Convolutional and LSTM Neural Networks

Similar documents
Convolutional and LSTM Neural Networks

Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning

Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks

Flexible, High Performance Convolutional Neural Networks for Image Classification

Convolutional Neural Networks for Text Classification

Image Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta

Machine Learning For Personalized Cancer Vaccines. Alex Rubinsteyn February 9th, 2018 Data Science Salon Miami

Neuromorphic convolutional recurrent neural network for road safety or safety near the road

Visual semantics: image elements. Symbols Objects People Poses

The Immune Epitope Database Analysis Resource: MHC class I peptide binding predictions. Edita Karosiene, Ph.D.

Question 1 Multiple Choice (8 marks)

Networks and Hierarchical Processing: Object Recognition in Human and Computer Vision

Learning Convolutional Neural Networks for Graphs

Convolutional Neural Networks for Estimating Left Ventricular Volume

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

CS229 Final Project Report. Predicting Epitopes for MHC Molecules

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns

Artificial Neural Networks

Factoid Question Answering

Antigen Recognition by T cells

arxiv: v2 [cs.cv] 19 Dec 2017

Attribution: University of Michigan Medical School, Department of Microbiology and Immunology

Recurrent Neural Networks

Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network

Course Introduction. Neural Information Processing: Introduction. Notes. Administration

IMMUNOINFORMATICS: Bioinformatics Challenges in Immunology

DIABETIC RISK PREDICTION FOR WOMEN USING BOOTSTRAP AGGREGATION ON BACK-PROPAGATION NEURAL NETWORKS

Using stigmergy to incorporate the time into artificial neural networks

Deep Compression and EIE: Efficient Inference Engine on Compressed Deep Neural Network

Aggregated Sparse Attention for Steering Angle Prediction

arxiv: v1 [stat.ml] 23 Jan 2017

Deep learning on biomedical images. Ruben Hemelings Graduate VITO KU Leuven. Data Innova)on Summit March, #DIS2017

arxiv: v2 [cs.lg] 3 Apr 2019

Predicting Breast Cancer Survivability Rates

The Impact of Visual Saliency Prediction in Image Classification

NIH Public Access Author Manuscript Immunogenetics. Author manuscript; available in PMC 2014 September 01.

CS-E Deep Learning Session 4: Convolutional Networks

Medical Immunology. Contents and lecturing hours. immune system. antigen and adhesion molecule. Major histocompatibility. antigen.

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.

MHC class I MHC class II Structure of MHC antigens:

Auto-Encoder Pre-Training of Segmented-Memory Recurrent Neural Networks

Why did the network make this prediction?

Disruption of Healthy Tissue by the Immune Response Autoimmune diseases: Inappropriate immune response against self-components

Cognitive Neuroscience History of Neural Networks in Artificial Intelligence The concept of neural network in artificial intelligence

Multilayer Perceptron Neural Network Classification of Malignant Breast. Mass

Basic Immunology. Lecture 5 th and 6 th Recognition by MHC. Antigen presentation and MHC restriction

Sequential Predictions Recurrent Neural Networks

Helminth worm, Schistosomiasis Trypanosomes, sleeping sickness Pneumocystis carinii. Ringworm fungus HIV Influenza

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing

Reduction of Overfitting in Diabetes Prediction Using Deep Learning Neural Network

A HMM-based Pre-training Approach for Sequential Data

Automatic Classification of Perceived Gender from Facial Images

arxiv: v1 [cs.lg] 6 Oct 2016

Recognition of sign language gestures using neural networks

Introduction and Historical Background. August 22, 2007

CPSC81 Final Paper: Facial Expression Recognition Using CNNs

ML LAId bare. Cambridge Wireless SIG Meeting. Mary-Ann & Phil Claridge 23 November

Machine learning for neural decoding

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Audiovisual to Sign Language Translator

POC Brain Tumor Segmentation. vlife Use Case

MASTER S THESIS. Ali El Adi. Deep neural networks to forecast cardiac and respiratory deterioration of intensive care patients

Antigen Presentation to T lymphocytes

The Major Histocompatibility Complex (MHC)

Dual Path Network and Its Applications

ECG Signal Classification with Deep Learning Techniques

Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012

Medical Knowledge Attention Enhanced Neural Model. for Named Entity Recognition in Chinese EMR

Data mining for Obstructive Sleep Apnea Detection. 18 October 2017 Konstantinos Nikolaidis

Deep learning and non-negative matrix factorization in recognition of mammograms

Sign Language Recognition using Convolutional Neural Networks

MetaMHC: a meta approach to predict peptides binding to MHC molecules

COMPARING THE IMPACT OF ACCURATE INPUTS ON NEURAL NETWORKS

Cardiac Arrest Prediction to Prevent Code Blue Situation

Multi-attention Guided Activation Propagation in CNNs

Neural Information Processing: Introduction

Attribution: University of Michigan Medical School, Department of Microbiology and Immunology

Antigen capture and presentation to T lymphocytes

Deep Learning for Computer Vision

Diabetic Retinopathy Detection Using Eye Images

Learning Long-term Dependencies in Recurrent Neural Networks

Satoru Hiwa, 1 Kenya Hanawa, 2 Ryota Tamura, 2 Keisuke Hachisuka, 3 and Tomoyuki Hiroyasu Introduction

Automated diagnosis of pneumothorax using an ensemble of convolutional neural networks with multi-sized chest radiography images

Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment

Dealing with Missing Values in Neural Network-Based Diagnostic Systems

Biological Inspiration. Joe Marino

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network

Primary Tumor detection with EEG Signals using Wavelet Transform and Neural Network

Summary and discussion of: Why Does Unsupervised Pre-training Help Deep Learning?

ANN predicts locoregional control using molecular marker profiles of. Head and Neck squamous cell carcinoma

R Jagdeesh Kanan* et al. International Journal of Pharmacy & Technology

A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition

Shu Kong. Department of Computer Science, UC Irvine

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos

ACTIVE LEARNING OF THE GROUND TRUTH FOR RETINAL IMAGE SEGMENTATION

3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients

Transcription:

Convolutional and LSTM Neural Networks Vanessa Jurtz January 11, 2017

Contents Neural networks and GPUs Lasagne Peptide binding to MHC class II molecules Convolutional Neural Networks (CNN) Recurrent and LSTM Neural Networks Hyper-parameter optimization Training CNN and LSTM ensembles What do the networks learn?

Neural networks and GPUs A CPU (central processing unit) has few cores with lots of cache memory A GPU (graphics processing unit) has hundreds of cores but each with little memory GPUs are very good for lots of small operations that can be heavily parallelized, like matrix multiplication training a neural network requires lots of matrix multiplications!

Lasagne Why use Lasagne? python it is highly flexible as you will see in the afternoon exercise code can be run on a CPU or GPU my collaborators are using it too Lasagne Documentation

Peptide binding to MHC class II molecules Figure adapted from http://events.embo.org/12-antigen/ MHC II only expressed in professional antigen presenting cells (dentritic cells, macrophages, B cells etc) only a small fraction of peptides bind MHC II and are potential T cell epitopes polygenic and highly polymorphic

Peptides binding to MHCII molecules Nielsen et al. 2010 MHC Class II epitope predictive algorithms Immunology Peptides binding HLA-DR3 adapted from Kenneth Murphy Janeway s Immunobiology 8th edition fig 4.21 Garland Science Why is this a hard problem? binding peptides have different lengths the binding core can lie anywhere within a peptide large number of different MHC class II molecules

Feed Forward Neural Networks fully connected layers information only flows from input to output

CNN - Convolutional Neural Networks convolutional filter has specified input size slides over input sequence feeding into a different hidden neuron in each position weights of the filter are fixed as it slides over input and updated in backpropagation usually multiple filters of the same and/or different size are used

CNN - Convolutional Neural Networks Why could this be useful for predicting peptide binding to MHC class II? use filters of size 9 AA to focus on binding core integrate information about all 9-mers

CNN - Convolutional Neural Networks

CNN - Convolutional Neural Networks CNNs are known to work well for image analysis CIFAR example http://deeplearning.net/tutorial/lenet.html

Recurrent Neural Networks have connections between the neurons in the hidden layer have a memory learn what to store and what to ignore (Graves, 2015) Graves 2015 Supervised Sequence Labelling with Recurrent Neural Networks

LSTM memory blocks Instead of a hidden neuron we use an LSTM memory block: Graves 2015 Supervised Sequence Labelling with Recurrent Neural Networks

LSTM Networks Why make it more complicated? LSTM memory blocks allow to store and access information over longer periods of time Graves 2015 Supervised Sequence Labelling with Recurrent Neural Networks

LSTM Networks Input and output can be of variable length! inspired by: http://karpathy.github.io/2015/05/21/rnn-effectiveness/ Example peptide MHC class II binding: many to one

LSTM Networks LSTM networks are known to work well for natural language processing check out Andrej Karpathys blog to read up: Graves 2015

LSTM Networks Another biological application: protein secondary structure prediction http://chemistry.berea.edu/~biochemistry/ Anthony/Aequorin/Aequorin3.php Sønderby et al. 2015 Convolutional LSTM Networks for Subcellular Localization of Proteins http://arxiv.org/abs/1503.01919

Hyper-parameter optimization Parameters: weights Hyper-parameters: weight initialization weight update activation function data encoding dropout learning rate number of CNN filters / number of LSTM memory blocks hidden layer size Bergstra et al 2012 Random Search for Hyper-Parameter Optimization Journal of Machine Learning Research

Training CNN + LSTM ensembles - architecture CNN LSTM Peptide MHC II pseudo sequence + Peptide CNN Layer MHC II pseudo sequence LSTM Layer Dense Layer Dense Layer Binding Affinity Binding Affinity

Training CNN + LSTM ensembles - performance ensemble of 50% CNN and 50% LSTM ensemble of 50% NetMHCIIpan and 50% CNN+LSTM

What do these nets learn?

What do these nets learn? A B Subcellular Location Dense Layer Attention Function B-LSTM Layer CNN Layer C Protein D Model Accuracy A-LSTM 0.914 MultiLoc 0.767 E

Lasagne