genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

Similar documents
Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

RNA-seq Introduction

RNA SEQUENCING AND DATA ANALYSIS

RNA- seq Introduc1on. Promises and pi7alls

MODULE 3: TRANSCRIPTION PART II

Transcriptome Analysis

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford

BIMM 143. RNA sequencing overview. Genome Informatics II. Barry Grant. Lecture In vivo. In vitro.

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

Selective depletion of abundant RNAs to enable transcriptome analysis of lowinput and highly-degraded RNA from FFPE breast cancer samples

Ambient temperature regulated flowering time

Molecular Biology (BIOL 4320) Exam #2 May 3, 2004

High-Throughput Sequencing Course

Inference of Isoforms from Short Sequence Reads

Genetics. Instructor: Dr. Jihad Abdallah Transcription of DNA

Deploying the full transcriptome using RNA sequencing. Jo Vandesompele, CSO and co-founder The Non-Coding Genome May 12, 2016, Leuven

Bio 111 Study Guide Chapter 17 From Gene to Protein

EPIGENOMICS PROFILING SERVICES

Analyse de données de séquençage haut débit

Simple, rapid, and reliable RNA sequencing

RNA SEQUENCING AND DATA ANALYSIS

An RNA-Seq Strategy to Detect the Complete Coding and Non-Coding Transcriptome Including Full-Length Imprinted Macro ncrnas

TRANSCRIPTION. DNA à mrna

Molecular Biology (BIOL 4320) Exam #2 April 22, 2002

Mechanism of splicing

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014

Mechanisms of alternative splicing regulation

Cellecta Overview. Started Operations in 2007 Headquarters: Mountain View, CA

Methods: Biological Data

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR)

Studying Alternative Splicing

Small RNAs and how to analyze them using sequencing

Analysis and design of RNA sequencing experiments for identifying isoform regulation

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project

Molecular Cell Biology - Problem Drill 10: Gene Expression in Eukaryotes

Supplemental Methods RNA sequencing experiment

High-throughput transcriptome sequencing

CONTRACTING ORGANIZATION: Johns Hopkins University, Baltimore, MD

Long non-coding RNAs

DNA codes for RNA, which guides protein synthesis.

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

Pre-mRNA has introns The splicing complex recognizes semiconserved sequences

Transcript reconstruction

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins.

AbSeq on the BD Rhapsody system: Exploration of single-cell gene regulation by simultaneous digital mrna and protein quantification

Introduction. Introduction

TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer.

RNA Processing in Eukaryotes *

Circular RNAs (circrnas) act a stable mirna sponges

ACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics

Lectures 13: High throughput sequencing: Beyond the genome. Spring 2017 March 28, 2017

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

RNA- seq Introduc1on. Promises and pi7alls

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Transcription and RNA processing

SCIENCE CHINA Life Sciences

An Analysis of MDM4 Alternative Splicing and Effects Across Cancer Cell Lines

Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB

Regulation of Gene Expression in Eukaryotes

MicroRNA in Cancer Karen Dybkær 2013

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

1. Investigate the structure of the trna Synthase in complex with a trna molecule. (pdb ID 1ASY).

ncounter Assay Automated Process Immobilize and align reporter for image collecting and barcode counting ncounter Prep Station

Protein Synthesis

Transcriptome and isoform reconstruc1on with short reads. Tangled up in reads

Supplementary Figures

Analysis of small RNAs from Drosophila Schneider cells using the Small RNA assay on the Agilent 2100 bioanalyzer. Application Note

AVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Transcription and RNA processing

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

ncounter Assay Automated Process Capture & Reporter Probes Bind reporter to surface Remove excess reporters Hybridize CodeSet to RNA

Chapter 10 - Post-transcriptional Gene Control

RECENT ADVANCES IN THE MOLECULAR DIAGNOSIS OF BREAST CANCER

Pseudogenes transcribed in breast invasive carcinoma show subtype-specific expression and cerna potential

Genome-wide Association Studies (GWAS) Pasieka, Science Photo Library

Effects of UBL5 knockdown on cell cycle distribution and sister chromatid cohesion

Small RNA Sequencing. Project Workflow. Service Description. Sequencing Service Specification BGISEQ-500 SERVICE OVERVIEW SAMPLE PREPARATION

Profiles of gene expression & diagnosis/prognosis of cancer. MCs in Advanced Genetics Ainoa Planas Riverola

Supplemental Materials. for. Conservation of an RNA Regulatory Map between Drosophila and. Mammals

TRANSCRIPTION CAPPING

A Statistical Framework for Classification of Tumor Type from microrna Data

Global Epigenetic and Transcriptional Trends among Two Rice Subspecies and Their Reciprocal Hybrids W

MutationTaster & RegulationSpotter

Computational Modeling and Inference of Alternative Splicing Regulation.

Alternative splicing. Biosciences 741: Genomics Fall, 2013 Week 6

Molecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC

Screening for novel oncology biomarker panels using both DNA and protein microarrays. John Anson, PhD VP Biomarker Discovery

Role of Cnot6l in maternal mrna turnover

P. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University.

The value of Omics to chemical risk assessment

Eukaryotic mrna is covalently processed in three ways prior to export from the nucleus:

Complexity DNA. Genome RNA. Transcriptome. Protein. Proteome. Metabolites. Metabolome

Transcription:

RNA sequencing (RNA-seq)

Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome across tissues and individuals. WE 22-Mar-2017 RNA sequencing: Models for RNA-sequencing Data MO 27-Mar-2017 Paper: PMID 26813401: A survey of best practices for RNA-seq data analysis WE 29-Mar-2017 Paper: PMID 28049689: Understanding development and stem cells using single cell-based analyses of gene expression

High Throughput Technologies Measuring things in parallel, for example: Northern blot: Expression of one gene. Microarray: Expression of all known genes. NGS (e.g. RNASeq): The transcriptome. NGS: Next Generation Sequencing: Generates thousands of sequences in parallel Underlies many applications, not only RNASeq Dramatically reduced cost

Next Generation Sequencing - Sequencing target (e.g. a genome) - Library preparation (includes fragmentation) - Single vs. paired end - Different technologies / platforms Differ in: - Read length - Accuracy - Cost Each fragment gets sequenced in parallel Much more efficient than sequential sequencing

Next generation sequencing [NHGRI: http://www.genome.gov/sequencingcosts/ ]

What can we measure with NGS?

What can we measure with NGS? [Soon et al. 2013]

HighThroughput Technologies High-throughput technologies measure things in parallel Many based on Next Generation Sequencing We will focus on measuring transcription, BUT: NGS much more versatile. Experimental design, data processing and data analysis are increasingly demanding.

RNA sequencing (RNA-seq) Gene expression

Gene expression [Wikipedia] Transcription: pre-mrna RNA processing: mrna - 5 capping - 3 cleavage and polyadenylation - RNA splicing Translation: protein - Ribosome contains rrna. - rrna depletion for RNASeq - Poly(A) enrichment for RNASeq

Gene expression [Licatalosi & Darnell, NRG 2010] - Exons, Introns and UTRs - DNA -> pre-mrna -> mrna - Poly-A tails and sites

RNA sequencing RNA sequencing

RNA sequencing Sequence cdna to get information about RNA content of the sample. Rough idea: Total RNA of sample Poly(A) selection to target mrna (not always) rrna removal (not always) Reverse transcription (mrna to cdna) Fragmentation Sequencing

RNA sequencing steps: Quality control and then: Computational pipeline on the left - Many different methods, but steps essentially similar. - Alignment - Disambiguation / filtering - Abundance quantification [Mortazavi et al., Nature Methods 2008] RPKM: reads (mrna abundance) kilo base (transcript length) million reads mapped (library size) FPKM: single -> paired end reads -> fragments more reliable alignment TPM: Transcripts per million

RNA sequencing steps: Assign reads (fragments) to annotated exons

RNA sequencing steps: Ambiguous reads: Alternative: constitutively expressed exons [gene-level only].

RNA sequencing steps: Make new gene(models):

RNA sequencing steps: Calculate gene abundances (here: RPKM)

RNA sequencing (differential) RNA abundance and what RNA-seq can do

RNA expression / abundance RPKM FPKM TPM [Pachter 2013]

Differential gene expression 2.0 First things first: Differential gene expression: Using constitutively expressed exons. Far from trivial. Easier than -> [Trapnell et al., Nature Protocols 2012]

Comparison to microarray => RNA similar to microarray; better correlation for higher expression levels. [Guo et al. 2013]

Differential isoforms with RNA-seq RNAseq can distinguish isoforms. [Katz et al. 2010]

SNPs from gene expression data RNAseq can detect polymorphisms. [Zhao et al. 2014]

RNA sequencing Characterization of entire transcriptome Can be used for: Discovery (existence / expression of) genes, exons, and transcripts (alternative splicing) Change detection between conditions in Gene expression Isoform expression, and more Different types of RNAs (e.g., mirnas, ncrnas)