RNA-seq Introduction

Similar documents
RNA- seq Introduc1on. Promises and pi7alls

RNA- seq Introduc1on. Promises and pi7alls

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins.

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

EPIGENOMICS PROFILING SERVICES

Prokaryotes and eukaryotes alter gene expression in response to their changing environment

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

RESEARCHER S NAME: Làszlò Tora RESEARCHER S ORGANISATION: Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC)

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

Bi 8 Lecture 17. interference. Ellen Rothenberg 1 March 2016

Mining for disease- causing genes in the other 98% of the human genome Daniel Gautheret

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Small RNAs and how to analyze them using sequencing

Genome-wide Association Studies (GWAS) Pasieka, Science Photo Library

Where Splicing Joins Chromatin And Transcription. 9/11/2012 Dario Balestra

Analyse de données de séquençage haut débit

Circular RNAs (circrnas) act a stable mirna sponges

Peak-calling for ChIP-seq and ATAC-seq

Regulation of Gene Expression in Eukaryotes

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

MicroRNA in Cancer Karen Dybkær 2013

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016

RNA SEQUENCING AND DATA ANALYSIS

mirna Dr. S Hosseini-Asl

TRANSCRIPTION. DNA à mrna

Introduction to Systems Biology of Cancer Lecture 2

Mechanisms of alternative splicing regulation

Genetics and Genomics in Medicine Chapter 6 Questions

Overview: Conducting the Genetic Orchestra Prokaryotes and eukaryotes alter gene expression in response to their changing environment

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region

DNA codes for RNA, which guides protein synthesis.

Profiles of gene expression & diagnosis/prognosis of cancer. MCs in Advanced Genetics Ainoa Planas Riverola

Genetics. Instructor: Dr. Jihad Abdallah Transcription of DNA

Small RNAs and how to analyze them using sequencing

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford

Alternative splicing. Biosciences 741: Genomics Fall, 2013 Week 6

Global Epigenetic and Transcriptional Trends among Two Rice Subspecies and Their Reciprocal Hybrids W

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

Selective depletion of abundant RNAs to enable transcriptome analysis of lowinput and highly-degraded RNA from FFPE breast cancer samples

RECAP (1)! In eukaryotes, large primary transcripts are processed to smaller, mature mrnas.! What was first evidence for this precursorproduct

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Ambient temperature regulated flowering time

Morphogens: What are they and why should we care?

Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing

Transcriptome and isoform reconstruc1on with short reads. Tangled up in reads

Eukaryotic Gene Regulation

Simple, rapid, and reliable RNA sequencing

PRECISION INSIGHTS. GPS Cancer. Molecular Insights You Can Rely On. Tumor-normal sequencing of DNA + RNA expression.

Transcriptome Analysis

Introduction. Introduction

The Blueprint of Life: DNA to Protein. What is genetics? DNA Structure 4/27/2011. Chapter 7

The Blueprint of Life: DNA to Protein

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

Inference of Isoforms from Short Sequence Reads

Deploying the full transcriptome using RNA sequencing. Jo Vandesompele, CSO and co-founder The Non-Coding Genome May 12, 2016, Leuven

Molecular Biology (BIOL 4320) Exam #2 May 3, 2004

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Transcription and RNA processing

Set the stage: Genomics technology. Jos Kleinjans Dept of Toxicogenomics Maastricht University, the Netherlands

MODULE 3: TRANSCRIPTION PART II

L I F E S C I E N C E S

Systematic exploration of autonomous modules in noisy microrna-target networks for testing the generality of the cerna hypothesis

Hands-on Activity Viral DNA Integration. Educator Materials

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

cis-regulatory enrichment analysis in human, mouse and fly

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Paternal exposure and effects on microrna and mrna expression in developing embryo. Department of Chemical and Radiation Nur Duale

ChIP-seq data analysis

Nature Structural & Molecular Biology: doi: /nsmb.2419

Gene Regulation Part 2

Epigenetic Principles and Mechanisms Underlying Nervous System Function in Health and Disease Mark F. Mehler MD, FAAN

Transcriptome profiling of the developing male germ line identifies the mir-29 family as a global regulator during meiosis

Human Genome Project (HGP) An international research effort to sequence and map all of the genes of Homo sapiens (the human genome) Completed p in Apr

Integrated network analysis reveals distinct regulatory roles of transcription factors and micrornas

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

Transcription and chromatin. General Transcription Factors + Promoter-specific factors + Co-activators

High AU content: a signature of upregulated mirna in cardiac diseases

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

Phenomena first observed in petunia

Research Strategy: 1. Background and Significance

High-throughput transcriptome sequencing

Multi-omics data integration colon cancer using proteogenomics approach

High Throughput Sequence (HTS) data analysis. Lei Zhou

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

REGULATED AND NONCANONICAL SPLICING

Chapter 2. Aims & Objectives

RNA SEQUENCING AND DATA ANALYSIS

RNA and Protein Synthesis Guided Notes

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

MicroRNA roles in signaling during lactation: an insight from differential expression, time course and pathway analyses of deep sequence data

Studying Alternative Splicing

1. Identify and characterize interesting phenomena! 2. Characterization should stimulate some questions/models! 3. Combine biochemistry and genetics

Transcript reconstruction

Transcription:

RNA-seq Introduction

DNA is the same in all cells but which RNAs that is present is different in all cells

There is a wide variety of different functional RNAs

Which RNAs (and sometimes then translated to proteins) varies between samples -Tissues -Cell types -Cell states -Individuals -Cells

RNA gives information on which genes that are expressed How DNA get transcribed to RNA (and sometimes then translated to proteins) varies between e. g. -Tissues -Cell types -Cell states -Individuals

ENCODE, the Encyclopedia of DNA Elements, is a project funded by the National Human Genome Research Institute to identify all regions of transcription, transcription factor association, chromatin structure and histone modification in the human genome sequence.

ENCyclopedia Of Dna Elements

Different kind of RNAs have different expression values Landscape of transcription in human cells, S Djebali et al. Nature 2012

What defines RNA depends on how you look at it Coverage Variants Abundance House keeping RNAs mrnas Regulatory RNAs Novel intergenic None Adapted from Landscape of transcription in human cells, S Djebali et al. Nature 2012

Defining functional DNA elements in the human genome Statement A priori, we should not expect the transcriptome to consist exclusively of functional RNAs. Why is that Zero tolerance for errant transcripts would come at high cost in the proofreading machinery needed to perfectly gate RNA polymerase and splicing activities, or to instantly eliminate spurious transcripts. In general, sequences encoding RNAs transcribed by noisy transcriptional machinery are expected to be less constrained, which is consistent with data shown here for very low abundance RNA Consequence Thus, one should have high confidence that the subset of the genome with large signals for RNA or chromatin signatures coupled with strong conservation is functional and will be supported by appropriate genetic tests. In contrast, the larger proportion of genome with reproducible but low biochemical signal strength and less evolutionary conservation is challenging to parse between specific functions and biological noise.

Biochemical evidence not enough to identify functional RNAs Defining functional DNA elements in the human genome Kellis M et al. PNAS 2014;111:6131-6138

RNA seq course

One gene many different mrnas

How are RNA-seq data generated? Sampling process

Depending on the different steps you will get different results RNA-> enrichments -> AAAAAAAA PolyA (mrna) RiboMinus (- rrna) Size <50 nt (mirna ).. library -> Size of fragment Strand specific 5 end specific 3 end specific.. reads -> Single end (1 read per fragment) Paired end (2 reads per fragment)

The RNA seq course From RNA seq to reads (Introduction) Mapping reads programs (Monday) Transcriptome reconstruction using reference (Monday) Transcriptome reconstruction without reference (Monday) QC analysis (Tuesday) Differential expression analysis (Tuesday) Gene set analysis (Tuesday) Multi Variate Analysis (Wednesday) mirna analysis (Wednesday)

Promises and pitfalls Long reads Low throughput (-) Complete transcripts (+) Only highly expressed genes (--) Expensive (-) Low background noise (+) Easy downstream analysis (+) short reads High throughput (+) Fractions of transcripts (-) Full dynamic range (+-) Unlimited dynamic range (+) Cheap (+) Low background noise (+) Strand specificity (+) Re-sequencing (+) 10000 1000 Signal 100 10 1 1 10 100 1000 10000 100000 1000000 # trancripts/cell EST MicroArray RNAseq

RNA seq reads correspond directly to abundance of RNAs in the sample

Map reads to reference

Transcriptome assembly using reference

Transcriptome assembly without reference

Quality control -samples might not be what you think they are Experiments go wrong 30 samples with 5 steps from samples to reads has 150 potential steps for errors Error rate 1/100 with 5 steps suggest that one of every 20 samples the reads does not represent the sample Mixing samples 30 samples with 5 steps from samples to reads has ~24M potential mix ups of samples Error rate 1/ 100 with 5 steps suggest that one of every 20 sample is mislabeled Combine the two steps and approximately one of every 10 samples are wrong

RNA QC Read quality Mapping statistics Transcript quality Compare between samples

Differential expression analysis using univariate analysis Typically univariate analysis (one gene at a time) even though we know that genes are not independent

Gene set analysis and data integration SCP hidden by gene downregulation?

microrna analysis (Berezikov et al. Genome Research, 2011.)

All the steps will affect the results All RNA

All the steps will affect the results Experimental setup All R A

All the steps will affect the results Lab work + RNA extraction Expeimental setu All R A

All the steps will affect the results RNA enrichment protocoll Expeimental setu All R A

All the steps will affect the results Sequencing machine Expeimental setu All R A

All the steps will affect the results Reference Expeimental setu All R A

All the steps will affect the results Mapping program Expeimental setu All R A

All the steps will affect the results Differential expression analysis program Expeimental setu All R A

Try to be as consistent as possible Differential expression analysis program Expeimental setu All R A Differential expression analysis Expeimental program setu All R A Differential expression analysis Expeimental program setu All R A Differential expression analysis Expeimental programsetu All R A