Small RNAs and how to analyze them using sequencing

Similar documents
Small RNAs and how to analyze them using sequencing

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

RNA-seq Introduction

MicroRNA in Cancer Karen Dybkær 2013

MicroRNAs, RNA Modifications, RNA Editing. Bora E. Baysal MD, PhD Oncology for Scientists Lecture Tue, Oct 17, 2017, 3:30 PM - 5:00 PM

RNA- seq Introduc1on. Promises and pi7alls

Supplementary Figure 1

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB

Small RNA-Seq and profiling

Bi 8 Lecture 17. interference. Ellen Rothenberg 1 March 2016

V16: involvement of micrornas in GRNs

Prediction of micrornas and their targets

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets

Arabidopsis thaliana small RNA Sequencing. Report

RNA- seq Introduc1on. Promises and pi7alls

Evidence for the biogenesis of more than 1,000 novel human micrornas

mirna-guided regulation at the molecular level

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1. Differential expression of mirnas from the pri-mir-17-92a locus.

he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003)

Transcriptome-wide analysis of microrna expression in the malaria mosquito Anopheles gambiae

Circular RNAs (circrnas) act a stable mirna sponges

Alan Weiner BIOCHEM 530 Friday, MEB 248 October 23, 2015 RNA structure, the ribosome, structure-based drug design

NOVEL FUNCTION OF mirnas IN REGULATING GENE EXPRESSION. Ana M. Martinez

Small RNA Sequencing. Project Workflow. Service Description. Sequencing Service Specification BGISEQ-500 SERVICE OVERVIEW SAMPLE PREPARATION

MicroRNA and Male Infertility: A Potential for Diagnosis

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

CRS4 Seminar series. Inferring the functional role of micrornas from gene expression data CRS4. Biomedicine. Bioinformatics. Paolo Uva July 11, 2012

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

Mature microrna identification via the use of a Naive Bayes classifier

High AU content: a signature of upregulated mirna in cardiac diseases

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford

mirna Dr. S Hosseini-Asl

P. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( 鄧致剛 ); g ( 黄栢榕 ) Bioinformatics Center, Chang Gung University.

Profiles of gene expression & diagnosis/prognosis of cancer. MCs in Advanced Genetics Ainoa Planas Riverola

BIMM 143. RNA sequencing overview. Genome Informatics II. Barry Grant. Lecture In vivo. In vitro.

Nature Structural & Molecular Biology: doi: /nsmb.2419

RIP-seq of BmAgo2-associated small RNAs reveal various types of small non-coding RNAs in the silkworm, Bombyx mori

omiras: MicroRNA regulation of gene expression

The RNA revolution: rewriting the fundamentals of genetics

Transcriptome profiling of the developing male germ line identifies the mir-29 family as a global regulator during meiosis

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

MicroRNAs: a new source of biomarkers in radiation response. Simone Moertl, Helmholtz Centre Munich

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

Evidence for the biogenesis of more than 1,000 novel human micrornas

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Processing of RNA II Biochemistry 302. February 18, 2004 Bob Kelm

EPIGENOMICS PROFILING SERVICES

Transcriptome Analysis

Small RNA and PARE sequencing in flower bud reveal the involvement of srnas in endodormancy release of Japanese pear (Pyrus pyrifolia 'Kosui')

Chapter 10 - Post-transcriptional Gene Control

RNA interference induced hepatotoxicity results from loss of the first synthesized isoform of microrna-122 in mice

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

ANALYSIS OF SOYBEAN SMALL RNA POPULATIONS HLAING HLAING WIN THESIS

DNA codes for RNA, which guides protein synthesis.

Islamic University Faculty of Medicine

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping

Profiling micrornas in lung tissue from pigs infected with Actinobacillus pleuropneumoniae

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Cellecta Overview. Started Operations in 2007 Headquarters: Mountain View, CA

Oasis 2: improved online analysis of small RNA-seq data

High-Throughput Sequencing Course

Processing of RNA II Biochemistry 302. February 13, 2006

Inferring condition-specific mirna activity from matched mirna and mrna expression data

Marta Puerto Plasencia. microrna sponges

Epigenetic Principles and Mechanisms Underlying Nervous System Function in Health and Disease Mark F. Mehler MD, FAAN

Transcriptome and isoform reconstruc1on with short reads. Tangled up in reads

Analysis of 13 cell types reveals evidence for the expression of numerous novel primate- and tissue-specific micrornas

Processing of RNA II Biochemistry 302. February 14, 2005 Bob Kelm

Ch. 18 Regulation of Gene Expression

IPA Advanced Training Course

10/31/2017. micrornas and cancer. From the one gene-one enzyme hypothesis to. microrna DNA RNA. Transcription factors.

Cancer Genomes How to Analyze Your Own Genome

Prokaryotes and eukaryotes alter gene expression in response to their changing environment

cis-regulatory enrichment analysis in human, mouse and fly

Regulation of Gene Expression in Eukaryotes

RNA-seq: filtering, quality control and visualisation. COMBINE RNA-seq Workshop

A Statistical Framework for Classification of Tumor Type from microrna Data

Simple, rapid, and reliable RNA sequencing

MicroRNAs and Cancer

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins.

RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers

Deploying the full transcriptome using RNA sequencing. Jo Vandesompele, CSO and co-founder The Non-Coding Genome May 12, 2016, Leuven

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

RNA - protein interactions in mrna decay A case study on TTP and HuR in AMD

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

Cellular MicroRNA and P Bodies Modulate Host-HIV-1 Interactions. 指導教授 : 張麗冠博士 演講者 : 黃柄翰 Date: 2009/10/19

Human Genome: Mapping, Sequencing Techniques, Diseases

EXPression ANalyzer and DisplayER

Lesson Flowchart. Why Gene Expression Regulation?; microrna in Gene Regulation: an overview; microrna Genomics and Biogenesis;

Over the past decade, it has become progressively more

Multi-omics data integration colon cancer using proteogenomics approach

REGULATED AND NONCANONICAL SPLICING

Table S1. Quantitative RT-PCR primers

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Introduction to Systems Biology of Cancer Lecture 2

Transcription:

Small RNAs and how to analyze them using sequencing RNA-seq Course November 8th 2017 Marc Friedländer ComputaAonal RNA Biology Group SciLifeLab / Stockholm University Special thanks to Jakub Westholm for sharing slides!

Small RNAs Small RNAs are species of short non-coding RNAs, by definiaon <200 nucleoades micrornas (mirnas) short interfering RNAs (sirnas) piwi associated RNAs (pirnas) clustered regularly interspaced short palindromic repeats (CRISPRs) mirtrons, cis-natrnas, tasi-rnas, enhancer RNAs and other strange things

1. Background on regulatory small RNAs

1993: Discovery of first mirna - lin-4 necessary for progression to L2 - it was found that lin-4 is an RNA! - part of this RNA is complementary to the 3 UTR of the gene lin-14

2000: a second, conserved, microrna is found

2001: many micrornas are found in various animals Using: - RNA structure predicaon - ComparaAve genomics - (low throughput) sequencing

microrna biogenesis Many enzymes etc. are involved: Drosha, Exp5, Dicer,... The end result is a ~22nt RNA loaded into an Argonaute complex. The microrna directs Argonaute to target genes, through base pairing with the 3 UTR (pos 2-8). This causes repression. (Winter et. Nature Cell Biol, 2009)

Target repression by micrornas (This is in animals. micrornas in plants work differently.) (Fabian, NSMB, 2012)

How do micrornas find their targets? In animals, micrornas find their targets through pairing between the microrna seed region (nucleoades 2-8) and the target transcript (Friedman et al. Genome Research, 2009) Such short matches are common à a microrna can have hundreds of targets. It is esamated that over half of all genes are targeted by micrornas.

MicroRNA target predicaon Besides seed pairing, other features are used in the target predicaons: ConservaAon (conserved target sites are more likely to be funcaonal) mrna structure (it s hard for a microrna to interact with a highly structured target mrna) Sequences around the target site (AU rich sequences around targets?) Many programs exist for microrna target predicaon (TargetScan, PicTar,..) These are not perfect. Target predicaon is hard, and a lot of details about the mechanism are sall not known.

MicroRNAs in animal genomes There are typically hundreds or thousands micrornas in animal genomes: Fly: ~300 microrna loci Mouse: ~1200 microrna loci Human: ~1900 microrna loci In a given Assue, their expression can range over more than 5 orders of magnitude (a few to > 100,000 molecules per cell)

micrornas regulate many biological processes and are involved in disease Development DifferenAaAon FormaAon of cell idenaty Stress response Cancer Cardiovascular disease Inflammatory disease Autoimmune disease

2. Small RNA sequencing

Sequencing Small RNA sequencing is similar to mrna sequencing, but: There is no poly-a selecaon. Instead RNA fragments are size selected (typically 15-30 nucleoades, to avoid contaminaaon by ribosomal RNA). Low complexity libraries à more sequencing problems FastQC results will look strange: Length NucleoAde content Sequence duplicaaon

Pre-processing of small RNA data I Since we are sequencing short RNA fragments, adaptor sequences end up in the reads too. Many programs available to remove adaptor sequences (cutadapt, fastx_clipper, Btrim..) We only want to keep the reads that had adaptors in them. GTTTCTGCATTTTCGTATGCCGTCTTCTGCTTGAA GTGGGTAGAACTTTGATTAATTCGTATGCCGTCTT GTTTGTAAATTCTGATCGTATGCCGTCTTCTGCTT GAATATATATAGATATATACATACATACTTATCGT GCTGACTTAGCTTGAAGCATAAATGGTCGTATGCC GACGATCTAGACGGTTTTCGCAGAATTCTGTTTAT Adapter missing

Pre-processing of small RNA data II micrornas are expected to be 20-25 nt. Short reads are probably not micrornas, and are hard to map uniquely GTTTCTGCATTTTCGTATGCCGTCTTCTGCTTGAA GTGGGTAGAACTTTGATTAATTCGTATGCCGTCTT GTTTGTAAATTCTGATCGTATGCCGTCTTCTGCTT GCTGACTTAGCTTGAAGCATAAATGGTCGTATGCC Long reads are probably not micrornas To short (Lau et al. Genome Research, 2010)

Pre-processing of small RNA data III Another useful QC step is to check which loci the reads map to: (Figure from Friedländer et al., PNAS, 2009)

Small RNA expression profiling -The number of Ames a small RNA is sequenced is a funcaon of its expression -to count this number, the sequenced small RNAs must first be compared to reference sequences -however some reference small RNA sequences are truncated, making mapping against them difficult -It is more robust to map the sequenced RNAs against the genome/precursors

Small RNA-seq is reproducible Sequencing frequency of micrornas in planarian biological replicates (Figure from Friedländer et al., PNAS. 2009)

Small RNA-seq cannot measure absolute abundances Sequencing frequency of 473 araficial micrornas in equal abundance (Figure from Linsen et al., Nature Methods. 2009)

Small RNA-seq can measure relaave abundances (fold-changes) Fold-changes: deep sequencing vs. qpcr (Figure from Linsen et al., Nature Methods. 2009)

IdenAfying differenaally expressed small RNAs -Once the sequence data is transformed to counts, they are in essence not different from ordinary RNA-seq data -microrna counts should be normalized to the total mirna counts in the sample (RPM) or to trimmed mean of M-values (TMM) -for comparisons between two datasets, an iniaal eyeballing works as sanity check Dedicated tools: - DEseq2 - edger - NOISEQ (Figure from Stoeckius et al., Nature Methods, 2009)

3. What can we learn from microrna expression analysis?

MicroRNA expression profiles classify human cancers microrna expression profiles cluster according to cancer type. (Lu et al. Nature 2005)

microrna profiles can be used to disanguish cancer subtypes (Chan et al. Trends in Molecular Medicine, 2010)

microrna profiles in cell lines vs. Assues PCA plot showing that microrna profiles in most cell lines are more similar to each other than to normal Assues. (Wen et al. Genome Research 2014)

microrna discovery by small RNA-seq: challenges NGS can detect hundreds of millions of small RNAs in one run - however, many of the sequenced RNAs are degradaaon products from: rrnas, trnas, mrnas, snrnas, snornas un-annotated transcripts - when the RNAs are mapped to the genome, they omen map to millions of loci - only a few hundreds of these loci are in fact microrna genes - thus, the non-trivial task of accurately classifying microrna gene loci remains!

mirdeep: first algorithm to discover micrornas in small RNA-seq data - first and most widely used algorithm for microrna discovery (>800 studies) - probabilisac (reports probability that a given sequence is a microrna) - independent of: species conserva3on informa3on genome annota3on state of genome assembly - incorporates our knowledge of microrna biogenesis

Key idea behind mirdeep(2) Novel micrornas are discovered in a three step process: 1: frequently sequenced RNAs are idenafied ( read stacks ) 2: the read stacks should overlap an RNA hairpin structure 3: the posiaon of the stacks in hairpin should conform to Dicer processing ( Dicer signature, a) Dicer signature (Figure from Friedländer et al., Nature Biotech. 2008)

Log-odds scoring funcaon Pre: the hairpin is a genuine microrna Bgr: the hairpin is a (non-microrna) background hairpin

Output is a list of microrna candidates, with scores, and a plot for each candidate: (Friedlä nder et. al. Nucleic Acids Research, 2011) mirdeep2 is installed on UPPMAX. There are also other programs, e.g. mircat2 which also finds other small RNAs.

Other strange small RNAs that show up in sequencing data pirnas tasi-rnas mirtrons yrnas trna fragments cis-natrnas Some of these are funcaonal Some are by products of RNA processing, and can be informaave (e.g. microrna loop sequences). Some are probably just noise.

The end