STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

Similar documents
7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

RNA-seq Introduction

MIR retrotransposon sequences provide insulators to the human genome

Genome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data

mirna Dr. S Hosseini-Asl

MicroRNA-mediated incoherent feedforward motifs are robust

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

MicroRNA in Cancer Karen Dybkær 2013

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Mature microrna identification via the use of a Naive Bayes classifier

MicroRNA expression profiling and functional analysis in prostate cancer. Marco Folini s.c. Ricerca Traslazionale DOSL

High AU content: a signature of upregulated mirna in cardiac diseases

Profiles of gene expression & diagnosis/prognosis of cancer. MCs in Advanced Genetics Ainoa Planas Riverola

Bi 8 Lecture 17. interference. Ellen Rothenberg 1 March 2016

Results. Abstract. Introduc4on. Conclusions. Methods. Funding

Computational Modeling of mirna Biogenesis

High Throughput Sequence (HTS) data analysis. Lei Zhou

Gene-microRNA network module analysis for ovarian cancer

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

RNA- seq Introduc1on. Promises and pi7alls

Metabolic programming. Role of micrornas. M Elizabeth Tejero, PhD Laboratory of Nutrigenetics and Nutrigenomics INMEGEN Mexico City

Integrated analysis of sequencing data

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

EPIGENOMICS PROFILING SERVICES

TRANSCRIPTIONAL REGULATION OF MICRORNA GENES AND THE REGULATORY NETWORKS IN WHICH THEY PARTICIPATE. David Lee Corcoran

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Circular RNAs (circrnas) act a stable mirna sponges

Assignment 5: Integrative epigenomics analysis

Global Epigenetic and Transcriptional Trends among Two Rice Subspecies and Their Reciprocal Hybrids W

MODULE 3: TRANSCRIPTION PART II

reported to underlie diverse aspects of biology, including developmental timing, differentiation, proliferation, cell death, and metabolism.

Nature Structural & Molecular Biology: doi: /nsmb.2419

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7

MicroRNA and Male Infertility: A Potential for Diagnosis

he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003)

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD,

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

The Epigenome Tools 2: ChIP-Seq and Data Analysis

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles

Yue Wei 1, Rui Chen 2, Carlos E. Bueso-Ramos 3, Hui Yang 1, and Guillermo Garcia-Manero 1

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford

Supplementary Figure 1

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni.

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

MicroRNA dysregulation in cancer. Systems Plant Microbiology Hyun-Hee Lee

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data

The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome

Obstacles and challenges in the analysis of microrna sequencing data

micrornas (mirna) and Biomarkers

Introduction to Systems Biology of Cancer Lecture 2

Research Article Identifying Liver Cancer-Related Enhancer SNPs by Integrating GWAS and Histone Modification ChIP-seq Data

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014

Reduced expression of SOX7 in ovarian cancer: a novel tumor suppressor through the Wnt/β-catenin signaling pathway

Allelic reprogramming of the histone modification H3K4me3 in early mammalian development

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

Comparative DNA methylome analysis of endometrial carcinoma reveals complex and distinct deregulation of cancer promoters and enhancers

SUPPLEMENTAL INFORMATION

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

Functions of microrna in response to cocaine stimulation

ChIP-seq data analysis

From reference genes to global mean normalization

Intrinsic cellular defenses against virus infection

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

Assaying micrornas in biofluids for detection of drug induced cardiac injury. HESI Annual Meeting State of the Science Session June 8, 2011

A novel and universal method for microrna RT-qPCR data normalization

RNA- seq Introduc1on. Promises and pi7alls

Small RNAs and how to analyze them using sequencing

Mechanisms of alternative splicing regulation

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

Dominic J Smiraglia, PhD Department of Cancer Genetics. DNA methylation in prostate cancer

Peak-calling for ChIP-seq and ATAC-seq

Prediction of micrornas and their targets

MicroRNA roles in signaling during lactation: an insight from differential expression, time course and pathway analyses of deep sequence data

V16: involvement of micrornas in GRNs

H3K4 demethylase KDM5B regulates global dynamics of transcription elongation and alternative splicing in embryonic stem cells

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature

DSB. Double-Strand Breaks causate da radiazioni stress ossidativo farmaci

Removal of Shelterin Reveals the Telomere End-Protection Problem

Supplementary Figures

NOVEL FUNCTION OF mirnas IN REGULATING GENE EXPRESSION. Ana M. Martinez

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1. Differential expression of mirnas from the pri-mir-17-92a locus.

The Role of micrornas in Pain. Phd Work of: Rehman Qureshi Advisor: Ahmet Sacan

mirt: A Database of Validated Transcription Start Sites of Human MicroRNAs

Supplementary Figure 1 IL-27 IL

Genetics and Genomics in Medicine Chapter 6 Questions

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Transcription:

CAMDA 2009 October 5, 2009 STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells Guohua Wang 1, Yadong Wang 1, Denan Zhang 1, Mingxiang Teng 1,2, Lang Li 2, and Yunlong Liu 2 Harbin Institute of Technology, Harbin, Heilongjiang, China Indiana University School of Medicine, Indianapolis, IN, USA

Outline Introduction microrna Motivation A failed attempt to construct micrornamediated regulatory network Strategy and Modeling Results Conclusion Acknowledgement

microrna regulation Small non-coding RNA Function: silencing genes through post-transcriptional regulation Single strand RNA (ssrna); 19-25 nucleotides (~22 nucleotides); Endogenous Accounting for more than 1% of the genome (>200 members per species, >800 in human); >2/3 of human genes are microrna target;

Motivation A failed attempt Initial goal: identify microrna regulatory network using microrna microarray and gene expression data Gene-Y mir-x In collaboration with Dr. Nakshatri Manuscript submitted

Motivation microrna biogenesis Three forms: Pri-miRNA (several thousands nt) Pre-miRNA (60-100 nt) Mature mirna (22 nt) Most micrornas are Pol II trascribed We only have pre- and mature mirna annotation Can we identify regulatory regions and transcription start site based on Pol II binding patterns? Transcription start site pri-mirna (>2000 nt) pre-mirna (100 nt) mature mirna (20 nt)

Strategy Identify Pol II binding pattern from expressed genes, and apply this pattern to identify microrna promoters. ChIP-seq data on Pol II Similar pattern for mirna? mir-21 Pol II binding feature surrounding highly expressed genes

Learn from coding genes Goal: Confirm whether Pol II binding follows certain pattern for high- and low- expressed genes; If there is a clear pattern, what does it look like. Strategy: Microarray data in HeLa cell (GSE 4483, Mense et al. Physiol Genomics 2006) Frequency silent Distance from transcription start site

A reasonable model Key features: A spike around TSS (transcription start site); In both promoter region and transcript region, the Pol II signals gradually go down; The signals in transcript region is higher than in the background region. Present genes Absent genes Things considered in the equation: Five important parameters for each gene: Signal Intensity in/on S: Transcription Start Site (TSS); B: Background region; T: Transcript region; Decay rate Kp: Promoter region Kt: Transcript region Each parameter follows gamma distribution genome-wide.

A closer look at the model 1. Genomic regions were divided into 200-bp bins. 2. The expected number of fragments falling into each bin follows Poisson distribution S ~ Gamma(α S, β S ) For each gene, 5 parameters decide the expected # of fragments (λ in Poisson) Signal Intensity in/on S: Transcription Start Site (TSS); B: Background region; T: Transcript region; Decay rate Kp: Promoter region Kt: Transcript region

Modeling!!! = = " " " " " " " " " " " " = " # # $ % & & ' ( ) ) ) ) ) = = GENE i i i Ti Pi i TSS i i TSS i ij T ij ij T ij ij P ij ij P ij N i j S a i E a i B a i K a Ti K a Pi TSS X TSS T X T j P X P e S e E e B e K e K X e X e X e Y Y X Y X 1 50 1 5 5 / 1 4 4 / 1 3 3 / 1 2 2 / 1 1 1 / 1 50 1 5 5 5 4 4 4 3 3 3 2 2 2 1 1 1 ) ( ) ( ) ( ) ( ) (!!! ) ) Pr(, Pr( ), Pr( * + * + * + * + * +,,, + * + * + * + * + *,,, - - - X: observed Pol II fragments in each bin Y: {B, T, S, Kp, Kt} missing data θ : {α B, β B, α T, β T, α S, β S, α Kt, β Kt, α Kp, β Kp } upstream bins downstream bins TSS bins Pr(Y θ)

Are these models working? Goal: evaluating whether the proposed model (Pol II binding pattern around TSS based on ChIPseq) can identify coding genes that are actively transcribed (array-derived) Gold standard (using genes with clear annotation no other genes within 10,000-bp): Positive set: 4,120 genes (Present on array) Negative set: 2,682 genes (Absent on array) Procedure 1/4 were used to train parameters 3/4 were used to make the call ROC curve are generated based upon correct prediction on gene expression. Conclusion The model worked well, at least for protein coding genes

Promoter identification for mirnas Strategy Parameters identified for coding genes 1. Predict whether certain mirnas are expressed 2. Find the promoter regions for expressed mirnas Putative promoter hsa-mir-96 hsa-mir-183

Results

Result summary Focusing on 419 annotated intergenic micrornas; Computational model identified 83 micrornas predicted to be transcribed in HeLa cells (FDR 20%); This is consistent with RNA-seq experiments on another project.

Distance between microrna and TSS Distance between annotated microrna and transcriptions start site varies Distance between mirna and TSS Longest Median Shortest 10000bp 3600bp 200bp Frequency mirna Distance Distance between microrna and TSS (bp)

Promoter length Promoter length varies. 90% decay 90% decay Promoter length Longest Median Shortest 4,989-bp 1,476-bp 397-bp promoter Frequency promoter length

Validation

Validation I conservation Higher conservation score is observed around transcription start site 0.11 Conservation Score 0.10 0.09 0.08 0.07 0.06 0.05-10000 -5000 TSS 5000 10000 Distance (bp) Conservation scores are retrieved from UCSC Genome Browser

Validation II CpG islands Number of total annotated CpG islands away from the transcription start site (TSS) in 83 identified micrornas. 60 50 microrna Random Number of CpG island 40 30 20 10 0-10000 -5000 TSS 5000 10000 Distance from TSS (bp)

Histone markers (other study) H3K4Me2 patterns for identified microrna promoters H3K4Me2 patterns gene published in Barski et al. Cell 2007

Relationship of predicted microrna promoters with STAT1 binding

Relationship with STAT1 binding Promoter regions of 41 micrornas (49.4%, out of 83) contain or overlap with STAT1 enriched regions. mir-1259

Relationship with STAT1 binding Promoter regions of 41 micrornas (49.4%, out of 83) contain or overlap with STAT1 enriched regions.

microrna mediated regulatory network? 83 micrornas are actively transcribed 41 of which are STAT1 targets 6,254 genes are STAT1 targets (within -1,000 to +500-bp from TSS) Gene-Y mir-x

Feed back loops microrna mediated regulatory network? Feed forward loops IFNγ STAT1 hsa-mir-27a hsa-mir-220c mir-x 72 mirs 1,753 genes mrna-y hsa-mir-607*, 24, 92a-1, 92b, 940, 220c, 505 *: conserved targets hsa-mir-220 is deregulated in 5 types of cancers, including: B-cell chronic lymphocytic leukemia Glioblastoma Lung cancer Pancreatic cancer Papillary thyroid carcinoma

Summary Genome-wide RNA polymerase II binding data (from ChIP-seq) is a good resource for identifying regulatory regions of microrna; Promoters of 83 micrornas were identified in HeLa cells. Important step to study microrna regulatory network; Similar methodology can also be applied to study regulatory mechanisms of other non-coding RNA, such as pi-rna; For coding genes, tissue-specific alternative promoters can be also identified.

Collaborators Tim Huang (OSU) Ken Nephew (Indiana U.) Changyu Shen (IUSM) Acknowledgement CAMDA Committee Simon Lin Pan Du Data Provider Gerstein group Funding Source US NIH-AA017941 US NIH-CA113001 China 863 High-Tech Program 2007AA02Z302 China Natural Science Foundation 60901075

Result summary Focusing on 419 annotated intergenic micrornas; Computational model identified 83 micrornas predicted to be transcribed in HeLa cells (FDR 20%); RNA-seq experiment RPKM in coding gene RPKM in pre-microrna