Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Similar documents
Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

The Epigenome Tools 2: ChIP-Seq and Data Analysis

EPIGENOMICS PROFILING SERVICES

Peak-calling for ChIP-seq and ATAC-seq

Nature Structural & Molecular Biology: doi: /nsmb.2419

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

Activation of proto-oncogenes by disruption of chromosome neighborhoods

Transcript-indexed ATAC-seq for immune profiling

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome

Genome-wide Association Studies (GWAS) Pasieka, Science Photo Library

The search for cis-regulatory driver mutations in cancer genomes

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014

Epigenetics. Jenny van Dongen Vrije Universiteit (VU) Amsterdam Boulder, Friday march 10, 2017

High Throughput Sequence (HTS) data analysis. Lei Zhou

R. Piazza (MD, PhD), Dept. of Medicine and Surgery, University of Milano-Bicocca EPIGENETICS

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters.

Histones modifications and variants

Introduction to Systems Biology of Cancer Lecture 2

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD,

RNA-seq Introduction

ChIP-seq data analysis

An epigenetic approach to understanding (and predicting?) environmental effects on gene expression

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data

Eukaryotic transcription (III)

Chromatin marks identify critical cell-types for fine-mapping complex trait variants

Yingying Wei George Wu Hongkai Ji

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Genomic Methods in Cancer Epigenetic Dysregulation

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

Epigenetics. Lyle Armstrong. UJ Taylor & Francis Group. f'ci Garland Science NEW YORK AND LONDON

Systems Analysis Of Chromatin-Related Protein Complexes In Cancer READ ONLINE

Genetics and Genomics in Medicine Chapter 6 Questions

Liposarcoma*Genome*Project*

Gene Expression DNA RNA. Protein. Metabolites, stress, environment

Ch. 18 Regulation of Gene Expression

RUNX1 and FPD/AML Translational Research. The Leukemia and Lymphoma Society / Babich Family Foundation Partnership. September 2016

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

SUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation.

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Methods: doi: /nmeth.3115

Supplemental Materials

CCCTC-Binding Factor. Synonyms. Definition. Characteristics

Jayanti Tokas 1, Puneet Tokas 2, Shailini Jain 3 and Hariom Yadav 3

Dominic J Smiraglia, PhD Department of Cancer Genetics. DNA methylation in prostate cancer

Data mining with Ensembl Biomart. Stéphanie Le Gras

Genome Control in Cell Identity and Disease! Development and cell identity Loss of cell identity and disease New diagnostics and therapeutics

3D genome organization in health and disease: emerging opportunities in cancer translational medicine

Assignment 5: Integrative epigenomics analysis

The epigenetic landscape of T cell subsets in SLE identifies known and potential novel drivers of the autoimmune response

Session 2: Biomarkers of epigenetic changes and their applicability to genetic toxicology

Epigenetics: The Future of Psychology & Neuroscience. Richard E. Brown Psychology Department Dalhousie University Halifax, NS, B3H 4J1

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

SUPPLEMENTARY INFORMATION

Transcription and chromatin. General Transcription Factors + Promoter-specific factors + Co-activators

Sudin Bhattacharya Institute for Integrative Toxicology

Although DNA methylation has been considered the primary

Nature Immunology: doi: /ni Supplementary Figure 1. DNA-methylation machinery is essential for silencing of Cd4 in cytotoxic T cells.

Part-II: Statistical analysis of ChIP-seq data

Epigenetics Armstrong_Prelims.indd 1 04/11/2013 3:28 pm

The silence of the genes: clinical applications of (colorectal) cancer epigenetics

Mechanisms of alternative splicing regulation

Supplementary Figures

cis-regulatory enrichment analysis in human, mouse and fly

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016

Dynamic Changes in Chromatin Accessibility Occur in CD8 + T Cells Responding to Viral Infection

Lecture 10. Eukaryotic gene regulation: chromatin remodelling

Reviewer #1 (Remarks to the Author)

LESSON 3.2 WORKBOOK. How do normal cells become cancer cells? Workbook Lesson 3.2

Precision medicine: How to exploit the growing knowledge on the evolving genomes of cells to improve cancer prevention and therapy.

DNA methylation & demethylation

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage

Supplemental Figure 1: Asymmetric chromatin maturation leads to epigenetic asymmetries on sister chromatids.

MIR retrotransposon sequences provide insulators to the human genome

How Cells Divide. Chapter 10

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand

Regulation of Gene Expression in Eukaryotes

Stem Cell Epigenetics

Section D: The Molecular Biology of Cancer

TITLE: - Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition

Imprinting. Joyce Ohm Cancer Genetics and Genomics CGP-L2-319 x8821

Structural vs. nonstructural proteins

Epigenomics. Ivana de la Serna Block Health Science

Dynamic reprogramming of DNA methylation in SETD2- deregulated renal cell carcinoma

Epigenetics DNA methylation. Biosciences 741: Genomics Fall, 2013 Week 13. DNA Methylation

RALYL Hypermethylation: A Potential Diagnostic Marker of Esophageal Squamous Cell Carcinoma (ESCC) Junwei Liu, MD

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

Open Access RESEARCH. Background Malaria is a major public health problem in many developing countries, with the parasite Plasmodium

Biomarker development in the era of precision medicine. Bei Li, Interdisciplinary Technical Journal Club

DNA replication timing and higher-order nuclear organization determine single nucleotide substitution patterns in cancer genomes

Clinical significance of genetic analysis in glioblastoma treatment

Soft Agar Assay. For each cell pool, 100,000 cells were resuspended in 0.35% (w/v)

New drugs in Acute Leukemia. Cristina Papayannidis, MD, PhD University of Bologna

Transcription:

Session 6: Integration of epigenetic data Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Utilizing complimentary datasets

Frequent mutations in chromatin regulators Watson et al, Emerging patterns of somatic mutations in cancer, Nat Rev Gen, 2013

Dawson & Kouzarides, Cell, 2012

High rate of mutation in chromatin regulators Kandoth et al, Nature, 2013

https://www.encodeproject.org

Chromatin immunoprecipitation (ChIP-seq) Transcription factors, chromatin-associate proteins, and other DNA-binding proteins (generally with formaldehyde crosslinking) Histone modifications (micrococcal nuclease treatment) Sonication to ~200-400 bp fragments Park PJ, Nature Reviews Genetics, 2009

ChIP-seq from clinical samples?

ENCODE (The Encyclopedia Of DNA Elements)

ENCODE (human + model organisms)

Integrating TCGA & ENCODE data Can we help interpret genome variations (e.g., germline or somatic mutations) with ENCODE-type data? Sequencing of cancer genomes has revealed many mutations in the non-coding regions. Do they belong to UTRs, enhancers, transcription factor binding sites, DNase hypersensitivity sites, etc.? Science, 2012

Chromatin landscape influences mutation rate Heterochromatin has elevated mutation rate Schuster-Bockler & Lehner,Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature, 2012

Chromatin state annotation of the genome Kharchenko et al, Nature, 2011

Annotating the genome with chromatin states Ernst et al, Mapping and analysis of chromatin state dynamics in nine human cell types, Nature, 2011

GWAS Non-coding variants enriched in regulatory DNA (DNase I hypersensitive sites) Maurano et al, Systematic localization of common disease-associated variation in regulatory DNA, Science, 2013

ATAC-seq Nature Methods, 2013

FunSeq Khurana et al, Integrative annotation of variants from 1092 humans: application to cancer genomics Science, 2013 Use enrichment of rare SNPs in 1000G as an estimate of purifying selection Regions of negative selection ( sensitive and ultrasensitive ) are enriched for disease-causing mutations

Roadmap Epigenomics

Roadmap Epigenomics

Matching genomic and epigenomic profiles There are no epigenetic data in TCGA other than DNA methylation Available epigenome data (ENCODE, Roadmap, etc) are mostly for normal tissues and cell lines - are they informative for my cancer genomes?

Cell-of-origin can be predicted by chromatin features Mutation rate is highly variable along the cancer genome Chromatin accessibility / chromatin modification / replication timing can explain the variance in mutation rates along cancer genomes (up to 86%) Cell type of origin of a cancer can be predicted based on the distribution of mutations along its genome. Polak et al, Cell-of-origin chromatin organization shapes the mutational landscape of cancer, Nature, 2015

Nucleosome footprint informs tissue-of-origin Sequencing cell-free DNA yields a genome-wide map of in vivo nucleosome occupancy! Identify tissues-of-origin of cell-free DNA using nucleosome maps Snyder et al, Cell-free DNA Comprises an In Vivo Nucleosome Footprint that Informs Its Tissues-Of-Origin, Cell, 2016

3D Interaction Data Dekker, Marti-Renom, Mirny, Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data, Nat Rev Gen, 2013

3D interactions: in situ Hi-C Rao et al, Cell, 2014

Hi-C Interaction Map Dekker et al, Nat Rev Gen, 2013

Histone modification & chromatin domains Ho, Jung, Liu et al, Comparative analysis of metazoan chromatin organization, Nature, 2014

Features in Hi-C data In situ Hi-C in GM12878 ~5 billion contacts, ~1 kb resolution. Rao et al, A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping, Cell, 2014

Promoter-enhancer loops are associated gene activation Rao et al, Cell, 2014

Features in Hi-C data Genomes are partitioned into contact domains (median: 185 kb), associated with distinct patterns of histone marks and segregated into subcompartments. ~10,000 loops. - frequently link promoters and enhancers - correlate with gene activation - conserved across cell types Loop anchors typically occur at domain boundaries and bind CTCF Rao et al, Cell, 2014

Disruption of CTCF binding Enhancers interact with their genes through loops. These interactions are usually contained within larger domains involving CTCF and cohesins Could somatic mutations disrupt the boundaries of these domains? Could the disruption of the boundaries lead to activation of oncogenes? Lucy Jung

Frequent mutations at CTCF-binding sites 213 colorectal tumors + ChIP-exo (CTCF/ Rad21) Katainen et al, CTCF/cohesin-binding sites are frequently mutated in cancer, Nature Genetics, 2015

Elevated mutations at CTCF binding sites across tumor types Lucy Jung

IDH mutant gliomas Flavahan et al, Insulator dysfunction and oncogene activation in IDH mutant gliomas, Nature, 2016 CTCF is methylationsensitive! Hypermethylation at cohesin/ctcf-binding sites compromises binding its binding Loss of CTCF at boundary -> aberrant enhancer interaction with PDGFRA (oncogene) Treatment with a demethylating agent partially restores insulator function and downregulates PDGFRA; CRISPR-mediated disruption of the CTCF motif in IDH wild-type gliomaspheres upregulates PDGFRA and increases proliferation

Oncogene activation via loop disruption in T-ALL Perturbation of boundaries in non-malignant cells activated oncogenes. Hnisz et al., Activation of proto-oncogenes by disruption of chromosome neighborhoods, Science, 2016

3D chromosomal interaction data A Common Fund project (2015-20) A large number of investigators with genomics and imaging expertise Will generate a large number of 3D interaction maps for the community