SUPPLEMENTARY INFORMATION

Similar documents
SUPPLEMENTARY FIGURES: Supplementary Figure 1

Nature Genetics: doi: /ng.2995

Nature Methods: doi: /nmeth.3115

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Supplementary Figure 1. IDH1 and IDH2 mutation site sequences on WHO grade III

DNA methylation signatures for 2016 WHO classification subtypes of diffuse gliomas

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer.

Supplemental Information. Molecular, Pathological, Radiological, and Immune. Profiling of Non-brainstem Pediatric High-Grade

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded

Nature Genetics: doi: /ng Supplementary Figure 1. HOX fusions enhance self-renewal capacity.

SUPPLEMENTARY INFORMATION

SUPPLEMENTAL INFORMATION

gliomas. Fetal brain expected who each low-

Nature Medicine: doi: /nm.3967

SUPPLEMENTARY INFORMATION

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Expert-guided Visual Exploration (EVE) for patient stratification. Hamid Bolouri, Lue-Ping Zhao, Eric C. Holland

Nature Structural & Molecular Biology: doi: /nsmb.2419

Supplementary Materials for

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

iplex genotyping IDH1 and IDH2 assays utilized the following primer sets (forward and reverse primers along with extension primers).

Meta-analysis of IDH-mutant cancers identifies EBF1 as a novel interaction partner for

SUPPLEMENTARY FIGURES

NIH Public Access Author Manuscript Nature. Author manuscript; available in PMC 2012 May 15.

Supplementary Figure 1. ALVAC-protein vaccines and macaque immunization. (A) Maximum likelihood

Supplementary Figure 1

Nature Medicine: doi: /nm.4439

Supplementary Figures

Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Supplemental Figure S1. RANK expression on human lung cancer cells.

Supplementary Figures

Supplementary Information

Reviewer #1 (Remarks to the Author)

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Expanded View Figures

Journal: Nature Methods

Use Case 9: Coordinated Changes of Epigenomic Marks Across Tissue Types. Epigenome Informatics Workshop Bioinformatics Research Laboratory

S1 Appendix: Figs A G and Table A. b Normal Generalized Fraction 0.075

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

Integrative DNA methylome analysis of pan-cancer biomarkers in cancer discordant monozygotic twin-pairs

mir-509-5p and mir-1243 increase the sensitivity to gemcitabine by inhibiting

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

SUPPLEMENTARY INFORMATION

Nature Neuroscience: doi: /nn Supplementary Figure 1

Nature Immunology: doi: /ni Supplementary Figure 1. DNA-methylation machinery is essential for silencing of Cd4 in cytotoxic T cells.

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

COMPUTATIONAL OPTIMISATION OF TARGETED DNA SEQUENCING FOR CANCER DETECTION

Expanded View Figures

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Nature Getetics: doi: /ng.3471

Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles

SUPPLEMENTARY INFORMATION

Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression.

EPIGENETIC RE-EXPRESSION OF HIF-2α SUPPRESSES SOFT TISSUE SARCOMA GROWTH

Supplementary Figure 1: LUMP Leukocytes unmethylabon to infer tumor purity

EXPression ANalyzer and DisplayER

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

FOLLICULAR LYMPHOMA- ILLUMINA METHYLATION. Jude Fitzgibbon

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Large conserved domains of low DNA methylation maintained by Dnmt3a

Gene expression profiling predicts clinical outcome of prostate cancer. Gennadi V. Glinsky, Anna B. Glinskii, Andrew J. Stephenson, Robert M.

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

Epigenetics. Jenny van Dongen Vrije Universiteit (VU) Amsterdam Boulder, Friday march 10, 2017

An annotated list of bivalent chromatin regions in human ES cells: a new tool for cancer epigenetic research

Supplementary information. Supplementary figure 1. Flow chart of study design

ASMS 2015 ThP 459 Glioblastoma Multiforme Subtype Classification: Integrated Analysis of Protein and Gene Expression Data

Supplementary Figure 1. Using DNA barcode-labeled MHC multimers to generate TCR fingerprints

Breeding scheme, transgenes, histological analysis and site distribution of SB-mutagenized osteosarcoma.

Supplementary Figure 1. Schematic diagram of o2n-seq. Double-stranded DNA was sheared, end-repaired, and underwent A-tailing by standard protocols.

Nature Genetics: doi: /ng.3731

SUPPLEMENTARY APPENDIX

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Cluster Dendrogram. dist(cor(na.omit(tss.exprs.chip[, c(1:10, 24, 27, 30, 48:50, dist(cor(na.omit(tss.exprs.chip[, c(1:99, 103, 104, 109, 110,

Supplementary Figures

chapter 1 - fig. 2 Mechanism of transcriptional control by ppar agonists.

Chromatin marks identify critical cell-types for fine-mapping complex trait variants

(A) Cells grown in monolayer were fixed and stained for surfactant protein-c (SPC,

SubLasso:a feature selection and classification R package with a. fixed feature subset

Supplementary Information

Supplementary Figure S1 Expression of mir-181b in EOC (A) Kaplan-Meier

Nature Genetics: doi: /ng Supplementary Figure 1. Phenotypic characterization of MES- and ADRN-type cells.

Supplementary Materials for

Copyright 2014 The Authors.

TEB. Id4 p63 DAPI Merge. Id4 CK8 DAPI Merge

SUPPLEMENTARY INFORMATION

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11

SUPPLEMENTARY INFORMATION

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Supplementary Materials

Understandable Statistics

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Supplementary Information

Tissue of origin determines cancer-associated CpG island promoter hypermethylation patterns

Dynamic reprogramming of DNA methylation in SETD2- deregulated renal cell carcinoma

Transcription:

doi:10.1038/nature10866 a b 1 2 3 4 5 6 7 Match No Match 1 2 3 4 5 6 7 Turcan et al. Supplementary Fig.1 Concepts mapping H3K27 targets in EF CBX8 targets in EF H3K27 targets in ES SUZ12 targets in ES EED targets in EF SUZ12 targets in EF PRC targets in ES 0 10 20 30 40 -log 10 (Q-value) Supplementary Figure 1 Differential methylation in IDH mutant human astrocytes. (a) Box plots of relative methylation levels in passage 40 (P40) astrocytes expressing wild-type IDH1, mutant IDH1 (R132H), or neither. The average methylation levels of variant genes are shown. Data from two technical replicates for each of two biological replicates is shown. Error bars indicate 1 standard deviation (SD). (b) Concordance between PRC2 targets and genes methylated following introduction of mutant IDH1. Concepts transcriptional module mapping of genes with PRC2 occupancy with loci that are hypermethylated in astrocytes expressing mutant IDH1. Data is from passage 40. Each row shows individual PCR2 occupancy gene sets. Red=matching gene. EF=embryonic fibroblasts. ES=embryonic stem cells. WWW.NATURE.COM/NATURE 1

a Consensus CDF Delta area Change in Gini CDF Proportion increase delta(gini) Consensus index value # clusters # clusters b 2 WWW.NATURE.COM/NATURE

c 100% 100% 90% 90% 80% 80% 70% 70% 60% 50% Not Defined CpG Shelf 60% 50% Not Defined Body 5' UTR 40% CpG Shore 40% 3' UTR 30% 20% CpG Island 30% 20% TSS 200-1500bp TSS <200bp 10% 10% 0% Hypermethylated Promoter promoterassociated Associated Probes probes 0% Promoter Associated Hypermethylated gene body- Probes associated probes d Distribution of CpG sites among hypermethylated genes WWW.NATURE.COM/NATURE 3

e f 4 WWW.NATURE.COM/NATURE

g h WWW.NATURE.COM/NATURE 5

Supplementary Figure 2 K-means statistical output and validation of differentially methylated regions in human tumors. (a) Statistical output from K-means consensus clustering analysis on 81 glioma samples. Plot showing cumulative distribution function (CDF) for k=2 to k=5. The shape of the CDF plot allows one to determine the ideal k. Delta area curves are shown for each cluster number. Curve for each cluster number is shown as labeled in the legend. The corresponding Gini coefficient is listed in each case. The change in Gini curve is displayed, showing the change in Gini coefficient as a function of cluster number. Gini index reflects the area of the Lorenz curve. (b) Relative methylation of the genes analyzed in Fig. 2b in G-CIMP+ versus G-CIMP tumors. Whisker box plots summarizing each data set are shown (mean ± standard deviation (SD)). The boxes delineate the 25th to 75th percentile range. The P value indicates significance determined using ANOVA. Data from top 2% most variant probes. (c) The plot on the left shows the breakdown of hypermethylated probes that are located in CpG islands, shores, shelves, or not defined. The plot on the right shows breakdown of gene body-associated probes. TSS=transcriptional start site, UTR=untranslated region. (d) The pie chart indicates the relative locations (and overlap) of the hypermethylated CpG sites. The percentage of hypermethylated CpGs located in shores, islands, both or neither are shown. EpiTYPER was used to validate CpG island methylation states of parental and mutant IDH1 astrocytes at passages 2 and 50 (e); in CIMP+ and CIMP- LGGs (f); and shore methylation states in parental, mutant IDH1 astrocytes and CIMP+/CIMP- LGGs (g). Overall, these examples include (1) unmethylated genes that have undergone de novo methylation following introduction of mutant IDH1 in astrocytes, (2) genes with low levels of methylation that develop high methylation levels following introduction of mutant IDH1, and (3) representative genes from the two groups above that are methylated in CIMP+ LGGs. Each circle indicates a CpG dinucleotide. The percentage of methylation is shown by the color scale in the legend. The genomic location is noted. PAR, parental; Mut, IDH1 mutant; IVD, in vitro methylated DNA (methylated control); WGA, whole genome amplified DNA (unmethylated control). (h) Frequencies of IDH1 and IDH 2 mutations in G-CIMP+ and G-CIMPtumors. 6 WWW.NATURE.COM/NATURE

0 p<0.001 0 2 4 6 Time (years) Supplementary Figure 3 Survival benefit of CIMP+ phenotype in MSKCC dataset. Kaplan-Meier survival curve of MSKCC patients (n=72) with astrocytomas (n=31)(left), and grade 3 gliomas (n=43)(right). P values calculated by log-rank. MSKCC Cohort CIMP by Age 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% 40 yrs 40-50 yrs 50-60 yrs >60 yrs P=0.005 CIMP Negative CIMP Positive Supplementary Figure 4 CIMP distribution by age of patient. Stacked column chart showing CIMP phenotype by age in 81 patients. CIMP positive phenotype is associated with younger age. P value calculated by chi-squared test WWW.NATURE.COM/NATURE 7

MSKCC Cohort CIMP by Histology 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% CIMP Negative CIMP Positive P<0.005 Supplementary Figure 5 CIMP distribution by histology. Stacked column chart showing CIMP status by histology in 81 patients. Oilgodendroglioma histology is associated with the greatest incidence of CIMP positive phenotype. P value calculated by chi-squared test. CIMP by Histology and Grade 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% CIMP Negative CIMP Positive Supplementary Figure 6 CIMP distribution by histology and grade. Stacked column chart showing CIMP status by grade and histology. 8 WWW.NATURE.COM/NATURE

MSKCC LGG samples TCGA GBM samples Idh1 mut vs idh1 wt Idh1 wt vs idh1 mut # of perm. ES NES Nom p-val FDR q-val FWER p-val 10000 1 1.74 0 0 0 idh1 mut vs idh1 wt idh1 wt vs idh1 mut # of perm ES NES Nom p-val FDR q-val FWER p-val 1000 3 1.96 0 0 0 10000 3 1.94 0 0 0 1000-3 - 1.95 8.9E- 4 8.9E- 4 3.0E- 4 10000-3 - 1.94 0 0 0 Supplementary Figure 7 Enrichment of hypermethylated probes derived from IDH1 mutant astrocytes. 730 of the most hypermethylated probes in IDH1 mutant expressing astrocytes were significantly enriched in IDH1 mutant MSKCC lower grade gliomas (left, 1 panel). The GSEA enrichment scores and their significance are noted for 10,000 permutations. The same hypermethylated probes were positively enriched in IDH1 mutant TCGA GBM tumor samples when compared to IDH1 wild-type GBM samples. Similarly, these probes are not methylated in IDH1 wild-type TCGA GBM tumors when compared to IDH1 mutant TCGA GBM tumors (right, 2 panels). GSEA enrichment scores for 1,000 and 10,000 iterations are shown. Perm, permutations; ES, enrichment score; NES, normalized enrichment score, NOM, nominal; FDR; false discovery rate; FWER, family-wise error rate. Genes Glioma samples CIMP status CIMP+ CIMP- Supplementary Figure 8 Accuracy of 17 gene signature in predicting CIMP. Two-dimensional unsupervised clustering using the probes derived from the 17 gene signature divides the MSKCC cohort into two clusters, representing a predicted CIMP+ and predicted CIMP- phenotype based on expression data. The blue and red labels on the dendrogram represent CIMP based on methylation data. WWW.NATURE.COM/NATURE 9

Genes Glioma samples CIMP status Predicted CIMP+ Predicted CIMP- Supplementary Figure 9 Identification of predicted CIMP groups in Rembrandt validation cohort. Two dimensional unsupervised clustering using the probes derived from the 17 gene signature clusters the 115 patients with grade II and grade III glioma from the Rembrandt database into two groups, classified as predicted CIMP+ and predicted CIMPphenotype. Cluster A Cluster B Astrocytoma Age>50 Grade 2 Cumulative Survival 0 p=0.003 0 2 4 6 8 Time (years) Cumulative Survival 0 P<0.001 0 2 4 6 Time (years) Cumulative Survival 0 P<0.001 0 2 4 6 8 Time (years) Supplementary Figure 10 Survival benefit of CIMP+ phenotype in Rembrandt dataset. Kaplan-Meier survival curve shows an improvement in survival in patients predicted to be CIMP+ in the Rembrandt validation dataset. Displayed are survival curves demonstrating the survival benefit in astrocytoma, tumors with grade II histology, age >50. P value calculated by log-rank. 10 WWW.NATURE.COM/NATURE

a Genes Proneural TCGA samples IDH1 Wild-type Mutant Cluster A B b Survival (probability) 0 Proneural p=0.001 0 p=0.132 0 5 10 15 20 25 30 0 5 10 15 20 25 30 Neural Cluster A Cluster B Classical Mesenchymal Survival (probability) 0 p=92 0 5 10 15 20 25 30 Time (months) 0 p=04 0 5 10 15 20 25 30 Time (months) WWW.NATURE.COM/NATURE 11

Supplementary Figure 11 Classification of TCGA GBM data using mutant IDH1 gene signature (a) Gene signature (17 genes) derived from mutant IDH1 expressing astrocytes identifies two distinct groups within proneural group of GBMs (TCGA tumor set). Heatmap for proneural GBMs generated from unsupervised 2Dhierarchical clustering. Color scale indicates normalized expression level. (b) Kaplan-Meier survival curves for GBMs that either have (red) or do not have (black) the 17 gene signature for IDH1 mutation. Results are shown for each GBM transcriptional subgroup. Subgroup 1 (green, from a), which contains all IDH1 mutants, was associated with improved survival. No subgroups associated with differing prognosis were identified in other TCGA subgroups. 12 WWW.NATURE.COM/NATURE

Supplementary Figure 12 Concordance of transcriptional programs regulated by mutant IDH1 in astrocytes and G-CIMP in LGGs. Analyses of enriched transcriptional programs in G-CIMP LGGs and mutant IDH1 astrocytes by PANTHER as described in Methods. P value for significance is shown along the x-axis. Yellow lines indicate threshold of significance (P=0.05). a b IDH1 WT IDH1 R132H Expression fold change (IDH1 mutant / parental) BMI1 RIF1 DICER1 LIG4 NES Supplementary Figure 13 Expression of mutant IDH1 promotes a neurosphere phenotype and induces expression of stem cell-associated genes. (a) Astrocytes (P15) that express IDH R132H or isogenic controls were grown in media that supports neural stem cell growth as in Methods. Experiment was performed in triplicate. (b) Introduction of mutant IDH1 into astrocytes induces expression of multiple glioma stem cell-associated markers. Data from expression microarrays are shown. Error bars indicate 1 SD. Yellow line marks equality. Passage 27 cells were used in each case. WWW.NATURE.COM/NATURE 13

TET2 negative TET2 positive % of Max FLAG FSC 5-hydroxymethylcytosine Supplementary Figure 14 Increased 5-methylcytosine hydroxylation by TET2. TET2 transfected HEK 293T cells were gated as TET2 positive or TET2 negative by FLAG antibody. 5-hydroxymethylcytosine levels are overlaid as histograms for both TET2 positive and negative populations. Increased 5-hydroxymethylcytosine levels are observed in the TET2 positive population. 14 WWW.NATURE.COM/NATURE