Nature Genetics: doi: /ng Supplementary Figure 1

Size: px
Start display at page:

Download "Nature Genetics: doi: /ng Supplementary Figure 1"

Transcription

1 Supplementary Figure 1 Expression deviation of the genes mapped to gene-wise recurrent mutations in the TCGA breast cancer cohort (top) and the TCGA lung cancer cohort (bottom). For each gene (each pair of red and blue bars), the average degree of expression deviation is compared between the mutated samples (red bar) and non-mutated samples (blue bar). For 72% (top) and 73% (bottom) of the cases, the mutated samples exhibited a higher degree of expression deviation. Genes were sorted according to the degree of the difference between the mutated and non-mutated groups. All genes are collapsed in the box plots on the right.

2 Supplementary Figure 2 Effects of copy number alterations versus mutations. (a) The effects of copy number alterations for genes ordered as in Figure 1c. For each gene (each pair of red and blue bars), the percentage of samples with copy number alterations is compared between the mutated samples (red bar) and non-mutated samples (blue bar). All genes are collapsed in the box plots on the right. (b) Outcome of multiple linear regression quantitatively comparing the contribution of mutation recurrence and copy number alteration to expression perturbation. Copy number alteration was processed through the deviation metric used for expression perturbation. The contribution of mutation burden remained significant after adjusting for copy number alteration.

3 Supplementary Figure 3 Permutation tests for expression perturbation by mutations. Gene expression profiles based on the RNA sequencing of 92 breast and 90 lung cancer samples were obtained from the TCGA and matched with the whole-genome sequences of the TCGA cohort samples. Array-based gene expression profiles of breast cancer samples were obtained from the ArrayExpress archive (E-MTAB-1088) and matched with the whole-genome sequences of the Sanger cohort samples. We obtained expression difference for each gene between the mutated and non-mutated groups as the t statistic. To estimate expected expression differences, the sample grouping (mutated versus non-mutated) was randomized while maintaining the sample size of each group. This permutation was repeated 1,000 times, and the t value was recalculated each time.

4 Supplementary Figure 4 Directionality of expression perturbation by mutations. The x axis has genes ordered as in Figure 1c. The y axis indicates the fraction of cases (samples) that go in the same direction (either positive or negative expression deviation).

5 Supplementary Figure 5 Recurrence rate (the percentage of mutated samples in the given cohort) was obtained for each mutation (for site-specific recurrence) or each gene (for gene-level recurrence). The site-specific recurrence rate was mapped to the target gene via the same chromatin interactome used for the gene-level recurrence analysis. P value indicates the differences in the recurrence rate between the known cancer genes and other genes.

6 Supplementary Figure 6 Permutation tests for biological validity of the chromatin interaction data. We shuffled chromatin interaction links between enhancers and promoters. This procedure was repeated 1,000 times. High recurrence rates were observed for known cancer genes (Fig. 2c), cancer gene interacting partners (Fig. 2d), and genes with relevant GO terms (Fig. 2e). We tested whether these patterns disappear as target genes are randomly mapped to mutations. To this end, the average recurrence rate of the above three groups of genes was obtained and divided by that of all genes. These relative recurrence rates were significantly higher with the real chromatin interactome map (black bars) than with the 1,000 randomized maps (gray density plots).

7 Supplementary Figure 7 Results of site-specific LOOCV The proportion of positive votes by 1,000 decision trees in our random forest for site-specific LOOCV compared between truly recurrent mutations (green) and false, non-recurrent, mutations (red) depending on tumor and epigenome types.

8 Supplementary Table 1. Whole genome mutation data Cohort Cancer type Number of samples Number of point mutations Breast ,695 Sanger Lung 24 1,446,336 GIS Lung 30 1,763,572 Breast ,434 TCGA Lung 90 3,594,840

9 Supplementary Table 2. Long-range enhancer-promoter data Cancer type State Cell line Data source Number of interaction pairs MCF7 ChIA-PET 24,501 Cancer MCF7 IM-PET 25,746 MCF7, T47D DHS tag correlation 66,222 Breast MCF7, T47D CAGE expression correlation 18,362 HMEC IM-PET 20,254 Cell of origin HMEC DHS tag correlation 75,167 HMEC CAGE expression correlation 26,001 A549 IM-PET* 27,435 Cancer A549 DHS tag correlation 50,135 Lung A549 CAGE expression correlation 18,509 NHLF, IMR90 IM-PET 40,189 Cell of origin AGO4550, HPF, NHLF, WI38 DHS tag correlation 70,785 AGO4550, HPF, NHLF, WI38 CAGE expression correlation 25,400 DHS tag correlation: Correlation of tag density between promoter DHSs and non-promoter DHSs within ± 500kb of the tss and within the same topologically associated domain CAGE expression correlation: TSS-enhancer association based on CAGE expression levels of erna and mrna in the FANTOM5 panel *This data was generated as described in the Methods

10 Supplementary Table 4. Random forest input features Target gene features Binding TF features TFBS features Genomic features Cancer epigenome features Cell-of-origin epigenome features Feature ID Target_Cancer_Gene Target_HumanNet.CGS_L1 Target_HumanNet.CGS_L2 Target_Interactome.CGS_L1 Target_Interactome.CGS_L2 Target_GO_cell.death Target_GO_cell.differentiation Target_GO_cell.proliferation Target_GO_mitototic.cell.cycle Target_DEG_score Target_Expression Target_CNA TF_Cancer_Gene TF_HumanNet.CGS_L1 TF_HumanNet.CGS_L2 TF_Interactome.CGS_L1 TF_Interactome.CGS_L2 TF_GO_cell.death TF_GO_cell.differentiation TF_GO_cell.proliferation TF_GO_mitototic.cell.cycle TF_DEG_score diff_pval avg_pval TF_sign Dist_GWAS_2D Early.to.late_Rate PhastCons Dnase H3k20me1 H3k27ac H3k27me3 H3k36me3 H3k4me1 H3k4me2 H3k4me3 H3k79me2 H3k9ac H3k9me3 Orig_Dnase Orig_H3k20me1 Orig_H3k27ac Orig_H3k27me3 Orig_H3k36me3 Orig_H3k4me1 Orig_H3k4me2 Orig_H3k4me3 Orig_H3k79me2 Orig_H3k9ac Orig_H3k9me3 Description Whether the target gene is a known cancer gene Number of one-hop interactions of the target gene with known cancer genes in HumanNet Number of two-hop interactions of the target gene with known cancer genes in HumanNet Number of one-hop interactions of the target gene with known cancer genes in Interactome Number of two-hop interactions of the target gene with known cancer genes in Interactome Whether the target gene belongs to a Gene Ontology term related to cell death Whether the target gene belongs to a Gene Ontology term related to cell differentation Whether the target gene belongs to a Gene Ontology term related to cell proliferation Whether the target gene belongs to a Gene Ontology term related to mitotic cell cycle Differential expression of the target gene between cancer and normal tissues* Expression level of the target gene in cancer Copy number alteration of the target region Whether the regulator is a known cancer gene Number of one-hop interactions of the regulator with known cancer genes in HumanNet Number of two-hop interactions of the regulator with known cancer genes in HumanNet Number of one-hop interactions of the regulator with known cancer genes in Interactome Number of two-hop interactions of the regulator with known cancer genes in Interactome Whether the regulator belongs to a Gene Ontology term related to cell death Whether the regulator belongs to a Gene Ontology term related to cell differentation Whether the regulator belongs to a Gene Ontology term related to cell proliferation Whether the regulator belongs to a Gene Ontology term related to mitotic cell cycle Differential expression of the regulator gene between cancer and normal tissues* Change in the P value of the motif score by the mutation Average of the P values of the motif scores with and without the mutation Gain or loss of the TF binding site by the mutation Chromosomal distance from the mutation to the closest cancer GWAS locus Replication timing represented as the early-to-late ratio, (G1B+S1) /(S4+G2)** Evolutionary conservation of the underlying sequences as meaured by PhastCons DNase peak score in the cancer cell line H3K20me1 modification peak score in the cancer cell line H3K27ac modification peak score in the cancer cell line H3K27me3 modification peak score in the cancer cell line H3K36me3 modification peak score in the cancer cell line H3K4me1 modification peak score in the cancer cell line H3K4me2 modification peak score in the cancer cell line H3K4me3 modification peak score in the cancer cell line H3K79me2 modification peak score in the cancer cell line H3K9ac modification peak score in the cancer cell line H3K9me3 modification peak score in the cancer cell line DNase peak score in the cells of origin H3K20me1 modification peak score in the cells of origin H3K27ac modification peak score in the cells of origin H3K27me3 modification peak score in the cells of origin H3K36me3 modification peak score in the cells of origin H3K4me1 modification peak score in the cells of origin H3K4me2 modification peak score in the cells of origin H3K4me3 modification peak score in the cells of origin H3K79me2 modification peak score in the cells of origin H3K9ac modification peak score in the cells of origin H3K9me3 modification peak score in the cells of origin * Sigmoid normalization of the P value of the T test as (log P)/(1+log P) ** Ref:

11 Supplementary Table 7. Correlation of gene-level mutation burden and expression level Survival gene Survival P value (expressionsurvival correlation) Correlation between mutation burden and expression level Correlation P value SNORA E-05 SNORA14B E-04 LAMP E-03 MAPK E-03 PRSS E-03 PITPNM E-03 LOC E-03 APCS E-03 VCX3B E-03 CARS E-03 RPS6KA E-03

12 Supplementary Table 9. Gained motifs for ETS transcription factors by distal tert mutations Mutation ID Motif start Motif end P value ( < 1E-03) ETS family tert E-04 SPIC tert E-04 SPIC tert E-04 ELK1 tert E-04 GABPA tert E-05 FLI1 tert E-04 ELK1 tert E-04 GABPA tert E-04 ELK4 tert E-04 ERG tert E-04 ELK1 tert E-04 ELF1 tert E-04 GABPA tert E-04 ELK4 tert E-04 GABPA tert E-04 GABPA tert E-04 ETV3 tert E-04 GABPA tert E-04 ELK1 tert E-04 ELF1 tert E-04 ELK4 tert E-04 GABPA tert E-04 ETS1 tert E-04 ELK1 tert E-04 ELK4

13 Supplementary Note Background mutation model We first chose the mutations that appear in two or more samples either gene-wise or in a site-specific manner, and then measured the statistical significance of mutation occurrence in a target region by considering local background mutation rates. As for site recurrence, we defined the target region as 5, 10, or 20 bp windows centred on the mutation of interest. As for gene-level recurrence, the mutation-containing regulatory region that is defined in the associated chromatin interactome data was used as the target region. We estimated the expected mutation frequencies per site based on background mutation density in windows of varying size (1 kb, 5 kb, 10 kb, or 100 kb). Consequently, the probability that a given region is mutated can be given as Prob target region R is mutated = 1 1 Prob site s is mutated. Given a very low per-site mutation frequency, the expected regional probability of mutation occurrence, p, was approximated as follows:!! p = Prob target region R is mutated!! Prob site s is mutated. Given p and the total number of samples n, the number of samples in which the given region is mutated follows the binomial distribution, B(n, p). We used Poisson approximation with parameter np for the binomial distribution. Therefore, the statistical significance of mutation occurrence x in a given target region was represented as the P value,

14 !!! e P X x = 1!!" np! i!!!!, where x = 1,2,, n and X ~ Poisson np. Only the target regions with P < 0.05 were retained. For site-specific significance, we used the pan-cancer data that were described in the Whole genome cancer sequences section. Additionally, the significance of gene-level occurrence was tested. We mapped the pan-cancer data including irrelevant tumour types to the cell-type-specific chromatin interactome to estimate the expected gene-level probability of mutation occurrence as q = Prob target gene G is mutated E/N, where E is the number of the pan-cancer samples with the given gene mutated, and N is the total number of the pan-cancer samples. Then, the number of cohort samples with the given gene mutated was assumed to follow the binomial distribution, B(n, q). Under the null hypothesis that cell-type specific recurrence was not different from the pan-cancer recurrence, the P value can be calculated as follows: P Y y = 1!!!!!! n i q! (1 q)!!!, where y = 1,2,, n and Y ~ B n, q. Finally, the mutations with the gene recurrence P value < 0.01 or < 0.05 were included in the true set.

15 Cohort extension analysis When the Sanger cohort was used as the initial (reference) cohort, the TCGA cohort was employed as the extension (rescue) cohort, and vice versa. We performed the LOOCV as described above for the initial cohort. The threshold fraction of votes between positive and negative calls was determined by applying the Jenks natural breaks classification method 1 for the distribution of votes for the two classes (true or false) of mutations. In other words, we sought a threshold that can minimize each class s average deviation from the class s mean vote while maximizing each class s deviation from the mean votes of the other classes. We then looked for the mutations that were positively predicted as recurrent although not actually being recurrent in the initial reference cohort but turned out to be recurrent when the gene-level recurrence rate was obtained after combining the initial and extension cohorts. These mutations were referred to as rescued mutations. The rescue frequency was calculated as the fraction of the rescued mutations over all mutations, that is, Rescue frequency % = True!"#$%&'( True!"#$%&'( + False!"#$%&'( 100, where True!"#$%&'( indicates the number of true (recurrent) mutations in the combined cohort and False!"#$%&'( is the number of false (non-recurrent) mutations in the combined cohort. This frequency was obtained for the initial positive calls and negative calls, separately. Next, we wanted to estimate the expected number of samples required to achieve the rescue frequency of 100% for positive calls. First, by a simple linear extrapolation, this number could be obtained as N 100 Rescue frequency or N Positive True!"#$%&'(, where N is the number of samples in the extension (rescue) cohort and Positive indicates the number of all positive calls. However, not all positive calls can be true and expected to be rescued.

16 Therefore, we estimated the number of true positives by assessing the precision of our random forest classifier in the LOOCV for the given initial cohort and the Jenks threshold. Because the precision is the ratio of true positives over all positives, the adjusted number of required samples should be N Positive Precision True!"#$%&'(.However, false negatives also should be taken into account. Sensitivity or recall is the ratio of true positives over all trues (true positives and false negatives). We assessed the sensitivity of our random forest classifier by estimating the number of false negatives as the initial negative calls that showed recurrence upon cohort extension. Hence, the final adjusted number of required samples can be given as N Positive Precision Recall True!"#$%&'(. We repeated all these calculations for each initial-extension cohort combination and also separately for the predictions based on cancer epigenomes and cell-of-origin epigenomes. 1. Jenks, G. F. & Caspall, F. C. Error on Choropleth Maps: Definition, Measurement, Reduction. Ann. Assoc. Am. Geogr. 61, (1971).

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

Chromatin marks identify critical cell-types for fine-mapping complex trait variants

Chromatin marks identify critical cell-types for fine-mapping complex trait variants Chromatin marks identify critical cell-types for fine-mapping complex trait variants Gosia Trynka 1-4 *, Cynthia Sandor 1-4 *, Buhm Han 1-4, Han Xu 5, Barbara E Stranger 1,4#, X Shirley Liu 5, and Soumya

More information

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. Supplementary Figure 1 Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. (a) Pearson correlation heatmap among open chromatin profiles of different

More information

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni.

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni. Supplementary Figure 1 Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Expression of Mll4 floxed alleles (16-19) in naive CD4 + T cells isolated from lymph nodes and

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Pan-cancer analysis of global and local DNA methylation variation a) Variations in global DNA methylation are shown as measured by averaging the genome-wide

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. Supplementary Figure 1 Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. (a,b) Values of coefficients associated with genomic features, separately

More information

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. Supplementary Figure Legends Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. A. Screenshot of the UCSC genome browser from normalized RNAPII and RNA-seq ChIP-seq data

More information

MIR retrotransposon sequences provide insulators to the human genome

MIR retrotransposon sequences provide insulators to the human genome Supplementary Information: MIR retrotransposon sequences provide insulators to the human genome Jianrong Wang, Cristina Vicente-García, Davide Seruggia, Eduardo Moltó, Ana Fernandez- Miñán, Ana Neto, Elbert

More information

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features Tyler Derr @ Yue Lab tsd5037@psu.edu Background Hi-C is a chromosome conformation capture (3C)

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality. Supplementary Figure 1 Assessment of sample purity and quality. (a) Hematoxylin and eosin staining of formaldehyde-fixed, paraffin-embedded sections from a human testis biopsy collected concurrently with

More information

Reconstruction of enhancer-target networks in 935 samples of human primary cells, tissues and cell lines

Reconstruction of enhancer-target networks in 935 samples of human primary cells, tissues and cell lines Reconstruction of enhancer-target networks in 935 samples of human primary cells, tissues and cell lines Qin Cao, Christine Anyansi,2, Xihao Hu, Liangliang Xu 3, Lei Xiong 4, Wenshu Tang 3, Myth T.S. Mok

More information

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed.

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Supplementary Note The potential association and implications of HBV integration at known and putative cancer genes of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Human telomerase

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive

More information

Expanded View Figures

Expanded View Figures Solip Park & Ben Lehner Epistasis is cancer type specific Molecular Systems Biology Expanded View Figures A B G C D E F H Figure EV1. Epistatic interactions detected in a pan-cancer analysis and saturation

More information

Supplementary Figure 1: High-throughput profiling of survival after exposure to - radiation. (a) Cells were plated in at least 7 wells in a 384-well

Supplementary Figure 1: High-throughput profiling of survival after exposure to - radiation. (a) Cells were plated in at least 7 wells in a 384-well Supplementary Figure 1: High-throughput profiling of survival after exposure to - radiation. (a) Cells were plated in at least 7 wells in a 384-well plate at cell densities ranging from 25-225 cells in

More information

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand ChIP-seq analysis J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand Tuesday : quick introduction to ChIP-seq and peak-calling (Presentation + Practical session)

More information

Peak-calling for ChIP-seq and ATAC-seq

Peak-calling for ChIP-seq and ATAC-seq Peak-calling for ChIP-seq and ATAC-seq Shamith Samarajiwa CRUK Autumn School in Bioinformatics 2017 University of Cambridge Overview Peak-calling: identify enriched (signal) regions in ChIP-seq or ATAC-seq

More information

ChIP-seq data analysis

ChIP-seq data analysis ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional

More information

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016 Session 6: Integration of epigenetic data Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016 Utilizing complimentary datasets Frequent mutations in chromatin regulators

More information

Inferring Biological Meaning from Cap Analysis Gene Expression Data

Inferring Biological Meaning from Cap Analysis Gene Expression Data Inferring Biological Meaning from Cap Analysis Gene Expression Data HRYSOULA PAPADAKIS 1. Introduction This project is inspired by the recent development of the Cap analysis gene expression (CAGE) method,

More information

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22. Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.32 PCOS locus after conditioning for the lead SNP rs10993397;

More information

Yingying Wei George Wu Hongkai Ji

Yingying Wei George Wu Hongkai Ji Stat Biosci (2013) 5:156 178 DOI 10.1007/s12561-012-9066-5 Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis,

More information

CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells

CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells SUPPLEMENTARY INFORMATION CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells Lusy Handoko 1,*, Han Xu 1,*, Guoliang Li 1,*, Chew Yee Ngan 1, Elaine Chew 1, Marie Schnapp 1, Charlie Wah

More information

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities Gábor Boross,2, Katalin Orosz,2 and Illés J. Farkas 2, Department of Biological Physics,

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells. Supplementary Figure 1 Characteristics of SEs in T reg and T conv cells. (a) Patterns of indicated transcription factor-binding at SEs and surrounding regions in T reg and T conv cells. Average normalized

More information

Gene-microRNA network module analysis for ovarian cancer

Gene-microRNA network module analysis for ovarian cancer Gene-microRNA network module analysis for ovarian cancer Shuqin Zhang School of Mathematical Sciences Fudan University Oct. 4, 2016 Outline Introduction Materials and Methods Results Conclusions Introduction

More information

COMPUTATIONAL OPTIMISATION OF TARGETED DNA SEQUENCING FOR CANCER DETECTION

COMPUTATIONAL OPTIMISATION OF TARGETED DNA SEQUENCING FOR CANCER DETECTION COMPUTATIONAL OPTIMISATION OF TARGETED DNA SEQUENCING FOR CANCER DETECTION Pierre Martinez, Nicholas McGranahan, Nicolai Juul Birkbak, Marco Gerlinger, Charles Swanton* SUPPLEMENTARY INFORMATION SUPPLEMENTARY

More information

cis-regulatory enrichment analysis in human, mouse and fly

cis-regulatory enrichment analysis in human, mouse and fly cis-regulatory enrichment analysis in human, mouse and fly Zeynep Kalender Atak, PhD Laboratory of Computational Biology VIB-KU Leuven Center for Brain & Disease Research Laboratory of Computational Biology

More information

Plasma-Seq conducted with blood from male individuals without cancer.

Plasma-Seq conducted with blood from male individuals without cancer. Supplementary Figures Supplementary Figure 1 Plasma-Seq conducted with blood from male individuals without cancer. Copy number patterns established from plasma samples of male individuals without cancer

More information

Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression.

Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression. Supplementary Figure 1 Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression. a, Design for lentiviral combinatorial mirna expression and sensor constructs.

More information

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory Computational aspects of ChIP-seq John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory ChIP-seq Using highthroughput sequencing to investigate DNA

More information

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data Bioinformatics methods, models and applications to disease Alex Essebier ChIP-seq experiment To

More information

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells Ondrej Podlaha 1, Subhajyoti De 2,3,4, Mithat Gonen 5, Franziska Michor 1 * 1 Department of Biostatistics

More information

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Supplementary Materials and Methods Phylogenetic tree of the HMT superfamily The phylogeny outlined in the

More information

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles ChromHMM Tutorial Jason Ernst Assistant Professor University of California, Los Angeles Talk Outline Chromatin states analysis and ChromHMM Accessing chromatin state annotations for ENCODE2 and Roadmap

More information

Supplemental Information. Genomic Characterization of Murine. Monocytes Reveals C/EBPb Transcription. Factor Dependence of Ly6C Cells

Supplemental Information. Genomic Characterization of Murine. Monocytes Reveals C/EBPb Transcription. Factor Dependence of Ly6C Cells Immunity, Volume 46 Supplemental Information Genomic Characterization of Murine Monocytes Reveals C/EBPb Transcription Factor Dependence of Ly6C Cells Alexander Mildner, Jörg Schönheit, Amir Giladi, Eyal

More information

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue),

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training. Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze

More information

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD,

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Crawford, GE, Ren, B (2007) Distinct and predictive chromatin

More information

Supplementary Information. Supplementary Figures

Supplementary Information. Supplementary Figures Supplementary Information Supplementary Figures.8 57 essential gene density 2 1.5 LTR insert frequency diversity DEL.5 DUP.5 INV.5 TRA 1 2 3 4 5 1 2 3 4 1 2 Supplementary Figure 1. Locations and minor

More information

Integrated network analysis reveals distinct regulatory roles of transcription factors and micrornas

Integrated network analysis reveals distinct regulatory roles of transcription factors and micrornas Integrated network analysis reveals distinct regulatory roles of transcription factors and micrornas Yu Guo 1,2,4, Katherine Alexander 1, Andrew G Clark 1, Andrew Grimson 1 and Haiyuan Yu 2,3* 1 Department

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Workflow of CDR3 sequence assembly from RNA-seq data.

Nature Genetics: doi: /ng Supplementary Figure 1. Workflow of CDR3 sequence assembly from RNA-seq data. Supplementary Figure 1 Workflow of CDR3 sequence assembly from RNA-seq data. Paired-end short-read RNA-seq data were mapped to human reference genome hg19, and unmapped reads in the TCR regions were extracted

More information

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Whole Genome and Transcriptome Analysis of Anaplastic Meningioma Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Outline Anaplastic meningioma compared to other cancers Whole genomes

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets. Supplementary Figure 1 Transcriptional program of the TE and MP CD8 + T cell subsets. (a) Comparison of gene expression of TE and MP CD8 + T cell subsets by microarray. Genes that are 1.5-fold upregulated

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

EPIGENOMICS PROFILING SERVICES

EPIGENOMICS PROFILING SERVICES EPIGENOMICS PROFILING SERVICES Chromatin analysis DNA methylation analysis RNA-seq analysis Diagenode helps you uncover the mysteries of epigenetics PAGE 3 Integrative epigenomics analysis DNA methylation

More information

Supplementary Tables. Supplementary Figures

Supplementary Tables. Supplementary Figures Supplementary Files for Zehir, Benayed et al. Mutational Landscape of Metastatic Cancer Revealed from Prospective Clinical Sequencing of 10,000 Patients Supplementary Tables Supplementary Table 1: Sample

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Effect of HSP90 inhibition on expression of endogenous retroviruses. (a) Inducible shrna-mediated Hsp90 silencing in mouse ESCs. Immunoblots of total cell extract expressing the

More information

Supplemental Information. A Highly Sensitive and Robust Method. for Genome-wide 5hmC Profiling. of Rare Cell Populations

Supplemental Information. A Highly Sensitive and Robust Method. for Genome-wide 5hmC Profiling. of Rare Cell Populations Molecular ell, Volume 63 Supplemental Information Highly Sensitive and Robust Method for enome-wide hm Profiling of Rare ell Populations Dali Han, Xingyu Lu, lan H. Shih, Ji Nie, Qiancheng You, Meng Michelle

More information

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study. mc mc mc mc SUP mc mc Supplemental Figure. Genes showing ectopic HK9 dimethylation in this study are DNA hypermethylated in Lister et al. study. Representative views of genes that gain HK9m marks in their

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

The search for cis-regulatory driver mutations in cancer genomes

The search for cis-regulatory driver mutations in cancer genomes /, Vol. 6, No. 32 The search for cis-regulatory driver mutations in cancer genomes Rebecca C. Poulos 1, Mathew A. Sloane 1, Luke B. Hesson 1, Jason W. H. Wong 1 1 Prince of Wales Clinical School and Lowy

More information

Supplementary Figure 1. Quantile-quantile (Q-Q) plot of the log 10 p-value association results from logistic regression models for prostate cancer

Supplementary Figure 1. Quantile-quantile (Q-Q) plot of the log 10 p-value association results from logistic regression models for prostate cancer Supplementary Figure 1. Quantile-quantile (Q-Q) plot of the log 10 p-value association results from logistic regression models for prostate cancer risk in stage 1 (red) and after removing any SNPs within

More information

Lung Met 1 Lung Met 2 Lung Met Lung Met H3K4me1. Lung Met H3K27ac Primary H3K4me1

Lung Met 1 Lung Met 2 Lung Met Lung Met H3K4me1. Lung Met H3K27ac Primary H3K4me1 a Gained Met-VELs 1.5 1.5 -.5 Lung Met 1 Lung Met Lung Met 3 1. Lung Met H3K4me1 Lung Met H3K4me1 1 Lung Met H3K4me1 Lung Met H3K7ac 1.5 Lung Met H3K7ac Lung Met H3K7ac.8 Primary H3K4me1 Primary H3K7ac

More information

An epigenetic approach to understanding (and predicting?) environmental effects on gene expression

An epigenetic approach to understanding (and predicting?) environmental effects on gene expression www.collaslab.com An epigenetic approach to understanding (and predicting?) environmental effects on gene expression Philippe Collas University of Oslo Institute of Basic Medical Sciences Stem Cell Epigenetics

More information

Supplementary Materials for

Supplementary Materials for www.sciencetranslationalmedicine.org/cgi/content/full/7/283/283ra54/dc1 Supplementary Materials for Clonal status of actionable driver events and the timing of mutational processes in cancer evolution

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature10866 a b 1 2 3 4 5 6 7 Match No Match 1 2 3 4 5 6 7 Turcan et al. Supplementary Fig.1 Concepts mapping H3K27 targets in EF CBX8 targets in EF H3K27 targets in ES SUZ12 targets in ES

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma.

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma. Supplementary Figure 1 Mutational signatures in BCC compared to melanoma. (a) The effect of transcription-coupled repair as a function of gene expression in BCC. Tumor type specific gene expression levels

More information

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012 Elucidating Transcriptional Regulation at Multiple Scales Using High-Throughput Sequencing, Data Integration, and Computational Methods Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder

More information

Statistical analysis of RIM data (retroviral insertional mutagenesis) Bioinformatics and Statistics The Netherlands Cancer Institute Amsterdam

Statistical analysis of RIM data (retroviral insertional mutagenesis) Bioinformatics and Statistics The Netherlands Cancer Institute Amsterdam Statistical analysis of RIM data (retroviral insertional mutagenesis) Lodewyk Wessels Bioinformatics and Statistics The Netherlands Cancer Institute Amsterdam Viral integration Viral integration Viral

More information

Supplementary Information. Preferential associations between co-regulated genes reveal a. transcriptional interactome in erythroid cells

Supplementary Information. Preferential associations between co-regulated genes reveal a. transcriptional interactome in erythroid cells Supplementary Information Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells Stefan Schoenfelder, * Tom Sexton, * Lyubomira Chakalova, * Nathan

More information

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022 96 APPENDIX B. Supporting Information for chapter 4 "changes in genome content generated via segregation of non-allelic homologs" Figure S1. Potential de novo CNV probes and sizes of apparently de novo

More information

Supplementary Materials for

Supplementary Materials for www.sciencesignaling.org/cgi/content/full/8/375/ra41/dc1 Supplementary Materials for Actin cytoskeletal remodeling with protrusion formation is essential for heart regeneration in Hippo-deficient mice

More information

The Epigenome Tools 2: ChIP-Seq and Data Analysis

The Epigenome Tools 2: ChIP-Seq and Data Analysis The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu http://zanglab.com PHS5705: Public Health Genomics March 20, 2017 1 Outline Epigenome: basics review ChIP-seq overview

More information

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The Supplementary Figures Figure S1. Patient cohorts and study design. To define and interrogate the genetic alterations underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

More information

Sudin Bhattacharya Institute for Integrative Toxicology

Sudin Bhattacharya Institute for Integrative Toxicology Beyond the AHRE: the Role of Epigenomics in Gene Regulation by the AHR (or, Varied Applications of Computational Modeling in Toxicology and Ingredient Safety) Sudin Bhattacharya Institute for Integrative

More information

The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome

The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome Yutao Fu 1, Manisha Sinha 2,3, Craig L. Peterson 3, Zhiping Weng 1,4,5 * 1 Bioinformatics Program,

More information

Nature Biotechnology: doi: /nbt.1904

Nature Biotechnology: doi: /nbt.1904 Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is

More information

Analysis of the peroxisome proliferator-activated receptor-β/δ (PPARβ/δ) cistrome reveals novel co-regulatory role of ATF4

Analysis of the peroxisome proliferator-activated receptor-β/δ (PPARβ/δ) cistrome reveals novel co-regulatory role of ATF4 Khozoie et al. BMC Genomics 2012, 13:665 RESEARCH ARTICLE Open Access Analysis of the peroxisome proliferator-activated receptor-β/δ (PPARβ/δ) cistrome reveals novel co-regulatory role of ATF4 Combiz Khozoie

More information

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer.

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer. Supplementary Figure 1 SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer. Scatter plots comparing expression profiles of matched pretreatment

More information

Theta sequences are essential for internally generated hippocampal firing fields.

Theta sequences are essential for internally generated hippocampal firing fields. Theta sequences are essential for internally generated hippocampal firing fields. Yingxue Wang, Sandro Romani, Brian Lustig, Anthony Leonardo, Eva Pastalkova Supplementary Materials Supplementary Modeling

More information

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Promoter Motif Analysis Shisong Ma 1,2*, Michael Snyder 3, and Savithramma P Dinesh-Kumar 2* 1 School of Life Sciences, University

More information

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. SUPPLEMENTAL FIGURE AND TABLE LEGENDS Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. A) Cirbp mrna expression levels in various mouse tissues collected around the clock

More information

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Multiple samples from five patients (P4, P8, P14, P15 and P17) with Barrett s esophagus and adjacent EAC show that the poor overlap is not a result of sampling bias. Bar graphs showing

More information

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11 ARID2 p.r623c KMT2D p.v650fs p.p976l p.r2922w p.l1212r p.d1400h DNA binding RFX DNA binding Zinc finger KMT2C p.a51s p.d372v p.c1103* p.d2847fs p.t2671 p.d2847fs p.r4586h PHD/ RING DHHC/ PHD PHD FYR N

More information

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016 Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf Rachel Schwope EPIGEN May 24-27, 2016 What does H3K4 methylation do? Plant of interest: Vitis vinifera Culturally important

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Phenotypic characterization of MES- and ADRN-type cells.

Nature Genetics: doi: /ng Supplementary Figure 1. Phenotypic characterization of MES- and ADRN-type cells. Supplementary Figure 1 Phenotypic characterization of MES- and ADRN-type cells. (a) Bright-field images showing cellular morphology of MES-type (691-MES, 700-MES, 717-MES) and ADRN-type (691-ADRN, 700-

More information

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first intron IGLL5 mutation depicting biallelic mutations. Red arrows highlight the presence of out of phase

More information

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing Last update: 05/10/2017 MODULE 4: SPLICING Lesson Plan: Title MEG LAAKSO Removal of introns from messenger RNA by splicing Objectives Identify splice donor and acceptor sites that are best supported by

More information

Supplementary information

Supplementary information Supplementary information High fat diet-induced changes of mouse hepatic transcription and enhancer activity can be reversed by subsequent weight loss Majken Siersbæk, Lyuba Varticovski, Shutong Yang,

More information

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 Introduction Loss of erozygosity (LOH) represents the loss of allelic differences. The SNP markers on the SNP Array 6.0 can be used

More information

High Throughput Sequence (HTS) data analysis. Lei Zhou

High Throughput Sequence (HTS) data analysis. Lei Zhou High Throughput Sequence (HTS) data analysis Lei Zhou (leizhou@ufl.edu) High Throughput Sequence (HTS) data analysis 1. Representation of HTS data. 2. Visualization of HTS data. 3. Discovering genomic

More information

Journal: Nature Methods

Journal: Nature Methods Journal: Nature Methods Article Title: Network-based stratification of tumor mutations Corresponding Author: Trey Ideker Supplementary Item Supplementary Figure 1 Supplementary Figure 2 Supplementary Figure

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Fig 1. Comparison of sub-samples on the first two principal components of genetic variation. TheBritishsampleisplottedwithredpoints.The sub-samples of the diverse sample

More information

Nature Genetics: doi: /ng Supplementary Figure 1. HOX fusions enhance self-renewal capacity.

Nature Genetics: doi: /ng Supplementary Figure 1. HOX fusions enhance self-renewal capacity. Supplementary Figure 1 HOX fusions enhance self-renewal capacity. Mouse bone marrow was transduced with a retrovirus carrying one of three HOX fusion genes or the empty mcherry reporter construct as described

More information

The common colorectal cancer predisposition SNP rs at chromosome 8q24 confers potential to enhanced Wnt signaling

The common colorectal cancer predisposition SNP rs at chromosome 8q24 confers potential to enhanced Wnt signaling SUPPLEMENTARY INFORMATION The common colorectal cancer predisposition SNP rs6983267 at chromosome 8q24 confers potential to enhanced Wnt signaling Sari Tuupanen 1, Mikko Turunen 2, Rainer Lehtonen 1, Outi

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Illustrative example of ptdt using height The expected value of a child s polygenic risk score (PRS) for a trait is the average of maternal and paternal PRS values. For example,

More information

The corrected Figure S1J is shown below. The text changes are as follows, with additions in bold and deletions in bracketed italics:

The corrected Figure S1J is shown below. The text changes are as follows, with additions in bold and deletions in bracketed italics: Correction H3K4me3 readth Is Linked to Cell Identity and Transcriptional Consistency érénice A. enayoun, Elizabeth A. Pollina, Duygu Ucar, Salah Mahmoudi, Kalpana Karra, Edith D. Wong, Keerthana Devarajan,

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Replicability of blood eqtl effects in ileal biopsies from the RISK study. eqtls detected in the vicinity of SNPs associated with IBD tend to show concordant effect size and direction

More information

A prostate cancer susceptibility allele at 6q22 increases RFX6 expression by modulating HOXB13 chromatin binding

A prostate cancer susceptibility allele at 6q22 increases RFX6 expression by modulating HOXB13 chromatin binding Supplementary Information A prostate cancer susceptibility allele at 6q22 increases RFX6 expression by modulating HOXB13 chromatin binding Qilai Huang 1,2*, Thomas Whitington 3,4*, Ping Gao 1,2, Johan

More information

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the deep dermis. (b) The melanocytes demonstrate abundant

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Nature Medicine: doi: /nm.4439

Nature Medicine: doi: /nm.4439 Figure S1. Overview of the variant calling and verification process. This figure expands on Fig. 1c with details of verified variants identification in 547 additional validation samples. Somatic variants

More information