Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Similar documents
Nature Getetics: doi: /ng.3471

Tumor suppressor genes that escape from X-inactivation contribute to cancer sex bias

Supplementary Figure 1: LUMP Leukocytes unmethylabon to infer tumor purity

SUPPLEMENTARY INFORMATION

Supplementary Materials for

Nature Immunology: doi: /ni Supplementary Figure 1. RNA-Seq analysis of CD8 + TILs and N-TILs.

Nature Genetics: doi: /ng Supplementary Figure 1. Workflow of CDR3 sequence assembly from RNA-seq data.

Nature Genetics: doi: /ng Supplementary Figure 1. Phenotypic characterization of MES- and ADRN-type cells.

Supplementary Figures

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

Exploring TCGA Pan-Cancer Data at the UCSC Cancer Genomics Browser

File Name: Supplementary Information Description: Supplementary Figures and Supplementary Tables. File Name: Peer Review File Description:

Nature Methods: doi: /nmeth.3115

Expanded View Figures

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

Identification of Tissue Independent Cancer Driver Genes

Supplemental Figure legends

Supplementary Figure 1: High-throughput profiling of survival after exposure to - radiation. (a) Cells were plated in at least 7 wells in a 384-well

User s Manual Version 1.0

SUPPLEMENTARY FIGURES

Nature Genetics: doi: /ng Supplementary Figure 1. HOX fusions enhance self-renewal capacity.

Expert-guided Visual Exploration (EVE) for patient stratification. Hamid Bolouri, Lue-Ping Zhao, Eric C. Holland

Clustered mutations of oncogenes and tumor suppressors.

Nature Genetics: doi: /ng Supplementary Figure 1. Somatic coding mutations identified by WES/WGS for 83 ATL cases.

Nature Genetics: doi: /ng.2995

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Machine-Learning on Prediction of Inherited Genomic Susceptibility for 20 Major Cancers

Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Nature Genetics: doi: /ng Supplementary Figure 1. Clinical timeline for the discovery WES cases.

EPIGENETIC RE-EXPRESSION OF HIF-2α SUPPRESSES SOFT TISSUE SARCOMA GROWTH

Breeding scheme, transgenes, histological analysis and site distribution of SB-mutagenized osteosarcoma.

Nature Neuroscience: doi: /nn Supplementary Figure 1

Supplementary Figures

Expanded View Figures

SUPPLEMENTARY FIGURES: Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

STATISTICS AND RESEARCH DESIGN

Supplementary Information. Supplementary Figures

Mosaic loss of chromosome Y in peripheral blood is associated with shorter survival and higher risk of cancer

TCGA. The Cancer Genome Atlas

Cancer Informatics Lecture

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

Introduction to Gene Sets Analysis

Supplementary Appendix

Supplementary Figure 1. Estimation of tumour content

Supplemental Figure S1. PLAG1 kidneys contain fewer glomeruli (A) Quantitative PCR for Igf2 and PLAG1 in whole kidneys taken from mice at E15.

- 1 - Reviewers' comments: Reviewer #1 (Remarks to the Author):

Supplementary Figure 1

DNA-seq Bioinformatics Analysis: Copy Number Variation

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Session 4 Rebecca Poulos

gliomas. Fetal brain expected who each low-

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Supplementary. properties of. network types. randomly sampled. subsets (75%

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer.

Supplementary Tables. Supplementary Figures

Nature Structural & Molecular Biology: doi: /nsmb.2419

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Supplementary Materials for

Supplementary Figure 1. ALVAC-protein vaccines and macaque immunization. (A) Maximum likelihood

Nature Genetics: doi: /ng Supplementary Figure 1. Rates of different mutation types in CRC.

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

Nature Genetics: doi: /ng Supplementary Figure 1

Supplemental Information. Integrated Genomic Analysis of the Ubiquitin. Pathway across Cancer Types

Supplemental Table 1 Molecular Profile of the SCLC Cell Line Panel

Frequency(%) KRAS G12 KRAS G13 KRAS A146 KRAS Q61 KRAS K117N PIK3CA H1047 PIK3CA E545 PIK3CA E542K PIK3CA Q546. EGFR exon19 NFS-indel EGFR L858R

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma.

Supplementary Information

Expanded View Figures

Supplementary Figure 1. Copy Number Alterations TP53 Mutation Type. C-class TP53 WT. TP53 mut. Nature Genetics: doi: /ng.

Nature Genetics: doi: /ng Supplementary Figure 1. Country distribution of GME samples and designation of geographical subregions.

m 6 A mrna methylation regulates AKT activity to promote the proliferation and tumorigenicity of endometrial cancer

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Immunology: doi: /ni Supplementary Figure 1. DNA-methylation machinery is essential for silencing of Cd4 in cytotoxic T cells.

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Plasma-Seq conducted with blood from male individuals without cancer.

Nature Genetics: doi: /ng Supplementary Figure 1. TCGA data set on HNSCCs reanalyzed in this study.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Lick response during the delayed Go versus No-Go task.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Information Titles Journal: Nature Medicine

Nature Medicine: doi: /nm.3967

Supplementary Figure 1

Understandable Statistics

(a) Significant biological processes (upper panel) and disease biomarkers (lower panel)

AD (Leave blank) TITLE: Genomic Characterization of Brain Metastasis in Non-Small Cell Lung Cancer Patients

Genetics All somatic cells contain 23 pairs of chromosomes 22 pairs of autosomes 1 pair of sex chromosomes Genes contained in each pair of chromosomes

Nature Neuroscience: doi: /nn Supplementary Figure 1. Missense damaging predictions as a function of allele frequency

Supplementary Figure 1

Global variation in copy number in the human genome

Session 4 Rebecca Poulos

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer

SUPPLEMENTARY INFORMATION

Nature Structural and Molecular Biology: doi: /nsmb Supplementary Figure 1

EXPression ANalyzer and DisplayER

Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression.

Transcription:

Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue), females (green), or all patients (black).

Supplementary Figure 2 Male and female mutation data across cancer types. (a) Ratio of female:male (F:M) coding mutations across all 4,126 cancers with exome sequencing by chromosome. The F:M ratio for the X chromosome is approximately double that for autosomes, as expected because females have two X chromosomes. On average, there are more mutations in male cancers than in female cancers across all chromosomes. This finding makes our approach to discover male-cancer-biased mutations on the X chromosome even more stringent because, to be enriched, they must be present at an M:F ratio that statistically exceeds the already male-biased background rate of mutation. (b) Female (red) and male (blue) somatic mutation rates (per Mb of DNA) by cancer type. Each dot represents one cancer sample. The horizontal bar represents the median number of mutations in female or male cancers. Numbers across the top of the graph are the number of tumor normal pairs in each disease. Disease acronym definitions are provided in Supplementary Table 1.

Supplementary Figure 3 Defining copy number loss events on the X chromosome. (a) The M:F ratio of copy number loss events (y axis) in a sliding window of different lengths over the genome (x axis). The blue line represents the cutoff chosen to define focal events (Online Methods). (b) Distribution of the log 2 (copy number) in tumor as compared to paired normal samples by genomic site on the X chromosome in males (top) and females (bottom). The red line represents the cutoff below which focal copy number loss was called. (c) Distribution of estimated X-chromosome ploidy across 10,844 tumors in the Broad Institute TCGA/GDAC data set (http://gdac.broadinstitute.org/) as determined by SNP array (left). The red line represents the cutoff used to infer loss of the whole X chromosome in female tumors in our data set. Estimated X-chromosome copy numbers for female and male tumors in our data set with concurrent LOF or LOF/CN loss mutations are shown on the right.

Supplementary Figure 4 Genes with higher frequencies of loss-of-function mutations in male cancers identified by a log-likelihood ratio test. (a c) Log-likelihood ratios for genes on the X chromosome across all 4,126 cancers (a), in clear cell kidney cancer (KIRC) (b), and in lower-grade glioma (LGG) (c) are shown. The log 2 (M:F ratio) of patients with mutations is plotted against the significance value. The size and color of each circle represent the number of patients with loss-of-function mutations for each gene. Genes with FDR <0.1 for male-cancer-biased mutation are identified. ATRX is also shown, as it was identified as more frequent in male cancers by the permutation analysis (Table 1 and Fig. 2a).

Supplementary Figure 5 Male-biased mutations are more frequent among X-chromosome genes that escape X-inactivation or have functional Y- chromosome homologs. (a) M:F ratio (left; P = 0.022 by t test; each bar represents the median; +, mean; box, interquartile range; whiskers, 10th 90th percentile) and scatterplot (right; P = 0.011 by Komolgorov Smirnov (K S) test; each point represents the number of males versus the number of females with a mutation in that gene; the line represents linear regression fit) of putative loss-of-function SNV, indel, or copy number loss for 46 mutated X-chromosome genes that escape X-inactivation (Escape) and the remaining 595 mutated X-chromosome genes (Non-Escape). (b) M:F ratio (left; P = 0.058 by t test; plot as in a) and scatterplot (right; P = 0.068 by K S test) of putative LOF mutations as above for X-chromosome genes that do (Y homolog; n = 17) or do not (Non-Y homolog; n = 624) have predicted functional Y-chromosome homologs.

Supplementary Figure 6 Determination of Y-chromosome loss from exome sequencing data. The relative number of reads between tumor and normal samples for females (red) and males (blue) on the Y chromosome is shown. The large blue dots represent samples called as having loss of Y chromosome, on the basis of having <25% of the mean number of reads among males.

Supplementary Figure 7 Allele-specific expression of EXITS genes in TCGA tumors. Each dot represents the average minor allele expression fraction from RNA seq at heterozygous SNP sites detected in exome sequencing in known non-escape genes on the X chromosome or EXITS genes, in tumors without mutations in the indicated gene (red cross, mean; green square, median). Non-escape genes are the non-par X-chromosome genes not defined as stringent escape genes in Supplementary Table 8. Tumor types are as follows: GBM, glioblastoma; LGG, lower-grade glioma; HNSC, head and neck squamous cell carcinoma; KIRC, clear cell kidney cancer; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma. Violin plots were generated using the distributionplot.m function in MATLAB with default parameters.

Supplementary Figure 8 Comparison of female and male expression levels of EXITS genes in non-mutated tumors. Female and male expression of ATRX, CNKSR2, DDX3X, KDM5C, and KDM6A in head and neck squamous cell carcinoma (HNSC), clear cell kidney cancer (KIRC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), glioblastoma (GBM), and lowergrade glioma (LGG) are shown. Analysis is limited to tumors without mutations in the indicated genes. Each bar represents the median; box, interquartile range; whiskers, 10th 90th percentile. P values were calculated by K S test; comparisons with P < 0.05 are shaded in blue.

Supplementary Figure 9 Allele-specific expression of EXITS genes in female GTEx normal tissues. Each dot represents the average minor allele expression fraction from RNA seq at heterozygous SNP sites detected in exome sequencing in known non-escape genes on the X chromosome or EXITS genes, in a GTEx biopsy of the indicated normal tissue (red cross, mean; green square, median). Violin plots were generated using the distributionplot.m function in MATLAB with default parameters.

Supplementary Figure 10 ATRX expression in male and female tissues measured by RNA sequencing. (a) Expression of ATRX in normal male and female peripheral blood and brain in the GTEx data set is shown (distributions compared by Komolgorov Smirnov test). (b) Holzmann Volmer likelihood test of restricted (yellow) and unrestricted (transparent gray) normal mixture fits for males (left) and females (right). Top, maximum-likelihood fits with equal variance of the component mixtures; bottom, maximum-likelihood fits with unequal variance of the component mixtures. The P value of the Holzmann Vollmer likelihood-ratio test for bimodality is indicated. (c) Wilcoxon rank-sum test of significance P values for difference in EXITS gene expression between males and females in the indicated tissues in the GTEx data set.

Supplementary Figure 11 Loss of Cnksr2 is associated with oncogenic phenotypes. (a) Unsupervised hierarchical clustering of Spearman rank correlations across nine biologically independent cultures of NIH 3T3 cells stably expressing lentiviruses with the indicated shrnas. Red blue color represents high low correlation, as shown in the legend. shcnksr2 1 and shcnksr2 2 are independent shrnas, each represented in three clones. shcontrol is a non-targeting shrna (targeting RFP, a control gene not present in these cells). (b) Principal-component analysis of expression profiles in the same cells as in a. (c) Network enrichment map of GSEA results querying the C2:CGP database of MSigDB, showing the interconnected cluster with the largest number of gene set members. Red circles are gene sets enriched in shcnksr2 cells and are labeled; blue circles are gene sets enriched in shcontrol cells. The size of each circle represents the number of genes in the gene set, and lines connect gene sets with overlapping members. (d) GSEA results querying the H1:Hallmarks database of MSigDB showing enrichment of MTORC1 and KRAS signaling pathways in shcnksr2 cells. ES, enrichment score. (e) Immunoblotting for phospho-erk, total ERK, or tubulin in NIH 3T3 cells stably expressing the indicated shrnas. (f) Automated counts of colonies per field of NIH 3T3 cells stably expressing the indicated shrnas grown in soft agar (each graph represents four fields counted in two independently transfected wells; *P < 0.05 by t test as compared to shrfp; error bars, s.e.m.). (g) Representative bright-field photomicrographs of the soft agar colony assays quantified in f.