Supplementary Figure 1. Spectrum and signatures of substitutions.

Similar documents
Characterisation of structural variation in breast. cancer genomes using paired-end sequencing on. the Illumina Genome Analyser

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

BWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space

NGS in tissue and liquid biopsy

MRC-Holland MLPA. Description version 06; 23 December 2016

SUPPLEMENTARY INFORMATION

Plasma-Seq conducted with blood from male individuals without cancer.

MSI positive MSI negative

Cancer. Questions about cancer. What is cancer? What causes unregulated cell growth? What regulates cell growth? What causes DNA damage?

Chapter 16 Mutations. Practice Questions:

Supplementary Figure 1. Estimation of tumour content

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

MRC-Holland MLPA. Description version 18; 09 September 2015

MRC-Holland MLPA. Related SALSA MLPA probemixes P190 CHEK2: Breast cancer susceptibility, genes included: CHEK2, ATM, PTEN, TP53.

Next generation diagnostics Bringing high-throughput sequencing into clinical application

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011

Molecular Genetics of Paediatric Tumours. Gino Somers MBBS, BMedSci, PhD, FRCPA Pathologist-in-Chief Hospital for Sick Children, Toronto, ON, CANADA

Using the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep

Germline Genetic Testing for Breast Cancer Risk

Tumor mutational burden and its transition towards the clinic

Supplementary Figure 1

Introduction to Genetics

Molecular mechanisms of human carcinogenesis

Nucleic Acid Testing - Oncology. Molecular Diagnosis. Gain/Loss of Nucleic Acid. Objectives. MYCN and Neuroblastoma. Molecular Diagnosis

Molecular Diagnosis. Nucleic acid based testing in Oncology

Transform genomic data into real-life results

Structural Variation and Medical Genomics

Global variation in copy number in the human genome

CNV Detection and Interpretation in Genomic Data

SALSA MLPA probemix P315-B1 EGFR

Nature Biotechnology: doi: /nbt.1904

Genetics and Genomics in Medicine Chapter 6 Questions

Genomic Instability. Kent Nastiuk, PhD Dept. Cancer Genetics Roswell Park Cancer Institute. RPN-530 Oncology for Scientist-I October 18, 2016

AllinaHealthSystems 1

SALSA MLPA KIT P060-B2 SMA

COSMIC - Catalogue of Somatic Mutations in Cancer

6.3 DNA Mutations. SBI4U Ms. Ho-Lau

CDH1 truncating alterations were detected in all six plasmacytoid-variant bladder tumors analyzed by whole-exome sequencing.

CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery

Supplementary Figure 1: Classification scheme for non-synonymous and nonsense germline MC1R variants. The common variants with previously established

Supplementary results

MRC-Holland MLPA. Description version 30; 06 June 2017

Package ClusteredMutations

MRC-Holland MLPA. Description version 19;

Supplementary Figure 1 Overall study design

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

MRC-Holland MLPA. Description version 29; 31 July 2015

New: P077 BRCA2. This new probemix can be used to confirm results obtained with P045 BRCA2 probemix.

609G: Concepts of Cancer Genetics and Treatments (3 credits)

Cancer Genomics. Nic Waddell. Winter School in Mathematical and Computational Biology. July th

Supplementary Figure 1: Comparison of acgh-based and expression-based CNA analysis of tumors from breast cancer GEMMs.

Genomic tests to personalize therapy of metastatic breast cancers. Fabrice ANDRE Gustave Roussy Villejuif, France

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

Prevalence and clinical implications of BRCA1/2 germline mutations in Chinese women with breast cancer Yuntao Xie M.D., Ph.D.

SALSA MLPA probemix P175-A3 Tumour Gain Lot A3-0714: As compared to the previous version A2 (lot A2-0411), nine probes have a small change in length.

Early Embryonic Development

Secuenciación masiva: papel en la toma de decisiones

The mutations that drive cancer. Paul Edwards. Department of Pathology and Cancer Research UK Cambridge Institute, University of Cambridge

Genomic Medicine: What every pathologist needs to know

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Assessing Your Patient s Breast Cancer Risk: Is Genetic Testing Necessary?

Genomic structural variation

Variations in Chromosome Structure & Function. Ch. 8

Single-strand DNA library preparation improves sequencing of formalin-fixed and paraffin-embedded (FFPE) cancer DNA

WHAT IS A GENE? CHROMOSOME DNA PROTEIN. A gene is made up of DNA. It carries instructions to make proteins.

Cancer. October is National Breast Cancer Awareness Month

EXAMPLE. - Potentially responsive to PI3K/mTOR and MEK combination therapy or mtor/mek and PKC combination therapy. ratio (%)

Next Generation Sequencing in Clinical Practice: Impact on Therapeutic Decision Making

COMPUTATIONAL OPTIMISATION OF TARGETED DNA SEQUENCING FOR CANCER DETECTION

oncogenes-and- tumour-suppressor-genes)

5 th July 2016 ACGS Dr Michelle Wood Laboratory Genetics, Cardiff

What All of Us Should Know About Cancer and Genetics

Klinisch belang van chromosomale translocatie detectie in sarcomen

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

NEOPLASIA. 3. Which of the following tumour is benign a. Chondrosarcoma b. Osteochondroma c. Chondroblastoma d. Ewing s tumour e.

Virtual Journal Club: Front-Line Therapy and Beyond Recent Perspectives on ALK-Positive Non-Small Cell Lung Cancer.

SUPPLEMENTARY INFORMATION

Germline Testing for Hereditary Cancer with Multigene Panel

Nature Genetics: doi: /ng.2995

Inhibidores de PARP Una realidad? dónde y cuando?

Supplementary Figure 1

SALSA MLPA Probemix P014-B1 Chromosome 8 Lot B and B

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11

Molecular classification of breast cancer implications for pathologists. Sarah E Pinder

MEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)

Introduction to Cancer Bioinformatics and cancer biology. Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) January 20, 2015

Chapter 9. Cells Grow and Reproduce

ESMO DOUBLE-HIT LYMPHOMAS

Case 1. ACCME/Disclosure. Clinical History. Dr. Mulligan has nothing to disclose

Carcinosarcoma Trial rial in s a in rare malign rare mali ancy

Nature Genetics: doi: /ng Supplementary Figure 1. Somatic coding mutations identified by WES/WGS for 83 ATL cases.

Figure S4. 15 Mets Whole Exome. 5 Primary Tumors Cancer Panel and WES. Next Generation Sequencing

WHAT IS A GENE? CHROMOSOME DNA PROTEIN. A gene is made up of DNA. It carries instructions to make proteins.

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed.

Haematology Probes for Multiple Myeloma

6/8/17. Genetics 101. Professor, College of Medicine. President & Chief Medical Officer. Hereditary Breast and Ovarian Cancer 2017

Transcription:

Supplementary Figure 1. Spectrum and signatures of substitutions. a. Heatmaps of trinucleotide context of substitutions. Each square represents a substitution in a specific trinucleotide context, normalised against the prevalence of that trinucleotide in the human genome. b. Spectrum of substitutions. Contribution (%) of the six substitution classes to overall substitution burden. c. Contribution of signatures to individual tumours. Signatures nomenclature as per Alexandrov et al. (Nature 2013). Note that signature 30 is a novel signature, characterised by C>T mutations occuring mainly 5 of cyotinse or adenine (CpM). d. Genome wide substitution and indel burden.

Supplementary Figure 2. Analyses of indel size, context, and motifs. a. Indel size distribution in radiation-associated tumours. Clusters refer to results from a statistical comparison of the size distribution of radiation-associated tumours with non-radiation tumours including BRCA1 or BRCA2 germline deficient breast tumours (see methods). b. Indel motifs. Motifs of indel sequences were analysed as described in methods. Shown here are the frequencies of indels with discernible motifs. c. Comparison of deleted sequence with flanking 3 sequence. Left: Proportion of deletions that exhibit microhomology with 3 flanking sequence, or are repeats of the 3 flanking sequence, or neither. Right: Length of microhomology sequences in comparison to chance distribution. Differences are significant (p=2.2e-16, Kolmogorov Smirnov test). d. One base pair indels that occur in homonymous mononucleotide repeat tracks. e. Comparison of motifs in sequence on either side of indel. The frequency of each combination of motifs was measured. 0: no discernible motif; I: mononucleotide repeat; II: dinucleotide repeat; III: trinucleotide repeat; IV: tetranucleotide repeat.

Supplementary Figure 3. PCR gels validating balanced inversions. PCRs were performed in tumour and normal tissue DNA with primers placed on either side of the breakpoint of the rearrangement in question. T: Tumour DNA. N: Normal tissue DNA. Numbers refer to numbering of complete inversions as per Supplementary Table 6. Head: head end of inversion. Tail: tail end of inversion.

Supplementary Table 1. Overview of 12 radiation-associated tumours. Angio Spindle cell Osteo Breast cancer Tumour PD7188a PD7192a PD7530a PD7189a PD7191a PD7190a PD9056a PD9972a PD13489a PD8618a PD8622a PD8623a Substitutions 6318 6362 1750 7147 3540 9245 4685 5459 2785 1506 1809 2020 Kataegis* - - - - Yes Yes Yes - - - - - Indels 447 555 301 943 425 545 606 350 433 135 349 296 Rearrangements 108 71 49 51 177 321 180 76 156 6 59 31 Chromothripsis* - - - - Yes Yes Yes - Yes - - - Driver mutations TP53 Common CDKN2A cancer genes PTPRB Common PLCG1 angio cancer genes MYC PIK3CA Common CDH1 breast cancer genes FOXA1 CHEK2 FGFR1 KDM6A NF1 CASP8 ATRX PTEN Others Homozygous deletion Essential splice site substitution Deletion rearrangement # Essential splice site indel Inversion rearrangement ## Germline Missense Amplification missense Nonsense *Assessed visually # Rearrangement deletions deleted segments are < 1 Mb in size ## Amplicons minimum number of copies: 9 in tetraploid genomes; 5 in diploid genomes

Supplementary Table 2. Prostate tumour series. Case (ID used in this report) ID in Gundem et al. (reference 5) Type Inversions Substitution burden (truncal mutations) PD11328 A10 No radiation 1 4555 894 PD11329 A22 No radiation 1 3795 764 PD11333 A12 No radiation 1 2562 532 PD12337 A21 No radiation 1 4277 669 PD11331 A31 Late 1* 2056 580 PD11330 A29 Radiation 3 5865 963 PD11332 A32 Radiation 6 4440 1224 PD11334 A24 Radiation 1 3256 1373 PD11335 A34 Radiation 4 3344 720 PD13412 A17 Radiation 0 9872 1537 *Inversion was found in primary tumour only and was absent from metastases Indel burden (truncal mutations)

Supplementary Table 3: Genomic properties in hg19 Property Note Metric Source Ref Centromere UCSC genome doi:10.10 Telomere Same arm Distance to (log 10) browser hg19 gap 93/nar/gk track t1168 GC proportion of known GC content nucleotides Value in 100bp bins hg19 FASTA across genome Trinucleotide complexity Chromatin state Replication timing Genes Sum of 64 trinucleotide proportions squared. High value = low complexity. E129 (osteoblasts) for osteos, consensus across all 127 cell types for prostate (no prostate in ROADMAP) Average across NHEK (normal skin), GM12878 (normal blood), IMR90 (normal lung). High value = early replicating Known protein-coding genes One of 25 chromatin segmentation states Average waveletsmoothed signal in 1kb bins across genome ROADMAP UCSC genome browser UW Repli- Seq tracks http://genome.ucsc. edu/cgibin/hgfileui?db=hg1 9&g=wgEncodeUwR epliseq GENCODE v19 doi:10.10 38/nature 14248 doi:10.10 73/pnas. 0912402 107 doi:10.11 01/gr.135 350.111 LADs Tig3ET normal human embryonic lung fibroblasts Lamina associated domains from Guelen et al. doi:10.10 38/nature 06947 CpG islands 200+ bp with >50% GC content and more CG dinucleotides than expected (excluding repeat regions) UCSC genome browser CpG islands track doi:10.10 93/nar/gk t1168 Direct repeats 10 300nt repeated directly one or more times 0 10nt away G-quadruplexes Cruciform inverted repeat proximity Triplex mirror repeat proximity Short tandem repeat proximity Z-DNA motif proximity ALU repeat MIR repeat L1 repeat L2 repeat LTR repeat DNA repeat Simple repeat Four or more runs of three or more Gs, with 1 7nt spacers Repeats of 6 or more nt repeated inversely up to 4nt away Repeats of 10 or more nt with 90% of one strand made of pyrimidines and repeated as a mirror up to 8nt away 1 9nt repeated perfectly three or more times with no spacers Alternating purine-pyrimidine tracts of 10 or more nt, excluding AT/TA dinucleotide repeats microsatellites Is position in or near (upto 100bp away) this feature? (Yes/No) Non-B DNA database v2 Repeatmasker 20140131 library doi:10.10 93/nar/gk s955 http://ww w.repeat masker.o rg/genom es/hg19/ RepeatM asker- rm405- db20140 131/

Supplementary Note 1. Description and discussion of cases. Clinical details of each of the twelve cases included in this experiment were provided by collaborating pathologists and are summarised in the table below. The cohort comprised four osteos, three angios, three breast tumours and two spindle cell s. The latency period from primary tumour to secondary tumour ranged from 4 to 40 years (median 11 years). Note that one case, PD13489a, harboured a pathogenic germline mutation in TP53. The presence of pathogenic TP53 germline mutations was excluded from the other cases by analysis of sequencing reads from normal tissue DNA (see Methods). In addition, we excluded the presence of pathogenic germline mutations in BRCA1 and BRCA2 from the study cohort, because there was some similarity between radiation-associated tumours and breast tumours arising in patients with BRCA1 or BRCA2 germline mutations. Radiation-induced cancer has been defined as a malignancy which, following a latency period, forms in a field of radiation and is of a different tumour type than the original disease 1,2. A key challenge in studying radiation-associated tumours is to substantiate the diagnosis with appropriate biomarkers. Thus, it is not possible to definitively ascertain whether the tumours we studied are radiation-associated. However, for the reasons discussed below, the cases we included can reasonably be regarded as radiation-associated second malignancies. An alternative explanation for the presence of a common signature in four distinct cancer types could include a germline predisposing genotype. However, a common genotype would be unlikely to present across our diverse cohort of s and epithelial cancers (breast; prostate). Angios PD7188a; PD7530a; PD7192a. Secondary angios are defined by the presence of MYC amplification. All three cases under investigation harboured MYC amplification, and arose in a field of therapeutic radiation. Therefore, it is reasonable to consider these tumours to be radiation-associated. However, secondary angio with MYC amplification can also be caused by chronic lymphoedema, which in turn can be caused by ionising radiation and by surgery. Thus, in cases of secondary angio it is impossible distinguish with certainty between the effects of radiation, lymphoedema, and surgery. Osteos - PD7190a; PD9056a; PD9972a; PD13489a. Osteo is a rare malignancy of bone. Development of osteo represents a well-known complication of radiotherapy, especially in patients with certain cancer-predisposing germline mutations. The four cases that we have studied can reasonably be regarded to be radiation-associated second tumours. However, one can speculate that PD7190a represents a recurrence of the original tumour, giant cell tumour of bone, as the two tumours are morphologically similar. However, this possibility can be excluded as PD7190a lacked the pathognomonic H3F3A G34W mutation of giant cell tumour of bone 5.

Radiations-associated second malignancies with clinical details Note that all tumours arose in fields of ionising radiation. ID Sex Age at diagnosis Secondary tumour Primary tumour Latency (Yrs) Comments PD7188a M 19 Angio Spindle cell PD7192a F 61 Angio Breast cancer 4 PD7530a F 38 Angio Breast cancer 5 PD7190a M 25 Osteo PD9056a M N/A Osteo Giant cell tumour of bone Squamous cell carcinoma of tonsil 7 Anatomical site > thigh 28 Patient underwent wide local excision Patient underwent wide local excision with axillary clearance Osteo lacks H3F3A G34W mutation of giant cell tumour of bone N/A - PD9972a F 60 Osteo Ovarian carcinoma 40 - PD13489a F 20 Osteo PD7189a M 19 PD7191a F 60 PD8618a F 37 PD8622a F 17 PD8623a F 38 Spindle cell Spindle cell Breast cancer (hormone receptor positive) Breast cancer (hormone receptor positive) Breast cancer (hormone receptor positive) Embryonal rhabdomyo 9 Synovial 5 Breast cancer 16 Hodgkin s lymphoma Hodgkin s lymphoma Hodgkin s lymphoma 11 14 40 Patient has germline TP53 mutation Spindle cell lacks the t(18:x) translocation of synovial Case has been reviewed to ascertain that it is not a toid carcinoma Patient received mantle radiotherapy Patient received mantle radiotherapy Patient received mantle radiotherapy Spindle cell s PD7189a; PD7191a. Primary spindle cell is also an exceedingly rare tumour which decreases the odds of these tumours arising spontaneously in a field of ionising radiation. In case PD7189a, the primary and secondary tumour were synovial and spindle cell, respectively, which are related entities. The pathognomonic marker of synovial 6, a t(18:x) translocation, was absent from the spindle cell, PD7189a, indicating that this tumour is not a synovial and thus represents a de novo secondary tumour. PD7191a arose in the field of radiation for a previous breast cancer. In rare cases, a breast carcinoma may be mistaken as a spindle cell when morphologically it becomes toid. We specifically requested expert review of this tumour which concluded that the appearance was typical of a spindle cell and incompatible with a breast cancer.

Breast cancers - PD8618a, PD8622a, PD8623a. These case arose following mantle radiotherapy for Hodgkin s lymphoma. We chose these cases because mantle radiotherapy is a well-established cause of secondary breast cancer. In fact, it has been abandoned as a treatment of Hodgkin s disease. One could still argue that the tumours have arisen spontaneously, given the high incidence of spontaneous breast cancer. However, consideration of the attributable risk percentage (AR% = (relative risk 1) / relative risk) suggests that most cases of breast cancer following radiotherapy for lymphoma can be attributed to radiotherapy (assuming a relative risk of 15-25 for developing breast cancer after radiotherapy for Hodgkin s disease 7 ).

Supplementary Note 2. Comparison of radiation-exposed prostate tumours and radiationassociated second malignancies. The biological context of radiation-associated second malignancies and radiation-exposed prostate lesions differs in two ways, although fundamental similarities remain. First, the irradiated cells in prostate cancer are cancer cells as opposed to presumably normal cells in cases of second malignancies. The effects of ionising radiation may manifest differently in a cancer cell compared to a normal cell. However, one could also argue that the effects of ionising radiation on DNA are independent of whether the recipient cell is cancerous or normal. Our finding that deletions in radiationassociated tumours occur independent of sequence context and higher order chromatin structure would support this latter view. Second, one may argue that the readout of the effects of ionising radiation in radiation-exposed tumours is more acute than in radiation-associated second malignancies. A latency period of many years may lie between irradiation and the development of a second tumour in a field of radiation. It is possible that during this period the genomic effects of ionising radiation are altered or become buried amongst new mutations. Our data that the deletion signature is stronger amongst early mutations compared to late mutations indicates that burying has taken place. However, importantly the signature remains apparent when looking at all mutations together, early and late.

Supplementary References 1. Cahan, W.G., Woodard, H.Q. & et al. Sarcoma arising in irradiated bone; report of 11 cases. Cancer 1, 3 29 (1948). 2. Allan, J.M. & Travis, L.B. Mechanisms of therapy related carcinogenesis. Nat Rev Cancer 5, 943 55 (2005). 3. Guo, T. et al. Consistent MYC and FLT4 gene amplification in radiation induced angio but not in other radiation associated atypical vascular lesions. Genes Chromosomes Cancer 50, 25 33 (2011). 4. Kacker, C. et al. High frequency of MYC gene amplification is a common feature of radiation induced s. Further results from EORTC STBSG TL 01/01. Genes Chromosomes Cancer 52, 93 8 (2013). 5. Behjati, S. et al. Distinct H3F3A and H3F3B driver mutations define chondroblastoma and giant cell tumor of bone. Nat Genet 45, 1479 1482 (2013). 6. Smith, S., Reeves, B.R., Wong, L. & Fisher, C. A consistent chromosome translocation in synovial. Cancer Genet Cytogenet 26, 179 80 (1987). 7. Horwich, A. & Swerdlow, A.J. Second primary breast cancer after Hodgkin's disease. Br J Cancer 90, 294 8 (2004).