The$mitochondrial$and$autosomal$mutation$landscapes$of$ prostate$cancer$ $supplementary$material$
|
|
- Harold Bryant
- 5 years ago
- Views:
Transcription
1 The$mitochondrial$and$autosomal$mutation$landscapes$of$ prostate$cancer$ $supplementary$material$ Johan Lindberg 1, Ian G. Mills 2-4, Daniel Klevebring 1, Wennuan Liu 5, Mårten Neiman 1, Jianfeng Xu 5, Pernilla Wikström 6, Peter Wiklund 7, Fredrik Wiklund 8, Lars Egevad 9, Henrik Grönberg 8. 1 Department of Medical Epidemiology and Biostatistics, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden. 2 Prostate Cancer Research Group, Centre for Molecular Medicine Norway, University of Oslo, Oslo, Norway. 3 Department of Urology, Oslo University Hospitals, Oslo, Norway. 4 Department of Cancer Prevention, Oslo University Hospitals, Oslo, Norway. 5 Center for Cancer Genomics, Wake Forest University School of Medicine, NC, USA. 6 Medical Biosciences, Pathology, Umeå University, Umeå, Sweden. 7 Department of Urology, Division of Surgery, Karolinska Hospital, Stockholm, Sweden. 8 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden. 9 Department of Pathology and Cytology, Karolinska University Hospital, Stockholm, Sweden.
2 Table$of$contents$ Supplementary,Text,...,1! Tissues,...,1! Analysis,of,exome,sequence,data,...,1! Validation,of,mitochondrial,sequence,data,quality,...,2! Analysis,of,RNA,sequencing,data,...,3! Estimation,of,cellular,frequencies,and,lowBpass,wholeBgenome,sequencing,...,4! Supplementary,Figures,...,5! Supplementary,Figure,1,...,5! Supplementary,Figure,2,...,6! Supplementary,Figure,3,...,7! Supplementary,references,...,8!!!!
3 Supplementary$Text$ Tissues& All samples were snap frozen in liquid nitrogen and stored at 80 C. Detailed information on handling of tissues was reported by Lindberg et al [1]. DNA and RNA were extracted from the same tumor samples using the Allprep kit (Qiagen, Hilden, Germany) according to standard protocol. RNA quality was assessed using the Agilent 2100 Bioanalyzer and RNA LabChips (Agilent, Santa Clara, CA, USA), which provides RNA integrity numbers (RIN). All RIN numbers were >7.2 (median: 9.3), which suggests excellent quality with little or no degradation. Analysis&of&exome&sequence&data& Except for tumor sample SWE-1 (2 75), all samples were sequenced using Illumina (San Diego, CA, USA) paired-end base pair (bp) sequencing. The sequence data were aligned to the human genome reference (hg19) using Burrows-Wheeler Aligner v0.6.1 (Dice Holdings Inc, New York, NY, USA) [2] with standard settings except parameter q (to trim off low-quality bases at the beginning or end of reads), which was set to 10. Duplicate reads were removed using the MarkDuplicates function in the software suit Picard v1.6.3 (Dice Holdings Inc, New York, NY, USA). Before identifying somatic variants, the sequence data were realigned and recalibrated using tools available in the Gene Analysis Toolkit (GATK) software bundle v1.6.4 (Broad Institute, Cambridge, MA, USA) [3]. Single nucleotide variants (SNVs) were identified using mutect v , a software tool developed at the Broad Institute (Cambridge, MA, USA) for the identification of somatic SNVs in tumor genomes. Only positions with 10 coverage in both tumor and normal genomes were considered for variant calling [4]. Although this marks the lower limit for mutant detection, 95% of variants harbored >15 coverage in both the tumor and the normal genome. Low-frequency variants (<10%) were filtered away if there was any evidence supporting the variant in the normal DNA, regardless of quality. In addition, we detected a subset of variants associated with misaligned reads near simple repeats. These were removed by implementing a strand-bias filter, since the variants were only supported by reads in one direction. Finally, a subset of variants was associated with low mapping quality, due to the presence of the same 100-mer located elsewhere in the genome save one position (namely, the false-positive SNV). These were removed using a mapping-quality filter requiring at least one read to harbor a mapping quality 30. MuTect has been validated in several publications, with excellent validation rates (>90%) [5 7]. We performed conventional Sanger sequencing on 24 autosomal highfrequency variants (>20%), which all were concordant between technologies (Supplementary Table 2). Validation of low-frequency variants is challenging, but the high concordance between variant allele frequencies using RNA sequencing (RNAseq) and exome data demonstrates a high level of performance even for lowfrequency variants (Supplementary Fig. 1). The Indel Genotyper v2.0 available in the! 1!
4 GATK was applied to identify insertions/deletions (indels) [3]. Standard settings were used except for read-depth requirement, which was set to 10 in the tumor sample and 15 in the normal sample. After removal of germline variants, all indels in simple repeats plus 3 bp were removed due to an inflation of misaligned reads causing falsepositive indels in these regions. Subsequently, all indels were manually examined to remove false positives according to Ding et al [8], retaining 12% of putative indels. Both SNVs and indels were annotated using Ensembl s variation application program interface (Ensembl, Hinxton, UK) [9]. To avoid inflation of genes found in metastasis primary pairs or as high-low frequency mutations in tumor samples from the same prostate, common mutations were assigned only to the tumor tissue that harbored it in the highest fraction in all subsequent comparisons, unless stated otherwise. Also, due to high similarity between right and left lymph node metastases of SWE-52, only metastasis SWE-52B was used for calculations unless stated otherwise. MuSiC, a software suite available from The Genome Institute at Washington University (St. Louis, MO, USA), was used to identify significantly mutated genes (SMGs) [10]. MuSiC calculates contextdependent mutation rates (eg, C positions in CpG dinucleotides have elevated mutation rates and should therefore be tested separately), which are tested against a background mutation rate (nonsynonymous mutations were tested vs the nonsynonymous background mutation rate). An overall p value is subsequently obtained for each gene by three different approaches (Fisher exact test, a likelihood test, and convolution). After false-discovery rate (FDR) correction for multiple testing [11], we assumed the same stringency threshold as used previously [7], where genes with a convolution FDR <0.15 and also Fisher exact and/or likelihood FDR <0.15 were considered significant (Supplementary Table 3). Also, two genes, with only multiple mutations in the same tumor, were removed from the list of significantly mutated genes (SMGs). Validation&of&mitochondrial&sequence&data&quality& We used the first 11 normal DNA samples that were assayed on Affy6 (Affymetrix Inc, Santa Clara, CA, USA) to map copy number alterations (CNAs) for validation of the quality of the sequence data mapping to the mitochondrial genome. This allowed for independent verification of 118 mitochondrial positions, on average, per sample. Mitochondrial germline variants from the sequence data were identified using samtools pileup v (Dice Holdings Inc, New York, NY, USA) [12]. In 1183 positions, the reference allele was called in the sequence data. Affy6 demonstrated a disconcordant call in only one position. One hundred-seventeen positions harbored nonreference alleles from the sequence data, of which five were disconcordant versus Affy6. In all five positions, the sequence data identified a heteroplasmic variant due to the presence of a minor allele (fraction: 12 17%) not called by Affy6. Subsequently, we mapped the tumor and normal DNA (all Caucasian subjects, 12 tumor-normal pairs) independently to the Yoruban (NC_001807) mitochondrial genome to determine if the nonreference variation identified in the normal DNA concurred with the tumor sample. On average, each sample carried 38 nonreference alleles. Of 450 positions, 14 demonstrated disconcordance between tumor/normal pairs. In all 14 positions, the reference allele was present as a minor allele (fraction:! 2!
5 all <20%), causing a homoplasmic call in the tumor mitochondrial genome relatively a heteroplasmic call in the normal DNA. Note, when investigating each disconcordant position in detail, the reference allele was present in 11/14 positions of the tumor DNA, albeit in too low frequency for single nucleotide polymorphism detection. Subsequently, we mapped the tumor and normal DNA samples to the Caucasian reference NC_ to identify somatic mutations. When comparing mitochondrial mutation rate to the autosomal mutation rate in protein coding genes, the following ratio was calculated: (total number of mitochondrial electron transport chain [ETC] mutations summarized for all samples / total number of megabases of mitochondrial ETC sequence with sufficient coverage summarized over all samples) / (total number of autosomal coding mutations summarized over all samples / total number of megabases of targeted autosomal regions with sufficient coverage summarized over all samples): (55 / 0.656) / (2162 / ) = The difference in mutation rate made us investigate alternative approaches for identifying SMGs in the mitochondrial genome. There was no difference in mutation rates in transition/transversion categories between coding and noncoding variants. Therefore, we performed a simple frequency test using the displacement (D) loop as background. Since the D loop harbored nine mutations, no ETC gene was mutated at a significantly higher mutation rate. The same approach was undertaken using synonymous mutations as a background. No ETC genes reached statistical significance, due to small sample size relative to the possible number of nonsynonymous/synonymous variants for each gene. Analysis&of&RNA&sequencing&data& RNAseq was performed using the Ovation RNA-Seq System V2 (Nugen Technologies Inc, San Carlos, CA, USA) according to standard protocols. Samples were pooled in equimolar amounts, and sequenced in pairs of eight in one lane on the Hiseq 2000 instrument (San Diego, CA, USA). This yielded reads on average per sample (range: to ). On average, reads per sample mapped to autosomal nonribosomal regions (range: to ). The RNAseq data were aligned to the human genome (hg19) using Tophat v2.0.0 ( [13]. Tophat depends on the Bowtie aligner v (Dice Holdings Inc, New York, NY, USA) and discovers splice sites in a prebuilt transcriptome index file. We used the Ensembl database for these purposes [14]. Subsequently, transcript assembly and abundance estimation was performed using Cufflinks ( [15] with upper-quartile normalization [16] to increase the robustness of expression estimates. To assess a proliferation score for each tumor, we undertook a similar approach as Cuzick and colleagues [17] but with modifications, since they used real-time quantitative polymerase chain reaction to assess expression levels and we did not have access to a control set of samples to define a baseline expression level for each gene. First, genes with fragments per kilobase of transcript per million mapped reads <0.01 were set to 0. Subsequently, the median log 2 expression value was assessed for each gene suggested by Cuzick and colleagues [17] to be associated to prostate cancer proliferation (56 genes in total). For each expressed cell cycle gene, we calculated the following ratio per sample: R = log 2 (expression level in sample X / median expression level over all samples).! 3!
6 Although the majority of genes were expressed in all samples, some were not. There was no significant correlation between the fraction of expressed cell cycle genes and the amount of sequence data per sample (r = 0.13; p = ). Therefore, we undertook two approaches to assess a final proliferation score: (1) log 2 [mean (2^R of all expressed cell cycle genes for sample X)], and (2) 95th quartile of all expressed cell cycle gene log 2 ratios (R) for sample X. We chose approach the second approach because it had to a better correlation to overall Gleason score (r = 0.4; p = 0.002). Estimation&of&cellular&frequencies&and&low<pass&whole<genome& sequencing& The same software, PyClone v0.2, used by Shah and colleagues to estimate cellular fractions harboring individual variants in triple-negative breast cancer [18] was applied to our exome data, in combination with Affy6 or low-pass whole genome sequencing data (Supplementary Table 1) using standard settings. Only sequence data supporting the reference/variant allele with base quality 20 or mapping quality 30 were used for raw frequency estimates. Also, SNVs with read depth <20 were excluded to increase the robustness of the cellular frequency estimations. PyClone clusters variants with similar frequencies, giving them similar cellular frequencies. Therefore, to facilitate plotting of bee swarms of cellular fractions of individual variants (Fig. 3, Supplementary Fig. 2-3), a small constant (eg, in the interval for Fig. 3) was added to each variant. Since samples SWE-54/SWE-55 were not processed using Affy6, we performed low-pass whole-genome sequencing using the same library preparation protocol as described previously [1], but by sequencing without performing targeted capture. We obtained, on average, 1.8-fold coverage throughout the human genome (range: in the whole set of samples from tumor samples SWE-54/ SWE-55). CNA profiles from low-pass whole-genome sequencing data were obtained using BIC-seq [19] with default parameters, except for a 1000 bp bin size, a window size of 300 reads, and lambda set to 2. After segmentation, regions that did not reach significance after adjustment for multiple testing (using correction = TRUE in BICseq:::getSummary) were forced to a log 2 ratio of 0.! 4!
7 Supplementary$Figures$ Supplementary&Figure&1& Variant fraction from RNA sequencing Variant fraction from exome sequencing Supplementary Fig. 1. Comparison of allelic fraction of mitochondrial variants obtained from RNAseq data and exome data. Only positions with 10x coverage in the RNAseq data, and bases with 20 in quality were accounted for. In total, 78 positions harbored enough coverage to be include in the comparison. The Pearson correlation coefficient was 0.93 (p-value = 2.2e-16).! 5!
8 Supplementary&Figure&2& 1 Estimated proportion of cells harboring variant SWE-55A SWE-55B Supplementary Fig. 2. Comparison of SNV cellular frequency estimates from multiple samples of SWE-55. The cellular frequency estimation procedure is described in the Supplementary Text. All variants identified in either SWE-55A or SWE-55B were investigated in both samples. Lines demark variants with supporting sequence data in both samples. The lines are colored according to the color of the sample harboring the variant in highest cellular frequency. In total, out of 50 SNVs identified in both samples, there were sequence data supporting SNVs in 43 positions for SWE-55A and 39 position in SWE-55B. Note, SWE-55A harbored the lowest cancer cellularity of all samples (~50%). Thereof the low estimated maximum SNV cellularity. Nevertheless, the variants present at the highest estimated proportion in the primary biopsy sample were also present in the cluster of highest estimated SNV proportions of the bone metastasis, although it was collected 7.6 years after the initial primary biopsy procedure. This suggests a monoclonal common origin.! 6!
9 Supplementary,Figure,3, Estimated proportion of cells harboring variant 0.5 Estimated proportion of cells harboring variant 0.5 Estimated proportion of cells harboring variant SWE-54A SWE-54B SWE-54A SWE-54C SWE-54B SWE-54C Supplementary Fig. 3. Comparison of SNV cellular frequency estimates from multiple samples of SWE-54. The same analysis as described for SWE-55 (Supplementary Fig. 2) was performed for SWE-54. In total, 155 unique SNVs were found in the targeted regions of all three samples. High-quality sequence data supported the presence of 140, 53 and 70 of the 155 variants in SWE-54A, SWE-54B and SWE-54C, respectively. Note, in the rightmost panel, the metastases samples have a high degree of similarity. In contrast (leftmost and middle panel), it is evident that the major clone of the sampled primary tumor tissue was not seeding the metastasis, as the high-cellular frequency variants of the metastases samples were present at low frequency in the primary tumor tissue. SWE-54A harbored the highest mutation rate of all samples in this dataset (6.6 mutations/mb), interestingly; this was not the case for the metastasis samples (1.0 mutations/mb).! 7
10 Supplementary,references, [1] Lindberg J, Klevebring D, Liu W, et al. Exome sequencing of prostate cancer supports the hypothesis of independent tumour origins. Eur Urol. In press. DOI: /j.eururo [2] Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009;25: [3] McKenna A, Hanna M, Banks E, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010;20: [4] Dees N, Zhang Q, Kandoth C, et al. MuSiC: identifying mutational significance in cancer genomes. Genome Res 2012;22: [5] Stransky N, Egloff AM, Tward AD, et al. The mutational landscape of head and neck squamous cell carcinoma. Science 2011;333: [6] Chapman MA, Lawrence MS, Keats JJ, et al. Initial genome sequencing and analysis of multiple myeloma. Nature 2011;471: [7] Cancer Genome Atlas Research Network. Integrated genomic analyses of ovarian carcinoma. Nature 2011;474: [8] Ding L, Ellis MJ, Li S, et al. Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature 2010;464: [9] Rios D, McLaren WM, Chen Y, et al. A database and API for variation, dense genotyping and resequencing data. BMC Bioinformatics 2010;11:238. [10] Dees ND, Zhang Q, Kandoth C, et al. MuSiC: Identifying mutational significance in cancer genomes. Genome Res 2012;22: [11] Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B 1995;57: [12] Li H, Handsaker B, Wysoker A, et al. The sequence alignment/map format and SAMtools. Bioinformatics 2009;25: [13] Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 2009;25: [14] Flicek P, Amode MR, Barrell D, et al. Ensembl Nucleic Acids Res 2011;39:D [15] Trapnell C, Williams BA, Pertea G, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 2010;28: [16] Bullard JH, Purdom E, Hansen KD, Dudoit S. Evaluation of statistical methods for normalization and differential expression in mrna-seq experiments. BMC Bioinformatics 2010;11:94. [17] Cuzick J, Swanson GP, Fisher G, et al. Prognostic value of an RNA expression signature derived from cell cycle proliferation genes in patients with prostate cancer: a retrospective study. Lancet Oncol 2011;12: [18] Shah SP, Roth A, Goya R, et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature 2012;486: [19] Xi R, Hadjipanayis AG, Luquette LJ, et al. Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion. Proc Natl Acad Sci U S A 2011;108:E ! 8!
Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers
Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies
More informationRASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays
Supplementary Materials RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Junhee Seok 1*, Weihong Xu 2, Ronald W. Davis 2, Wenzhong Xiao 2,3* 1 School of Electrical Engineering,
More informationSupplemental Methods RNA sequencing experiment
Supplemental Methods RNA sequencing experiment Mice were euthanized as described in the Methods and the right lung was removed, placed in a sterile eppendorf tube, and snap frozen in liquid nitrogen. RNA
More informationDNA-seq Bioinformatics Analysis: Copy Number Variation
DNA-seq Bioinformatics Analysis: Copy Number Variation Elodie Girard elodie.girard@curie.fr U900 institut Curie, INSERM, Mines ParisTech, PSL Research University Paris, France NGS Applications 5C HiC DNA-seq
More informationAdvance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library
Advance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library Marilou Wijdicks International Product Manager Research For Life Science Research Only. Not for Use in Diagnostic Procedures.
More informationSingle-strand DNA library preparation improves sequencing of formalin-fixed and paraffin-embedded (FFPE) cancer DNA
www.impactjournals.com/oncotarget/ Oncotarget, Supplementary Materials 2016 Single-strand DNA library preparation improves sequencing of formalin-fixed and paraffin-embedded (FFPE) DNA Supplementary Materials
More informationThe Mitochondrial and Autosomal Mutation Landscapes of Prostate Cancer
EUROPEAN UROLOGY 63 (2013) 702 708 available at www.sciencedirect.com journal homepage: www.europeanurology.com Prostate Cancer The Mitochondrial and Autosomal Mutation Landscapes of Prostate Cancer Johan
More informationAVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits
AVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits Accelerating clinical research Next-generation sequencing (NGS) has the ability to interrogate many different genes and detect
More informationTumor mutational burden and its transition towards the clinic
Tumor mutational burden and its transition towards the clinic G C C A T C A C Wolfram Jochum Institute of Pathology Kantonsspital St.Gallen CH-9007 St.Gallen wolfram.jochum@kssg.ch 30th European Congress
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature11396 A total of 2078 samples from a large sequencing project at decode were used in this study, 219 samples from 78 trios with two grandchildren who were not also members of other trios,
More informationNature Getetics: doi: /ng.3471
Supplementary Figure 1 Summary of exome sequencing data. ( a ) Exome tumor normal sample sizes for bladder cancer (BLCA), breast cancer (BRCA), carcinoid (CARC), chronic lymphocytic leukemia (CLLX), colorectal
More informationTranscriptome Analysis
Transcriptome Analysis Data Preprocessing Sample Preparation Illumina Sequencing Demultiplexing Raw FastQ Reference Genome (fasta) Reference Annotation (GTF) Reference Genome Analysis Tophat Accepted hits
More informationAD (Leave blank) TITLE: Genomic Characterization of Brain Metastasis in Non-Small Cell Lung Cancer Patients
AD (Leave blank) Award Number: W81XWH-12-1-0444 TITLE: Genomic Characterization of Brain Metastasis in Non-Small Cell Lung Cancer Patients PRINCIPAL INVESTIGATOR: Mark A. Watson, MD PhD CONTRACTING ORGANIZATION:
More informationAnalysis with SureCall 2.1
Analysis with SureCall 2.1 Danielle Fletcher Field Application Scientist July 2014 1 Stages of NGS Analysis Primary analysis, base calling Control Software FASTQ file reads + quality 2 Stages of NGS Analysis
More informationMutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research
Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research Application Note Authors John McGuigan, Megan Manion,
More informationSupplementary Figure 1. Estimation of tumour content
Supplementary Figure 1. Estimation of tumour content a, Approach used to estimate the tumour content in S13T1/T2, S6T1/T2, S3T1/T2 and S12T1/T2. Tissue and tumour areas were evaluated by two independent
More informationSupplementary Methods
Supplementary Methods Short Read Preprocessing Reads are preprocessed differently according to how they will be used: detection of the variant in the tumor, discovery of an artifact in the normal or for
More informationBreast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS
Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS dr sc. Ana Krivokuća Laboratory for molecular genetics Institute for Oncology and
More informationSupplementary Material for IPred - Integrating Ab Initio and Evidence Based Gene Predictions to Improve Prediction Accuracy
1 SYSTEM REQUIREMENTS 1 Supplementary Material for IPred - Integrating Ab Initio and Evidence Based Gene Predictions to Improve Prediction Accuracy Franziska Zickmann and Bernhard Y. Renard Research Group
More informationSupplementary Tables. Supplementary Figures
Supplementary Files for Zehir, Benayed et al. Mutational Landscape of Metastatic Cancer Revealed from Prospective Clinical Sequencing of 10,000 Patients Supplementary Tables Supplementary Table 1: Sample
More informationThe feasibility of circulating tumour DNA as an alternative to biopsy for mutational characterization in Stage III melanoma patients
The feasibility of circulating tumour DNA as an alternative to biopsy for mutational characterization in Stage III melanoma patients ASSC Scientific Meeting 13 th October 2016 Prof Andrew Barbour UQ SOM
More informationOn the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles
On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles Ying-Wooi Wan 1,2,4, Claire M. Mach 2,3, Genevera I. Allen 1,7,8, Matthew L. Anderson 2,4,5 *, Zhandong Liu 1,5,6,7 * 1 Departments of Pediatrics
More informationFigure S4. 15 Mets Whole Exome. 5 Primary Tumors Cancer Panel and WES. Next Generation Sequencing
Figure S4 Next Generation Sequencing 15 Mets Whole Exome 5 Primary Tumors Cancer Panel and WES Get coverage of all variant loci for all three Mets Variant Filtering Sequence Alignments Index and align
More informationCONTRACTING ORGANIZATION: The Chancellor, Masters and Scholars of the University of Cambridge, Clara East, The Old Schools, Cambridge CB2 1TN
AWARD NUMBER: W81XWH-14-1-0110 TITLE: A Molecular Framework for Understanding DCIS PRINCIPAL INVESTIGATOR: Gregory Hannon CONTRACTING ORGANIZATION: The Chancellor, Masters and Scholars of the University
More informationVARIANT PRIORIZATION AND ANALYSIS INCORPORATING PROBLEMATIC REGIONS OF THE GENOME ANIL PATWARDHAN
VARIANT PRIORIZATION AND ANALYSIS INCORPORATING PROBLEMATIC REGIONS OF THE GENOME ANIL PATWARDHAN Email: apatwardhan@personalis.com MICHAEL CLARK Email: michael.clark@personalis.com ALEX MORGAN Email:
More informationSupplementary Figures
Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive
More informationgenomics for systems biology / ISB2020 RNA sequencing (RNA-seq)
RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome
More informationunderlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The
Supplementary Figures Figure S1. Patient cohorts and study design. To define and interrogate the genetic alterations underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The
More informationSupplementary Figure 1. General strategy to classify genes and identify TSGs.
Supplementary Figure 1. General strategy to classify genes and identify TSGs. Supplementary Figure 2. EE patterns of validated EECTPs (7 novel) in our 24 LUAD samples. Red indicates extremely highly expressed
More informationNGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation
NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation Michael R. Rossi, PhD, FACMG Assistant Professor Division of Cancer Biology, Department of Radiation Oncology Department
More informationWHOLE EXOME SEQUENCING PIPELINE EVALUATION AND MUTATION DETECTION IN ESOPHAGEAL CANCER PATIENTS
WHOLE EXOME SEQUENCING PIPELINE EVALUATION AND MUTATION DETECTION IN ESOPHAGEAL CANCER PATIENTS SUMMARY Tran Thi Bich Ngoc 1 ; Ho Viet Hoanh 2 ; Vu Phuong Nhung 1 ; Nguyen Hai Ha 1 Nguyen Van Ba 2 ; Nguyen
More informationNGS in tissue and liquid biopsy
NGS in tissue and liquid biopsy Ana Vivancos, PhD Referencias So, why NGS in the clinics? 2000 Sanger Sequencing (1977-) 2016 NGS (2006-) ABIPrism (Applied Biosystems) Up to 2304 per day (96 sequences
More informationGenome. Institute. GenomeVIP: A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud. R. Jay Mashl.
GenomeVIP: the Genome Institute at Washington University A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud R. Jay Mashl October 20, 2014 Turnkey Variant
More informationRNA SEQUENCING AND DATA ANALYSIS
RNA SEQUENCING AND DATA ANALYSIS Length of mrna transcripts in the human genome 5,000 5,000 4,000 3,000 2,000 4,000 1,000 0 0 200 400 600 800 3,000 2,000 1,000 0 0 2,000 4,000 6,000 8,000 10,000 Length
More informationfl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)
KRas;At KRas;At KRas;At KRas;At a b Supplementary Figure 1. Gene set enrichment analyses. (a) GO gene sets (MSigDB v3. c5) enriched in KRas;Atg5 fl/+ as compared to KRas;Atg5 fl/fl tumors using gene set
More informationAn integrated map of genetic variation from 1092 human genomes
SUPPLEMENTAL METHODS AND MATERIALS Whole genome sequencing Alignment: Short insert paired-end reads were aligned to the GRCh37 reference human genome with 1000 genomes decoy contigs using BWA-mem(1). Somatic
More informationRNA SEQUENCING AND DATA ANALYSIS
RNA SEQUENCING AND DATA ANALYSIS Download slides and package http://odin.mdacc.tmc.edu/~rverhaak/package.zip http://odin.mdacc.tmc.edu/~rverhaak/rna-seqlecture.zip Overview Introduction into the topic
More informationCONTRACTING ORGANIZATION: The Chancellor, Masters and Scholars of the University of Cambridge, Clara East, The Old Schools, Cambridge CB2 1TN
AWARD NUMBER: W81XWH-14-1-0110 TITLE: A Molecular Framework for Understanding DCIS PRINCIPAL INVESTIGATOR: Gregory Hannon CONTRACTING ORGANIZATION: The Chancellor, Masters and Scholars of the University
More informationIdentifying Mutations Responsible for Rare Disorders Using New Technologies
Identifying Mutations Responsible for Rare Disorders Using New Technologies Jacek Majewski, Department of Human Genetics, McGill University, Montreal, QC Canada Mendelian Diseases Clear mode of inheritance
More informationGolden Helix s End-to-End Solution for Clinical Labs
Golden Helix s End-to-End Solution for Clinical Labs Steven Hystad - Field Application Scientist Nathan Fortier Senior Software Engineer 20 most promising Biotech Technology Providers Top 10 Analytics
More informationResearch Strategy: 1. Background and Significance
Research Strategy: 1. Background and Significance 1.1. Heterogeneity is a common feature of cancer. A better understanding of this heterogeneity may present therapeutic opportunities: Intratumor heterogeneity
More informationFrequency(%) KRAS G12 KRAS G13 KRAS A146 KRAS Q61 KRAS K117N PIK3CA H1047 PIK3CA E545 PIK3CA E542K PIK3CA Q546. EGFR exon19 NFS-indel EGFR L858R
Frequency(%) 1 a b ALK FS-indel ALK R1Q HRAS Q61R HRAS G13R IDH R17K IDH R14Q MET exon14 SS-indel KIT D8Y KIT L76P KIT exon11 NFS-indel SMAD4 R361 IDH1 R13 CTNNB1 S37 CTNNB1 S4 AKT1 E17K ERBB D769H ERBB
More informationPREPARED FOR: U.S. Army Medical Research and Materiel Command Fort Detrick, Maryland
AD Award Number: W81XWH-10-1-0589 TITLE: Global Characterization of Protein Altering Mutations in Prostate Cancer PRINCIPAL INVESTIGATOR: Jay Shendure, M.D., Ph.D. CONTRACTING ORGANIZATION: University
More informationNature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from
Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue),
More informationPerformance Characteristics BRCA MASTR Plus Dx
Performance Characteristics BRCA MASTR Plus Dx with drmid Dx for Illumina NGS systems Manufacturer Multiplicom N.V. Galileïlaan 18 2845 Niel Belgium Table of Contents 1. Workflow... 4 2. Performance Characteristics
More informationDetection of low-frequent mitochondrial DNA variants using SMRT sequencing
Detection of low-frequent mitochondrial DNA variants using SMRT sequencing Marjolein J.A. Weerts SMRT Leiden 2018 June 13 Content Mitochondrial DNA & liquid biopsy in oncology Pitfalls when studying human
More informationNature Methods: doi: /nmeth.3115
Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by
More informationChIP-seq data analysis
ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional
More informationMATS: a Bayesian framework for flexible detection of differential alternative splicing from RNA-Seq data
Nucleic Acids Research Advance Access published February 1, 2012 Nucleic Acids Research, 2012, 1 13 doi:10.1093/nar/gkr1291 MATS: a Bayesian framework for flexible detection of differential alternative
More informationSelective depletion of abundant RNAs to enable transcriptome analysis of lowinput and highly-degraded RNA from FFPE breast cancer samples
DNA CLONING DNA AMPLIFICATION & PCR EPIGENETICS RNA ANALYSIS Selective depletion of abundant RNAs to enable transcriptome analysis of lowinput and highly-degraded RNA from FFPE breast cancer samples LIBRARY
More informationData mining with Ensembl Biomart. Stéphanie Le Gras
Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:
More informationCRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery
CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery PacBio CoLab Session October 20, 2017 For Research Use Only. Not for use in diagnostics procedures. Copyright 2017 by Pacific Biosciences
More informationBWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space
Whole genome sequencing Whole exome sequencing BWA alignment to reference transcriptome and genome Convert transcriptome mappings back to genome space genomes Filter on MQ, distance, Cigar string Annotate
More informationPan-cancer analysis of expressed somatic nucleotide variants in long intergenic non-coding RNA
Pan-cancer analysis of expressed somatic nucleotide variants in long intergenic non-coding RNA Travers Ching 1,2, Lana X. Garmire 1,2 1 Molecular Biosciences and Bioengineering Graduate Program, University
More informationNature Genetics: doi: /ng Supplementary Figure 1. Rates of different mutation types in CRC.
Supplementary Figure 1 Rates of different mutation types in CRC. (a) Stratification by mutation type indicates that C>T mutations occur at a significantly greater rate than other types. (b) As for the
More informationVariant Classification. Author: Mike Thiesen, Golden Helix, Inc.
Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets
More informationGenetic alterations of histone lysine methyltransferases and their significance in breast cancer
Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Supplementary Materials and Methods Phylogenetic tree of the HMT superfamily The phylogeny outlined in the
More informationSupplemental Information. Regulatory T Cells Exhibit Distinct. Features in Human Breast Cancer
Immunity, Volume 45 Supplemental Information Regulatory T Cells Exhibit Distinct Features in Human Breast Cancer George Plitas, Catherine Konopacki, Kenmin Wu, Paula D. Bos, Monica Morrow, Ekaterina V.
More informationMODULE 4: SPLICING. Removal of introns from messenger RNA by splicing
Last update: 05/10/2017 MODULE 4: SPLICING Lesson Plan: Title MEG LAAKSO Removal of introns from messenger RNA by splicing Objectives Identify splice donor and acceptor sites that are best supported by
More informationSUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation.
SUPPLEMENTARY INFORMATION Intron retention is a widespread mechanism of tumor suppressor inactivation. Hyunchul Jung 1,2,3, Donghoon Lee 1,4, Jongkeun Lee 1,5, Donghyun Park 2,6, Yeon Jeong Kim 2,6, Woong-Yang
More informationSimple, rapid, and reliable RNA sequencing
Simple, rapid, and reliable RNA sequencing RNA sequencing applications RNA sequencing provides fundamental insights into how genomes are organized and regulated, giving us valuable information about the
More informationAVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB
Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits Next-generation performance in liquid biopsies 2 Accelerating clinical research From liquid biopsy to next-generation
More information38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16
38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and
More informationRNA-seq Introduction
RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated
More informationTITLE: Unique Genomic Alterations in Prostate Cancers in African American Men
AD Award Number: W81XWH-12-1-0046 TITLE: Unique Genomic Alterations in Prostate Cancers in African American Men PRINCIPAL INVESTIGATOR: Michael Ittmann, M.D., Ph.D. CONTRACTING ORGANIZATION: Baylor College
More informationClonal Evolution of saml. Johnnie J. Orozco Hematology Fellows Conference May 11, 2012
Clonal Evolution of saml Johnnie J. Orozco Hematology Fellows Conference May 11, 2012 CML: *bcr-abl and imatinib Melanoma: *braf and vemurafenib CRC: *k-ras and cetuximab Esophageal/Gastric: *Her-2/neu
More informationACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics
ACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics Precision Genomics for Immuno-Oncology Personalis, Inc. ACE ImmunoID When one biomarker doesn t tell the whole
More informationMEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)
Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Ordering Information Acceptable specimen types: Fresh blood sample (3-6 ml EDTA; no time limitations associated with receipt)
More informationTranscript reconstruction
Transcript reconstruction Summary I Data types, file formats and utilities Annotation: Genomic regions Genes Peaks bedtools Alignment: Map reads BAM/SAM Samtools Aggregation: Summary files Wig (UCSC) TDF
More informationNEXT GENERATION SEQUENCING. R. Piazza (MD, PhD) Dept. of Medicine and Surgery, University of Milano-Bicocca
NEXT GENERATION SEQUENCING R. Piazza (MD, PhD) Dept. of Medicine and Surgery, University of Milano-Bicocca SANGER SEQUENCING 5 3 3 5 + Capillary Electrophoresis DNA NEXT GENERATION SEQUENCING SOLEXA-ILLUMINA
More informationDaehwan Kim September 2018
Daehwan Kim September 2018 Michael L. Rosenberg Assistant Professor, CPRIT Scholar (214) 645-1738 Lyda Hill Department of Bioinformatics infphilo@gmail.com University of Texas Southwestern Medical Center
More informationAnalyse de données de séquençage haut débit
Analyse de données de séquençage haut débit Vincent Lacroix Laboratoire de Biométrie et Biologie Évolutive INRIA ERABLE 9ème journée ITS 21 & 22 novembre 2017 Lyon https://its.aviesan.fr Sequencing is
More informationIdentification of genomic alterations in cervical cancer biopsies by exome sequencing
Chapter- 4 Identification of genomic alterations in cervical cancer biopsies by exome sequencing 105 4.1 INTRODUCTION Athough HPV has been identified as the prime etiological factor for cervical cancer,
More informationAbstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction
Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant
More informationCancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer
Cancer Gene Panels Dr. Andreas Scherer Dr. Andreas Scherer President and CEO Golden Helix, Inc. scherer@goldenhelix.com Twitter: andreasscherer About Golden Helix - Founded in 1998 - Main outside investor:
More informationAmbient temperature regulated flowering time
Ambient temperature regulated flowering time Applications of RNAseq RNA- seq course: The power of RNA-seq June 7 th, 2013; Richard Immink Overview Introduction: Biological research question/hypothesis
More informationGlobal regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR)
Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR) O. Solomon, S. Oren, M. Safran, N. Deshet-Unger, P. Akiva, J. Jacob-Hirsch, K. Cesarkas, R. Kabesa, N. Amariglio, R.
More informationTITLE: - Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition
AD Award Number: W81XWH-13-1-0336 TITLE: - Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition PRINCIPAL INVESTIGATOR: Mary-Claire King,
More informationIdentification of Genetic Determinants of Metastasis and Clonal Relationships Between Primary and Metastatic Tumors
Washington University in St. Louis Washington University Open Scholarship All Theses and Dissertations (ETDs) Spring 5-1-2013 Identification of Genetic Determinants of Metastasis and Clonal Relationships
More informationSupplemental Figure legends
Supplemental Figure legends Supplemental Figure S1 Frequently mutated genes. Frequently mutated genes (mutated in at least four patients) with information about mutation frequency, RNA-expression and copy-number.
More informationThe 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis
The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis Tieliu Shi tlshi@bio.ecnu.edu.cn The Center for bioinformatics
More informationa) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,
Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted
More informationBelow, we included the point-to-point response to the comments of both reviewers.
To the Editor and Reviewers: We would like to thank the editor and reviewers for careful reading, and constructive suggestions for our manuscript. According to comments from both reviewers, we have comprehensively
More informationncounter Assay Automated Process Immobilize and align reporter for image collecting and barcode counting ncounter Prep Station
ncounter Assay ncounter Prep Station Automated Process Hybridize Reporter to RNA Remove excess reporters Bind reporter to surface Immobilize and align reporter Image surface Count codes Immobilize and
More informationSupplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first
Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first intron IGLL5 mutation depicting biallelic mutations. Red arrows highlight the presence of out of phase
More informationCDH1 truncating alterations were detected in all six plasmacytoid-variant bladder tumors analyzed by whole-exome sequencing.
Supplementary Figure 1 CDH1 truncating alterations were detected in all six plasmacytoid-variant bladder tumors analyzed by whole-exome sequencing. Whole-exome sequencing of six plasmacytoid-variant bladder
More informationSupplementary Materials for
www.sciencetranslationalmedicine.org/cgi/content/full/7/283/283ra54/dc1 Supplementary Materials for Clonal status of actionable driver events and the timing of mutational processes in cancer evolution
More informationIso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing
Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing PacBio Americas User Group Meeting Sample Prep Workshop June.27.2017 Tyson Clark, Ph.D. For Research Use Only. Not
More informationDeploying the full transcriptome using RNA sequencing. Jo Vandesompele, CSO and co-founder The Non-Coding Genome May 12, 2016, Leuven
Deploying the full transcriptome using RNA sequencing Jo Vandesompele, CSO and co-founder The Non-Coding Genome May 12, 2016, Leuven Roadmap Biogazelle the power of RNA reasons to study non-coding RNA
More informationTITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer.
AWARD NUMBER: W81XWH-14-1-0080 TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer. PRINCIPAL INVESTIGATOR: Christopher B. Umbricht, MD, PhD CONTRACTING ORGANIZATION: Johns
More informationSupplementary Online Content
Supplementary Online Content Fumagalli D, Venet D, Ignatiadis M, et al. RNA Sequencing to predict response to neoadjuvant anti-her2 therapy: a secondary analysis of the NeoALTTO randomized clinical trial.
More informationPredominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing
Molecular Systems Biology Peer Review Process File Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing Mr. Qingsong Gao, Wei Sun, Marlies Ballegeer, Claude
More informationSUPPLEMENTARY APPENDIX
SUPPLEMENTARY APPENDIX 1) Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort 2) Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature
More informationYork criteria, 6 RA patients and 10 age- and gender-matched healthy controls (HCs).
MATERIALS AND METHODS Study population Blood samples were obtained from 15 patients with AS fulfilling the modified New York criteria, 6 RA patients and 10 age- and gender-matched healthy controls (HCs).
More informationNature Structural & Molecular Biology: doi: /nsmb.2419
Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb
More informationClasificación Molecular del Cáncer de Próstata. JM Piulats
Clasificación Molecular del Cáncer de Próstata JM Piulats Introduction The Gleason score is the major method for prostate cancer tissue grading and the most important prognostic factor in this disease.
More informationPatnaik SK, et al. MicroRNAs to accurately histotype NSCLC biopsies
Patnaik SK, et al. MicroRNAs to accurately histotype NSCLC biopsies. 2014. Supplemental Digital Content 1. Appendix 1. External data-sets used for associating microrna expression with lung squamous cell
More informationDr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.
Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag
More informationDetection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit
APPLICATION NOTE Ion PGM System Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit Key findings The Ion PGM System, in concert with the Ion ReproSeq PGS View Kit and Ion Reporter
More informationMyeloma Genetics what do we know and where are we going?
in partnership with Myeloma Genetics what do we know and where are we going? Dr Brian Walker Thames Valley Cancer Network 14 th September 2015 Making the discoveries that defeat cancer Myeloma Genome:
More information