A METHOD FOR FINDING NOVEL ASSOCIATIONS BETWEEN GENOME-WIDE COPY NUMBER AND DNA METHYLATION PATTERNS
|
|
- Morgan Robinson
- 5 years ago
- Views:
Transcription
1 A METHOD FOR FINDING NOVEL ASSOCIATIONS BETWEEN GENOME-WIDE COPY NUMBER AND DNA METHYLATION PATTERNS Man-Hung Eric Tang 1), Vinay Varadan 2), Sid Kamalakaran 2), Michael Q. Zhang 3), Nevenka Dimitrova 2), James Hicks 1). 1. Cold Spring Harbor Laboratory, 1 Bungtown Rd, NY 12724, USA 2. Philips Research North America, 345 Scarborough Rd, Briarcliff Manor, NY 10510, USA 3. The University of Texas at Dallas, Richardson T 75080, USA and Tsinghua University, Beijing, China mtang@cshl.edu Abstract We present a computational method that combines genome-wide DNA methylation and copy number variation data in an integrated fashion with the aim of finding mechanistic associations between genome instability and local DNA methylation changes. The method is applied to Luminal A breast cancer early-stage tumour samples and focuses on methylation events occurring at frequently rearranged genome locations. Our method accommodates array and sequencing platforms for methylation and DNA copy number estimates. We find significant local methylation changes in tumours tend to occur in the viscinity of breakpoint rich regions, with 80% of the differentially methylated regions occurring within 2Mb from a breakpoint rich locus. Keywords- breast cancer, genome instability, DNA methylation I. INTRODUCTION Breast cancer is a complex genetic disease characterized by multiple genetic and epigenetic changes which have been widely studied in the past two decades. Pioneering works by Perou et al. [1], Sørlie et al. [2] showed that breast tumour cells can be distinguished into five molecular subtypes with clinically different outcomes: Luminal A and B, HER2 positive, basal-like and normal-like. As new high-throughput methodologies have emerged, other genetic anomalies have been studied. Copy Number Variation profiling is a well established methodology to survey major chromosomal rearrangements in the genome. It has been shown in many studies [3,4,5] that CNV patterns are important discriminating features between subtypes of breast and other cancers. Similarly, the characterization of cancer methylomes and their corresponding normal profiles is an important aspect in biomarker discovery. Kamalakaran et al. [6] showed that Luminal and non-luminal breast cancer tumours have different methylation patterns and that differentially methylated genes between tumour and normal cells could be used as prognosis factors. Furthermore, epigenetic subtyping of breast cancer has also been addressed, for example in [7], describing the epigenotypes of Luminal A, B, HER2 positive and basal-like breast tumours. The relationship between gene expression, copy number and DNA methylation is still unclear. The problem has been often tackled in the gene expression perspective, looking at the impact of changes in copy number, methylation levels or both on gene expression, and with the aim of looking for potential therapeutic targets [6,8]. Alongside to these classic genefocused studies, it would be interesting to see whether epigenetic and genetic anomalies occur randomly and if genome instability could be associated with local variations of DNA methylation. For this we need to look at where changes occur rather than the actual levels as in the classic methods. We propose an integrative framework that combines CNV profiles and DNA methylation information of the same tumours focusing at events occurring at common chromosomal rearrangement breakpoints. We showed in Luminal A samples that genomic loci with local DNA methylation changes tend to occur significantly within 2 Mb from chromosomal breakpointdense regions. II. METHOD A. Tumour sample set We used the 119 Norwegian breast cancer dataset described in Sørlie et al. [2]. Each patient of the study is further classified into one of the following sub-groups: Luminal A tumour subtype (40 patients); Luminal B (15), ERBB2 positive (19), basal-like (12), normal-like (14), and 8 undefined. The normal tissue dataset consisted of 11 adjacent breast tissue samples. For each sample, we surveyed DNA methylation and copy number variation data using the experimental platforms described below. The statistical analysis was performed on Luminal A samples only, which represent the largest and most homogeneous group of our dataset. B. MOMA platform We surveyed the methylome of each tumour sample using the MOMA platform [9]. Each CpG island is covered by one or several MOMA fragments that undergo MspI cleavage and McrBC or mock digestion. McrBC and mock digested fragments are then labeled and hybridized on a chip. The hybridization ratio reflects the level of methylation of the probed CpG island. In total, the CpG islands annotated by the UCSC genome browser (hg17 build) are covered by MOMA fragments. The data is normalized by converting the hybridization log-ratios into the probabilistic space using an Expectation-Maximization (EM) method [6]. Each MOMA fragment is assigned one of the following states: high methylation (+1), low methylation (-1) and 0 for partial methylation.
2 C. ROMA platform To measure copy number variation across the genome, we used the ROMA platform described in Lucito et al. [5] The genome is covered by regularly spaced probes printed on an array, providing a coverage of the genome of nucleotides resolution. Copy number ratios are measured using the skin fibroblast CHPSKN-1 cell-line as reference. Since CHPSKN-1 cells come from a male individual, we focused our analysis on the 22 autosomes only. Copy number values are obtained using Circular Binary Segmentation [10]. D. Flow diagram Figure 1 presents the different steps of the analysis procedure. The model contains three layers: input methylation and copy number data (dotted line round boxes), computational modules (solid line round boxes) and output data (square boxes). In the pre-processing step, we derive the profile of copy number gain and losses in the studied set of breast tumours and define windows of similar copy number status (amplified or deleted) across the genome. For each of these windows, a statistical test is performed to evaluate whether the methylation distribution differs from the background. We then obtain a list of regions with local methylation deviations that we compare with loci with high chromosomal breakpoint density. E. CNV analysis across tumour samples We partitioned the genome into variable windows in which copy number ratio remains constant in each sample. Windows are determined by all the breakpoints obtained by segmentation of the copy number values in each sample using the CBS algorithm. Longer intervals describe regions that have very little copy number change across all the patients while short intervals correspond to regions with high copy number changes, ie many breaks across different samples. We defined three levels of amplification in order to bin samples into three categories. In each given interval, samples with a ROMA ratio greater than 1.1 are defined as amplified, deleted if their linear ratio is less than 0.9 and normal if their ROMA ratio fall between these two values. The thresholds that define the normal copy number ratio were chosen empirically to take into account the measurement noise around 1. The CNV profile of the dataset can be then plotted as the fraction of sample showing amplifications and deletions across (Figure 2). F. Detection of local methylation changes To identify local variations of DNA methylation in the 40 luminal A samples, we compared the distribution of methylations calls within each of the intervals defined by all the copy number breakpoints with the one observed across the genome. Each MOMA fragment is surveyed and we can associate to each fragment a triplet of observations accounting for the number of '+1's, '0's, and '-1's seen across all samples. For example, a window can be seen 30 times as +1, 3 times 0 and 7 times -1. Local changes in DNA methylation across the genome were identified using the Hotelling's t 2 -test, a generalization the Student's t-test for multivariate hypothesis testing. The null hypothesis H 0 is defined as the observed distribution of '+1's, '0's, and '-1's observed at each fragment across the MOMA platform. It is calculated based on observations. It has an expectation μ 0 =(μ 01,μ 02,μ 03 ) and covariance B. If a window contains n MOMA fragments, let 1, 2,.., n be n independent 3-dimensional vectors, n , 2,.., n is follows the normal law N(μ,B). Then, the T 2 statistics can be expressed as: where and S T 2 1 = n T 1 μ μ S μ μ n 1 = n 1 0 n μ = 1 n i= 1 i= 1 0 (1) i (2) μ μ T i are the sample maximum likelihood estimators of μ and B. Then T 2 has the Hotelling's T-square distribution and the statistic n p F = p n 1 T² has a Fisher's F distribution with p and n-p degrees of freedom, p=3 and parameter (μ - μ 0 ) T B -1 (μ - μ 0 ). To test whether the null hypothesis H 0 :μ=μ 0 is rejected, we compute the F statistics using the observations 1, 2,.., n of the 3-dimensional normal law N(μ,B) and derive the associated p-value. A window is considered to have a significant deviation in its methylation pattern if its p-value is smaller than G. Breakpoint dense regions We used the segment starts and end defined by the CBS algorithm for the CNV profile of each sample to define our breakpoints. We then calculated the density function and defined the center of the breakpoint dense region as the local maxima of the density. III. RESULTS A. Identification of local methylation changes within frequently recombined regions. Figure 2 summarizes the genome-wide analysis, integrating DNA copy number and DNA methylation for our 40 Luminal A. The aim is to find all local differentially methylated foci along the copy number aberration profile derived from the whole dataset. The top track (CNV) shows the frequency of gains and loses across the genome for all the tumours combined. The genome is partitioned in variable windows in which all the samples share similar copy number state: amplified or deleted. We compute the methylation profile of each of these windows and detect those showing local difference compared to the genome background. The mean methylation distribution of each window along the genome is shown by the tri-color stripe. The ratio of unmethylated, partially methylated and methylated states are respectively blue, yellow and red. The scores of significant loci are shown by the red peaks. i (3) (4)
3 Using a p-value cut-off of 10-3 after Benjamini and Hochberg FDR correction, we identified 66 regions in the genome with significant methylation deviation compare to the distribution seen genome-wide. We compared these location with the regions with high breakpoint density, shown in the bottom track of Figure 2. The results are described in the next section. B. Local methylation changes in Luminal A samples co-localize with breakpoint rich regions within 2Mb The overall picture of spatial distribution of local methylation change compared to background shown in Figure 2 seemed to indicate that they occur frequently near frequently rearranged genomic loci. In order to verify this, we plotted the cummulative fraction of methylation peaks as a function of the distance to the nearest breakpoint dense region (solid red line, Figure 3). The result showed indeed that the observed regions with strong methylation change compared to background occur within 2Mb of a breakpoint. The null distribution was estimated by randomizing the locations of these methylation changes 1000 times and plotting the mean cumulative distribution (dotted red line). We also tested where the difference between the two curves were the most significant (blue line). An FDR corrected Wilcoxon-test was performed between the two distance distributions, taking the one of the randomized data as a reference. Thus the statistics shows where the curved differ the most. We found that the difference of coverage is maximized at a distance of 2Mb where 80% of the local methylation changes in Luminal A co-localize with breakpoint rich regions. The random model on the other hand did not show such a high coverage with 70% of the simulated events falling within 2Mb. IV. CONCLUSIONS We designed a computational framework integrating DNA copy number and DNA methylation with the aim to uncover certain aspects of the mechanisms involved in large chromosomal rearrangements in breast cancer. CNV and DNA methylation are combined in an integrated fashion focusing on regions with frequent chromosomal rearrangments. We found that breakpoint rich genomic regions tend to coincide with local DNA methylation pattern changes. Genome instability is often associated to high density of LINE and SINE elements which have been shown to be methylation resistant in cancer cell lines. In future work, it will be interesting to investigate further the link between methylation, genome instability and retro-transposable repeats. REFERENCES [1] Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu S, Lønning PE, Børresen-Dale AL, Brown PO, Botstein D., Molecular portraits of human breast tumours., Nature Aug 17;406(6797): [2] Sørlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A, Deng S, Johnsen H, Pesich R, Geisler S, Demeter J, Perou CM, Lønning PE, Brown PO, Børresen-Dale AL, Botstein D, Repeated observation of breast tumor subtypes in independent gene expression data sets, Proc Natl Acad Sci U S A Jul 8;100(14): Epub 2003 Jun 26. [3] Bergamaschi A, Kim YH, Wang P, Sørlie T, Hernandez-Boussard T, Lonning PE, Tibshirani R, Børresen-Dale AL, Pollack JR, Distinct patterns of DNA copy number alteration are associated with different clinicopathological features and gene-expression subtypes of breast cancer, Genes Chromosomes Cancer Nov;45(11): [4] Chin K, DeVries S, Fridlyand J, Spellman PT, Roydasgupta R, Kuo WL, Lapuk A, Neve RM, Qian Z, Ryder T, Chen F, Feiler H, Tokuyasu T, Kingsley C, Dairkee S, Meng Z, Chew K, Pinkel D, Jain A, Ljung BM, Esserman L, Albertson DG, Waldman FM, Gray JW, Genomic and transcriptional aberrations linked to breast cancer pathophysiologies, Cancer Cell Dec;10(6): [5] Lucito R, Healy J, Alexander J, Reiner A, Esposito D, Chi M, Rodgers L, Brady A, Sebat J, Troge J, West JA, Rostan S, Nguyen KC, Powers S, Ye KQ, Olshen A, Venkatraman E, Norton L, Wigler M, Representational oligonucleotide microarray analysis: a high-resolution method to detect genome copy number variation, Genome Res Oct;13(10): [6] Kamalakaran S, Varadan V, Giercksky Russnes HE, Levy D, Kendall J, Janevski A, Riggs M, Banerjee N, Synnestvedt M, Schlichting E, Kåresen R, Shama Prasada K, Rotti H, Rao R, Rao L, Eric Tang MH, Satyamoorthy K, Lucito R, Wigler M, Dimitrova N, Naume B, Borresen-Dale AL, Hicks JB. DNA methylation patterns in luminal breast cancers differ from non-luminal subtypes and can identify relapse risk independent of other clinical variables. Mol Oncol Feb;5(1): Epub 2010 Dec 2. [7] Bediaga NG, Acha-Sagredo A, Guerra I, Viguri A, Albaina C, Ruiz Diaz I, Rezola R, Alberdi MJ, Dopazo J, Montaner D, de Renobales M, Fernández AF, Field JK, Fraga MF, Liloglou T, de Pancorbo MM. DNA methylation epigenotypes in breast cancer molecular subtypes, Breast Cancer Res Sep 29;12(5):R77. [8] Staaf J, Jönsson G, Ringnér M, Vallon-Christersson J, Grabau D, Arason A, Gunnarsson H, Agnarsson BA, Malmström PO, Johannsson OT, Loman N, Barkardottir RB, Borg A. High-resolution genomic and expression analyses of copy number alterations in HER2-amplified breast cancer. Breast Cancer Res. 2010;12(3):R25. Epub 2010 May 6. [9] Kamalakaran S, Kendall J, Zhao, Tang C, Khan S, Ravi K, Auletta T, Riggs M, Wang Y, Helland A, Naume B, Dimitrova N, Børresen-Dale AL, Hicks J, Lucito R, Methylation detection oligonucleotide microarray analysis: a high-resolution method for detection of CpG island methylation, Nucleic Acids Res Jul;37(12):e89. Epub 2009 May 27. [10] Venkatraman ES, Olshen AB, A faster circular binary segmentation algorithm for the analysis of array CGH data, Bioinformatics Mar 15;23(6): Epub 2007 Jan 18. ACKNOWLEDGMENT Philips Research grant to Cold Spring Harbor Laboratory and NIH ES and HG grants to MQZ.
4 Figure 1. Flowchart of the analysis pipeline. Figure 2. Detection of significant local changes in DNA methylation distribution across the genome. CNV profile of the 40 Luminal A samples are shown on the top track (CNV). Local deviations of the distribution of Methylated, UnMethylated and Partially methylated samples are computed in regions that are frequently amplified or deleted (orange peaks). The actual methylation distribution across samples is shown in the tri-color bar (red-yellow-blue). We compared the locations of the deviations with those where breakpoints frequently occur (Breakpoint density track)
5 Figure 3. Co-localization of regions with significant methylation change with breakpoint rich regions. The solid red-line represents the cummulative fraction of identified regions with methylation change occurring within a certain distance to the nearest breakpoint rich region. The true data significantly differs from random (dotted red line) at a distance of 2Mb.
Understanding DNA Copy Number Data
Understanding DNA Copy Number Data Adam B. Olshen Department of Epidemiology and Biostatistics Helen Diller Family Comprehensive Cancer Center University of California, San Francisco http://cc.ucsf.edu/people/olshena_adam.php
More informationRNA preparation from extracted paraffin cores:
Supplementary methods, Nielsen et al., A comparison of PAM50 intrinsic subtyping with immunohistochemistry and clinical prognostic factors in tamoxifen-treated estrogen receptor positive breast cancer.
More informationComputational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq
Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological
More informationGinkgo Interactive analysis and quality assessment of single-cell CNV data
Ginkgo Interactive analysis and quality assessment of single-cell CNV data @RobAboukhalil Robert Aboukhalil, Tyler Garvin, Jude Kendall, Timour Baslan, Gurinder S. Atwal, Jim Hicks, Michael Wigler, Michael
More informationComparison of Triple Negative Breast Cancer between Asian and Western Data Sets
2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops Comparison of Triple Negative Breast Cancer between Asian and Western Data Sets Lee H. Chen Bioinformatics and Biostatistics
More informationSupplementary Figures
Supplementary Figures Supplementary Figure 1. Pan-cancer analysis of global and local DNA methylation variation a) Variations in global DNA methylation are shown as measured by averaging the genome-wide
More informationSUPPLEMENTAL INFORMATION
SUPPLEMENTAL INFORMATION GO term analysis of differentially methylated SUMIs. GO term analysis of the 458 SUMIs with the largest differential methylation between human and chimp shows that they are more
More informationNature Biotechnology: doi: /nbt.1904
Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is
More informationA Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer
A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,
More informationNature Methods: doi: /nmeth.3115
Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by
More informationAbstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction
Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant
More informationAssociation of GSTP1 Methylation with Aggressive Phenotype in ER-positive Breast Cancer
Association of GSTP1 Methylation with Aggressive Phenotype in ER-positive Breast Cancer TOMOHIRO MIYAKE 1, TAKAHIRO NAKAYAMA 1, NAOFUMI KAGARA 1, NORIAKI YAMAMOTO 2, YUKIKO NAKAMURA 1, YOKO OTANI 1, KUMIKO
More informationComparison of segmentation methods in cancer samples
fig/logolille2. Comparison of segmentation methods in cancer samples Morgane Pierre-Jean, Guillem Rigaill, Pierre Neuvial Laboratoire Statistique et Génome Université d Évry Val d Éssonne UMR CNRS 8071
More informationRelationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.
Supplementary Figure 1 Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. (a,b) Values of coefficients associated with genomic features, separately
More informationNature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma.
Supplementary Figure 1 Mutational signatures in BCC compared to melanoma. (a) The effect of transcription-coupled repair as a function of gene expression in BCC. Tumor type specific gene expression levels
More informationBroad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes
Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,
More informationIntroduction to Discrimination in Microarray Data Analysis
Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t
More informationGenetic alterations of histone lysine methyltransferases and their significance in breast cancer
Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Supplementary Materials and Methods Phylogenetic tree of the HMT superfamily The phylogeny outlined in the
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature10866 a b 1 2 3 4 5 6 7 Match No Match 1 2 3 4 5 6 7 Turcan et al. Supplementary Fig.1 Concepts mapping H3K27 targets in EF CBX8 targets in EF H3K27 targets in ES SUZ12 targets in ES
More informationMammogram Analysis: Tumor Classification
Mammogram Analysis: Tumor Classification Term Project Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is the
More informationStatistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies
Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley
More informationSUPPLEMENTARY APPENDIX
SUPPLEMENTARY APPENDIX 1) Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort 2) Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature
More informationHuman Cancer Genome Project. Bioinformatics/Genomics of Cancer:
Bioinformatics/Genomics of Cancer: Professor of Computer Science, Mathematics and Cell Biology Courant Institute, NYU School of Medicine, Tata Institute of Fundamental Research, and Mt. Sinai School of
More informationLTA Analysis of HapMap Genotype Data
LTA Analysis of HapMap Genotype Data Introduction. This supplement to Global variation in copy number in the human genome, by Redon et al., describes the details of the LTA analysis used to screen HapMap
More informationSupplementary note: Comparison of deletion variants identified in this study and four earlier studies
Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Here we compare the results of this study to potentially overlapping results from four earlier studies
More informationGene Selection for Tumor Classification Using Microarray Gene Expression Data
Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology
More informationBoosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer
Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer Pei Wang Department of Statistics Stanford University Stanford, CA 94305 wp57@stanford.edu Young Kim, Jonathan Pollack Department
More informationThe 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis
The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis Tieliu Shi tlshi@bio.ecnu.edu.cn The Center for bioinformatics
More informationInteractive analysis and quality assessment of single-cell copy-number variations
Interactive analysis and quality assessment of single-cell copy-number variations Tyler Garvin, Robert Aboukhalil, Jude Kendall, Timour Baslan, Gurinder S. Atwal, James Hicks, Michael Wigler, Michael C.
More informationNature Structural & Molecular Biology: doi: /nsmb.2419
Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb
More information7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.
Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the
More informationDigitizing the Proteomes From Big Tissue Biobanks
Digitizing the Proteomes From Big Tissue Biobanks Analyzing 24 Proteomes Per Day by Microflow SWATH Acquisition and Spectronaut Pulsar Analysis Jan Muntel 1, Nick Morrice 2, Roland M. Bruderer 1, Lukas
More informationBiostatistical modelling in genomics for clinical cancer studies
This work was supported by Entente Cordiale Cancer Research Bursaries Biostatistical modelling in genomics for clinical cancer studies Philippe Broët JE 2492 Faculté de Médecine Paris-Sud In collaboration
More informationGenerating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University
Role of Chemical lexposure in Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University CNV Discovery Reference Genetic
More informationNature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.
Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze
More informationChIP-seq data analysis
ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional
More informationAgilent s Copy Number Variation (CNV) Portfolio
Technical Overview Agilent s Copy Number Variation (CNV) Portfolio Abstract Copy Number Variation (CNV) is now recognized as a prevalent form of structural variation in the genome contributing to human
More informationA COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION
5-9 JATIT. All rights reserved. A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 1 H. Mahmoodian, M. Hamiruce Marhaban, 3 R. A. Rahim, R. Rosli, 5 M. Iqbal Saripan 1 PhD student, Department
More informationSingle SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)
High-Throughput Sequencing Course Gene-Set Analysis Biostatistics and Bioinformatics Summer 28 Section Introduction What is Gene Set Analysis? Many names for gene set analysis: Pathway analysis Gene set
More informationWhite Paper Estimating Complex Phenotype Prevalence Using Predictive Models
White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015
More informationRisk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach
Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach Manuela Zucknick Division of Biostatistics, German Cancer Research Center Biometry Workshop,
More informationT. R. Golub, D. K. Slonim & Others 1999
T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have
More informationOutlier Analysis. Lijun Zhang
Outlier Analysis Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Extreme Value Analysis Probabilistic Models Clustering for Outlier Detection Distance-Based Outlier Detection Density-Based
More informationAIMS: Absolute Assignment of Breast Cancer Intrinsic Molecular Subtype
AIMS: Absolute Assignment of Breast Cancer Intrinsic Molecular Subtype Eric R. Paquet (eric.r.paquet@gmail.com), Michael T. Hallett (michael.t.hallett@mcgill.ca) 1 1 Department of Biochemistry, Breast
More informationR2: web-based genomics analysis and visualization platform
R2: web-based genomics analysis and visualization platform Overview Jan Koster Department of Oncogenomics Academic Medical Center (AMC) UvA, the Netherlands jankoster@amc.uva.nl jankoster@amc.uva.nl 1
More informationa) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,
Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted
More informationcn.mops - Mixture of Poissons for CNV detection in NGS data Günter Klambauer Institute of Bioinformatics, Johannes Kepler University Linz
Software Manual Institute of Bioinformatics, Johannes Kepler University Linz cn.mops - Mixture of Poissons for CNV detection in NGS data Günter Klambauer Institute of Bioinformatics, Johannes Kepler University
More informationFalse Discovery Rates and Copy Number Variation. Bradley Efron and Nancy Zhang Stanford University
False Discovery Rates and Copy Number Variation Bradley Efron and Nancy Zhang Stanford University Three Statistical Centuries 19th (Quetelet) Huge data sets, simple questions 20th (Fisher, Neyman, Hotelling,...
More informationVL Network Analysis ( ) SS2016 Week 3
VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture
More informationComputational Investigation of Homologous Recombination DNA Repair Deficiency in Sporadic Breast Cancer
University of Massachusetts Medical School escholarship@umms Open Access Articles Open Access Publications by UMMS Authors 11-16-2017 Computational Investigation of Homologous Recombination DNA Repair
More informationSALSA MLPA probemix P315-B1 EGFR
SALSA MLPA probemix P315-B1 EGFR Lot B1-0215 and B1-0112. As compared to the previous A1 version (lot 0208), two mutation-specific probes for the EGFR mutations L858R and T709M as well as one additional
More informationNovember 9, Johns Hopkins School of Medicine, Baltimore, MD,
Fast detection of de-novo copy number variants from case-parent SNP arrays identifies a deletion on chromosome 7p14.1 associated with non-syndromic isolated cleft lip/palate Samuel G. Younkin 1, Robert
More informationFigure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control
Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control HaCaT Tet, uninduced HaCaT GLI2 and induced HaCaT GLI2
More informationModule 3: Pathway and Drug Development
Module 3: Pathway and Drug Development Table of Contents 1.1 Getting Started... 6 1.2 Identifying a Dasatinib sensitive cancer signature... 7 1.2.1 Identifying and validating a Dasatinib Signature... 7
More informationDetection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit
APPLICATION NOTE Ion PGM System Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit Key findings The Ion PGM System, in concert with the Ion ReproSeq PGS View Kit and Ion Reporter
More informationChromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011
Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011 Massive Genomic Rearrangement Acquired in a Single Catastrophic Event during Cancer Development Cell 144,
More informationProbability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data
Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data Tong WW, McComb ME, Perlman DH, Huang H, O Connor PB, Costello
More informationNature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from
Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue),
More informationGenomic structural variation
Genomic structural variation Mario Cáceres The new genomic variation DNA sequence differs across individuals much more than researchers had suspected through structural changes A huge amount of structural
More informationSubLasso:a feature selection and classification R package with a. fixed feature subset
SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced
More informationGenome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre
Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre Structural variation (SVs) Copy-number variations C Deletion A B C Balanced rearrangements A B A B C B A C Duplication Inversion Causes
More informationMemorial Sloan-Kettering Cancer Center
Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2007 Paper 14 On Comparing the Clustering of Regression Models
More informationClinical Interpretation of Cancer Genomes
IGENZ Ltd, Auckland, New Zealand Clinical Interpretation of Cancer Genomes Dr Amanda Dixon-McIver www.igenz.co.nz 1992 Slovenia and Croatia gain independence USA and Russia declare the Cold War over Steffi
More informationCHROMOSOMAL MICROARRAY (CGH+SNP)
Chromosome imbalances are a significant cause of developmental delay, mental retardation, autism spectrum disorders, dysmorphic features and/or birth defects. The imbalance of genetic material may be due
More informationChapter 1. Introduction
Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a
More informationMODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA
International Journal of Software Engineering and Knowledge Engineering Vol. 13, No. 6 (2003) 579 592 c World Scientific Publishing Company MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION
More informationComputational Analysis of Genome-Wide DNA Copy Number Changes
Computational Analysis of Genome-Wide DNA Copy Number Changes Lei Song Thesis submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements
More informationMIR retrotransposon sequences provide insulators to the human genome
Supplementary Information: MIR retrotransposon sequences provide insulators to the human genome Jianrong Wang, Cristina Vicente-García, Davide Seruggia, Eduardo Moltó, Ana Fernandez- Miñán, Ana Neto, Elbert
More informationMALBAC Technology and Its Application in Non-invasive Chromosome Screening (NICS)
MALBAC Technology and Its Application in Non-invasive Chromosome Screening (NICS) The Power of One Adapted from Internet Single Cell Genomic Studies Ultra Low Sample Input Advances and applications of
More informationCanadian College of Medical Geneticists (CCMG) Cytogenetics Examination. May 4, 2010
Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination May 4, 2010 Examination Length = 3 hours Total Marks = 100 (7 questions) Total Pages = 8 (including cover sheet and 2 pages of prints)
More informationSupplementary Figure 1
Supplementary Figure 1 Supplementary Fig. 1: Quality assessment of formalin-fixed paraffin-embedded (FFPE)-derived DNA and nuclei. (a) Multiplex PCR analysis of unrepaired and repaired bulk FFPE gdna from
More informationWhite Paper. Copy number variant detection. Sample to Insight. August 19, 2015
White Paper Copy number variant detection August 19, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 Fax: +45 86 20 12 22 www.clcbio.com
More informationComputer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015
Goals/Expectations Computer Science, Biology, and Biomedical (CoSBBI) We want to excite you about the world of computer science, biology, and biomedical informatics. Experience what it is like to be a
More informationInter-session reproducibility measures for high-throughput data sources
Inter-session reproducibility measures for high-throughput data sources Milos Hauskrecht, PhD, Richard Pelikan, MSc Computer Science Department, Intelligent Systems Program, Department of Biomedical Informatics,
More informationMicroarrays in primary breast cancer lessons from chemotherapy studies
Endocrine-Related Cancer (2001) 8 259 263 Microarrays in primary breast cancer lessons from chemotherapy studies P E Lønning, T Sørlie 1,2, C M Perou 2, P O Brown 3, D Botstein 2 and A-L Børresen-Dale
More informationSNPrints: Defining SNP signatures for prediction of onset in complex diseases
SNPrints: Defining SNP signatures for prediction of onset in complex diseases Linda Liu, Biomedical Informatics, Stanford University Daniel Newburger, Biomedical Informatics, Stanford University Grace
More informationGenomic complexity and arrays in CLL. Gian Matteo Rigolin, MD, PhD St. Anna University Hospital Ferrara, Italy
Genomic complexity and arrays in CLL Gian Matteo Rigolin, MD, PhD St. Anna University Hospital Ferrara, Italy Clinical relevance of genomic complexity (GC) in CLL GC has been identified as a critical negative
More informationTITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)
TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) AUTHORS: Tejas Prahlad INTRODUCTION Acute Respiratory Distress Syndrome (ARDS) is a condition
More informationSystematic Analysis for Identification of Genes Impacting Cancers
Systematic Analysis for Identification of Genes Impacting Cancers Arpita Singhal Stanford University Saint Francis High School ABSTRACT Currently, vast amounts of molecular information involving genomic
More informationof TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed.
Supplementary Note The potential association and implications of HBV integration at known and putative cancer genes of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Human telomerase
More informationTITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer.
AWARD NUMBER: W81XWH-14-1-0080 TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer. PRINCIPAL INVESTIGATOR: Christopher B. Umbricht, MD, PhD CONTRACTING ORGANIZATION: Johns
More informationMammogram Analysis: Tumor Classification
Mammogram Analysis: Tumor Classification Literature Survey Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is
More informationApplication of Array-based Comparative Genome Hybridization in Children with Developmental Delay or Mental Retardation
Pediatr Neonatol 2008;49(6):213 217 REVIEW ARTICLE Application of Array-based Comparative Genome Hybridization in Children with Developmental Delay or Mental Retardation Jao-Shwann Liang 1,2 *, Keiko Shimojima
More informationSupplementary Information
Supplementary Information 5-hydroxymethylcytosine-mediated epigenetic dynamics during postnatal neurodevelopment and aging By Keith E. Szulwach 1,8, Xuekun Li 1,8, Yujing Li 1, Chun-Xiao Song 2, Hao Wu
More informationMolecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC
Molecular Markers Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC Overview Testing methods Rationale for molecular testing
More informationDiagnosis of multiple cancer types by shrunken centroids of gene expression
Diagnosis of multiple cancer types by shrunken centroids of gene expression Robert Tibshirani, Trevor Hastie, Balasubramanian Narasimhan, and Gilbert Chu PNAS 99:10:6567-6572, 14 May 2002 Nearest Centroid
More informationMeasuring DNA Methylation with the MinION. Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16
Measuring DNA Methylation with the MinION Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16 Epigenetics: Modern Modern Definition of epigenetics involves heritable changes
More informationCNV Detection and Interpretation in Genomic Data
CNV Detection and Interpretation in Genomic Data Benjamin W. Darbro, M.D., Ph.D. Assistant Professor of Pediatrics Director of the Shivanand R. Patil Cytogenetics and Molecular Laboratory Overview What
More informationOncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies
OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...
More informationSALSA MS-MLPA KIT ME011-A1 Mismatch Repair genes (MMR) Lot 0609, 0408, 0807, 0407
SALSA MS-MLPA KIT ME011-A1 Mismatch Repair genes (MMR) Lot 0609, 0408, 0807, 0407 The Mismatch Repair (MMR) system is critical for the maintenance of genomic stability. MMR increases the fidelity of DNA
More informationIntegrated Analysis of Copy Number and Gene Expression
Integrated Analysis of Copy Number and Gene Expression Nexus Copy Number provides user-friendly interface and functionalities to integrate copy number analysis with gene expression results for the purpose
More informationDiffVar: a new method for detecting differential variability with application to methylation in cancer and aging
Genome Biology This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. DiffVar: a new method for detecting
More informationLab 5: Testing Hypotheses about Patterns of Inheritance
Lab 5: Testing Hypotheses about Patterns of Inheritance How do we talk about genetic information? Each cell in living organisms contains DNA. DNA is made of nucleotide subunits arranged in very long strands.
More informationBreast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data
Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing
More informationPeak-calling for ChIP-seq and ATAC-seq
Peak-calling for ChIP-seq and ATAC-seq Shamith Samarajiwa CRUK Autumn School in Bioinformatics 2017 University of Cambridge Overview Peak-calling: identify enriched (signal) regions in ChIP-seq or ATAC-seq
More informationMolecular Methods in the Diagnosis and Prognostication of Melanoma: Pros & Cons
Molecular Methods in the Diagnosis and Prognostication of Melanoma: Pros & Cons Ben J. Friedman, MD Senior Staff Physician Department of Dermatology Department of Pathology and Laboratory Medicine Henry
More informationSUPPLEMENTARY INFORMATION
doi: 1.138/nature8645 Physical coverage (x haploid genomes) 11 6.4 4.9 6.9 6.7 4.4 5.9 9.1 7.6 125 Neither end mapped One end mapped Chimaeras Correct Reads (million ns) 1 75 5 25 HCC1187 HCC1395 HCC1599
More informationCytogenetics 101: Clinical Research and Molecular Genetic Technologies
Cytogenetics 101: Clinical Research and Molecular Genetic Technologies Topics for Today s Presentation 1 Classical vs Molecular Cytogenetics 2 What acgh? 3 What is FISH? 4 What is NGS? 5 How can these
More informationStructural Variation and Medical Genomics
Structural Variation and Medical Genomics Andrew King Department of Biomedical Informatics July 8, 2014 You already know about small scale genetic mutations Single nucleotide polymorphism (SNPs) Deletions,
More informationncounter Data Analysis Guidelines for Copy Number Variation (CNV) Molecules That Count NanoString Technologies, Inc.
ncounter Data Analysis Guidelines for Copy Number Variation (CNV) NanoString Technologies, Inc. 530 Fairview Ave N Suite 2000 Seattle, Washington 98109 www.nanostring.com Tel: 206.378.6266 888.358.6266
More information