A METHOD FOR FINDING NOVEL ASSOCIATIONS BETWEEN GENOME-WIDE COPY NUMBER AND DNA METHYLATION PATTERNS

Size: px
Start display at page:

Download "A METHOD FOR FINDING NOVEL ASSOCIATIONS BETWEEN GENOME-WIDE COPY NUMBER AND DNA METHYLATION PATTERNS"

Transcription

1 A METHOD FOR FINDING NOVEL ASSOCIATIONS BETWEEN GENOME-WIDE COPY NUMBER AND DNA METHYLATION PATTERNS Man-Hung Eric Tang 1), Vinay Varadan 2), Sid Kamalakaran 2), Michael Q. Zhang 3), Nevenka Dimitrova 2), James Hicks 1). 1. Cold Spring Harbor Laboratory, 1 Bungtown Rd, NY 12724, USA 2. Philips Research North America, 345 Scarborough Rd, Briarcliff Manor, NY 10510, USA 3. The University of Texas at Dallas, Richardson T 75080, USA and Tsinghua University, Beijing, China mtang@cshl.edu Abstract We present a computational method that combines genome-wide DNA methylation and copy number variation data in an integrated fashion with the aim of finding mechanistic associations between genome instability and local DNA methylation changes. The method is applied to Luminal A breast cancer early-stage tumour samples and focuses on methylation events occurring at frequently rearranged genome locations. Our method accommodates array and sequencing platforms for methylation and DNA copy number estimates. We find significant local methylation changes in tumours tend to occur in the viscinity of breakpoint rich regions, with 80% of the differentially methylated regions occurring within 2Mb from a breakpoint rich locus. Keywords- breast cancer, genome instability, DNA methylation I. INTRODUCTION Breast cancer is a complex genetic disease characterized by multiple genetic and epigenetic changes which have been widely studied in the past two decades. Pioneering works by Perou et al. [1], Sørlie et al. [2] showed that breast tumour cells can be distinguished into five molecular subtypes with clinically different outcomes: Luminal A and B, HER2 positive, basal-like and normal-like. As new high-throughput methodologies have emerged, other genetic anomalies have been studied. Copy Number Variation profiling is a well established methodology to survey major chromosomal rearrangements in the genome. It has been shown in many studies [3,4,5] that CNV patterns are important discriminating features between subtypes of breast and other cancers. Similarly, the characterization of cancer methylomes and their corresponding normal profiles is an important aspect in biomarker discovery. Kamalakaran et al. [6] showed that Luminal and non-luminal breast cancer tumours have different methylation patterns and that differentially methylated genes between tumour and normal cells could be used as prognosis factors. Furthermore, epigenetic subtyping of breast cancer has also been addressed, for example in [7], describing the epigenotypes of Luminal A, B, HER2 positive and basal-like breast tumours. The relationship between gene expression, copy number and DNA methylation is still unclear. The problem has been often tackled in the gene expression perspective, looking at the impact of changes in copy number, methylation levels or both on gene expression, and with the aim of looking for potential therapeutic targets [6,8]. Alongside to these classic genefocused studies, it would be interesting to see whether epigenetic and genetic anomalies occur randomly and if genome instability could be associated with local variations of DNA methylation. For this we need to look at where changes occur rather than the actual levels as in the classic methods. We propose an integrative framework that combines CNV profiles and DNA methylation information of the same tumours focusing at events occurring at common chromosomal rearrangement breakpoints. We showed in Luminal A samples that genomic loci with local DNA methylation changes tend to occur significantly within 2 Mb from chromosomal breakpointdense regions. II. METHOD A. Tumour sample set We used the 119 Norwegian breast cancer dataset described in Sørlie et al. [2]. Each patient of the study is further classified into one of the following sub-groups: Luminal A tumour subtype (40 patients); Luminal B (15), ERBB2 positive (19), basal-like (12), normal-like (14), and 8 undefined. The normal tissue dataset consisted of 11 adjacent breast tissue samples. For each sample, we surveyed DNA methylation and copy number variation data using the experimental platforms described below. The statistical analysis was performed on Luminal A samples only, which represent the largest and most homogeneous group of our dataset. B. MOMA platform We surveyed the methylome of each tumour sample using the MOMA platform [9]. Each CpG island is covered by one or several MOMA fragments that undergo MspI cleavage and McrBC or mock digestion. McrBC and mock digested fragments are then labeled and hybridized on a chip. The hybridization ratio reflects the level of methylation of the probed CpG island. In total, the CpG islands annotated by the UCSC genome browser (hg17 build) are covered by MOMA fragments. The data is normalized by converting the hybridization log-ratios into the probabilistic space using an Expectation-Maximization (EM) method [6]. Each MOMA fragment is assigned one of the following states: high methylation (+1), low methylation (-1) and 0 for partial methylation.

2 C. ROMA platform To measure copy number variation across the genome, we used the ROMA platform described in Lucito et al. [5] The genome is covered by regularly spaced probes printed on an array, providing a coverage of the genome of nucleotides resolution. Copy number ratios are measured using the skin fibroblast CHPSKN-1 cell-line as reference. Since CHPSKN-1 cells come from a male individual, we focused our analysis on the 22 autosomes only. Copy number values are obtained using Circular Binary Segmentation [10]. D. Flow diagram Figure 1 presents the different steps of the analysis procedure. The model contains three layers: input methylation and copy number data (dotted line round boxes), computational modules (solid line round boxes) and output data (square boxes). In the pre-processing step, we derive the profile of copy number gain and losses in the studied set of breast tumours and define windows of similar copy number status (amplified or deleted) across the genome. For each of these windows, a statistical test is performed to evaluate whether the methylation distribution differs from the background. We then obtain a list of regions with local methylation deviations that we compare with loci with high chromosomal breakpoint density. E. CNV analysis across tumour samples We partitioned the genome into variable windows in which copy number ratio remains constant in each sample. Windows are determined by all the breakpoints obtained by segmentation of the copy number values in each sample using the CBS algorithm. Longer intervals describe regions that have very little copy number change across all the patients while short intervals correspond to regions with high copy number changes, ie many breaks across different samples. We defined three levels of amplification in order to bin samples into three categories. In each given interval, samples with a ROMA ratio greater than 1.1 are defined as amplified, deleted if their linear ratio is less than 0.9 and normal if their ROMA ratio fall between these two values. The thresholds that define the normal copy number ratio were chosen empirically to take into account the measurement noise around 1. The CNV profile of the dataset can be then plotted as the fraction of sample showing amplifications and deletions across (Figure 2). F. Detection of local methylation changes To identify local variations of DNA methylation in the 40 luminal A samples, we compared the distribution of methylations calls within each of the intervals defined by all the copy number breakpoints with the one observed across the genome. Each MOMA fragment is surveyed and we can associate to each fragment a triplet of observations accounting for the number of '+1's, '0's, and '-1's seen across all samples. For example, a window can be seen 30 times as +1, 3 times 0 and 7 times -1. Local changes in DNA methylation across the genome were identified using the Hotelling's t 2 -test, a generalization the Student's t-test for multivariate hypothesis testing. The null hypothesis H 0 is defined as the observed distribution of '+1's, '0's, and '-1's observed at each fragment across the MOMA platform. It is calculated based on observations. It has an expectation μ 0 =(μ 01,μ 02,μ 03 ) and covariance B. If a window contains n MOMA fragments, let 1, 2,.., n be n independent 3-dimensional vectors, n , 2,.., n is follows the normal law N(μ,B). Then, the T 2 statistics can be expressed as: where and S T 2 1 = n T 1 μ μ S μ μ n 1 = n 1 0 n μ = 1 n i= 1 i= 1 0 (1) i (2) μ μ T i are the sample maximum likelihood estimators of μ and B. Then T 2 has the Hotelling's T-square distribution and the statistic n p F = p n 1 T² has a Fisher's F distribution with p and n-p degrees of freedom, p=3 and parameter (μ - μ 0 ) T B -1 (μ - μ 0 ). To test whether the null hypothesis H 0 :μ=μ 0 is rejected, we compute the F statistics using the observations 1, 2,.., n of the 3-dimensional normal law N(μ,B) and derive the associated p-value. A window is considered to have a significant deviation in its methylation pattern if its p-value is smaller than G. Breakpoint dense regions We used the segment starts and end defined by the CBS algorithm for the CNV profile of each sample to define our breakpoints. We then calculated the density function and defined the center of the breakpoint dense region as the local maxima of the density. III. RESULTS A. Identification of local methylation changes within frequently recombined regions. Figure 2 summarizes the genome-wide analysis, integrating DNA copy number and DNA methylation for our 40 Luminal A. The aim is to find all local differentially methylated foci along the copy number aberration profile derived from the whole dataset. The top track (CNV) shows the frequency of gains and loses across the genome for all the tumours combined. The genome is partitioned in variable windows in which all the samples share similar copy number state: amplified or deleted. We compute the methylation profile of each of these windows and detect those showing local difference compared to the genome background. The mean methylation distribution of each window along the genome is shown by the tri-color stripe. The ratio of unmethylated, partially methylated and methylated states are respectively blue, yellow and red. The scores of significant loci are shown by the red peaks. i (3) (4)

3 Using a p-value cut-off of 10-3 after Benjamini and Hochberg FDR correction, we identified 66 regions in the genome with significant methylation deviation compare to the distribution seen genome-wide. We compared these location with the regions with high breakpoint density, shown in the bottom track of Figure 2. The results are described in the next section. B. Local methylation changes in Luminal A samples co-localize with breakpoint rich regions within 2Mb The overall picture of spatial distribution of local methylation change compared to background shown in Figure 2 seemed to indicate that they occur frequently near frequently rearranged genomic loci. In order to verify this, we plotted the cummulative fraction of methylation peaks as a function of the distance to the nearest breakpoint dense region (solid red line, Figure 3). The result showed indeed that the observed regions with strong methylation change compared to background occur within 2Mb of a breakpoint. The null distribution was estimated by randomizing the locations of these methylation changes 1000 times and plotting the mean cumulative distribution (dotted red line). We also tested where the difference between the two curves were the most significant (blue line). An FDR corrected Wilcoxon-test was performed between the two distance distributions, taking the one of the randomized data as a reference. Thus the statistics shows where the curved differ the most. We found that the difference of coverage is maximized at a distance of 2Mb where 80% of the local methylation changes in Luminal A co-localize with breakpoint rich regions. The random model on the other hand did not show such a high coverage with 70% of the simulated events falling within 2Mb. IV. CONCLUSIONS We designed a computational framework integrating DNA copy number and DNA methylation with the aim to uncover certain aspects of the mechanisms involved in large chromosomal rearrangements in breast cancer. CNV and DNA methylation are combined in an integrated fashion focusing on regions with frequent chromosomal rearrangments. We found that breakpoint rich genomic regions tend to coincide with local DNA methylation pattern changes. Genome instability is often associated to high density of LINE and SINE elements which have been shown to be methylation resistant in cancer cell lines. In future work, it will be interesting to investigate further the link between methylation, genome instability and retro-transposable repeats. REFERENCES [1] Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu S, Lønning PE, Børresen-Dale AL, Brown PO, Botstein D., Molecular portraits of human breast tumours., Nature Aug 17;406(6797): [2] Sørlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A, Deng S, Johnsen H, Pesich R, Geisler S, Demeter J, Perou CM, Lønning PE, Brown PO, Børresen-Dale AL, Botstein D, Repeated observation of breast tumor subtypes in independent gene expression data sets, Proc Natl Acad Sci U S A Jul 8;100(14): Epub 2003 Jun 26. [3] Bergamaschi A, Kim YH, Wang P, Sørlie T, Hernandez-Boussard T, Lonning PE, Tibshirani R, Børresen-Dale AL, Pollack JR, Distinct patterns of DNA copy number alteration are associated with different clinicopathological features and gene-expression subtypes of breast cancer, Genes Chromosomes Cancer Nov;45(11): [4] Chin K, DeVries S, Fridlyand J, Spellman PT, Roydasgupta R, Kuo WL, Lapuk A, Neve RM, Qian Z, Ryder T, Chen F, Feiler H, Tokuyasu T, Kingsley C, Dairkee S, Meng Z, Chew K, Pinkel D, Jain A, Ljung BM, Esserman L, Albertson DG, Waldman FM, Gray JW, Genomic and transcriptional aberrations linked to breast cancer pathophysiologies, Cancer Cell Dec;10(6): [5] Lucito R, Healy J, Alexander J, Reiner A, Esposito D, Chi M, Rodgers L, Brady A, Sebat J, Troge J, West JA, Rostan S, Nguyen KC, Powers S, Ye KQ, Olshen A, Venkatraman E, Norton L, Wigler M, Representational oligonucleotide microarray analysis: a high-resolution method to detect genome copy number variation, Genome Res Oct;13(10): [6] Kamalakaran S, Varadan V, Giercksky Russnes HE, Levy D, Kendall J, Janevski A, Riggs M, Banerjee N, Synnestvedt M, Schlichting E, Kåresen R, Shama Prasada K, Rotti H, Rao R, Rao L, Eric Tang MH, Satyamoorthy K, Lucito R, Wigler M, Dimitrova N, Naume B, Borresen-Dale AL, Hicks JB. DNA methylation patterns in luminal breast cancers differ from non-luminal subtypes and can identify relapse risk independent of other clinical variables. Mol Oncol Feb;5(1): Epub 2010 Dec 2. [7] Bediaga NG, Acha-Sagredo A, Guerra I, Viguri A, Albaina C, Ruiz Diaz I, Rezola R, Alberdi MJ, Dopazo J, Montaner D, de Renobales M, Fernández AF, Field JK, Fraga MF, Liloglou T, de Pancorbo MM. DNA methylation epigenotypes in breast cancer molecular subtypes, Breast Cancer Res Sep 29;12(5):R77. [8] Staaf J, Jönsson G, Ringnér M, Vallon-Christersson J, Grabau D, Arason A, Gunnarsson H, Agnarsson BA, Malmström PO, Johannsson OT, Loman N, Barkardottir RB, Borg A. High-resolution genomic and expression analyses of copy number alterations in HER2-amplified breast cancer. Breast Cancer Res. 2010;12(3):R25. Epub 2010 May 6. [9] Kamalakaran S, Kendall J, Zhao, Tang C, Khan S, Ravi K, Auletta T, Riggs M, Wang Y, Helland A, Naume B, Dimitrova N, Børresen-Dale AL, Hicks J, Lucito R, Methylation detection oligonucleotide microarray analysis: a high-resolution method for detection of CpG island methylation, Nucleic Acids Res Jul;37(12):e89. Epub 2009 May 27. [10] Venkatraman ES, Olshen AB, A faster circular binary segmentation algorithm for the analysis of array CGH data, Bioinformatics Mar 15;23(6): Epub 2007 Jan 18. ACKNOWLEDGMENT Philips Research grant to Cold Spring Harbor Laboratory and NIH ES and HG grants to MQZ.

4 Figure 1. Flowchart of the analysis pipeline. Figure 2. Detection of significant local changes in DNA methylation distribution across the genome. CNV profile of the 40 Luminal A samples are shown on the top track (CNV). Local deviations of the distribution of Methylated, UnMethylated and Partially methylated samples are computed in regions that are frequently amplified or deleted (orange peaks). The actual methylation distribution across samples is shown in the tri-color bar (red-yellow-blue). We compared the locations of the deviations with those where breakpoints frequently occur (Breakpoint density track)

5 Figure 3. Co-localization of regions with significant methylation change with breakpoint rich regions. The solid red-line represents the cummulative fraction of identified regions with methylation change occurring within a certain distance to the nearest breakpoint rich region. The true data significantly differs from random (dotted red line) at a distance of 2Mb.

Understanding DNA Copy Number Data

Understanding DNA Copy Number Data Understanding DNA Copy Number Data Adam B. Olshen Department of Epidemiology and Biostatistics Helen Diller Family Comprehensive Cancer Center University of California, San Francisco http://cc.ucsf.edu/people/olshena_adam.php

More information

RNA preparation from extracted paraffin cores:

RNA preparation from extracted paraffin cores: Supplementary methods, Nielsen et al., A comparison of PAM50 intrinsic subtyping with immunohistochemistry and clinical prognostic factors in tamoxifen-treated estrogen receptor positive breast cancer.

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Ginkgo Interactive analysis and quality assessment of single-cell CNV data

Ginkgo Interactive analysis and quality assessment of single-cell CNV data Ginkgo Interactive analysis and quality assessment of single-cell CNV data @RobAboukhalil Robert Aboukhalil, Tyler Garvin, Jude Kendall, Timour Baslan, Gurinder S. Atwal, Jim Hicks, Michael Wigler, Michael

More information

Comparison of Triple Negative Breast Cancer between Asian and Western Data Sets

Comparison of Triple Negative Breast Cancer between Asian and Western Data Sets 2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops Comparison of Triple Negative Breast Cancer between Asian and Western Data Sets Lee H. Chen Bioinformatics and Biostatistics

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Pan-cancer analysis of global and local DNA methylation variation a) Variations in global DNA methylation are shown as measured by averaging the genome-wide

More information

SUPPLEMENTAL INFORMATION

SUPPLEMENTAL INFORMATION SUPPLEMENTAL INFORMATION GO term analysis of differentially methylated SUMIs. GO term analysis of the 458 SUMIs with the largest differential methylation between human and chimp shows that they are more

More information

Nature Biotechnology: doi: /nbt.1904

Nature Biotechnology: doi: /nbt.1904 Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is

More information

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,

More information

Nature Methods: doi: /nmeth.3115

Nature Methods: doi: /nmeth.3115 Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by

More information

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant

More information

Association of GSTP1 Methylation with Aggressive Phenotype in ER-positive Breast Cancer

Association of GSTP1 Methylation with Aggressive Phenotype in ER-positive Breast Cancer Association of GSTP1 Methylation with Aggressive Phenotype in ER-positive Breast Cancer TOMOHIRO MIYAKE 1, TAKAHIRO NAKAYAMA 1, NAOFUMI KAGARA 1, NORIAKI YAMAMOTO 2, YUKIKO NAKAMURA 1, YOKO OTANI 1, KUMIKO

More information

Comparison of segmentation methods in cancer samples

Comparison of segmentation methods in cancer samples fig/logolille2. Comparison of segmentation methods in cancer samples Morgane Pierre-Jean, Guillem Rigaill, Pierre Neuvial Laboratoire Statistique et Génome Université d Évry Val d Éssonne UMR CNRS 8071

More information

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. Supplementary Figure 1 Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. (a,b) Values of coefficients associated with genomic features, separately

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma.

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma. Supplementary Figure 1 Mutational signatures in BCC compared to melanoma. (a) The effect of transcription-coupled repair as a function of gene expression in BCC. Tumor type specific gene expression levels

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information

Introduction to Discrimination in Microarray Data Analysis

Introduction to Discrimination in Microarray Data Analysis Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t

More information

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Supplementary Materials and Methods Phylogenetic tree of the HMT superfamily The phylogeny outlined in the

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature10866 a b 1 2 3 4 5 6 7 Match No Match 1 2 3 4 5 6 7 Turcan et al. Supplementary Fig.1 Concepts mapping H3K27 targets in EF CBX8 targets in EF H3K27 targets in ES SUZ12 targets in ES

More information

Mammogram Analysis: Tumor Classification

Mammogram Analysis: Tumor Classification Mammogram Analysis: Tumor Classification Term Project Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is the

More information

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley

More information

SUPPLEMENTARY APPENDIX

SUPPLEMENTARY APPENDIX SUPPLEMENTARY APPENDIX 1) Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort 2) Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature

More information

Human Cancer Genome Project. Bioinformatics/Genomics of Cancer:

Human Cancer Genome Project. Bioinformatics/Genomics of Cancer: Bioinformatics/Genomics of Cancer: Professor of Computer Science, Mathematics and Cell Biology Courant Institute, NYU School of Medicine, Tata Institute of Fundamental Research, and Mt. Sinai School of

More information

LTA Analysis of HapMap Genotype Data

LTA Analysis of HapMap Genotype Data LTA Analysis of HapMap Genotype Data Introduction. This supplement to Global variation in copy number in the human genome, by Redon et al., describes the details of the LTA analysis used to screen HapMap

More information

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Here we compare the results of this study to potentially overlapping results from four earlier studies

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer

Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer Pei Wang Department of Statistics Stanford University Stanford, CA 94305 wp57@stanford.edu Young Kim, Jonathan Pollack Department

More information

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis Tieliu Shi tlshi@bio.ecnu.edu.cn The Center for bioinformatics

More information

Interactive analysis and quality assessment of single-cell copy-number variations

Interactive analysis and quality assessment of single-cell copy-number variations Interactive analysis and quality assessment of single-cell copy-number variations Tyler Garvin, Robert Aboukhalil, Jude Kendall, Timour Baslan, Gurinder S. Atwal, James Hicks, Michael Wigler, Michael C.

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the

More information

Digitizing the Proteomes From Big Tissue Biobanks

Digitizing the Proteomes From Big Tissue Biobanks Digitizing the Proteomes From Big Tissue Biobanks Analyzing 24 Proteomes Per Day by Microflow SWATH Acquisition and Spectronaut Pulsar Analysis Jan Muntel 1, Nick Morrice 2, Roland M. Bruderer 1, Lukas

More information

Biostatistical modelling in genomics for clinical cancer studies

Biostatistical modelling in genomics for clinical cancer studies This work was supported by Entente Cordiale Cancer Research Bursaries Biostatistical modelling in genomics for clinical cancer studies Philippe Broët JE 2492 Faculté de Médecine Paris-Sud In collaboration

More information

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University Role of Chemical lexposure in Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University CNV Discovery Reference Genetic

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training. Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze

More information

ChIP-seq data analysis

ChIP-seq data analysis ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional

More information

Agilent s Copy Number Variation (CNV) Portfolio

Agilent s Copy Number Variation (CNV) Portfolio Technical Overview Agilent s Copy Number Variation (CNV) Portfolio Abstract Copy Number Variation (CNV) is now recognized as a prevalent form of structural variation in the genome contributing to human

More information

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 5-9 JATIT. All rights reserved. A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 1 H. Mahmoodian, M. Hamiruce Marhaban, 3 R. A. Rahim, R. Rosli, 5 M. Iqbal Saripan 1 PhD student, Department

More information

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach) High-Throughput Sequencing Course Gene-Set Analysis Biostatistics and Bioinformatics Summer 28 Section Introduction What is Gene Set Analysis? Many names for gene set analysis: Pathway analysis Gene set

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach

Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach Manuela Zucknick Division of Biostatistics, German Cancer Research Center Biometry Workshop,

More information

T. R. Golub, D. K. Slonim & Others 1999

T. R. Golub, D. K. Slonim & Others 1999 T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have

More information

Outlier Analysis. Lijun Zhang

Outlier Analysis. Lijun Zhang Outlier Analysis Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Extreme Value Analysis Probabilistic Models Clustering for Outlier Detection Distance-Based Outlier Detection Density-Based

More information

AIMS: Absolute Assignment of Breast Cancer Intrinsic Molecular Subtype

AIMS: Absolute Assignment of Breast Cancer Intrinsic Molecular Subtype AIMS: Absolute Assignment of Breast Cancer Intrinsic Molecular Subtype Eric R. Paquet (eric.r.paquet@gmail.com), Michael T. Hallett (michael.t.hallett@mcgill.ca) 1 1 Department of Biochemistry, Breast

More information

R2: web-based genomics analysis and visualization platform

R2: web-based genomics analysis and visualization platform R2: web-based genomics analysis and visualization platform Overview Jan Koster Department of Oncogenomics Academic Medical Center (AMC) UvA, the Netherlands jankoster@amc.uva.nl jankoster@amc.uva.nl 1

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

cn.mops - Mixture of Poissons for CNV detection in NGS data Günter Klambauer Institute of Bioinformatics, Johannes Kepler University Linz

cn.mops - Mixture of Poissons for CNV detection in NGS data Günter Klambauer Institute of Bioinformatics, Johannes Kepler University Linz Software Manual Institute of Bioinformatics, Johannes Kepler University Linz cn.mops - Mixture of Poissons for CNV detection in NGS data Günter Klambauer Institute of Bioinformatics, Johannes Kepler University

More information

False Discovery Rates and Copy Number Variation. Bradley Efron and Nancy Zhang Stanford University

False Discovery Rates and Copy Number Variation. Bradley Efron and Nancy Zhang Stanford University False Discovery Rates and Copy Number Variation Bradley Efron and Nancy Zhang Stanford University Three Statistical Centuries 19th (Quetelet) Huge data sets, simple questions 20th (Fisher, Neyman, Hotelling,...

More information

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

Computational Investigation of Homologous Recombination DNA Repair Deficiency in Sporadic Breast Cancer

Computational Investigation of Homologous Recombination DNA Repair Deficiency in Sporadic Breast Cancer University of Massachusetts Medical School escholarship@umms Open Access Articles Open Access Publications by UMMS Authors 11-16-2017 Computational Investigation of Homologous Recombination DNA Repair

More information

SALSA MLPA probemix P315-B1 EGFR

SALSA MLPA probemix P315-B1 EGFR SALSA MLPA probemix P315-B1 EGFR Lot B1-0215 and B1-0112. As compared to the previous A1 version (lot 0208), two mutation-specific probes for the EGFR mutations L858R and T709M as well as one additional

More information

November 9, Johns Hopkins School of Medicine, Baltimore, MD,

November 9, Johns Hopkins School of Medicine, Baltimore, MD, Fast detection of de-novo copy number variants from case-parent SNP arrays identifies a deletion on chromosome 7p14.1 associated with non-syndromic isolated cleft lip/palate Samuel G. Younkin 1, Robert

More information

Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control

Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control HaCaT Tet, uninduced HaCaT GLI2 and induced HaCaT GLI2

More information

Module 3: Pathway and Drug Development

Module 3: Pathway and Drug Development Module 3: Pathway and Drug Development Table of Contents 1.1 Getting Started... 6 1.2 Identifying a Dasatinib sensitive cancer signature... 7 1.2.1 Identifying and validating a Dasatinib Signature... 7

More information

Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit

Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit APPLICATION NOTE Ion PGM System Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit Key findings The Ion PGM System, in concert with the Ion ReproSeq PGS View Kit and Ion Reporter

More information

Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011

Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011 Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011 Massive Genomic Rearrangement Acquired in a Single Catastrophic Event during Cancer Development Cell 144,

More information

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data Tong WW, McComb ME, Perlman DH, Huang H, O Connor PB, Costello

More information

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue),

More information

Genomic structural variation

Genomic structural variation Genomic structural variation Mario Cáceres The new genomic variation DNA sequence differs across individuals much more than researchers had suspected through structural changes A huge amount of structural

More information

SubLasso:a feature selection and classification R package with a. fixed feature subset

SubLasso:a feature selection and classification R package with a. fixed feature subset SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced

More information

Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre

Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre Structural variation (SVs) Copy-number variations C Deletion A B C Balanced rearrangements A B A B C B A C Duplication Inversion Causes

More information

Memorial Sloan-Kettering Cancer Center

Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2007 Paper 14 On Comparing the Clustering of Regression Models

More information

Clinical Interpretation of Cancer Genomes

Clinical Interpretation of Cancer Genomes IGENZ Ltd, Auckland, New Zealand Clinical Interpretation of Cancer Genomes Dr Amanda Dixon-McIver www.igenz.co.nz 1992 Slovenia and Croatia gain independence USA and Russia declare the Cold War over Steffi

More information

CHROMOSOMAL MICROARRAY (CGH+SNP)

CHROMOSOMAL MICROARRAY (CGH+SNP) Chromosome imbalances are a significant cause of developmental delay, mental retardation, autism spectrum disorders, dysmorphic features and/or birth defects. The imbalance of genetic material may be due

More information

Chapter 1. Introduction

Chapter 1. Introduction Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a

More information

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA International Journal of Software Engineering and Knowledge Engineering Vol. 13, No. 6 (2003) 579 592 c World Scientific Publishing Company MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION

More information

Computational Analysis of Genome-Wide DNA Copy Number Changes

Computational Analysis of Genome-Wide DNA Copy Number Changes Computational Analysis of Genome-Wide DNA Copy Number Changes Lei Song Thesis submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements

More information

MIR retrotransposon sequences provide insulators to the human genome

MIR retrotransposon sequences provide insulators to the human genome Supplementary Information: MIR retrotransposon sequences provide insulators to the human genome Jianrong Wang, Cristina Vicente-García, Davide Seruggia, Eduardo Moltó, Ana Fernandez- Miñán, Ana Neto, Elbert

More information

MALBAC Technology and Its Application in Non-invasive Chromosome Screening (NICS)

MALBAC Technology and Its Application in Non-invasive Chromosome Screening (NICS) MALBAC Technology and Its Application in Non-invasive Chromosome Screening (NICS) The Power of One Adapted from Internet Single Cell Genomic Studies Ultra Low Sample Input Advances and applications of

More information

Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination. May 4, 2010

Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination. May 4, 2010 Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination May 4, 2010 Examination Length = 3 hours Total Marks = 100 (7 questions) Total Pages = 8 (including cover sheet and 2 pages of prints)

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 Supplementary Fig. 1: Quality assessment of formalin-fixed paraffin-embedded (FFPE)-derived DNA and nuclei. (a) Multiplex PCR analysis of unrepaired and repaired bulk FFPE gdna from

More information

White Paper. Copy number variant detection. Sample to Insight. August 19, 2015

White Paper. Copy number variant detection. Sample to Insight. August 19, 2015 White Paper Copy number variant detection August 19, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 Fax: +45 86 20 12 22 www.clcbio.com

More information

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015 Goals/Expectations Computer Science, Biology, and Biomedical (CoSBBI) We want to excite you about the world of computer science, biology, and biomedical informatics. Experience what it is like to be a

More information

Inter-session reproducibility measures for high-throughput data sources

Inter-session reproducibility measures for high-throughput data sources Inter-session reproducibility measures for high-throughput data sources Milos Hauskrecht, PhD, Richard Pelikan, MSc Computer Science Department, Intelligent Systems Program, Department of Biomedical Informatics,

More information

Microarrays in primary breast cancer lessons from chemotherapy studies

Microarrays in primary breast cancer lessons from chemotherapy studies Endocrine-Related Cancer (2001) 8 259 263 Microarrays in primary breast cancer lessons from chemotherapy studies P E Lønning, T Sørlie 1,2, C M Perou 2, P O Brown 3, D Botstein 2 and A-L Børresen-Dale

More information

SNPrints: Defining SNP signatures for prediction of onset in complex diseases

SNPrints: Defining SNP signatures for prediction of onset in complex diseases SNPrints: Defining SNP signatures for prediction of onset in complex diseases Linda Liu, Biomedical Informatics, Stanford University Daniel Newburger, Biomedical Informatics, Stanford University Grace

More information

Genomic complexity and arrays in CLL. Gian Matteo Rigolin, MD, PhD St. Anna University Hospital Ferrara, Italy

Genomic complexity and arrays in CLL. Gian Matteo Rigolin, MD, PhD St. Anna University Hospital Ferrara, Italy Genomic complexity and arrays in CLL Gian Matteo Rigolin, MD, PhD St. Anna University Hospital Ferrara, Italy Clinical relevance of genomic complexity (GC) in CLL GC has been identified as a critical negative

More information

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) AUTHORS: Tejas Prahlad INTRODUCTION Acute Respiratory Distress Syndrome (ARDS) is a condition

More information

Systematic Analysis for Identification of Genes Impacting Cancers

Systematic Analysis for Identification of Genes Impacting Cancers Systematic Analysis for Identification of Genes Impacting Cancers Arpita Singhal Stanford University Saint Francis High School ABSTRACT Currently, vast amounts of molecular information involving genomic

More information

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed.

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Supplementary Note The potential association and implications of HBV integration at known and putative cancer genes of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Human telomerase

More information

TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer.

TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer. AWARD NUMBER: W81XWH-14-1-0080 TITLE: Total RNA Sequencing Analysis of DCIS Progressing to Invasive Breast Cancer. PRINCIPAL INVESTIGATOR: Christopher B. Umbricht, MD, PhD CONTRACTING ORGANIZATION: Johns

More information

Mammogram Analysis: Tumor Classification

Mammogram Analysis: Tumor Classification Mammogram Analysis: Tumor Classification Literature Survey Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is

More information

Application of Array-based Comparative Genome Hybridization in Children with Developmental Delay or Mental Retardation

Application of Array-based Comparative Genome Hybridization in Children with Developmental Delay or Mental Retardation Pediatr Neonatol 2008;49(6):213 217 REVIEW ARTICLE Application of Array-based Comparative Genome Hybridization in Children with Developmental Delay or Mental Retardation Jao-Shwann Liang 1,2 *, Keiko Shimojima

More information

Supplementary Information

Supplementary Information Supplementary Information 5-hydroxymethylcytosine-mediated epigenetic dynamics during postnatal neurodevelopment and aging By Keith E. Szulwach 1,8, Xuekun Li 1,8, Yujing Li 1, Chun-Xiao Song 2, Hao Wu

More information

Molecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC

Molecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC Molecular Markers Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC Overview Testing methods Rationale for molecular testing

More information

Diagnosis of multiple cancer types by shrunken centroids of gene expression

Diagnosis of multiple cancer types by shrunken centroids of gene expression Diagnosis of multiple cancer types by shrunken centroids of gene expression Robert Tibshirani, Trevor Hastie, Balasubramanian Narasimhan, and Gilbert Chu PNAS 99:10:6567-6572, 14 May 2002 Nearest Centroid

More information

Measuring DNA Methylation with the MinION. Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16

Measuring DNA Methylation with the MinION. Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16 Measuring DNA Methylation with the MinION Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16 Epigenetics: Modern Modern Definition of epigenetics involves heritable changes

More information

CNV Detection and Interpretation in Genomic Data

CNV Detection and Interpretation in Genomic Data CNV Detection and Interpretation in Genomic Data Benjamin W. Darbro, M.D., Ph.D. Assistant Professor of Pediatrics Director of the Shivanand R. Patil Cytogenetics and Molecular Laboratory Overview What

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

SALSA MS-MLPA KIT ME011-A1 Mismatch Repair genes (MMR) Lot 0609, 0408, 0807, 0407

SALSA MS-MLPA KIT ME011-A1 Mismatch Repair genes (MMR) Lot 0609, 0408, 0807, 0407 SALSA MS-MLPA KIT ME011-A1 Mismatch Repair genes (MMR) Lot 0609, 0408, 0807, 0407 The Mismatch Repair (MMR) system is critical for the maintenance of genomic stability. MMR increases the fidelity of DNA

More information

Integrated Analysis of Copy Number and Gene Expression

Integrated Analysis of Copy Number and Gene Expression Integrated Analysis of Copy Number and Gene Expression Nexus Copy Number provides user-friendly interface and functionalities to integrate copy number analysis with gene expression results for the purpose

More information

DiffVar: a new method for detecting differential variability with application to methylation in cancer and aging

DiffVar: a new method for detecting differential variability with application to methylation in cancer and aging Genome Biology This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. DiffVar: a new method for detecting

More information

Lab 5: Testing Hypotheses about Patterns of Inheritance

Lab 5: Testing Hypotheses about Patterns of Inheritance Lab 5: Testing Hypotheses about Patterns of Inheritance How do we talk about genetic information? Each cell in living organisms contains DNA. DNA is made of nucleotide subunits arranged in very long strands.

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

Peak-calling for ChIP-seq and ATAC-seq

Peak-calling for ChIP-seq and ATAC-seq Peak-calling for ChIP-seq and ATAC-seq Shamith Samarajiwa CRUK Autumn School in Bioinformatics 2017 University of Cambridge Overview Peak-calling: identify enriched (signal) regions in ChIP-seq or ATAC-seq

More information

Molecular Methods in the Diagnosis and Prognostication of Melanoma: Pros & Cons

Molecular Methods in the Diagnosis and Prognostication of Melanoma: Pros & Cons Molecular Methods in the Diagnosis and Prognostication of Melanoma: Pros & Cons Ben J. Friedman, MD Senior Staff Physician Department of Dermatology Department of Pathology and Laboratory Medicine Henry

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi: 1.138/nature8645 Physical coverage (x haploid genomes) 11 6.4 4.9 6.9 6.7 4.4 5.9 9.1 7.6 125 Neither end mapped One end mapped Chimaeras Correct Reads (million ns) 1 75 5 25 HCC1187 HCC1395 HCC1599

More information

Cytogenetics 101: Clinical Research and Molecular Genetic Technologies

Cytogenetics 101: Clinical Research and Molecular Genetic Technologies Cytogenetics 101: Clinical Research and Molecular Genetic Technologies Topics for Today s Presentation 1 Classical vs Molecular Cytogenetics 2 What acgh? 3 What is FISH? 4 What is NGS? 5 How can these

More information

Structural Variation and Medical Genomics

Structural Variation and Medical Genomics Structural Variation and Medical Genomics Andrew King Department of Biomedical Informatics July 8, 2014 You already know about small scale genetic mutations Single nucleotide polymorphism (SNPs) Deletions,

More information

ncounter Data Analysis Guidelines for Copy Number Variation (CNV) Molecules That Count NanoString Technologies, Inc.

ncounter Data Analysis Guidelines for Copy Number Variation (CNV) Molecules That Count NanoString Technologies, Inc. ncounter Data Analysis Guidelines for Copy Number Variation (CNV) NanoString Technologies, Inc. 530 Fairview Ave N Suite 2000 Seattle, Washington 98109 www.nanostring.com Tel: 206.378.6266 888.358.6266

More information