Virtual CGH: Prediction of Novel Regions of Chromosomal Alterations in Tumor from Gene Expression Profiling

Size: px
Start display at page:

Download "Virtual CGH: Prediction of Novel Regions of Chromosomal Alterations in Tumor from Gene Expression Profiling"

Transcription

1 Virtual CGH: Prediction of ovel egions of Chromosomal Alterations in Tumor from Gene Expression Profiling Huimin Geng 1,3, Javeed qbal 1, Xutao Deng 2, Wing C. Chan 1 and Hesham Ali 3 1 Department of Pathology and Microbiology, niversity of ebraska Medical Center, Omaha, E Cedars-Sinai esearch nstitute, Cedars-Sinai Medical Center, Los Angeles, CA Department of Computer Science, niversity of ebraska at Omaha, Omaha, E {huimingeng, jiqbal, jchan}@unmc.edu, dengx@cshs.org, hali@mail.unomaha.edu Abstract The identification of genetic alterations using array Comparative Genomic Hybridization (CGH would provide important insights into the mechanisms of tumorogenesis. High resolution array CGH are very expensive and the experiment require a separate sample, therefore, we developed a computational method for predicting gains and losses of genomic DA segments based on ma expression profiles of tumor cell lines, and called this method a Virtual CGH (vcgh predictor. VCGH is performed through a novel algorithm in which each chromosomal segment is evaluated by the gene transcriptional profiles. The calculation yields a log of odds (LOD score for each chromosomal segment and this likelihood-based score is used to predict the correlation between ma expression patterns and DA copy number alterations. By aligning all regions of gains and losses from multiple cell lines we can identify minimal common regions of gains and losses which may contain potential oncogenes or tumor suppressors. This method can be used to screen transcriptional profiles of other malignancies for the identification of DA segmental loss or gain. 1. ntroduction Tumors often accumulate a large series of genomic imbalances, which either lead to activation of oncogenes or deletion of tumor suppressors [1-3]. An oncogene can cause the transformation of normal cells into cancerous tumor cells. A tumor suppressor gene is involved in slowing down cell growth and if inactivated, allows growth to progress without control. Comparative Genomic Hybridization (CGH is a molecular cytogenetic technique used to identify DA copy number changes and has been widely used in defining the putative chromosomal regions involved in tumor progression [4-7]. CGH relies on the hybridization of tumor DA to whole genome arrays and image extraction through high resolution scanners. Several methods have been developed to select regions of gains and losses from the experimental array CGH data [8-16]. The other microarray-based genome-wide technique called Gene Expression Profiling (GEP has been widely used to identify transcriptional signatures essential for tumor development based on ma level changes compared to normal DA [17]. Many studies have reported a number of genes that are differentially expressed between cancer and healthy patients or between cancer subtypes [18]. Both CGH and GEP techniques are based on hybridization of cancer versus normal DA or A and are highly expensive and require preparation of two different samples (A and DA which can be of a limitation in tumor biopsies. The ma transcript changes in cancer patients detected by GEP have been found to correlate to corresponding DA copy number alterations detected by CGH [10, 19, 20]. Gene dosage has a direct effect on the level of transcriptome and therefore GEP can be indirectly used to measure the genomic abnormalities in tumors. Compared with CGH, GEP is a fundamental technology. t accumulates relatively large amount of data. Also, the high-density arrays available from Affymetrix make it possible to measure the genomic changes indirectly due to their whole genome coverage. Therefore, it is advantageous to use GEP data which measures gene expression on the level of transcriptome to predict CGH which measures the alterations of DA copy numbers on the genomic level. This prediction will reduce the need for extra samples and additional costs which could be otherwise consumed by a separate CGH experiment, and may help to reveal and /07 $ EEE 1

2 confirm the links between DA and ma that may cause cancers. n this study, we focused on one type of cancers, atural Killer (K cell lymphoma. Eight K-cell lymphoma cell lines were profiled with GeneChips (plus 2 from Affymetrix. Then vcgh predictor was applied using expression levels and the chromosomal locations of the genes, to identify the chromosome regions with consistent patterns of over or under expression. Finally, some of these regions were validated with experimental array CGH using tiling arrays [4]. This paper is organized as follows. n section 2, we describe the vcgh method. n section 3, we apply this method to GEP data of eight cell lines of K-cell lymphoma and get the predicted genomic gain or loss regions. n section 4, we compare the prediction results with those from experimental array CGH. n section 5, we conclude that vcgh is a powerful tool that may significantly enhance the data analysis of cancer GEP. 2. Method Virtual CGH (vcgh is a novel algorithm that utilizes the changes in ma expression level to predict DA copy number alterations. The key of vcgh is to determine the magnitude of correlation between ma changes and DA copy number changes. Since the DA copy number changes are highly correlated to the spatial relationship of genes on the genome, in vcgh, we measure the correlation of ma level expression changes of a gene and its chromosomal location, and use a probabilistic model to estimate the likelihood of a chromosome segment being a genetically abnormal region. VCGH can be applied for two purposes. The first one is to predict DA copy number alterations using ma expression profiles. The second usage is to compare and combine vcgh results with experimental CGH data to study functional abnormalities Microarray data preprocessing First, we need to find differentially expressed genes between cancer and normal tissues and mark them as over-expressed or under-expressed. Fold change, t-test, SAM [21], or other methods can be applied for detecting genes that are differentially expressed. Then each gene is assigned a symbol,, or representing it is nduced, epressed, or nchanged in cancer compared with normal samples. The spatial order of the genes on each chromosome need be preserved. This data preprocessing transformed the original expression data into 22 symbolic vectors, each for one autosome. (We ignore X and Y chromosomes in this study Calculating LOD score n this step, we give a probability score to each chromosomal segment of a fixed size (e.g. six consecutive genes which shows the likelihood of this segment being a genomic gain or loss. ntuitively, if a chromosomal segment contains an enriched set of up or down genes, the correlation is high. This correlation is defined as a log of odds (LOD score which is computed from a probabilistic model illustrated below. Suppose now we focus on a chromosomal segment that contains n consecutive genes. We count the occurrences of each type of symbols and denote the counting results as n, n, and n, for the number of induced, repressed and unchanged genes, respectively. The likelihood of the observation of n, n, and n under certain hypothesis H is computed from the binomial distribution as in Eq. (1: n! ( nn Gain:, n n H,!( n n! n! n ( n Loss:, n H. n!( n! (1 where n n n, 1, and 1. L is the likelihood, H is the hypothesis, n is the number of genes, and is the parameter of the success probability in the binomial distribution. Hypothesis H is specified by parameter which is the underlying probability that certain symbol occurs. We can then use this basic probability model to evaluate the chromosomal segments of being abnormal regions. The likelihood of observing n, n, and n in a segment of n consecutive genes is computed under two different hypotheses: H o : The chromosomal segment does not belong to a DA gain/loss region. H 1 : The chromosomal segment belongs to a DA gain/loss region. H 0 implies that the observations of up-regulated and down-regulated genes do not correlate to their chromosomal spatial locations where the genomic gains and losses happen. On the other hand, H 1 spells out that the expression changes of genes do correlate to 2

3 their chromosomal locations where the genomic gains and losses happen. For null hypothesis H 0, the 0 s can be estimated by normal/normal hybridization for the chromosomal segments. For alternative hypothesis H 1, the 1 s can be estimated by cancer/normal hybridization for the corresponding chromosomal segments. n case normal/normal hybridization is not available, we can estimate 0 s (under H 0 by using cancer/normal hybridization, but using the whole genome instead of a segment, to estimate the background occurrence frequencies. The parameters 0 s (for hypotheses H 0 and 1 s (for hypothesis H 1 are estimated using the maximum likelihood estimator as in Eq. (2: ˆ, '', '', '', or '', 0, ˆ n, '', '', '', or '', 1, n (2 where is the total number of genes across all chromosomes; is the number of genes in certain gene categories across all chromosomes ( '', '', '', or '' representing induced, repressed, induced + unchanged, or repressed + unchanged, respectively; n is the total number of genes within a specific segment, and n is the number of genes in certain gene categories in the specific segment ( has the same meaning as mentioned in. The LOD score is the log base 10 of the likelihood ratio under the hypotheses of H 1 and H 0. Having the parameters estimated from the observations from the real GEP data, we could now calculate the LOD score of a segment of being a gain or loss region as follows: ( nn, n n H1 1, 1, ( nn, n n H0 0, 0, n ( n, n H1 1, 1, n ( n, n H0 0, 0, Gain: LOD log log Loss: LOD log log For example, we use a DA segment of a total of ten genes containing eight induced genes, one reduced gene and one unchanged gene to illustrate the LOD score computation for the gain. Suppose from the background (a total of genes from the whole genome, the parameters under H 0 have been estimated as: (3 ˆ ˆ ˆ 0, 0.1, 0, 0.1, 0, 0.8. Since n = 8, n = 1, n = 1 and n = 10, we have the parameters under H 1 be estimated as: ˆ ˆ n ˆ 1, 0.8, n 1, 0.1, 1, 0.1. n n n ( nn 8 2 1, 1, So, LODgain log10 log ( nn , 0, A LOD score of 5.9 means that the probability that H 1 is true (that this segment is a genomic gain is times more likely than it is that H 0 is true (that this segment is not a genomic gain. 3. Virtual CGH prediction from GEP 3.1. Gene expression profiling The vcgh predictor is applied to GEP on one normal K-cell sample from peripheral blood of healthy donors as reference and eight well characterized K-cell lymphoma cell lines. The sample preparation, A isolation and cda synthesis was performed strictly according to the Affymetrix protocol. The hybridizations from each sample were performed on Gene-Chip 133 plus 2 (Affymetrix, nc. All arrays were scanned using the Gene-chip scanner (Affymetrix and analyzed by the Affymetrix GeneChip Operating Software Data processing We uploaded all the gene expression data in BB- ArrayTool ( ArrayTools.html. The threshold of the intensity at the minimum value was set as 10 if the intensity was below the minimum, and normalization was applied using median over entire array. All the information regarding the probe sets, gene names, chromosomal and cytoband locations was downloaded using the BB-ArrayTools. The parameters under H 0 were estimated by using all spots/genes across the whole genome as indicated in Eq. (2. Figure 1 shows, under H 0, the log 2 fold difference between tumor and normal samples has normal distribution with the mean of and the standard deviation (s.d. of When the threshold was set at two fold, we get 3

4 ˆ ˆ ( , ; ˆ ( , , 0, ˆ 0, 0, For H 1, we split one file containing the expression values of all genes over 22 chromosomes to 22 files, each for one chromosome, and ordered those genes in a spatial order on the genome. The gene names and corresponding start and stop base pair positions were downloaded from CB Homo sapiens Genome Build We screened the neighboring genes along a chromosome by moving the window from the pter to the qter of each chromosome. For each segment of the chromosome, we calculated its LOD scores for both gain and loss as in Eq. (3 and draw graphics of a chromosomal position versus the likelihood that a segment (of a fixed window size, which is centered at that chromosomal position, is a genetic alterative region Abnormal regions identified by vcgh n our gene expression analysis, the transcription levels of genes in the abnormal regions identified by vcgh were > +2 for gains and < -2 for loss as compared to normal K sample profile. We set up the window size as six genes and calculate the LOD scores for each segment in each chromosome, for each of the eight cell lines. f a segment shows consistent gene expression either low or high along the regional cluster in all cell lines, then this segment is more likely to contain critical genes (oncogenes or tumor suppressors for the development of K-cell lymphoma. According to vcgh, we identified several chromosomes showing frequent gene expression alterations, such as chromosomes 1, 7, 13, 17 and 18. We show chromosomes 1, 13 and 18 as illustration in Figure 2 and Figure 3. n Figure 2, we graphically display the LOD scores of gain and loss for chromosomes 1 and 13, with loss on the left side of a chromosomal ideogram and gain on the right side. For example, the regions in chromosome 1 with increased expression in a set of consecutive genes were 1q41, 7q21.1-q22.1 and 17p13, and the regions with decreased expression were 1p36.1, 7p22.1, 17p13 and 18p11.2, in concordance with previously published results using conventional or BAC array CGH. Figure 3 was for the further comparison of the vcgh and the experimental CGH using losses of chromosome 18 as an example. The data is unpublished and will appear in another biological paper together with a detailed discussion of the biological meanings of those genomic alteration regions. Fig. 1. Histogram of log2 ratio of tumor over normal samples under H 0. t is a normal distribution (mean = and s.d. =

5 a, chromosome 1 b, chromosome 13 Fig. 2. LOD scores of gain and loss for chromosomes 1 and 13. The left side of a chromosomal ideogram represents Loss and right side represents Gain. The series in different colors indicate different cell lines. 4. Experimental CGH validation To validate the proposed vcgh predictor, we compared the recurrent regions predicted from GEP to the ones identified by experimental array CGH. We aligned the gain and loss regions from eight cell lines for each chromosome and mapped the alignment results along with the chromosomal ideograms. There were good associations between the experimental CGH and vcgh. To graphically view the comparison between the two methodologies, we draw the gain and loss regions from vcgh and the experimental CGH along with the chromosomal ideogram together. The associations of the two on chromosome 18 are shown in Figure 3 as an illustration. Human physical map information used in the graph was obtained from Vysis nc. ( 5

6 VCGH Fig. 3. Comparison of results from vcgh and experimental CGH for losses on chromosome 18. On the right side, regions of losses are indicated by the lines on the left of the chromosomal ideogram obtained from experimental CGH, and the graphs on the left side show the predicted loss regions from vcgh. 5. Conclusions Whole genome array CGH can be used to identify and analyze the genetic alterations and hence provide important insights into the genetic basis of tumors. While noticing that high resolution array CGH are very expensive and require a separate sample, we developed a vcgh method for predicting gains and losses of DA segments based on GEP. n vcgh, we calculate the correlation between ma changes and DA copy number changes using LOD score, which is the likelihood that a chromosomal segment belongs to a genetic gain or loss region. Experimental CGH Employing the vcgh predictor to GEP of eight cell lines of K-cell lymphoma, we predicted several genomic alterative regions most likely happened for K-cell lymphoma. The revealed abnormal regions of genetic alterations may contain oncogenes and tumor suppressors important in the pathogenesis of K-cell lymphoma. The loss of chromosome pieces indicates that the tumor suppressor genes (common in lymphoid malignancies may be located in these regions and those suppressor genes may be associated with the development of K lymphoma/leukemia. The gain of chromosome pieces indicates that the tumor oncogenes, which cause the transformation of normal cells into cancerous tumor cells, may be located within these regions. We have also performed experimental array CGH on the same cell lines to validate our method and the results showed that the predicted gain or loss regions were in accordance with the recurrent regions identified by the experimental array CGH. 6. Future work n this paper, we build a computational model to give an indication from experimental GEP to experimental CGH: GEP Virtual CGH CGH. Based on this preliminary work, we would like to continue on the following further studies: Cross link GEP and CGH. Combination of genomic profiling by array CGH and transcriptional profiling by microarrays may narrow down the number of candidate genes which are most likely participating in the malignancies. We would also provide a web-based search for the genes in the recurrent regions and the description of these genes as well. Possible candidate genes will be listed and require further investigation. Build a Bayesian framework by combining vcgh and array CGH. Many types of cancer and inherited diseases are known to have genome abnormalities, which may have a direct or indirect impact on gene expression patterns in the affected genomic region. esearchers commonly use A gene expression analysis as the default approach to cluster and classify patient samples, but often lack information about the underlying DA that may also undergo dramatic copy number change. Therefore, an integrated tool using both genomic profiling and transcriptional profiling has the advantage in evaluating DA copy number and A gene expression at the same time and is important for understanding the underlying biological mechanisms. 6

7 Acknowledgments This work was supported by the H grant number P from the BE program of the ational Center for esearch esources and by.s. Public Health Service grants CA36727 and CA84967 by the ational Cancer nstitute, Department of Health and Human Services. eferences [1] E.. Fearon and B. Vogelstein, "A genetic model for colorectal tumorigenesis," Cell, vol. 61, pp , [2] D. Hanahan and. A. Weinberg, "The hallmarks of cancer," Cell, vol. 100, pp , [3] C. J. Sherr, "Cancer cell cycles," Science, vol. 274, pp , [4] A. S. shkanian, C. A. Malloff, S. K. Watson,. J. DeLeeuw, B. Chi, B. P. Coe, A. Snijders, D. G. Albertson, D. Pinkel, M. A. Marra, V. Ling, C. MacAulay, and W. L. Lam, "A tiling resolution DA microarray with complete coverage of the human genome," at Genet, vol. 36, pp , [5] C. Baldwin, C. Garnis, L. Zhang, M. P. osin, and W. L. Lam, "Multiple microalterations detected at high frequency in oral cancer," Cancer es, vol. 65, pp , [6]. J. de Leeuw, J. J. Davies, A. osenwald, G. Bebb,. D. Gascoyne, M. J. Dyer, L. M. Staudt, J. A. Martinez- Climent, and W. L. Lam, "Comprehensive whole genome array CGH profiling of mantle cell lymphoma model genomes," Hum Mol Genet, vol. 13, pp , [7] M. Khojasteh, W. L. Lam,. K. Ward, and C. MacAulay, "A stepwise framework for the normalization of array CGH data," BMC Bioinformatics, vol. 6, pp. 274, [8] V. E. Olshen A., "Change-point analysis of array-based comparative genomic hybridization data.," n proceedings of Joint Statistical Meetings, pp , [9] S. A. Fridlyand J, Pinkel D, Albertson DG, Jain A, "Hidden Markov Models Approach to the Analysis of Array CGH Data," J Multivariate Anal, vol. 90, pp , [10] J.. Pollack, T. Sorlie, C. M. Perou, C. A. ees, S. S. Jeffrey, P. E. Lonning,. Tibshirani, D. Botstein, A. L. Borresen-Dale, and P. O. Brown, "Microarray analysis reveals a major direct role of DA copy number alteration in the transcriptional program of human breast tumors," Proc atl Acad Sci S A, vol. 99, pp , [11] A. M. Snijders,. owak,. Segraves, S. Blackwood,. Brown, J. Conroy, G. Hamilton, A. K. Hindle, B. Huey, K. Kimura, S. Law, K. Myambo, J. Palmer, B. Ylstra, J. P. Yue, J. W. Gray, A.. Jain, D. Pinkel, and D. G. Albertson, "Assembly of microarrays for genomewide measurement of DA copy number," at Genet, vol. 29, pp , [12] P. Wang, Y. Kim, J. Pollack, B. arasimhan, and. Tibshirani, "A method for calling gains and losses in array CGH data," Biostatistics, vol. 6, pp , [13]. Autio, S. Hautaniemi, P. Kauraniemi, O. Yli-Harja, J. Astola, M. Wolf, and A. Kallioniemi, "CGH-Plotter: MATLAB toolbox for CGH-data analysis," Bioinformatics, vol. 19, pp , [14] C. Cheng,. Kimmel, P. eiman, and L. P. Zhao, "Array rank order regression analysis for the detection of gene copy-number changes in human cancer," Genomics, vol. 82, pp , [15] G. Hodgson, J. H. Hager, S. Volik, S. Hariono, M. Wernick, D. Moore,. owak, D. G. Albertson, D. Pinkel, C. Collins, D. Hanahan, and J. W. Gray, "Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas," at Genet, vol. 29, pp , [16] K. Jong, E. Marchiori, G. Meijer, A. V. Vaart, and B. Ylstra, "Breakpoint identification and smoothing of array comparative genomic hybridization data," Bioinformatics, vol. 20, pp , [17] T.. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P. Mesirov, H. Coller, M. L. Loh, J.. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander, "Molecular classification of cancer: class discovery and class prediction by gene expression monitoring," Science, vol. 286, pp , [18] S. amaswamy and T.. Golub, "DA microarrays in clinical oncology," J Clin Oncol, vol. 20, pp , [19] J. L. Phillips, S. W. Hayward, Y. Wang, J. Vasselli, C. Pavlovich, H. Padilla-ash, J.. Pezullo, B. M. Ghadimi, G. D. Grossfeld, A. ivera, W. M. Linehan, G.. Cunha, and T. ied, "The consequences of chromosomal aneuploidy on gene expression profiles in a cell line model for prostate carcinogenesis," Cancer es, vol. 61, pp , [20] P. Platzer, M. B. pender, K. Wilson, J. Willis, J. Lutterbaugh, A. osrati, J. K. Willson, D. Mack, T. ied, and S. Markowitz, "Silence of chromosomal amplifications in colon cancer," Cancer es, vol. 62, pp , [21] V. G. Tusher,. Tibshirani, and G. Chu, "Significance analysis of microarrays applied to the ionizing radiation response," Proc atl Acad Sci S A, vol. 98, pp ,

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,

More information

Understanding DNA Copy Number Data

Understanding DNA Copy Number Data Understanding DNA Copy Number Data Adam B. Olshen Department of Epidemiology and Biostatistics Helen Diller Family Comprehensive Cancer Center University of California, San Francisco http://cc.ucsf.edu/people/olshena_adam.php

More information

Biostatistical modelling in genomics for clinical cancer studies

Biostatistical modelling in genomics for clinical cancer studies This work was supported by Entente Cordiale Cancer Research Bursaries Biostatistical modelling in genomics for clinical cancer studies Philippe Broët JE 2492 Faculté de Médecine Paris-Sud In collaboration

More information

Introduction to LOH and Allele Specific Copy Number User Forum

Introduction to LOH and Allele Specific Copy Number User Forum Introduction to LOH and Allele Specific Copy Number User Forum Jonathan Gerstenhaber Introduction to LOH and ASCN User Forum Contents 1. Loss of heterozygosity Analysis procedure Types of baselines 2.

More information

Vega: Variational Segmentation for Copy Number Detection

Vega: Variational Segmentation for Copy Number Detection Vega: Variational Segmentation for Copy Number Detection Sandro Morganella Luigi Cerulo Giuseppe Viglietto Michele Ceccarelli Contents 1 Overview 1 2 Installation 1 3 Vega.RData Description 2 4 Run Vega

More information

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 5-9 JATIT. All rights reserved. A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 1 H. Mahmoodian, M. Hamiruce Marhaban, 3 R. A. Rahim, R. Rosli, 5 M. Iqbal Saripan 1 PhD student, Department

More information

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley

More information

Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit

Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit APPLICATION NOTE Ion PGM System Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit Key findings The Ion PGM System, in concert with the Ion ReproSeq PGS View Kit and Ion Reporter

More information

Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer

Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer Pei Wang Department of Statistics Stanford University Stanford, CA 94305 wp57@stanford.edu Young Kim, Jonathan Pollack Department

More information

Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes

Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes Ivan Arreola and Dr. David Han Department of Management of Science and Statistics, University

More information

Integrated Analysis of Copy Number and Gene Expression

Integrated Analysis of Copy Number and Gene Expression Integrated Analysis of Copy Number and Gene Expression Nexus Copy Number provides user-friendly interface and functionalities to integrate copy number analysis with gene expression results for the purpose

More information

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach) High-Throughput Sequencing Course Gene-Set Analysis Biostatistics and Bioinformatics Summer 28 Section Introduction What is Gene Set Analysis? Many names for gene set analysis: Pathway analysis Gene set

More information

T. R. Golub, D. K. Slonim & Others 1999

T. R. Golub, D. K. Slonim & Others 1999 T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have

More information

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 Introduction Loss of erozygosity (LOH) represents the loss of allelic differences. The SNP markers on the SNP Array 6.0 can be used

More information

Data analysis in microarray experiment

Data analysis in microarray experiment 16 1 004 Chinese Bulletin of Life Sciences Vol. 16, No. 1 Feb., 004 1004-0374 (004) 01-0041-08 100005 Q33 A Data analysis in microarray experiment YANG Chang, FANG Fu-De * (National Laboratory of Medical

More information

Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011

Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011 Chromothripsis: A New Mechanism For Tumorigenesis? i Fellow s Conference Cheryl Carlson 6/10/2011 Massive Genomic Rearrangement Acquired in a Single Catastrophic Event during Cancer Development Cell 144,

More information

Package golubesets. August 16, 2014

Package golubesets. August 16, 2014 Package golubesets August 16, 2014 Version 1.6.0 Title exprsets for golub leukemia data Author Todd Golub Maintainer Vince Carey Description representation

More information

Gene expression profiling predicts clinical outcome of prostate cancer. Gennadi V. Glinsky, Anna B. Glinskii, Andrew J. Stephenson, Robert M.

Gene expression profiling predicts clinical outcome of prostate cancer. Gennadi V. Glinsky, Anna B. Glinskii, Andrew J. Stephenson, Robert M. SUPPLEMENTARY DATA Gene expression profiling predicts clinical outcome of prostate cancer Gennadi V. Glinsky, Anna B. Glinskii, Andrew J. Stephenson, Robert M. Hoffman, William L. Gerald Table of Contents

More information

Gene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering

Gene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering Gene expression analysis Roadmap Microarray technology: how it work Applications: what can we do with it Preprocessing: Image processing Data normalization Classification Clustering Biclustering 1 Gene

More information

NIH Public Access Author Manuscript Best Pract Res Clin Haematol. Author manuscript; available in PMC 2010 June 1.

NIH Public Access Author Manuscript Best Pract Res Clin Haematol. Author manuscript; available in PMC 2010 June 1. NIH Public Access Author Manuscript Published in final edited form as: Best Pract Res Clin Haematol. 2009 June ; 22(2): 271 282. doi:10.1016/j.beha.2009.07.001. Analysis of DNA Microarray Expression Data

More information

Contents. 1.5 GOPredict is robust to changes in study sets... 5

Contents. 1.5 GOPredict is robust to changes in study sets... 5 Supplementary documentation for Data integration to prioritize drugs using genomics and curated data Riku Louhimo, Marko Laakso, Denis Belitskin, Juha Klefström, Rainer Lehtonen and Sampsa Hautaniemi Faculty

More information

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA International Journal of Software Engineering and Knowledge Engineering Vol. 13, No. 6 (2003) 579 592 c World Scientific Publishing Company MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION

More information

Bioimaging and Functional Genomics

Bioimaging and Functional Genomics Bioimaging and Functional Genomics Elisa Ficarra, EPF Lausanne Giovanni De Micheli, EPF Lausanne Sungroh Yoon, Stanford University Luca Benini, University of Bologna Enrico Macii,, Politecnico di Torino

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

False Discovery Rates and Copy Number Variation. Bradley Efron and Nancy Zhang Stanford University

False Discovery Rates and Copy Number Variation. Bradley Efron and Nancy Zhang Stanford University False Discovery Rates and Copy Number Variation Bradley Efron and Nancy Zhang Stanford University Three Statistical Centuries 19th (Quetelet) Huge data sets, simple questions 20th (Fisher, Neyman, Hotelling,...

More information

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis HMG Advance Access published December 21, 2012 Human Molecular Genetics, 2012 1 13 doi:10.1093/hmg/dds512 Whole-genome detection of disease-associated deletions or excess homozygosity in a case control

More information

Simple Discriminant Functions Identify Small Sets of Genes that Distinguish Cancer Phenotype from Normal

Simple Discriminant Functions Identify Small Sets of Genes that Distinguish Cancer Phenotype from Normal Genome Informatics 16(1): 245 253 (2005) 245 Simple Discriminant Functions Identify Small Sets of Genes that Distinguish Cancer Phenotype from Normal Gul S. Dalgin 1 Charles DeLisi 2,3 sdalgin@bu.edu delisi@bu.edu

More information

Computational Analysis of Genome-Wide DNA Copy Number Changes

Computational Analysis of Genome-Wide DNA Copy Number Changes Computational Analysis of Genome-Wide DNA Copy Number Changes Lei Song Thesis submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements

More information

SNP array-based analyses of unbalanced embryos as a reference to distinguish between balanced translocation carrier and normal blastocysts

SNP array-based analyses of unbalanced embryos as a reference to distinguish between balanced translocation carrier and normal blastocysts J Assist Reprod Genet (2016) 33:1115 1119 DOI 10.1007/s10815-016-0734-0 TECHNOLOGICAL INNOVATIONS SNP array-based analyses of unbalanced embryos as a reference to distinguish between balanced translocation

More information

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Supplementary Materials RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Junhee Seok 1*, Weihong Xu 2, Ronald W. Davis 2, Wenzhong Xiao 2,3* 1 School of Electrical Engineering,

More information

ChIP-seq data analysis

ChIP-seq data analysis ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional

More information

Human Cancer Genome Project. Bioinformatics/Genomics of Cancer:

Human Cancer Genome Project. Bioinformatics/Genomics of Cancer: Bioinformatics/Genomics of Cancer: Professor of Computer Science, Mathematics and Cell Biology Courant Institute, NYU School of Medicine, Tata Institute of Fundamental Research, and Mt. Sinai School of

More information

Harvard University. A Pseudolikelihood Approach for Simultaneous Analysis of Array Comparative Genomic Hybridizations (acgh)

Harvard University. A Pseudolikelihood Approach for Simultaneous Analysis of Array Comparative Genomic Hybridizations (acgh) Harvard University Harvard University Biostatistics Working Paper Series Year 2005 Paper 30 A Pseudolikelihood Approach for Simultaneous Analysis of Array Comparative Genomic Hybridizations (acgh) David

More information

Bayesian Prediction Tree Models

Bayesian Prediction Tree Models Bayesian Prediction Tree Models Statistical Prediction Tree Modelling for Clinico-Genomics Clinical gene expression data - expression signatures, profiling Tree models for predictive sub-typing Combining

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al.

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Holger Höfling Gad Getz Robert Tibshirani June 26, 2007 1 Introduction Identifying genes that are involved

More information

Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination. May 4, 2010

Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination. May 4, 2010 Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination May 4, 2010 Examination Length = 3 hours Total Marks = 100 (7 questions) Total Pages = 8 (including cover sheet and 2 pages of prints)

More information

Classification of cancer profiles. ABDBM Ron Shamir

Classification of cancer profiles. ABDBM Ron Shamir Classification of cancer profiles 1 Background: Cancer Classification Cancer classification is central to cancer treatment; Traditional cancer classification methods: location; morphology, cytogenesis;

More information

Structural Variation and Medical Genomics

Structural Variation and Medical Genomics Structural Variation and Medical Genomics Andrew King Department of Biomedical Informatics July 8, 2014 You already know about small scale genetic mutations Single nucleotide polymorphism (SNPs) Deletions,

More information

10. The SKY/CGH Database for Spectral Karyotyping and Comparative Genomic Hybridization Data

10. The SKY/CGH Database for Spectral Karyotyping and Comparative Genomic Hybridization Data 10. The SKY/CGH Database for Spectral Karyotyping and Comparative Genomic Hybridization Data Turid Knutsen, 1 Vasuki Gobu, 2 Rodger Knaus, 2 Thomas Ried, 1 and Karl Sirotkin 2 1. National Cancer Institute

More information

A Quick-Start Guide for rseqdiff

A Quick-Start Guide for rseqdiff A Quick-Start Guide for rseqdiff Yang Shi (email: shyboy@umich.edu) and Hui Jiang (email: jianghui@umich.edu) 09/05/2013 Introduction rseqdiff is an R package that can detect differential gene and isoform

More information

Introduction to Discrimination in Microarray Data Analysis

Introduction to Discrimination in Microarray Data Analysis Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University Role of Chemical lexposure in Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University CNV Discovery Reference Genetic

More information

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Here we compare the results of this study to potentially overlapping results from four earlier studies

More information

Package propoverlap. R topics documented: February 20, Type Package

Package propoverlap. R topics documented: February 20, Type Package Type Package Package propoverlap February 20, 2015 Title Feature (gene) selection based on the Proportional Overlapping Scores Version 1.0 Date 2014-09-15 Author Osama Mahmoud, Andrew Harrison, Aris Perperoglou,

More information

Comparison of discrimination methods for the classification of tumors using gene expression data

Comparison of discrimination methods for the classification of tumors using gene expression data Comparison of discrimination methods for the classification of tumors using gene expression data Sandrine Dudoit, Jane Fridlyand 2 and Terry Speed 2,. Mathematical Sciences Research Institute, Berkeley

More information

Estimating the Number of Clusters in DNA Microarray Data

Estimating the Number of Clusters in DNA Microarray Data Estimating the Number of Clusters in DNA Microarray Data N. Bolshakova 1, F. Azuaje 2 1 Department of Computer Science, Trinity College Dublin, Ireland 2 School of Computing and Mathematics, University

More information

Cytogenetics Technologies, Companies & Markets

Cytogenetics Technologies, Companies & Markets Cytogenetics Technologies, Companies & Markets By Prof. K. K. Jain MD, FRACS, FFPM Jain PharmaBiotech Basel, Switzerland November 2018 A Jain PharmaBiotech Report A U T H O R ' S B I O G R A P H Y Professor

More information

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision

More information

Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control

Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control Figure 1. Growth characteristics of GLI2 expressing cells in monolayer culture (A) Expression of GLI2 and downstream targets GLI1 and PTCH in control HaCaT Tet, uninduced HaCaT GLI2 and induced HaCaT GLI2

More information

IN SPITE of a very quick development of medicine within

IN SPITE of a very quick development of medicine within INTL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 21, VOL. 6, NO. 3, PP. 281-286 Manuscript received July 1, 21: revised September, 21. DOI: 1.2478/v1177-1-37-9 Application of Density Based Clustering

More information

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival* LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,

More information

Assessment of Breast Cancer with Borderline HER2 Status Using MIP Microarray

Assessment of Breast Cancer with Borderline HER2 Status Using MIP Microarray Assessment of Breast Cancer with Borderline HER2 Status Using MIP Microarray Hui Chen, Aysegul A Sahin, Xinyan Lu, Lei Huo, Rajesh R Singh, Ronald Abraham, Shumaila Virani, Bal Mukund Mishra, Russell Broaddus,

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Introduction RNA splicing is a critical step in eukaryotic gene

More information

TITLE: Unique Genomic Alterations in Prostate Cancers in African American Men

TITLE: Unique Genomic Alterations in Prostate Cancers in African American Men AD Award Number: W81XWH-12-1-0046 TITLE: Unique Genomic Alterations in Prostate Cancers in African American Men PRINCIPAL INVESTIGATOR: Michael Ittmann, M.D., Ph.D. CONTRACTING ORGANIZATION: Baylor College

More information

Classification and Statistical Analysis of Auditory FMRI Data Using Linear Discriminative Analysis and Quadratic Discriminative Analysis

Classification and Statistical Analysis of Auditory FMRI Data Using Linear Discriminative Analysis and Quadratic Discriminative Analysis International Journal of Innovative Research in Computer Science & Technology (IJIRCST) ISSN: 2347-5552, Volume-2, Issue-6, November-2014 Classification and Statistical Analysis of Auditory FMRI Data Using

More information

Array-based Comparative Genomic Hybridization and Its Application to Cancer Genomes and Human Genetics

Array-based Comparative Genomic Hybridization and Its Application to Cancer Genomes and Human Genetics J Lung Cancer 2011;10(2):77-86 http://dx.doi.org/10.6058/jlc.2011.10.2.77 Array-based Comparative Genomic Hybridization and Its Application to Cancer Genomes and Human Genetics Microarray comparative genomic

More information

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection 202 4th International onference on Bioinformatics and Biomedical Technology IPBEE vol.29 (202) (202) IASIT Press, Singapore Efficacy of the Extended Principal Orthogonal Decomposition on DA Microarray

More information

Cancer outlier differential gene expression detection

Cancer outlier differential gene expression detection Biostatistics (2007), 8, 3, pp. 566 575 doi:10.1093/biostatistics/kxl029 Advance Access publication on October 4, 2006 Cancer outlier differential gene expression detection BAOLIN WU Division of Biostatistics,

More information

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015 Goals/Expectations Computer Science, Biology, and Biomedical (CoSBBI) We want to excite you about the world of computer science, biology, and biomedical informatics. Experience what it is like to be a

More information

Expanded View Figures

Expanded View Figures Solip Park & Ben Lehner Epistasis is cancer type specific Molecular Systems Biology Expanded View Figures A B G C D E F H Figure EV1. Epistatic interactions detected in a pan-cancer analysis and saturation

More information

Cancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer

Cancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer Cancer Gene Panels Dr. Andreas Scherer Dr. Andreas Scherer President and CEO Golden Helix, Inc. scherer@goldenhelix.com Twitter: andreasscherer About Golden Helix - Founded in 1998 - Main outside investor:

More information

Opinion Microarrays and molecular markers for tumor classification Brian Z Ring and Douglas T Ross

Opinion Microarrays and molecular markers for tumor classification Brian Z Ring and Douglas T Ross http://genomebiology.com/2002/3/5/comment/2005.1 Opinion Microarrays and molecular markers for tumor classification Brian Z Ring and Douglas T Ross Address: Applied Genomics Inc., 525 Del Rey Ave #B, Sunnyvale,

More information

A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis

A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis Baljeet Malhotra and Guohui Lin Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada T6G 2E8

More information

Molecular Staging for Survival Prediction of. Colorectal Cancer Patients

Molecular Staging for Survival Prediction of. Colorectal Cancer Patients Molecular Staging for Survival Prediction of Colorectal Cancer Patients STEVEN ESCHRICH 1, IVANA YANG 2, GREG BLOOM 1, KA YIN KWONG 2,DAVID BOULWARE 1, ALAN CANTOR 1, DOMENICO COPPOLA 1, MOGENS KRUHØFFER

More information

Global variation in copy number in the human genome

Global variation in copy number in the human genome Global variation in copy number in the human genome Redon et. al. Nature 444:444-454 (2006) 12.03.2007 Tarmo Puurand Study 270 individuals (HapMap collection) Affymetrix 500K Whole Genome TilePath (WGTP)

More information

RESEARCH ARTICLE. Introduction Despite the high prevalence of prostate cancer, its molecular basis continues to be poorly understood.

RESEARCH ARTICLE. Introduction Despite the high prevalence of prostate cancer, its molecular basis continues to be poorly understood. RESEARCH ARTICLE Neoplasia. Vol. 6, No. 3, May/June 2004, pp. 240 247 240 www.neoplasia.com High-Resolution Analysis of Gene Copy Number Alterations in Human Prostate Cancer Using CGH on cdna Microarrays:

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

S1 Appendix: Figs A G and Table A. b Normal Generalized Fraction 0.075

S1 Appendix: Figs A G and Table A. b Normal Generalized Fraction 0.075 Aiello & Alter (216) PLoS One vol. 11 no. 1 e164546 S1 Appendix A-1 S1 Appendix: Figs A G and Table A a Tumor Generalized Fraction b Normal Generalized Fraction.25.5.75.25.5.75 1 53 4 59 2 58 8 57 3 48

More information

TITLE: Detection of Genetic Alterations in Breast Sentinel Lymph Node by Array-CGH

TITLE: Detection of Genetic Alterations in Breast Sentinel Lymph Node by Array-CGH AD Award Number: W81XWH-04-1-0671 TITLE: Detection of Genetic Alterations in Breast Sentinel Lymph Node by Array-CGH PRINCIPAL INVESTIGATOR: Luciane R.Cavalli CONTRACTING ORGANIZATION: Georgetown University

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Digitizing the Proteomes From Big Tissue Biobanks

Digitizing the Proteomes From Big Tissue Biobanks Digitizing the Proteomes From Big Tissue Biobanks Analyzing 24 Proteomes Per Day by Microflow SWATH Acquisition and Spectronaut Pulsar Analysis Jan Muntel 1, Nick Morrice 2, Roland M. Bruderer 1, Lukas

More information

A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system

A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system Technology and Health Care 24 (2016) S601 S605 DOI 10.3233/THC-161187 IOS Press S601 A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system Jongwoo Lim, Bohyun

More information

Supplementary Figure 1: Comparison of acgh-based and expression-based CNA analysis of tumors from breast cancer GEMMs.

Supplementary Figure 1: Comparison of acgh-based and expression-based CNA analysis of tumors from breast cancer GEMMs. Supplementary Figure 1: Comparison of acgh-based and expression-based CNA analysis of tumors from breast cancer GEMMs. (a) CNA analysis of expression microarray data obtained from 15 tumors in the SV40Tag

More information

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue),

More information

Gene expression analysis for tumor classification using vector quantization

Gene expression analysis for tumor classification using vector quantization Gene expression analysis for tumor classification using vector quantization Edna Márquez 1 Jesús Savage 1, Ana María Espinosa 2, Jaime Berumen 2, Christian Lemaitre 3 1 IIMAS, Universidad Nacional Autónoma

More information

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK CHAPTER 6 DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK Genetic research aimed at the identification of new breast cancer susceptibility genes is at an interesting crossroad. On the one hand, the existence

More information

User Guide. Association analysis. Input

User Guide. Association analysis. Input User Guide TFEA.ChIP is a tool to estimate transcription factor enrichment in a set of differentially expressed genes using data from ChIP-Seq experiments performed in different tissues and conditions.

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

Nature Biotechnology: doi: /nbt.1904

Nature Biotechnology: doi: /nbt.1904 Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is

More information

Detecting robust gene signature through integrated analysis of multiple types of high-throughput data in liver cancer 1

Detecting robust gene signature through integrated analysis of multiple types of high-throughput data in liver cancer 1 Acta Pharmacol Sin 2007 Dec; 28 (12): 2005 2010 Full-length article Detecting robust gene signature through integrated analysis of multiple types of high-throughput data in liver cancer 1 Xin-yu ZHANG

More information

CNV Detection and Interpretation in Genomic Data

CNV Detection and Interpretation in Genomic Data CNV Detection and Interpretation in Genomic Data Benjamin W. Darbro, M.D., Ph.D. Assistant Professor of Pediatrics Director of the Shivanand R. Patil Cytogenetics and Molecular Laboratory Overview What

More information

Predictive Biomarkers

Predictive Biomarkers Uğur Sezerman Evolutionary Selection of Near Optimal Number of Features for Classification of Gene Expression Data Using Genetic Algorithms Predictive Biomarkers Biomarker: A gene, protein, or other change

More information

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. Supplementary Figure 1 Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. (a,b) Values of coefficients associated with genomic features, separately

More information

Large-scale identity-by-descent mapping discovers rare haplotypes of large effect. Suyash Shringarpure 23andMe, Inc. ASHG 2017

Large-scale identity-by-descent mapping discovers rare haplotypes of large effect. Suyash Shringarpure 23andMe, Inc. ASHG 2017 Large-scale identity-by-descent mapping discovers rare haplotypes of large effect Suyash Shringarpure 23andMe, Inc. ASHG 2017 1 Why care about rare variants of large effect? Months from randomization 2

More information

Progressive Genomic Instability in the FVB/Kras[superscript LA2] Mouse Model of Lung Cancer

Progressive Genomic Instability in the FVB/Kras[superscript LA2] Mouse Model of Lung Cancer Progressive Genomic Instability in the FVB/Kras[superscript LA2] Mouse Model of Lung Cancer The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story

More information

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

On Missing Data and Genotyping Errors in Association Studies

On Missing Data and Genotyping Errors in Association Studies On Missing Data and Genotyping Errors in Association Studies Department of Biostatistics Johns Hopkins Bloomberg School of Public Health May 16, 2008 Specific Aims of our R01 1 Develop and evaluate new

More information

Molecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC

Molecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC Molecular Markers Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC Overview Testing methods Rationale for molecular testing

More information

SubLasso:a feature selection and classification R package with a. fixed feature subset

SubLasso:a feature selection and classification R package with a. fixed feature subset SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced

More information

Genomic structural variation

Genomic structural variation Genomic structural variation Mario Cáceres The new genomic variation DNA sequence differs across individuals much more than researchers had suspected through structural changes A huge amount of structural

More information

Challenges of CGH array testing in children with developmental delay. Dr Sally Davies 17 th September 2014

Challenges of CGH array testing in children with developmental delay. Dr Sally Davies 17 th September 2014 Challenges of CGH array testing in children with developmental delay Dr Sally Davies 17 th September 2014 CGH array What is CGH array? Understanding the test Benefits Results to expect Consent issues Ethical

More information

Mining gene expression profiles: expression signatures as cancer phenotypes

Mining gene expression profiles: expression signatures as cancer phenotypes Mining gene expression profiles: expression signatures as cancer phenotypes Joseph R. Nevins and Anil Potti Abstract Many examples highlight the power of gene expression profiles, or signatures, to inform

More information

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Supplementary Materials and Methods Phylogenetic tree of the HMT superfamily The phylogeny outlined in the

More information

Cancer cytogenetics update 2005

Cancer cytogenetics update 2005 Atlas of Genetics and Cytogenetics in Oncology and Haematology Cancer cytogenetics update 2005 Felix Mitelman Department of Clinical Genetics, University of Lund, Lund, Sweden March 2005 By January 2005,

More information

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23: Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:7332-7341 Presented by Deming Mi 7/25/2006 Major reasons for few prognostic factors to

More information

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Dipak K. Dey, Ulysses Diva and Sudipto Banerjee Department of Statistics University of Connecticut, Storrs. March 16,

More information

Memorial Sloan-Kettering Cancer Center

Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2008 Paper 15 A Metastasis or a Second Independent Cancer?

More information