Network-assisted data analysis

Size: px
Start display at page:

Download "Network-assisted data analysis"

Transcription

1 Network-assisted data analysis Bing Zhang Department of Biomedical Informatics Vanderbilt University

2 Protein identification in shotgun proteomics Protein digestion LC-MS/MS Protein assembly Database search 2 BMIF310, Fall 2009

3 Protein assembly and classification Background a b Zhang, et al. J Proteome Res 6:3549, BMIF310, Fall 2009

4 Network-assisted protein identification: motivation Current protein assembly pipelines treat proteins as individual entities. Biologically interesting proteins may be eliminated due to insufficient experimental evidence. Most biological functions arise from interactions among proteins. Can we use protein interaction network information to improve protein identification? Hypothesis: an eliminated protein is more likely to be present in the original sample if it lies close to other confidently identified proteins in the protein interaction network. 4 BMIF310, Fall 2009

5 Do confidently identified proteins cluster together? Li et al. Mol Syst Biol,5:303, BMIF310, Fall 2009

6 Neighborhood majority voting Direct neighbors (local) Iterative majority voting (global) 6 Local Global BMIF310, Fall 2009

7 Module-based analysis Direct neighbor voting Module-based 7 BMIF310, Fall 2009

8 Method evaluation: Cross validation for proteins with known labels Yeast Yeast cell culture dataset YPIN, 5,666 nodes; 126,126 edges Results: sensitivity of 56% at the specificity of 90%. CEA is more accurate and robust than other methods Mouse Mouse organ datasets: brain, lung, placenta MPIN1: 9,776 nodes; 69,470 edges MPIN2: 12,271 nodes; 236,675 edges Results: sensitivity of ~45% at the specificity of 90% Li et al. Mol Syst Biol,5:303, BMIF310, Fall 2009

9 Method evaluation: Supporting evidence for rescued proteins Mouse-MPIN1 Proteins rescued Brain: 171 (12%) Lung: 181 (10%) Placenta: 156 (11%) Independent evidences Microarray EST library Publication Ratio of support ( 2 evidences) Li et al. Mol Syst Biol,5:303, BMIF310, Fall 2009

10 Application: Breast cancer data set (normal vs tumor) Rescued proteins Normal: 139 (23%) Tumor: 95 (8%) Rescued cancer-related proteins Ctnnb1 Top1 Cancer specific sub-networks Wnt signaling pathway Cell adhesion Apoptosis Li et al. Mol Syst Biol,5:303, BMIF310, Fall 2009

11 Extended hypothesis Hypothesis: an eliminated protein is more likely to be present in the original sample if it lies close to other confidently identified proteins in the protein interaction network. Extended hypothesis: Proteins lie close to each other in a protein interaction network are more likely to share common attributes. Protein expression Protein function Disease association 11 BMIF310, Fall 2009

12 Protein function prediction: motivation Gene Ontology annotation coverage in different organisms Sharan et al. Mol Syst Biol, 3:88, BMIF310, Fall 2009

13 Network-based protein function prediction Proteins that lie closer to one another in a protein interaction network are more likely to have similar function and involve in similar biological process. GO semantic similarity Sharan et al. Mol Syst Biol, 3:88, BMIF310, Fall 2009

14 Disease gene prioritization: motivation Most common genetic disorders may result form variants in many genes, each contributing only weak effects to the disease Quantitative trait locus mapping and genome-wide association studies identify large set of candidate disease genes Assessing and selecting genes for validation is nontrivial Causal genes for the same disease are often involved in a few biological processes or molecular pathways Limb-girdle muscular dystrophy: many disease genes involve in the dystrophy complex Fanconi anemia: 5 out of the 10 disease genes identified in a genetic study function in a nuclear complex 14 BMIF310, Fall 2009

15 Network-based disease causal gene prioritization: method Kohler et al. Am J Hum Genet. 82:949, BMIF310, Fall 2009

16 Network-based disease causal gene prioritization: result Kohler et al. Am J Hum Genet. 82:949, BMIF310, Fall 2009

17 Disease biomarker identification: gene-level approach Scoring individual genes to generate a marker set Use gene expression matrix to train a classifier High variability: two breast cancer biomarker sets of ~70 genes each shared only 3 genes in common Doesn t improve mechanistic understanding of cancer Chuang et al. Mol Systems Biol, 3:140, BMIF310, Fall 2009

18 Disease biomarker identification: pathway-level approach Scoring known pathways by the coherency of expression changes among their member genes More robust More sensitive Better interpretation The majority of human genes have not yet been assigned to a definitive pathway 18 BMIF310, Fall 2009

19 Disease biomarker identification: network approach Overlay gene expression data on the protein interaction network Search for subnetworks that are highly discriminative of metastasis Use subnetwork activity matrix to train a classifier Chuang et al. Mol Systems Biol, 3:140, BMIF310, Fall 2009

20 Subnetwork markers are more robust Chuang et al. Mol Systems Biol, 3:140, BMIF310, Fall 2009

21 Subnetwork markers are more informative 47.3% and 65.4% subnetwork markers showed functional enrichment 66 and 153 subnetworks corresponded to the major events that have been implicated in the progression of cancer Signaling of cell growth and survival Cell proliferation and replication Apoptosis Cell and tissue remodeling Circulation and coagulation Metabolism Detecting important non-differentially expressed genes TP53, KARS, HARS, ERBB2, PIK3CA Chuang et al. Mol Systems Biol, 3:140, BMIF310, Fall 2009

22 Summary Guilt by association: Proteins lie close to each other in a protein interaction network are more likely to share common attributes. Protein expression Protein function Disease-related proteins Network-based disease biomarkers 22 BMIF310, Fall 2009

23 Key references Li et al. Network-assisted protein identification and data interpretation in shotgun proteomics. Molecular Systems Biology, 5:303, 2009 Ideker et al. Protein networks in disease. Genome Research, 18:644, 2008 Sharan et al. Network-based prediction of protein function. Molecular Systems Biology, 3:88, BMIF310, Fall 2009

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

Integrative Omics for The Systems Biology of Complex Phenotypes

Integrative Omics for The Systems Biology of Complex Phenotypes Integrative Omics for The Systems Biology of Complex Phenotypes Mehmet Koyutürk Case Western Reserve University (1) Electrical Engineering and Computer Science (2) Center for Proteomics and Bioinformatics

More information

Suppression of the Malignant Phenotype of Bladder Cancer Cells Investigations of the Proteome in Relation to Phenotype and Gene Expression

Suppression of the Malignant Phenotype of Bladder Cancer Cells Investigations of the Proteome in Relation to Phenotype and Gene Expression Suppression of the Malignant Phenotype of Bladder Cancer Cells Investigations of the Proteome in Relation to Phenotype and Gene Expression Topics Modulation of Phenotype by Extracellular Matrix. Bioinformatics

More information

Digitizing the Proteomes From Big Tissue Biobanks

Digitizing the Proteomes From Big Tissue Biobanks Digitizing the Proteomes From Big Tissue Biobanks Analyzing 24 Proteomes Per Day by Microflow SWATH Acquisition and Spectronaut Pulsar Analysis Jan Muntel 1, Nick Morrice 2, Roland M. Bruderer 1, Lukas

More information

Integrating Genome and Functional Genomics Data to Reveal Perturbed Signaling Pathways in Ovarian Cancers

Integrating Genome and Functional Genomics Data to Reveal Perturbed Signaling Pathways in Ovarian Cancers Integrating Genome and Functional Genomics Data to Reveal Perturbed Signaling Pathways in Ovarian Cancers Songjian Lu, PhD, Xinghua Lu, MD. PhD Dept. Biomedical Informatics, Univ. Pittsburgh, PA 15232

More information

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Dr. Sascha Losko, Dr. Karsten Wenger, Dr. Wenzel Kalus, Dr. Andrea Ramge, Dr. Jens Wiehler,

More information

What Impact will Proteomics, Including CSF Analysis, have on the Near-term Development of Biomarkers for Nervous System Diseases?

What Impact will Proteomics, Including CSF Analysis, have on the Near-term Development of Biomarkers for Nervous System Diseases? What Impact will Proteomics, Including CSF Analysis, have on the Near-term Development of Biomarkers for Nervous System Diseases? Daumier 1 Howard Schulman (howard.schulman@menlo.ppdi.com) Biomarker Discovery,

More information

Bayesian (Belief) Network Models,

Bayesian (Belief) Network Models, Bayesian (Belief) Network Models, 2/10/03 & 2/12/03 Outline of This Lecture 1. Overview of the model 2. Bayes Probability and Rules of Inference Conditional Probabilities Priors and posteriors Joint distributions

More information

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015 Goals/Expectations Computer Science, Biology, and Biomedical (CoSBBI) We want to excite you about the world of computer science, biology, and biomedical informatics. Experience what it is like to be a

More information

A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data

A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data Method A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data Xiao-Gang Ruan, Jin-Lian Wang*, and Jian-Geng Li Institute of Artificial Intelligence and

More information

What can we contribute to cancer research and treatment from Computer Science or Mathematics? How do we adapt our expertise for them

What can we contribute to cancer research and treatment from Computer Science or Mathematics? How do we adapt our expertise for them From Bioinformatics to Health Information Technology Outline What can we contribute to cancer research and treatment from Computer Science or Mathematics? How do we adapt our expertise for them Introduction

More information

Principles of phylogenetic analysis

Principles of phylogenetic analysis Principles of phylogenetic analysis Arne Holst-Jensen, NVI, Norway. Fusarium course, Ås, Norway, June 22 nd 2008 Distance based methods Compare C OTUs and characters X A + D = Pairwise: A and B; X characters

More information

IPA Advanced Training Course

IPA Advanced Training Course IPA Advanced Training Course October 2013 Academia sinica Gene (Kuan Wen Chen) IPA Certified Analyst Agenda I. Data Upload and How to Run a Core Analysis II. Functional Interpretation in IPA Hands-on Exercises

More information

Identifying Potential Prognostic Biomarkers by Analyzing Gene Expression for Different Cancers

Identifying Potential Prognostic Biomarkers by Analyzing Gene Expression for Different Cancers Identifying Potential Prognostic Biomarkers by Analyzing Gene Expression for Different Cancers Pragya Verma Department of Mathematics, Bioinformatics and Computer Applications Maulana Azad National Institute

More information

CS2220 Introduction to Computational Biology

CS2220 Introduction to Computational Biology CS2220 Introduction to Computational Biology WEEK 8: GENOME-WIDE ASSOCIATION STUDIES (GWAS) 1 Dr. Mengling FENG Institute for Infocomm Research Massachusetts Institute of Technology mfeng@mit.edu PLANS

More information

Supplementary Information

Supplementary Information Supplementary Information Modelling the Yeast Interactome Vuk Janjić, Roded Sharan 2 and Nataša Pržulj, Department of Computing, Imperial College London, London, United Kingdom 2 Blavatnik School of Computer

More information

Proteomic Biomarker Discovery in Breast Cancer

Proteomic Biomarker Discovery in Breast Cancer Proteomic Biomarker Discovery in Breast Cancer Rob Baxter Laboratory for Cellular and Diagnostic Proteomics Kolling Institute, University of Sydney, Royal North Shore Hospital robert.baxter@sydney.edu.au

More information

I. Setup. - Note that: autohgpec_v1.0 can work on Windows, Ubuntu and Mac OS.

I. Setup. - Note that: autohgpec_v1.0 can work on Windows, Ubuntu and Mac OS. autohgpec: Automated prediction of novel disease-gene and diseasedisease associations and evidence collection based on a random walk on heterogeneous network Duc-Hau Le 1,*, Trang T.H. Tran 1 1 School

More information

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s Using Bayesian Networks to Analyze Expression Data Xu Siwei, s0789023 Muhammad Ali Faisal, s0677834 Tejal Joshi, s0677858 Outline Introduction Bayesian Networks Equivalence Classes Applying to Expression

More information

Hao D. H., Ma W. G., Sheng Y. L., Zhang J. B., Jin Y. F., Yang H. Q., Li Z. G., Wang S. S., GONG Ming*

Hao D. H., Ma W. G., Sheng Y. L., Zhang J. B., Jin Y. F., Yang H. Q., Li Z. G., Wang S. S., GONG Ming* Comparison of transcriptomes and gene expression profiles of two chilling- and drought-tolerant and intolerant Nicotiana tabacum varieties under low temperature and drought stress Hao D. H., Ma W. G.,

More information

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis Tieliu Shi tlshi@bio.ecnu.edu.cn The Center for bioinformatics

More information

Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They?

Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They? Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They? Dubovenko Alexey Discovery Product Manager Sonia Novikova Solution Scientist September 2018 2 Non-Small Cell Lung Cancer

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

New Developments in LC-IMS-MS Proteomic Measurements and Informatic Analyses

New Developments in LC-IMS-MS Proteomic Measurements and Informatic Analyses New Developments in LC-IMS-MS Proteomic Measurements and Informatic Analyses Erin Shammel Baker Kristin E. Burnum-Johnson, Xing Zhang, Cameron P. Casey, Yehia M. Ibrahim, Matthew E. Monroe, Tao Liu, Brendan

More information

* To whom correspondence should be addressed.

* To whom correspondence should be addressed. Modeling Cancer Progression via Pathway Dependencies Elena Edelman 1,2, Justin Guinney 1,2, Jen-TsanChi 1,3, Phillip G. Febbo 1,3,4,7, Sayan Mukherjee 1,5,6,7* 1 Institute for Genome Sciences & Policy,

More information

Multi-omics data integration colon cancer using proteogenomics approach

Multi-omics data integration colon cancer using proteogenomics approach Dept. of Medical Oncology Multi-omics data integration colon cancer using proteogenomics approach DTL Focus meeting, 29 August 2016 Thang Pham OncoProteomics Laboratory, Dept. of Medical Oncology VU University

More information

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend Table of content -Supplementary methods -Figure S1 -Figure S2 -Figure S3 -Table legend Supplementary methods Yeast two-hybrid bait basal transactivation test Because bait constructs sometimes self-transactivate

More information

BIOINFORMATICS ORIGINAL PAPER

BIOINFORMATICS ORIGINAL PAPER BIOINFORMATICS ORIGINAL PAPER Vol. 25 no. 2 2009, pages 237 242 doi:10.1093/bioinformatics/btn613 Genetics and population analysis Prioritizing risk pathways: a novel association approach to searching

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1

Nature Neuroscience: doi: /nn Supplementary Figure 1 Supplementary Figure 1 Illustration of the working of network-based SVM to confidently predict a new (and now confirmed) ASD gene. Gene CTNND2 s brain network neighborhood that enabled its prediction by

More information

NEpiC: a network-assisted algorithm for epigenetic studies using mean and variance combined signals

NEpiC: a network-assisted algorithm for epigenetic studies using mean and variance combined signals Published online 14 June 2016 Nucleic Acids Research, 2016, Vol. 44, No. 16 e134 doi: 10.1093/nar/gkw546 NEpiC: a network-assisted algorithm for epigenetic studies using mean and variance combined signals

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Effect of HSP90 inhibition on expression of endogenous retroviruses. (a) Inducible shrna-mediated Hsp90 silencing in mouse ESCs. Immunoblots of total cell extract expressing the

More information

Cancer gene discovery via network analysis of somatic mutation data. Insuk Lee

Cancer gene discovery via network analysis of somatic mutation data. Insuk Lee Cancer gene discovery via network analysis of somatic mutation data Insuk Lee Cancer is a progressive genetic disorder. Accumulation of somatic mutations cause cancer. For example, in colorectal cancer,

More information

University of Pittsburgh Cancer Institute UPMC CancerCenter. Uma Chandran, MSIS, PhD /21/13

University of Pittsburgh Cancer Institute UPMC CancerCenter. Uma Chandran, MSIS, PhD /21/13 University of Pittsburgh Cancer Institute UPMC CancerCenter Uma Chandran, MSIS, PhD chandran@pitt.edu 412-648-9326 2/21/13 University of Pittsburgh Cancer Institute Founded in 1985 Director Nancy Davidson,

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 An example of the gene-term-disease network automatically generated by Phenolyzer web server for 'autism'. The largest word represents the user s input term, Autism. The pink round

More information

Systems Level Analysis and Identification of Pathways and Networks Associated with Liver Fibrosis

Systems Level Analysis and Identification of Pathways and Networks Associated with Liver Fibrosis Systems Level Analysis and Identification of Pathways and Networks Associated with Liver Fibrosis Mohamed Diwan M. AbdulHameed 1, Gregory J. Tawa 1, Kamal Kumar 1, Danielle L. Ippolito 2, John A. Lewis

More information

Supplementary Figure 1. Schematic diagram of o2n-seq. Double-stranded DNA was sheared, end-repaired, and underwent A-tailing by standard protocols.

Supplementary Figure 1. Schematic diagram of o2n-seq. Double-stranded DNA was sheared, end-repaired, and underwent A-tailing by standard protocols. Supplementary Figure 1. Schematic diagram of o2n-seq. Double-stranded DNA was sheared, end-repaired, and underwent A-tailing by standard protocols. A-tailed DNA was ligated to T-tailed dutp adapters, circularized

More information

A bioinformatics driven cancer study

A bioinformatics driven cancer study The 11th China-Japan-Korea Bioinformatics Training Course & Symposium, June 16-18, Suzhou, China A bioinformatics driven cancer study Chao Li and Yixue Li yxli@sibs.ac.cn Shanghai Institutes for Bio-Medicine

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

Integrative analysis of survival-associated gene sets in breast cancer

Integrative analysis of survival-associated gene sets in breast cancer Varn et al. BMC Medical Genomics (2015) 8:11 DOI 10.1186/s12920-015-0086-0 RESEARCH ARTICLE Open Access Integrative analysis of survival-associated gene sets in breast cancer Frederick S Varn 1, Matthew

More information

Chapter 5: Global Analysis of the Effect of Densin. Knockout on Gene Transcription in the Brain

Chapter 5: Global Analysis of the Effect of Densin. Knockout on Gene Transcription in the Brain Chapter 5: Global Analysis of the Effect of Densin Knockout on Gene Transcription in the Brain Introduction High throughput methods for analyzing transcriptional changes in response to deletion or mutation

More information

Drug repurposing and therapeutic anti-mirna predictions in oxldl-induced the proliferation of vascular smooth muscle cell associated diseases

Drug repurposing and therapeutic anti-mirna predictions in oxldl-induced the proliferation of vascular smooth muscle cell associated diseases Drug repurposing and therapeutic anti-mirna predictions in oxldl-induced the proliferation of vascular smooth muscle cell associated diseases Shun-Tsung Chen, Chien-Hung Huang, Victor C. Kok, Chi-Ying

More information

Integration of Genetic and Genomic Approaches for the Analysis of Chronic Fatigue Syndrome Implicates Forkhead Box N1

Integration of Genetic and Genomic Approaches for the Analysis of Chronic Fatigue Syndrome Implicates Forkhead Box N1 Integration of Genetic and Genomic Approaches for the Analysis of Chronic Fatigue Syndrome Implicates Forkhead Box N1 Angela Presson, Jeanette Papp, Eric Sobel, and Steve Horvath Biostatistics and Human

More information

MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent acquisitions

MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent acquisitions MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent acquisitions MS1 528.18 +++ m/z 568.98 ++ m/z 678.34 ++ m/z MS2/SWATH June 9th, 2013 Matthew J. Rardin SIRT3 regulated

More information

Journal of Biomedical Discovery and Collaboration

Journal of Biomedical Discovery and Collaboration Journal of Biomedical Discovery and Collaboration A cognitive task analysis of a visual analytic workflow: Exploring molecular interaction networks in systems biology Barbara Mirel 1,*, Felix Eichinger

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

SAPLING: A Tool for Gene Network Analysis focusing on Psychiatric Genetics

SAPLING: A Tool for Gene Network Analysis focusing on Psychiatric Genetics SAPLING: A Tool for Gene Network Analysis focusing on Psychiatric Genetics sapling.cshl.edu Wim Verleyen, Ph.D. Gillis Lab Outline Motivation Disease-gene analysis Enrichment analysis Gene network analysis:

More information

Limsoon Wong Global-COE Workshop on Engineering/Information Science for Integrated Life Science and Predictive Medicine 28 February 2012

Limsoon Wong Global-COE Workshop on Engineering/Information Science for Integrated Life Science and Predictive Medicine 28 February 2012 A Novel Contextualization Approach to Proteomic Profile Analysis Limsoon Wong Global-COE Workshop on Engineering/Information Science for Integrated Life Science and Predictive Medicine 28 February 2012

More information

Shotgun metaproteomics of the human distal gut microbiota. Present by Lei Chen

Shotgun metaproteomics of the human distal gut microbiota. Present by Lei Chen Shotgun metaproteomics of the human distal gut microbiota Present by Lei Chen (lc6@indana.edu) Outline Background What are the goals? Materials and Methods Results Discussion Background The human gastrointestinal

More information

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein The parsimony principle: A quick review Find the tree that requires the fewest

More information

Validation of Genetic Determinants of Skeletal Diseases. Dr Kasia Piróg (Newcastle University)

Validation of Genetic Determinants of Skeletal Diseases. Dr Kasia Piróg (Newcastle University) Validation of Genetic Determinants of Skeletal Diseases Dr Kasia Piróg (Newcastle University) What is? Skeletal conditions Common Rare related to ageing and/or injury often multifactorial often difficult

More information

PathAct: a novel method for pathway analysis using gene expression profiles

PathAct: a novel method for pathway analysis using gene expression profiles www.bioinformation.net Hypothesis Volume 9(8) PathAct: a novel method for pathway analysis using gene expression profiles Kaoru Mogushi & Hiroshi Tanaka* Department of Bioinformatics, Division of Medical

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

PCxN: The pathway co-activity map: a resource for the unification of functional biology

PCxN: The pathway co-activity map: a resource for the unification of functional biology PCxN: The pathway co-activity map: a resource for the unification of functional biology Sheffield Institute for Translational Neurosciences Center for Integrative Genome Translation GENOME INFORMATICS

More information

Integrated Analysis of Copy Number and Gene Expression

Integrated Analysis of Copy Number and Gene Expression Integrated Analysis of Copy Number and Gene Expression Nexus Copy Number provides user-friendly interface and functionalities to integrate copy number analysis with gene expression results for the purpose

More information

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative Association mapping (qualitative) Office hours Wednesday 3-4pm 304A Stanley Hall Fig. 11.26 Association scan, qualitative Association scan, quantitative osteoarthritis controls χ 2 test C s G s 141 47

More information

Characteriza*on of Soma*c Muta*ons in Cancer Genomes

Characteriza*on of Soma*c Muta*ons in Cancer Genomes Characteriza*on of Soma*c Muta*ons in Cancer Genomes Ben Raphael Department of Computer Science Center for Computa*onal Molecular Biology Soma*c Muta*ons and Cancer Clonal Theory (Nowell 1976) Passenger

More information

Simple, rapid, and reliable RNA sequencing

Simple, rapid, and reliable RNA sequencing Simple, rapid, and reliable RNA sequencing RNA sequencing applications RNA sequencing provides fundamental insights into how genomes are organized and regulated, giving us valuable information about the

More information

Cancer Genetics. What is Cancer? Cancer Classification. Medical Genetics. Uncontrolled growth of cells. Not all tumors are cancerous

Cancer Genetics. What is Cancer? Cancer Classification. Medical Genetics. Uncontrolled growth of cells. Not all tumors are cancerous Session8 Medical Genetics Cancer Genetics J avad Jamshidi F a s a U n i v e r s i t y o f M e d i c a l S c i e n c e s, N o v e m b e r 2 0 1 7 What is Cancer? Uncontrolled growth of cells Not all tumors

More information

AVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB

AVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits Next-generation performance in liquid biopsies 2 Accelerating clinical research From liquid biopsy to next-generation

More information

PRINCESSA: Erstellung eines R-Paketes und einer zugehörigen Web-Applikation zur ProteinCoexpressions-Analyse

PRINCESSA: Erstellung eines R-Paketes und einer zugehörigen Web-Applikation zur ProteinCoexpressions-Analyse PRINCESSA: Erstellung eines R-Paketes und einer zugehörigen Web-Applikation zur ProteinCoexpressions-Analyse Studienprojekt im Vertiefungsgebiet Bioinformatik Ruhr-Universität Bochum July 11th, 2018 Dr.

More information

Global variation in copy number in the human genome

Global variation in copy number in the human genome Global variation in copy number in the human genome Redon et. al. Nature 444:444-454 (2006) 12.03.2007 Tarmo Puurand Study 270 individuals (HapMap collection) Affymetrix 500K Whole Genome TilePath (WGTP)

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

EXPression ANalyzer and DisplayER

EXPression ANalyzer and DisplayER EXPression ANalyzer and DisplayER Tom Hait Aviv Steiner Igor Ulitsky Chaim Linhart Amos Tanay Seagull Shavit Rani Elkon Adi Maron-Katz Dorit Sagir Eyal David Roded Sharan Israel Steinfeld Yossi Shiloh

More information

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm:

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm: The clustering problem: partition genes into distinct sets with high homogeneity and high separation Hierarchical clustering algorithm: 1. Assign each object to a separate cluster. 2. Regroup the pair

More information

Classification with microarray data

Classification with microarray data Classification with microarray data Aron Charles Eklund eklund@cbs.dtu.dk DNA Microarray Analysis - #27612 January 8, 2010 The rest of today Now: What is classification, and why do we do it? How to develop

More information

Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator.

Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator. Extending Proteome Coverage by Combining MS/MS Methods and a Modified Bioinformatics Platform adapted for Database Searching of Positive and Negative Polarity 193 nm Ultraviolet Photodissociation Mass

More information

A homologous mapping method for threedimensional reconstruction of protein networks reveals disease-associated mutations

A homologous mapping method for threedimensional reconstruction of protein networks reveals disease-associated mutations Huang et al. BMC Systems Biology 2018, 12(Suppl 2):13 https://doi.org/10.1186/s12918-018-0537-2 RESEARCH A homologous mapping method for threedimensional reconstruction of protein networks reveals disease-associated

More information

Comparative Effectiveness Research (CER) and Personalized Medicine: Policy, Science, and Business

Comparative Effectiveness Research (CER) and Personalized Medicine: Policy, Science, and Business Comparative Effectiveness Research (CER) and Personalized Medicine: How a comprehensive CER system can support personalized medicine Amy P. Abernethy, MD October 28, 2009 CER in Cancer Care? 2 Friends

More information

Systems Biology in Metabolomics

Systems Biology in Metabolomics Systems Biology in Metabolomics Maciej Lalowski PhD, Adjunct Professor Biomedicum Helsinki Helsinki University and Folkhälsan Institute of Genetics maciej.lalowski@helsinki.fi TrasMed Course 25.09.2014

More information

Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics, 2010

Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics, 2010 Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics, 2010 C.J.Vaske et al. May 22, 2013 Presented by: Rami Eitan Complex Genomic

More information

Supplemental Material

Supplemental Material 1 Supplemental Material Golomb, J.D, and Kanwisher, N. (2012). Higher-level visual cortex represents retinotopic, not spatiotopic, object location. Cerebral Cortex. Contents: - Supplemental Figures S1-S3

More information

Figure S4. 15 Mets Whole Exome. 5 Primary Tumors Cancer Panel and WES. Next Generation Sequencing

Figure S4. 15 Mets Whole Exome. 5 Primary Tumors Cancer Panel and WES. Next Generation Sequencing Figure S4 Next Generation Sequencing 15 Mets Whole Exome 5 Primary Tumors Cancer Panel and WES Get coverage of all variant loci for all three Mets Variant Filtering Sequence Alignments Index and align

More information

CoINcIDE: A framework for discovery of patient subtypes across multiple datasets

CoINcIDE: A framework for discovery of patient subtypes across multiple datasets Planey and Gevaert Genome Medicine (2016) 8:27 DOI 10.1186/s13073-016-0281-4 METHOD CoINcIDE: A framework for discovery of patient subtypes across multiple datasets Catherine R. Planey and Olivier Gevaert

More information

Basket and Umbrella Trial Designs in Oncology

Basket and Umbrella Trial Designs in Oncology Basket and Umbrella Trial Designs in Oncology Eric Polley Biomedical Statistics and Informatics Mayo Clinic Polley.Eric@mayo.edu Dose Selection for Cancer Treatment Drugs Stanford Medicine May 2017 1 /

More information

Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome

Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome Marta R. Hidalgo 1, Alicia Amadoz 2, Cankut Çubuk 1, José Carbonell-Caballero 2 and Joaquín Dopazo

More information

CDC & Florida DOH Attribution

CDC & Florida DOH Attribution FCDS Annual Educational Conference Tampa, Florida July 19, 2018 Steven Peace, CTR 1 CDC & Florida DOH Attribution Funding for this conference was made possible (in part) by the Centers for Disease Control

More information

Fluxion Biosciences and Swift Biosciences Somatic variant detection from liquid biopsy samples using targeted NGS

Fluxion Biosciences and Swift Biosciences Somatic variant detection from liquid biopsy samples using targeted NGS APPLICATION NOTE Fluxion Biosciences and Swift Biosciences OVERVIEW This application note describes a robust method for detecting somatic mutations from liquid biopsy samples by combining circulating tumor

More information

Unsupervised Identification of Isotope-Labeled Peptides

Unsupervised Identification of Isotope-Labeled Peptides Unsupervised Identification of Isotope-Labeled Peptides Joshua E Goldford 13 and Igor GL Libourel 124 1 Biotechnology institute, University of Minnesota, Saint Paul, MN 55108 2 Department of Plant Biology,

More information

Multiplexed Cancer Pathway Analysis

Multiplexed Cancer Pathway Analysis NanoString Technologies, Inc. Multiplexed Cancer Pathway Analysis for Gene Expression Lucas Dennis, Patrick Danaher, Rich Boykin, Joseph Beechem NanoString Technologies, Inc., Seattle WA 98109 v1.0 MARCH

More information

LECTURE-15. itraq Clinical Applications HANDOUT. Isobaric Tagging for Relative and Absolute quantitation (itraq) is a quantitative MS

LECTURE-15. itraq Clinical Applications HANDOUT. Isobaric Tagging for Relative and Absolute quantitation (itraq) is a quantitative MS LECTURE-15 itraq Clinical Applications HANDOUT PREAMBLE Isobaric Tagging for Relative and Absolute quantitation (itraq) is a quantitative MS based method for quantifying proteins subject to various different

More information

CS 6824: Tissue-Based Map of the Human Proteome

CS 6824: Tissue-Based Map of the Human Proteome CS 6824: Tissue-Based Map of the Human Proteome T. M. Murali November 17, 2016 Human Protein Atlas Measure protein and gene expression using tissue microarrays and deep sequencing, respectively. Alternative

More information

Supporting Information

Supporting Information Supporting Information Retinal expression of small non-coding RNAs in a murine model of proliferative retinopathy Chi-Hsiu Liu 1, Zhongxiao Wang 1, Ye Sun 1, John Paul SanGiovanni 2, Jing Chen 1, * 1 Department

More information

Genomics Research. May 31, Malvika Pillai

Genomics Research. May 31, Malvika Pillai Genomics Research May 31, 2018 Malvika Pillai Outline for Research Discussion Why Informatics Read journal articles! Example Paper Presentation Research Pipeline How To Read A Paper If I m aiming to just

More information

Statement of research interest

Statement of research interest Statement of research interest Milos Hauskrecht My primary field of research interest is Artificial Intelligence (AI). Within AI, I am interested in problems related to probabilistic modeling, machine

More information

Title: Pathway-Based Classification of Cancer Subtypes

Title: Pathway-Based Classification of Cancer Subtypes Title: Pathway-Based Classification of Cancer Subtypes Running title: Pathway-based classification of cancer subtypes Shinuk Kim 1, Mark Kon 1,2*, Charles DeLisi 1 1 Bioinformatics program, Boston University,

More information

Ovarian stimulation and inflammation, not always friends! Marc-André Sirard DVM PhD, Canadian Research Chair in Reproduction Genomics

Ovarian stimulation and inflammation, not always friends! Marc-André Sirard DVM PhD, Canadian Research Chair in Reproduction Genomics Ovarian stimulation and inflammation, not always friends! Marc-André Sirard DVM PhD, Canadian Research Chair in Reproduction Genomics Conflict of interest Controlled Ovarian Stimulation Timing Test (COST2)GFI

More information

Gene interaction enrichment and network analysis to identify dysregulated pathways and their interactions in complex diseases

Gene interaction enrichment and network analysis to identify dysregulated pathways and their interactions in complex diseases Liu et al. BMC Systems Biology 2012, 6:65 RESEARCH ARTICLE Open Access Gene interaction enrichment and network analysis to identify dysregulated pathways and their interactions in complex diseases Yu Liu

More information

Variant Detection & Interpretation in a diagnostic context. Christian Gilissen

Variant Detection & Interpretation in a diagnostic context. Christian Gilissen Variant Detection & Interpretation in a diagnostic context Christian Gilissen c.gilissen@gen.umcn.nl 28-05-2013 So far Sequencing Johan den Dunnen Marja Jakobs Ewart de Bruijn Mapping Victor Guryev Variant

More information

Rare Variant Burden Tests. Biostatistics 666

Rare Variant Burden Tests. Biostatistics 666 Rare Variant Burden Tests Biostatistics 666 Last Lecture Analysis of Short Read Sequence Data Low pass sequencing approaches Modeling haplotype sharing between individuals allows accurate variant calls

More information

International Journal of Innovative Research in Computer and Communication Engineering. (An ISO 3297: 2007 Certified Organization)

International Journal of Innovative Research in Computer and Communication Engineering. (An ISO 3297: 2007 Certified Organization) Reconstruction of Gene Regulatory Network to Identify Prognostic Molecular Markers of the Reactive Stroma of Breast and Prostate Cancer Using Information Theoretic Approach Prof. Shanthi Mahesh 1, Dr.

More information

SYSTEMS BIOLOGY APPROACHES TO IDENTIFY MECHANISMS OF IMMUNE MEDIATED PROTECTION TRANSLATING RESEARCH INTO HEALTH

SYSTEMS BIOLOGY APPROACHES TO IDENTIFY MECHANISMS OF IMMUNE MEDIATED PROTECTION TRANSLATING RESEARCH INTO HEALTH SYSTEMS BIOLOGY APPROACHES TO IDENTIFY MECHANISMS OF IMMUNE MEDIATED PROTECTION TRANSLATING RESEARCH INTO HEALTH Novel assays to decipher protective immune responses Decoding the immune response to infectious

More information

Identifying Causal Genes and Dysregulated Pathways in Complex Diseases

Identifying Causal Genes and Dysregulated Pathways in Complex Diseases Identifying Causal Genes and Dysregulated Pathways in Complex Diseases Yoo-Ah Kim, Stefan Wuchty, Teresa M. Przytycka* National Center for Biotechnology Information, National Library of Medicine, National

More information

APPLICATION NOTE. Highly reproducible and Comprehensive Proteome Profiling of Formalin-Fixed Paraffin-Embedded (FFPE) Tissues Slices

APPLICATION NOTE. Highly reproducible and Comprehensive Proteome Profiling of Formalin-Fixed Paraffin-Embedded (FFPE) Tissues Slices APPLICATION NOTE Highly reproducible and Comprehensive Proteome Profiling of Formalin-Fixed Paraffin-Embedded (FFPE) Tissues Slices INTRODUCTION Preservation of tissue biopsies is a critical step to This

More information

Ginkgo Interactive analysis and quality assessment of single-cell CNV data

Ginkgo Interactive analysis and quality assessment of single-cell CNV data Ginkgo Interactive analysis and quality assessment of single-cell CNV data @RobAboukhalil Robert Aboukhalil, Tyler Garvin, Jude Kendall, Timour Baslan, Gurinder S. Atwal, Jim Hicks, Michael Wigler, Michael

More information

BIT 120. Copy of Cancer/HIV Lecture

BIT 120. Copy of Cancer/HIV Lecture BIT 120 Copy of Cancer/HIV Lecture Cancer DEFINITION Any abnormal growth of cells that has malignant potential i.e.. Leukemia Uncontrolled mitosis in WBC Genetic disease caused by an accumulation of mutations

More information

An expanded view of complex traits: from polygenic to omnigenic

An expanded view of complex traits: from polygenic to omnigenic BIRS 2017 An expanded view of complex traits: from polygenic to omnigenic How does human genetic variation drive variation in complex traits/disease risk? Yang I Li Stanford University Evan Boyle Jonathan

More information

Analysis of Cancer Omics Data In A Semantic Web Framework

Analysis of Cancer Omics Data In A Semantic Web Framework Analysis of Cancer Omics Data In A Semantic Web Framework Matt Holford Yale University 12/10/10 Matt Holford Analysis of Cancer Omics Data In A Semantic Web Framework 1/17 yspore Specialized Program in

More information

Integrated Proteomic and Genomic Analysis of Gastric Cancer

Integrated Proteomic and Genomic Analysis of Gastric Cancer SUPPORTING INFORMATION PAGE S-1 Integrated Proteomic and Genomic Analysis of Gastric Cancer Patient Tissues Julia Fangfei Yan 1, Hoguen Kim 2, Seul-Ki Jeong 3, Hyoung-Joo Lee 3, Manveen K. Sethi 4, Ling

More information

A comparative study of cancer proteins in the human protein-protein interaction network

A comparative study of cancer proteins in the human protein-protein interaction network RESEARCH Open Access A comparative study of cancer proteins in the human protein-protein interaction network Jingchun Sun 1,2, Zhongming Zhao 1,2,3* From The ISIBM International Joint Conference on Bioinformatics,

More information