SAPLING: A Tool for Gene Network Analysis focusing on Psychiatric Genetics

Size: px
Start display at page:

Download "SAPLING: A Tool for Gene Network Analysis focusing on Psychiatric Genetics"

Transcription

1 SAPLING: A Tool for Gene Network Analysis focusing on Psychiatric Genetics sapling.cshl.edu Wim Verleyen, Ph.D. Gillis Lab

2 Outline Motivation Disease-gene analysis Enrichment analysis Gene network analysis: network and algorithm User interface Experiment and method configuration Report generation

3 Motivation Disease-gene analysis User interface Report generation

4 Motivation Consistent functional interpretation of a disease gene list Traditional: enrichment analysis (what if there is no statistical significance?) Alternative: gene network analysis Provide brain-specific network data for analyzing promiscuous expression in the brain

5 Motivation GeneMANIA: Heterogeneous data resources Limited support for in-depth analysis DAPPLE: 1 data resources Supports very in-depth analysis New web application: Heterogeneous data resources In-depth analysis

6 Motivation Disease-gene analysis User interface Report generation

7 CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2 Disease-gene analysis Gene list: GWAS (common variants), exome sequencing (rare variants), disease-associated-genes, Gene Ontology (GO), etc. Analysis methods: Enrichment analysis: is the gene list enriched for a reference gene list. Gene network analysis method: predict function of (protein-coding) genes based on previous annotations. 1 Iossifov et al. (2014). The contribution of de novo coding mutations to autism spectrum disorder. Nature.

8 CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2 Network statistics Gene list Network: nodes represent genes edges represent interactions between genes node degree: # of connections of a node representation bias: node has a connection in the network

9 CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2 Gene network analysis method Gene list Probability function-associated-gene

10 Aggregation

11 Aggregation Aggregation of output scores of individual methods for gene function prediction.

12 Motivation Disease-gene analysis User interface Report generation

13 User A registration User interface

14 User interface A User registration B Experiment configuration Gene list CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2

15 User interface User registration Experiment configuration A B C Gene list CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2 Method configuration Network Algorithms Neighbor voting Logistic regression RankProp SGD-SVM Passive aggressive Random walk with restarts

16 User interface User registration Experiment configuration Method configuration A B C D Gene list Network CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A Algorithms TNRC6B WDFY3 Neighbor voting PHF2 WAC KDM5B Logistic regression POGZ RIMS1 RankProp FOXP1 GIGYF1 SGD-SVM KATNAL2 KMT2E Passive aggressive TBR1 TCF7L2 Random walk with restarts Experiment execution

17 User interface User registration Experiment configuration Gene list CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2 Method configuration Network Algorithms Neighbor voting Logistic regression RankProp SGD-SVM Passive aggressive Random walk with restarts Experiment execution A B C D E Report generation

18 (B) Experiment configuration Gene list: any list of gene for which you would like to understand its generalizable properties. Experiment name: Each experiment allows you to collect analysis with different data resources and algorithms.

19 (C) Method configuration Algorithms Networks Neighbor voting RankProp Logistic regression SGD-SVM Passive aggressive Random walk with restarts DNA RNA Protein Sequence similarity (SQ) BLAST and pathogenicity Co-expression (Co) microarray bulk and single cell RNA Seq Protein-protein interaction (PPI) BioGRID, HIPPIE, I2D, IntAct, and GeneMANIA Semantic similarity (SM) Biological pathways KEGG and Reactome Disease Phenocarta Shared protein domains Pfam and InterPro

20 Heterogeneity of networks

21 Motivation Disease-gene analysis User interface Report generation

22 Report generation Enrichment analysis on the original gene list Extend gene list with novel candidate genes Performance evaluation for novel candidate genes Network visualization and enrichment analysis on extended gene list All methods finished? No Yes Consensus prediction of novel candidate genes

23 CHD8 DYRK1A ANK2 GRIN2B DSCAM CHD2 ARID1B KDM6B ADNP MED13L NCKAP1 ANKRD11 DIP2A SCN2A TNRC6B WDFY3 PHF2 WAC KDM5B POGZ RIMS1 FOXP1 GIGYF1 KATNAL2 KMT2E TBR1 TCF7L2 1 Rare variants in autism Recurrent de novo mutations for the Simons Simplex Collection (SSC) Import gene list for functional interpretation with SAPLING Full report can be found at sapling.cshl.edu Iossifov et al. (2014). The contribution of de novo coding mutations to autism spectrum disorder. Nature. 1

24 Enrichment analysis GO identifier Function Multiple test correction: Benjamini - Hochberg adjusted p-value

25 Enrichment analysis cont. GO identifier Overlapping genes between original and reference gene list Number of gene is reference gene list Number of overlapping genes

26 Network analysis: top 20 novel candidates Gene symbol Standardized ranked output score of method (neighbor voting and Phenocarta)

27 Network analysis: performance evaluation ROC curve of method (neighbor voting and InterPro) Performance comparison with reference gene lists

28 Network analysis: visualization Genes in original gene list Novel candidate genes Indirect connected genes Direct connection Indirect connection

29 Consensus prediction of novel candidates Top 20 candidate genes ROC curve for the consensus predictor

30 Conclusion User-defined network analysis and enrichment analysis SAPLING supports a field-wide functional interpretation of a disease-gene list Find sample reports at sapling.cshl.edu for ADHD and synaptic interactions Please do not hesitate to contact us if you would like to suggest any new features

31 Acknowledgements Gillis Lab at CSHL Jesse Gillis Sara Ballouz Megan Crow IT crowds at CSHL Supported by a grant from T. and V. Stanley

Role of Molecular Genetic Assessment for Autism in Clinical Setting

Role of Molecular Genetic Assessment for Autism in Clinical Setting Role of Molecular Genetic Assessment for Autism in Clinical Setting Young Shin Kim, MD, MS, MPH, PhD Director STAR Center for ASD and NDDs Psychiatric Genetic Epidemiology Associate Professor Dept. of

More information

Sensitivity always trumps specificity in every disorder in DSM5 No axes (as determined by APA) All disorders must have severity indices (as per APA)

Sensitivity always trumps specificity in every disorder in DSM5 No axes (as determined by APA) All disorders must have severity indices (as per APA) Sensitivity always trumps specificity in every disorder in DSM5 No axes (as determined by APA) All disorders must have severity indices (as per APA) No specification of research standards or methods General

More information

Rebecca Muhle, M.D. Ph.D. Yale Child Study Center Yale University The Autism Center Hospital for Special Care

Rebecca Muhle, M.D. Ph.D. Yale Child Study Center Yale University The Autism Center Hospital for Special Care Genetic Work-Up of Autism Spectrum Disorder in a Clinical Setting Rebecca Muhle, M.D. Ph.D. Yale Child Study Center Yale University The Autism Center Hospital for Special Care Disclosures statement I have

More information

The contribution of de novo coding mutations to autism spectrum disorder

The contribution of de novo coding mutations to autism spectrum disorder doi:10.1038/nature13908 The contribution of de novo coding mutations to autism spectrum disorder Ivan Iossifov 1 *, Brian J. O Roak,3 *, Stephan J. Sanders 4,5 *, Michael Ronemus 1 *, Niklas Krumm, Dan

More information

Strength of functional signature correlates with effect size in autism

Strength of functional signature correlates with effect size in autism Ballouz and Gillis Genome Medicine (217) 9:64 DOI 1.1186/s1373-17-455-8 RESEARCH Open Access Strength of functional signature correlates with effect size in autism Sara Ballouz and Jesse Gillis * Abstract

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1

Nature Neuroscience: doi: /nn Supplementary Figure 1 Supplementary Figure 1 Illustration of the working of network-based SVM to confidently predict a new (and now confirmed) ASD gene. Gene CTNND2 s brain network neighborhood that enabled its prediction by

More information

VARPRISM: incorporating variant prioritization in tests of de novo mutation association

VARPRISM: incorporating variant prioritization in tests of de novo mutation association Hu et al. Genome Medicine (2016) 8:91 DOI 10.1186/s13073-016-0341-9 SOFTWARE VARPRISM: incorporating variant prioritization in tests of de novo mutation association Hao Hu 1, Hilary Coon 2, Man Li 3, Mark

More information

The Amazing Brain Webinar Series: Select Topics in Neuroscience and Child Development for the Clinician

The Amazing Brain Webinar Series: Select Topics in Neuroscience and Child Development for the Clinician The Amazing Brain Webinar Series: Select Topics in Neuroscience and Child Development for the Clinician Part VII Recent Advances in the Genetics of Autism Spectrum Disorders Abha R. Gupta, MD, PhD Jointly

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA. Giuseppe Narzisi, PhD Schatz Lab

SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA. Giuseppe Narzisi, PhD Schatz Lab SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA Giuseppe Narzisi, PhD Schatz Lab November 14, 2013 Micro-Assembly Approach to detect INDELs 2 Outline Scalpel micro-assembly pipeline

More information

DAWN: a framework to identify autism genes and subnetworks using gene expression and genetics

DAWN: a framework to identify autism genes and subnetworks using gene expression and genetics Liu et al. Molecular Autism 2014, 5:22 RESEARCH Open Access DAWN: a framework to identify autism genes and subnetworks using gene expression and genetics Li Liu 1,JingLei 1, Stephan J Sanders 2,3, Arthur

More information

Reducing INDEL calling errors in whole genome and exome sequencing data.

Reducing INDEL calling errors in whole genome and exome sequencing data. Reducing INDEL calling errors in whole genome and exome sequencing data. Han Fang November 8, 2014 CSHL Biological Data Science Meeting Acknowledgments Lyon Lab Yiyang Wu Jason O Rawe Laura J Barron Max

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

Journal: Nature Methods

Journal: Nature Methods Journal: Nature Methods Article Title: Network-based stratification of tumor mutations Corresponding Author: Trey Ideker Supplementary Item Supplementary Figure 1 Supplementary Figure 2 Supplementary Figure

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 An example of the gene-term-disease network automatically generated by Phenolyzer web server for 'autism'. The largest word represents the user s input term, Autism. The pink round

More information

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data. Supplementary Figure 1 PCA for ancestry in SNV data. (a) EIGENSTRAT principal-component analysis (PCA) of SNV genotype data on all samples. (b) PCA of only proband SNV genotype data. (c) PCA of SNV genotype

More information

RACP Congress 2017 Genetics of Intellectual Disability and Autism: Past Present and Future 9 th May 2017

RACP Congress 2017 Genetics of Intellectual Disability and Autism: Past Present and Future 9 th May 2017 RACP Congress 2017 Genetics of Intellectual Disability and Autism: Past Present and Future 9 th May 2017 Why causation? Explanation for family Prognosis Recurrence risk and reproductive options Guide medical

More information

Van test naar diagnose naar

Van test naar diagnose naar Van test naar diagnose naar V therapie op maat Marjolein Kriek, LUMC Joris Veltman, RUNMC Exome diagnostics in genetically heterogeneous disease Joris Veltman, PhD Department of Human Genetics Radboud

More information

What can genetic studies tell us about ADHD? Dr Joanna Martin, Cardiff University

What can genetic studies tell us about ADHD? Dr Joanna Martin, Cardiff University What can genetic studies tell us about ADHD? Dr Joanna Martin, Cardiff University Outline of talk What do we know about causes of ADHD? Traditional family studies Modern molecular genetic studies How can

More information

SFARI Base SSC/Simons VIP/SPARK Researcher Welcome Packet. Last Updated: Friday, December 14, 2018

SFARI Base SSC/Simons VIP/SPARK Researcher Welcome Packet. Last Updated: Friday, December 14, 2018 SFARI Base SSC/Simons VIP/SPARK Researcher Welcome Packet Last Updated: Friday, December 14, 2018 Table of Contents SSC PARTICIPATION CRITERIA 1 Participants in the SSC 1 Inclusion Criteria 1 Exclusion

More information

Illuminating the genetics of complex human diseases

Illuminating the genetics of complex human diseases Illuminating the genetics of complex human diseases Michael Schatz Sept 27, 2012 Beyond the Genome @mike_schatz / #BTG2012 Outline 1. De novo mutations in human diseases 1. Autism Spectrum Disorder 2.

More information

IMPaLA tutorial.

IMPaLA tutorial. IMPaLA tutorial http://impala.molgen.mpg.de/ 1. Introduction IMPaLA is a web tool, developed for integrated pathway analysis of metabolomics data alongside gene expression or protein abundance data. It

More information

Recurrent de novo mutations in neurodevelopmental disorders: properties and clinical implications

Recurrent de novo mutations in neurodevelopmental disorders: properties and clinical implications Wilfert et al. Genome Medicine (2017) 9:101 DOI 10.1186/s13073-017-0498-x REVIEW Recurrent de novo mutations in neurodevelopmental disorders: properties and clinical implications Amy B. Wilfert 1, Arvis

More information

Variant Detection & Interpretation in a diagnostic context. Christian Gilissen

Variant Detection & Interpretation in a diagnostic context. Christian Gilissen Variant Detection & Interpretation in a diagnostic context Christian Gilissen c.gilissen@gen.umcn.nl 28-05-2013 So far Sequencing Johan den Dunnen Marja Jakobs Ewart de Bruijn Mapping Victor Guryev Variant

More information

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein The parsimony principle: A quick review Find the tree that requires the fewest

More information

Data mining with Ensembl Biomart. Stéphanie Le Gras

Data mining with Ensembl Biomart. Stéphanie Le Gras Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature13908 Supplementary Tables Supplementary Table 1: Families in this study (.xlsx) All families included in the study are listed. For each family, we show: the genders of the probands and

More information

TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY. Giuseppe Narzisi, PhD Bioinformatics Scientist

TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY. Giuseppe Narzisi, PhD Bioinformatics Scientist TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY Giuseppe Narzisi, PhD Bioinformatics Scientist July 29, 2014 Micro-Assembly Approach to detect INDELs 2 Outline 1 Detecting INDELs:

More information

Network-assisted data analysis

Network-assisted data analysis Network-assisted data analysis Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Protein identification in shotgun proteomics Protein digestion LC-MS/MS Protein

More information

ISPG Residency Education Taskforce

ISPG Residency Education Taskforce ISPG Residency Education Taskforce What does genetics have to do with psychiatry? - psychiatric illnesses run in families - the major psychiatric disorders have a high heritability - specific genes may

More information

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Cancer Informatics Lecture

Cancer Informatics Lecture Cancer Informatics Lecture Mayo-UIUC Computational Genomics Course June 22, 2018 Krishna Rani Kalari Ph.D. Associate Professor 2017 MFMER 3702274-1 Outline The Cancer Genome Atlas (TCGA) Genomic Data Commons

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S1 (table) Examples of genes associated with ASD and their main clinical phenotypes. Genes Observations Chromatin remodelling MECP2 Females with a mutation in the X-linked MECP2

More information

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend Table of content -Supplementary methods -Figure S1 -Figure S2 -Figure S3 -Table legend Supplementary methods Yeast two-hybrid bait basal transactivation test Because bait constructs sometimes self-transactivate

More information

De Novo Gene Disruptions in Children on the Autistic Spectrum

De Novo Gene Disruptions in Children on the Autistic Spectrum Article De Novo Gene Disruptions in Children on the Autistic Spectrum Ivan Iossifov, 1,6 Michael Ronemus, 1,6 Dan Levy, 1 Zihua Wang, 1 Inessa Hakker, 1 Julie Rosenbaum, 1 Boris Yamrom, 1 Yoon-ha Lee,

More information

Medical Advisory Council: Verified

Medical Advisory Council: Verified What is White Sutton Syndrome? White Sutton Syndrome (WHSUS) is a condition characterized by autism and developmental delay and/or intellectual disability, as well as a characteristic facial profile. Children

More information

Clustering with Genotype and Phenotype Data to Identify Subtypes of Autism

Clustering with Genotype and Phenotype Data to Identify Subtypes of Autism Clustering with Genotype and Phenotype Data to Identify Subtypes of Autism Project Category: Life Science Rocky Aikens (raikens) Brianna Kozemzak (kozemzak) December 16, 2017 1 Introduction and Motivation

More information

Sequencing studies implicate inherited mutations in autism

Sequencing studies implicate inherited mutations in autism NEWS Sequencing studies implicate inherited mutations in autism BY EMILY SINGER 23 JANUARY 2013 1 / 5 Unusual inheritance: Researchers have found a relatively mild mutation in a gene linked to Cohen syndrome,

More information

Autism Pathways Analysis: A Functional Framework and Clues for Further Investigation. Martha Herbert PhD MD Ya Wen PhD July 2016

Autism Pathways Analysis: A Functional Framework and Clues for Further Investigation. Martha Herbert PhD MD Ya Wen PhD July 2016 Autism Pathways Analysis: A Functional Framework and Clues for Further Investigation Martha Herbert PhD MD Ya Wen PhD July 2016 1 Report on pathway network analyses in autism, based on open-access paper

More information

Integrated Model of De Novo and Inherited Genetic Variants Yields Greater Power to Identify Risk Genes

Integrated Model of De Novo and Inherited Genetic Variants Yields Greater Power to Identify Risk Genes Integrated Model of De Novo and Inherited Genetic Variants Yields Greater Power to Identify Risk Genes Xin He 1, Stephan J. Sanders 2, Li Liu 3, Silvia De Rubeis 4,5, Elaine T. Lim 6,7, James S. Sutcliffe

More information

An expanded view of complex traits: from polygenic to omnigenic

An expanded view of complex traits: from polygenic to omnigenic BIRS 2017 An expanded view of complex traits: from polygenic to omnigenic How does human genetic variation drive variation in complex traits/disease risk? Yang I Li Stanford University Evan Boyle Jonathan

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs

Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs Michael Schatz April 8, 2014 IEEE Fellows Night Syracuse @mike_schatz The secret of life Your DNA, along

More information

Talking Genomes with Your Patients. Meagan Cochran, MS, CGC Certified Genetic Counselor HudsonAlpha Institute for Biotechnology

Talking Genomes with Your Patients. Meagan Cochran, MS, CGC Certified Genetic Counselor HudsonAlpha Institute for Biotechnology Talking Genomes with Your Patients Meagan Cochran, MS, CGC Certified Genetic Counselor HudsonAlpha Institute for Biotechnology Objectives Review the importance of physician familiarity with genomic testing

More information

Gene-microRNA network module analysis for ovarian cancer

Gene-microRNA network module analysis for ovarian cancer Gene-microRNA network module analysis for ovarian cancer Shuqin Zhang School of Mathematical Sciences Fudan University Oct. 4, 2016 Outline Introduction Materials and Methods Results Conclusions Introduction

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

The next 10 years of quantitative biology

The next 10 years of quantitative biology The next 10 years of quantitative biology Michael Schatz March 25, 2014 Keystone Meeting on Big Data in Biology @mike_schatz / #KSBigData Unsolved Questions in Biology What is your genome sequence? How

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Clinical Genetics. Functional validation in a diagnostic context. Robert Hofstra. Leading the way in genetic issues

Clinical Genetics. Functional validation in a diagnostic context. Robert Hofstra. Leading the way in genetic issues Clinical Genetics Leading the way in genetic issues Functional validation in a diagnostic context Robert Hofstra Clinical Genetics Leading the way in genetic issues Future application of exome sequencing

More information

Introduction. Introduction

Introduction. Introduction Introduction We are leveraging genome sequencing data from The Cancer Genome Atlas (TCGA) to more accurately define mutated and stable genes and dysregulated metabolic pathways in solid tumors. These efforts

More information

Nature Methods: doi: /nmeth.3115

Nature Methods: doi: /nmeth.3115 Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by

More information

Predication-based Bayesian network analysis of gene sets and knowledge-based SNP abstractions

Predication-based Bayesian network analysis of gene sets and knowledge-based SNP abstractions Predication-based Bayesian network analysis of gene sets and knowledge-based SNP abstractions Skanda Koppula Second Annual MIT PRIMES Conference May 20th, 2012 Mentors: Dr. Gil Alterovitz and Dr. Amin

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Illustrative example of ptdt using height The expected value of a child s polygenic risk score (PRS) for a trait is the average of maternal and paternal PRS values. For example,

More information

American Psychiatric Nurses Association

American Psychiatric Nurses Association Francis J. McMahon International Society of Psychiatric Genetics Johns Hopkins University School of Medicine Dept. of Psychiatry Human Genetics Branch, National Institute of Mental Health* * views expressed

More information

Clinical Genome Knowledge Base and Linked Data technologies. Aleksandar Milosavljevic

Clinical Genome Knowledge Base and Linked Data technologies. Aleksandar Milosavljevic Clinical Genome Knowledge Base and Linked Data technologies Aleksandar Milosavljevic Topics 1. ClinGen Resource project 2. Building the Clinical Genome Knowledge Base 3. Linked Data technologies 4. Using

More information

Using SuSPect to Predict the Phenotypic Effects of Missense Variants. Chris Yates UCL Cancer Institute

Using SuSPect to Predict the Phenotypic Effects of Missense Variants. Chris Yates UCL Cancer Institute Using SuSPect to Predict the Phenotypic Effects of Missense Variants Chris Yates UCL Cancer Institute c.yates@ucl.ac.uk Outline SAVs and Disease Development of SuSPect Features included Feature selection

More information

Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs

Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs Michael Schatz June 18, 2014 CSHL Public Lecture Series DNA: The secret of life Your DNA, along with your

More information

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

ARTICLE RESEARCH. Macmillan Publishers Limited. All rights reserved

ARTICLE RESEARCH. Macmillan Publishers Limited. All rights reserved Extended Data Figure 6 Annotation of drivers based on clinical characteristics and co-occurrence patterns. a, Putative drivers affecting greater than 10 patients were assessed for enrichment in IGHV mutated

More information

Lecture 20. Disease Genetics

Lecture 20. Disease Genetics Lecture 20. Disease Genetics Michael Schatz April 12 2018 JHU 600.749: Applied Comparative Genomics Part 1: Pre-genome Era Sickle Cell Anaemia Sickle-cell anaemia (SCA) is an abnormality in the oxygen-carrying

More information

Autism genetics: searching for specificity and convergence

Autism genetics: searching for specificity and convergence R E V I E W Autism genetics: searching for specificity and convergence Jamee M Berg 1,2,3 and Daniel H Geschwind 2,3,4 * Abstract Advances in genetics and genomics have improved our understanding of autism

More information

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach) High-Throughput Sequencing Course Gene-Set Analysis Biostatistics and Bioinformatics Summer 28 Section Introduction What is Gene Set Analysis? Many names for gene set analysis: Pathway analysis Gene set

More information

Supplement to SCnorm: robust normalization of single-cell RNA-seq data

Supplement to SCnorm: robust normalization of single-cell RNA-seq data Supplement to SCnorm: robust normalization of single-cell RNA-seq data Supplementary Note 1: SCnorm does not require spike-ins, since we find that the performance of spike-ins in scrna-seq is often compromised,

More information

Characterizing intra-host influenza virus populations to predict emergence

Characterizing intra-host influenza virus populations to predict emergence Characterizing intra-host influenza virus populations to predict emergence June 12, 2012 Forum on Microbial Threats Washington, DC Elodie Ghedin Center for Vaccine Research Dept. Computational & Systems

More information

NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation

NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation Michael R. Rossi, PhD, FACMG Assistant Professor Division of Cancer Biology, Department of Radiation Oncology Department

More information

Detection of copy number variations in PCR-enriched targeted sequencing data

Detection of copy number variations in PCR-enriched targeted sequencing data Detection of copy number variations in PCR-enriched targeted sequencing data German Demidov Parseq Lab, Saint-Petersburg University of Russian Academy of Sciences, current: Center for Genomic Regulation

More information

Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures

Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures 1 2 3 4 5 Kathleen T Quach Department of Neuroscience University of California, San Diego

More information

Genetics and Genetic Testing for Autism:

Genetics and Genetic Testing for Autism: STAR Training 2/22/2018 Genetics and Genetic Testing for Autism: Demystifying the Journey to Find a Cause Alyssa (Ah leesa) Blesson, MGC, CGC Certified Genetic Counselor Center for Autism and Related Disorders

More information

Leman Akoglu Hanghang Tong Jilles Vreeken. Polo Chau Nikolaj Tatti Christos Faloutsos SIAM International Conference on Data Mining (SDM 2013)

Leman Akoglu Hanghang Tong Jilles Vreeken. Polo Chau Nikolaj Tatti Christos Faloutsos SIAM International Conference on Data Mining (SDM 2013) Leman Akoglu Hanghang Tong Jilles Vreeken Polo Chau Nikolaj Tatti Christos Faloutsos 2013 SIAM International Conference on Data Mining (SDM 2013) What can we say about this list of authors? Use relational

More information

User s Manual Version 1.0

User s Manual Version 1.0 User s Manual Version 1.0 #639 Longmian Avenue, Jiangning District, Nanjing,211198,P.R.China. http://tcoa.cpu.edu.cn/ Contact us at xiaosheng.wang@cpu.edu.cn for technical issue and questions Catalogue

More information

SFARI Gene 2.0 User Guide

SFARI Gene 2.0 User Guide 1 SFARI Gene 2.0 User Guide This document is designed to acquaint the new user with SFARI Gene release 4.0, an integrated resource for autism research, and to provide enough information to allow the user

More information

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo Micro-RNA web tools UBio Training Courses Gonzalo Gómez//ggomez@cnio.es Introduction mirnas, target prediction, biology Experimental data Network Filtering Pathway interpretation mirs-pathways network

More information

Coexpression Networks Implicate Human Midfetal Deep Cortical Projection Neurons in the Pathogenesis of Autism

Coexpression Networks Implicate Human Midfetal Deep Cortical Projection Neurons in the Pathogenesis of Autism Coexpression Networks Implicate Human Midfetal Deep Cortical Projection Neurons in the Pathogenesis of Autism A. Jeremy Willsey, 1,2 Stephan J. Sanders, 1,2 Mingfeng Li, 3,4 Shan Dong, 1, Andrew T. Tebbenkamp,

More information

Probabilistic retrieval and visualization of relevant experiments

Probabilistic retrieval and visualization of relevant experiments Probabilistic retrieval and visualization of relevant experiments Samuel Kaski Joint work with: José Caldas, Nils Gehlenborg, Ali Faisal, Alvis Brazma Motivation 2 How to best use collections of measurement

More information

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks 1 Rumor Detection on Twitter with Tree-structured Recursive Neural Networks Jing Ma 1, Wei Gao 2, Kam-Fai Wong 1,3 1 The Chinese University of Hong Kong 2 Victoria University of Wellington, New Zealand

More information

Proposal form for the evaluation of a genetic test for NHS Service Gene Dossier

Proposal form for the evaluation of a genetic test for NHS Service Gene Dossier Proposal form for the evaluation of a genetic test for NHS Service Gene Dossier Test Disease Population Triad Disease name and description (please provide any alternative names you wish listed) (A)-Testing

More information

A framework for the interpretation of de novo mutation in human disease

A framework for the interpretation of de novo mutation in human disease A framework for the interpretation of de novo mutation in human disease The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. Citation

More information

Research Article Breast Cancer Prognosis Risk Estimation Using Integrated Gene Expression and Clinical Data

Research Article Breast Cancer Prognosis Risk Estimation Using Integrated Gene Expression and Clinical Data BioMed Research International, Article ID 459203, 15 pages http://dx.doi.org/10.1155/2014/459203 Research Article Breast Cancer Prognosis Risk Estimation Using Integrated Gene Expression and Clinical Data

More information

1. Create a mutation rate table from intergenic SNPs for all possible trinucleotide to trinucleotide changes

1. Create a mutation rate table from intergenic SNPs for all possible trinucleotide to trinucleotide changes 1. Create a mutation rate table from intergenic SNPs for all possible trinucleotide to trinucleotide changes ATCGGCTGG ATCGACTGG CCTAGCTAA CCTGGCTAA CTCACCGGA CTCACTGGA Change AAA ACA AAA AGA AAA ATA AAC

More information

Characteriza*on of Soma*c Muta*ons in Cancer Genomes

Characteriza*on of Soma*c Muta*ons in Cancer Genomes Characteriza*on of Soma*c Muta*ons in Cancer Genomes Ben Raphael Department of Computer Science Center for Computa*onal Molecular Biology Soma*c Muta*ons and Cancer Clonal Theory (Nowell 1976) Passenger

More information

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis Tieliu Shi tlshi@bio.ecnu.edu.cn The Center for bioinformatics

More information

AVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB

AVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits Next-generation performance in liquid biopsies 2 Accelerating clinical research From liquid biopsy to next-generation

More information

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China A Vision-based Affective Computing System Jieyu Zhao Ningbo University, China Outline Affective Computing A Dynamic 3D Morphable Model Facial Expression Recognition Probabilistic Graphical Models Some

More information

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis Jian Xu, Ph.D. Children s Research Institute, UTSW Introduction Outline Overview of genomic and next-gen sequencing technologies

More information

p e r s p e c t i v e

p e r s p e c t i v e n e u r o g e n o m i c s Genome-scale neurogenetics: methodology and meaning Steven A McCarroll 1,2, Guoping Feng 1,3,4 & Steven E Hyman 1,5 Genetic analysis is currently offering glimpses into molecular

More information

The Immune Epitope Database Analysis Resource: MHC class I peptide binding predictions. Edita Karosiene, Ph.D.

The Immune Epitope Database Analysis Resource: MHC class I peptide binding predictions. Edita Karosiene, Ph.D. The Immune Epitope Database Analysis Resource: MHC class I peptide binding predictions Edita Karosiene, Ph.D. edita@liai.org IEDB Workshop October 29, 2015 Outline Introduction MHC-I peptide binding prediction

More information

Reporting TP53 gene analysis results in CLL

Reporting TP53 gene analysis results in CLL Reporting TP53 gene analysis results in CLL Mutations in TP53 - From discovery to clinical practice in CLL Discovery Validation Clinical practice Variant diversity *Leroy at al, Cancer Research Review

More information

Name: BIOS 703 MIDTERM EXAMINATIONS (5 marks per question, total = 100 marks)

Name: BIOS 703 MIDTERM EXAMINATIONS (5 marks per question, total = 100 marks) Name: BIOS 703 MIDTERM EXAMINATIONS (5 marks per question, total = 100 marks) You will have 75 minutest to complete this examination. Some of the questions refer to Crizotinib in ROS1- Rearranged Non Small-

More information

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network Contents Classification Rule Generation for Bioinformatics Hyeoncheol Kim Rule Extraction from Neural Networks Algorithm Ex] Promoter Domain Hybrid Model of Knowledge and Learning Knowledge refinement

More information

Research Strategy: 1. Background and Significance

Research Strategy: 1. Background and Significance Research Strategy: 1. Background and Significance 1.1. Heterogeneity is a common feature of cancer. A better understanding of this heterogeneity may present therapeutic opportunities: Intratumor heterogeneity

More information

PSSV User Manual (V1.0)

PSSV User Manual (V1.0) PSSV User Manual (V1.0) 1. Introduction A novel pattern-based probabilistic approach, PSSV, is developed to identify somatic structural variations from WGS data. Specifically, discordant and concordant

More information

Discovery of Rare Mutations in Autism: Elucidating Neurodevelopmental Mechanisms

Discovery of Rare Mutations in Autism: Elucidating Neurodevelopmental Mechanisms Neurotherapeutics (2015) 12:553 571 DOI 10.1007/s13311-015-0363-9 REVIEW Discovery of Rare Mutations in Autism: Elucidating Neurodevelopmental Mechanisms Ece D. Gamsiz 1,4 & Laura N. Sciarra 1,2 & Abbie

More information

Predicting Sleep Using Consumer Wearable Sensing Devices

Predicting Sleep Using Consumer Wearable Sensing Devices Predicting Sleep Using Consumer Wearable Sensing Devices Miguel A. Garcia Department of Computer Science Stanford University Palo Alto, California miguel16@stanford.edu 1 Introduction In contrast to the

More information

8/23/2016 1:50 PM. As researchers sequence the DNA of thousands of kids with autism, dozens of genetic subgroups are emerging.

8/23/2016 1:50 PM. As researchers sequence the DNA of thousands of kids with autism, dozens of genetic subgroups are emerging. Se The Scientist» August 2016 Issue» Features As researchers sequence the DNA of thousands of kids with autism, dozens of genetic subgroups are emerging. By Megan Scudellari August 1, 2016 1.1K 25 181

More information

Clasificación Molecular del Cáncer de Próstata. JM Piulats

Clasificación Molecular del Cáncer de Próstata. JM Piulats Clasificación Molecular del Cáncer de Próstata JM Piulats Introduction The Gleason score is the major method for prostate cancer tissue grading and the most important prognostic factor in this disease.

More information

Genetic Counselling in relation to genetic testing

Genetic Counselling in relation to genetic testing Genetic Counselling in relation to genetic testing Dr Julie Vogt Consultant Geneticist West Midlands Regional Genetics Service September 2016 Disclosures for Research Support/P.I. Employee Consultant Major

More information

Gene Expression Analysis Web Forum. Jonathan Gerstenhaber Field Application Specialist

Gene Expression Analysis Web Forum. Jonathan Gerstenhaber Field Application Specialist Gene Expression Analysis Web Forum Jonathan Gerstenhaber Field Application Specialist Our plan today: Import Preliminary Analysis Statistical Analysis Additional Analysis Downstream Analysis 2 Copyright

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

Supplementary Figure 1. Experimental paradigm. A combination of genome and exome sequencing coupled with array-comparative genome hybridization was

Supplementary Figure 1. Experimental paradigm. A combination of genome and exome sequencing coupled with array-comparative genome hybridization was Supplementary Figure 1. Experimental paradigm. A combination of genome and exome sequencing coupled with array-comparative genome hybridization was performed on a total of 85 SS patients. Data filtration

More information