An expanded view of complex traits: from polygenic to omnigenic

Similar documents
BST227: Introduction to Statistical Genetics

Heritability enrichment of differentially expressed genes. Hilary Finucane PGC Statistical Analysis Call January 26, 2016

Lecture 20. Disease Genetics

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

QTs IV: miraculous and missing heritability

Use and Interpreta,on of LD Score Regression. Brendan Bulik- Sullivan PGC Stat Analysis Call

Epigenetics. Jenny van Dongen Vrije Universiteit (VU) Amsterdam Boulder, Friday march 10, 2017

The genetics of complex traits Amazing progress (much by ppl in this room)

Nature Genetics: doi: /ng Supplementary Figure 1

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

Introduction to the Genetics of Complex Disease

Quantitative genetics: traits controlled by alleles at many loci

Heritability and genetic correlations explained by common SNPs for MetS traits. Shashaank Vattikuti, Juen Guo and Carson Chow LBM/NIDDK

Interaction of Genes and the Environment

Introduction to Genetics and Genomics

Nature Genetics: doi: /ng Supplementary Figure 1

Missing Heritablility How to Analyze Your Own Genome Fall 2013

Supplementary Figures

New Enhancements: GWAS Workflows with SVS

Supplementary Figures

5/2/18. After this class students should be able to: Stephanie Moon, Ph.D. - GWAS. How do we distinguish Mendelian from non-mendelian traits?

Rare Variant Burden Tests. Biostatistics 666

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

Emerging gene)cs in schizophrenia: Real challenges for crea)ng animal models

IS IT GENETIC? How do genes, environment and chance interact to specify a complex trait such as intelligence?

Imaging Genetics: Heritability, Linkage & Association

Dan Koller, Ph.D. Medical and Molecular Genetics

NIH Public Access Author Manuscript Nat Genet. Author manuscript; available in PMC 2012 September 01.

Supplementary Figure 1. Nature Genetics: doi: /ng.3736

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance components analysis

Taking a closer look at trio designs and unscreened controls in the GWAS era

Chromatin marks identify critical cell-types for fine-mapping complex trait variants

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

Interaction of Genes and the Environment

American Psychiatric Nurses Association

The Inheritance of Complex Traits

CS2220 Introduction to Computational Biology

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Discontinuous Traits. Chapter 22. Quantitative Traits. Types of Quantitative Traits. Few, distinct phenotypes. Also called discrete characters

Research Article Identifying Liver Cancer-Related Enhancer SNPs by Integrating GWAS and Histone Modification ChIP-seq Data

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

What can genetic studies tell us about ADHD? Dr Joanna Martin, Cardiff University

Systems genetic evidence for a convergence of epilepsy and its co-morbidities on shared molecular pathways

Developing and evaluating polygenic risk prediction models for stratified disease prevention

Genes, Diseases and Lisa How an advanced ICT research infrastructure contributes to our health

Mining the Human Phenome Using Allelic Scores That Index Biological Intermediates

Nature Neuroscience: doi: /nn Supplementary Figure 1

3) It is not clear to me why the authors exclude blond hair from the red hair GWAS, and blond and red hair from the brown hair GWAS.

Genetics of COPD Prof. Ian P Hall

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection

QTL Studies- Past, Present and Future. David Evans

Statistical Genetics. Matthew Stephens. Statistics Retreat, October 26th 2012

EIGEN: A SPECTRAL APPROACH TO THE INTEGRATION OF FUNCTIONAL GENOMICS ANNOTATIONS FOR BOTH CODING AND NONCODING SEQUENCE VARIANTS

Multiplex target enrichment using DNA indexing for ultra-high throughput variant detection

Mapping by recurrence and modelling the mutation rate

Extended Abstract prepared for the Integrating Genetics in the Social Sciences Meeting 2014

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Title: Chapter 5 Recorded Lecture. Speaker: Amit Dhingra Created by: (remove if same as speaker) online.wsu.edu

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Detection and Validation of Clinically Relevant High Order Epistatic Interactions in a BRCA2 Positive Breast Cancer Population

Genetics of Behavior (Learning Objectives)

Integrated post-gwas analysis shed new light on the disease. mechanisms of schizophrenia

Fragile X Syndrome. Genetics, Epigenetics & the Role of Unprogrammed Events in the expression of a Phenotype

Notable papers in autism research in 2018

HHS Public Access Author manuscript Nat Genet. Author manuscript; available in PMC 2015 September 01.

Summary & general discussion

Genetics and Neurobiology of Mood Disorders

Cognitive, affective, & social neuroscience

The road to precision psychiatry: translating genetics into disease mechanisms

An emotional rollercoaster? - An update of affective lability in bipolar disorder. Sofie Ragnhild Aminoff Postdoc, NORMENT

GENOME-WIDE ASSOCIATION STUDIES

ANALYZING VARIATION IN EXPRESSION PROFILES AND EQTLS FOR GENES ASSOCIATED WITH SCHIZOPHRENIA BETWEEN CORTICAL BRAIN REGIONS

Nature Structural & Molecular Biology: doi: /nsmb.2419

Workgroup Webinar Tuesday, May 26, :00 p.m.

Discordance for exposures and disease phenotypes

Accepted Manuscript. Predicting polygenic risk of psychiatric disorders

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Update on the Genetics of Autism and Rett syndrome

Insights into Complex Brain Functions Related to Schizophrenia Disorder through Causal Network Analysis

Genetics and Genomics in Medicine Chapter 8 Questions

GWAS mega-analysis. Speakers: Michael Metzker, Douglas Blackwood, Andrew Feinberg, Nicholas Schork, Benjamin Pickard

You submitted this quiz on Mon 6 May :02 PM IST (UTC +0530). You got a score of out of Your Answer Score Explanation

Using Network Flow to Bridge the Gap between Genotype and Phenotype. Teresa Przytycka NIH / NLM / NCBI

Your DNA extractions! 10 kb

Chapter 5 INTERACTIONS OF GENES AND THE ENVIRONMENT

Epigenetics: The Future of Psychology & Neuroscience. Richard E. Brown Psychology Department Dalhousie University Halifax, NS, B3H 4J1

Field wide development of analytic approaches for sequence data

Liver and adipose expression associated SNPs are enriched for association to Type 2 Diabetes

p e r s p e c t i v e

AN INTRODUCTION TO BEHAVIOR GENETICS. Terence J. Bazzett. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts 01375

Gamma gene expression in haemoglobin disorders

Novel Targets of disease modifying therapy for Parkinson disease. David G. Standaert, MD, PhD John N. Whitaker Professor and Chair of Neurology

Components of heritability in an Icelandic cohort

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

Genetics of Behavior (Learning Objectives)

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

Transcription:

BIRS 2017 An expanded view of complex traits: from polygenic to omnigenic How does human genetic variation drive variation in complex traits/disease risk? Yang I Li Stanford University Evan Boyle Jonathan Pritchard

What did we learn from genome-wide association studies?! Very polygenic architecture.! Effect sizes are generally very small. (Kaiser 2012, Science)

! GWAS significant hits explain only a small % of heritability. (Maher 2008, Nature)

Complex traits variation mostly explained by common variants! GWAS significant hits explain only a small % of heritability.! But considering all common SNPs explains most of heritability.

Of course there may be some exceptions...! Rare variants and de novo mutations play larger role in e.g. Autism. (Sullivan et al., biorxiv, 2017)

Regulatory variation, not coding variation, drive variation in complex traits - Over 90% of GWAS SNPs are noncoding - Enrichment in chromatin marks, eqtls, sqtls in relevant cell-types (Farh et al., 2015; Trynka et al., 2013; Finucane et al., 2015; and many others ) (Li et al., 2016, Science)

What does this tell us about why people get disease?

The classic view would be that causal variants are concentrated in core genes, pathways that drive disease/traits. Figure from Wood et al 2014 NG! Synaptic pruning in Schizophrenia (Sekar et al., 2016)! Adipocyte differentiation in obesity (Smemo et al., 2014, Claussnitzer et al., 2015)

We argue that data from GWAS do not support this model: Heritability of many complex traits (e.g. schizophrenia) is (1) spread very widely across the genome and (2) shows limited pathway enrichment. Core disease genes (n < 200) Most expressed genes (n > 10K) Instead we propose an updated model that hypothesizes that most genes expressed in relevant tissues affect disease risk through highly-connected tissue-specific interaction networks. (We refer to this hypothesis as the Omnigenic model.)

Observation #1: For many traits, causal loci are spread nearly uniformly across the genome

Amount of schizophrenia heritability explained by each chromosome is highly correlated with its length 4% schizophrenia heritability 0% Loh Price, Nature Genetics 2015. Chromosome length (MB)! At a broad scale, causal SNPs are spread widely across the genome! Loh et al: >70% of MB windows in the genome contribute to schizophrenia heritability

Nearly all complex traits show a strong polygenic signature at a broad scale! 30 traits were considered ranging from autoimmune diseases, to anthropomorphic traits, to metabolic traits, etc... Shi et al. AJHG 2016

Replication of height signal throughout p-value range! Median effect of the 697 significant hits on height is 1.43mm, median effect of all SNPs is quite large: 1/10 th of that, i.e. 0.145mm.! Conservative back-of-the-envelope calculation: >23K independent genetic loci affect height (need >150K SNPs to explain height variance in human). Evan Boyle

For many traits, causal SNPs are spread nearly uniformly across the genome But disease genes are not, so causal SNPs might often target other genes?

Observation #2: GWAS signals are enriched in chromatin that is active in cell-types that make sense. CNS Connective Immune Pancreas Pancreas CNS Analysis by Stratified LD Score regression [Finucane Alkes Price (2015) NG]! GWAS SNPs generally affect cell-type-specific processes (classic view).! GWAS SNPs affect non-specific processes as long as they are active (omnigenic view).

Disease heritability is enriched in chromatin uniquely active in relevant cell types Evan Boyle Analysis using Stratified LD Score regression and annotations from Finucane et al. 2015

Observation #2A: It doesn t matter much whether the chromatin is broadly active, or uniquely active Evan Boyle Analysis using Stratified LD Score regression and annotations from Finucane et al. 2015

What matters is that they are active in the relevant cell type Evan Boyle Analysis using Stratified LD Score regression and annotations from Finucane et al. 2015

What matters is that they are active in the relevant cell type Evan Boyle Genetic effects are not mediated through cell-type-specific function? Analysis using Stratified LD Score regression and annotations from Finucane et al. 2015

Heritability near genes with preferential tissue expression Using stratified LD score regression! SCZ heritability more enriched near genes preferentially expressed in frontal cortex! SNPs near genes expressed broadly explained more total heritability (because they are more numerous)

! Hypothesis: Genes that do not have a direct function in disease pathways might play, in aggregate, a larger role in disease so long as they have a function in the relevant tissue

Observation #3: SNPs near genes with relevant functional ontologies explain only a small fraction of disease heritability! Relevant functional categories are enriched in heritability.! For all three diseases the category that explained the most heritability was simply the largest category, i.e. protein binding. Evan Boyle

Summary: Variants in most of the genome contributes to heritability The main shared feature of contributing variants is that they are in regions that are active in relevant tissues, but not necessarily in pathways directly relevant to disease.

Model: 3 types of genes Core genes: direct roles in disease (genes that make sense: eg synaptic genes for schizophrenia) Peripheral genes: Any gene expressed in the right cell types can affect regulation/function of core genes, and most of them do Inactive genes: Genes not expressed in the right cell types do not contribute to heritability

Small world property of networks: most nodes can be reached from every other node by a small number of steps Papers on small world property: applies to most real-life networks Watts & Strogatz 1998 Strogatz 2001! Suggests that most expressed genes may be close to core genes.! Network could be at the transcriptional, post-transcriptional level, and/ or an emergent property (e.g. cell function).

If the Small World property applies here then genes outside core pathways might cumulatively contribute more heritability than the much smaller number of genes inside core pathways Core genes

Summary: The contributions of thousands (tens of) of regulatory QTLs in peripheral genes might (paradoxically) drive most of the disease heritability. Therefore focusing only on core genes/pathways might never provide us with a full accounting of variation in disease risk.

Final remarks More implications/future directions: Mouse models for complex traits have limited use (dysregulated pathway only account for a small fraction of disease cases) Drugs may need to focus on altering system states rather than individual genes/pathways. We need a better understanding of how genes/ proteins interact together in a cell-type-specific fashion (could be PPI, transcriptional, etc..) More implications in our perspective: Boyle*, Li*, and Pritchard* (under review)

Acknowledgments Arbel Harpak (Stanford) William Greenleaf (Stanford) Alex Marson (UCSF) Joe Pickrell (NYGC/Columbia) Guy Sella (Columbia) Molly Przeworski (Columbia) Discussion/criticism welcome!

Height GWAS SNPs are enriched in functional elements! Although GWAS SNPs appear to be spread uniformly across the genome, they often fall within functional regions that suggests regulatory function.

Family-based GWAS confirms the signals are not driven by confounding from population structure! Median effect smaller likely due to higher rate of sign errors (sample size is 10% of GIANT). Evan Boyle