Imaging Genetics: Heritability, Linkage & Association

Similar documents
Mendelian & Complex Traits. Quantitative Imaging Genomics. Genetics Terminology 2. Genetics Terminology 1. Human Genome. Genetics Terminology 3

Neuroimaging and Genetics

Dan Koller, Ph.D. Medical and Molecular Genetics

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

Genetics and Genomics in Medicine Chapter 8 Questions

Introduction to the Genetics of Complex Disease

For more information about how to cite these materials visit

Quantitative genetics: traits controlled by alleles at many loci

Interaction of Genes and the Environment

Discontinuous Traits. Chapter 22. Quantitative Traits. Types of Quantitative Traits. Few, distinct phenotypes. Also called discrete characters

Genetics and Pharmacogenetics in Human Complex Disorders (Example of Bipolar Disorder)

QTs IV: miraculous and missing heritability

Non-parametric methods for linkage analysis

Interaction of Genes and the Environment

Resemblance between Relatives (Part 2) Resemblance Between Relatives (Part 2)

Complex Traits Activity INSTRUCTION MANUAL. ANT 2110 Introduction to Physical Anthropology Professor Julie J. Lesnik

Comparing heritability estimates for twin studies + : & Mary Ellen Koran. Tricia Thornton-Wells. Bennett Landman

Roadmap. Inbreeding How inbred is a population? What are the consequences of inbreeding?

The Inheritance of Complex Traits

GENETIC LINKAGE ANALYSIS

CS2220 Introduction to Computational Biology

Cognitive, affective, & social neuroscience

C) Show the chromosomes, including the alleles on each, in the F1 hybrid progeny at metaphase of Meiosis 1 and mitosis.

Quantitative Trait Analysis in Sibling Pairs. Biostatistics 666

Performing. linkage analysis using MERLIN

(b) What is the allele frequency of the b allele in the new merged population on the island?

Your DNA extractions! 10 kb

Problem set questions from Final Exam Human Genetics, Nondisjunction, and Cancer

Evolution II.2 Answers.

Behavioral genetics: The study of differences

MBG* Animal Breeding Methods Fall Final Exam

Combined Linkage and Association in Mx. Hermine Maes Kate Morley Dorret Boomsma Nick Martin Meike Bartels

Estimating genetic variation within families

Multifactorial Inheritance. Prof. Dr. Nedime Serakinci

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction

The genetics of complex traits Amazing progress (much by ppl in this room)

Tutorial on Genome-Wide Association Studies

Systems of Mating: Systems of Mating:

Transmission Disequilibrium Test in GWAS

New Enhancements: GWAS Workflows with SVS

Genetics of common disorders with complex inheritance Bettina Blaumeiser MD PhD

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

Chapter 5 INTERACTIONS OF GENES AND THE ENVIRONMENT

Lecture 1 Mendelian Inheritance

Complex Multifactorial Genetic Diseases

Quantitative Genetics. Statistics Overview: Mean. Statistics Overview: Variance. Statistics Overview: Distributions. Chapter 22

B-4.7 Summarize the chromosome theory of inheritance and relate that theory to Gregor Mendel s principles of genetics

Quantitative Genetics

SEX. Genetic Variation: The genetic substrate for natural selection. Sex: Sources of Genotypic Variation. Genetic Variation

Introduction to Genetics and Genomics

Heritability. The concept

Genetic association analysis incorporating intermediate phenotypes information for complex diseases

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

Figure 1: Transmission of Wing Shape & Body Color Alleles: F0 Mating. Figure 1.1: Transmission of Wing Shape & Body Color Alleles: Expected F1 Outcome

Nature Genetics: doi: /ng Supplementary Figure 1

GENOME-WIDE ASSOCIATION STUDIES

Genome-wide association studies (case/control and family-based) Heather J. Cordell, Institute of Genetic Medicine Newcastle University, UK

Is imaging genetics plausible? Imaging Genetics and Multimodal Approaches to Brain Imaging. Which genetic model should be used?

HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007

Activities to Accompany the Genetics and Evolution App for ipad and iphone

Heritability of surface area and cortical thickness: a comparison between the Human Connectome Project and the UK Biobank dataset

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen

Human population sub-structure and genetic association studies

Role of Genomics in Selection of Beef Cattle for Healthfulness Characteristics

Statistical power and significance testing in large-scale genetic studies

Selection at one locus with many alleles, fertility selection, and sexual selection

Mendelian Randomization

Today s Topics. Cracking the Genetic Code. The Process of Genetic Transmission. The Process of Genetic Transmission. Genes

Genetics of Behavior (Learning Objectives)

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Variation in linkage disequilibrium patterns between populations of different production types VERONIKA KUKUČKOVÁ, NINA MORAVČÍKOVÁ, RADOVAN KASARDA

Supplementary Figures

Psych 3102 Introduction to Behavior Genetics

Mendel s Methods: Monohybrid Cross

Nonparametric Linkage Analysis. Nonparametric Linkage Analysis

SEMs for Genetic Analysis

MENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on.

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

MOLECULAR EPIDEMIOLOGY Afiono Agung Prasetyo Faculty of Medicine Sebelas Maret University Indonesia

Patterns of Inheritance

Heritability and genetic correlations explained by common SNPs for MetS traits. Shashaank Vattikuti, Juen Guo and Carson Chow LBM/NIDDK

Multifactorial Inheritance

Clinical Genetics & Dementia

Name: PS#: Biol 3301 Midterm 1 Spring 2012

For a long time, people have observed that offspring look like their parents.

Genetics All somatic cells contain 23 pairs of chromosomes 22 pairs of autosomes 1 pair of sex chromosomes Genes contained in each pair of chromosomes

Nature and nurture in. Department of Biological Psychology, Neuroscience Campus / EMGO + institute VU University, Amsterdam The Netherlands

THE CONTENTS OF THIS THESIS

Cognitive and Behavioral Genetics: An Overview. Steven Pinker

Ch 8 Practice Questions

Title: Pinpointing resilience in Bipolar Disorder

Developing and evaluating polygenic risk prediction models for stratified disease prevention

What#are#the#different#types#of#mutations#&#phenotypic#effects?#Give#an#example#of#a#disease#for#each.#

Complex Trait Genetics in Animal Models. Will Valdar Oxford University

The Foundations of Personalized Medicine

Mendelian Inheritance. Jurg Ott Columbia and Rockefeller Universities New York

Transcription:

Imaging Genetics: Heritability, Linkage & Association David C. Glahn, PhD Olin Neuropsychiatry Research Center & Department of Psychiatry, Yale University July 17, 2011

Memory Activation & APOE ε4 Risk gene for Alzheimer s 16 APOE ε4 14 APOE ε3 Bookheimer et al., N Engl J Med, 2000

Fear Response & Serotonin Transporter Gene (SLC6A4) Hariri et al., Science, 2002

Human Genome 23 Chromosomes ~20-25,000 genes ~3 billion base pairs

Approaches to Genotyping Candidate genes: genotype only markers in genes potentially related to the trait. Pro: fast and easy, may be able to be more thorough with a higher density of markers Con: must get lucky in choice of genes, lower potential for something really novel Genome screen: genotype anonymous markers spanning the genome at regular intervals Pro: can identify previously unknown genes, covers all of the possibilities Con: slower and more expensive, may have lower marker density which could translate to less power

Questions for the Study of Complex Trait Genetics 1) Is this trait influenced by genetic factors? How strong are these genetic influences? 2) Which traits are influenced by the same genes? 3) Where are the genes that influence a trait? 4) What are the specific genes that influence the trait? 5) What specific genetic variants influence the trait and how do they interact with each other and with the environment?

Question 1: Heritability Is this trait influenced by genetic factors? How strong are these genetic influences?

Defining Heritability Phenotype (P) = Genotype (G) + Environment (E)

Variance Decomposition 2 2 2 σp=σg +σe 2 2 2 σg=σa +σd µ σ 2 µ ^= Σx i / n σ^ 2 = Σ(x - µ) ^ 2 / n Almasy & Blangero, Am J Hum Genet, 1998 2 2 2 σe =σc +σeu σ 2 p = total phenotypic 2 = genetic σ g σ e 2 = environmental 2 = additive genetic σ a σ 2 d = dominance

Narrow-Sense Heritability (h 2 ) Heritability (h 2 ): the proportion of the phenotypic variance in a trait attributable to the additive effects of genes. 2 h 2 = σ a σ 2 p

Conceptualizing Heritability Heritability estimates vary between 0 and 1 0, genetic factors do not influence trait variance 1, trait variance is completely under genetic control If h 2 =0.5, then 50% of phenotypic variation is due to genetic variation. Not that the trait is 50% caused by genetics Stronger heritability does not imply simple genetics

Estimating Heritability with Twins Falconer s Method h 2 =2*(r MZ -r DZ ) r MZ = correlation between monozygotic co-twins r DZ = correlation between dizygotic co-twins

Variance Component Approach Modeling the Phenotype: p =µ+σβ i x i + µ Population mean a +d +e β Regression coefficients x Scaled covariates a Additive genetic effects d Dominance genetic effects e Random environmental effects

Simple Kinship Matrix Dad 1 2 Mom 3 D M 1 2 3 D 1 0 ½ ½ ½ M 0 1 ½ ½ ½ 1 ½ ½ 1 ½ ½ 2 ½ ½ ½ 1 ½ 3 ½ ½ ½ ½ 1

Kinship Matrix 1 2 3 4 5 1 2 3 1 1 0 0 ½ 0 2 0 1 0 ½ ½ 3 0 0 1 0 ½ 4 ½ ½ 0 1 ¼ 4 5 5 0 ½ ½ ¼ 1

Gray-Matter Thickness Heritability Thompson et al., Nat Neuro, 2001 10 MZ / 10 DZ pairs

Gray-Matter Thickness Heritability Kremen et al., NeuroImage, 2010 110 MZ / 92 DZ pairs

Cortical Thickness & Surface Area Winkler et al., NeuroImage, 2009 486 family members

White-Matter Tracts (DTI) h 2 Kochunov et al., NeuroImage, 2010 467 family members

White-Matter Hyperintensity h 2 Total Cranial: 0.91 Brain Parenchyma: 0.92 WMH: 0.73 Carmelli et al., Stroke, 1998 74 MZ / 71 DZ pairs

Task Based fmri Heritability Koten et al., Science, 2009 10 MZ pairs / 10 sibs

Resting State fmri Heritability h 2 = 0.424 Glahn et al., Proc Nat Sci USA, 2010 333 family members

Limitations of Heritability Estimates 1. Heritability is a population level parameter, summarizing the strength of genetic influences on variation in a trait among members of the population. It doesn t tell you anything about particular individuals. 2. Heritability is an aggregate of the effects of multiple genes. It tells you nothing about how many genes influence a phenotype. A high heritability is not necessarily better if it is due to many, many genes.

Question 2: Pleiotropy Which traits are influenced by the same genes?

Levels of Pleiotropy No Pleiotropy Trait 1 Trait 2 Partial Pleiotropy Trait 1 Trait 2 Full Pleiotropy Trait 1 Trait 2

Genetic Correlation (Pleiotropy) Genetic correlation (ρ g ): a measure of the overlap in genetic effects between traits. ρ g varies from -1 to 1 0 = no pleiotropy; -1 or 1 = complete pleiotropy

Cortical Thickness & Surface Area Schmitt et al., Cerebral Cortex, 2008 107 MZ / 47 DZ / 228 others Winkler et al., NeuroImage, 2009 486 family members

Cortical Thinning & IQ ρ g = 1.0/1.0 ρ g = 1.0/0.7 Brans et al., J Neurosci, 2010 77 MZ / 84 DZ pairs

White Matter Tracts & Working Memory Superior longitudinal fasciculus Spatial DRT: ρ g = 0.593 Karlsgodt et al., J Neurosci, 2010 467 family members

Question 3: Localization Where are the genes that influence a trait?

Two Common Methods for Gene Localization Linkage analyses: test for co-segregation of phenotype and genotype within families - a function of physical connections of genes on chromosomes Association analyses: test for deviations of phenotype-genotype combinations from that predicted by their separate frequencies - a function of linkage disequilibrium created by population history

Association Studies Pro: Can be done with unrelated individuals. Statistical methods easy and fast. May be able to detect loci of smaller effect with a given sample size. Con: Requires disequilibrium, which may not be present, making power difficult to estimate. Susceptible to allelic heterogeneity.

Genome Wide Association Tests for correlation between genotype and phenotype Association analyses work when: 1) your genotyped marker is a functional polymorphism 2) your genotyped marker is in disequilibrium with a functional polymorphism Currently: 700K to 1.5Mil SNPs

Linkage Disequilibrium (LD) Linkage disequilibrium is the non-random association of alleles at two or more loci LD = the population frequency of allelic combinations the expected combinations from random formation of haplotypes Level of LD is influenced by a number of factors including genetic linkage, selection, the rate of recombination, the rate of mutation, genetic drift, non-random mating, and population structure. LD is unpredictable

Association, LD & Haplotypes Unlikely that functional variants are typed Association studies are dependent on LD between a genotyped marker and a functional variant. May change with complete sequencing

Multiple testing: p-values A p-value of 0.05 implies that 5% of the time we will reject the null hypothesis (i.e. conclude that we have an association) when the null hypothesis is actually correct If we test 100 SNPs and each time we use a p- value of 0.05 as our cutoff for significance, we would expect 5 of those SNPs to be significant (p < 0.05) just by chance

Multiple testing The simplest correction is the Bonferroni: multiply each p-value by the total number of tests, or divide the significance threshold required by the number of tests (0.05 / #). For 550K SNPs, requires p=1.2x10-7 This maintains an experiment-wide significance threshold, but may be too conservative when the tests are correlated, as they might be if some of the markers tested are in LD with each other.

Cortical Thickness GWA Association found at rs2342227 (p=4.0x10-10), chr 13q31, near SLIT- & NTRK-like family, member 6. SLITRKs are expressed predominantly in neural tissues and have neurite-modulating activity. Associated with Tourette's Syndrome Glahn et al., ACNP, 2009

Temporal Lobe GWA n=742 rs10845840 p=1.3x10-7 rs2456930 P=3.1x10-7 Stein et al., NeuroImage, 2010

Stein et al., NeuroImage, 2010 VBM & GWA

Limitations of Association A QTL may be in equilibrium with the other polymorphisms surrounding it. Disequilibrium need not be present. Since LD need not be present, negative association results have implications only for the marker you have tested, lack of association does not exclude the gene or region. Population Stratification: If the sample contains multiple populations that differ in the trait of interest, any locus whose allele frequencies differ between the populations will show association

Example: Hypertension AB AB AB AA AA AA AB AB African Americans 70% A, 30% B AB AB BB BB AA AA AB AB European Americans 50% A, 50% B

Example: Hypertension AB AB AA AB Affected 64% A, 36% B AB AA AB AA AB AB BB BB AA AA AB AB Unaffected 56% A, 44% B

Minimizing Limitations of Association 1. Match cases and controls carefully or try to obtain subjects from a single well defined population. 2. Use one of a variety of statistical approaches designed to deal with population stratification (e.g. TDT, genomic control)

Linkage Studies Pro: Power to find a gene can be more easily quantified. Guaranteed to work if the sample size is large enough. Not influenced by allelic heterogeneity. Con: Need a large sample of related individuals. Large enough may be too large to be practical.

Genetic Linkage Defined Genetic loci that are physically close to one another tend to stay together during meiosis. Independent assortment occurs when the genes on different chromosomes are separated by a great enough distance on the same chromosome that recombination occurs at least half of the time. An exception to independent assortment develops when genes appear near one another on the same chromosome. When genes occur on the same chromosome, they are usually inherited as a single unit. Genes inherited in this way are said to be linked, and are referred to as "linkage groups.

Measuring Linkage: Lod Score LOD = log 10 (probability of birth sequence with a given linkage value/probability of birth sequence with no linkage) LOD = log 10 ((1-θ) NR x θ R )/0.5 NR+R NR denotes the number of non-recombinant offspring, R denotes the number of recombinant offspring. Theta = recombinant fraction= R / (NR + R) A LOD score >=3.0 is considered evidence for linkage A LOD score of 3 indicates 1000 to 1 odds that the linkage being observed did not occur by chance A LOD score <=-2.0 is considered evidence to exclude linkage

Linkage with White-Matter Hyperintensitites Bivariate linkage for subcortical & ependymal HWM volumes (log of odds=2.12) on chr 1 at 288 cm. Kochunov et al., Stroke, 2009 459 family members

Linkage with White-Matter Hyperintensitites/Blood Pressure LOD Kochunov et al., Stroke, 2010 357 family members

Linkage vs. Association Association: you re testing for an excess of a specific combination of alleles at two loci. The same alleles must be traveling together at a population level. Detects effects of common variants. Linkage: you re testing for an excess of the parental type. That parental type (i.e. the alleles traveling together) could be different in every family (i.e. linkage equilibrium) and you would still get linkage. Can detect cumulative effect of multiple variants (including rare variants).

Determining Linkage Power The power to map a QTL in a human linkage study is a function of: 1. locus-specific heritability (genetic signalto-noise ratio) 2. Sample size 3. Pedigree size and complexity

Sample size required for 80% power to detect linkage to a QTL at a LOD of 3

Question 4: Identification What specific genes influence the trait?

Identifying a Causal Gene Once a significant QTL is identified, additional genetic tests are needed to determine the exact identity of the gene Association: identifies a genomic region of ~500kb (250kb to either side of the association) determined by the general extent of linkage disequilibrium Linkage: detect the cumulative additive genetic signal of all functional variants within a much larger genomic region (e.g. 10-15Mb)

VPS13A: A Candidate Gene for Brain Structure/Function VPS13A is a 73 exon gene localizing to chromosome 9q21 Involved in choreaacanthocytosis Schizophrenia OCD Frontal lobe functioning 3174 amino acid protein Bellis et al., ASHB, 2009

Association Analysis: VPS13A SNPs and Frontal Lobe Volume rs13294928 rs13297451 Bellis et al., ASHB, 2009

Take Home Message This may be the most important thing I say in this lecture. There is no one size fits all in genetic analysis of complex traits. Laura Almasy

Coffee Break