Estimating genetic variation within families

Similar documents
Resemblance between Relatives (Part 2) Resemblance Between Relatives (Part 2)

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

An Introduction to Quantitative Genetics

Quantitative Trait Analysis in Sibling Pairs. Biostatistics 666

Discontinuous Traits. Chapter 22. Quantitative Traits. Types of Quantitative Traits. Few, distinct phenotypes. Also called discrete characters

Non-parametric methods for linkage analysis

Psych 3102 Introduction to Behavior Genetics

Dan Koller, Ph.D. Medical and Molecular Genetics

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

Introduction to Quantitative Genetics

We expand our previous deterministic power

For more information about how to cite these materials visit

It is shown that maximum likelihood estimation of

Performing. linkage analysis using MERLIN

Combined Linkage and Association in Mx. Hermine Maes Kate Morley Dorret Boomsma Nick Martin Meike Bartels

Nonparametric Linkage Analysis. Nonparametric Linkage Analysis

Your DNA extractions! 10 kb

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

Mendelian & Complex Traits. Quantitative Imaging Genomics. Genetics Terminology 2. Genetics Terminology 1. Human Genome. Genetics Terminology 3

The Efficiency of Mapping of Quantitative Trait Loci using Cofactor Analysis

Today s Topics. Cracking the Genetic Code. The Process of Genetic Transmission. The Process of Genetic Transmission. Genes

New Enhancements: GWAS Workflows with SVS

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction

Univariate modeling. Sarah Medland

Complex Multifactorial Genetic Diseases

A test of quantitative genetic theory using Drosophila effects of inbreeding and rate of inbreeding on heritabilities and variance components #

A Unified Sampling Approach for Multipoint Analysis of Qualitative and Quantitative Traits in Sib Pairs

Interaction of Genes and the Environment

HERITABILITY AND ITS GENETIC WORTH FOR PLANT BREEDING

Heritability. The concept

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen

Variation (individual differences): Stature (in cm) in Dutch adolescent twins

BIOL 364 Population Biology Fairly testing the theory of evolution by natural selection with playing cards

Genome-wide association studies (case/control and family-based) Heather J. Cordell, Institute of Genetic Medicine Newcastle University, UK

Imaging Genetics: Heritability, Linkage & Association

SEMs for Genetic Analysis

Finding genes for complex traits: the GWAS revolution

Impacts of genomic. pre-selection on classical genetic evaluations. Vincent Ducrocq, Clotilde Patry. Vincent Ducrocq & Clotilde Patry

Multifactorial Inheritance. Prof. Dr. Nedime Serakinci

Interaction of Genes and the Environment

UNIT 6 GENETICS 12/30/16

Chapter 5 INTERACTIONS OF GENES AND THE ENVIRONMENT

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

Quantitative genetics: traits controlled by alleles at many loci

Learning Abilities and Disabilities

Biometrical genetics

25.1 QUANTITATIVE TRAITS

Regression-based Linkage Analysis Methods

Multivariate QTL linkage analysis suggests a QTL for platelet count on chromosome 19q

Diallel Analysis and its Applications in Plant Breeding

Overview of Animal Breeding

Quantitative Genetics

Critical assumptions of classical quantitative genetics and twin studies that warrant more attention

Role of Genomics in Selection of Beef Cattle for Healthfulness Characteristics

With the rapid advances in molecular biology, the near

Lecture 6 Practice of Linkage Analysis

Lecture 13: May 24, 2004

The Association Design and a Continuous Phenotype

Reflection Questions for Math 58B

C. Incorrect! Resemblance is not described by heritability. Heritability is a concept that is easily confused, so memorize the definition.

Chapter 6 Genetics. Total Assessment Guide (T.A.G.) 56, 57, 58, 59, 60, 61 22, 23, 25, 26, 27, 28, 29, 30, 31, 32, 34

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012

Genetics and Genomics in Medicine Chapter 8 Questions

Tutorial on Genome-Wide Association Studies

Genetics of Educational Attainment

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Behavioral genetics: The study of differences

Mendelian Genetics. KEY CONCEPT Mendel s research showed that traits are inherited as discrete units.

Heritability and genetic correlations explained by common SNPs for MetS traits. Shashaank Vattikuti, Juen Guo and Carson Chow LBM/NIDDK

additive genetic component [d] = rded

Quantitative Genetics. Statistics Overview: Mean. Statistics Overview: Variance. Statistics Overview: Distributions. Chapter 22

INTRODUCTION TO GENETIC EPIDEMIOLOGY (1012GENEP1) Prof. Dr. Dr. K. Van Steen

Bivariate Genetic Analysis

Replacing IBS with IBD: The MLS Method. Biostatistics 666 Lecture 15

MBG* Animal Breeding Methods Fall Final Exam

Comparing heritability estimates for twin studies + : & Mary Ellen Koran. Tricia Thornton-Wells. Bennett Landman

Decomposition of the Genotypic Value

The Inheritance of Complex Traits

12 MENDEL, GENES, AND INHERITANCE

Inbreeding and Inbreeding Depression

Honors Biology Test Chapter 9 - Genetics

Biology. Chapter 13. Observing Patterns in Inherited Traits. Concepts and Applications 9e Starr Evers Starr. Cengage Learning 2015

GENETICS - NOTES-

Sexual Reproduction & Inheritance

Genetics of common disorders with complex inheritance Bettina Blaumeiser MD PhD

Agro/ANSC/Biol/Gene/Hort 305 Fall, 2017 MENDELIAN INHERITANCE Chapter 2, Genetics by Brooker (Lecture outline) #2

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

Rapid evolution towards equal sex ratios in a system with heterogamety

MENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on.

Bias, precision and heritability of self-reported and clinically measured height in Australian twins

Frequency of church attendance in Australia and the United States: models of family resemblance

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait.

Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits

Human genetic susceptibility to leprosy: methodological issues in a linkage analysis of extended pedigrees from Karonga district, Malawi

Fundamentals of Genetics

STUDIES of the transmission of multifactorial traits in man have assumed

Ch 8 Practice Questions

GARRY R. CUTTING, MD PROFESSOR OF MEDICAL GENETICS DIRECTOR OF THE DNA DIAGNOSTIC LABORATORY THE JOHNS HOPKINS UNIVERSITY BALTIMORE, MARYLAND

Expression of Genetic Effects in the Environment. Expression of Genetic Effects in the Environment

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES

Transcription:

Estimating genetic variation within families Peter M. Visscher Queensland Institute of Medical Research Brisbane, Australia peter.visscher@qimr.edu.au 1

Overview Estimation of genetic parameters Variation in identity Applications mean and variance of genome-wide IBD sharing for sibpairs estimation of heritability of height genome partitioning of genetic variation 2

Estimation of genetic parameters Model expected covariance between relatives Genetics Environment Data correlation/regression of observations between relatives Statistical method ANOVA regression maximum likelihood Bayesian analysis 3

[Galton, 1889] 4

The height vs. pea debate (early 1900s) Biometricians Mendelians Do quantitative traits have the same hereditary and evolutionary properties as discrete characters? 5

Trait Qq qq QQ m-a m+d m+a RA Fisher (1918). Transactions of the Royal Society of Edinburgh 52: 399-433. 6

Genetic covariance between relatives cov G (y i,y j ) = a ij σ A2 + d ij σ D 2 a = additive coefficient of relationship = 2 * coefficient of kinship (= E(π)) d = coefficient of fraternity = Prob(2 alleles are IBD) 7

Examples (no inbreeding) Relatives a d MZ twins 1 1 Parent-offspring ½ 0 Fullsibs ½ ¼ Double first cousins ¼ 1 / 16 8

Controversy/confounding: nature vs nurture Is observed resemblance between relatives genetic or environmental? MZ & DZ twins (shared environment) Fullsibs (dominance & shared environment) Estimation and statistical inference Different models with many parameters may fit data equally well 9

Total mole count for MZ and DZ twins MZ twins - 153 pairs, r = 0.94 DZ twins - 199 pairs, r = 0.60 400 400 300 300 Twin 1 200 Twin 1 200 100 100 0 0 100 200 300 400 0 0 100 200 300 400 Twin 2 Twin 2 10

Sources of variation in Queensland school test results of 16-year olds 10% 78% 12% Additive genetic Shared environment Non-shared environment 11

An unbiased approach Estimate genetic variance within families 12

Actual or realised genetic relationship = proportion of genome shared IBD (π a ) Varies around the expectation Apart from parent-offspring and MZ twins Can be estimated using marker data 13

x 1/4 1/4 1/4 1/4 14

IDENTITY BY DESCENT Sib 1 Sib 2 4/16 = 1/4 sibs share BOTH parental alleles IBD = 2 8/16 = 1/2 sibs share ONE parental allele IBD = 1 4/16 = 1/4 sibs share NO parental alleles IBD = 0 15

Single locus Relatives E(π a ) var(π a ) Fullsibs ½ 1 / 8 Halfsibs ¼ 1 / 16 Double 1 st cousins ¼ 3 / 32 16

Several notations IBD Probability Actual IBD0 k 0 0 or 1 IBD1 k 1 0 or 1 IBD2 k 2 0 or 1 Σ=1 Σ=1 Realisations k 0 k 1 k 2 1 0 0 0 1 0 0 0 1 π a = ½k 1 + k 2 = R = 2θ π d = k 2 = xy 17 [e.g., LW Chapter 7; Weir and Hill 2011, Genetics Research]

n multiple unlinked loci Relatives E(π a ) var(π a ) Fullsibs ½ 1 / 8n Halfsibs ¼ 1 / 16n Double 1 st cousins ¼ 3 / 32n 18

Loci are on chromosomes Segregation of large chromosome segments within families increasing variance of IBD sharing Independent segregation of chromosomes decreasing variance of IBD sharing 19

Theoretical SD of π a Relatives 1 chrom (1 M) genome (35 M) Fullsibs 0.217 0.038 Halfsibs 0.154 0.027 Double 1 st cousins 0.173 0.030 20 [Stam 1980; Hill 1993; Guo 1996; Hill & Weir 2011]

Fullsibs: genome-wide (Total length L Morgan) var(π a ) 1/(16L) 1/(3L 2 ) [Stam 1980; Hill 1993; Guo 1996] var(π d ) 5/(64L) 1/(3L 2 ) var(π d )/ var(π a ) 1.3 if L = 35 Genome-wide variance depends more on total genome length than on the number of chromosomes 21

Fullsibs: Correlation additive and dominance relationships r(π a, π d ) = σ(π a ) / σ(π d ) [1/(16L) / (5/(64L))] 0.5 = 0.89. Using β(π a on π d ) = 1 Difficult but not impossible to disentangle additive and dominance variance NB Practical 22

Summary Additive and dominance (fullsibs) SD(π a ) SD(π d ) Single locus 0.354 0.433 One chromsome (1M) 0.217 0.247 Whole genome (35M) 0.038 0.043 Predicted correlation 0.89 (genome-wide π a and π d ) 23

Application (1) Aim: estimate genetic variance from actual relationships between fullsib pairs Two cohorts of Australian twin families Adolescent Adult Families 500 1512 Individuals 1201 3804 Sibpairs with genotypes 950 3451 Markers per individual 211-791 201-1717 Average marker spacing 6 cm 5 cm 24

Phenotype = height Application (1) Number of sibpairs with phenotypes and genotypes Adolescent cohort 931 Adult cohort 2444 Combined 3375 25

Mean IBD sharing across the genome for the jth sib pair was based on IBD estimated from Merlin every centimorgan and averaged at all 3491 points additive ˆ 3491 = ( j) i= 1 ˆ π a π a( ij) / 3491 dominance 3491 ( j) i= 1 πˆ = p d 2( ij) / 3491 26

And for the c th chromosome of length l c cm additive dominance ˆ π l c c = πˆ c / a ( j ) a ( ij ) c i= 1 l c ( j ) ˆ = p / d π i= 1 2( ij) l l c c 27

Mean and SD of genome-wide additive relationships 28

Mean and SD of genome-wide dominance relationships 29

Empirical and theoretical SD of additive relationships correlation = 0.98 (n = 4401) 30

Empirical and theoretical SD of dominance relationships correlation = 0.98 (n = 4401) 31

Additive and dominance relationships correlation = 0.91 (n= 4401) 32

Phenotypes After adjustment for sex and age: σ p = 7.7 cm σ p = 6.9 cm 33

Phenotypic correlation between siblings Raw After age & sex Adolescents 0.33 0.40 Adults 0.24 0.39 34

Models C = A = E = Family effect Genome-wide additive genetic Residual Full model Reduced model C + A + E C + E 35

Estimation Maximum Likelihood variance components Likelihood-ratio-test (LRT) to calculate P- values for hypotheses H 0 : A = 0 H 1 : A > 0 36

Estimates: null model (CE) Cohort Family effect (C) Adolescent 0.40 (0.34 0.45) Adult 0.39 (0.36 0.43) Combined 0.39 (0.36 0.42) 37

Estimates: full model (ACE) Cohort C A P Adolescent 0 0.80 0.0869 Adult 0 0.80 0.0009 Combined 0 0.80 0.0003 All family resemblance due to additive genetic variation 38

Sampling variances are large Cohort A (95% CI) Adolescent 0.80 (0.00 0.90) Adult 0.80 (0.43 0.86) Combined 0.80 (0.46 0.85) 39

F+A more accurately estimated Cohort C+A (95% CI) Adolescent 0.80 (0.36 0.90) Adult 0.80 (0.61 0.86) Combined 0.80 (0.62 0.85) Prediction of MZ correlation from fullsibs! 40

Power and SE of estimates True parameters (t) Sample size (n) Variance in genome-wide IBD sharing (var(π)) var( h ˆ2 ) (1 t 2 ) 2 / + [ ] (1 t 2 )( n var( π )) NCP = nh 4 var(π)(1+t 2 ) / (1-t 2 ) 2 41

Application (2) Genome partitioning of additive Aims genetic variance for height Estimate genetic variance from genome-wide IBD in larger sample Partition genetic variance to individual chromosomes using chromosome-wide coefficients of relationship Test hypotheses about the distribution of genetic variance in the genome 42

Sample # Sibpairs Sib Correlation AU 5952 0.43 US 3996 0.50 NL 1266 0.45 Total 11,214 0.46 43

Realised relationships Mean 0.499 Range 0.31 0.64 SD 0.036 44

Estimates from genome-wide additive and dominance coefficients ACE model Heritability 0.86 (0.49 0.95) P<0.0001 Family 0.03 (0.00 0.03) P=0.38 ADCE model Additive component 0.70 Dominance component 0.16 (P=0.35) 45

46

Estimates of chromosomal heritabilities 0.12 From combined chromosome analysis 0.10 0.08 0.06 0.04 0.02 y = 1.006x + 0.0001 R 2 = 0.9715 No epistasis? 0.00 0.00 0.02 0.04 0.06 0.08 0.10 0.12 From single chromosome analyses 47

Longer chromosomes explain more additive genetic variance: ~0.03 per 100 cm Estimate of heritability 0.14 0.12 0.10 0.08 0.06 0.04 0.02 0.00 WLS analysis: P<0.001; intercept NS 3 17 4 8 15 18 7 1 12 6 19 9 14 16 5 2 20 21 22 13 11 10 50 100 150 200 250 300 Length of chromosome (cm) 48

Estimates are consistent across countries 0.20 0.18 0.16 y = 0.9623x R 2 = 0.2813 3 Heritability USA 0.14 0.12 0.10 0.08 0.06 0.04 0.02 0.00 11 13 17 4 7 8 19 15 9 18 12 1 10 16 2 20 22 6 5 14 21 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Heritability AU 49

Stepwise analyses: at least 6 chromosomes are needed to explain the additive genetic variance 0-2 0 1 2 3 4 5 6 7 8-4 -6 Scaled AIC -8-10 -12-14 -16-18 -20 Number of chromosomal additive genetic variance components 50

Hypothesis test Model h 2 c 2 df LRT Full (22 chrom.) 0.92 0.00 22 Genome-wide 0.86 0.03 1 19.2 Additive genetic variance in proportion to lenght not rejected 51

Data consistent with published QTL results 0.12 17 4 3 Estimate of heritability 0.10 0.08 0.06 0.04 Rank test: P=0.002 19 8 18 15 1 7 6 12 9 0.02 16 2 14 5 0.00 21 22 20 10 13 0 1 11 2 3 4 Number of publications with LOD > 1.9 52

Conclusions Empirical variation in genome-wide IBD sharing follows theoretical predictions Genetic variance can be estimated from genome-wide IBD within families results for height consistent with estimates from between-relative comparisons no assumptions about nature/nurture causes of family resemblance Genetic variance can be partitioned onto chromosomes 53

Conclusions With large sample sizes it will become possible to estimate dominance variance epistatic variance genome-wide parent-of-origin variance genetic relative risk to disease 54

Genetic architecture for height Additive genetic variance No QTL of large effects Chromosomes explain ~10% of genetic variance Consequences for genome-wide association 55

Other applications: breeding programmes Exploit variance in genome-wide IBD by using the realised A-matrix large increase in accuracy of selection if variance in identity is large family size is large Genomic Selection 56

Using the realised A-matrix: Reliability of EBV for an unphenotyped individual from n-1 phenotyped relatives (a simulation study) 57