Complex Trait Genetics in Animal Models. Will Valdar Oxford University

Similar documents
Genetics of extreme body size evolution in mice from Gough Island

QTL detection for traits of interest for the dairy goat industry

The Efficiency of Mapping of Quantitative Trait Loci using Cofactor Analysis

MBG* Animal Breeding Methods Fall Final Exam

THERE is broad interest in genetic loci (called quantitative

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

New Enhancements: GWAS Workflows with SVS

Received 5 April 2018; revised 31 July 2018; accepted 14 August 2018; published online 24 November 2018

Chromosomal regions underlying noncoagulation of milk in Finnish Ayrshire cows

Statistical Genetics : Gene Mappin g through Linkag e and Associatio n

outbred colonies are stocks inbred colonies are strains 3/22/2012 Mouse strains 2.500

Supplementary Figures

The genetic architecture of novel trophic specialists: higher effect sizes are associated with exceptional

Lecture 9: Hybrid Vigor (Heterosis) Michael Gore lecture notes Tucson Winter Institute version 18 Jan 2013

Dan Koller, Ph.D. Medical and Molecular Genetics

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction

Variation in linkage disequilibrium patterns between populations of different production types VERONIKA KUKUČKOVÁ, NINA MORAVČÍKOVÁ, RADOVAN KASARDA

The sex-specific genetic architecture of quantitative traits in humans

GENETIC LINKAGE ANALYSIS

Performing. linkage analysis using MERLIN

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

Role of Genomics in Selection of Beef Cattle for Healthfulness Characteristics

Name: PS#: Biol 3301 Midterm 1 Spring 2012

The domesticated brain: genetics of brain mass and brain structure in an avian species

Genetic Analysis of Anxiety Related Behaviors by Gene Chip and In situ Hybridization of the Hippocampus and Amygdala of C57BL/6J and AJ Mice Brains

Non-parametric methods for linkage analysis

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

A Decision Rule for Quantitative Trait Locus Detection under the Extended Bayesian LASSO Model

Imaging Genetics: Heritability, Linkage & Association

GENETIC MODIFIERS. KENT W. HUNTER Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection

Estimating genetic variation within families

Genetics of COPD Prof. Ian P Hall

Genome-wide analysis reveals a complex pattern of genomic imprinting in mice

Complex Multifactorial Genetic Diseases

Over the past several decades, hundreds of

The Determination of the Genetic Order and Genetic Map for the Eye Color, Wing Size, and Bristle Morphology in Drosophila melanogaster

Breeding Designs for Recombinant Inbred Advanced Intercross Lines. Lewis-Sigler Institute for Integrative Genomics and Department of Ecology and

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

LOOKING FOR ARTHRITIS REGULATING GENES ON MOUSE CHROMOSOME 6 & 14

DESPITE decades of effort, the molecular variants

Variation in Measurement Error in Asymmetry Studies: A New Model, Simulations and Application

A Bayesian approach for constructing genetic maps when markers are miscoded

Statistical power and significance testing in large-scale genetic studies

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

The Pennsylvania State University. The Graduate School. College of Health and Human Development BONE QUALITY, MUSCLE MASS, AND ACTIVITY:

Big Data Training for Translational Omics Research. Session 1, Day 3, Liu. Case Study #2. PLOS Genetics DOI: /journal.pgen.

A test of quantitative genetic theory using Drosophila effects of inbreeding and rate of inbreeding on heritabilities and variance components #

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

A search for quantitative trait loci exhibiting imprinting effects on mouse mandible size and shape

Your DNA extractions! 10 kb

Numerous hypothesis tests were performed in this study. To reduce the false positive due to

Inheritance of Aldehyde Oxidase in Drosophila melanogaster

Significance of the MHC

EXPLAINING patterns of biodiversity requires a mechanistic

Genetic resistance to diet-induced obesity in chromosome substitution strains of mice

Aspects of Statistical Modelling & Data Analysis in Gene Expression Genomics. Mike West Duke University

Ordinal Data Modeling

Mendel Short IGES 2003 Data Preparation. Eric Sobel. Department of of Human Genetics UCLA School of of Medicine

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

IDENTIFICATION OF QTLS FOR STARCH CONTENT IN SWEETPOTATO (IPOMOEA BATATAS (L.) LAM.)

J Jpn Coll Angiol, 2009, 49: collagen disease, genetic polymorphism, MRL mice, recombinant inbred strains, Cd72. MRL/Mp-lpr/lpr MRL/ lpr

Large-scale identity-by-descent mapping discovers rare haplotypes of large effect. Suyash Shringarpure 23andMe, Inc. ASHG 2017

the HLA complex Hanna Mustaniemi,

Effects of Stratification in the Analysis of Affected-Sib-Pair Data: Benefits and Costs

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

Kelvin Chan Feb 10, 2015

Quantitative Genetics

Table S1. Trait summary for Northern Leaf Blight resistance in the maize nested association mapping (NAM) population and individual NAM families

Genetics and Genomics in Medicine Chapter 8 Questions

Effect of Genetic Heterogeneity and Assortative Mating on Linkage Analysis: A Simulation Study

Ecological Statistics

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab

25.1 QUANTITATIVE TRAITS

BST227: Introduction to Statistical Genetics

Ct=28.4 WAT 92.6% Hepatic CE (mg/g) P=3.6x10-08 Plasma Cholesterol (mg/dl)

Gene and Pathway Analysis of Metabolic Traits in Dairy Cows

Mapping of genes causing dyslexia susceptibility Clyde Francks Wellcome Trust Centre for Human Genetics University of Oxford Trinity 2001

Discontinuous Traits. Chapter 22. Quantitative Traits. Types of Quantitative Traits. Few, distinct phenotypes. Also called discrete characters

Genomics Research. May 31, Malvika Pillai

Genetic study of gutter-shaped root (GSR) in AKXL RI mouse strains using QTL analysis

Locating a modifier gene of Ovum mutant through crosses between DDK and C57BL/6J inbred strains in mice

Supplemental Data. Genome-wide Association of Copy-Number Variation. Reveals an Association between Short Stature

Alzheimer Disease and Complex Segregation Analysis p.1/29

Statistical Methods for Latent Class Quantitative Trait Loci Mapping

Quantitative Trait Loci Involved in Genetic Predisposition to Acute Alcohol Withdrawal in Mice

Genomewide Linkage of Forced Mid-Expiratory Flow in Chronic Obstructive Pulmonary Disease

Quantitative genetics: traits controlled by alleles at many loci

Genetic association analysis incorporating intermediate phenotypes information for complex diseases

QTL ANALYSIS OF VITAMIN E TRAIT IN CAPSICUM F2 POPULATION

Taking a closer look at trio designs and unscreened controls in the GWAS era

Supplementary Online Content

A Bayesian Framework for Parentage Analysis: The Value of Genetic and Other Biological Data

The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters

For a long time, people have observed that offspring look like their parents.

Method Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects

Rare Variant Burden Tests. Biostatistics 666

Identification of loci and genes responsible for sodium and chloride exclusion in rootstocks for use in marker assisted selection

Transcription:

Complex Trait Genetics in Animal Models Will Valdar Oxford University

Mapping Genes for Quantitative Traits in Outbred Mice Will Valdar Oxford University

What s so great about mice? Share ~99% of genes with humans ~90% of the two genomes can be portioned into regions of conserved synteny Shorter lifespans You can do invasive experiments You can breed them as you like control the genetics

What is an inbred strain? F20 generation F = 99% BALB/c

http://www.informatics.jax.org/mgihome/genealogy/

one inbred strain

two inbred strains

Mouse model of anxiety

Mouse model of anxiety Anxious mouse Non anxious mouse

F2 cross Generation F0 F1 F2

anxious about average non-anxious

Quantitative Trait Locus anxious about average non-anxious

Quantitative Trait Locus anxious about average non-anxious

Quantitative Trait Locus anxious about average non-anxious

Quantitative Trait Locus anxious about average non-anxious QTL snp

Linear models Also known as ANOVA ANCOVA regression multiple regression linear regression

x +1 0-1 QTL snp

x y ax +1 0-1 QTL snp

x y ax +1 0-1 QTL snp

x y ax +1 0-1 a a QTL snp

Hypothesis testing H 0 : y H 1 : y ax

Hypothesis testing H 0 : y ~ 1 y H 1 : y ~ 1 + x y ax

Hypothesis testing H 0 : y ~ 1 y H 1 : y ~ 1 + x y ax H 1 vs H 0 : Does x explain a significant amount of the variation?

Hypothesis testing H 0 : y ~ 1 y H 1 : y ~ 1 + x y ax H 1 vs H 0 : Does x explain a significant amount of the variation? likelihood ratio LOD score

Hypothesis testing H 0 : y ~ 1 y H 1 : y ~ 1 + x y ax H 1 vs H 0 : Does x explain a significant amount of the variation? LOD score likelihood ratio Chi Square test p-value logp

Hypothesis testing H 0 : y ~ 1 y H 1 : y ~ 1 + x y ax H 1 vs H 0 : Does x explain a significant amount of the variation? LOD score likelihood ratio Chi Square test p-value logp linear models only SS explained / SS unexplained F-test (or t-test)

Hypothesis testing H 0 : y ~ 1 + x1 y a 1 x 1 H 1 : y ~ 1 + x1 + x2 y a x a x 1 1 2 2 H 1 vs H 0 : Does x2 explain a significant amount of the variation after accounting for x1? or is x2 significant conditional on x1?

F2 cross Generation F0 F1 F2

QTL

logp QTL

QTL logp highly significant snp non significant snp

logp QTL

logp QTL

Chromosome scan for F2 QTL goodness of fit (logp) significance threshold 0 200Mb position along whole chromosome (Mb) Typical chromosome

Advanced intercross lines (AILs) F0 F1 F2 F3 F4 Darvasi & Soller (1995) Genetics

F12 cross F0 F1 F2 F12

Chromosome scan for F12 QTL goodness of fit (logp) significance threshold 0 200Mb position along whole chromosome (Mb) Typical chromosome

Practical 1. Fitting a linear model to test a marker-phenotype association 2. Single marker association on an F2 3. Permutation test 4. Single marker association on an AIL (F12) 5. Conditional modelling of loci Start Firefox, File->Open and go to F:\valdar\ThursdayAfternoonAnimals\practical.R Start R

Practical: F2 cross Bonferroni = 2.6 permutation ~ 2.1 uncorrected = 1.3

Practical: F2 cross Bonferroni = 2.6 permutation ~ 2.1 uncorrected = 1.3 generalized extreme value (GEV) distribution

Practical: F12 phenotype ~ MARKER

Practical: F12 phenotype ~ MARKER phenotype ~ m37 + MARKER

Practical: F12 phenotype ~ MARKER phenotype ~ m37 + MARKER phenotype ~ MARKER

Practical: F12 phenotype ~ MARKER phenotype ~ m37 + MARKER phenotype ~ MARKER phenotype ~ m37 + MARKER

Practical: F12 phenotype ~ MARKER phenotype ~ m37 + MARKER phenotype ~ MARKER phenotype ~ m37 + MARKER phenotype ~ m37 + m29 + MARKER

F2 F18 F18 multilocus approach

F2 F18 F18 multilocus approach

F2 breeding in a small population F18 F18 multilocus approach

F2 population structure gross genetic differences between groups where groups = families F18 multilocus approach

F2 population structure gross genetic differences between groups where groups = families multilocus approach

Heterogeneous Stocks

Heterogeneous Stock Pseudo-random mating for 50 generations Avg. Distance Between Recombinations: HS ~2 cm

Heterogeneous Stock F2 Intercross x Pseudo-random mating for 50 generations F1 Avg. Distance Between Recombinations: HS ~2 cm F2 intercross ~30 cm F2

124 Phenotypes Anxiety [24] Asthma [13] Biochemistry [15] Bone Morphology [23] Diabetes [16] Haematology [15] Immunology [9] Weight/size related [8] Wound Healing [1]

Intraperitoneal Glucose Tolerance Test

How to select peaks: a simulated example

How to select peaks: a simulated example Simulate 7 x 5% QTLs (ie, 35% genetic effect) + 20% shared environment effect + 45% noise = 100% variance

Simulated example

phenotype ~?

phenotype ~ peak 1 +? condition on 1 peak

condition on 2 peaks phenotype ~ peak 1 + peak 2 +?

condition on 3 peaks phenotype ~ peak 1 + peak 2 + peak 3 +?

condition on 4 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 +?

condition on 5 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 +?

condition on 6 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 + peak 6 +?

condition on 7 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 + peak 6 + peak 7 +?

condition on 8 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 + peak 6 + peak 7 + peak 8 +?

condition on 9 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 + peak 6 + peak 7 + peak 8 + peak 9 +?

condition on 10 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 + peak 6 + peak 7 + peak 8 + peak 9 + peak 10 +?

condition on 11 peaks phenotype ~ peak 1 + peak 2 + peak 3 + peak 4 + peak 5 + peak 6 + peak 7 + peak 8 + peak 9 + peak 10 + peak 11 +?

Peaks chosen by forward selection

Bootstrap sampling 10 subjects 1 2 3 4 5 6 7 8 9 10

Bootstrap sampling sample with replacement 1 1 2 2 10 subjects 3 4 5 6 2 3 5 5 bootstrap sample from 10 subjects 7 6 8 7 9 7 10 9

Forward selection on a bootstrap sample

Forward selection on a bootstrap sample

Forward selection on a bootstrap sample

Bootstrap evidence mounts up

Model Inclusion Probability In 1000 bootstraps

854 loci in all phenotypes, 84 diabetes loci

854 loci in all phenotypes, 84 diabetes loci

Servin B, Stephens M (2007)

Bayesian Multiple QTL modelling Kilpikari R, Sillanpaa MJ (2003) Bayesian analysis of multilocus association in quantitative and qualitative traits. Genet Epidemiol 25: 122-135 Yi N (2004) A unified Markov chain Monte Carlo framework for mapping multiple quantitative trait loci. Genetics 167: 967-975 Servin B, Stephens M (2007) Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet 3: e114 Fridley BL (2008) Bayesian variable and model selection methods for genetic association studies. Genet Epidemiol.

The Collaborative Cross Heterogeneous Stocks (HS) Collaborative Cross outbreed outbred population recombinant inbred lines Churchill et al 2004; Broman 2005; Valdar et al 2006