BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

Similar documents
Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

Imaging Genetics: Heritability, Linkage & Association

CS2220 Introduction to Computational Biology

(b) What is the allele frequency of the b allele in the new merged population on the island?

Introduction to the Genetics of Complex Disease

Mendelian Genetics and Beyond Chapter 4 Study Prompts

HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007

GENOME-WIDE ASSOCIATION STUDIES

Tutorial on Genome-Wide Association Studies

For more information about how to cite these materials visit

Mendelian Inheritance. Jurg Ott Columbia and Rockefeller Universities New York

Dan Koller, Ph.D. Medical and Molecular Genetics

Lecture 1 Mendelian Inheritance

5/2/18. After this class students should be able to: Stephanie Moon, Ph.D. - GWAS. How do we distinguish Mendelian from non-mendelian traits?

Using ancestry estimates as tools to better understand group or individual differences in disease risk or disease outcomes

Mendel s Methods: Monohybrid Cross

Mendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp , Chapter 5 pp

Laboratory. Mendelian Genetics

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

HEREDITY = The passing of traits from parents to offspring. Transmitted by means of information stored in molecules of DNA.

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

A gene is a sequence of DNA that resides at a particular site on a chromosome the locus (plural loci). Genetic linkage of genes on a single

For a long time, people have observed that offspring look like their parents.

Genetic association analysis incorporating intermediate phenotypes information for complex diseases

Mendelian & Complex Traits. Quantitative Imaging Genomics. Genetics Terminology 2. Genetics Terminology 1. Human Genome. Genetics Terminology 3

Unit 5 Review Name: Period:

When Mendel crossed 2 plants that were different in a single trait, he called that a monohybrid cross. The resulting offspring were called the F1

Ch 8 Practice Questions

BIOLOGY - CLUTCH CH.15 - CHROMOSOMAL THEORY OF INHERITANCE

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES

Fundamentals of Genetics

Roadmap. Inbreeding How inbred is a population? What are the consequences of inbreeding?

MENDELIAN GENETICS. Punnet Squares and Pea Plants

PRINCIPLE OF INHERITANCE AND

Transmission Disequilibrium Test in GWAS

MENDELIAN GENETIC CH Review Activity

The Chromosomal Basis Of Inheritance

GENETICS - NOTES-

Statistical power and significance testing in large-scale genetic studies

Mendelian Genetics. Biology 3201 Unit 3

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Welcome to the Genetic Code: An Overview of Basic Genetics. October 24, :00pm 3:00pm

LTA Analysis of HapMap Genotype Data

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

Genetics and Genomics in Medicine Chapter 8 Questions

What creates variation in the offspring of sexually reproducing organisms?

Assessing Accuracy of Genotype Imputation in American Indians

Non-parametric methods for linkage analysis

Heritability and genetic correlations explained by common SNPs for MetS traits. Shashaank Vattikuti, Juen Guo and Carson Chow LBM/NIDDK

Lecture 6: Linkage analysis in medical genetics

GENETIC LINKAGE ANALYSIS

Power Calculation for Testing If Disease is Associated with Marker in a Case-Control Study Using the GeneticsDesign Package

The Modern Genetics View

Biology. Chapter 13. Observing Patterns in Inherited Traits. Concepts and Applications 9e Starr Evers Starr. Cengage Learning 2015

MENDEL S LAWS AND MONOHYBRID CROSSES. Day 1 UNIT 6 : GENETICS

Chapter 15: The Chromosomal Basis of Inheritance

LINKAGE. Dr.Shivani Gupta, PGGCG-11, Chandigarh

Human Genetics 542 Winter 2018 Syllabus

SEX. Genetic Variation: The genetic substrate for natural selection. Sex: Sources of Genotypic Variation. Genetic Variation

Systems of Mating: Systems of Mating:

IN SILICO EVALUATION OF DNA-POOLED ALLELOTYPING VERSUS INDIVIDUAL GENOTYPING FOR GENOME-WIDE ASSOCIATION STUDIES OF COMPLEX DISEASE.

Downloaded from Chapter 5 Principles of Inheritance and Variation

Human Genetics 542 Winter 2017 Syllabus

Fundamentals of Genetics

Mendelian Genetics. Gregor Mendel. Father of modern genetics

Gregor Mendel and Genetics Worksheets

GENETICS NOTES. Chapters 12, 13, 14, 15 16

Unit 6.2: Mendelian Inheritance

Statistical Genetics : Gene Mappin g through Linkag e and Associatio n

GENETICS PREDICTING HEREDITY

MENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on.

THE CHROMOSOMAL BASIS OF INHERITANCE CHAPTER 15

The Chromosomal Basis of Inheritance

Mendelian Genetics. You are who you are due to the interaction of HEREDITY and ENVIRONMENT. ENVIRONMENT: all outside forces that act on an organism.

Chapter 15 Notes 15.1: Mendelian inheritance chromosome theory of inheritance wild type 15.2: Sex-linked genes

Chapter 1 : Genetics 101

11-1: Introduction to Genetics

GENETICS - CLUTCH CH.2 MENDEL'S LAWS OF INHERITANCE.

Transmission Disequilibrium Methods for Family-Based Studies Daniel J. Schaid Technical Report #72 July, 2004

BST227: Introduction to Statistical Genetics

Problem set questions from Final Exam Human Genetics, Nondisjunction, and Cancer

Genetics and Pharmacogenetics in Human Complex Disorders (Example of Bipolar Disorder)

The Chromosomal Basis of Inheritance

Evolution II.2 Answers.

OCTOBER 21 Unit 5 Heredity 1. What is Heredity

Quantitative genetics: traits controlled by alleles at many loci

IS IT GENETIC? How do genes, environment and chance interact to specify a complex trait such as intelligence?

Mendel s Law of Heredity. Page 254

Ch. 23 The Evolution of Populations

Mendel Short IGES 2003 Data Preparation. Eric Sobel. Department of of Human Genetics UCLA School of of Medicine

The Foundations of Personalized Medicine

Ch 6.2 Inheritance in Sex Linked Genes.notebook April 19, 2018

Neatness 0 1 Accuracy Completeness Lab Class Procedure Total Lab Score

Chapter 15 - The Chromosomal Basis of Inheritance. A. Bergeron +AP Biology PCHS

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

On Missing Data and Genotyping Errors in Association Studies

TEACHER S GUIDE. Case Study. Lactose tolerance. Steve Cross, Bronwyn Terrill and colleagues. Wellcome Trust Sanger Institute Hinxton. Version 1.

Punne% Square Quiz A AP Tes2ng this week 15-Week Grades due next week Note: media center is hos2ng tes2ng Turn in all make-up work

Transcription:

BST227 Introduction to Statistical Genetics Lecture 4: Introduction to linkage and association analysis 1

Housekeeping Homework #1 due today Homework #2 posted (due Monday) Lab at 5:30PM today (FXB G13)

Reading The Fundamentals of Modern Statistical Genetics by Nan Laird and Christoph Lange Lecture title Reading 1 Background Chapter 1 2 Mendel s Laws, genetic models for disease Chapter 2 3 Hardy Weinberg Equilibrium and Recurrence risk Ratios Chapters 3, 41-2 4 An overview of linkage and association Chapter 5

Last time: Relationships between allele and genotype frequencies: Hardy Weinberg Equilibrium Today: Relationships between different loci

Y = 0 0 0 1 0 1 denotes the presence of a disease-causative variant at the DSL Goal: Locate the genetic variant(s) (ie the DSL) presumed to be biologically causal for a disease

Two main statistical methods for finding disease susceptibility loci Linkage Analysis: Based on recombination Association Analysis: Based on linkage disequilibrium (LD) 3

We take advantage of failure of Mendel s second law (Independent assortment) Figure 1 Example of Linkage Disequilibrium through generations Initial mutation () occurs on a chromosomal background (shaded) A (Many generations) Areas retaining LD with mutation 15

Mapping Strategies Linkage analysis Family data Few (<1000 genome-wide) markers required Can find potential disease loci located far from marker Low resolution (finds big candidate regions) Association analysis Case/Control data Relatively dense markers required (~1M genome-wide) Marker needs to be very close to DSL Higher resolution 9

Linkage and Association between a Genetic Marker and the DSL Marker (observed) LINKAGE: Based on physical concept of distance Two linked loci are on the same chromosome and close enough that they are not inherited independently Disease locus (unobserved) ASSOCIATION: Statistical concept A particular allele at a marker is associated with the disease variant (DSL) at the causal locus in the population Population concept, AKA Linkage Disequilibrium 14

Linkage vs Association: Statistical tests Linkage Two loci are linked if the recombination fraction is less than ½, ie the loci are NOT inherited independently Linkage analysis is based on testing if the recombination fraction between a marker and the DSL is = ½ Association Two loci are associated if the alleles at one locus are not independent of the alleles at another locus (allelic association) Association analysis is based on testing statistical independence between disease and a marker 12

Linkage Figure 1 Example of Linkage Disequilibrium through generations Initial mutation () occurs on a chromosomal background (shaded) A (Many generations) Areas retaining LD with mutation Marker (green arrow) is close to DSL () 16

No Linkage Figure 1 Example of Linkage Disequilibrium through generations Initial mutation () occurs on a chromosomal background (shaded) A (Many generations) Areas retaining LD with mutation Marker (green arrow) is far from DSL () 17

Basic Idea of Linkage analysis Based on recombination: θ = P(recombination occurs between two loci) If two loci are on top of each other, θ = 0 If two loci are far apart (or on different chromosomes), θ = ½ To test for linkage: Count number of recombinations we observe; estimate θ as proportion of recombinations Null hypothesis H0: θ = ½ Alternative hypothesis H1: θ < ½ Rejection of null implies there is linkage 18

Features of Linkage Analysis Must have family data with multiple affected individuals Requires being able to infer relationship between variant at DSL and disease trait Easiest with Mendelian 0/1 penetrance functions and mode of inheritance known Uses relatively few markers (<1000) for whole genome linkage analysis Rejecting null hypothesis of no linkage may implicate thousands of genes Very successful for Mendelian disorders, less so for complex 20

Association / Linkage Disequilibrium Figure 1 Example of Linkage Disequilibrium through generations Initial mutation () occurs on a chromosomal background (shaded) A (Many generations) Areas retaining LD with mutation 21

Haplotype Haplotype: Set of alleles at multiple loci on a particular chromosome transmitted from parent to child Red for haplotype from Mom, blue from Dad Genotype A/A C/T C/C T/T G/G A/A Haplotypes A A T C C C T T G G A A Phase Knowledge of the origin of alleles (either from mom or dad)

B is the DSL Linkage vs Linkage Disequilibrium A a B b C c D D A,a b,b c,c D,D a,a b,b C,c D,d a A B b C c D D A a B b C c d D A,a b,b C,c d,d A a B b C c D D a,a b,b c,c D,D A A B b C c D D a a b B c C d D a,a b,b C,c D,D a a b B C C D D A a B b C c d d A A B b C C d d a,a b,b c,c d,d A a B b C c d d Which loci are linked to B? All 4 loci are linked Which loci are in LD with B? Only B and C are in LD (ie associated at the population level)

Linkage Disequilibrium (LD) AKA Allelic Association LD usually exists when two loci are close (about 50kb maybe up to 300kb) When a mutation arises in a population, there is high LD between the mutation and other variants on the same chromosome Over generations, LD dissipates between mutation and loci far away via recombination 26

Ancestral alleles Linkage Disequilibrium (LD) AKA Allelic Association Y 0 1 0 0 Present-day alleles 0 1 1 0 1 Based on Abecasis 26

Using LD for Mapping: Association 25

Formal Definiton of LD LD refers to association between alleles at two markers on same haplotype (chromosome) B b A p AB p Ab p A a p ab p ab p a p B p b 1 p AB = proportion of chromosomes with an A and a B at the two loci 31

Linkage Equilibrium (absence of linkage) Linkage Equilibrium occurs when alleles at two loci are independent of each other: B b A p AB = p A p B p A a p a p B p b 1 Independence Linkage Equilibrium 32

Linkage Disequilibrium (LD) Association between alleles A and B Coefficient of linkage disequilibrium ( Disequilibrium coefficient ): D = p AB p A p B Problem: Range of D depends on margins Alternative: correlation, r = D/ p A p B p a p b nr 2 is effective sample size for testing marker instead DSL 33

Linkage Disequilibrium Bart Buitenhuis, BMC Genetics, 2016 https://doiorg/101186/s12863-016-0421-2

Genetic Association Analysis LD is the basis of Association Analysis For association analysis we observe a trait and a marker locus, usually not the DSL Test association between marker and a trait using ordinary association analysis methods Null hypothesis is no association between marker and the trait---guilt by Association Rejection implies DSL is in LD with the marker Spurious association occurs when 2 loci are not linked, ie, association between two loci that are on different chromosomes (possibly due to population substructure) 27

Features of Genetic Association Analysis Can use isolated cases and unrelated individuals or families Much more powerful than linkage analysis Requires hundreds of thousands of markers for a whole genome analysis (500k-1,000k ); a severe multiple comparison problem 28

Published Genome-Wide Associations through 12/2013 Published GWA at p 5X10-8 for 17 trait categories NHGRI GWA Catalog wwwgenomegov/gwastudies wwwebiacuk/fgpt/gwas/ 29