Transmission Disequilibrium Test in GWAS

Similar documents
Nonparametric Linkage Analysis. Nonparametric Linkage Analysis

Non-parametric methods for linkage analysis

Transmission Disequilibrium Methods for Family-Based Studies Daniel J. Schaid Technical Report #72 July, 2004

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

STATISTICAL GENETICS 98 Transmission Disequilibrium, Family Controls, and Great Expectations

Complex Multifactorial Genetic Diseases

Dan Koller, Ph.D. Medical and Molecular Genetics

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

GENETIC LINKAGE ANALYSIS

Allowing for Missing Parents in Genetic Studies of Case-Parent Triads

Effect of Genetic Heterogeneity and Assortative Mating on Linkage Analysis: A Simulation Study

Human population sub-structure and genetic association studies

Genetics and Genomics in Medicine Chapter 8 Questions

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

Ascertainment Through Family History of Disease Often Decreases the Power of Family-based Association Studies

Chapter 15 - The Chromosomal Basis of Inheritance. A. Bergeron +AP Biology PCHS

GENETICS - NOTES-

Comparison of Linkage-Disequilibrium Methods for Localization of Genes Influencing Quantitative Traits in Humans

Imaging Genetics: Heritability, Linkage & Association

Review and Evaluation of Methods Correcting for Population Stratification with a Focus on Underlying Statistical Principles

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction

B-4.7 Summarize the chromosome theory of inheritance and relate that theory to Gregor Mendel s principles of genetics

STATISTICAL ANALYSIS FOR GENETIC EPIDEMIOLOGY (S.A.G.E.) INTRODUCTION

REVIEWS. Family-based designs for genome-wide association studies

Indian Journal of Nephrology Indian J Nephrol 2001;11: 88-97

Chapter 15: The Chromosomal Basis of Inheritance

Replacing IBS with IBD: The MLS Method. Biostatistics 666 Lecture 15

Name: PS#: Biol 3301 Midterm 1 Spring 2012

Stat 531 Statistical Genetics I Homework 4

Power Calculation for Testing If Disease is Associated with Marker in a Case-Control Study Using the GeneticsDesign Package

Alzheimer Disease and Complex Segregation Analysis p.1/29

For a long time, people have observed that offspring look like their parents.

Name Class Date. KEY CONCEPT The chromosomes on which genes are located can affect the expression of traits.

Effects of Stratification in the Analysis of Affected-Sib-Pair Data: Benefits and Costs

Figure 1: Transmission of Wing Shape & Body Color Alleles: F0 Mating. Figure 1.1: Transmission of Wing Shape & Body Color Alleles: Expected F1 Outcome

Mendelian Genetics. KEY CONCEPT Mendel s research showed that traits are inherited as discrete units.

Linkage Disequilibrium in Recently Admixed Populations

MTH 225: Introductory Statistics

draw and interpret pedigree charts from data on human single allele and multiple allele inheritance patterns; e.g., hemophilia, blood types

Quantitative Genetic Methods to Dissect Heterogeneity in Complex Traits

Mendelian Genetics: Patterns of Inheritance

Publications (* denote senior corresponding author)

MENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on.

Probability and Punnett Squares

Statistical Evaluation of Sibling Relationship

1997 Oxford University Press Human Molecular Genetics, 1997, Vol. 6, No

Type 1 diabetes, an early-onset autoimmune disease. Brief Genetics Report Association of IL4R Haplotypes With Type 1 Diabetes

Chapter 15: The Chromosomal Basis of Inheritance

A gene is a sequence of DNA that resides at a particular site on a chromosome the locus (plural loci). Genetic linkage of genes on a single

A Unified Sampling Approach for Multipoint Analysis of Qualitative and Quantitative Traits in Sib Pairs

Population Structure in Admixed Populations: Effect of Admixture Dynamics on the Pattern of Linkage Disequilibrium

Lecture 6: Linkage analysis in medical genetics

Linkage Mapping in Drosophila Melanogaster

Biology. Chapter 13. Observing Patterns in Inherited Traits. Concepts and Applications 9e Starr Evers Starr. Cengage Learning 2015

Biology: Life on Earth

Model of an F 1 and F 2 generation

Edited by Tak Wah Mak, Ontario Cancer Institute, Toronto, ON, Canada, and approved March 19, 2009 (received for review December 19, 2008)

Covariate Adjustment in Family-Based Association Studies

14.1 Human Chromosomes pg

Chapter 17 Genetics Crosses:

HLA Disease Associations Methods Manual Version (July 25, 2011)

Mendel and Genetics. Mr. Nagel Meade High School

Performing. linkage analysis using MERLIN

Patterns in Inheritance. Chapter 10

ARTICLE Identification of Susceptibility Genes for Cancer in a Genome-wide Scan: Results from the Colon Neoplasia Sibling Study

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES

Diabetologia 9 Springer-Verlag 1981

Parental Effects. Genomic Imprinting

For more information about how to cite these materials visit

Genome-wide association studies (case/control and family-based) Heather J. Cordell, Institute of Genetic Medicine Newcastle University, UK

Patterns of Inheritance

Section 8.1 Studying inheritance

Introduction to the Genetics of Complex Disease

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

Chapter 11 introduction to genetics 11.1 The work of Gregor mendel

Pedigree Construction Notes

Diabetologia. Originals. Will a three-allele model of inheritance explain the HLA data for Type 1 (insulin-dependent) diabetes? D. A.

Mendelism: the Basic principles of Inheritance

Genetics Practice Test

h f presence of a human leukocyte antigen (HLA)-linked disease-predisposing

SSN SBPM Workshop Exam One. Short Answer Questions & Answers

Lesson Overview 11.2 Applying Mendel s Principles

Genetics and Pharmacogenetics in Human Complex Disorders (Example of Bipolar Disorder)

The Statistics of Inheritance

1248 Letters to the Editor

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

Human genetic susceptibility to leprosy: methodological issues in a linkage analysis of extended pedigrees from Karonga district, Malawi

The Making of the Fittest: Natural Selection in Humans

New Enhancements: GWAS Workflows with SVS

INTRODUCTION TO GENETIC EPIDEMIOLOGY (1012GENEP1) Prof. Dr. Dr. K. Van Steen

Multifactorial Inheritance. Prof. Dr. Nedime Serakinci

Name Class Date. Review Guide. Genetics. The fundamental principles of genetics were first discovered by. What type of plant did he breed?.

Gregor Mendel and Genetics Worksheets

VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous chromosome sexual reproduction meiosis

UNIT 6 GENETICS 12/30/16

Laws of Inheritance. Bởi: OpenStaxCollege

Genetics, Analysis & Principles/5e ANSWERS TO PROBLEM SETS CHAPTER 1

Chapter 02 Mendelian Inheritance

Finding genes. Genes and language, Part V: Dan Dediu. Dan Dediu

Genetics and Heredity Notes

Transcription:

Department of Computer Science Brown University, Providence sorin@cs.brown.edu November 10, 2010

Outline 1 Outline 2 3 4

The transmission/disequilibrium test (TDT) was intro- duced several years ago by Spielman et al. (1993) as a test for linkage between a complex disease and a genetic marker. The original intended use of the TDT was to test for linkage with a marker located near a candidate gene, in cases where disease association already had been found. However, even if prior evidence for association is absent, the TDT is valid and can be used to test any marker (or a set of markers) for which data are available from parents and one or more affected offspring.

The genetic model consists of both a locus D that contributes to disease susceptibility and a (possibly linked) marker locus M. The standard way to identify disease loci is to use classical (LOD) or nonparametric (affected-sib-pair [ASP]) methods to test for linkage with such a marker or a set of markers. It has been known for some time, however, that these methods may fail to detect a disease locus linked to a marker, even though the locus may be of biological significance (Cox et al. 1988; Spielman et al. 1989). Even when standard linkage tests fail to provide evi- dence, however, a disease locus linked to the marker may be suggested sometimes by the presence of a dis- ease association, usually established by a case-control study.

The underlying premise is that, if an association is found, it is likely to be due to linkage disequilibrium. (We use the term strictly to mean the presence of both linkage and association between marker and disease.) Since linkage disequilibrium is found only over very small map distances, close linkage between marker and disease susceptibility is implied.

However, it is a well- known result of population genetics that admixture, het- erogeneity, or stratification in a population can make it impossible to draw valid conclusions from a conven- tional case-control study, since these conditions ( popu- lation structure ) can give rise to substantial association even for unlinked loci. Accordingly, Spielman and Ewens developed the TDT to test for linkage in the presence of association that is, to distinguish this case (linkage disequilibrium) from associations that arise from population structure in the absence of linkage.

The TDT grew out of earlier proposals for avoiding incorrect conclusions from disease associations. Recog- nizing the problem, Falk and Rubinstein (1987) pro- posed the haplotype relative risk (HRR) as a family- based test for association, but they did not focus on linkage. Field et al. (1986), Thomson et al. (1989), and Thomson (1995) developed this approach further as a test for association. Ott s (1989) analysis of the mathe- matical model for the HRR was the point of departure for Spielman and Ewens development of the TDT, and Parsian et al. (1991) presented and applied a similar test without call- ing attention to its mathematical properties.

Conventional tests for linkage (e.g., LOD and ASP) require sibships with multiple offspring. In the TDT, by contrast, sibships with a single affected offspring can be used to detect linkage between disease and marker, provided that disease association with some particular marker allele is also present.

We consider a marker locus M, with two alleles M1 and M2, and obtain genotypes for affected individuals and their parents. In the most general form, the data to be analyzed are numbers of transmissions ; that is, for parents of each genotype (M1M1, M1M2, and M2M2, we determine the number of times that the M1 allele or the M2 allele was transmit- ted to an affected offspring. Spielman et al. (1993, table 2), denoted these counts as follows: a, number of times that M1Ml transmits M1 to af- fected offspring; b, number of times that M1M2 transmits M1 to af- fected offspring; c, number of times that M1M2 transmits M2 to af- fected offspring; d, number of times that M2M2 transmits M2 to af- fected offspring.

The counts may come from families that are simplex (i.e., data are from only one affected offspring), multi- plex (data are from two or more affected sibs), or multi- generational; and the population may exhibit structure. The null hypothesis of interest is that the marker and disease are unlinked. For any specified null hypothesis and a given set of data, there is a standard procedure ( Neyman-Pearson ; e.g., see Kendall and Stuart 1979) for obtaining the most powerful statistical test. Applica- tion of this procedure, for the present data and hypothe- sis, yields the TDT as the optimal test. When marker and disease are unlinked, data used in the TDT for related individuals are independent. It follows that data from a large pedigree may be used (if desired) by applying the TDT to each affected individual separately.

The TDT is carried out as follows The TDT is carried out as follows. The result obtained by applying Neyman-Pearson theory dictates that we use data (observations b and c) only from those parents who are heterozygous M1M2; this result holds regardless of whether there is population structure. The TDT statistic (b c)2 (b+c) ; it tests for equal numbers of transmissions of M1 and M2 from heterozygous parents to affected offspring. If there is linkage between marker and disease, as well as allelic association, b and c will tend to differ in value.

The TDT is carried out as follows The statistical significance of the TDT is tested by X2 ( McNemar Test ) or by the exact binomial test (see Spielman et al. 1993); a significant difference provides evidence that the marker is linked to the disease locus. Note that, if there is no linkage, alleles of M segregate independently of disease, so the presence of association (e.g., from population structure) will not cause b to dif- fer from c. Thus such an association would not lead us to infer linkage incorrectly.

The TDT is carried out as follows Similarly, when there is linkage but no association, there is also, on average, no tendency for b to differ from c. Thus the TDT can detect linkage only in the presence of association. In contrast, when tests of linkage are carried out by ASP methods, there is no requirement for association, since linkage is then detected as departure from random assortment within families. However, ASP methods do require the presence of two or more affected sibs. This restriction, which poses practical difficulties for study of diseases for which multiplex families are rare, does not apply to the TDT.

The structure of the TDT shows that one should use only heterozygous parents of total number b + c. The TDT tests whether the probabilities (0.5, 0.5) b (b+c), c (b+c) are about the same with The hypothesis is tested by a binomial (asymptotically chi-square) with one degree of freedom χ 2 = (b+c) (b 2 ) 2 + (b+c) 2 (b (b+c) 2 ) 2 (b+c) 2 = (b c)2 (b + c)