Linkage Disequilibrium in Recently Admixed Populations

Size: px
Start display at page:

Download "Linkage Disequilibrium in Recently Admixed Populations"

Transcription

1 Am. J. Hum. Genet. 60: , 1997 Mapping Genes Underlying Ethnic Differences in Disease Risk by Linkage Disequilibrium in Recently Admixed Populations Paul M. McKeigue Epidemiology Unit, Department of Epidemiology and Population Sciences, London School of Hygiene and Tropical Medicine Summary Where recent admixture has occurred between two populations that have different disease rates for genetic reasons, family-based association studies can be used to map the genes underlying these differences, if the ancestry of the alleles at each locus examined can be assigned to one of the two founding populations. This article explores the statistical power and design requirements of this approach. Markers suitable for assigning the ancestry of genomic regions could be defined by grouping alleles at closely spaced microsatellite loci into haplotypes, or generated by representational difference analysis. For a given relative risk between populations, the sample size required to detect a disease locus that accounts for this relative risk by linkage-disequilibrium mapping in an admixed population is not critically dependent on assumptions about genotype penetrances or allele frequencies. Using the transmission-disequilibrium test to search the genome for a locus that accounts for a relative risk of between 2 and 3 in a high-risk population, compared with a low-risk population, generally requires between 150 and 800 case-parent pairs of mixed descent. The optimal strategy is to conduct an - initial study using markers spaced at 10 cm with cases from the second and third generations of mixed descent, and then to map the disease loci more accurately in a subsequent study of a population with a longer history of admixture. This approach has greater statistical power than allele-sharing designs and has obvious applications to the genetics of hypertension, non-insulin-dependent diabetes, and obesity. Introduction Current attempts to map genes for complex traits are based on two main approaches: association studies using Received June 10, 1996; accepted for publication August 28, Address for correspondence and reprints: Dr. Paul M. McKeigue, Epidemiology Unit, Department of Epidemiology and Population Sciences, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom. pmckeigu@ lshtm.ac.uk 1997 by The American Society of Human Genetics. All rights reserved /97/ $ polymorphisms close to candidate genes or allele-sharing studies using affected sib pairs. Both these approaches have limitations: candidate gene studies depend on knowing which gene to look at, and allelesharing studies of a binary trait require very large sample sizes to detect a locus that contributes a recurrence risk ratio in siblings of <-1.5. An ideal method would combine the statistical power of association studies with the broad mapping resolution of linkage studies. Several authors have noted that it is possible to take advantage of the disequilibrium that results from recent admixture between two populations in which the frequencies of alleles influencing the trait are different (Chakraborty and Weiss 1988; Risch 1992; Stephens et al. 1994; Thomson 1995). Under these circumstances, linkage disequilibrium will be present over longer distances and the density of markers required for a whole genome search is less than in a population without history of recent admixture. Two previous groups have examined the statistical power and design requirements of this approach (Chakraborty and Weiss 1988; Stephens et al. 1994). These analyses were based on two assumptions that limit their application in practice. The first assumption was that disequilibrium between alleles at the marker locus and the disease locus can be measured directly because genotypes at the disease locus are easily distinguishable by the phenotypes they produce. This is an unrealistic model for the study of complex traits. In any case, if genotypes at a locus are easily distinguishable by the phenotypes they produce, the locus can be mapped by conventional linkage analysis. The second assumption made by these authors was that disequilibrium between unlinked genes could not be eliminated in the design of the association study. Gametic disequilibrium generated by admixture persists for a few generations, even when the loci are unlinked. Chakraborty and Weiss (1988) suggested that statistical tests based on the history of admixture could help establish whether disequilibrium between alleles was attributable to linkage, whereas Stephens et al. (1994) assumed that individuals whose ancestry included admixture within the last few generations would be excluded from the study. Neither of these measures is necessary if family-based study designs are used to test for association,

2 McKeigue: Mapping of Genes in Recently Admixed Groups as when analyzed correctly these studies detect disequilibrium only between linked genes (Ewens and Spielman 1995; Thomson 1995). A key advantage of family-based study designs is the ability to test for linkage disequilibrium in populations where admixture has occurred within the last few generations, because in such populations widely spaced markers can be used for a wholegenome search. The objective of this paper is to explore the practical possibilities for using linkage disequilibrium in recently admixed populations to map genes for complex traits and multifactorial diseases such as hypertension, diabetes, and obesity. The rationale for this approach is that admixture between populations that have markedly different disease rates or trait values for genetic reasons will generate linkage disequilibrium between genes influencing the trait and other alleles that have different frequencies in the two founding populations. A first step therefore is to consider the criteria for deciding whether ethnic differences in disease rates are likely to have a genetic explanation. Distinguishing between Genetic and Environmental Explanations for Ethnic Differences in Disease Rates Even where familial aggregation studies demonstrate that a trait has high heritability or high risk of recurrence in relatives of cases, it does not necessarily follow that ethnic differences in disease risk are attributable to genetic influences. The classical epidemiological approach to distinguishing between genetic and environmental explanations for between-population differences in disease rates is to study migrants (Reid 1971). When migrant groups living in the same environment have different disease rates that are not accounted for by adjusting for known environmental determinants of disease risk, genetic explanations should be considered. Genetic explanations are most likely where differences in disease rates persist even in migrants who have been settled outside the home country for several generations and where such differences are consistently found in all countries where the migrant group has settled. On these criteria, genetic factors are likely to underlie the high rates of coronary heart disease and non-insulin-dependent diabetes that have been reported in people of South Asian (Indian, Pakistani, Bangladeshi, and Sri Lankan) descent settled overseas (McKeigue et al. 1989). In contrast, genetic factors are unlikely to account for the low rates of coronary heart disease, colorectal cancer, and breast cancer in native Japanese compared with U.S. Whites; in second- and third-generation Japanese migrants to the United States, the rates of these diseases are close to those in U.S. Whites (Dunn 1975; Gordon 1982). The most-powerful technique for distinguishing between environmental and genetic explanations for ethnic differences in disease risk is to study populations in which the proportion of admixture of genes from highrisk and low-risk populations varies. If disease risk is correlated with the proportion of genes originating from the high-risk population, independently of environmental factors, this is consistent with genetic influences. Thus, in Mexican-Americans, prevalence of non-insulin-dependent diabetes is correlated with the percent of Native American admixture (Chakraborty et al. 1986). Some examples of ethnic differences in trait values that are likely to be attributable to genetic influences are listed in table 1. It is curious that hypertension, noninsulin-dependent diabetes, and obesity should be so prominent among the conditions for which genetic differentiation has apparently led to ethnic differences in disease risk. This may be because genes influencing risk of these diseases also influence traits on which there has been differential selection pressure, such as the ability to survive famine (Neel 1962). If there is less evidence of ethnic differences in cancer risk that are attributable to genetic influences, this may be because genes influencing cancer risk do not influence other traits on which there has been differential selection pressure. Whom to Study 189 European maritime expansion beginning in the fifteenth century led to admixture on a large scale with Native American, West African, and Pacific populations. Admixture in Asia has a longer history: thus, for instance, early Hindu sources refer to admixture between the inhabitants of Punjab and invaders from southwestern Asia who arrived around 1200 B.C. (Tinker 1989). It is not unusual for the offspring of mixed unions to form an endogamous subpopulation, isolated from the two founding populations by geographic or social barriers. An example is the Anglo-Indian population, descended from eighteenth-century intermarriages between Indian women and British officers of the East India Company (Younger 1987). Although searches of the whole genome may be feasible only in populations where admixture has occurred recently, studying populations with a longer history of admixture may help map disease loci more accurately. Because the power of the study depends on the relative risk between populations, the case definition and the distribution of cases by age and sex should be chosen to correspond with the categories that maximize the relative risk when the two populations are compared. For instance, to detect loci underlying the high rates of hypertension in people of West African descent compared with Europeans, the optimal group to study would be adult women, because the ethnic difference in blood pressure is largest in this group (Chaturvedi et al. 1993;

3 190 Table 1 Ethnic Differences in Disease Rates That Are Likely To Have a Genetic Component Am. J. Hum. Genet. 60: , 1997 DISEASE Non-Insulin-Dependent Diabetes Hypertension Obesity Coronary Heart Disease High-risk ethnic Many non-european groups, West Africans (Prineas Native Americans, South Asians groups including South Asians, West and Gillum 1985) Peninsular Arabs, (McKeigue et al. 1989) Africans, Peninsular Arabs, Pacific islanders, Aboriginal Australians, Pacific Aboriginal Australians, islanders, and Native West African women Americans (Zimmet 1992) (Hodge and Zimmet 1994); South Asians (central adiposity) (McKeigue et al. 1991) Low-risk ethnic Northern Europeans West African men groups (Miller et al. 1989) Typical relative risks 4 to 7 2 to 3 2 to 4 2 to 4 between high-risk and low-risk populations Manatunga et al. 1993) even though heritability of blood pressure is generally lower in adults than in children (Perusse et al. 1989). What Size of Relative Risk Should One Look For? On the assumption of a multiplicative model for the relative risks associated with combinations of genotypes at different loci, the relative risk associated with ethnicity can be partitioned into locus-specific relative risks: thus, two loci accounting for relative risks of 2 and 3 would generate a relative risk of 6. Usually we can only guess the number of loci likely to be involved, but the following argument suggests that loci where genetic variation accounts for ethnic differences in disease risk will generally be fewer in number than loci where genetic variation accounts for familial aggregation of disease risk within populations. Genes that underlie ethnic differences in the risk of a disease such as non-insulindependent diabetes are likely to influence traits for which differential selection pressure has occurred, such as ability to survive famine (Neel 1962). Presumably not all genetic loci influencing diabetes risk also influence traits on which there has been differential selection pressure, so only a subset of the loci where genetic variation influences the risk of diabetes will underlie ethnic differences in diabetes prevalence. For the examples in table 1, where the relative risks of disease in high-risk compared with low-risk populations are generally between 2 and 5, it is reasonable to begin by looking for loci where ethnic differences in allele frequencies account for relative risks of 2 to 3, about one-third to one-half of the observed ethnic difference in disease risk. It is possible that ethnic differences in disease risk depend on a balance between opposing effects at different loci: thus, in a population where selection pressure has led to high frequencies of an allele causing insulin resistance, there may have been selection for other traits such as increased beta-cell reserves, which allow glucose homeostasis to be maintained. In this case, the relative risk of diabetes attributable to a locus influencing insulin resistance could be larger than the total relative risk of diabetes between the two populations. Expected Frequency of Transmission of a Marker Allele from a Heterozygous Parent to Affected Offspring We first examine the properties of the transmissiondisequilibrium test, which tests for excess frequency of transmission of a marker allele to affected offspring from heterozygous parents (Ewens and Spielman 1995). The expected deviation from 50% transmission of the marker allele can be calculated for a simple two-locus model as follows, assuming random mating within the population under study. Suppose that there is linkage disequilibrium between alleles D1 and D2 which have frequencies PD and qd at a disease locus D, and alleles A1 and A2 which have frequencies PA and qa at a nearby marker locus A. With a gametic disequilibrium coefficient of A, the conditional probability R of a D1 allele given that the gamete carries an A1 allele is PD + A/PA, and the conditional probability S of a D1 allele given that the gamete carries an A2 allele is PD - A/qA. The expected frequency of transmission of the A1 al-

4 McKeigue: Mapping of Genes in Recently Admixed Groups lele from heterozygous parents to affected offspring is the probability H that the A1 allele is transmitted, given that the parental genotype is A1A2 and the offspring is affected. If gi is the risk of disease in individuals who have i A1 alleles, then from the rules of conditional probability we have ~qa + PAg2 (1) qa9g + g1 + PA92 If fi is the penetrance of the D genotype with i D1 alleles, then, assuming random mating, we can express gi in terms of fi, R and S as where g2= R2U + 2RV + W g= RSU + (R + S)V+ W, go = S2U + 2SV + W U = f2-2fi + fo, V = fi - fo, and W = fo. In an additive model, there is a linear relationship between penetrance and the number of copies of the D1 allele, so that U = 0 (Kempthorne 1969). In a fully recessive model, one copy of the D1 allele has the same genotypic value as no copies, so that V = 0. In a fully dominant model, one copy of the D1 allele has the same genotypic value as two copies, so that U + V = 0. Substituting the expressions for g2, g1, and go into equation (1) above, we have PD(PD + A/PA)U + (2PD + A/PA)V + W (2) PD(2PD + A/PA - A/qA)U + (4PD + A/PA - AlqA)V + 2W This expression depends only on the frequency of the D1 allele, the frequency of the marker allele A1, the gametic disequilibrium coefficient, and the penetrance ratios f[2fo and fi/fo. Thus, the expected transmission frequency does not depend on whether the disease is rare or common. When PA = qa = 0.5, equation (2) simplifies to 'l = A(PDU + V) (3) U +2PDV+W For an additive model, U = 0 and the expression simplifies further to =DAV (4) 2PDV + W' 191 Application of the Transmission-Disequilibrium Test to a Recently Admixed Population Consider two populations X and Y, where the frequency PDY of a disease-predisposing allele D1 in population Y is higher than the frequency PDX of this allele in population X, so that the relative risk of disease in population Y compared with population X is r. For simplicity, we restrict the analysis to an endogamous group formed by the offspring of mixed unions between population X and population Y. As noted earlier, this is not an unrealistic situation. The frequency of the D1 allele in this equally admixed population will be PD = (PDX + PDY)/2 - Suppose we examine a nearby marker locus A where the frequency of the A1 allele is higher (PAY) in population Y than in population X (frequency PAX). In the gametes that give rise to the first generation of mixed descent, the gametic disequilibrium coefficient is A1 = (PAY - PAX)(PDY - PDX)/4. After a further n - 2 generations of random mating, the disequilibrium coefficient A, - 1 in the gametes that produce the (n - 1)th generation of mixed descent will be An-1= (1-20)(1-0) 3A 1 where 0 is the recombination fraction between the two loci. The usual relation Ai + 1 = (1-0)Ai does not apply between the first and second generations, because the alleles are not in Hardy-Weinberg equilibrium until the second generation. If a transmission-disequilibrium test is applied to these parents and their affected offspring in the nth generation of mixed descent, the disequilibrium coefficient An in the gametes from heterozygous parents that produce these offspring will be An = (1-20)2(1-0)n 11/4pAqA Again the usual relation Ai + 1 = (1-0)Ai does not apply, because the test is restricted to parents heterozygous at the marker locus. In the gametes that give rise to these heterozygous parents, the disequilibrium coefficient is 114PAqA times the coefficient in the general population. In the gametes produced by these heterozygous parents, the disequilibrium coefficient is (1-20) times the coefficient in the parents themselves. Substituting these expressions for PD, PA, and A into equation (2) above gives the expected frequency of transmission of the A1 allele to affected offspring in the admixed population.

5 192 In practice, we would try to choose marker loci where certain alleles occur only in population X, or only in population Y. The optimal strategy is to use information from several closely spaced flanking markers, where necessary, to assign the parental alleles at each locus to X by descent or Y by descent. The transmission-disequilibrium test will then use at each locus only those parents who have one allele X by descent and one allele Y by descent. This assignment of marker alleles to X by descent or Y by descent is equivalent to using a marker locus A where the frequency PAX of the A1 allele in population X is 0 and the frequency PAY in population Y is 1. In an equally admixed population, this will be equivalent to PA = qa= 0.5, and the gametic disequilibrium coefficient in the first generation of mixed descent will be given by A = (PDY - PDX)/4. In practice, we would have no idea of the likely difference between allele frequencies PDX and PDY, but from epidemiological studies we will have an estimate of the relative risk r of disease in population Y compared with population X. The relationship between PDX and PDY is then YU + 2PDYV + W = r(p2xu + 2PDXV + W). Solving for PDY and substituting the result into equation (3) yields a value of HI in terms of r, the relative risk in population Y compared with population X, the frequency PDX of the disease predisposing allele in the low-risk population, and the risk ratios [2/fo and fi/fo. This transmission probability represents the maximal effect detectable in an equally admixed population, where the marker locus is close to the disease locus and the ancestry of parental alleles at the marker locus can be assigned accurately. If the effect of the D1 allele is additive, U = 0 and the expressions for PD and A simplify to and PD = [(r + 1)PDX + (r - 1)W/2V]/2 A = (r - 1)(2PDX + W/V)/8. Substituting these values for PD and A into equation (4), we have n = (r ) (5) 4(r +1)(5 Thus, with an additive model, an equally admixed Am. J. Hum. Genet. 60: , 1997 population, and markers that allow ancestry of parental marker alleles to be assigned accurately, the probability of transmission (and hence the sample size required to detect linkage) depends only on r, the relative risk between the two populations accounted for by the locus. For a given value of r, the required sample size is independent of any other assumptions about allele frequencies or penetrance. Sample Size For a given type I and type II error, the required sample size can be calculated from the standard test of the null hypothesis that a binomial probability is 0.5. Where the specific objective of the study is to detect loci that account for higher disease rates in one of the two founding populations, it is appropriate to use a one-sided test. For instance, in a study of a population of mixed Indian and European descent, we would plan to detect loci where alleles Indian by descent are transmitted to diabetic offspring more often than expected by chance, but we would probably ignore any loci where there was an excess transmission to diabetic offspring of alleles European by descent. With a one-sided type I error probability a and type II error probability of A, the required sample size n for a transmission probability HI is given by [0.5Za +H/(1 - Hi)Z1 12 n = Hl -0.5 (6) For an initial search of the whole genome, it is reasonable to set a type I error probability that will often generate one or two false-positive results, because any regions that show evidence of linkage disequilibrium will be followed up in further studies to map the loci more precisely. The threshold value of a that will yield one expected false-positive result in a whole genome search with closely spaced markers can be calculated from the formula given by Lander and Kruglyak (1995), which uses the crossover rate p between the genotypes being compared. In a study of the grandchildren of mixed unions, the crossover rate is 1, and setting the value of a to will give one expected false-positive result in a whole-genome search. In studies of populations with a longer history of admixture, the crossover rate will be higher. This will give finer mapping resolution but increase the effective number of independent statistical tests so that a lower value of a will be required to keep the expected number of false-positive results down to one. The sample sizes given here are based on a requirement of 90% power (i.e., i = 0.1), and a one-sided a of (equivalent to one expected false-positive result in a study of the grandchildren of mixed unions).

6 McKeigue: Mapping of Genes in Recently Admixed Groups On the assumption of an additive model, 633 case-parent pairs are required to detect a locus that accounts for a relative risk of 2 between the two founding populations, 381 case-parent pairs are required to detect a locus that generates a relative risk of 2.5, and 279 case-parent pairs are required to detect a locus that generates a relative risk of 3. The effects of assuming either a fully recessive model (fi = fo) or a fully dominant model (f2 = ft), and varying the frequency PDX of the D1 allele in population X are shown in table 2. To account for a relative risk of 2 in population Y compared with population X, two alternative values for the genotype risk ratio f2/fo are assumed: 5 and 10. The value of PDX is then allowed to vary from zero to the maximum possible value compatible with a relative risk of 2 between the two populations. Assuming a fully recessive model rather than an additive model reduces the required sample size by a fraction that depends on PDX. In the most extreme case, in which the D1 allele is absent in population X (i.e., PDX = 0), the Table 2 Sample Size Required To Detect a Locus That Generates a Relative Risk of 2 in Population Y Compared with Population X No. OF CASE-PARENT PAIRS f2/fo = 5 f2/fo = 10 PDX A. Fully Recessive Model (fo = fl) B. Fully Dominant Model (ft = f2) NOTE. -f2, fl, and fo are penetrances of genotypes D1D1, DID2, and D2D2, respectively. PDX is frequency of Di allele in population X. Sample sizes are based on 90% power and one-sided P-value of For an additive model and a relative risk of 2, required sample size is 633 case-parent pairs, whatever the values of [2, fi, fo, and PDXrequired sample size is reduced by about one-third. Assuming a fully dominant model rather than an additive model increases the required sample size, but, even on the most unfavorable assumption (a frequency of PDX so high that PDY is close to unity), the required sample size does not increase by more than about one-third. The power of the admixture disequilibrium study design is thus remarkably robust, whatever the underlying genetic model. For comparison, we can examine the power of an affected sib-pair study designed to detect such a locus, using the method described by Risch and Zhang (1995) for calculating the power of a study using pairs of sibs who have been selected on the values of a trait. The following calculations are based on 90% power to detect a locus where an additive genetic model with f2/fo = 5 generates a relative risk of 2 between population Y and population X, with a type I error rate equivalent to one expected false-positive result in a whole genome search. In this situation the required sample size depends critically on the frequency of the D1 allele: if the D1 allele is rare (which is possible in population X) or occurs with high frequency (which is possible in population Y), tens of thousands of sib pairs are required. Even in the most favorable situation, where the D1 allele frequency is -0.25, >1,500 affected sib pairs would be required: this is -10 times the sample size required for the admixture disequilibrium study design. Allele-sharing designs have less statistical power than association studies because large risk ratios between genotypes at a disease locus generate only modest recurrence risk ratios in relatives: thus, for an additive genetic model with f2/fo = 5 and a frequency of 0.25 for the D1 allele, the recurrence risk ratio in siblings is only 1.2. Misclassification of allele ancestry at the marker locus would seriously weaken the power to detect linkage disequilibrium in a recently admixed population. Thus, if the ancestry of 10% of alleles is misclassified, the gametic disequilibrium coefficient would be reduced by -20% and the sample size required to detect a disease locus would be increased by 56%. It is thus a crucial requirement of this design to be able to classify accurately the alleles at each locus as X by descent or Y by descent. What Markers Should Be Used and at What Density? In contrast to allele-sharing studies, marker loci for linkage-disequilibrium mapping do not have to be highly polymorphic: RFLPs would be suitable. The objective is to be able to classify the two alleles at a parental locus as XX by descent, XY by descent, or YY by descent. The first step in identifying such markers would be to assemble a bank of samples from each founding population and to look for marker alleles that occur in only

7 194 one of the two populations. Pooling of DNA samples from each population would speed this. The genetic distance between West Africans and Europeans is large enough for this to be feasible: '-8% of RFLPs with two or more alleles in Europeans have only one allele in West Africans (Poloni et al. 1995). Grouping the alleles at several closely spaced marker loci into haplotypes would allow ancestry of genomic regions to be assigned, since most haplotypes will contain at least one allele unique to one of the two founding populations. Groups of closely spaced microsatellite markers could be used to define such haplotypes. An alternative method of generating markers suitable for assigning ancestry would be to use representational difference analysis (Lisitsyn and Wigler 1993). Using pooled DNA from a large sample of individuals from population X to drive hybridization against DNA from population Y would detect base sequences present in population Y but absent in population X. By restricting the sample from population Y to two or three individuals, the chance of detecting base sequences that are rare in population Y (and therefore less useful for assigning ancestry) would be minimized. The strength of this approach is that it could be applied when the genetic distance between the two founding populations is too small for ancestry of genomic regions to be assigned using currently available markers. If base sequences that can be used to define ancestry exist, representational difference analysis can be relied on to detect them. The disadvantage of using representational difference analysis is that the base sequences would have to be mapped before they could be used as markers. The distance required between markers (or between groups of closely spaced markers used to define haplotypes) depends on the number of generations since admixture. Equations (5) and (6) show that the required sample size is approximately proportional to the square of the gametic disequilibrium coefficient A. In the grandchildren of mixed unions, the coefficient A2 will be decreased by a factor of (1-20) from its initial value A1 in their parents. After n generations since the mixed unions occurred (n > 2), the gametic disequilibrium coefficient An between marker locus and disease locus in the offspring of heterozygous parents will be (1-20)2(1-9)n - 3A1. Figure 1 shows the decay of the gametic disequilibrium coefficient A as a function of the distance from marker to disease locus and the number of generations since admixture. In the second and third generations of mixed descent, marker spacing of 10 cm will ensure that there is at least one marker locus at which A is within 20% of its initial value. In a population such as Anglo-Indians, where the main admixture occurred -10 generations ago (Younger 1987), marker spacing of o-4 cm would be necessary to ensure that the value of A was within 20% of its initial value. Interval E 0D._,l)._ _ 4- C 0 U- Al 1 Am. J. Hum. Genet. 60: , 1997 / /. ~~/ /.~ /: ///.1, I' / I-, / \,. \ IN, A2 A3 A5 A'-A Map distance (cm) from marker to disease locus Figure 1 Decay of gametic disequilibrium coefficient from its initial value as a function of distance between marker and disease locus and the number of generations since admixture. Al is the coefficient of disequilibrium in the gametes that produce the first generation of mixed descent, and A2, A3, A5, and Ajo are the coefficients of disequilibrium in the gametes from heterozygous parents that produce the 2d, 3d, 5th, and 10th generations of mixed descent. mapping techniques could maximize the information obtained at a given spacing of markers. Discussion In contrast to allele-sharing studies, which detect loci that contribute to familial aggregation of disease risk within populations, linkage-disequilibrium mapping in recently admixed populations would specifically detect the loci of genes that underlie ethnic differences in disease risk. One strength of the admixture disequilibrium mapping approach is that it is usually possible to assemble a sample in which several traits can be examined simultaneously. For instance, many populations with high rates of non-insulin-dependent diabetes also have high rates of hypertension and obesity. In an admixed population cases with more than one of these conditions are easy to find. The statistical analysis can then test for linkage disequilibrium with each condition separately, or with a combination of conditions such as diabetes and hypertension, which are likely to share genetic influences. With an allele-sharing design, the practical problems of identifying sib pairs affected with the same combination of conditions makes this more difficult. In the first generation of mixed descent, all individuals will have one allele X by descent and one allele Y by descent: all parents in this generation can therefore be included in a test of unequal transmission of alleles X by descent and Y by descent to affected offspring, pro- ""

8 McKeigue: Mapping of Genes in Recently Admixed Groups 195 vided that the ancestry of their alleles can be assigned accurately. In these first generation offspring of mixed unions it is sufficient to be able to identify one of the two alleles at each locus as X or Y by descent, because the ancestry of the other allele is then assigned by default. In subsequent generations where Hardy-Weinberg equilibrium is reached, only half the parents (in an equally admixed population) will have one allele X by descent and one allele Y by descent at any given locus. Thus, studying cases who are the grandchildren of mixed unions, rather than subsequent generations, not only maximizes the distances over which linkage disequilibrium can be detected but also doubles the effective sample size (number of heterozygous parents) obtained from a given number of affected offspring. The optimal design is therefore to conduct an initial study with the affected grandchildren of mixed unions and their parents. Regions of the genome that show evidence of linkage disequilibrium in such an initial study can then be examined in detail in a population with a longer history of admixture, which allows the disease loci to be mapped more accurately. Although the statistical power calculations in this paper are based on assuming random mating within an equally admixed population, the study design is valid even if there is nonrandom mating and unequal admixture between the two populations. For instance, cases who have one parent from population X and one parent who is the offspring of an XY union could be included, although such cases will contribute less information than those who have two parents of mixed descent. Admixture results in gametic disequilibrium between alleles at any pair of loci where the allele frequencies in the two founding populations are different, whether or not these loci are linked. With random mating, this gametic disequilibrium decays by a factor of (1-0) in each generation. If the loci are unlinked (O = 0.5), gametic disequilibrium will be halved in each generation. Thus, when admixture has occurred within the last two or three generations, disequilibrium between unlinked genes will be present, and population-based association studies cannot yield unequivocal evidence of linkage. The basis of the transmission-disequilibrium test's ability to eliminate disequilibrium between unlinked genes is that it uses information from heterozygous parents only. As noted earlier, when the parents are heterozygous at the marker locus, gametic disequilibrium decays by a factor of (1-20) in a single generation; when 0 = 0.5, the disequilibrium in the gametes produced by heterozygous parents is zero. Excess transmission of a marker allele from heterozygous parents to affected offspring is thus unequivocal evidence of linkage between the marker and an allele influencing susceptibility, even in a recently admixed population where stratification, nonrandom mating, and disequilibrium between alleles at unlinked loci are present (Ewens and Spielman 1995). Where parents are not available, as with diseases that affect mainly older adults, parental genotypes can be inferred from the genotypes of cases' sibs. The transmission-disequilibrium test is equivalent to analyzing a matched-pair case-control study in which the cases are the transmitted alleles, the controls are the untransmitted alleles, the exposure is the allele type, and each case is matched to a control within a stratum defined by the affected offspring-parent pair. As in a matched case-control study, conditional logistic regression (Breslow and Day 1980) can be used for the analysis, instead of a simple binomial proportion test; it is thus possible to test for gene-environment interactions, if covariates have been measured on the cases. Where suitable admixed populations exist, the main technical problem in undertaking a linkage-disequilibrium study is to assemble a set of markers from which ancestry of alleles can be accurately assigned to one of the two founding populations. It remains to be established whether the most efficient means of defining such markers is by haplotyping based on known microsatellite markers or by using representational difference analysis to detect base sequences unique to one of the two founding populations. If suitable markers can be identified, this analysis shows that linkage-disequilibrium mapping in recently admixed populations has much greater power than allele-sharing designs to detect loci contributing to ethnic differences in disease rates. With realistic sample sizes it is possible to detect loci that would account for one-third to one-half of the observed ethnic differences in prevalence of conditions such as hypertension, non-insulin-dependent diabetes, or obesity. Acknowledgments I am grateful to Tim Aitman and Steve Bennett for discussion and comments. References Breslow NE, Day NE (1980) Statistical methods in cancer research. Vol 1: The analysis of case-control studies. International Agency for Research on Cancer, Lyon, pp Chakraborty R, Ferrell RE, Stern MP, Haffner SM, Hazuda HP, Rosenthal M (1986) Relationship of prevalence of noninsulin-dependent diabetes mellitus to Amerindian admixture in the Mexican Americans of San Antonio, Texas. Genet Epidemiol 3: Chakraborty R, Weiss KM (1988) Admixture as a tool for finding linked genes and detecting that difference from allelic association between loci. Proc Natl-Acad Sci USA 85: Chaturvedi N. McKeigue PM, Marmot MG (1993) Resting

9 196 Am. J. Hum. Genet. 60: , 1997 and ambulatory blood pressure differences in Afro-Caribbeans and Europeans. Hypertension 22:90-96 Dunn JE Jr (1975) Cancer epidemiology in populations of the United States-with emphasis on Hawaii and Californiaand Japan. Cancer Res 35: Ewens WJ, Spielman RS (1995) The transmission/disequilibrium test: history, subdivision, and admixture. Am J Hum Genet 57: Gordon T (1982) Further mortality experience among Japanese Americans. Public Health Rep 97: Hodge AM, Zimmet PZ (1994) The epidemiology of obesity. Bailleres Clin Endocrinol Metab 8: Kempthorne 0 (1969) An introduction to genetic statistics, 2d ed. Iowa State University Press, Ames, pp Lander E, Kruglyak L (1995) Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet 11: Lisitsyn N, Wigler M (1993) Cloning the differences between two complex genomes. Science 259: Manatunga AK, Jones JJ, Pratt JH (1993) Longitudinal assessment of blood pressures in black and white children. Hypertension 22:84-89 McKeigue PM, Miller GJ, Marmot MG (1989) Coronary heart disease in South Asians overseas: a review. J Clin Epidemiol 42: McKeigue PM, Shah B, Marmot MG (1991) Relation of central obesity and insulin resistance with high diabetes prevalence and cardiovascular risk in South Asians. Lancet 337: Miller GJ, Beckles GLA, Maude GH, Carson DC, Alexis SD, Price SGL, Byam NTA (1989) Ethnicity and other characteristics predictive of coronary heart disease in a developing country: principal results of the St. James survey, Trinidad. Int J Epidemiol 18: Neel JV (1962) Diabetes mellitus: a "thrifty" genotype rendered detrimental by "progress." Am J Hum Genet 14: Perusse L, Rice T, Bouchard C, Vogler GP, Rao DC (1989) Cardiovascular risk factors in a French-Canadian population: resolution of genetic and familial environmental effects on blood pressure by using extensive information on environmental correlates. Am J Hum Genet 45: Poloni ES, Excoffier L, Mountain JL, Langaney A, Cavalli- Sforza LL (1995) Nuclear DNA polymorphism in a Mandenka population from Senegal: comparison with eight other human populations. Ann Hum Genet 59:43-61 Prineas RJ, Gillum R (1985) U.S. epidemiology of hypertension in blacks. In: Hall WD, Saunders E, Shulman NB (eds) Hypertension in blacks: epidemiology, pathophysiology, and treatment. Year Book, Chicago, pp Reid DD (1971) The future of migrant studies. Israel J Med Sci 12: Risch N (1992) Mapping genes for complex diseases using association studies with recently admixed populations. Am J Hum Genet Suppl 51:13 Risch N. Zhang H (1995) Extreme discordant sib pairs for mapping quantitative trait loci in humans. Science 268: Stephens JC, Briscoe D, O'Brien SJ (1994) Mapping by admixture linkage disequilibrium in human populations: limits and guidelines. Am J Hum Genet 55: Thomson G (1995) Mapping disease genes: family-based association studies. Am J Hum Genet 57: Tinker H (1989) South Asia: a short history, 2d ed. Macmillan, Basingstoke, pp 4-5 Younger C (1987) Anglo-Indians: neglected children of the Raj. BR Publications, Delhi Zimmet PZ (1992) Kelly West lecture 1991: challenges in diabetes epidemiology from West to the rest. Diabetes Care 15:

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder Introduction to linkage and family based designs to study the genetic epidemiology of complex traits Harold Snieder Overview of presentation Designs: population vs. family based Mendelian vs. complex diseases/traits

More information

Transmission Disequilibrium Test in GWAS

Transmission Disequilibrium Test in GWAS Department of Computer Science Brown University, Providence sorin@cs.brown.edu November 10, 2010 Outline 1 Outline 2 3 4 The transmission/disequilibrium test (TDT) was intro- duced several years ago by

More information

Dan Koller, Ph.D. Medical and Molecular Genetics

Dan Koller, Ph.D. Medical and Molecular Genetics Design of Genetic Studies Dan Koller, Ph.D. Research Assistant Professor Medical and Molecular Genetics Genetics and Medicine Over the past decade, advances from genetics have permeated medicine Identification

More information

(b) What is the allele frequency of the b allele in the new merged population on the island?

(b) What is the allele frequency of the b allele in the new merged population on the island? 2005 7.03 Problem Set 6 KEY Due before 5 PM on WEDNESDAY, November 23, 2005. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. Two populations (Population One

More information

Decomposition of the Genotypic Value

Decomposition of the Genotypic Value Decomposition of the Genotypic Value 1 / 17 Partitioning of Phenotypic Values We introduced the general model of Y = G + E in the first lecture, where Y is the phenotypic value, G is the genotypic value,

More information

CS2220 Introduction to Computational Biology

CS2220 Introduction to Computational Biology CS2220 Introduction to Computational Biology WEEK 8: GENOME-WIDE ASSOCIATION STUDIES (GWAS) 1 Dr. Mengling FENG Institute for Infocomm Research Massachusetts Institute of Technology mfeng@mit.edu PLANS

More information

GENETIC LINKAGE ANALYSIS

GENETIC LINKAGE ANALYSIS Atlas of Genetics and Cytogenetics in Oncology and Haematology GENETIC LINKAGE ANALYSIS * I- Recombination fraction II- Definition of the "lod score" of a family III- Test for linkage IV- Estimation of

More information

White Paper Estimating Genotype-Specific Incidence for One or Several Loci

White Paper Estimating Genotype-Specific Incidence for One or Several Loci White Paper 23-01 Estimating Genotype-Specific Incidence for One or Several Loci Authors: Mike Macpherson Brian Naughton Andro Hsu Joanna Mountain Created: September 5, 2007 Last Edited: November 18, 2007

More information

Ascertainment Through Family History of Disease Often Decreases the Power of Family-based Association Studies

Ascertainment Through Family History of Disease Often Decreases the Power of Family-based Association Studies Behav Genet (2007) 37:631 636 DOI 17/s10519-007-9149-0 ORIGINAL PAPER Ascertainment Through Family History of Disease Often Decreases the Power of Family-based Association Studies Manuel A. R. Ferreira

More information

Human population sub-structure and genetic association studies

Human population sub-structure and genetic association studies Human population sub-structure and genetic association studies Stephanie A. Santorico, Ph.D. Department of Mathematical & Statistical Sciences Stephanie.Santorico@ucdenver.edu Global Similarity Map from

More information

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis BST227 Introduction to Statistical Genetics Lecture 4: Introduction to linkage and association analysis 1 Housekeeping Homework #1 due today Homework #2 posted (due Monday) Lab at 5:30PM today (FXB G13)

More information

Systems of Mating: Systems of Mating:

Systems of Mating: Systems of Mating: 8/29/2 Systems of Mating: the rules by which pairs of gametes are chosen from the local gene pool to be united in a zygote with respect to a particular locus or genetic system. Systems of Mating: A deme

More information

Using ancestry estimates as tools to better understand group or individual differences in disease risk or disease outcomes

Using ancestry estimates as tools to better understand group or individual differences in disease risk or disease outcomes Using ancestry estimates as tools to better understand group or individual differences in disease risk or disease outcomes Jill Barnholtz-Sloan, Ph.D. Assistant Professor Case Comprehensive Cancer Center

More information

PopGen4: Assortative mating

PopGen4: Assortative mating opgen4: Assortative mating Introduction Although random mating is the most important system of mating in many natural populations, non-random mating can also be an important mating system in some populations.

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Fig 1. Comparison of sub-samples on the first two principal components of genetic variation. TheBritishsampleisplottedwithredpoints.The sub-samples of the diverse sample

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Effects of Stratification in the Analysis of Affected-Sib-Pair Data: Benefits and Costs

Effects of Stratification in the Analysis of Affected-Sib-Pair Data: Benefits and Costs Am. J. Hum. Genet. 66:567 575, 2000 Effects of Stratification in the Analysis of Affected-Sib-Pair Data: Benefits and Costs Suzanne M. Leal and Jurg Ott Laboratory of Statistical Genetics, The Rockefeller

More information

Lecture 1 Mendelian Inheritance

Lecture 1 Mendelian Inheritance Genes Mendelian Inheritance Lecture 1 Mendelian Inheritance Jurg Ott Gregor Mendel, monk in a monastery in Brünn (now Brno in Czech Republic): Breeding experiments with the garden pea: Flower color and

More information

SEX. Genetic Variation: The genetic substrate for natural selection. Sex: Sources of Genotypic Variation. Genetic Variation

SEX. Genetic Variation: The genetic substrate for natural selection. Sex: Sources of Genotypic Variation. Genetic Variation Genetic Variation: The genetic substrate for natural selection Sex: Sources of Genotypic Variation Dr. Carol E. Lee, University of Wisconsin Genetic Variation If there is no genetic variation, neither

More information

Complex Multifactorial Genetic Diseases

Complex Multifactorial Genetic Diseases Complex Multifactorial Genetic Diseases Nicola J Camp, University of Utah, Utah, USA Aruna Bansal, University of Utah, Utah, USA Secondary article Article Contents. Introduction. Continuous Variation.

More information

Population Structure in Admixed Populations: Effect of Admixture Dynamics on the Pattern of Linkage Disequilibrium

Population Structure in Admixed Populations: Effect of Admixture Dynamics on the Pattern of Linkage Disequilibrium Am. J. Hum. Genet. 68:198 207, 2001 Population Structure in Admixed Populations: Effect of Admixture Dynamics on the Pattern of Linkage Disequilibrium C. L. Pfaff, 1 E. J. Parra, 1 C. Bonilla, 1 K. Hiester,

More information

HARDY- WEINBERG PRACTICE PROBLEMS

HARDY- WEINBERG PRACTICE PROBLEMS HARDY- WEINBERG PRACTICE PROBLEMS PROBLEMS TO SOLVE: 1. The proportion of homozygous recessives of a certain population is 0.09. If we assume that the gene pool is large and at equilibrium and all genotypes

More information

Your DNA extractions! 10 kb

Your DNA extractions! 10 kb Your DNA extractions! 10 kb Quantitative characters: polygenes and environment Most ecologically important quantitative traits (QTs) vary. Distributions are often unimodal and approximately normal. Offspring

More information

Allowing for Missing Parents in Genetic Studies of Case-Parent Triads

Allowing for Missing Parents in Genetic Studies of Case-Parent Triads Am. J. Hum. Genet. 64:1186 1193, 1999 Allowing for Missing Parents in Genetic Studies of Case-Parent Triads C. R. Weinberg National Institute of Environmental Health Sciences, Research Triangle Park, NC

More information

During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin,

During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin, ESM Methods Hyperinsulinemic-euglycemic clamp procedure During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin, Clayton, NC) was followed by a constant rate (60 mu m

More information

Introduction to the Genetics of Complex Disease

Introduction to the Genetics of Complex Disease Introduction to the Genetics of Complex Disease Jeremiah M. Scharf, MD, PhD Departments of Neurology, Psychiatry and Center for Human Genetic Research Massachusetts General Hospital Breakthroughs in Genome

More information

Indian Journal of Nephrology Indian J Nephrol 2001;11: 88-97

Indian Journal of Nephrology Indian J Nephrol 2001;11: 88-97 88 Indian Journal of Nephrology Indian J Nephrol 2001;11: 88-97 ARTICLE HLA gene and haplotype frequency in renal transplant recipients and donors of Uttar Pradesh (North India) S Agrawal, AK Singh, RK

More information

Supplementary Figure 1. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations.

Supplementary Figure 1. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations. Supplementary Figure. Principal components analysis of European ancestry in the African American, Native Hawaiian and Latino populations. a Eigenvector 2.5..5.5. African Americans European Americans e

More information

Diabetologia 9 Springer-Verlag 1993

Diabetologia 9 Springer-Verlag 1993 Diabetologia (1993) 36:234-238 Diabetologia 9 Springer-Verlag 1993 HLA-associated susceptibility to Type 2 (non-insulin-dependent) diabetes mellitus: the Wadena City Health Study S. S. Rich 1, L. R. French

More information

Mendelian Inheritance. Jurg Ott Columbia and Rockefeller Universities New York

Mendelian Inheritance. Jurg Ott Columbia and Rockefeller Universities New York Mendelian Inheritance Jurg Ott Columbia and Rockefeller Universities New York Genes Mendelian Inheritance Gregor Mendel, monk in a monastery in Brünn (now Brno in Czech Republic): Breeding experiments

More information

Bio 312, Spring 2017 Exam 3 ( 1 ) Name:

Bio 312, Spring 2017 Exam 3 ( 1 ) Name: Bio 312, Spring 2017 Exam 3 ( 1 ) Name: Please write the first letter of your last name in the box; 5 points will be deducted if your name is hard to read or the box does not contain the correct letter.

More information

Does Body Mass Index Adequately Capture the Relation of Body Composition and Body Size to Health Outcomes?

Does Body Mass Index Adequately Capture the Relation of Body Composition and Body Size to Health Outcomes? American Journal of Epidemiology Copyright 1998 by The Johns Hopkins University School of Hygiene and Public Health All rights reserved Vol. 147, No. 2 Printed in U.S.A A BRIEF ORIGINAL CONTRIBUTION Does

More information

MBG* Animal Breeding Methods Fall Final Exam

MBG* Animal Breeding Methods Fall Final Exam MBG*4030 - Animal Breeding Methods Fall 2007 - Final Exam 1 Problem Questions Mick Dundee used his financial resources to purchase the Now That s A Croc crocodile farm that had been operating for a number

More information

Nonparametric Linkage Analysis. Nonparametric Linkage Analysis

Nonparametric Linkage Analysis. Nonparametric Linkage Analysis Limitations of Parametric Linkage Analysis We previously discued parametric linkage analysis Genetic model for the disease must be specified: allele frequency parameters and penetrance parameters Lod scores

More information

Selection at one locus with many alleles, fertility selection, and sexual selection

Selection at one locus with many alleles, fertility selection, and sexual selection Selection at one locus with many alleles, fertility selection, and sexual selection Introduction It s easy to extend the Hardy-Weinberg principle to multiple alleles at a single locus. In fact, we already

More information

Transmission Disequilibrium Methods for Family-Based Studies Daniel J. Schaid Technical Report #72 July, 2004

Transmission Disequilibrium Methods for Family-Based Studies Daniel J. Schaid Technical Report #72 July, 2004 Transmission Disequilibrium Methods for Family-Based Studies Daniel J. Schaid Technical Report #72 July, 2004 Correspondence to: Daniel J. Schaid, Ph.D., Harwick 775, Division of Biostatistics Mayo Clinic/Foundation,

More information

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014 MULTIFACTORIAL DISEASES MG L-10 July 7 th 2014 Genetic Diseases Unifactorial Chromosomal Multifactorial AD Numerical AR Structural X-linked Microdeletions Mitochondrial Spectrum of Alterations in DNA Sequence

More information

Non-parametric methods for linkage analysis

Non-parametric methods for linkage analysis BIOSTT516 Statistical Methods in Genetic Epidemiology utumn 005 Non-parametric methods for linkage analysis To this point, we have discussed model-based linkage analyses. These require one to specify a

More information

Case Studies in Ecology and Evolution

Case Studies in Ecology and Evolution 2 Genetics of Small Populations: the case of the Laysan Finch In 1903, rabbits were introduced to a tiny island in the Hawaiian archipelago called Laysan Island. That island is only 187 ha in size, in

More information

Performing. linkage analysis using MERLIN

Performing. linkage analysis using MERLIN Performing linkage analysis using MERLIN David Duffy Queensland Institute of Medical Research Brisbane, Australia Overview MERLIN and associated programs Error checking Parametric linkage analysis Nonparametric

More information

Cancer develops after somatic mutations overcome the multiple

Cancer develops after somatic mutations overcome the multiple Genetic variation in cancer predisposition: Mutational decay of a robust genetic control network Steven A. Frank* Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697-2525

More information

Inbreeding and Inbreeding Depression

Inbreeding and Inbreeding Depression Inbreeding and Inbreeding Depression Inbreeding is mating among relatives which increases homozygosity Why is Inbreeding a Conservation Concern: Inbreeding may or may not lead to inbreeding depression,

More information

DEFINITIONS: POPULATION: a localized group of individuals belonging to the same species

DEFINITIONS: POPULATION: a localized group of individuals belonging to the same species DEFINITIONS: POPULATION: a localized group of individuals belonging to the same species SPECIES: a group of populations whose individuals have the potential to interbreed and produce fertile offspring

More information

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction Chapter 2 Linkage Analysis JenniferH.BarrettandM.DawnTeare Abstract Linkage analysis is used to map genetic loci using observations on relatives. It can be applied to both major gene disorders (parametric

More information

Analysis of single gene effects 1. Quantitative analysis of single gene effects. Gregory Carey, Barbara J. Bowers, Jeanne M.

Analysis of single gene effects 1. Quantitative analysis of single gene effects. Gregory Carey, Barbara J. Bowers, Jeanne M. Analysis of single gene effects 1 Quantitative analysis of single gene effects Gregory Carey, Barbara J. Bowers, Jeanne M. Wehner From the Department of Psychology (GC, JMW) and Institute for Behavioral

More information

INTERACTION BETWEEN NATURAL SELECTION FOR HETEROZYGOTES AND DIRECTIONAL SELECTION

INTERACTION BETWEEN NATURAL SELECTION FOR HETEROZYGOTES AND DIRECTIONAL SELECTION INTERACTION BETWEEN NATURAL SELECTION FOR HETEROZYGOTES AND DIRECTIONAL SELECTION MARGRITH WEHRLI VERGHESE 1228 Kingston Ridge Driue, Cary, N.C. 27511 Manuscript received May 5, 1973 Revised copy received

More information

Genetics and Genomics in Medicine Chapter 8 Questions

Genetics and Genomics in Medicine Chapter 8 Questions Genetics and Genomics in Medicine Chapter 8 Questions Linkage Analysis Question Question 8.1 Affected members of the pedigree above have an autosomal dominant disorder, and cytogenetic analyses using conventional

More information

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance Chapter 6 Patterns of Inheritance Genetics Explains and Predicts Inheritance Patterns Genetics can explain how these poodles look different. Section 10.1 Genetics Explains and Predicts Inheritance Patterns

More information

Section 8.1 Studying inheritance

Section 8.1 Studying inheritance Section 8.1 Studying inheritance Genotype and phenotype Genotype is the genetic constitution of an organism that describes all the alleles that an organism contains The genotype sets the limits to which

More information

Downloaded from:

Downloaded from: Molokhia, M; Fanciulli, M; Petretto, E; Patrick, AL; McKeigue, P; Roberts, AL; Vyse, TJ; Aitman, TJ (2011) FCGR3B copy number variation is associated with systemic lupus erythematosus risk in Afro-Caribbeans.

More information

The Making of the Fittest: Natural Selection in Humans

The Making of the Fittest: Natural Selection in Humans INTRODUCTION MENDELIAN GENETICS, PROBABILITY, PEDIGREES, AND CHI-SQUARE STATISTICS Hemoglobin is a protein found in red blood cells (RBCs) that transports oxygen throughout the body. The hemoglobin protein

More information

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis

Whole-genome detection of disease-associated deletions or excess homozygosity in a case control study of rheumatoid arthritis HMG Advance Access published December 21, 2012 Human Molecular Genetics, 2012 1 13 doi:10.1093/hmg/dds512 Whole-genome detection of disease-associated deletions or excess homozygosity in a case control

More information

Experiment 1. The aim here is to understand the pattern of

Experiment 1. The aim here is to understand the pattern of H A Ranganath and M T Tanuja Drosophila Stock Centre Department of Studies in Zoology University of Mysore Manasagangotri Mysore 570006, India. E-mail:drosrang@bgl.vsnl.net.in hranganath@hotmail.com Part

More information

GENETICS - NOTES-

GENETICS - NOTES- GENETICS - NOTES- Warm Up Exercise Using your previous knowledge of genetics, determine what maternal genotype would most likely yield offspring with such characteristics. Use the genotype that you came

More information

Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels.

Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels. Supplementary Online Material Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels. John C Chambers, Weihua Zhang, Yun Li, Joban Sehmi, Mark N Wass, Delilah Zabaneh,

More information

Imaging Genetics: Heritability, Linkage & Association

Imaging Genetics: Heritability, Linkage & Association Imaging Genetics: Heritability, Linkage & Association David C. Glahn, PhD Olin Neuropsychiatry Research Center & Department of Psychiatry, Yale University July 17, 2011 Memory Activation & APOE ε4 Risk

More information

A Comparison of Sample Size and Power in Case-Only Association Studies of Gene-Environment Interaction

A Comparison of Sample Size and Power in Case-Only Association Studies of Gene-Environment Interaction American Journal of Epidemiology ª The Author 2010. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. This is an Open Access article distributed under

More information

Multifactorial Inheritance

Multifactorial Inheritance S e s s i o n 6 Medical Genetics Multifactorial Inheritance and Population Genetics J a v a d J a m s h i d i F a s a U n i v e r s i t y o f M e d i c a l S c i e n c e s, Novemb e r 2 0 1 7 Multifactorial

More information

Introduction to Genetics and Genomics

Introduction to Genetics and Genomics 2016 Introduction to enetics and enomics 3. ssociation Studies ggibson.gt@gmail.com http://www.cig.gatech.edu Outline eneral overview of association studies Sample results hree steps to WS: primary scan,

More information

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018 An Introduction to Quantitative Genetics I Heather A Lawson Advanced Genetics Spring2018 Outline What is Quantitative Genetics? Genotypic Values and Genetic Effects Heritability Linkage Disequilibrium

More information

Tutorial on Genome-Wide Association Studies

Tutorial on Genome-Wide Association Studies Tutorial on Genome-Wide Association Studies Assistant Professor Institute for Computational Biology Department of Epidemiology and Biostatistics Case Western Reserve University Acknowledgements Dana Crawford

More information

EPDs and Heterosis - What is the Difference?

EPDs and Heterosis - What is the Difference? EPDs and Heterosis - What is the Difference? By Steven D. Lukefahr KINGSVILLE, Texas: The value of Expected Progeny Differences or EPDs as a genetic tool of selection is widely accepted especially in the

More information

Review Risk for rheumatic disease in relation to ethnicity and admixture Mariam Molokhia and Paul McKeigue

Review Risk for rheumatic disease in relation to ethnicity and admixture Mariam Molokhia and Paul McKeigue Review Risk for rheumatic disease in relation to ethnicity and admixture Mariam Molokhia and Paul McKeigue London School of Hygiene and Tropical Medicine, London, UK Received: 22 December 1999 Revisions

More information

Effects of Inbreeding on Raven Matrices

Effects of Inbreeding on Raven Matrices Behavior Genetics, Vol. 14, No. 6, 1984 Effects of Inbreeding on Raven Matrices Nirupama Agrawal, ~ S. N. Sinha, 1 and Arthur R. Jensen z'3 Received 5 July 1984--Final 15 July 1984 Indian Muslim school

More information

additive genetic component [d] = rded

additive genetic component [d] = rded Heredity (1976), 36 (1), 31-40 EFFECT OF GENE DISPERSION ON ESTIMATES OF COMPONENTS OF GENERATION MEANS AND VARIANCES N. E. M. JAYASEKARA* and J. L. JINKS Department of Genetics, University of Birmingham,

More information

Pedigree Analysis Why do Pedigrees? Goals of Pedigree Analysis Basic Symbols More Symbols Y-Linked Inheritance

Pedigree Analysis Why do Pedigrees? Goals of Pedigree Analysis Basic Symbols More Symbols Y-Linked Inheritance Pedigree Analysis Why do Pedigrees? Punnett squares and chi-square tests work well for organisms that have large numbers of offspring and controlled mating, but humans are quite different: Small families.

More information

Roadmap. Inbreeding How inbred is a population? What are the consequences of inbreeding?

Roadmap. Inbreeding How inbred is a population? What are the consequences of inbreeding? 1 Roadmap Quantitative traits What kinds of variation can selection work on? How much will a population respond to selection? Heritability How can response be restored? Inbreeding How inbred is a population?

More information

Ch 4: Mendel and Modern evolutionary theory

Ch 4: Mendel and Modern evolutionary theory Ch 4: Mendel and Modern evolutionary theory 1 Mendelian principles of inheritance Mendel's principles explain how traits are transmitted from generation to generation Background: eight years breeding pea

More information

Bio 1M: Evolutionary processes

Bio 1M: Evolutionary processes Bio 1M: Evolutionary processes Evolution by natural selection Is something missing from the story I told last chapter? Heritable variation in traits Selection (i.e., differential reproductive success)

More information

Effect of Genetic Heterogeneity and Assortative Mating on Linkage Analysis: A Simulation Study

Effect of Genetic Heterogeneity and Assortative Mating on Linkage Analysis: A Simulation Study Am. J. Hum. Genet. 61:1169 1178, 1997 Effect of Genetic Heterogeneity and Assortative Mating on Linkage Analysis: A Simulation Study Catherine T. Falk The Lindsley F. Kimball Research Institute of The

More information

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative Association mapping (qualitative) Office hours Wednesday 3-4pm 304A Stanley Hall Fig. 11.26 Association scan, qualitative Association scan, quantitative osteoarthritis controls χ 2 test C s G s 141 47

More information

Single Gene (Monogenic) Disorders. Mendelian Inheritance: Definitions. Mendelian Inheritance: Definitions

Single Gene (Monogenic) Disorders. Mendelian Inheritance: Definitions. Mendelian Inheritance: Definitions Single Gene (Monogenic) Disorders Mendelian Inheritance: Definitions A genetic locus is a specific position or location on a chromosome. Frequently, locus is used to refer to a specific gene. Alleles are

More information

Overview of Animal Breeding

Overview of Animal Breeding Overview of Animal Breeding 1 Required Information Successful animal breeding requires 1. the collection and storage of data on individually identified animals; 2. complete pedigree information about the

More information

Flexible Matching in Case-Control Studies of Gene-Environment Interactions

Flexible Matching in Case-Control Studies of Gene-Environment Interactions American Journal of Epidemiology Copyright 2004 by the Johns Hopkins Bloomberg School of Public Health All rights reserved Vol. 59, No. Printed in U.S.A. DOI: 0.093/aje/kwg250 ORIGINAL CONTRIBUTIONS Flexible

More information

Downloaded from Chapter 5 Principles of Inheritance and Variation

Downloaded from  Chapter 5 Principles of Inheritance and Variation Chapter 5 Principles of Inheritance and Variation Genetics: Genetics is a branch of biology which deals with principles of inheritance and its practices. Heredity: It is transmission of traits from one

More information

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK

DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK CHAPTER 6 DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK Genetic research aimed at the identification of new breast cancer susceptibility genes is at an interesting crossroad. On the one hand, the existence

More information

Interaction of Genes and the Environment

Interaction of Genes and the Environment Some Traits Are Controlled by Two or More Genes! Phenotypes can be discontinuous or continuous Interaction of Genes and the Environment Chapter 5! Discontinuous variation Phenotypes that fall into two

More information

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES Chapter Notes- Genetics By Mir Mohammed Abbas II PCMB 'A' 1 CHAPTER CONCEPT NOTES Relationship between genes and chromosome of diploid organism and the terms used to describe them Know the terms Terms

More information

Genome Scan Meta-Analysis of Schizophrenia and Bipolar Disorder, Part I: Methods and Power Analysis

Genome Scan Meta-Analysis of Schizophrenia and Bipolar Disorder, Part I: Methods and Power Analysis Am. J. Hum. Genet. 73:17 33, 2003 Genome Scan Meta-Analysis of Schizophrenia and Bipolar Disorder, Part I: Methods and Power Analysis Douglas F. Levinson, 1 Matthew D. Levinson, 1 Ricardo Segurado, 2 and

More information

The Inheritance of Fingerprint Patterns

The Inheritance of Fingerprint Patterns Am J Hum Genet 28:280-289, 1976 The Inheritance of Fingerprint Patterns HERMAN M. SLATIS,1 MARIASSA BAT-MIRIAM KATZNELSON,2 AND BATSHEVA BONNE-TAMIR3 Although there have been a few reports on the inheritance

More information

The Chromosomal Basis of Inheritance

The Chromosomal Basis of Inheritance The Chromosomal Basis of Inheritance Factors and Genes Mendel s model of inheritance was based on the idea of factors that were independently assorted and segregated into gametes We now know that these

More information

D. Hilton. Keywords Epidemiological methods, aging, prevalence.

D. Hilton. Keywords Epidemiological methods, aging, prevalence. Computational Methods in Official Statistics with an Example on Calculating and Predicting Diabetes Mellitus [DM] Prevalence in Different Age Groups within Australia in Future Years, in Light of the Aging

More information

Statistical Evaluation of Sibling Relationship

Statistical Evaluation of Sibling Relationship The Korean Communications in Statistics Vol. 14 No. 3, 2007, pp. 541 549 Statistical Evaluation of Sibling Relationship Jae Won Lee 1), Hye-Seung Lee 2), Hyo Jung Lee 3) and Juck-Joon Hwang 4) Abstract

More information

Appendix 1. Data shown in Table 1

Appendix 1. Data shown in Table 1 Age Mean BMI Source Group Status Region Country Population (years) Male Female Source Number Five Cities 25-64 22.7 Settibalija >20 19.87 19.53 South Andhra Pradesh 18-75 21.41 22.34 Wadabalija >20 20.09

More information

Inequalities in health due to ethnicity and social deprivation an analysis of primary care data from one inner city area over a three year period

Inequalities in health due to ethnicity and social deprivation an analysis of primary care data from one inner city area over a three year period Inequalities in health due to ethnicity and social deprivation an analysis of primary care data from one inner city area over a three year period Report to the National Audit Office Peter Schofield, PhD

More information

MENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on.

MENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on. MENDELIAN GENETICS Objectives Upon completion of this lab, students should: 1. Understand the principles and terms used in Mendelian genetics. 2. Know how to complete a Punnett square to estimate phenotypic

More information

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S. Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S. December 17, 2014 1 Introduction Asthma is a chronic respiratory disease affecting

More information

Summary. Introduction. Atypical and Duplicated Samples. Atypical Samples. Noah A. Rosenberg

Summary. Introduction. Atypical and Duplicated Samples. Atypical Samples. Noah A. Rosenberg doi: 10.1111/j.1469-1809.2006.00285.x Standardized Subsets of the HGDP-CEPH Human Genome Diversity Cell Line Panel, Accounting for Atypical and Duplicated Samples and Pairs of Close Relatives Noah A. Rosenberg

More information

New Enhancements: GWAS Workflows with SVS

New Enhancements: GWAS Workflows with SVS New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences

More information

breast cancer; relative risk; risk factor; standard deviation; strength of association

breast cancer; relative risk; risk factor; standard deviation; strength of association American Journal of Epidemiology The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail:

More information

Chapter 4 INSIG2 Polymorphism and BMI in Indian Population

Chapter 4 INSIG2 Polymorphism and BMI in Indian Population Chapter 4 INSIG2 Polymorphism and BMI in Indian Population 4.1 INTRODUCTION Diseases like cardiovascular disorders (CVD) are emerging as major causes of death in India (Ghaffar A et. al., 2004). Various

More information

Mating Systems. 1 Mating According to Index Values. 1.1 Positive Assortative Matings

Mating Systems. 1 Mating According to Index Values. 1.1 Positive Assortative Matings Mating Systems After selecting the males and females that will be used to produce the next generation of animals, the next big decision is which males should be mated to which females. Mating decisions

More information

GENETIC DRIFT & EFFECTIVE POPULATION SIZE

GENETIC DRIFT & EFFECTIVE POPULATION SIZE Instructor: Dr. Martha B. Reiskind AEC 450/550: Conservation Genetics Spring 2018 Lecture Notes for Lectures 3a & b: In the past students have expressed concern about the inbreeding coefficient, so please

More information

The laws of Heredity. Allele: is the copy (or a version) of the gene that control the same characteristics.

The laws of Heredity. Allele: is the copy (or a version) of the gene that control the same characteristics. The laws of Heredity 1. Definition: Heredity: The passing of traits from parents to their offspring by means of the genes from the parents. Gene: Part or portion of a chromosome that carries genetic information

More information

Effects of age-at-diagnosis and duration of diabetes on GADA and IA-2A positivity

Effects of age-at-diagnosis and duration of diabetes on GADA and IA-2A positivity Effects of age-at-diagnosis and duration of diabetes on GADA and IA-2A positivity Duration of diabetes was inversely correlated with age-at-diagnosis (ρ=-0.13). However, as backward stepwise regression

More information

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012 Statistical Tests for X Chromosome Association Study with Simulations Jian Wang July 10, 2012 Statistical Tests Zheng G, et al. 2007. Testing association for markers on the X chromosome. Genetic Epidemiology

More information

The Making of the Fittest: Natural Selection in Humans

The Making of the Fittest: Natural Selection in Humans MENDELIAN GENETICS, PROBABILITY, PEDIGREES, AND CHI-SQUARE STATISTICS INTRODUCTION Hemoglobin is a protein found in red blood cells that transports oxygen throughout the body. The hemoglobin protein consists

More information

Comparison of Linkage-Disequilibrium Methods for Localization of Genes Influencing Quantitative Traits in Humans

Comparison of Linkage-Disequilibrium Methods for Localization of Genes Influencing Quantitative Traits in Humans Am. J. Hum. Genet. 64:1194 105, 1999 Comparison of Linkage-Disequilibrium Methods for Localization of Genes Influencing Quantitative Traits in Humans Grier P. Page and Christopher I. Amos Department of

More information

An Introduction to Quantitative Genetics

An Introduction to Quantitative Genetics An Introduction to Quantitative Genetics Mohammad Keramatipour MD, PhD Keramatipour@tums.ac.ir ac ir 1 Mendel s work Laws of inheritance Basic Concepts Applications Predicting outcome of crosses Phenotype

More information

Strategies for Mapping Heterogeneous Recessive Traits by Allele- Sharing Methods

Strategies for Mapping Heterogeneous Recessive Traits by Allele- Sharing Methods Am. J. Hum. Genet. 60:965-978, 1997 Strategies for Mapping Heterogeneous Recessive Traits by Allele- Sharing Methods Eleanor Feingold' and David 0. Siegmund2 'Department of Biostatistics, Emory University,

More information