GENETIC LINKAGE ANALYSIS

Similar documents
Decomposition of the Genotypic Value

Lecture 6: Linkage analysis in medical genetics

Non-parametric methods for linkage analysis

Ch 8 Practice Questions

Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

PopGen4: Assortative mating

Selection at one locus with many alleles, fertility selection, and sexual selection

Dan Koller, Ph.D. Medical and Molecular Genetics

Exam #2 BSC Fall. NAME_Key correct answers in BOLD FORM A

Complex Traits Activity INSTRUCTION MANUAL. ANT 2110 Introduction to Physical Anthropology Professor Julie J. Lesnik

Systems of Mating: Systems of Mating:

Lecture 13: May 24, 2004

Inbreeding and Inbreeding Depression

The laws of Heredity. Allele: is the copy (or a version) of the gene that control the same characteristics.

For more information about how to cite these materials visit

GENETICS - NOTES-

Nonparametric Linkage Analysis. Nonparametric Linkage Analysis

What we mean more precisely is that this gene controls the difference in seed form between the round and wrinkled strains that Mendel worked with

Roadmap. Inbreeding How inbred is a population? What are the consequences of inbreeding?

Pedigree Construction Notes

Population Genetics 4: Assortative mating

Transmission Disequilibrium Methods for Family-Based Studies Daniel J. Schaid Technical Report #72 July, 2004

Ch 10 Genetics Mendelian and Post-Medelian Teacher Version.notebook. October 20, * Trait- a character/gene. self-pollination or crosspollination

Unit 6.2: Mendelian Inheritance

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

Chapter 10 Notes Patterns of Inheritance, Part 1

LTA Analysis of HapMap Genotype Data

Mendelian Genetics: Patterns of Inheritance

Genetics and Heredity Notes

Fundamentals of Genetics

(b) What is the allele frequency of the b allele in the new merged population on the island?

For a long time, people have observed that offspring look like their parents.

IB BIO I Genetics Test Madden

Transmission Disequilibrium Test in GWAS

Effect of Genetic Heterogeneity and Assortative Mating on Linkage Analysis: A Simulation Study

Mating Systems. 1 Mating According to Index Values. 1.1 Positive Assortative Matings

Downloaded from Chapter 5 Principles of Inheritance and Variation

Imaging Genetics: Heritability, Linkage & Association

12 MENDEL, GENES, AND INHERITANCE

Probability and Punnett Squares

Gregor Mendel and Genetics Worksheets

Mendelian Genetics. KEY CONCEPT Mendel s research showed that traits are inherited as discrete units.

Problem set questions from Final Exam Human Genetics, Nondisjunction, and Cancer

White Paper Estimating Genotype-Specific Incidence for One or Several Loci

Genetics: field of biology that studies heredity, or the passing of traits from parents to offspring Trait: an inherited characteristic, such as eye

Genetics. F 1 results. Shape of the seed round/wrinkled all round 5474 round, 1850 wrinkled 2.96 : 1

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

Section 8.1 Studying inheritance

Genetics and Genomics in Medicine Chapter 8 Questions

Replacing IBS with IBD: The MLS Method. Biostatistics 666 Lecture 15

Laws of Inheritance. Bởi: OpenStaxCollege

Unit 7 Section 2 and 3

Meiosis and Genetics

Name Period. Keystone Vocabulary: genetics fertilization trait hybrid gene allele Principle of dominance segregation gamete probability

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

Any inbreeding will have similar effect, but slower. Overall, inbreeding modifies H-W by a factor F, the inbreeding coefficient.

Example: Colour in snapdragons

Chapter 15: The Chromosomal Basis of Inheritance

Biology. Chapter 13. Observing Patterns in Inherited Traits. Concepts and Applications 9e Starr Evers Starr. Cengage Learning 2015

Mendel s Methods: Monohybrid Cross

The Chromosomal Basis Of Inheritance

Quantitative Genetics

Mitosis and Meiosis. See Mitosis and Meiosis on the class web page

Chapter 15: The Chromosomal Basis of Inheritance

UNIT 6 GENETICS 12/30/16

Section 1 MENDEL S LEGACY

The vagaries of non-traditional mendelian recessive inheritance in uniparental disomy: AA x Aa = aa!

Parental Effects. Genomic Imprinting

Lab 5: Testing Hypotheses about Patterns of Inheritance

5.5 Genes and patterns of inheritance

BIOLOGY - CLUTCH CH.15 - CHROMOSOMAL THEORY OF INHERITANCE

Genetics Practice Test

Genetic basis of inheritance and variation. Dr. Amjad Mahasneh. Jordan University of Science and Technology

Sexual Reproduction & Inheritance

Patterns of Inheritance

Single Gene (Monogenic) Disorders. Mendelian Inheritance: Definitions. Mendelian Inheritance: Definitions

A gene is a sequence of DNA that resides at a particular site on a chromosome the locus (plural loci). Genetic linkage of genes on a single

Lecture 1 Mendelian Inheritance

Does Mendel s work suggest that this is the only gene in the pea genome that can affect this particular trait?

Mendelian Genetics. Biology 3201 Unit 3

Answers to Questions from old quizzes and exams Problem 1A (i). a (ii) c (iii) a (iv) d

VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous chromosome sexual reproduction meiosis

Mendel and Heredity. Chapter 12

Genetics Review. Alleles. The Punnett Square. Genotype and Phenotype. Codominance. Incomplete Dominance

WHAT S IN THIS LECTURE?

Incomplete Dominance

This document is a required reading assignment covering chapter 4 in your textbook.

Model of an F 1 and F 2 generation

Genetics of extreme body size evolution in mice from Gough Island

Mendelism: the Basic principles of Inheritance

Bio 312, Spring 2017 Exam 3 ( 1 ) Name:

Keywords. Punnett Square forked line. gene allele dominant recessive character trait phenotype genotype

p and q can be thought of as probabilities of selecting the given alleles by

National Disease Research Interchange Annual Progress Report: 2010 Formula Grant

GENETICS - CLUTCH CH.2 MENDEL'S LAWS OF INHERITANCE.

Complex Multifactorial Genetic Diseases

The Modern Genetics View

Chapter 02 Mendelian Inheritance

Statistical Evaluation of Sibling Relationship

Transcription:

Atlas of Genetics and Cytogenetics in Oncology and Haematology GENETIC LINKAGE ANALYSIS * I- Recombination fraction II- Definition of the "lod score" of a family III- Test for linkage IV- Estimation of the recombination fraction V- Recombination fraction for a disease locus and a marker locus * Investigating the linked segregation of genes situated at different loci is a way of testing the independence of their transmission. This concept of independence is also reflected in the recombination fraction, θ, which is the percentage of the gametes transmitted by the parents to be recombined. If they are transmitted independently, there will be the same number of recombined gametes as there are parental gametes, and so θ = 1/2. If they are not transmitted independently, then the parenteral gametes are transmitted preferentially to the recombined gametes, and 0 θ<1/2. In this case, there is said to be "linkage" between the two loci. I- RECOMBINATION FRACTION Let us consider the caseof two loci, A and B, with two codominant alleles at each of these loci, A 1, A 2 and B 1, B 2 respectively. Such an individual can produce four types of gamete: A 1 B 1 A 2 B 1 Atlas Genet Cytogenet Oncol Haematol 2002; 4-679-

A 1 B 2 A 2 B 2 Two situations are possible: 1- The loci A and B are on different chromosome pairs Figure 1 In this case, the four gametes all have the same probability: 1/4. 2 The loci A and B are on the same chromosome pairs Here we have to distinguish between two possible situations: the alleles A 1 and B 1 may be on the same chromosome within the pair, in which case A 1 and B 1 are said to be "coupled"; or they may be on different chromosomes, in which case A 1 and B 1 are said to be in a state of "repulsion". Figure 2 For instance, let us suppose that A 1 and B 1 are "coupled". Four types of gametes are still produced. Atlas Genet Cytogenet Oncol Haematol 2002; 4-680-

Figure3 Gametes A 1 B 1 and A 2 B 2 are said to be "parental". In the offspring, as in the parents, A 1 is "coupled" with B 1 (and A 2 is "coupled" with B 2). The gametes A 1 B 2 and A 2 B 1 are therefore described as being "recombined". An uneven number of recombination or "crossing-over" phenomena have occurred between the A and B loci. Figure 4 Assuming that the crossing-over event for a pair of chromosomes follows Poisson s law, and knowing that a parental gamete has zero or an even number of crossingsover, whereas a recombined gamete has an odd number, we can show that the frequency of recombined gametes is always equal to or lower than that of the parenteral gametes and so 0 θ < 1/2 If θ = 1/2, then all the gamete types have the same probability and the alleles at the loci A and B loci are transmitted independently. Loci A and B are therefore said not to enhibit genetic linkage. This is the situation if A and B are on different pairs of chromosomes, and also if A and B are one the same pair, but at some distance from each other. However, if θ < 1/2, then the two loci are genetically linked. For a couple of which the genotypes at the A and B are known, the probability of observing the genotypes of the offspring depends on the value of θ. Let us assume the following crossing: Atlas Genet Cytogenet Oncol Haematol 2002; 4-681-

Figure 5 Therefore, such a couple can have 4 types of offspring Figure 6 Assuming that there is gamete equilibrium at the A and B loci, in parent 1 there is a probability of 1/2 that alleles A 1 and B 1 will be coupled, and a probability of 1/2 that they will be in repulsion. (1) A 1 and B 1 are coupled, so the probability that parent (1) provides the gametes A 1 B 1 and A 2 B 2 is (1-θ)/2 and the probability that this parent provides gametes A 1 B 2 and A 2 B 1 is θ/2. The probability that the couple will have child of type (1) or (2) is (1- θ)/2, and that of their having a type (3) or type (4) child is θ/2. The probability of finding n 1 children of type (1), n 2 of type (2), n 3 of type (3) and n 4 of type (4) is therefore [(1- θ)/2] n1+n2 x (θ/2) n3+n4 (2) A 1 and B 1 are in a state of repulsion, so the probability that parent (1) provides the gametes A 1 B 2 and A 2 B 1 is (1-θ)/2 and the probability that this parent provides gametes A 1 B 1 and A 2 B 2 is θ/2. The probability of the previous observation is therefore: (θ/2) n1+n2 x[(1-θ)/2] n3+n4 Atlas Genet Cytogenet Oncol Haematol 2002; 4-682-

So in the end, with no additional information about the A 1 and B 1 phase, and assuming that the alleles at the A and B loci are in a state of coupling equilibrium, the probability of finding n 1, n 2, n 3 and n 4 children in categories (1), (2), (3), (4) is: p(n 1,n 2,n 3,n 4 /θ)=1/2{[(1 -θ)/2] n1+n2 x (θ/2) n3+n4 + (θ/2) n1+n2 x [(1-θ)/2] n3+n4 } So the liklihood of θ for an observation n1, n2, n3, n4 can be written : L(θ/n1,n2,n3,n4)=1/2 {[(1-θ)/2] n1+n2 (θ/2) n3+n4 + (θ/2) n1+n2 [(1-θ)/2] n3+n4 } Special case: number of children n= 1 Regardless of the category to which this child belongs L(θ) = 1/2 [(1-θ)/2] + 1/2 [θ/2] = 1/4 The likelihood of this observation for the family does not depend on θ. We can say that such a family is not informative for θ. Informative families An "informative family" is a family for which the liklihood is a variable function of θ. One essential condition for a family to be informative is, therefore, that it has more than one child. Furthermore, at least one of the parents must be heterozygotic. Definition: if one of the parents is doubly heterozygotic and the other is A double homozygote, we have a backcross A single homozygote, we have a simple backcross A double heterozygote, we have a double intercross II- DEFINITION OF THE "LOD SCORE" OF A FAMILY Take a family of which we know the genotypes at the A and B loci of each of the members. Let L(θ) be the liklihood of a recombination fraction 0 θ < 1/2 L(1/2) be the liklihood of θ = 1/2, that is of independent segregation into A and B. The lod score of the family in θ is: Z(θ) = log 10 [L(θ)/L(1/2)] Z can be taken to be a function of θ defined over the range [0,1/2]. Lod score of a sample of families The liklihood of a value of θ for a sample of independent families is the product of the liklihoods of each family, and so the lod score of the whole sample will be the sum of the lod scores of each family. Atlas Genet Cytogenet Oncol Haematol 2002; 4-683-

III- TEST FOR LlNKAGE Several methods have been proposed to detect linkage: the U scores, the sib pair test, the likelihood ratios, the lod score method. The lod score method is the one most commonly used at present. The test procedure in the lod score method is sequential. Information, i.e. the number of families in the sample, is accumulated until it is possible to decide between the hypotheses H0 and H1 : H0 : genetic independence θ = 1/2 and Hl: linkage of θ 1 0 θ 1 < 1/2 The lod score of the θ 1 sample Z(θ 1 ) = log 10 [L(θ 1 )/L(l/2)] indicates the relative probabilities of finding that the sample is Hl or H0. Thus, a lod score of 3 means that the probability of finding that the sample is Hl is 1000 times greater than of finding that it is H0 ("lod = logarithm of the odds"). The decision thresholds of the test are usually set at -2 and +3, so that if: Z(θ 1 ) 3 H0 is rejected, and linkage is accepted. Z(θ 1 ) -2 linkage of θ 1 is rejected. -2 <Z(θ 1 ) < 3 it is impossible to decide between H0 and Hl. It is necessary to go on accumulating information. For the thresholds chosen, -2 and +3, we can show that: The first degree error, α < 10-3 The second degreee error, β < 10-2 The reliability, 1-ρ > 0.95 θ 1 The power, P(θ) > 0.80 θ 1 if the true value of θ < 0.10 Atlas Genet Cytogenet Oncol Haematol 2002; 4-684-

Figure 7 In fact, what is being tested is not a single value of θ 1 relative to θ = 1/2, but a whole set of values between 0 and 1/2, with a step of various size (0.01 or 0.05). If there is a value of θ 1 such that Z(θ 1 ) 3: linkage is concluded to exist. If there is a value of θ 1 such that Figure 8 Z(θ 1 ) = -2 The linkage is excluded for any θ θ 1 Atlas Genet Cytogenet Oncol Haematol 2002; 4-685-

Figure 9 If θ -2 < Z(θ) < 3, no conclusion can be drawn, the sample is not sufficiently informative. Figure 10 The proposed test has the advantage of being very simple, and of providing protection against falsely concluding linkage. However, some criticisms can be levelled, not only against the criteria chosen, but also against the entire principle of using a sequential procedure. The number of families typed is, indeed, rarely chosen in the light of the test results. Atlas Genet Cytogenet Oncol Haematol 2002; 4-686-

IV- ESTIMATION OF THE RECOMBINATION FRACTION If the test, on a sample of the family, has demonstrated linkage between the A and B loci, then one may want to estimate the recombination fraction for these loci. The estimated value of θ is the value which maximizes the function of the lod score Z, and this is equivalent to taking the value of θ for which the probability of observing linkage in the sample is greatest. V- RECOMBINATION FRACTION FOR A DISEASE LOCUS AND A MARKER LOCUS Let us assume we are dealing with a disease carried by a single gene, determined by an allele, g 0, located at a locus G (g 0 : harmful allele, G 0 : normal allele). We would like to be able to situate locus G relative to a marker locus T, which is known to occupy a given locus on the genome. To do this, we can use families with one or several individuals affected and in which the genotype of each member of the family is known with regard to the marker T. In order to be able to use the lod scores method described above, what is needed Figure 11 is to be able to extrapolate from the phenotype of the individuals (affected, not affected) to their genotype at locus G (or their genotypical probability at locus G). What we need to know is: 1. the frequency, g 0 2. the penetration vector f 1, f 2,f 3 f 1 = proba (/g 0 g 0 ) f 2 = proba (affected /g 0 G 0 ) f 3 = proba (affected /G 0 G 0 ) Atlas Genet Cytogenet Oncol Haematol 2002; 4-687-

It will often happen that the information available for the marker is not also genotypic, but phenotypic in nature. Once again, all possible genotypes must be envisaged. As a general rule, the information available about a family concerns the phenotype. To calculate the likelihood of θ, we must envisage all the possible genotype configurations at each of the loci, for this family, writing the likelihood of θ for each configuration, weighting it by the probability of this configuration, and knowing the phenotypes of individuals in A and B. Knowledge of the genetic parameters at each of the loci (gene frequency, penetration values) is therefore necessary before we can estimate θ. It is obvious that calculating the lod scores, despite being simple in theory, is in fact a lengthy and tedious business; specific software have been designed for linkage analysis. Analysis of gene linkage has made it possible to construct a gene map by locating the new polymorphisms relative to one other on the genome. The measurement used on the gene map is not the recombination fraction, which is not an additive datum, but the gene distance, which we will define below. Contributor(s) Written Citation 05-2002 Françoise Clerget-Darpoux This paper should be referenced as such : Clerget-Darpoux F. Genetic Linkage Analysis. Atlas Genet Cytogenet Oncol Haematol. May 2002. URL : http://atlasgeneticsoncology.org/educ/linkageshortid30031es.html Atlas of Genetics and Cytogenetics in Oncology and Haematology Atlas Genet Cytogenet Oncol Haematol 2002; 4-688-