Analysis of single gene effects 1. Quantitative analysis of single gene effects. Gregory Carey, Barbara J. Bowers, Jeanne M.
|
|
- Alexandrina Hamilton
- 5 years ago
- Views:
Transcription
1 Analysis of single gene effects 1 Quantitative analysis of single gene effects Gregory Carey, Barbara J. Bowers, Jeanne M. Wehner From the Department of Psychology (GC, JMW) and Institute for Behavioral Genetic (GC, BJB, JMW) University of Colorado, Boulder, CO, USA. Supported by Grant 2 M01 RR00051 from the General Clinical Research Center Program of the National Center for Research Resources, National Institute of Health, AA-11275, AA-03627, and R.C.A. AA to JMW.
2 Analysis of single gene effects 2 Abstract Traditionally, the analysis of a quantitative phenotype as a function of a two allele polymorphism at a single locus involves a oneway ANOVA with locus as the ANOVA factor and the three individual genotypes (e.g., aa, Aa, and AA) as the levels of the factor. Here, we propose a related method of analysis that is just as easy to implement as the oneway ANOVA but has the following advantages: (1) greater statistical power; (2) direct estimation of meaningful genetic parameters such as additive genetic variance at the locus and both broad and narrow heritability due to the locus; (3) provides estimates of effect size that also equal meaningful genetic parameters; (4) gives a direct test for dominance that avoids the problems with post hoc ANOVA tests; (5) yields an omnibus F test that is identical to that of the oneway ANOVA. The method is illustrated with data on the protein kinase C g locus in the mouse and behavior on the elevated plus-maze. Power calculations are also provided.
3 Analysis of single gene effects 3 Many designs involving genetics and neuroscience involve the comparison of the three genotypic means for a locus with various mechanisms used to control for (or to estimate) differences in background genetic variation. A simple and valid statistical technique for this type of problem is a oneway ANOVA in which genetic locus is the ANOVA factor and the individual genotypes at the locus are the three levels of the factor. Most planned experiments of this type strive to achieve equal sample sizes for the three cells in the ANOVA. While the resulting orthogonal design has attractive properties, it presents a problem for calculating quantities like narrow-sense heritability for the locus that may be of use to the researcher. Many traditional texts on quantitative genetics deal with panmictic populations and derive their formulas for heritability assuming, say, Hardy-Weinberg equilibrium. When an ANOVA design has equal sample sizes, then such formulas are inappropriate. Here, we present a simple method of analysis for single-gene problems that has several advantages over a oneway ANOVA but is just as easy to implement. The first advantage is that is statistically more powerful than the oneway ANOVA. The second advantage is that it calculates and tests the significance of quantitative parameters such as broad and narrow sense heritability due to the locus. (Note well that in this paper, the term heritability applies only to the variance explained by the locus under question and not to all polymorphic loci that influence the trait). A third advantage is that results can be presented in terms of effect sizes instead of p values, a trend that is recommended in contemporary statistics (Cohen, 1988). A fourth advantage is that, in addition to the information given above, it also provides an F test identical to that of a oneway ANOVA.
4 Analysis of single gene effects 4 A fifth advantage is that there is no need for post hoc tests to assess for the presence of dominant gene action. Analytical proof of these statements (and other assertions to follow) is provided by a technical document at and in some standard statistical texts such as Judd and McClelland (1989). The model also applies to observational data gathered on organisms in their natural habitat where genotypic frequencies are not equal (see the above website for the general quantitative equations). In this paper, however, we deal with the specific case of equal genotypic frequencies. Mathematical Model The model used here assumes two alleles per locus (which we designate as A and a), giving three genotypes AA, Aa, and aa. The overall notation is similar to that used in Falconer and Mackay (1996) and is presented in Table 1. [Insert Table 1 about here] Here m is the midpoint between the two homozygotes. The quantity a is the additive genetic effect and has the following interpretation: if we were to substitute allele A for allele a in a genotype, then we expect, on average, a phenotypic change of a units. The quantity d is the parameter for dominance. It measures the extent to which the mean of the heterozygote Aa deviates from the average of the two homozygotes. When d = 0, there is complete additivity.
5 Analysis of single gene effects 5 Data Arrangement The arrangement of data for the analysis is illustrated in Table 2. In addition to a column for genotype and one for phenotypic scores (the values of which are fictitious in this table), two new quantitative variables are created. The first of these is called alpha in Table 2 and it is used to obtain an estimate of the additive effect of an allelic substitution and also an estimate of narrow-sense (i.e., additive) heritability. The rule for constructing variable alpha is simple. Alpha equals 1 for one homozygote, equals 1 for the other homozygote, and equals 0 for the heterozygote. (Hint: let alpha equal 1 for the homozygote with the lower mean.) The second variable is called delta and it is used to assess the presence of dominance. If the genotype is a heterozygote, then the value of delta is 1; otherwise, delta equals 0. [Insert Table 2 about here] Sequence of Analysis After variables alpha and delta are constructed, two regressions are performed. The first regression, referred to by some authors as the compact model (Judd & McClelland, 1989), uses the phenotypic score as the dependent variable and variable alpha as the independent variable. The second, termed the augmented model, uses both variables alpha and delta as the independent variables. Interpretation of the Output The squared multiple correlation (R 2 ) from the first regression is an estimate of the narrow sense heritability or the proportion of additive genetic variance to phenotypic
6 Analysis of single gene effects 6 variance for the locus. Multiplying this R 2 by the phenotypic variance gives the additive genetic variance that this locus contributes to the trait. If there is no dominance (discussed below), then the regression coefficient for the variable alpha is a direct estimate of the additive effect of an allelic substitution (i.e., the quantity a in Table 1). Also, the test of significant for this coefficient is always the most powerful statistical test for genetic effects provided dominance is not strong. The regression for the augmented model tests for dominance. If we reject the null hypothesis of no dominance, then this regression gives additional important quantities; otherwise, we return to the first regression and present and interpret those results. The intercept from this multiple regression model equals the midpoint between the two homozygotes (i.e., the quantity m in Table 1). The regression coefficient for variable alpha equals the additive effect of an allelic substitution (i.e., the quantity a in Table 1). The regression coefficient for variable delta equals the deviation of the heterozygote mean from the midpoint (i.e., the quantity d in Table 1). The F statistic from augmented model is an omnibus F that assesses the fit of the whole model. It, along with its p value, will be identical to the F (and that F s p value) from a oneway ANOVA. For the types of sample sizes available for neuroscience research, the t test for the regression coefficient of variable alpha in either the first or second regression is almost always a more powerful statistic for testing the presence of genetic effects at the locus than the omnibus F. For those few cases in which the F test is more powerful (complete dominance, large additive effect, and large sample sizes), the maximal difference in power is about.03. However, in the rest of the parameter space, the difference in power favoring the t statistic can be appreciable up to.20.
7 Analysis of single gene effects 7 The appropriate test for dominance is the significance of the t statistic for the regression coefficient of variable delta. The p level for this t statistic will be identical to that of an F statistic that tests whether adding variable delta significantly increased R 2 over the first regression. This test for dominance will always be less powerful than the t test for variable alpha s regression coefficient. The squared multiple correlation from the second regression is an estimate of broad sense heritability or the proportion of phenotypic variance attributable to total genetic variance (additive plus dominance variance) at the gene. Thus, the proportion of phenotype variance attributable to dominance variance can be calculated by subtracting the R 2 from this regression from the R 2 of the first regression. Multiplying this quantity by the phenotypic standard deviation gives dominance variance in raw score units. A numerical example As an example, we analyze data collected and reported by Bowers et al. (2000) on behavior on an elevated plus-maze for mice lacking the gene for the g isoform of protein kinase C (PKCg knockouts) and their heterozygous and wild-type littermates. Two phenotypes are analyzed, both derived from a principal components analysis of the original variables presented in Table 1 of Bowers et al. (2000). These factors agree almost perfectly with those reported by Rodgers & Johnson (1995) using a different population of mice. The first phenotype is activity (marked by the total number of entrances, and entrances into the closed arms of the maze). The second phenotype is anxiety. Here, high scores are marked by a high percentage of time and of entrances into the closed arm while low scores are indexed by a high percentage of time and entrances
8 Analysis of single gene effects 8 into the open arms. Descriptive statistics for these two phenotypes are presented in Table 3. [Insert Table 3 about here] The values of alpha were assigned so that the PKCg knock out mice were given the value of 1, heterozygotes a value of 0, and the wild type homozygotes, a value of 1. Code from the Statistical Analysis System (SAS) to perform the regressions for the the anxiety phenotype is given in Appendix 1. The activity phenotype illustrates how the method operates for a system with only additive gene action. The results from the first regression, presented in Table 4, should be used to interpret whether or not the PKCg locus has an effect on activity. Here, one could interpret either the F statistic from the ANOVA table or the t statistic testing whether the parameter estimate for variable alpha is significantly different from 0. (Both statistics are equivalent because with only one independent variable the F statistic is the square of the t statistic and both will have identical p values.) Here, t = (p =.037), so we reject the null hypothesis of no genetic effect and conclude that the PKCg gene has an influence on overall activity in the elevated plus-maze. The value of the coefficient for variable alpha (i.e., our estimate of a) is -.42 indicating that, on average, a substitution of one wild type allele for null (i.e., knock out) allele reduces activity by.42 units. Here, the estimate of a may be viewed as an effect size expressed in the metric of the original data. This effect size may be standardized in one of two ways. First, the estimate of a may be divided by the error standard deviation (i.e., the square root of the error mean square). This gives a measure of effect size favored by statisticians such as Cohen
9 Analysis of single gene effects 9 (1988). In the present case, this gives -.42/.90 = Hence, the average effect of an allelic substitution is to change activity by.44 standard deviation units. The second way to standardize is to divide a by the phenotypic standard deviation. Because we used scores from a principal components analysis, the phenotypic standard deviations are 1.0, leaving a unchanged. A second way of expressing effect size is in terms of the proportion of variance explained. The statistic here is R 2, the squared multiple correlation that will be calculated and printed in any output. It just so happens that this quantity also equals narrow-sense heritability or the ratio of additive genetic variance to phenotypic variance. For this regression, the R 2 is.12, implying that 12% of phenotypic variance is attributable to additive genetic variance. [Insert Table 4 about here] The next step is to test for dominance by regressing the activity phenotype on both variables alpha and delta. Results are presented in Table 5. The critical statistic is the t statistic that tests whether the parameter estimate for delta is significantly difference from 0. Here, the value of t is.08 and its associated p value is.937. Hence, there is no evidence for dominance on activity, and we would return and interpret the first regression as the best model to explain the data. [Insert Table 5 about here] The results on activity also illustrate how the present method increases power to detect genetic effects over a oneway ANOVA. The ANOVA table from a oneway ANOVA is identical to that presented in Table 5. Here, the F value is 2.28 and its p value is.118, so one would not reject the null hypothesis. The typical conclusion would
10 Analysis of single gene effects 10 be that there is no evidence that the PKCg locus influences activity in the elevated plusmaze. On the other hand, we have seen that testing whether the coefficient for alpha differs from 0 results in a statistically significant finding. In summary, there is good evidence that the PKCg locus influences activity. All gene action appears to be additive and the estimate of both narrow and broad-sense heritability for the locus is.12. Whether an effect size of this magnitude is something that is worthwhile pursuing is, of course, a matter that should be determined by the substance of the problem and not the statistics. The anxiety phenotype is used to illustrate the method when dominance is important. The results of regressing anxiety on variable alpha are presented in Table 6. Here, the t statistic testing whether the coefficient for alpha equals is 3.09 (p =.004), so we conclude that there is evidence that the PKCg locus also influences anxiety. The R 2 from this regression (.22) equals the estimate of narrow sense heritability for anxiety. [Insert Table 6 about here] The results of the second regression are given in Table 7. The t statistic for the coefficient for variable delta equals 3.25 (p =.003), so we reject the null hypothesis of no dominance. Because there is evidence for dominance, we favor the parameter estimate for alpha using this regression instead of that from the first regression. (The two estimates are the same in the present case because there are equal sample sizes for the genotypes; when sample sizes differ, the estimates of alpha may be different in the two regressions). Here, the value of the coefficient for variable alpha is.56, implying that substituting one wild-type allele for a null PKCg allele has the average effect of increasing anxiety by.56 units. The value of the coefficient for variable delta (i.e., our
11 Analysis of single gene effects 11 estimate of d) is.91. This is larger than the value for a, so we might suspect heterosis. Let us postpone discussion of this topic to focus on interpretation of heritability. The R 2 for this second regression is.41. This is our estimate of broad sense heritability for anxiety. In short, variation in the PKCg locus accounts for about 41% of the variability in this anxiety measure in this population of mice. Because the R 2 from the first regression is an estimate of narrow-sense heritability, the contribution of dominance to broad sense heritability may be found by subtracting the R 2 from the first regression from that in the second regression. This gives =.19. Should we interpret the large estimate of d as overdominance? Certainly the mean of the heterozygote is consistent with this possibility. Most but not all modern regression software allows for a direct test of this hypothesis. When there is heterosis, then the value for d should be significantly greater than the value of a (or significantly less than the value of a, depending on which allele is dominant). One can test for this by constraining the regression coefficients for variables alpha and delta to be equal and then testing the significance of this model against the second regression given above. One simple SAS statement is sufficient for this test (see Appendix 1). For the present case, the test is not significant (F(1,33) = 1.14, p =.29). Hence, the value of d is not significantly greater than that for a, and there is no evidence for overdominance. Power Calculations of power are very easy when the number of animals in each cell is equal. Because the most powerful test is the t statistic for the regression coefficient of
12 Analysis of single gene effects 12 variable alpha, we base power on that statistic used in the augmented model. The numerator for the t is the hypothesized value of the regression coefficient for variable alpha (i.e., the quantity a in the model in Table 1). The denominator is the standard error of the regression coefficient and equals 2 1.5s error N, 2 where s error is the error variance and N is the total number of animals. The degrees of freedom for this test equal (N 2). With a little algebra, the noncentrality parameter for the t may be written as t nc = a s error N 1.5. Note that the a in this equation is the parameter on Table 1 and not the probability of a type I error. Note also the quantity a divided by the error standard deviation is the Cohen s standardized effect size. Power may then be calculated by finding the area under this noncentral t distribution that exceeds the critical value of t. A program for the calculation of power is available at Table 8 gives estimates of power as a function of Cohen s (1988) effect size and sample sizes. The calculations were made assuming complete additivity. Dominance will increase power because error variance will be reduced, but increase is so slight that the figures in Table 8 are close approximations, even to the case of complete dominance. The calculations suggest that 12 animals per group will result in power of.80 or better as long as the effect size is moderately large (i.e.,.60 or greater). [Insert Table 8 about here]
13 Analysis of single gene effects 13 Discussion The method advocated in this paper provides greater statistical power and more informative quantitative estimates than traditional statistical techniques for analyzing single gene effects. It provides two different ways to communicate effect sizes, one expressed in mean change units (i.e., the values of a and d) and the second in terms of variance components or heritabilities (R 2 ) at the locus. The major liability with the technique does not lie in the mathematics. Instead, it resides in the incorrect interpretation and extrapolation of the quantitative estimates. In experimental organisms, heritability estimates for a single gene are usually estimated within the context of a limited range of genetic backgrounds and in controlled environments. These estimates should never be extrapolated to general populations or treated and interpreted as heritability estimates derived from, say, human twin or adoption studies. For example, the mice used for the data examples give involved the genetic background of C57BL/6J and 129/SvEvTac inbreds and were all reared and tested in environmentally standardized conditions. The estimates of effect size and heritability pertain only to this genetic background and similar environments. These estimates can be very useful in guiding which of several future experiments to pursue, in calculating power for other studies, and perhaps even in pointing out a suitable candidate locus for other designs. They do not, however, imply that the PKCg locus must be involved in individual differences in anxiety in wild mouse populations. Finally, we offer a caveat. All statements made in this paper about power are based on equal genotypic frequencies. They will be also apply to the case where cell size
14 Analysis of single gene effects 14 differs, but only by a few animals. The results on power should not be extrapolated to natural populations in whom allele and genotypic frequencies can markedly differ.
15 Analysis of single gene effects 15 References Bowers,B.J., Collins,A.C., Tritto,T., & Wehner, J.M. (2000). Mice lacking PKCg exhibit decreased anxiety. Behavior Genetics 30: Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Hillsdale NJ: L. Erlbaum. Falconer,D.S. & Mackay, T.F.C. (1996). Introduction to quantitative genetics, 4 th edition. Essex, England: Longman. Judd,C.M. & McClelland,G.H. (1989) Data analysis: A model-comparison approach. San Diego: Harcourt Brace Jovanovich. Rodgers, R.J. & Johnson, N.J.T. (1995) Factor analysis of spatiotemporal and ethological measures in the murine elevated plus-maze test of anxiety. Pharmacology Biochemistry and Behavior 52:
16 Analysis of single gene effects 16 Table 1. Notation for the single-gene model. Expected Variance Genotype Frequency Mean within Genotype aa f aa m a s Aa f Aa m + d s AA f AA m + a s
17 Analysis of single gene effects 17 Table 2. Example of a data set arranged for single-gene analysis. Phenotype Genotype alpha delta 8.3 AA Aa aa aa Aa AA 1 0
18 Analysis of single gene effects 18 Table 3. Means and standard deviations for activity and anxiety measures on an elevated plus-maze for mice lacking the gene for PKCg (knock outs) and their heterozygous and wild-type littermates. Activity Anxiety Genotype N Mean St. Dev. Mean St. Dev. Knock Out Heterozygote Wild Type
19 Analysis of single gene effects 19 Table 4. Results on a activity from the regression model with independent variable alpha. Sum of Mean Source df Squares Squares F p Model Error Total Standard Parameter Value Error t p alpha
20 Analysis of single gene effects 20 Table 5. Results on activity from the regression model with independent variables alpha and delta. Sum of Mean Source df Squares Squares F p Model Error Total Standard Parameter Value Error t p alpha delta
21 Analysis of single gene effects 21 Table 6. Results on anxiety from the regression using independent variable alpha. Sum of Mean Source df Squares Squares F p Model Error Total Standard Parameter Value Error t p alpha
22 Analysis of single gene effects 22 Table 7. Results on anxiety from the regression using independent variables alpha and delta. Sum of Mean Source df Squares Squares F p Model Error Total Standard Parameter Value Error t p alpha delta
23 Analysis of single gene effects 23 Table 8. Power for single-gene analysis using the t test for the regression coefficient of variable alpha in the full model: No dominance. Effect Size = a/s error Total N
24 Analysis of single gene effects 24 Appendix 1. SAS code for performing the regression analyses. Code for variable genotype: 1 = PKCg null mutant homozygotes; 2 = heterozygotes; 3 = wild-type homozygotes. A copy of this program is available at data plusmaze; input genotype activity anxiety; if genotype=1 then alpha = -1; else if genotype=2 then alpha = 0; else alpha = 1; delta = 0; if genotype=2 then delta = 1; datalines;
25 Analysis of single gene effects
26 Analysis of single gene effects run; title Bowers et al (2000) data on PKC-gamma and anxiety; proc reg data=plusmaze; var anxiety alpha delta; title2 first model; additive: model anxiety = alpha; run; title2 second model; total: model anxiety = alpha delta; run; title3 test for overdominance; test alpha = delta; run; quit;
The Association Design and a Continuous Phenotype
PSYC 5102: Association Design & Continuous Phenotypes (4/4/07) 1 The Association Design and a Continuous Phenotype The purpose of this note is to demonstrate how to perform a population-based association
More informationRoadmap. Inbreeding How inbred is a population? What are the consequences of inbreeding?
1 Roadmap Quantitative traits What kinds of variation can selection work on? How much will a population respond to selection? Heritability How can response be restored? Inbreeding How inbred is a population?
More informationDecomposition of the Genotypic Value
Decomposition of the Genotypic Value 1 / 17 Partitioning of Phenotypic Values We introduced the general model of Y = G + E in the first lecture, where Y is the phenotypic value, G is the genotypic value,
More informationANOVA. Thomas Elliott. January 29, 2013
ANOVA Thomas Elliott January 29, 2013 ANOVA stands for analysis of variance and is one of the basic statistical tests we can use to find relationships between two or more variables. ANOVA compares the
More informationOne-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;
1 One-Way ANOVAs We have already discussed the t-test. The t-test is used for comparing the means of two groups to determine if there is a statistically significant difference between them. The t-test
More informationIntroduction to Quantitative Genetics
Introduction to Quantitative Genetics 1 / 17 Historical Background Quantitative genetics is the study of continuous or quantitative traits and their underlying mechanisms. The main principals of quantitative
More informationSelection at one locus with many alleles, fertility selection, and sexual selection
Selection at one locus with many alleles, fertility selection, and sexual selection Introduction It s easy to extend the Hardy-Weinberg principle to multiple alleles at a single locus. In fact, we already
More information(b) What is the allele frequency of the b allele in the new merged population on the island?
2005 7.03 Problem Set 6 KEY Due before 5 PM on WEDNESDAY, November 23, 2005. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. Two populations (Population One
More informationLab 5: Testing Hypotheses about Patterns of Inheritance
Lab 5: Testing Hypotheses about Patterns of Inheritance How do we talk about genetic information? Each cell in living organisms contains DNA. DNA is made of nucleotide subunits arranged in very long strands.
More informationLessons in biostatistics
Lessons in biostatistics The test of independence Mary L. McHugh Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA Corresponding author:
More informationCHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to
CHAPTER - 6 STATISTICAL ANALYSIS 6.1 Introduction This chapter discusses inferential statistics, which use sample data to make decisions or inferences about population. Populations are group of interest
More informationAnalysis of Variance (ANOVA)
Research Methods and Ethics in Psychology Week 4 Analysis of Variance (ANOVA) One Way Independent Groups ANOVA Brief revision of some important concepts To introduce the concept of familywise error rate.
More informationAn Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018
An Introduction to Quantitative Genetics I Heather A Lawson Advanced Genetics Spring2018 Outline What is Quantitative Genetics? Genotypic Values and Genetic Effects Heritability Linkage Disequilibrium
More informationGeorgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera
Homework assignment topics 51-63 Georgina Salas Topics 51-63 EDCI Intro to Research 6300.62 Dr. A.J. Herrera Topic 51 1. Which average is usually reported when the standard deviation is reported? The mean
More information3 CONCEPTUAL FOUNDATIONS OF STATISTICS
3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical
More informationTesting Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.
10 Learning Objectives Testing Means After reading this chapter, you should be able to: Related-Samples t Test With Confidence Intervals 1. Describe two types of research designs used when we select related
More informationYour DNA extractions! 10 kb
Your DNA extractions! 10 kb Quantitative characters: polygenes and environment Most ecologically important quantitative traits (QTs) vary. Distributions are often unimodal and approximately normal. Offspring
More informationBio 1M: Evolutionary processes
Bio 1M: Evolutionary processes Evolution by natural selection Is something missing from the story I told last chapter? Heritable variation in traits Selection (i.e., differential reproductive success)
More informationHERITABILITY INTRODUCTION. Objectives
36 HERITABILITY In collaboration with Mary Puterbaugh and Larry Lawson Objectives Understand the concept of heritability. Differentiate between broad-sense heritability and narrowsense heritability. Learn
More informationHARDY- WEINBERG PRACTICE PROBLEMS
HARDY- WEINBERG PRACTICE PROBLEMS PROBLEMS TO SOLVE: 1. The proportion of homozygous recessives of a certain population is 0.09. If we assume that the gene pool is large and at equilibrium and all genotypes
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationHERITABILITY AND ITS GENETIC WORTH FOR PLANT BREEDING
HERITABILITY AND ITS GENETIC WORTH FOR PLANT BREEDING Author: Prasanta Kumar Majhi M. Sc. (Agri.), Junior Research Scholar, Department of Genetics and Plant Breeding, College of Agriculture, UAS, Dharwad,
More informationConfidence Intervals On Subsets May Be Misleading
Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 2 11-1-2004 Confidence Intervals On Subsets May Be Misleading Juliet Popper Shaffer University of California, Berkeley, shaffer@stat.berkeley.edu
More informationEvolution II.2 Answers.
Evolution II.2 Answers. 1. (4 pts) Contrast the predictions of blending inheritance for F1 and F2 generations with those observed under Mendelian inheritance. Blending inheritance predicts both F1 and
More informationMULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES
24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter
More informationDiscontinuous Traits. Chapter 22. Quantitative Traits. Types of Quantitative Traits. Few, distinct phenotypes. Also called discrete characters
Discontinuous Traits Few, distinct phenotypes Chapter 22 Also called discrete characters Quantitative Genetics Examples: Pea shape, eye color in Drosophila, Flower color Quantitative Traits Phenotype is
More informationSystems of Mating: Systems of Mating:
8/29/2 Systems of Mating: the rules by which pairs of gametes are chosen from the local gene pool to be united in a zygote with respect to a particular locus or genetic system. Systems of Mating: A deme
More informationInvestigating the robustness of the nonparametric Levene test with more than two groups
Psicológica (2014), 35, 361-383. Investigating the robustness of the nonparametric Levene test with more than two groups David W. Nordstokke * and S. Mitchell Colp University of Calgary, Canada Testing
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationBehavioral genetics: The study of differences
University of Lethbridge Research Repository OPUS Faculty Research and Publications http://opus.uleth.ca Lalumière, Martin 2005 Behavioral genetics: The study of differences Lalumière, Martin L. Department
More informationCh. 23 The Evolution of Populations
Ch. 23 The Evolution of Populations 1 Essential question: Do populations evolve? 2 Mutation and Sexual reproduction produce genetic variation that makes evolution possible What is the smallest unit of
More informationStatistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012
Statistical Tests for X Chromosome Association Study with Simulations Jian Wang July 10, 2012 Statistical Tests Zheng G, et al. 2007. Testing association for markers on the X chromosome. Genetic Epidemiology
More informationadditive genetic component [d] = rded
Heredity (1976), 36 (1), 31-40 EFFECT OF GENE DISPERSION ON ESTIMATES OF COMPONENTS OF GENERATION MEANS AND VARIANCES N. E. M. JAYASEKARA* and J. L. JINKS Department of Genetics, University of Birmingham,
More informationAPPENDIX N. Summary Statistics: The "Big 5" Statistical Tools for School Counselors
APPENDIX N Summary Statistics: The "Big 5" Statistical Tools for School Counselors This appendix describes five basic statistical tools school counselors may use in conducting results based evaluation.
More informationHZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION
HZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION Using the malt quality dataset on the class s Web page: 1. Determine the simple linear correlation of extract with the remaining variables.
More informationINTENDED LEARNING OUTCOMES
FACTORIAL ANOVA INTENDED LEARNING OUTCOMES Revise factorial ANOVA (from our last lecture) Discuss degrees of freedom in factorial ANOVA Recognise main effects and interactions Discuss simple effects QUICK
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More informationChapter 19: Quantitative II Estimation & Testing
2000, Gregory Carey Chapter 19: Quantitative II - 1 Chapter 19: Quantitative II Estimation & Testing Introduction The previous chapter presented quantitative genetics from a conceptual view. We learned
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationMENDELIAN GENETICS. MENDEL RULE AND LAWS Please read and make sure you understand the following instructions and knowledge before you go on.
MENDELIAN GENETICS Objectives Upon completion of this lab, students should: 1. Understand the principles and terms used in Mendelian genetics. 2. Know how to complete a Punnett square to estimate phenotypic
More informationINTERACTION BETWEEN NATURAL SELECTION FOR HETEROZYGOTES AND DIRECTIONAL SELECTION
INTERACTION BETWEEN NATURAL SELECTION FOR HETEROZYGOTES AND DIRECTIONAL SELECTION MARGRITH WEHRLI VERGHESE 1228 Kingston Ridge Driue, Cary, N.C. 27511 Manuscript received May 5, 1973 Revised copy received
More informationLecture 7: Introduction to Selection. September 14, 2012
Lecture 7: Introduction to Selection September 14, 2012 Announcements Schedule of open computer lab hours on lab website No office hours for me week. Feel free to make an appointment for M-W. Guest lecture
More informationMendelian Genetics: Patterns of Inheritance
Mendelian Genetics: Patterns of Inheritance A Bit on Gregor Mendel Born to a poor farming family in what is now part of Czech Republic Attended Augustinian monastery (1843) Became an excellent teacher
More informationMating Systems. 1 Mating According to Index Values. 1.1 Positive Assortative Matings
Mating Systems After selecting the males and females that will be used to produce the next generation of animals, the next big decision is which males should be mated to which females. Mating decisions
More informationPedigree Analysis Why do Pedigrees? Goals of Pedigree Analysis Basic Symbols More Symbols Y-Linked Inheritance
Pedigree Analysis Why do Pedigrees? Punnett squares and chi-square tests work well for organisms that have large numbers of offspring and controlled mating, but humans are quite different: Small families.
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More informationBio 312, Spring 2017 Exam 3 ( 1 ) Name:
Bio 312, Spring 2017 Exam 3 ( 1 ) Name: Please write the first letter of your last name in the box; 5 points will be deducted if your name is hard to read or the box does not contain the correct letter.
More informationCHAPTER 3 DATA ANALYSIS: DESCRIBING DATA
Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such
More informationA SAS Macro to Investigate Statistical Power in Meta-analysis Jin Liu, Fan Pan University of South Carolina Columbia
Paper 109 A SAS Macro to Investigate Statistical Power in Meta-analysis Jin Liu, Fan Pan University of South Carolina Columbia ABSTRACT Meta-analysis is a quantitative review method, which synthesizes
More informationPopulation Genetics Simulation Lab
Name Period Assignment # Pre-lab: annotate each paragraph Population Genetics Simulation Lab Evolution occurs in populations of organisms and involves variation in the population, heredity, and differential
More informationModel of an F 1 and F 2 generation
Mendelian Genetics Casual observation of a population of organisms (e.g. cats) will show variation in many visible characteristics (e.g. color of fur). While members of a species will have the same number
More informationChapter 9. Factorial ANOVA with Two Between-Group Factors 10/22/ Factorial ANOVA with Two Between-Group Factors
Chapter 9 Factorial ANOVA with Two Between-Group Factors 10/22/2001 1 Factorial ANOVA with Two Between-Group Factors Recall that in one-way ANOVA we study the relation between one criterion variable and
More informationAn Introduction to Quantitative Genetics
An Introduction to Quantitative Genetics Mohammad Keramatipour MD, PhD Keramatipour@tums.ac.ir ac ir 1 Mendel s work Laws of inheritance Basic Concepts Applications Predicting outcome of crosses Phenotype
More information8/28/2017. If the experiment is successful, then the model will explain more variance than it can t SS M will be greater than SS R
PSY 5101: Advanced Statistics for Psychological and Behavioral Research 1 If the ANOVA is significant, then it means that there is some difference, somewhere but it does not tell you which means are different
More informationChapter 12: Introduction to Analysis of Variance
Chapter 12: Introduction to Analysis of Variance of Variance Chapter 12 presents the general logic and basic formulas for the hypothesis testing procedure known as analysis of variance (ANOVA). The purpose
More informationWhen bad things happen to good genes: mutation vs. selection
When bad things happen to good genes: mutation vs. selection Selection tends to increase the frequencies of alleles with higher marginal fitnesses. Does this mean that genes are perfect? No, mutation can
More informationThe Making of the Fittest: Natural Selection in Humans
INTRODUCTION MENDELIAN GENETICS, PROBABILITY, PEDIGREES, AND CHI-SQUARE STATISTICS Hemoglobin is a protein found in red blood cells (RBCs) that transports oxygen throughout the body. The hemoglobin protein
More informationThe Making of the Fittest: Natural Selection in Humans
MENDELIAN GENETICS, PROBABILITY, PEDIGREES, AND CHI-SQUARE STATISTICS INTRODUCTION Hemoglobin is a protein found in red blood cells that transports oxygen throughout the body. The hemoglobin protein consists
More informationSection 3.2 Least-Squares Regression
Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.
More informationUse the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions.
Fall, 2002 Grad Stats Final Exam There are four questions on this exam, A through D, and each question has multiple sub-questions. Unless otherwise indicated, each sub-question is worth 3 points. Question
More informationWhite Paper Estimating Genotype-Specific Incidence for One or Several Loci
White Paper 23-01 Estimating Genotype-Specific Incidence for One or Several Loci Authors: Mike Macpherson Brian Naughton Andro Hsu Joanna Mountain Created: September 5, 2007 Last Edited: November 18, 2007
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationTwo-Way Independent Samples ANOVA with SPSS
Two-Way Independent Samples ANOVA with SPSS Obtain the file ANOVA.SAV from my SPSS Data page. The data are those that appear in Table 17-3 of Howell s Fundamental statistics for the behavioral sciences
More informationLecture 5 Inbreeding and Crossbreeding. Inbreeding
Lecture 5 Inbreeding and Crossbreeding Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 018 1 Inbreeding Inbreeding = mating of related individuals Often results
More informationAnswers to Questions from old quizzes and exams Problem 1A (i). a (ii) c (iii) a (iv) d
BIOLOGY 321 SPRING 2013 ANSWERS TO ASSIGNMENT SET #2 Answers to text questions: Chapter 2 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_02.pdf Chapter 3 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_03.pdf
More informationPopGen4: Assortative mating
opgen4: Assortative mating Introduction Although random mating is the most important system of mating in many natural populations, non-random mating can also be an important mating system in some populations.
More informationHypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC
Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric statistics
More informationMultiple Regression Analysis of Twin Data: Etiology of Deviant Scores versus Individuai Differences
Acta Genet Med Gemellol 37:205-216 (1988) 1988 by The Mendel Institute, Rome Received 30 January 1988 Final 18 Aprii 1988 Multiple Regression Analysis of Twin Data: Etiology of Deviant Scores versus Individuai
More informationChapter 13: Introduction to Analysis of Variance
Chapter 13: Introduction to Analysis of Variance Although the t-test is a useful statistic, it is limited to testing hypotheses about two conditions or levels. The analysis of variance (ANOVA) was developed
More informationd =.20 which means females earn 2/10 a standard deviation more than males
Sampling Using Cohen s (1992) Two Tables (1) Effect Sizes, d and r d the mean difference between two groups divided by the standard deviation for the data Mean Cohen s d 1 Mean2 Pooled SD r Pearson correlation
More informationHeritability. The concept
Heritability The concept What is the Point of Heritability? Is a trait due to nature or nurture? (Genes or environment?) You and I think this is a good point to address, but it is not addressed! What is
More informationDiallel Analysis and its Applications in Plant Breeding
Diallel Analysis and its Applications in Plant Breeding Madhu Choudhary*, Kana Ram Kumawat and Ravi Kumawat Department of Plant Breeding and Genetics, S.K.N. Agriculture University, Jobner-303329, Jaipur
More informationDEFINITIONS: POPULATION: a localized group of individuals belonging to the same species
DEFINITIONS: POPULATION: a localized group of individuals belonging to the same species SPECIES: a group of populations whose individuals have the potential to interbreed and produce fertile offspring
More informationBasic Statistics and Data Analysis in Work psychology: Statistical Examples
Basic Statistics and Data Analysis in Work psychology: Statistical Examples WORK PSYCHOLOGY INTRODUCTION In this chapter we examine a topic which is given too little coverage in most texts of this kind,
More informationMBG* Animal Breeding Methods Fall Final Exam
MBG*4030 - Animal Breeding Methods Fall 2007 - Final Exam 1 Problem Questions Mick Dundee used his financial resources to purchase the Now That s A Croc crocodile farm that had been operating for a number
More informationHungry Mice. NP: Mice in this group ate as much as they pleased of a non-purified, standard diet for laboratory mice.
Hungry Mice When laboratory mice (and maybe other animals) are fed a nutritionally adequate but near-starvation diet, they may live longer on average than mice that eat a normal amount of food. In this
More informationCHAPTER NINE DATA ANALYSIS / EVALUATING QUALITY (VALIDITY) OF BETWEEN GROUP EXPERIMENTS
CHAPTER NINE DATA ANALYSIS / EVALUATING QUALITY (VALIDITY) OF BETWEEN GROUP EXPERIMENTS Chapter Objectives: Understand Null Hypothesis Significance Testing (NHST) Understand statistical significance and
More informationS P O U S A L R ES E M B L A N C E I N PSYCHOPATHOLOGY: A C O M PA R I SO N O F PA R E N T S O F C H I LD R E N W I T H A N D WITHOUT PSYCHOPATHOLOGY
Aggregation of psychopathology in a clinical sample of children and their parents S P O U S A L R ES E M B L A N C E I N PSYCHOPATHOLOGY: A C O M PA R I SO N O F PA R E N T S O F C H I LD R E N W I T H
More informationSTATISTICS AND RESEARCH DESIGN
Statistics 1 STATISTICS AND RESEARCH DESIGN These are subjects that are frequently confused. Both subjects often evoke student anxiety and avoidance. To further complicate matters, both areas appear have
More informationBIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA
BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group
More informationCHAPTER III METHODOLOGY
24 CHAPTER III METHODOLOGY This chapter presents the methodology of the study. There are three main sub-titles explained; research design, data collection, and data analysis. 3.1. Research Design The study
More information25. Two-way ANOVA. 25. Two-way ANOVA 371
25. Two-way ANOVA The Analysis of Variance seeks to identify sources of variability in data with when the data is partitioned into differentiated groups. In the prior section, we considered two sources
More informationBRONX COMMUNITY COLLEGE LIBRARY SUGGESTED FOR MTH 01 FUNDAMENTAL CONCEPTS & SKILLS IN ARITHMETIC & ALGEBRA
BRONX COMMUNITY COLLEGE LIBRARY SUGGESTED FOR MTH 01 FUNDAMENTAL CONCEPTS & SKILLS IN ARITHMETIC & ALGEBRA Textbook: Publisher: Developmental Mathematics, 4th edition by Johnson, Willis, and Hughes Addison
More informationSheila Barron Statistics Outreach Center 2/8/2011
Sheila Barron Statistics Outreach Center 2/8/2011 What is Power? When conducting a research study using a statistical hypothesis test, power is the probability of getting statistical significance when
More informationInferential Statistics for Radiation Scientists: A Brief Guide to Better Statistical Interpretations
Journal of Medical Imaging and Radiation Sciences Journal of Medical Imaging and Radiation Sciences 43 (2012) 121-131 Directed Reading Article Inferential Statistics for Radiation Scientists: A Brief Guide
More informationGENETIC LINKAGE ANALYSIS
Atlas of Genetics and Cytogenetics in Oncology and Haematology GENETIC LINKAGE ANALYSIS * I- Recombination fraction II- Definition of the "lod score" of a family III- Test for linkage IV- Estimation of
More informationOne-Way Independent ANOVA
One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.
More informationTo open a CMA file > Download and Save file Start CMA Open file from within CMA
Example name Effect size Analysis type Level Tamiflu Symptom relief Mean difference (Hours to relief) Basic Basic Reference Cochrane Figure 4 Synopsis We have a series of studies that evaluated the effect
More informationMendel s Methods: Monohybrid Cross
Mendel s Methods: Monohybrid Cross Mendel investigated whether the white-flowered form disappeared entirely by breeding the F1 purple flowers with each other. Crossing two purple F1 monohybrid plants is
More informationDescriptive Statistics Lecture
Definitions: Lecture Psychology 280 Orange Coast College 2/1/2006 Statistics have been defined as a collection of methods for planning experiments, obtaining data, and then analyzing, interpreting and
More informationSomething to think about. What happens, however, when we have a sample with less than 30 items?
One-Sample t-test Remember In the last chapter, we learned to use a statistic from a large sample of data to test a hypothesis about a population parameter. In our case, using a z-test, we tested a hypothesis
More informationApplied Statistical Analysis EDUC 6050 Week 4
Applied Statistical Analysis EDUC 6050 Week 4 Finding clarity using data Today 1. Hypothesis Testing with Z Scores (continued) 2. Chapters 6 and 7 in Book 2 Review! = $ & '! = $ & ' * ) 1. Which formula
More informationPsych 3102 Introduction to Behavior Genetics
Psych 3102 Introduction to Behavior Genetics Lecture 12 Quantitative analysis Covariance between relatives Sources of covariance between relatives covariance a measure of shared variance (how similar the
More informationMendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp , Chapter 5 pp
1 Biology 423L Sept. 1/2 Mendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp. 13-27, Chapter 5 pp. 127-130. FASTPLANTS: Williams et al. (1986)
More informationUnequal Numbers of Judges per Subject
The Reliability of Dichotomous Judgments: Unequal Numbers of Judges per Subject Joseph L. Fleiss Columbia University and New York State Psychiatric Institute Jack Cuzick Columbia University Consider a
More informationThe Effect Sizes r and d in Hypnosis Research
Marty Sapp The Effect Sizes r and d in Hypnosis Research Marty Sapp, Ed.D. The effect sizes r and d and their confidence intervals can improve hypnosis research. This paper describes how to devise scientific
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationPearson Edexcel International GCSE in Mathematics (Specification A) (9-1) Exemplar student answers with examiner comments
Pearson Edexcel International GCSE in Mathematics (Specification A) (9-1) Exemplar student answers with examiner comments 1 Contents About this booklet... 3 How to use this booklet.... 3 Guide on the use
More informationEmpirical Knowledge: based on observations. Answer questions why, whom, how, and when.
INTRO TO RESEARCH METHODS: Empirical Knowledge: based on observations. Answer questions why, whom, how, and when. Experimental research: treatments are given for the purpose of research. Experimental group
More information