Two Factor Analysis of Variance

Similar documents
BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

ANOVA in SPSS (Practical)

Dr. Kelly Bradley Final Exam Summer {2 points} Name

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Multiple Linear Regression Analysis

One-Way Independent ANOVA

Testing the effect of two factors at different levels (two treatments). Examples: Yield of a crop varying with fertilizer type and seedling used.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Instructions for doing two-sample t-test in Excel

Lessons in biostatistics

Simple Linear Regression One Categorical Independent Variable with Several Categories

Intro to SPSS. Using SPSS through WebFAS

CHAPTER III METHODOLOGY

Chapter 6 Analysis of variance and experimental designed

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

Analysis of Variance (ANOVA) Program Transcript

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Study Guide #2: MULTIPLE REGRESSION in education

Two-Way Independent ANOVA

STATS Relationships between variables: Correlation

Examining differences between two sets of scores

Section 6: Analysing Relationships Between Variables

1) What is the independent variable? What is our Dependent Variable?

CHAPTER TWO REGRESSION

USE AND MISUSE OF MIXED MODEL ANALYSIS VARIANCE IN ECOLOGICAL STUDIES1

Chapter 3: Describing Relationships

Inferential Statistics

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

Chapter 5: Field experimental designs in agriculture

Lesson 9: Two Factor ANOVAS

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg.

ANOVA. Thomas Elliott. January 29, 2013

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

Example 7.2. Autocorrelation. Pilar González and Susan Orbe. Dpt. Applied Economics III (Econometrics and Statistics)

25. Two-way ANOVA. 25. Two-way ANOVA 371

Math 075 Activities and Worksheets Book 2:

CHAPTER ONE CORRELATION

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Chapter 1: Exploring Data

1. To review research methods and the principles of experimental design that are typically used in an experiment.

Exercise Verify that the term on the left of the equation showing the decomposition of "total" deviation in a two-factor experiment.

Chapter 7: Correlation

Part 8 Logistic Regression

Two-Way Independent Samples ANOVA with SPSS

Correlation and Regression

FORM C Dr. Sanocki, PSY 3204 EXAM 1 NAME

Advanced ANOVA Procedures

Comparison of two means

Math Section MW 1-2:30pm SR 117. Bekki George 206 PGH

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

EPS 625 INTERMEDIATE STATISTICS TWO-WAY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY)

14.1: Inference about the Model

Variability. After reading this chapter, you should be able to do the following:

The Jumping Dog Quadratic Activity

COMPUTER-BASED BIOMETRICS MANUAL

A Comparison of Three Measures of the Association Between a Feature and a Concept

Introduction. Lecture 1. What is Statistics?

Use the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions.

M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1

ExperimentalPhysiology

12.1 Inference for Linear Regression. Introduction

An Introduction to Research Statistics

SPSS Correlation/Regression

CLINICAL RESEARCH METHODS VISP356. MODULE LEADER: PROF A TOMLINSON B.Sc./B.Sc.(HONS) OPTOMETRY

Daniel Boduszek University of Huddersfield

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

Multiple Regression Analysis

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Stat Wk 9: Hypothesis Tests and Analysis

CHAPTER 2 TAGUCHI OPTIMISATION TECHNIQUE

Lesson: A Ten Minute Course in Epidemiology

Basic Biostatistics. Chapter 1. Content

Section 3 Correlation and Regression - Teachers Notes

isc ove ring i Statistics sing SPSS

Answer all three questions. All questions carry equal marks.

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

Creative Commons Attribution-NonCommercial-Share Alike License

Thinking about Drinking

Daniel Boduszek University of Huddersfield

2 Assumptions of simple linear regression

Chapter 9. Factorial ANOVA with Two Between-Group Factors 10/22/ Factorial ANOVA with Two Between-Group Factors

Modeling Binary outcome

MA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1

IAPT: Regression. Regression analyses

Chi-Square Goodness-of-Fit Test

Birds' Judgments of Number and Quantity

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

FRESH FRUIT JUICES-A STUDY ON CONSUMER PREFERENCE AND DEMOGRAPHIC PROFILING

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

Factorial Analysis of Variance

Chapter 4 DESIGN OF EXPERIMENTS

Worksheet 6 - Multifactor ANOVA models

Transcription:

BIOL 310 Two Factor Analysis of Variance In the previous discussions of analysis of variance (ANOVA), only one factor was involved. For example, in Chapter 7 the variable of interest in the sample problem was cholesterol concentration and the statistical test was to determine if sex affected the concentration. In that problem sex is termed the factor. In Chapter 9, the variable of interest in the sample problem was body weight and the test was to determine if the feed used (the factor) affected body weight. In that problem, four feeds were used and each type of feed is said to be a level of the factor. In many biological experiments, there are more than one factor involved. This lesson deals with a two factor analysis, which is dealt with by running a two-way ANOVA, which can then be followed by a post-test if necessary. One of the strengths of a two-way ANOVA is that it allows you to test three null hypotheses at one time, all of which may be biologically important. 1. Ho: there is no effect of factor 1 on the response variable (variable of interest) 2. Ho: there is no effect of factor 2 on the response variable 3. Ho: there is no interaction between factor 1 and factor 2 on the response variable The first two of these can be tested using the one factor ANOVA (one-way ANOVA) but the third one, the interaction, can be obtained only through the two-way ANOVA approach. Besides it is much more efficient to test all three null hypotheses in one single test (two-way ANOVA) rather than using a series of tests (one-way ANOVA). If it is found that there is a significant interaction between the two factors, then neither factor can be considered independently. An interaction means that the effect of one factor depends on the level of another factor. If the interaction is significant it could be due to synergism or interference. When two factors act synergistically, the combined effect is greater than the sum of the separate effects. In contrast, when two factors in combination inhibit each other s effects it is termed interference. Biologically, interactions are very important and can be extremely interesting; however, they can sometimes be a challenge to interpret. One way to think about all of the possible Hos being tested by a two-way ANOVA is to consider them as a series of linear equations. For instance, in a one-way ANOVA we are interested in the effect of a single factor (X) on the response variable (Y) then you could write this as a word equation in the form of Y = X. Using the example from Chapter 7, the word equation describing the analysis would be: cholesterol = sex. A two-way ANOVA then investigates three scenarios: Response = Factor1 Response = Factor2 Response = Factor1*Factor2 with the later being the potential interaction between the two factors. 89

Calculating a Two-way ANOVA Using MINITAB Although only slightly more complex mathematically, it is usually easiest to use a statistical package and a general linear model to conduct a two-way ANOVA. We will be using MINITAB in this class, but the data entry and output from the analysis are fairly standard. A standard MINITAB screen consists of two windows, a worksheet similar to EXCEL where data is entered and a session window where the results of the analysis are shown. We will first consider how to enter data for a two-way ANOVA. In MINITAB it is necessary to enter each factor as a separate column and the response variable in another. For example, for Problem 15.1 the worksheet would have three columns (Figure 15.1): 1) hormone or no hormone, 2) sex, and 3) calcium levels. The first two are factors and the last, calcium levels, is the response variable. The next step is to choose ANOVA from the Stat menu and then General Linear Model. The general linear model window is fairly self explanatory (Figure 15.2, below). The columns with appropriate data are listed in the Figure 15.1. Sample MINITAB worksheet from Problem 15.1. left-hand box, after selecting the appropriate one you chose Select and it will be transferred into the place selected. For instance, Ca level was entered into the Response: box by placing the curser in the box and then choosing Ca level from the options at the left. In fact, Ca level is the only option for the response variable since the other two columns are filled with categorical data. The Model box has each factor tested independently as well as the interaction (in this case, Hormone*Sex ). Once the model is set up as shown above the next step is to simply choose OK. The results will then appear in the Session window as shown in Figure 15.3. Figure 15.2. Example of general linear model window 90

Figure 15.3. MINITAB statistical output for a two-way ANOVA run as a general linear model using data from Problem 15.1. Example Problem (Problem 15.1) Investigators wished to know if there was a difference in plasma calcium levels of male and female birds given a hormone treatment. They obtained the data presented in Problem 16.1. The two factors are sex and hormone treatment. Both factors have two levels since there are male and female categories for sex and with and without hormone for hormone treatment. In this 2 x 2 analysis (two levels of each factor) there are five replicates (five birds) for each combination of the two factors; in other words, there are equal sample sizes for each of the experimental conditions. I. STATISTICAL STEPS A. Statement of Ho Ho: No effect of hormone treatment on the plasma Ca concentration Ho: No difference in plasma Ca concentration between male and female birds Ho: No interaction between sex and hormone treatment on the plasma Ca concentration B. Statistical test Since two factors are involved we need to use a two-way ANOVA, with possibly a multiple comparison post-test if necessary. C. Computation of test statistics The Minitab analysis results are shown above, in Figure 15.3. Ho: No effect of hormone treatment on the plasma Ca concentration MS Hormone 1386.1 F = = = 60.5,1/16 df, P < 0.001 MS Error 22.9 91

Ho: No difference in plasma Ca concentration between male and female birds MSSex 70.3 F = = = 3.07,1/16 df, P = 0.099 MS Error 22.9 Ho: No interaction between sex and hormone treatment on the plasma Ca concentration MS Interaction 4.9 F = = = 0.21,1/16 df, P = 0.650 MS Error 22.9 D. Determine the P of the test statistic Ho: No effect of hormone treatment on the plasma Ca concentration, P<0.05 Ho: No difference in plasma Ca concentration between male and female birds, P>0.05 Ho: No interaction between sex and hormone treatment on the plasma Ca concentration, P>0.05 E. Statistical Inference 1. The hormone treatment did have an effect on plasma Ca concentration. 2. There was no difference in plasma Ca concentrations of male and female birds Note of caution the only reason we can examine the effects of each of the factors is because the interaction term was not significant. Thus, it is often best to look at the analysis of the interaction first. II. BIOLOGICAL INTERPRETATION Results The hormone treatment significantly increased plasma calcium concentrations in birds, regardless of sex (Table 1). Calcium levels were on average 55% higher in birds that received the hormone treatment than those that did not (Figure 1). Table 1. Analysis of variance table for plasma calcium levels in birds of both sexes, half of which received a hormone treatment. Use of an asterisk denotes a significant effect at the 5% significance level. Source df SS MS F P Hormone Treatment 1 1386.11 1386.11 60.53 < 0.001* Sex 1 70.31 70.31 3.07 0.099 Interaction 1 4.90 4.90 0.21 0.650 Error 16 366.37 22.90 Total 19 1827.70 92

50 45 Calcium Level (mg/ml) 40 35 30 25 20 15 10 5 0 Female-No Female-Horm Male-No Male-Horm Sex / Treatment Figure 1. Plasma calcium levels (mg/ml) of male and female birds both with (closed symbols) and without (open symbols) a hormone treatment. Horizontal lines are the means and the boxes are the 95% confidence limits about the mean. Special Case of Random Factors In Problem 15.1, as with many biological experiments, the factors are assigned by the investigator and we refer to them as being fixed. However there are some situations in which one or both the factors are random (e.g. geographic location). In order to determine if a factor is random or fixed, biostatisticians have suggested that one should ask the question, If the observations were to be made several times, would the biology of the problem force the same choice of levels (e.g. sex) each time, or would any set of levels work? If the choice of levels is fixed by the problem, the factor is a fixed one; if the choice of levels is arbitrary, it is a random factor. It is important to determine if the factors are fixed or random because the general linear model two-way ANOVA procedure in MINITAB assumes that they are and calculates the F values based that case (Table 15.1). If the factors are not fixed, but instead are random, then a different procedure is appropriate: a Balanced ANOVA. The balanced ANOVA procedure works the same way as the general linear model procedure except that you also specify which of the factors are random, so that the F statistic is calculated appropriately. The model is the same as would be entered for a regular two-way ANOVA. Finally, if sample sizes are unequal, use General Linear Model, which is used in the same manner as Balanced ANOVA. Table 15.1. Computation of the F statistic for tests of significance in a two-factor ANOVA. Hypothesized Effect A & B Fixed A Fixed, B Random A & B Random Factor A Factor A MS Factor A MS Factor A MS Factor B Factor B MS Factor B MS Factor B MS Interaction 93

Problem Set - Two-way ANOVA 15.1) Investigators wished to know if there was a difference in plasma calcium levels (mg Ca ml -1 ) of male and female birds given a hormone treatment. No Hormone Hormone Female Male Female Male 16.5 14.5 39.1 32.0 18.4 11.0 26.2 23.8 12.7 10.8 21.3 28.8 14.0 14.3 35.8 25.0 12.8 10.0 40.2 29.3 15.2) Fish of each sex were given one of three different hormone treatments, after which the blood calcium was measured (mg Ca per 100 ml blood). The data are as follows: Males Females Treatment1 Treatment2 Treatment3 Treatment1 Treatment2 Treatment3 16.87 19.07 32.45 15.86 17.20 30.54 16.18 18.77 28.71 14.92 17.64 32.41 17.12 17.63 34.65 15.63 17.89 28.97 16.83 16.99 28.79 15.24 16.78 28.46 17.19 18.04 24.46 14.8 16.92 29.65 15.3) Investigators wished to know if the oxygen consumption rates (µl O 2 mg dry weight -1 min -1 ) varied between two species of limpets, Acmaea scabra and A. digitalis, at three concentrations of seawater. 100% 75% 50% A. scabra A. digitalis A. scabra A. digitalis A. scabra A. digitalis 7.16 8.26 6.14 6.14 5.20 13.20 4.47 4.95 11.11 10.50 9.63 14.50 6.78 14.00 3.86 10.00 5.20 8.39 9.90 6.49 9.74 14.60 6.38 10.20 13.60 16.10 10.40 11.60 7.17 10.40 5.75 5.44 18.80 11.10 13.40 17.70 8.93 9.66 5.49 5.80 6.37 7.18 11.80 9.90 9.74 11.80 14.50 12.30 15.4) Investigators wished to know if the number of insects varied between two creeks and at various times of year. They collected the following data: Month Shope Ball December 7 9 18 25 10 9 19 1 15 16 14 28 June 124 51 63 20 44 26 106 81 83 127 38 52 March 29 24 37 35 18 45 114 64 49 22 27 29 September 100 87 72 40 45 263 68 67 9 100 129 115 94