Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction

Similar documents
Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Intro to SPSS. Using SPSS through WebFAS

Analysis of Variance (ANOVA) Program Transcript

One-Way Independent ANOVA

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Chapter 11. Experimental Design: One-Way Independent Samples Design

10. LINEAR REGRESSION AND CORRELATION

Examining differences between two sets of scores

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

Lesson 9: Two Factor ANOVAS

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Two-Way Independent ANOVA

Analysis of Variance: repeated measures

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

ANOVA in SPSS (Practical)

CHAPTER ONE CORRELATION

Chapter 12: Introduction to Analysis of Variance

Using SPSS for Correlation

ANOVA. Thomas Elliott. January 29, 2013

Chapter 13: Introduction to Analysis of Variance

Two-Way Independent Samples ANOVA with SPSS

Experiment 1: Scientific Measurements and Introduction to Excel

Overview of Non-Parametric Statistics

Day 11: Measures of Association and ANOVA

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

Psychology of Perception Psychology 4165, Fall 2001 Laboratory 1 Weight Discrimination

2 Assumptions of simple linear regression

To open a CMA file > Download and Save file Start CMA Open file from within CMA

Experiment 1: Scientific Measurements and Introduction to Excel

appstats26.notebook April 17, 2015

Statistics for EES Factorial analysis of variance

isc ove ring i Statistics sing SPSS

Psychology of Perception Psychology 4165, Spring 2003 Laboratory 1 Weight Discrimination

PSY 216: Elementary Statistics Exam 4

Things you need to know about the Normal Distribution. How to use your statistical calculator to calculate The mean The SD of a set of data points.

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel

Chapter 9: Comparing two means

Section 6: Analysing Relationships Between Variables

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Example of Interpreting and Applying a Multiple Regression Model

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE

Stat Wk 9: Hypothesis Tests and Analysis

CHAPTER TWO REGRESSION

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties

The Effectiveness of Captopril

Analysis of Variance (ANOVA)

Lab 5a Exploring Correlation

Laboratory 10 Factors Controlling Phagocytosis in Tetrahymena,

Your Task: Find a ZIP code in Seattle where the crime rate is worse than you would expect and better than you would expect.

Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations)

Bangor University Laboratory Exercise 1, June 2008

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

Comparing 3 Means- ANOVA

Regression Including the Interaction Between Quantitative Variables

COMPUTER-BASED BIOMETRICS MANUAL

GST: Step by step Build Diary page

Psychology Research Methods Lab Session Week 10. Survey Design. Due at the Start of Lab: Lab Assignment 3. Rationale for Today s Lab Session

Simple Linear Regression One Categorical Independent Variable with Several Categories

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Statisticians deal with groups of numbers. They often find it helpful to use

To open a CMA file > Download and Save file Start CMA Open file from within CMA

Math 075 Activities and Worksheets Book 2:

Chapter 9. Factorial ANOVA with Two Between-Group Factors 10/22/ Factorial ANOVA with Two Between-Group Factors

One way Analysis of Variance (ANOVA)

Small Group Presentations

Term Definition Example Amino Acids

Variability. After reading this chapter, you should be able to do the following:

Measuring the User Experience

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

1. To review research methods and the principles of experimental design that are typically used in an experiment.

Tutorial 3: MANOVA. Pekka Malo 30E00500 Quantitative Empirical Research Spring 2016

Chapter 1: Managing workbooks

Impact of Group Rejections on Self Esteem

Basic Biostatistics. Chapter 1. Content

Business Statistics Probability

Worksheet 6 - Multifactor ANOVA models

1SCIENTIFIC METHOD PART A. THE SCIENTIFIC METHOD

Choosing a Significance Test. Student Resource Sheet

Statistics 571: Statistical Methods Summer 2003 Final Exam Ramón V. León

Correlation and Regression

Cleaning Up and Visualizing My Workout Data With JMP Shannon Conners, PhD JMP, SAS Abstract

Multiple Linear Regression Analysis

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1

6. Unusual and Influential Data

Statistics 2. RCBD Review. Agriculture Innovation Program

Linear Regression in SAS

SPSS Correlation/Regression

Math 215, Lab 7: 5/23/2007

CLINICAL RESEARCH METHODS VISP356. MODULE LEADER: PROF A TOMLINSON B.Sc./B.Sc.(HONS) OPTOMETRY

Online Introduction to Statistics

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

Lab 5: Testing Hypotheses about Patterns of Inheritance

Exam 2 PS 306, Spring 2004

7. Bivariate Graphing

Transcription:

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction In this exercise, we will conduct one-way analyses of variance using two different platforms in JMP. We will also conduct planned and unplanned comparisons among means. Finally, we will learn how to conduct non-parametric correlation analysis. Objectives Learn to conduct one-way ANOVA using the Fit Y by X and Fit Model platform. Conduct planned and unplanned comparisons among means.. Exercise 1- Conducting one-way ANOVA Last week, we discussed the use of t tests (and their non-parametric equivalent) to compare two populations (groups or experimental treatments). Quinn and Keough (Chapter 8) and the JMP manual (Chapter 7) review the use of single factor analysis of variance (one-way ANOVA) to analyze experiments where more than two populations are compared. The results of the ANOVA tell us whether differences among group means exist (Fixed effects model). 1 As discussed in class, a similar approach is used in ANOVA as in regression analyses, namely that the ANOVA model is fit to minimize the sum of squares in the error variable. We will use a JMP data file that is closely related to the one we examined last week to practice conducting and interpreting one-way ANOVA. Opening the data file Open the file OldFieldSiteFinnishData.JMP. This week s data compares host plant characteristics and beetle performance on three species of Finnish willows, growing together in an abandoned field: Salix myrsinifolia, Salix pentandra, and Salix phylicifolia. Variables shown are the same as those used last week: LeafWater, LeafNitrogen, LeafToughness, TotalSalicylates, TotalPhenolics, RelLarGrowth, AreaEaten, and AveSurvival. Data for each host species are coded a different color. Comparing species values and testing statistical assumptions for nitrogen content 1. Select the Analyze and Fit Y by X commands, and drag HostSpec into the X, Factor box and LeafNitrogen into the Y, Response box before clicking OK. 2. Select the red triangle, Normal Quantile Plot and Plot Actual by Quantile. You will now see quantile plots, shown separately for each species, to the right of the scatterplots. Your JMP manual describes these plots on page 148-150. Check to make sure that ANOVA assumptions (normality and homogeneity of variance) appear to be satisfied. 3. Select the red triangle, and Means/Anova/t test. The output will show the Analysis of Variance results. On a separate sheet of paper, write down the numbers in the Summary of Fit and Analysis of Variance tables. Using the example on page 205 of your JMP 1 Or whether there is significant variation among groups (Random effects model).

Bio 345 Week 8-2- manual and a hand calculator, recalculate mean square column in the table, the F value, the root mean square error, and r square. Show your work, as done in the JMP manual. Determining which variables can be analyzed parametrically 4. Begin a table with nine rows, one for each variable, and repeat the inspection of quantile plots described in step 2. Next to each variable, write down whether the quantile plots appeared to show that the variable adheres to the normality and homogeneity of variance assumption. You should find three variables that clearly do not appear to adhere to the assumptions. For the variables that do adhere to assumptions, conduct ANOVAs and record the following as columns in your table: F, degrees of freedom in the numerator, degrees of freedom in the denominator, and P. 5. Carefully examine the three variables that failed to meet assumptions in the data table. One of them has an obvious problem for parametric analyses, because it affects the variance of one of the host species greatly. Which variable is it, and what is the problem? Transforming variables and analyzing them 6. We will try log-transforming the other two variables to see whether the transformed data conform to ANOVA assumptions. Select Column and New Column. When the next box opens, give the column the name logxxx where XXXX refers to the name of the original variable. Then select New Property and Formula. Using the formula editor (transcendental option), obtain the base 10 log of your variable, with the number 1 added to each value (e.g. logxxxx = log10(xxxx + 1). 2 7. Now select the Analyze and Fit Y by X platform again, and compare the quantile plots for each of the two transformed variables to the untransformed one. If you are satisfied that the transformation brings the variable into conformance with ANOVA assumptions, add two rows to your table, indicate the names of the log transformed variables, and include the same information for the transformed variables as you did for the untransformed ones in step 4. Using the Fit Model platform to save ANOVA residuals 8. Unfortunately, you cannot save predicted values or residuals within the Fit Y by X platform. Choose two variables that you analyzed above, select Analyze and Fit Model, place HostSpec into the Construct Model Effects box, and your two response variables into the Y box. Make sure that the Personality box continues to say Standard Least Squares and then press Run Model. 9. In the output window, select the red triangle next to Response Variablename and select Save Columns and Residuals. Repeat this action for the second response variable (output shown below the first one). Determining whether residuals are normally distributed 10. Select Analyze and Distribution and drag your two columns of residuals into the Y, Columns box. Then perform Shapiro-Wilks tests on each set of residuals to establish 2 We add one to each value to avoid problems that arise when base 10 logs are taken from values between zero and one (the log would be a negative value).

Bio 345 Week 8-3- whether they conform to expectations for a normal distribution. If you are unsure how to interpret the output from this analysis, please ask your instructor! 11. Write a brief summary of the results of your normality tests. Conducting non-parametric ANOVA 12. Your table, above, should contain one variable that could not be analyzed parametrically. The non-parametric methods in JMP, which you used last week to compare two species, can also be used when comparing more than two groups (see pages 222-224 of your JMP manual). Select Analyze and Fit Y by X with your remaining response variable as Y. Then select Nonparametric and Wilcoxon Test to obtain the results of a non parametric test of overall differences among the three host species. This test is equivalent to the Kruskal-Wallis test that is described in many statistics textbooks. 13. Write a sentence describing your findings for this last variable, based on the nonparametric analysis. Read the paragraphs in section 8.5.2 (pages 195-196 of Quinn and Keough) to gain some background into the Kruskal-Wallis test. Based on what you read, describe a potential problem with your use of the Kruskal-Wallis test with this variable. Exercise 2- Comparisons among means When we conduct ANOVA, the emphasis is on overall differences among groups or treatments, and not on whether individual groups differ from other groups. As scientists, however, we are often very interested in determining which means are significantly different. Two kinds of means comparisons exist. Planned comparisons or contrasts are made between means that we expect, for some reason that is a part of the experiment or study, should be different. We usually construct a contrast that compares groups of group means rather than each individual pair of means. Unplanned comparisons are pairwise tests of significant differences among means, usually conducted without knowing in advance which comparisons might be of interest. One could conduct individual t tests on each pair of means, but many statisticians believe that this procedure introduces the multiple comparisons problem of repeated significance testing (i.e. making many statistical tests increases the likelihood that two means will differ to a certain degree simply by chance). We will therefore use the Tukey s HSD test, which adjusts the significance level by the number of comparisons made. This test is discussed in Quinn and Keough (pages 199-200) and in your JMP manual (pages 212-213). In our case, we are only comparing three species, so three pairwise comparisons among species means are possible (MYR vs PHY), (MYR vs PEN), (PEN vs PHY). The multiple comparisons problem is much greater if you are comparing more groups of means. Unplanned comparisons 1. Begin a table with eight rows (one for each variable) and 10 columns, listed in the following order: sample size for Salix myrsinifolia, mean + standard error for S. myrsinifolia, sample size for S. pentandra, mean + standard error for S. pentandra, sample size for S. phylicifolia, mean + standard error for S. phylicifolia, a column for F, degrees of freedom in the numerator, degrees of freedom in the denominator, and the P value. You will use the last four columns when you make your planned comparisons (see below). Check with your instructor if you are not sure about the format of this table. I

Bio 345 Week 8-4- suggest that you enter your means and standard errors to 2-3 significant digits (e.g. 62.5 + 0.59). Remember to use your transformed data on this table if you found that a variable conformed to ANOVA assumptions better after transformation. 2. For each variable, select Analyze and Fit Y by X. Click on the red triangle to obtain Means and Stddev. Then click on it again to select Compare Means and All Pairs, Tukey HSD. You will see a new window appear, to the right of the graph, and output below that indicates (with positive values) which means are significantly different. Compare your output to the output shown on pages 183 and 184 of your JMP manual. Fill in the means and standard errors on your table. Then write, in lower case superscript, next to each mean, a letter. If there are no significant differences among the means, use the letter a for all three superscripts. If one species has higher values than the others, use superscript a for the higher mean and superscript b for the two lower means. Check with your instructor if you are not clear how to interpret the output or to decide which superscripts to use. Planned comparisons This data is well suited for a planned comparison. As you may have noticed, the willows S. myrsinifolia and S. pentandra both contain substantial amounts of salicylates, while the willow S. phylicifolia contains little or no salicylates. The beetle I studied in Europe, Phratora vitellinae, never fed on S. phylicifolia in nature. We can therefore compare the beetle s two natural host plants (MYR and PEN) to the species it does not use in nature (PHY). The results are of interest because they would show whether the two species used by the beetle share characteristics (high nutritive quality, high beetle growth and survival). 3. To conduct a planned comparison, you must use the Analyze and Fit Model platform. Construct your model using the variable LeafWater and press Run Model. To the right of the ANOVA output, you should see a leverage plot with the label HostSpec. This window shows values for the least-squares means, which are estimated in the ANOVA calculations. I prefer to report these in my own data analyses because they are adjusted for differences in sample size. In the present case, you should notice that the least squares means are pretty similar to the actual means in the far right column. 4. Click on the red triangle under HostSpec and select LSmeans plot. This shows you a graph of the least squares means and their standard errors and is a very useful feature in JMP. 5. Click on the red triangle under HostSpec again and select LSmeans contrast. You will see a new window open at the bottom of the output. Click in the plus box next to SalixMyr and then click in the plus box next to SalixPentandra. Now click on the minus box next to SalixPhylic. This sets up your contrast comparing the overall mean of PEN and MYR to the mean of PHY. Note that the two sets of contrasts sum to zero (as described just below formula 8.8, page 198 of Quinn and Keough). 6. Click on Done. Note that you are now given an F value, numerator degrees of freedom, denominator degrees of freedom, and a P value. Enter these values onto your table. 7. Repeat steps 3-6 for the other variables and fill in the table.

Bio 345 Week 8-5- Interpreting your results 8. Write a paragraph presenting and interpreting the results on your table. As you did last week, organize your paragraph by the kinds of variables tested (plant nutritional quality, plant chemistry, beetle performance). Indicate which variables differed significantly between species, and which differed little or not at all. Then summarize the information about the planned comparison between the two species used by the beetle and the other willow species. Use the results to answer the following question: do host species used by the beetle tend to be similar with respect to plant nutritional characteristics and beetle performance, or not? Tasks 1. Construct an ANOVA data and confirm how some values in the table are calculated directly from others, using an example from your JMP manual (page 176). 2. Examine a series of variables to determine whether they conform to ANOVA assumptions. Record the results of the ANOVA in a table, for those variables that did conform. Try to transform other variables to bring them into conformance and enter results from any transformed variables into your table. 3. Save residuals from two ANOVAs and compare their distributions to normal expectations using the Shapiro-Wilk test. 4. Conduct and interpret results from a non-parametric ANOVA on remaining variables. 5. Conduct unplanned comparisons tests among species means and use these to fill in a table of means with superscripts indicating which means are significantly different from others. 6. Conduct planned comparisons tests between the two species used by a beetle as natural host plants (S. myrsinifolia and S. pentandra) and the remaining species S. phylicifolia. Write a paragraph interpreting your results.