Tutorial 3: MANOVA. Pekka Malo 30E00500 Quantitative Empirical Research Spring 2016

Similar documents
(C) Jamalludin Ab Rahman

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth

One-Way Independent ANOVA

1. You want to find out what factors predict achievement in English. Develop a model that

Repeated Measures ANOVA and Mixed Model ANOVA. Comparing more than two measurements of the same or matched participants

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

Small Group Presentations

isc ove ring i Statistics sing SPSS

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups

Before we get started:

CHAPTER VI RESEARCH METHODOLOGY

ANOVA in SPSS (Practical)

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Readings Assumed knowledge

Examining differences between two sets of scores

Comparing 3 Means- ANOVA

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

EPS 625 INTERMEDIATE STATISTICS TWO-WAY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY)

Analysis and Interpretation of Data Part 1

Supplementary Materials Are taboo words simply more arousing?

Daniel Boduszek University of Huddersfield

Business Research Methods. Introduction to Data Analysis

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

Introduction to SPSS. Katie Handwerger Why n How February 19, 2009

Day 11: Measures of Association and ANOVA

Two-Way Independent ANOVA

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology

Advanced ANOVA Procedures

Choosing a Significance Test. Student Resource Sheet

Intro to SPSS. Using SPSS through WebFAS

Basic Features of Statistical Analysis and the General Linear Model

HANDOUTS FOR BST 660 ARE AVAILABLE in ACROBAT PDF FORMAT AT:

Stat Wk 9: Hypothesis Tests and Analysis

Basic Biostatistics. Chapter 1. Content

8/28/2017. If the experiment is successful, then the model will explain more variance than it can t SS M will be greater than SS R

Two-Way Independent Samples ANOVA with SPSS

Ecological Statistics

Reading Time [min.] Group

PSY 216: Elementary Statistics Exam 4

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Chapter 11: Comparing several means

Chapter 11. Experimental Design: One-Way Independent Samples Design

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Investigating the robustness of the nonparametric Levene test with more than two groups

Analysis of Covariance (ANCOVA)

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Principal Components Factor Analysis in the Literature. Stage 1: Define the Research Problem

Chapter 25. Paired Samples and Blocks. Copyright 2010 Pearson Education, Inc.

Levene Statistic df1 df2 Sig

Statistics 2. RCBD Review. Agriculture Innovation Program

Midterm Exam MMI 409 Spring 2009 Gordon Bleil

Unit 1 Exploring and Understanding Data

Analysis of Variance (ANOVA)

USER INSTITUTIONS OF OFFICE WORKERS IN GHANA: INVESTIGATING THE DIFFERENCES ABSTRACT

C.3 Repeated Measures ANOVA

1. What is the relationship between item response indecision and SDS summary code permutations?

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

Chapter 14: More Powerful Statistical Methods

RESULTS. Chapter INTRODUCTION

Bangor University Laboratory Exercise 1, June 2008

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Regression Discontinuity Analysis

10. LINEAR REGRESSION AND CORRELATION

Statistics Guide. Prepared by: Amanda J. Rockinson- Szapkiw, Ed.D.

ANCOVA with Regression Homogeneity

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Developing and Testing Hypotheses Kuba Glazek, Ph.D. Methodology Expert National Center for Academic and Dissertation Excellence Los Angeles

Overview of Non-Parametric Statistics

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

investigate. educate. inform.

Cultural Intelligence: A Predictor of Ethnic Minority College Students Psychological Wellbeing

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Sample Exam Paper Answer Guide

A Brief (very brief) Overview of Biostatistics. Jody Kreiman, PhD Bureau of Glottal Affairs

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Validity, Reliability and Classical Assumptions

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Understandable Statistics

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

PSYCHOLOGY 320L Problem Set #4: Estimating Sample Size, Post Hoc Tests, and Two-Factor ANOVA

Multivariate methods in rehabilitation

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Chapter 13: Factorial ANOVA

Reveal Relationships in Categorical Data

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

ANALYSIS OF VARIANCE (ANOVA): TESTING DIFFERENCES INVOLVING THREE OR MORE MEANS

Research Analysis MICHAEL BERNSTEIN CS 376

Nature Neuroscience: doi: /nn Supplementary Figure 1. Task timeline for Solo and Info trials.

Bonferroni Adjustments in Tests for Regression Coefficients Daniel J. Mundfrom Jamis J. Perrett Jay Schaffer

STATISTICS AND RESEARCH DESIGN

Daniel Boduszek University of Huddersfield

Reliability of Ordination Analyses

Analysis of Variance (ANOVA) Program Transcript

Correlation and Regression

Experimental Studies. Statistical techniques for Experimental Data. Experimental Designs can be grouped. Experimental Designs can be grouped

Transcription:

Tutorial 3: Pekka Malo 30E00500 Quantitative Empirical Research Spring 2016

Step 1: Research design Adequacy of sample size Choice of dependent variables Choice of independent variables (treatment effects) Optional: Use of covariates (MANCOVA) 2

Choice of dependent variables Don t try to include too many, use only as many variables as necessary Single badly chosen variable can distort the entire result Use only variables with a strong theoretical support 3

Choice of independent variables (factors or treatments ) Often researcher is interested in examining effects of several independent variables rather than using only a single variable à factorial design Sometimes researchers are for compelled to add more independent variables to control for characteristic that affect the dependent variables but is not part of the study design (e.g. geographic location, gender) à blocking factors Remember sample size limitations 4

Use of covariates (MANCOVA) Sometimes the research also has to control for effects of metric variables à covariates Objective of covariates is to 1. Eliminate any effects that influence only a portion of the respondents or 2. Effects that vary among the respondents Key benefits: 1. Helps to eliminate systematic error outside the control of the researcher (effects which could bias results) 2. Accounts for differences in responses due to unique characteristics of respondents Note: Number of covariates should not exceed the following thumbrule: 0.1 x sample size (number of groups -1) 5

Sample size Need to use larger samples than in ANOVA Minimum sample in each group must be greater than the number of dependent variables Recommended minimum number per group is 20 observations The higher the number of dependent variables, the greater the sample size needed to maintain statistical power 6

Sample size with power at 0.80 7

Step 2: Check assumptions Independence of observations Homogeneity of variance/covariance matrices Levene s univariate tests (similar process as in ANOVA) Box s M-test for covariance matrices Normality Check histograms and Kolmogorov-Smirnov tests Multicollinearity of dependent variables Sensitivity to outliers 8

Normality and independence Multivariate normality: All dependent variables should be normally distributed Any linear combination of the dependent variables should be also normally distributed Luckily is robust test and survives departures from multivariate normality When multivariate normality not satisfied, lesser power to detect main or interaction effects Independence of observations must not be violated! 9

Explore by variable Analyze > Descriptive statistics > Explore (plots) 10

Equality of covariance matrices For k multivariate populations, the hypothesis of equality of covariance matrices is Commonly tested using Box s M-test: Sensitive to the size of covariance matrices and the number of groups in the analysis Highly sensitive to departures from normality: always check univariate normality of measures before performing the test However, violation of the assumption doesn t generally have severe impact when groups are of approximately equal size 11

Equality of covariance matrices (2) What if group sizes differ more and Box s M-test is not within acceptable levels (alpha <.001)? Apply one of the many variance-stabilizing transformations on variables Use adjusted tests which do not assume equal variances: E.g. Games-Howell 12

Box s M-test in SPSS The test is obtained at the estimation step as part of GLM routine (see Step 3) 13

Step 3: Estimation and significance testing Choice of criteria for significance testing Assessment of statistical power Effects of dependent variable multicollinearity 14

Estimation with the General Linear Model (GLM) Classical techniques established more than 70 years ago, but nowadays models are commonly fitted using GLM can be viewed as a multivariate multiple regression model Reasons for using GLM: Single estimation procedure can be used to handle wide range of specifications Available in most statistical packages (e.g. SPSS, SAS) 15

Multivariate GLM in SPSS 16

Multivariate Tests () Wilks lambda (or U-statistic): Roy s greatest characteristic root (GCR): Pillai s criterion and Hotelling s T 2 Often referred to as the multivariate F-test Preferred when basic requirements (sample size, no violations, approximately equal sized groups) are met Advantages in power and specificity Most appropriate when dependent variables are strongly interrelated on one dimension Strongly affected by violations of assumptions Similar to Wilks lambda More robust and preferred when sample size decreases, unequal groups, or homogeneity of covariances is violated 17

18

Step 4: Interpretation of Results Interpret the effects of covariates, if employed Assess which dependent variable(s) exhibited differences across the groups of each treatment Identify whether the groups differ on a single dependent variable or the entire dependent variate 19

Interpreting Covariates If covariates are involved, they need to be interpreted first, since they act as controls on the dependent variate Assessment of overall impact: Evaluate impacts with and without covariates in the model Effective covariates should improve statistical power and reduce within-group variance Interpretation of covariates: Similar to regression analysis If their overall impact is significant, it is of interest to examine the strength of their predictive relationship with dependent variables 20

Main Effects of Treatments Main effect is typically described by the difference between groups on the dependent variables in the analysis Example: if gender has a significant main effect, we could look to the difference in means as a way of describing the impact (e.g. 7.5 for women vs. 6.0 for men) When treatment or factor has more than two levels, significant main effect does not guarantee that all groups are significantly different (at least one pair of groups is) 21

Identifying differences between individual groups Once you are confident that there are differences between groups, it is possible to perform a variety of tests to understand which groups differ Contrasts (~a priori tests): Planned comparisons based on scientific goals Post-hoc tests (~ decide after experiment): Performed for each dependent variable separately Used for situations where you can decide which comparisons you want to make after looking at the data 22

Impact of Interactions Any time more than one treatment or factor is used, there are interaction effects Interaction terms represent the joint effect of 2 or more factors No interaction: The effect of one factor is the same for each level of the other factor; allows direct interpretation of main effects independently Ordinal interaction Effects differ across the levels of another factor, but the group differences are always in the same direction Disordinal interaction Factors don t represent any consistent effect 23

Profile plots Profile plots (interaction plots) are useful for comparing marginal means in your model. A profile plot is a line plot in which each point indicates the estimated marginal mean of a dependent variable (adjusted for any covariates) at one level of a factor. If you have more than 1 independent variable: The levels of a second factor can be used to make separate lines. Each level in a third factor can be used to create a separate plot. 24

25

Example: Cereal evaluation Shape (ball, cube, star) Vs. Color (red, blue, green) Source: bestofbritish.ca 26

No interaction effect 27

Ordinal interaction 28

Disordinal interaction 29

Identifying differences between individual groups Once you are confident that there are differences between groups, it is possible to perform a variety of tests to understand which groups differ Contrasts (~a priori tests): Planned comparisons based on scientific goals Post-hoc tests (~ decide after experiment): Performed for each dependent variable separately Used for situations where you can decide which comparisons you want to make after looking at the data 30

Post-hoc Multiple Comparisons Commonly used tests for multiple comparisons: Bonferroni s test: based on Student s t-test, adjusts the significance level for multiple comparisons Tukey s honestly significant difference test: uses Studentized range statistic to make all pairwise comparisons between groups Choose a test based on the number of pairwise comparisons: Use Tukey for large number of pairs: More powerful than Bonferroni when several pairs of means considered Use Bonferroni for small number of pairs 31