kxk BG Factorial Designs kxk BG Factorial Designs Basic and Expanded Factorial Designs

Similar documents
Limitations of 2-cond Designs

Limitations of 2-cond Designs

Between Groups & Within-Groups ANOVA

Sources of Variance & ANOVA

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Regression Including the Interaction Between Quantitative Variables

Factorial Analysis of Variance

UNEQUAL CELL SIZES DO MATTER

Interactions between predictors and two-factor ANOVA

Intro to Factorial Designs

Multiple Regression Models

Example of Interpreting and Applying a Multiple Regression Model

Study Guide #2: MULTIPLE REGRESSION in education

Chapter 13 Summary Experiments and Observational Studies

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Analysis of Variance (ANOVA)

Villarreal Rm. 170 Handout (4.3)/(4.4) - 1 Designing Experiments I

HPS301 Exam Notes- Contents

INTENDED LEARNING OUTCOMES

Lesson 9: Two Factor ANOVAS

Statistics 2. RCBD Review. Agriculture Innovation Program

Research Designs. Internal Validity

Introduction to Path Analysis

Intro to SPSS. Using SPSS through WebFAS

Business Statistics Probability

Meanings of Causation

Chapter 18 Repeated-Measures Analysis of Variance Cont.

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 10: Introduction to Factorial Designs

8/28/2017. If the experiment is successful, then the model will explain more variance than it can t SS M will be greater than SS R

Chapter 6 Analysis of variance and experimental designed

State of Connecticut Department of Education Division of Teaching and Learning Programs and Services Bureau of Special Education

Complex Regression Models with Coded, Centered & Quadratic Terms

Comparing 3 Means- ANOVA

WELCOME! Lecture 11 Thommy Perlinger

Use the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions.

Chapter 5: Producing Data

The essential focus of an experiment is to show that variance can be produced in a DV by manipulation of an IV.

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Effects of Nutrients on Shrimp Growth

CHAPTER TWO REGRESSION

One-Way Independent ANOVA

Research Process. the Research Loop. Now might be a good time to review the decisions made when conducting any research project.

PSYCHOLOGY 320L Problem Set #4: Estimating Sample Size, Post Hoc Tests, and Two-Factor ANOVA

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

PSY 216: Elementary Statistics Exam 4

Two-Way Independent Samples ANOVA with SPSS

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

ANOVA in SPSS (Practical)

investigate. educate. inform.

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

Testing the effect of two factors at different levels (two treatments). Examples: Yield of a crop varying with fertilizer type and seedling used.

SPSS output for 420 midterm study

Chapter 13. Experiments and Observational Studies

Advanced ANOVA Procedures

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth

Study Guide for the Final Exam

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

SPSS output for 420 midterm study

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Chapter 13: Factorial ANOVA

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

ANOVA. Thomas Elliott. January 29, 2013

How to CRITICALLY APPRAISE

Still important ideas

2-Group Multivariate Research & Analyses

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Our plan for giving better care to people with dementia Oxleas Dementia

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

Organizing Scientific Thinking Using the QuALMRI Framework

Chapter 5: Field experimental designs in agriculture

ID# Exam 2 PS 217, Fall 2010

HS Exam 1 -- March 9, 2006

Analysis of Variance: repeated measures

Stanford Youth Diabetes Coaches Program Instructor Guide Class #1: What is Diabetes? What is a Diabetes Coach? Sample

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

n Outline final paper, add to outline as research progresses n Update literature review periodically (check citeseer)

PSYCHOLOGY 300B (A01)

Correlation and Regression

Version No. 7 Date: July Please send comments or suggestions on this glossary to

Still important ideas

Comparing Two Means using SPSS (T-Test)

Political Science 15, Winter 2014 Final Review

Questionnaire on Anticipated Discrimination (QUAD)(1): is a self-complete measure comprising 14 items

ID# Exam 1 PS306, Spring 2005

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Research Approaches Quantitative Approach. Research Methods vs Research Design

Mild memory problems

11/24/2017. Do not imply a cause-and-effect relationship

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

Exam 3 PS 217, Spring 2011

2 Critical thinking guidelines

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Multilevel Techniques for Quality Control Charts of Recovery Outcomes

Why do Psychologists Perform Research?

MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE:

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Transcription:

kxk BG Factorial Designs expanding the 2x2 design F & LSD for orthogonal factorial designs using the right tests for the right effects F & follow-up analyses for non-orthogonal designs Effect sizes for kxk Factorial Designs kxk BG Factorial Designs We ve worked extensively with the 2x2 design -- the basic factorial Larger factorial designs are often used for the same reasons that multiple-condition 1-factor designs are used... You may need more than 2 IV conditions to properly test a RH: Want multiple experimental conditions (qual or quant diffs) Want multiple treatment conditions (standard vs. none, etc) Want to dissect a multiple element treatment You might want to test the replicability of an IV s effect across more than two situations/settings testing the generality of a TX across gender requires just 2 conditions of the 2nd IV testing generality of that TX across ages would require more Basic and Expanded Factorial Designs The simplest factorial design is a 2x2, which can be expanded in two ways: 1) Adding conditions to one, the other, or both IVs 2x2 design 2x4 design 3x2 design 3x4 design

Adding Treatment Conditions to an IV Ways treatment conditions differ amount of treatment receiving therapy once vs. twice each week getting 0, 1, 5 or 10 practice trials before testing kind of treatment receiving Cognitive vs. Gestalt clinical therapy whether or not there is feedback on practice trials combinations of treatment components receiving both talk therapy vs. combined drug & talk therapy receiving 10 practices without feedback vs. 2 practices with feedback The Secret is to be sure the selection of conditions matches the research hypotheses you started with!!! Adding Control Conditions to an IV No Treatment control Asks if the Tx works better than nothing Standard Tx control Asks if the Tx works better than usual Best Practice Control Asks if the Tx works better than the best known Pseudo Tx Control Asks if TX works without a specific component The Secret is to be sure the selection of conditions matches the research hypotheses you started with!!! Designs, RH & results are communicated by drawing the boxes. Be sure to label each IV and specify each or its conditions. 1. Males and females completed the task, either under instructions to work quickly, work accurately, to work as quickly as possible without making unnecessary errors or no instructions. 2. Folks completed a depression questionnaire either under instructions to respond like someone with acute depression, respond like someone with chronic depression or respond like someone who is trying to fake being depressed. Participants were either clinical psychologists, clinical Ph.D., or volunteers from a local social club.

#1 was a 2x4 design that looks like this... Gender Male Female #2 was a 3 x 3 design that looks like... Respond like a.. Acute depressive Chronic depressive Fake depressive Instructions Quick Accurate Both None Participant Clinician Clin. Grad. Soc. Club Statistical Analysis of Orthogonal kxk Designs Only a couple of differences from the 2x2 1. Tell IVs and DV 2. Present data in table or figure 3. Determine if the interaction is significant if it is, describe it in terms of one of the sets of simple effects using LSD mmd to compare the cell means 4. Determine whether or not the first main effect is significant if so, describe it using LSD mmd compare 3+ marginal means determine if that main effect is descriptive or misleading using the interaction LSDmmd to compare SEs 5. Determine whether or not the second main effect is significant if so, describe it using LSD mmd compare 3+ marginal means determine if that main effect is descriptive or misleading using the interaction LSDmmd to compare SEs The omnibus ANOVA for the kxk is the same as for the 2x2 BG SS total = SS A + SS B + SS INT + SS Error df total = df A + df B + df INT + df Error (N - 1) = ( a -1) + (b-1) + (a-1)(b-1) + ab(n-1) SS A / df A SS B / df B SS INT / df INT F A = --------------- F B = -------------- F INT = ----------------- SS E / df E SS E / df E SS E / df E Notice there is a single error term that is used for all the Fs & all the follow-ups!

When do you need a LSDmmd value??? General rule: You will need an LSDmmd value to compare pairs of means whenever a significant effect has k>2 conditions #1 Whenever the interaction is significant LSDmmd for the cell means is needed to: describe the pattern of the SEs to describe the interaction pattern describe the pattern of the SEs to determine if corresponding ME is descriptive or misleading (necessary to do for each ME -- whether the ME is significant or not) #2 Whenever a 3+grp ME is significant LSDmmd for those marginal means is needed to: describe the pattern of that ME Kxk LSD follow-ups are a little different than the 2x2 the 2x2 uses the LSD only for comparing cell means describe the simple effects to explicate the interaction pattern not needed for MEs, since they involve only 2 conditions the kxk uses the LSD for comparing cell & marginal means different LSD mmd values are computed for different effects if the interaction is significant, then an LSD is computed to compare the cell means -- describe SEs, interaction, etc. If a ME with 2 conditions is significant - no LSD needed If a ME with 3 or more conditions is significant, then LSD is computed to compare the marginal means of that ME Be sure to use the proper n to compute each LSD n = mean number of data points used to compute the means being compared (more on demo sheet) What statistic is used for which factorial effects???? There will be 5 statistics 1. F Gender 2. F Age 3. Age LSD mmd 4. F Int 5. Int LSD mmd Age 5 10 15 Gender Male Female 30 30 30 20 30 25 25 30 27.5 25 30 27.5 15 Effects in this study 1. Main effect of gender 2. Main effect of age 3. 5 vs. 10 marginals 4. 5 vs. 15 marginals 5. 10 vs. 15 marginals 6. Interaction of age & gender 7. 5 vs. 10 yr old males 8. 5 vs. 15 yr old males 9. 10 vs. 15 yr old males 10. 5 vs. 10 yr old females 11. 5 vs. 15 yr old females 12. 10 vs. 15 yr old females 13. male vs. female 5 yr olds 14. male vs. female 10 yr olds 15. male vs. female 15 yr olds

Back to 100 males and 100 females completed the task, either under instructions to work quickly, work accurately, to work as quickly as possible without making unnecessary errors or no instructions. Gender Male Female For the interaction p =.03 will we need an LSD mmd to compare cell means? For the main effect of instruction p =.02 will we need an LSD mmd to compare marginal means? will we need an LSDmmd to compare cell means? For the main effect of gender p =.02 will we need an LSD mmd to compare marginal means? will we need an LSDmmd to compare cell means? Instruction Quick Accurate Both None Yep! sig. Int & k = 8! 200 / 8 = 25 Yep! sig. ME & k = 4! 200 / 4 = 50 Yep! sig. Int! 200 / 8 = 25 Nope k = 2! Yep! sig. Int! 200 / 8 = 25 Analysis of Non-orthogonal Factorial Designs Whether because of careful, intentional stratified sampling to match subpopulation proportions or because of sampling exigencies Many of our factor designs have unequal-n, resulting in non-orthogonality (collinearity, correlation) among our effects. Either way, we have to be certain our variance partitioning, significance tests and interpretations properly match up! Sum of Squares types: Type I SS each effect controlled for previous effects SStotal = SS(A) + SS(B A) + SS(AB A B) + SSerror Type II SS each main effect controlled for other main effect SStotal = SS(A B) + SS(B A) + SSerror Type III SS each effect controlled for all other effects Sstotal = SS(A B AB) + SS(B A AB) + SS(AB A B) + Sserror Type III SS is the most commonly used for non-orthogonal factorials! Why are Type III SS best?? Hold on, this takes a bit In multiple regression the collinearity among predictors is caused by the relationship among the constructs and is expected to replicate across samplings from the same population/setting/task-stim. However, for factorial designs, the collinearity among the effects is determined by relative cell sample sizes. So, unless there has been explicit stratified sampling, the collinearity among the effects is happenstance rather than reflecting the relationship between the constructs. Said differently, the effects confound each other differently depending on what the relative cell sample sizes happen to be (confounds vary with sample size??) By using Type III SS correcting each effect for the happenstance collinearity it has with the others we should replicate the same corrected effects, regardless of the collinearity among the effects (that s caused by whatever the relative cell sample sizes happen to be)!

Cell means in non-orthogonal factorial designs cell means are calculated, analyzed and interpreted the same in orthogonal and non-orthogonal designs! so, the analysis and interpretation of simple effects are the same in orthogonal and non-orthogonal designs! Marginal means 3 kinds in a non-orthogonal design!!! Unweighted marginal means computed as the average of the corresponding cell means, without regard to differential cell sample sizes - usually only used by mistake! Weighted marginal means computed as average of corresponding cell means weighted by differential cell sample sizes usually used in Descriptives Estimated marginal means marginal means estimated from the model used in EMMEANS significance testing More about weighted marginal means Weighted marginal means are the best estimate of the mean of the population represented by the aggregate of the corresponding cell means (because of the weighting) However, as we ve discussed, you must be very careful when producing and interpreting these marginal means, because, as an aggregate, they might not actually represent any population??? Also... Pay attention to this.!!! The weighted marginal means represent the difference between the groups including the confounding of that group difference by the other main effect and the interaction!!! Said differently the difference between the weighted marginal means is like a simple correlation it represents the bivariate relationship between the DV & that IV, without regard to how that relationship is confounded by other variables! More about estimated marginal means & main effect F-tests Estimated marginal means are the best estimate of the mean of the population represented by that condition of the main effect, taking in to account the relationships among the DV, the IVs & their interaction! How are they estimated? Each factorial model has a corresponding multiple regression model (much more later ) For a factorial design with the IVs A & B and interaction AB DV = b A *A + b B *B + b AB *AB For each participant, we can compute their estimated score from the model, and then compute the average estimated score for each marginal group! The main effect F-tests are tests of these estimated marginal means! (not the weighted or unweighted marginal means)!