Why Mixed Effects Models?

Similar documents
Midterm project due next Wednesday at 2 PM

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/24/2017. Do not imply a cause-and-effect relationship

How to analyze correlated and longitudinal data?

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

The Vote! Winners. $100 Question from Ch 10 11/16/11

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Lecture 21. RNA-seq: Advanced analysis

Distinguishing between Category-based and Similarity-based Induction

CSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression

Generalized Estimating Equations for Depression Dose Regimes

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Data Analysis Using Regression and Multilevel/Hierarchical Models

Week 12: Missing Data

Statistical reports Regression, 2010

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Small Group Presentations

Selection and Combination of Markers for Prediction

TITLE: Acquisition and generalization responses in aphasia treatment: Evidence from sentence-production treatment

CSE 258 Lecture 1.5. Web Mining and Recommender Systems. Supervised learning Regression

Fundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2


Study Guide #2: MULTIPLE REGRESSION in education

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

Categorization and Memory: Representation of Category Information Increases Memory Intrusions

Exemplars in Syntax: Evidence from Priming

Use of GEEs in STATA

Fitting discrete-data regression models in social science

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Language: English Course level: Doctoral level

Clincial Biostatistics. Regression

Regression so far... Lecture 22 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102

Statistical Techniques. Masoud Mansoury and Anas Abulfaraj

The random variable must be a numeric measure resulting from the outcome of a random experiment.

Artificial intelligence and judicial systems: The so-called predictive justice. 20 April

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

IAPT: Regression. Regression analyses

MS&E 226: Small Data

Dan Byrd UC Office of the President

Modeling Binary outcome

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests

Political Science 15, Winter 2014 Final Review

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Business Statistics Probability

Fundamental Clinical Trial Design

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

The Lens Model and Linear Models of Judgment

Biostatistics II

EPSE 594: Meta-Analysis: Quantitative Research Synthesis

Analyzing binary outcomes, going beyond logistic regression

(C) Jamalludin Ab Rahman

CHAPTER TWO REGRESSION

Controlling Bias & Confounding

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Explanatory Latent Trait Models: A Tale of Two Studies

THE STATSWHISPERER. Introduction to this Issue. Binary Logistic Regression: The Rock Star of Regression

Bayesian and Frequentist Approaches

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

A Race Model of Perceptual Forced Choice Reaction Time

The Simulacrum. What is it, how is it created, how does it work? Michael Eden on behalf of Sally Vernon & Cong Chen NAACCR 21 st June 2017

In this module I provide a few illustrations of options within lavaan for handling various situations.

Recognizing Ambiguity

Animal Cognition. Introduction to Cognitive Science

Available from Deakin Research Online:

Reflection Questions for Math 58B

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Statistical analysis supporting the development of the guidance on dermal absorption

9 research designs likely for PSYC 2100

You must answer question 1.

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick.

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

The Law of Attraction Myth Free Report

VARIABLES AND MEASUREMENT

Numerical Integration of Bivariate Gaussian Distribution

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

SPSS output for 420 midterm study

Relational Versus Attributional Mode of Problem Solving?

ε error εˆ ~ Greenhouse Geisser correction (see p. 25) ε Huynh Feldt correction (see p. 25)

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Ecological Statistics

Psychology 205, Revelle, Fall 2014 Research Methods in Psychology Mid-Term. Name:

Limited dependent variable regression models

Still important ideas

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

Can Angioplasty Improve Quality of Life for CAD Patients?

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Statistical questions for statistical methods

Ingredients of Difficult Conversations

Transcription:

Why Mixed Effects Models?

Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative fitting

Problem One: Multiple Random Effects Most studies sample both subjects and items Subject 1 Subject 2 Knight story Monkey story

Problem One: Crossed Random Effects Most studies sample both subjects and items Typically, subjects crossed with items Each subject sees a version of each item May also be only partially crossed Each subject sees only some of the items

...or Hierarchical Random Effects Most studies sample both subjects and items Typically, subjects crossed with items May also have one nested within the other (hierarchical) e.g. autobiographical memory How to incorporate this into model?

Problem One: Multiple Random Effects Why do we care about items, anyway? #1: Investigate robustness of effects across items Concern is that effect could be driven by just 1 or 2 items might not really be what we thought it was Psycholinguistics: View is that we studying language too, not just people Other areas of psychology have not tended to care about this Note: Including items in a model doesn't really confirm that the effect is robust across items. It's still possible to get a reliable effect driven by a small number of items. But it allows you investigate how variable the effect is across items and why different items might be differentially influenced.

Problem One: Multiple Random Effects Why do we care about items? #2: Violations of independence A BIG ISSUE Suppose Amélie and Zhenghan see items A & B but Tuan sees items C & D Likely that Amélie's results are more like Zhenghan's than like Tuan's But ANOVA assumes observations independent Even a small amount of dependency can lead to spurious results (Quene & van den Bergh, 2008) Dependency you didn't account for makes the variance look smaller than it actually is C A D B

What Constitutes an Item? Items assumed to be independently sampled sampled from population of relevant items 2 related words / sentences not independently sampled The coach knew you missed practice. The coach knew that you missed practice. Not a coincidence both are in your experiment! Should be considered the same item But 2 unrelated things can be different items ALL POSSIBLE DISCOURSES

Problem One: Crossed Random Effects ANOVA solution Subjects analysis: Average over multiple items for each subject Items analysis: Average over multiple subjects for each item Two sets of results Sometime combined with min F' An approximation of true min F Note: not real data or statistical tests F 1 = 18.31, p <.001 F 2 = 22.10, p <.0001

Problem One: Crossed Random Effects Some debate on how accurate min F' is Scott will admit to not be fully read up on this since I came in after people started switching to mixed effects models Somewhat less relevant now that we can use mixed effects models instead Note: not real data or statistical tests F 1 = 18.31, p <.001 F 2 = 22.10, p <.0001

Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative fitting

Problem Two: Categorical Data ANOVA assumes our response is continuous RT: 833 ms But, we often want to look at categorical data 'Lightning hit the church. vs. The church was hit by lightning. Choice of syntactic structure Item recalled or not Region fixated in eye-tracking experiment

Problem One: Two: Categorical Data Traditional solution: Analyze proportions Violates assumptions of ANOVA Among other issues: ANOVA assumes normal distribution, which has infinite tails But proportions are clearly bounded Model could predict impossible values like 110% But 0 proportions 1

Problem One: Two: Categorical Data Traditional solution: Analyze proportions Violates assumptions of ANOVA Among other issues: ANOVA assumes normal distribution, which has infinite tails But proportions are clearly bounded Model could predict impossible values like 110% But 0 proportions 1

Problem One: Two: Categorical Data Traditional solution: Analyze proportions Violates assumptions of ANOVA Can lead to: Spurious effects (Type I error) Missing a true effects (Type II error)

Problem One: Two: Categorical Data Transformations improve the situation but don't solve it Empirical logit is good (Jaeger, 2008) Arcsine less so Situation is worse for very high or very low proportions (Jaeger, 2008).30 to.70 are OK

Problem One: Two: Categorical Data Why can't we just use logistic regression? Predict if each trial's response is in category A or category B This is essentially what we will end up doing But, if we are looking at things at a trial-bytrial basis... Need to control for the different items on each trial Problem One again!

Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative fitting

Problem Three: Focus on Fixed Effects ANOVA doesn't characterize differences between subjects or items The bird that they spotted was a... ENDING Predictable Unpredictable MEAN READING TIME 283 ms 309 ms 26 ms We just have a mean effect No info. about how much it varies across participants or items cardinal pitohui

Problem Three: Focus on Fixed Effects Can try to account for some of this with an ANCOVA But not typically done And would have to be done separately for participants and items (Problem One again) Predictable Unpredictable MEAN 283 ms 309 ms 26 ms

Power of subjects analysis! Power of items analysis! Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focused on fixed effects Captain MLM to the rescue! What mixed effects models do Random slopes Link functions Iterative fitting

Mixed Effects Models to the Rescue! ANOVA: Unit of analysis is cell mean MLM: Unit of analysis is individual trial!

Mixed Models to the Rescue! Look at individual trials Model outcome using regression RT Semantic categorization: Is it a dinosaur? = + Subject + Item Prime? Problem One solved!

Mixed Models to the Rescue! This means you will need your data formatted differently than you would for an ANOVA Each trial gets its own line

Mixed Models to the Rescue! Is this useful for what we care about? Stereotypical view of regression is that it's about predicting values In experimental settings we more typically want to know if Variable X matters Yes! We can test individual effects: Do they contribute to the model? e.g. does priming predict something about RT? RT = + + Prime? Subject Jason Item

Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative fitting

88 ms Fixed vs. Random Slopes Fixed Slope: Same for all participants/items Random Slope: Can vary by participants/items RT = + + Prime? 26 ms + Laurel Stego.

315 ms Fixed vs. Random Slopes Fixed Slope: Same for all participants/items Random Slope: Can vary by participants/items RT = + + Prime? 26 ms Laurel Dr. L Example: Some items may show a larger priming effect than others +

Fixed vs. Random Slopes Fixed Slope: Same for all participants/items Random Slope: Can vary by participants/items Can also test what explains variation RT = + + Prime? 26 ms Laurel Dr. L e.g. Adding lexical frequency to the model may account for variation in priming effect + 15 ms + Lex.Freq. 300 ms

Fixed vs. Random Slopes Fixed Slope: Same for all participants/items Random Slope: Can vary by participants/items Can also test what explains variation RT = + + Prime? 26 ms Laurel Dr. L Problem Three Solved! + 15 ms + Lex.Freq. 300 ms

Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative fitting

Link Functions Specifies how to connect predictors to the outcome RT + + 1300 ms Prime? Every model has one... Subject...sometimes, just the identity function With Gaussian (normal) data Item

Link Functions Specifies how to connect predictors to the outcome Accuracy + + Yes/No Prime? Subject Item For binomial (yes/no) outcomes: Model log odds to predict outcome Problem Two solved!

Link Functions Default link function for binomial data is logit (log odds) Odds: p(yes)/p(no) or p(yes)/[1-p(yes)] No upper bound, but lower bound at 0 Log Odds: ln(odds) Now unbounded at both ends Can also use probit Based on cumulative distribution function of normal distribution Very highly correlated with logit; almost always give you same results as logit Probit assumes slightly fewer hits at low end of distribution & slightly more hits at high end

Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions Iterative fitting

One Caveat... Where do model results come from? (Answer: When a design matrix and a data matrix really love each other...)

One Caveat... Fitting ANOVA / linear regression has easy solution b = (X'X) -1 X'Y A few matrix multiplications a computer can do easily A closed form solution Like a beta machine you put your data in and automatically get the One True Model out

One Caveat... MEMs requires iteration Check various sets of betas until you find the best one R does this for you An estimation Not mathematically guaranteed to be best fit The best model: The one that smiles with its eyes Complicated models take longer to fit If too many parameters relative to data, might completely fail to converge (find the best set of betas) Scott's only experience with this is with multiple random slopes of interactions