Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Similar documents
Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

REPEATED MEASURES DESIGNS

Structural Equation Modeling (SEM)

How to analyze correlated and longitudinal data?

Dan Byrd UC Office of the President

Available from Deakin Research Online:

Daniel Boduszek University of Huddersfield

Simple Linear Regression One Categorical Independent Variable with Several Categories

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

TWO-DAY DYADIC DATA ANALYSIS WORKSHOP Randi L. Garcia Smith College UCSF January 9 th and 10 th

HPS301 Exam Notes- Contents

Centering Predictors

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Class 7 Everything is Related

isc ove ring i Statistics sing SPSS

CHAPTER 3 RESEARCH METHODOLOGY

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Data Analysis Using Regression and Multilevel/Hierarchical Models

Introduction to Longitudinal Analysis

11/24/2017. Do not imply a cause-and-effect relationship

UNIVERSITY OF FLORIDA 2010

investigate. educate. inform.

Use of the Quantitative-Methods Approach in Scientific Inquiry. Du Feng, Ph.D. Professor School of Nursing University of Nevada, Las Vegas

In this chapter, we discuss the statistical methods used to test the viability

(C) Jamalludin Ab Rahman

IAPT: Regression. Regression analyses

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

Differential Item Functioning

Appendix Part A: Additional results supporting analysis appearing in the main article and path diagrams

Longitudinal and Hierarchical Analytic Strategies for OAI Data

Correlation and regression

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Scale Building with Confirmatory Factor Analysis

PERSON LEVEL ANALYSIS IN LATENT GROWTH CURVE MODELS. Ruth E. Baldasaro

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology

Multifactor Confirmatory Factor Analysis

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

Experimental Studies. Statistical techniques for Experimental Data. Experimental Designs can be grouped. Experimental Designs can be grouped

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Donna L. Coffman Joint Prevention Methodology Seminar

MODELING HIERARCHICAL STRUCTURES HIERARCHICAL LINEAR MODELING USING MPLUS

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

Still important ideas

Business Statistics Probability

VARIABLES AND MEASUREMENT

INTRODUCTION TO ECONOMETRICS (EC212)

Linear Regression in SAS

Manuscript Presentation: Writing up APIM Results

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

Analyzing Longitudinal Data With Multilevel Models: An Example With Individuals Living With Lower Extremity Intra-Articular Fractures

Complex Regression Models with Coded, Centered & Quadratic Terms

Context and Change in Health Research: A Multilevel Model Approach

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Biostatistics II

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

Exploring the Factors that Impact Injury Severity using Hierarchical Linear Modeling (HLM)

Psychology Research Process

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Industrial and Manufacturing Engineering 786. Applied Biostatistics in Ergonomics Spring 2012 Kurt Beschorner

Exploratory Longitudinal Profile Analysis via Multidimensional Scaling. Cody Ding University of Missouri - St. Louis. Page 1 of 10

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Graphical Exploration of Statistical Interactions. Nick Jackson University of Southern California Department of Psychology 10/25/2013

Advanced ANOVA Procedures

Generalized Estimating Equations for Depression Dose Regimes

Confirmatory Factor Analysis of Preschool Child Behavior Checklist (CBCL) (1.5 5 yrs.) among Canadian children

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of

Daniel Boduszek University of Huddersfield

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

1. Family context. a) Positive Disengaged

Repeated Measures ANOVA and Mixed Model ANOVA. Comparing more than two measurements of the same or matched participants

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

EVALUATING THE INTERACTION OF GROWTH FACTORS IN THE UNIVARIATE LATENT CURVE MODEL. Stephanie T. Lane. Chapel Hill 2014

Unit 1 Exploring and Understanding Data

Chapter Eight: Multivariate Analysis

Hierarchical Linear Models I ICPSR 2012

Multiple Linear Regression Analysis

Daniel Boduszek University of Huddersfield

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

Many studies conducted in practice settings collect patient-level. Multilevel Modeling and Practice-Based Research

Political Science 15, Winter 2014 Final Review

Single-Factor Experimental Designs. Chapter 8

The Use of Latent Trajectory Models in Psychopathology Research

10. LINEAR REGRESSION AND CORRELATION

Analytic Strategies for the OAI Data

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

University of Pennsylvania. From the SelectedWorks of Penn CCT

Transcription:

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this course (and the next course) CLP 944: Lecture 1 1

Longitudinal data What is CLP 944 about? Same individual units of analysis measured at different occasions (which can range from milliseconds to decades) Repeated measures data Same individual units of analysis measured via different items, using different stimuli, or under different conditions Both of these fall under a more general category of multivariate data of varying complexity The link between them is the use of random effects to describe covariance of outcomes from the same unit CLP 944: Lecture 1 2

Data Requirements for Our Models A useful outcome variable: Has an interval scale* A one-unit difference means the same thing across all scale points In subscales, each contributing item has an equivalent scale *Other kinds of outcomes will be analyzed using generalized multilevel models instead, but estimation will be more challenging Has scores with the same meaning over observations Includes meaning of construct Includes how items relate to the scale Implies measurement invariance FANCY MODELS CANNOT SAVE BADLY MEASURED VARIABLES OR CONFOUNDED RESEARCH DESIGNS. CLP 944: Lecture 1 3

Requirements for Longitudinal Data Multiple OUTCOMES from same sampling unit (person)! 2 is the minimum, but just 2 can lead to problems: Only 1 kind of change is observable (1 difference) Can t distinguish real individual differences in change from error Repeated measures ANOVA is just fine for 2 observations Necessary assumption of sphericity is satisfied with only 2 observations even if compound symmetry doesn t hold More data is better (with diminishing returns) More occasions better description of the form of change More persons better estimates of amount of individual differences in change; better prediction of those individual differences More items/stimuli/groups more power to show effects of differences between items/stimuli/groups CLP 944: Lecture 1 4

Power in Longitudinal Data More occasions are better! Can examine more complex growth functions Can get more reliable individual growth parameters More people are better! Can get more reliable measures of individual differences Reliability of Slopes by Signal to Noise Ratio and # Occasions From Willett, 1989, pg. 597 CLP 944: Lecture 1 5

Levels of Analysis in Longitudinal Data Between-Person (BP) Variation: Level-2 INTER-individual Differences Time-Invariant All longitudinal studies begin as cross-sectional studies Within-Person (WP) Variation: Level-1 INTRA-individual Differences Time-Varying Only longitudinal studies can provide this extra information Longitudinal studies allow examination of both types of relationships simultaneously (and their interactions) Any variable measured over time usually has both BP and WP variation BP = more/less than other people; WP = more/less than one s average Later, Between-Item Variation will also show up as another Level-2 source of variation in other repeated measures data CLP 944: Lecture 1 6

A Longitudinal Data Continuum Within-Person Change: Systematic change Magnitude or direction of change can be different across individuals Growth curve models Time is meaningfully sampled Within-Person Fluctuation: No systematic change Outcome just varies/fluctuates over time (e.g., emotion, stress) Time is just a way to get lots of data per individual Pure WP Change Pure WP Fluctuation Time Time CLP 944: Lecture 1 7

What is a Multilevel Model (MLM)? Same as other terms you have heard of: General Linear Mixed Model (if you are from statistics) Mixed = Fixed and Random effects Random Coefficients Model (also if you are from statistics) Random coefficients = Random effects = latent variables/factors Hierarchical Linear Model (if you are from education) Not the same as hierarchical regression Special cases of MLM: Random Effects ANOVA or Repeated Measures ANOVA (Latent) Growth Curve Model (where Latent implies SEM) Within-Person Fluctuation Model (e.g., for daily diary data) Clustered/Nested Observations Model (e.g., for kids in schools) Cross-Classified Models (e.g., value-added models) Psychometric Models (e.g., factor analysis, item response theory) CLP 944: Lecture 1 8

The Two Sides of Any Model Model for the Means: Aka Fixed Effects, Structural Part of Model What you are used to caring about for testing hypotheses How the expected outcome for a given observation varies as a function of values on predictor variables Model for the Variance: Aka Random Effects and Residuals, Stochastic Part of Model What you are used to making assumptions about instead How residuals are distributed and related across observations (persons, groups, time, etc.) these relationships are called dependency and this is the primary way that multilevel models differ from general linear models (e.g., regression) CLP 944: Lecture 1 9

Dimensions for Organizing Models Outcome type: General (normal) vs. Generalized (not normal) Dimensions of sampling: One (so one variance term per outcome) vs. Multiple (so multiple variance terms per outcome) OUR WORLD General Linear Models: conditionally normal outcome distribution, fixed effects (identity link; only one dimension of sampling) Generalized Linear Models: any conditional outcome distribution, fixed effects through link functions, no random effects (one dimension) General Linear Mixed Models: conditionally normal outcome distribution, fixed and random effects (identity link, but multiple sampling dimensions) Generalized Linear Mixed Models: any conditional outcome distribution, fixed and random effects through link functions (multiple dimensions) Same concepts as for this week, but with more complexity in estimation Note: Least Squares is only for GLM Linear means fixed effects predict the link-transformed conditional mean of DV in a linear combination of (effect*predictor) + (effect*predictor) CLP 944: Lecture 1 10

Options for Longitudinal Models Although models and software are logically separate, longitudinal data can be analyzed via multiple analytic frameworks: Multilevel/Mixed Models Dependency over time, persons, groups, etc. is modeled via random effects (multivariate through levels using stacked/long data) Builds on GLM, generalizes easier to additional levels of analysis Structural Equation Models Dependency over time only is modeled via latent variables (single-level analysis using multivariate/wide data) Generalizes easier to broader analysis of latent constructs, mediation Because random effects and latent variables are the same thing, many longitudinal models can be specified/estimated either way And now Multilevel Structural Equation Models can do it all CLP 944: Lecture 1 11

What can MLM do for you? 1. Model dependency across observations Longitudinal, clustered, and/or cross-classified data? No problem! Tailor your model of sources of correlation to your data 2. Include categorical or continuous predictors at any level Time-varying, person-level, group-level predictors for each variance Explore reasons for dependency, don t just control for dependency 3. Does not require same data structure for each person Unbalanced or missing data? No problem! 4. You already know how (or you will soon)! Use SPSS Mixed, SAS Mixed, Stata, Mplus, R, HLM, MlwiN What s an intercept? What s a slope? What s a pile of variance? CLP 944: Lecture 1 12

1. Model Dependency Sources of dependency depend on the sources of variation created by your sampling design: residuals for outcomes from the same unit are likely to be related, which violates the GLM independence assumption Levels for dependency = levels of random effects Sampling dimensions can be nested e.g., time within person, time within group, trial within person If you can t figure out the direction of your nesting structure, odds are good you have a crossed sampling design instead e.g., persons crossed with items, raters crossed with targets To have a level, there must be random outcome variation due to sampling that remains after including the model s fixed effects e.g., treatment vs. control does not create another level of group (but it would if you had multiple treatment and multiple control groups) CLP 944: Lecture 1 13

Longitudinal dependency comes from Mean differences across sampling units (e.g., persons) Creates constant dependency over time Will be represented by a random intercept in our models Individual differences in effects of predictors Individual differences in change over time, stress reactivity Creates non-constant dependency, the size of which depends on the value of the predictor at each occasion Will be represented by random slopes in our models Non-constant within-person correlation for unknown reasons (time-dependent autocorrelation) Can add other patterns of correlation as needed for this CLP 944: Lecture 1 14

Why care about dependency? In other words, what happens if we have the wrong model for the variances (assume independence instead)? Validity of the tests of the predictors depends on having the most right model for the variances Estimates will usually be ok come from model for the means Standard errors (and thus p-values) can be compromised The sources of variation that exist in your outcome will dictate what kinds of predictors will be useful Between-Person variation needs Between-Person predictors Within-Person variation needs Within-Person predictors Between-Item variation needs Between-Item predictors CLP 944: Lecture 1 15

2. Include categorical or continuous predictors at any level of analysis ANOVA: test differences among discrete IV factor levels Between-Groups: Gender, Intervention, Age Groups Within-Subjects (Repeated Measures): Condition, Time Test main effects of continuous covariates (ANCOVA) Regression: test whether slopes relating predictors to outcomes are different from 0 Persons measured once, differ categorically or continuously on a set of time-invariant (person-level) covariates What if a predictor is assessed repeatedly (time-varying predictors) but can t be characterized by conditions? ANOVA or Regression won t work need MLM CLP 944: Lecture 1 16

2. Include categorical or continuous predictors at any level of analysis Some things don t change over measurements Sex, Ethnicity Time-Invariant Predictor = Person Level Some things do change over measurements Health Status, Stress Levels, Living Arrangements Time-Varying Predictor = Time Level Some predictors might be measured at higher levels Family SES, length of marriage, school size Interactions between levels may be included, too Does the effect of health status differ by gender and SES? Level: Time Person Family CLP 944: Lecture 1 17

3. Does not require same data structure per person (by accident or by design) RM ANOVA: uses multivariate (wide) data structure: ID Sex T1 T2 T3 T4 100 0 5 6 8 12 101 1 4 7. 11 People missing any data are excluded (data from ID 101 are not included at all) MLM: uses stacked (long) data structure: Only rows missing data are excluded 100 uses 4 cases 101 uses 3 cases ID Sex Time Y 100 0 1 5 100 0 2 6 100 0 3 8 100 0 4 12 101 1 1 4 101 1 2 7 101 1 3. 101 1 4 11 Time can also be unbalanced across people such that each person can have his or her own measurement schedule: Time 0.9 1.4 3.5 4.2 CLP 944: Lecture 1 18

4. You already know how! If you can do GLM, you can do MLM (and if you can do generalized linear models, you can do generalized multilevel models, too) How do you interpret an estimate for the intercept? the effect of a continuous variable? the effect of a categorical variable? a variance component ( pile of variance )? CLP 944: Lecture 1 19

CLP 944: This Semester s Topics Make Friends with SAS Review of linear models and interactions Within-person analysis via ANOVA Describing within-person fluctuation via alternative covariance structure models R matrix structures with and without a random intercept Describing within-person change via random effects Polynomial, piecewise, and nonlinear models Time-invariant (level-2) predictors Level-2 predictors in crossed repeated measures designs Models for non-normal outcomes CLP 944: Lecture 1 20

CLP 945: Next Semester s Topics Two-level models for persons in groups Time-varying predictors that show WP fluctuation Multivariate models For multiple outcomes For time-varying predictors that show WP change For multilevel SEM Models for accelerated longitudinal data Cross-classified models for persons in multiple groups Three-level models For measurement burst designs For persons in time-invariant groups Other topics as time permits CLP 944: Lecture 1 21

CLP 944: Coursework Structure SAS PROC MIXED or GLIMMIX for everything! Available in 3049 Dole during office hours and perhaps otherwise Available for purchase for $150 annually from KU software store Available remotely FOR FREE via KU ACF or SAS University edition 11(ish) homeworks (hopefully using an online system) Mix of data analysis, fill-in-the-blank results sections, and conceptual multiple choice questions (80 points total) 3 points will be deducted per late homework (so please keep up) Readings are not considered optional! BECAUASE I SPENT 5 YEARS WRITING THE BOOK Other sources that I ve found to be readable and useful 10 in-class quizzes (2 points each; must be here to take them) CLP 944: Lecture 1 22