Cross-Lagged Panel Analysis

Similar documents
Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Carrying out an Empirical Project

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name

Current Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2

Modeling Time-Dependent Association in Longitudinal Data: A Lag as Moderator Approach

Logistic regression: Why we often can do what we think we can do 1.

Assessing Studies Based on Multiple Regression. Chapter 7. Michael Ash CPPA

Exploring the Impact of Missing Data in Multiple Regression

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research

Instrumental Variables Estimation: An Introduction

Regression Discontinuity Analysis

Estimating Heterogeneous Choice Models with Stata

Ec331: Research in Applied Economics Spring term, Panel Data: brief outlines

A brief history of the Fail Safe Number in Applied Research. Moritz Heene. University of Graz, Austria

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros

Lecture II: Difference in Difference and Regression Discontinuity

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

The Perils of Empirical Work on Institutions

Herman Aguinis Avram Tucker Distinguished Scholar George Washington U. School of Business

Introduction to Observational Studies. Jane Pinelis

Propensity Score Analysis Shenyang Guo, Ph.D.

How to Model Mediating and Moderating Effects. Hsueh-Sheng Wu CFDR Workshop Series Summer 2012

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

Which Comes First: Poor Psychological Well-Being or Decreased Friendship Activity?

Methods for Addressing Selection Bias in Observational Studies

Identifying Endogenous Peer Effects in the Spread of Obesity. Abstract

WELCOME! Lecture 11 Thommy Perlinger

The Issue of Endogeneity within Theory-Based, Quantitative Management Accounting Research

Panel: Using Structural Equation Modeling (SEM) Using Partial Least Squares (SmartPLS)

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS

26:010:557 / 26:620:557 Social Science Research Methods

1. What is the I-T approach, and why would you want to use it? a) Estimated expected relative K-L distance.

Advanced Handling of Missing Data

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

EC352 Econometric Methods: Week 07

Use of Structural Equation Modeling in Social Science Research

Moving beyond regression toward causality:

What Counts as Evidence? Panel Data and the Empirical Evaluation of Revised Modernization Theory

Is Knowing Half the Battle? The Case of Health Screenings

Two economists musings on the stability of locus of control

(b) empirical power. IV: blinded IV: unblinded Regr: blinded Regr: unblinded α. empirical power

REPEATED MEASURES DESIGNS

Studying the effect of change on change : a different viewpoint

Score Tests of Normality in Bivariate Probit Models

The Limits of Inference Without Theory

TOWARD IMPROVED USE OF REGRESSION IN MACRO-COMPARATIVE ANALYSIS

The Dynamic Effects of Obesity on the Wages of Young Workers

Recent advances in non-experimental comparison group designs

INTRODUCTION TO ECONOMETRICS (EC212)

How to Model Mediating and Moderating Effects. Hsueh-Sheng Wu CFDR Workshop Series November 3, 2014

Pros. University of Chicago and NORC at the University of Chicago, USA, and IZA, Germany

A critical look at the use of SEM in international business research

multilevel modeling for social and personality psychology

PubH 7405: REGRESSION ANALYSIS. Propensity Score

Quantitative Methods. Lonnie Berger. Research Training Policy Practice

Personalized Network Modeling in Psychopathology

Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories

The Context of Detection and Attribution

PERCEIVED TRUSTWORTHINESS OF KNOWLEDGE SOURCES: THE MODERATING IMPACT OF RELATIONSHIP LENGTH

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of

This exam consists of three parts. Provide answers to ALL THREE sections.

Session 1: Dealing with Endogeneity

Epidemiological study design. Paul Pharoah Department of Public Health and Primary Care

Introduction to Applied Research in Economics

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

The Causal Ordering of Academic Self-concept and Academic Achievement: A Multiwave, Longitudinal Panel Analysis

Lecture II: Difference in Difference. Causality is difficult to Show from cross

Donna L. Coffman Joint Prevention Methodology Seminar

RUNNING HEAD: TESTING INDIRECT EFFECTS 1 **** TO APPEAR IN JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY ****

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

Correlation and Regression

IMPACTS OF SOCIAL NETWORKS AND SPACE ON OBESITY. The Rights and Wrongs of Social Network Analysis

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick.

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD

Differential Item Functioning

Social Penetration Theory

Lecture 9 Internal Validity

Testing the Predictability of Consumption Growth: Evidence from China

Qualitative Data Analysis. Richard Boateng, PhD. Arguments with Qualitative Data. Office: UGBS RT18 (rooftop)

Experimental Research Design

Introduction to Applied Research in Economics Kamiljon T. Akramov, Ph.D. IFPRI, Washington, DC, USA

I. Introduction. Armin Falk IZA and University of Bonn April Falk: Behavioral Labor Economics: Psychology of Incentives 1/18

OLS Regression with Clustered Data

Experimental Psychology

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Non experimental. Cross sectional studies Panel (somewhat rare) Longitudinal (pooled time series) Case (s) study

Estimating treatment effects with observational data: A new approach using hospital-level variation in treatment intensity

Strategies to Measure Direct and Indirect Effects in Multi-mediator Models. Paloma Bernal Turnes

Using register data to estimate causal effects of interventions: An ex post synthetic control-group approach

Session 3: Dealing with Reverse Causality

The Logic of Causal Inference

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Confidence Intervals On Subsets May Be Misleading

SAMPLE SELECTION BIAS AND HECKMAN MODELS IN STRATEGIC MANAGEMENT RESEARCH

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth

The Pretest! Pretest! Pretest! Assignment (Example 2)

Transcription:

Cross-Lagged Panel Analysis Michael W. Kearney Cross-lagged panel analysis is an analytical strategy used to describe reciprocal relationships, or directional influences, between variables over time. Cross-lagged panel models (CLPM), also referred to as cross-lagged path models and cross-lagged regression models, are estimated using panel data, or longitudinal data where each observation or person is recorded at multiple points in time. The models are considered "crossed" because they estimate relationships from one variable to another and vice-versa. They are considered "lagged" because they estimate relationships between variables across different time points. Taken together, cross-lagged panel models estimate the directional influence variables have on each other over time. The primary goal of cross-lagged panel models is to examine the causal influences between variables. In essence, cross-lagged panel analysis compares the relationship between variable X at Time 1 and variable Y at Time 2 with the relationship between variable Y at Time 1 and X at Time 2. It is widely used to examine the stability and relationships between variables over time to better understand how variables influence each other over time. This entry discusses cross-lagged panel analysis, an analytical strategy used in longitudinal communication research. It describes its rationales and origins in research. It also describes modern path-analytic approaches to cross-lagged panel analysis. Finally, this entry discusses some important assumptions and issues with cross-lagged panel analysis. Directional Influences Basic methods for testing causality have several limitations. Correlational analysis relies on theoretical inferences to make arguments about causality. Ph.D. Candidate, Communication Studies, University of Kansas. Kearney, M. W. (in press). Cross Lagged Panel Analysis. In M. R. Allen (Ed.), The SAGE Encyclopedia of Communication Research Methods. Thousand Oaks, CA: Sage 1

Since cross-sectional data represents only one moment in time, there is no way to determine if these inferences are correct. The experimental method utilizes randomization and control to provide a more robust method for examining causality. In many cases, however, randomization and control are not practical or even possible. For example, costs associated with recruiting truly random samples for multiple time points are often too expensive. And resources are not the only barriers to randomization. In many cases, randomization creates ethical dilemmas that make studies examining certain variables like aging or illness problematic. In these situations, researchers often turn to longitudinal research and cross-lagged panel analysis. Cross-Lagged Correlations Cross-lagged panel analysis is used to compare the relationship between variable X at Time 1 (X 1 ) and variable Y at Time 2 (Y 2 ) with the relationship between Y 1 and X 2. In the past, this was accomplished by examining zeroorder correlations. Cross-lagged correlations (CLC) were used to make arguments about causal directions between variables. Correlations of the same size indicated a reciprocal relationship. If one of the coefficients was larger, however, it suggested that changes in one variable lead to changes in the other variable and not the other way around. Comparing CLC thus provides some evidence of directional influence, but it has serious flaws. Several weaknesses have been identified in the cross-lagged correlations method. One weakness is that CLC do not account for contemporaneous relationships between variables. Contemporaneous relationships refer to the correlations between variables within the same time point. Another weakness is that CLC do not account for the stability of each construct across time points. Stability refers to the degree to which values of a variable are unchanging over time. As a result of these shortcomings, the CLC method has largely been discarded in favor of cross-lagged path (or regression) models. Cross-Lagged Panel Models Like the CLC method, cross-lagged path models compare cross-lagged relationships. In addition to allowing for the estimation of cross-lagged effects, cross-lagged path models also control for correlations within time-points and autoregressive effects, or stability, across time. Autoregressive effects describe the amount of stability in constructs over time. Smaller autoregressive coefficients (closer to zero) indicate more variance in the construct, 2

meaning less stability or influence from the previous time point. Larger autoregressive coefficients indicate little variance over time, meaning more stability or influence from the previous time point. The most basic cross-lagged panel model includes two constructs measured at two time points. Cross-lagged panel models assume that each time a construct is measured is a variable. The simplest model therefore consists of two X variables (x 1, x 2 ) and two Y variables (y 1, y 2 ). The model also includes ten parameters, making it just identified. These parameters include exogenous variances ψ x1, ψ y1, [synchronous] correlations r x1 y 1, r x2 y 2, crosslagged paths β 1, β 2, auto-regressive paths β 3, β 4, and endogenous residuals ζ x2, ζ y2. Estimates for cross-lagged effects now control for contemporaneous effects and variance across time (stability). Causal predominance can be examined by comparing standardized coefficients of the cross-lagged paths. This basic model can be easily extended for studies with three or more waves of measurement as well. Cross-lagged panel models can include more than two waves of measurement. An example of a three-wave cross-lagged panel model is presented in Figure 1. In the depicted model, paths are constrained to equality across time. Thus, β 1 represents the cross-lagged effects from X 1 on Y 2 and from X 2 on Y 3. The model also assumes that stability is a function of only the previous time point, meaning a log of one. These assumptions can change depending on the study, but theory should drive these decisions whenever possible. Unlike the two-wave models, models with more than three waves of measurement are typically over-identified, which allows for models with different error structures (autoregressive effects) and assumptions about variance over time to be compared. Issues and Assumptions in Cross-Lagged Panel Analysis Cross-lagged panel analysis is a useful tool for describing lagged relationships between two or more variables, though it is sometimes explicitly used as evidence of causality. In its most basic form, cross-lagged panel analysis attempts to identify causal predominance, which occurs when one variable influences another variable without also experiencing a reciprocal influence in return. Causal predominance is indicated when the effect of X at time 1 on Y at time 2 is large, while the effect of Y at time 1 on X at time 2 is zero. In such cases, X is considered the source variable and Y is considered the effect variable. The remainder of this entry covers common issues and 3

Figure 1 important assumptions that explain why many scholars recommend only using cross-lagged panel analysis for exploratory research. Synchronicity. Cross-lagged panel analysis makes several important assumptions. The first is the assumption of synchronicity, which assumes that measurements at each time point occurred at the exact same times. Although most studies are designed to measure variables simultaneous, complications during data collection frequently violate this assumption. Stationarity. Another assumption of cross-lagged panel analysis is that variables and relationships stay the same across time. This assumption, referred to as stationarity, relates to the stability of a construct as well as the nature of the relationships between constructs over time. As was previously discussed in models with three or more time points, there are varying degrees of stationarity, though very few theories offer guidance in this area. Comparing Cross-Lagged Coefficients. In order to make claims about causal predominance, cross-lagged path analysis typically includes comparing relative sizes of cross-lagged coefficients. This is accomplished by standardizing variables. Although each of the standardized variables can be described in similar terms, standardization does not necessarily address fundamental differences distributions. In some cases it may not be appropriate to assume the variables were measured on the same scale. 4

Measurement Error. Although cross-lagged panel analysis can also be done using structural equation modeling, many cross-lagged panel models assume that variables are measured without error. Of course, for many variables in communication research, this is clearly not the case, which leads to biased results. Some scholars have also argued that measurement error may also be misidentified as real change when models have only two time points. In these cases, measurement error could still confound results of structural equation models. Timeframe of Effect. Cross-lagged panel models also assume X 1 occurs before X 1, but it does not explicitly include time. Instead, it assumes the influence of one variable on another is a function of lag, or time between waves of measurement. The amount of lag can be any length of time, though it must be contextually appropriate to have meaningful interpretations. If the lag is too short, measurement will occur before the effects can be observed. If the lag is too long, the effects will dissipate before the next time of measurement. Omitted Variables. Cross-lagged panel analysis, in theory, assumes all possible variables were measured and included in the model. This definition of causality, which originated from econometrics, is unlikely to hold communication research. Given the uncertainty surrounding many communication variables, scholars phrase interpretations of cross-lagged panel analysis in terms of directions of "influence" rather than "causality." Stability. Cross-lagged panel models generally lack explicit theories of change. As such, autoregressive parameters are included to account for stability for everyone across time. This assumes there are no inter-individual differences, or differences between people, over time in stability. Interindividual differences that do exist, such as unobserved trait-like influences or dependencies, may bias results. References [1] Hamaker, E. L., Kuiper, R. M., & Grasman, R. P. (2015). A critique of the cross-lagged panel model. Psychological methods, 20 (1), 102. 5

[2] Kenny, D. A. (1975). Cross-lagged panel correlation: A test for spuriousness. Psychological Bulletin, 82 (6), 887. [3] Rogosa, D. (1980). A critique of cross-lagged correlation. Psychological Bulletin, 88 (2), 245. [4] Selig, J. P., & Little, T. D. (2012). Autoregressive and cross-lagged panel analysis for longitudinal data. In Laursen, B., & Little, T. D. (Ed.), Handbook of developmental research methods (pp. 265-278). New York: Guilford Press. [5] Newsom, J. T. (2015). Longitudinal structural equation modeling: A comprehensive introduction. New York: Taylor & Francis. pp 122-151. 6