Michael Hallquist, Thomas M. Olino, Paul A. Pilkonis University of Pittsburgh

Similar documents
Russian Journal of Agricultural and Socio-Economic Sciences, 3(15)

Small Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

Impact of an equality constraint on the class-specific residual variances in regression mixtures: A Monte Carlo simulation study

Past Year Alcohol Consumption Patterns, Alcohol Problems and Alcohol-Related Diagnoses in the New Zealand Mental Health Survey

Ecological Statistics

Speech recognition in noisy environments: A survey

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

What can the Real World do for simulation studies? A comparison of exploratory methods

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research

Learning from data when all models are wrong

Multivariate Multilevel Models

Structure. Results: Evidence for Validity: Internal

Unit 1 Exploring and Understanding Data

Likelihood Ratio Based Computerized Classification Testing. Nathan A. Thompson. Assessment Systems Corporation & University of Cincinnati.

Bayesians methods in system identification: equivalences, differences, and misunderstandings

On the Targets of Latent Variable Model Estimation

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Score Tests of Normality in Bivariate Probit Models

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Models and strategies for factor mixture analysis: Two examples concerning the structure underlying psychological disorders

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Data Analysis Using Regression and Multilevel/Hierarchical Models

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger

Technical Specifications

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

For general queries, contact

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Individual Differences in Attention During Category Learning

A Comparison of Several Goodness-of-Fit Statistics

A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model

Testing Hypotheses Regarding the Causes of Comorbidity: Examining the Underlying Deficits of Comorbid Disorders

MEA DISCUSSION PAPERS

Selection of Linking Items

Technical Appendix: Methods and Results of Growth Mixture Modelling

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note

STATISTICAL INFERENCE PROCEDURE BY THE INFORMATION-BASED TEST AND ITS APPLICATION IN MARINE CLIMATOLOGY

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

PCA Enhanced Kalman Filter for ECG Denoising

Multivariate meta-analysis for non-linear and other multi-parameter associations

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION

Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses

Advances In Measurement Modeling: Bringing Genetic Information Into Preventive Interventions And Getting The Phenotype Right

Chapter 1. Introduction

Ordinal Data Modeling

Statistics and Probability

Reveal Relationships in Categorical Data

How many speakers? How many tokens?:

MODELLING CHARACTER LEGIBILITY

16:35 17:20 Alexander Luedtke (Fred Hutchinson Cancer Research Center)

PLEASE SCROLL DOWN FOR ARTICLE. Full terms and conditions of use:

An Ideal Observer Model of Visual Short-Term Memory Predicts Human Capacity Precision Tradeoffs

Selection and Combination of Markers for Prediction

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Advanced Bayesian Models for the Social Sciences

Is There a General Factor of Prevalent Psychopathology During Adulthood?

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

Measurement Models for Behavioral Frequencies: A Comparison Between Numerically and Vaguely Quantified Reports. September 2012 WORKING PAPER 10

A Bayesian Nonparametric Model Fit statistic of Item Response Models

Mathematisches Forschungsinstitut Oberwolfach. Statistische und Probabilistische Methoden der Modellwahl

Self-Oriented and Socially Prescribed Perfectionism in the Eating Disorder Inventory Perfectionism Subscale

LEDYARD R TUCKER AND CHARLES LEWIS

Discriminant Analysis with Categorical Data

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Assessment of a disease screener by hierarchical all-subset selection using area under the receiver operating characteristic curves

KRISTIAN ERIC MARKON. Phone: (319)

Online Appendix. According to a recent survey, most economists expect the economic downturn in the United

Hanne Søberg Finbråten 1,2*, Bodil Wilde-Larsson 2,3, Gun Nordström 3, Kjell Sverre Pettersen 4, Anne Trollvik 3 and Øystein Guttersrud 5

Different styles of modeling

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

Basic concepts and principles of classical test theory

You must answer question 1.

The Influence of Test Characteristics on the Detection of Aberrant Response Patterns

Problem 1) Match the terms to their definitions. Every term is used exactly once. (In the real midterm, there are fewer terms).

A Memory Model for Decision Processes in Pigeons

Running head: STATISTICAL AND SUBSTANTIVE CHECKING

1999 Wiley-Liss, Inc. Cytometry 36:60 70 (1999)

Multilevel Latent Class Analysis: an application to repeated transitive reasoning tasks

On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA

Neural Noise and Movement-Related Codes in the Macaque Supplementary Motor Area

HIV risk associated with injection drug use in Houston, Texas 2009: A Latent Class Analysis

The Statistical Analysis of Failure Time Data

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Modeling unobserved heterogeneity in Stata

2014 Modern Modeling Methods (M 3 ) Conference, UCONN

Non-parametric methods for linkage analysis

accuracy (see, e.g., Mislevy & Stocking, 1989; Qualls & Ansley, 1985; Yen, 1987). A general finding of this research is that MML and Bayesian

General recognition theory of categorization: A MATLAB toolbox

A BAYESIAN SOLUTION FOR THE LAW OF CATEGORICAL JUDGMENT WITH CATEGORY BOUNDARY VARIABILITY AND EXAMINATION OF ROBUSTNESS TO MODEL VIOLATIONS

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

Mediation Analysis With Principal Stratification

Package mdsdt. March 12, 2016

How few countries will do? Comparative survey analysis from a Bayesian perspective

Supplementary Materials. Instructions for Target Subjects (Taken from, and kindly shared by, Haselton & Gildersleeve, 2011).

JRC Community of Practice Meeting Panel VI: Ageing societies & Migration

Accuracy of Range Restriction Correction with Multiple Imputation in Small and Moderate Samples: A Simulation Study

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016

Bayesian Tolerance Intervals for Sparse Data Margin Assessment

Transcription:

Comparing the evidence for categorical versus dimensional representations of psychiatric disorders in the presence of noisy observations: a Monte Carlo study of the Bayesian Information Criterion and Akaike Information Criterion in latent variable models. Michael Hallquist, Thomas M. Olino, & Paul A. Pilkonis University of Pittsburgh

Introduction Considerable debate has centered on whether psychopathology is best represented by continuous dimensions or categorical subtypes (First, 2005; Kramer, Noda, & O Hara, 2004). Prior statistical research has focused on the ability of model selection criteria (e.g., the Bayesian Information Criterion) to adjudicate the latent structure of psychometric indicators.

Introduction Psychological Methods 2006, Vol. 11, No. 3, 228 243 Copyright 2006 by the American Psychological Association 1082-989X/06/$12.00 DOI: 10.1037/1082-989X.11.3.228 Information-Theoretic Latent Distribution Modeling: Distinguishing Discrete and Continuous Latent Variable Models Kristian E. Markon and Robert F. Krueger University of Minnesota, Twin Cities Campus Distinguishing between discrete and continuous latent variable distributions has become Markon & Krueger (2006): selection increasingly important in numerous domains of behavioral science. Here, the authors explore an information-theoretic approach to latent distribution modeling, in which the ability of latent distribution models to represent statistical information in observed data is emphasized. criteria (esp. BIC) can be used to probe The authors conclude that loss of statistical information with a decrease in the number of latent values provides an attractive basis for comparing discrete and continuous latent variable models. Theoretical considerations as well as the results of 2 Monte Carlo simulations indicate that information theory provides a sound basis modeling latent distributions and distinguishing between discrete and continuous latent variable models particular. the loss of statistical information resulting Keywords: information theory, latent class, latent trait, model selection from discretizing an underlying dimension. Methods for distinguishing between discrete and continvariable models. The approach has a number of benefits, including explicit formulation and comparison of competing e.g., if latent structure models, is quantification dimensional, of model fit, and situation within a larger latent variable modeling framework. We first review relationships between discrete and continuous latent distribution models and theory regarding in- estimating a few points along a latent formation-theoretic comparison of latent distribution models more generally. may Using be simulations, less we explore the distribution (oligovalued) behavior of information-theoretic criteria in comparisons between discrete and continuous latent distribution models, informative than a parametric under conditions in which the models are correctly and Here, we explore a novel approach to distinguishing be- incorrectly specified. We conclude that information-theoretic approaches provide approximating the dimension (e.g., an attractive basis normal). for comparisons uous latent distributions have become increasingly important in a variety of areas of behavioral science. For example, interest in these methods has grown greatly in clinical psychology, because of the importance of distinguishing between discrete versus continuous phenomena in etiology, assessment, and treatment of psychological disorders (e.g., Cole, 2004). Such methods have also become increasingly important in other areas of psychology and are being applied in other fields of behavioral science as well, such as in the study of language impairment (e.g., Dollaghan, 2004). tween discrete and continuous latent structure. This approach is based on information-theoretic comparison of discrete and continuous latent distribution models within a logistic latent variable model framework. We make a dis- between latent distribution models, including comparisons between discrete and continuous latent distribution models.

Introduction Factor mixture models that combine categorical and dimensional structure can often be resolved by model selection criteria (Lubke & Neale, 2005). Resolution of dimensional, categorical, and hybrid latent structure is clearest when 1. indicators are well separated across latent classes 2. there are more indicators 3. there is little unmodeled within-class residual association among indicators

Introduction Yet simulation studies have only tested multivariate normal (or skew-normal) data with little attention to the influence of scatter/outliers on latent structure decisions. There has been recent interest in the robustness of clustering methods to data that include scatter (Maitra & Ramler, 2008). In the case of empirical studies of mental disorders, data are rarely normal and heterogeneity sometimes manifests as indicators that are difficult to cluster.

Scientific question If mental disorders in the population were sorted by severity (e.g., low, medium, high), would we be able to resolve categorical versus dimensional structure when some observations are noisy? Parameterization: latent profile model (LPM) with diagonal covariance matrices.

Model selection criteria Generally: Model log-likelihood Penalty coefficient Number of parameters Akaike Information Criterion (AIC): α = 2 Bayesian Information Criterion (BIC): α = log(n) Finite-sample adjusted AIC (AIC C ) α = 2*[n/(n-κ-1)]

Asymptotic properties of AIC and BIC AIC Efficient: tends to select the model that minimizes prediction error as sample size increases. Primarily concerned with minimizing the relative Kullback-Leibler (K-L) distance between a statistical model and the unknown mechanisms giving rise to the observed data. BIC Consistent: true model will be selected with increasing probability as sample size increases. Vrieze 2012; Yang 2005

Simulation Conditions Number of latent classes: 3 (equal prob.) Number of indicators: 5 Sample size: 45, 90, 180, 360, 720, 1440 Mean separation:.25,.5, 1.0, 1.5, 2.0, 3.0 Proportion of noisy observations added: 0%, 2.5%, 5%, 10%, 15%, 20%, 25%, 30% Number of replications: 200/condition

Simulation methods Within-class multivariate normal data generated using the MixSim package in R. Noise observations added by randomly sampling observations that were outside the 99% ellipsoids of concentration for all clusters (Maitra & Ramler, 2009). Noise observations were uniformly distributed with bounds -1.5*IQR < x < 1.5*IQR

Simulation Pipeline For each replication, simulate mvnorm Gaussian mixtures using MixSim in R Add some proportion (0 0.3) of scatter observations outside of 99% ellipsoid bounds Analyze simulated data in Mplus 7.0 using LPM and CFA Summarize and visualize results across conditions

Example of clean latent structure n = 720, LC separation = 2.0 SD, 0% Noise

Example of noisy latent structure n = 720, LC separation = 2.0 SD, 10% Noise

Discussion When indicators were separated by 1SD or less, scatter observations (esp. > 10%) seriously degraded latent structure. For models where indicators were well separated (2 3 SDs), parameter coverage for LPMs was relatively robust to scatter. Adding scatter to data separated by < 1SD shifted evidence toward categorical model. For indicators separated by > 1SD, adding scatter weakened evidence for categorical model. Greater latent separation was more important than sample size in determining coverage of latent means.

Discussion For moderate sample sizes (n = 360 or less) and low latent separation (0.75SD or less) BIC tends to select dimensional model, whereas AIC prefers categorical model regardless of scatter. AIC C fell in the middle, but has a steeper penalty than BIC at small sample sizes.

Future directions Exploring how scatter observations are clustered (normalized mutual information). New approaches to scatter: corruption of selected observations by adding random deviate. Testing clustering algorithms (e.g., k-clips; Maitral & Ramler, 2009) that identify scatter and remove these observations from substantive clusters. Testing performance of model selection criteria for dimensional data with scatter.