Different styles of modeling

Similar documents
Multiple Act criterion:

Paul Irwing, Manchester Business School

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Choosing an Approach for a Quantitative Dissertation: Strategies for Various Variable Types

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Traits & Trait Taxonomies

Reliability of Ordination Analyses

Quint: An R package for the identification of subgroups of clients who differ in which treatment alternative is best for them

Business Statistics Probability

WWC STUDY REVIEW STANDARDS

Predicting Breast Cancer Survival Using Treatment and Patient Factors

UvA-DARE (Digital Academic Repository)

Some interpretational issues connected with observational studies

Chiba University AKIO WAKABAYASHI 1 *

Extraversion. The Extraversion factor reliability is 0.90 and the trait scale reliabilities range from 0.70 to 0.81.

Validity and reliability of measurements

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

To cite this article:

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

TACKLING SIMPSON'S PARADOX IN BIG DATA USING CLASSIFICATION & REGRESSION TREES

Ordinal Data Modeling

Political Science 15, Winter 2014 Final Review

Procedia - Social and Behavioral Sciences 140 ( 2014 ) PSYSOC 2013

Chapter 1. Introduction

Analysis of the Reliability and Validity of an Edgenuity Algebra I Quiz

MODEL SELECTION STRATEGIES. Tony Panzarella

Likelihood Ratio Based Computerized Classification Testing. Nathan A. Thompson. Assessment Systems Corporation & University of Cincinnati.

The Research Roadmap Checklist

Preliminary Conclusion

Sociodemographic Effects on the Test-Retest Reliability of the Big Five Inventory. Timo Gnambs. Osnabrück University. Author Note

Weight Adjustment Methods using Multilevel Propensity Models and Random Forests

1. Evaluate the methodological quality of a study with the COSMIN checklist

Measurement Invariance (MI): a general overview

Michael Hallquist, Thomas M. Olino, Paul A. Pilkonis University of Pittsburgh

To cite this article:

1/16/2012. Personality. Personality Structure

A MONTE CARLO STUDY OF MODEL SELECTION PROCEDURES FOR THE ANALYSIS OF CATEGORICAL DATA

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Biostatistics II

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Supplementary Online Content 2

CEMO RESEARCH PROGRAM

BIOSTATISTICAL METHODS

An International Study of the Reliability and Validity of Leadership/Impact (L/I)

Testing the Predictability of Consumption Growth: Evidence from China

Propensity Score Analysis Shenyang Guo, Ph.D.

Lumina Spark Development of an Integrated Assessment of Big 5. Personality Factors, Type Theory & Overextension

Emotional Intelligence Assessment Technical Report

Re-Examining the Role of Individual Differences in Educational Assessment

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Congruence Research: Methodological Advancement Can Speed Its Impact on the Public s Health L. ALISON PHILLIPS RUTGERS UNIVERSITY APRIL 28, 2011

Resolving binary responses to the Visual Arts Attitude Scale with the Hyperbolic Cosine Model

11/24/2017. Do not imply a cause-and-effect relationship

Validity in Psychiatry. Maj-Britt Posserud, MD, PhD Child and Adolescent Mental Health Haukeland University Hospital, Bergen, Norway

Fundamental Clinical Trial Design

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Abstract Title Page Not included in page count. Authors and Affiliations: Joe McIntyre, Harvard Graduate School of Education

Factor Analysis of Gulf War Illness: What Does It Add to Our Understanding of Possible Health Effects of Deployment?

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Part 8 Logistic Regression

Editorial: An Author s Checklist for Measure Development and Validation Manuscripts

GOLDSMITHS Research Online Article (refereed)

A Simulation Study on Methods of Correcting for the Effects of Extreme Response Style

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Blending Psychometrics with Bayesian Inference Networks: Measuring Hundreds of Latent Variables Simultaneously

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Effect of Sample Size on Correlation and Regression Coefficients

Construct Validity of the MBTI in Management Development: A Test of Two Interpretations. Robert B. Kaiser & S. Bartholomew Craig

SUPPLEMENTARY MATERIAL

Impact and adjustment of selection bias. in the assessment of measurement equivalence

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER 2. RESEARCH METHODS AND PERSONALITY ASSESSMENT (64 items)

MEASUREMENT THEORY 8/15/17. Latent Variables. Measurement Theory. How do we measure things that aren t material?

By Hui Bian Office for Faculty Excellence

Modelling Double-Moderated-Mediation & Confounder Effects Using Bayesian Statistics

Current Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2

Cognitive modeling versus game theory: Why cognition matters

Computational Capacity and Statistical Inference: A Never Ending Interaction. Finbarr Sloane EHR/DRL

Cultural Intelligence: A Predictor of Ethnic Minority College Students Psychological Wellbeing

REPLICATION DATA SET FOR:

Nonparametric IRT analysis of Quality-of-Life Scales and its application to the World Health Organization Quality-of-Life Scale (WHOQOL-Bref)

The Classification Accuracy of Measurement Decision Theory. Lawrence Rudner University of Maryland

Making a psychometric. Dr Benjamin Cowan- Lecture 9

Work Personality Index Factorial Similarity Across 4 Countries

Instrument equivalence across ethnic groups. Antonio Olmos (MHCD) Susan R. Hutchinson (UNC)

Description of components in tailored testing

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

SUPPLEMENTAL MATERIAL

A comparison of five recursive partitioning methods to find person subgroups involved in meaningful treatment subgroup interactions

Predicting Diabetes and Heart Disease Using Features Resulting from KMeans and GMM Clustering

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Can Quasi Experiments Yield Causal Inferences? Sample. Intervention 2/20/2012. Matthew L. Maciejewski, PhD Durham VA HSR&D and Duke University

ORIGINS AND DISCUSSION OF EMERGENETICS RESEARCH

Validity and reliability of measurements

Experimental Studies. Statistical techniques for Experimental Data. Experimental Designs can be grouped. Experimental Designs can be grouped

Measures. David Black, Ph.D. Pediatric and Developmental. Introduction to the Principles and Practice of Clinical Research

University of Oxford Intermediate Social Statistics: Lecture One

Personality Traits and Labour Economics

Methods for Computing Missing Item Response in Psychometric Scale Construction

Transcription:

Different styles of modeling Marieke Timmerman m.e.timmerman@rug.nl 19 February 2015 Different styles of modeling (19/02/2015) What is psychometrics? 1/40

Overview 1 Breiman (2001). Statistical modeling: the two cultures Summary 2 Different cases - different cultures? 1. Causal inference 2. Suppporting a theory 3. C & W s criteria for stochastic models 4. Prediction - exploratory assessing relevant predictors 5. Accurate prediction from predictor set 6. Distributional modeling, e.g., growth curves 3 Exemplary cases Randomized clinical trial Measuring developmental level Identifying personality traits using the lexical approach Different styles of modeling (19/02/2015) What is psychometrics? 2/40

1. Statistical modeling: the two cultures Breiman, L. (2001). Statistical modeling: the two cultures. Statistical Science, 16, 199-231. Goals in data analysis Prediction: What will responses be, given certain observed variables (e.g., input variables)? Information: Extract knowledge on the nature of the relationships (e.g., of input and response variables) Two cultures Data modeling Algorithmic modeling Different styles of modeling (19/02/2015) What is psychometrics? 3/40

Data Modeling Stochastic data model Examples response variables = f (observed input variables, parameters, noise) response variables = f (latent variables, parameters, noise) Model validation Breiman: yes-no goodness-of-fit tests, residual examination Different styles of modeling (19/02/2015) What is psychometrics? 4/40

Algorithmic Modeling core goal: predict outcome measurement from set of features core focus on... properties of algorithms, rather than data modeling prediction accuracy Different styles of modeling (19/02/2015) What is psychometrics? 5/40

Algorithmic Modeling Prediction accuracy check: Typically via some form of cross-validation from: http://cse3521.artifice.cc/classification-evaluation.html Different styles of modeling (19/02/2015) What is psychometrics? 6/40

Problems in Data Modeling focus on the model, rather than the question / sample lack-of-fit often not detected via goodness-of-fit tests / residual examination good fit does not imply that model represents reality Different styles of modeling (19/02/2015) What is psychometrics? 7/40

Lessons from Breiman Rashomon: Multiplicity of models: different models, with different interpretations, with about equal fit for a single data set In data modeling, e.g., regression Also in algorithmic modeling reduce problem, i.e., improve prediction accuracy, by aggregating over large number of competing models Occam: Conflict between simplicity and accuracy Bellman: Dimensionality - curse or blessing? Different styles of modeling (19/02/2015) What is psychometrics? 8/40

2. Different cases - different cultures? 1. Causal inference 2. Supporting a theory 3. Cox & Wermuth s criteria for stochastic models 4. Prediction - exploratorily assessing relevant predictors 5. Accurate prediction from predictor set 6. Distributional modeling (e.g., growth curves) Different styles of modeling (19/02/2015) What is psychometrics? 9/40

1. Causal inference Experimental design, randomized clinical trial Potential outcome model (Rubin, 1974) requirements (Sagarin et al., 2014): effect of a cause is always relative to another cause each observation unit (e.g., patient) is potentially exposable to any one of the causes a cause must temporally precede its effect viewed in terms of prediction: What would happen if a future patient would receive treatment A, idem treatment B? Sagarin, B.J., West, S.G., Ratnikov, A. Homan, W.K., Ritchie, T.D., Hansen, E.J. Treatment noncompliance in randomized experiments: Statistical approaches and design issues. Psychological Methods, Vol 19(3), Sep 2014, 317-333. http://dx.doi.org/10.1037/met0000013 Different styles of modeling (19/02/2015) What is psychometrics? 10/40

2. Suppporting a theory To test a theory with free parameters is to... (Roberts & Pashler, 2000): determine how the theory constrains possible outcomes (i.e., what it predicts) assess how firmly actual outcomes agree with those constraints determine if plausible alternative outcomes would have been inconsistent with the theory, allowing for the variability of the data Roberts, S., & Pashler, H. (2000). How persuasive is a good fit? A comment on theory testing. Psychological Review, 107, 358-367. Different styles of modeling (19/02/2015) What is psychometrics? 11/40

3. Cox & Wermuth s Criteria for [Stochastic] models (1996, section 1.8) Link with underlying substantive knowledge Link with previously published work, for comparison Some indication or pointer towards a process that might have generated the data Model must be rich enough to capture central features of interest (i.e., include key parameters, e.g., treatment effect)... Model should be consistent with data under analysis Cox, D.R. & Wermuth, N. (1996). Multivariate Dependencies - Models, Analysis and Interpretation. London: Chapman & Hall. Different styles of modeling (19/02/2015) What is psychometrics? 12/40

4. Prediction - exploratory assessing relevant predictors Aim: to accurately predict treatment success treatment-subgroup interaction, subgroup-treatment effect interaction, or treatment-covariate interaction Question: How to identify good and useful predictors? Different styles of modeling (19/02/2015) What is psychometrics? 13/40

4. Prediction - Qualitative interaction trees to identify qualitative treatment-subgroup interactions Dusseldorp, E., & Van Mechelen, I. (2014). Qualitative interaction trees: a tool to identify qualitative treatment-subgroup interactions. Statistics in medicine, 33(2), 219-237. Different styles of modeling (19/02/2015) What is psychometrics? 14/40

4. Prediction - Qualitative interaction trees Case: large number of moderators, absence of clear a priori hypotheses Problems in identifying moderators: multiplicity and spurious interactions that cannot be replicated in follow-up studies QUINT (Dusseldorp & Van Mechelen, 2014): 1 Check on presence of qualitative interaction 2 Identify combinations of dichotomized moderators that are most important for qualitative treatment-subgroup interactions 3 Result: binary tree, partitioning the total sample in three groups: Patients for whom... treatment A is better than treatment B... treatment B is better than A... it does not make any difference Different styles of modeling (19/02/2015) What is psychometrics? 15/40

4. QUINT Start: group of N patients randomly assigned to one of two treatments A and B Before treatment: set of baseline variables, i.e., categorical and/or continuous background characteristics (e.g., severity of disease), and possibly outcome variable After treatment: one primary continuous outcome variable Outcome variable before and after treatment: use change score, or slope of response over time, or time to an event can be used as outcome for QUINT Different styles of modeling (19/02/2015) What is psychometrics? 16/40

4. QUINT - conditions Aim: find the best partition all patients using the baseline variables into two or three mutually exclusive and exhaustive subgroups (i.e., partition classes) p 1 : A B; p 2 : A B; p 3 : A B Conditions 1 Difference in treatment outcome component: In p 1 and p 2, the difference in outcome between treatments A and B should be as large as possible 2 Cardinality component: p 1 and p 2 should comprise as many patients as possible Partitioning criterion ensures maximization of the conjunction of the two components Different styles of modeling (19/02/2015) What is psychometrics? 17/40

4. Stepwise binary splitting procedure At each step: Maximize partitioning criterion After each split: all leaves re-assigned afresh to subgroups (i.e., nonrecursive procedure) Different styles of modeling (19/02/2015) What is psychometrics? 18/40

4. Sequential partitioning algorithm Algorithm: 1 Stepwise procedure to optimize criterion C 2 Stopping criterion: when C is maximized, and 4 boundary conditions are met 3 Pruning: reduce tree resulting after 2., to ensure a well-fitting tree for future data as well (using bootstrap procedure) Different styles of modeling (19/02/2015) What is psychometrics? 19/40

4. Quint in action - Improvement in depression Background characteristics and Outcome Different styles of modeling (19/02/2015) What is psychometrics? 20/40

4. Quint in action - Improvement in depression Final Tree Different styles of modeling (19/02/2015) What is psychometrics? 21/40

5. Accurate Prediction from set of predictors Aim to accurately predict. Dot. Relationships predictors - criterion is a black box Example: Movieweb - individual tailored suggestion for other movies of interest Different styles of modeling (19/02/2015) What is psychometrics? 22/40

6. Models for the Distribution of criterion scores, as a function of predictors Growth curve (e.g., length), as a function of age and gender Norming (i.e., test scores), as a function of age from: http://cse3521.artifice.cc/classification-evaluation.html Different styles of modeling (19/02/2015) What is psychometrics? 23/40

Example of norming Van Wiechen scheme: each Dutch child is assessed during 8 visits at Child Health Care Center Developmental scores (D-score) as a function of age from: Jacobusse G.W., Buuren S. van, Verkerk P.H. (2006). An interval scale for development of children aged 0-2 years. Statistics in Medicine, 25(13), 2272-2283. Different styles of modeling (19/02/2015) What is psychometrics? 24/40

3. Exemplary cases Randomized clinical trial (RCT): Effectiveness of treatments to panic disorder (Van Apeldoorn et al., 2010) Treatments: Cognitive behavioral therapy (CBT), medication (SSRI), or both (CBT+SSRI) Research questions: Which treatment is most effective, in the short-term and long-term? Are there any differential effects ( what works for whom )? Different styles of modeling (19/02/2015) What is psychometrics? 25/40

RCT - Design pre-test 9 months treatment 18 sessions CBT 9 months treatment 9 sessions SSRI 9 months treatment 18 sessions CBT + 9 sessions SSRI 3 months treatment 3 sessions CBT post-test I 3 months treatment 3 sessions SSRI (taper-off) 3 months treatment 3 sessions CBT + 3 sessions SSRI post-test II 6 months after post-test II: follow-up I 12 months after post-test II: follow-up II C_7A 9 Different styles of modeling (19/02/2015) What is psychometrics? 26/40

RCT - Patient flow Different styles of modeling (19/02/2015) What is psychometrics? 27/40

RCT - Panic disorder treatment Outcome measures: Indicators of Anxiety, Depression, Quality of life Predictors treatment, time duration of ilness, level of agoraphobia (no,..., severe), benzodiazapine use (yes, no) patient type (completer, no-taper, drop-out) Different styles of modeling (19/02/2015) What is psychometrics? 28/40

RCT - Model based expected scores Different styles of modeling (19/02/2015) What is psychometrics? 29/40

RCT - Questions Which approach appears to be most reasonable: Data modeling or algorithmic modeling? Or somewhere in between... Causal inference Supporting a theory C & W s criteria for stochastic models Prediction - exploratorily assessing relevant predictors Accurate prediction from predictor set Distributional modeling (e.g., growth curves) The authors used a random coefficient model, using a data modeling approach. Limitations? Alternative ideas for modeling? Different styles of modeling (19/02/2015) What is psychometrics? 30/40

Measuring developmental level Van Wiechen scheme: each Dutch child is assessed during 8 visits at the Child Health Care Center Examples of items: from: Jacobusse G.W., Buuren S. van, Verkerk P.H. (2006). An interval scale for development of children aged 0-2 years. Statistics in Medicine, 25(13), 2272-2283. Different styles of modeling (19/02/2015) What is psychometrics? 31/40

Rasch model P(X ijt θ it, δ j ) = exp(θ it δ j ) (1+exp(θ it δ j )), with θ it the developmental level of child i at age t and δ j the difficulty of item j Empirical pass rates as a function of age from: Jacobusse G.W., Buuren S. van, Verkerk P.H. (2006). An interval scale for development of children aged 0-2 years. Statistics in Medicine, 25(13), 2272-2283. Different styles of modeling (19/02/2015) What is psychometrics? 32/40

Fit P(X ijt θ it, δ j ) = exp(θ it δ j ) (1+exp(θ it δ j )), with θ it the developmental level of child i at age t and δ j the difficulty of item j Empirical pass rates as a function of D-score from: Jacobusse G.W., Buuren S. van, Verkerk P.H. (2006). An interval scale for development of children aged 0-2 years. Statistics in Medicine, 25(13), 2272-2283. Different styles of modeling (19/02/2015) What is psychometrics? 33/40

Measuring developmental level - Questions Which approach appears to be most reasonable: Data modeling or algorithmic modeling? Or somewhere in between... Causal inference Supporting a theory C & W s criteria for stochastic models Prediction - exploratorily assessing relevant predictors Accurate prediction from predictor set Distributional modeling (e.g., growth curves) The authors used a random coefficient model, using a data modeling approach. Limitations? Alternative ideas for modeling? Different styles of modeling (19/02/2015) What is psychometrics? 34/40

Identifying personality traits using the lexical approach Psycholexical research: use trait taxonomies Around 30 trait taxonomies are known, in different languages, e.g., Albanian, Arabic, English, Indian, Japanese Taxonomies: from a dictionary, all trait-descriptive words (i.e., adjectives) are selected; leads to large number of adjectives (range 300-600) Typical approach: perform principal component analysis on sample ratings Outcome: trait dimensions, e.g., extraversion, agreeableness Key question: which traits are pancultural? Different styles of modeling (19/02/2015) What is psychometrics? 35/40

The search for a Pancultural trait structure Included ratings from 11 taxonomies in different languages 1. translate all non-english adjectives into English 2. identify adjectives with same meaning - yielded 1993 unique trait variables, of which 1071 occurred only in a single language retained only 922 trait variables that occurred in minimally two languages only 2 trait variables in all 11 languages (impulsive and sentimental), and another 13 trait variables in 10 languages (e.g., ambitious, conscientious, consistent, creative, emotional, generous, industrious...) Data: 11 languages (sample size range 369-991), total 7104 participants, on 922 trait variables, with structural missing data (trait variable not present in taxonomy) 3. Perform Simultaneous Component Analysis on the data Different styles of modeling (19/02/2015) What is psychometrics? 36/40

Simultaneous Components - across 11 countries A, agreeableness; C, conscientiousness; Dyn, dynamism;... from: De Raad, B. et al. (2014). Towards a Pan-cultural Personality Structure: Input from 11 Psycholexical Studies. European Journal of Personality, 28(5), 497-510. doi:10.1002/per.1953 Different styles of modeling (19/02/2015) What is psychometrics? 37/40

Fit - variance accounted for (VAF ) per country, for SCA and PCA Different styles of modeling (19/02/2015) What is psychometrics? 38/40

Fit - Ratio (VAF SCA /VAF PCA ) and Difference (VAF SCA VAF PCA ) Different styles of modeling (19/02/2015) What is psychometrics? 39/40

The search for a Pancultural trait structure - Questions Which approach appears to be most reasonable: Data modeling or algorithmic modeling? Or somewhere in between... Causal inference Supporting a theory C & W s criteria for stochastic models Prediction - exploratorily assessing relevant predictors Accurate prediction from predictor set Distributional modeling (e.g., growth curves) The authors used a random coefficient model, using a data modeling approach. Limitations? Alternative ideas for modeling? Different styles of modeling (19/02/2015) What is psychometrics? 40/40