Fundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2

Similar documents
Diagnostic Classification Models

Blending Psychometrics with Bayesian Inference Networks: Measuring Hundreds of Latent Variables Simultaneously

JONATHAN TEMPLIN LAINE BRADSHAW THE USE AND MISUSE OF PSYCHOMETRIC MODELS

Factors Affecting the Item Parameter Estimation and Classification Accuracy of the DINA Model

VARIABLES AND MEASUREMENT

Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure. Rob Cavanagh Len Sparrow Curtin University

Group Assignment #1: Concept Explication. For each concept, ask and answer the questions before your literature search.

Chapter 1 Introduction to Educational Research

Author s response to reviews

6. A theory that has been substantially verified is sometimes called a a. law. b. model.

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Chapter 02 Developing and Evaluating Theories of Behavior

Parallel Forms for Diagnostic Purpose

Methodological Issues in Measuring the Development of Character

COMBINING SCALING AND CLASSIFICATION: A PSYCHOMETRIC MODEL FOR SCALING ABILITY AND DIAGNOSING MISCONCEPTIONS LAINE P. BRADSHAW

Reasoning: The of mathematics Mike Askew

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS

Workshop Overview. Diagnostic Measurement. Theory, Methods, and Applications. Session Overview. Conceptual Foundations of. Workshop Sessions:

Lecturer: Dr. Emmanuel Adjei Department of Information Studies Contact Information:

for Scaling Ability and Diagnosing Misconceptions Laine P. Bradshaw James Madison University Jonathan Templin University of Georgia Author Note

Everything DiSC Manual

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Model-based Diagnostic Assessment. University of Kansas Item Response Theory Stats Camp 07

RATER EFFECTS AND ALIGNMENT 1. Modeling Rater Effects in a Formative Mathematics Alignment Study

A Correlation of. to the. Common Core State Standards for Mathematics Grade 1

CHAPTER VI RESEARCH METHODOLOGY

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

PSYCHOLOGY (413) Chairperson: Sharon Claffey, Ph.D.

A Multidimensional, Hierarchical Self-Concept Page. Herbert W. Marsh, University of Western Sydney, Macarthur. Introduction

Structural Equation Modeling (SEM)

Item Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS

A Correlation of. to the. Common Core State Standards for Mathematics Grade 2

Introduction to Geoscience Education Research Methods

Measurement Invariance (MI): a general overview

Donald L. Patrick PhD, MSPH, Laurie B. Burke RPh, MPH, Chad J. Gwaltney PhD, Nancy Kline Leidy PhD, Mona L. Martin RN, MPA, Lena Ring PhD

Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories

Introduction to the HBDI and the Whole Brain Model. Technical Overview & Validity Evidence

THE IMPORTANCE OF MENTAL OPERATIONS IN FORMING NOTIONS

Occupational Therapy (OC_THR)

Task Force Background Donald Patrick. Appreciation to Reviewers. ISPOR PRO GRP Task Force Reports

Connexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Expert judgements in risk and reliability analysis

University of Alberta

THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES IN GHANA.

Validity and reliability of measurements

Answers to end of chapter questions

TYPES OF RESEARCH (CONTD )

Goal #1: To train students who are thoroughly grounded in the science of psychology and its application to health and disease.

IDENTIFYING DATA CONDITIONS TO ENHANCE SUBSCALE SCORE ACCURACY BASED ON VARIOUS PSYCHOMETRIC MODELS

Factors Influencing Undergraduate Students Motivation to Study Science

Comprehensive Statistical Analysis of a Mathematics Placement Test

Introduction to biostatistics & Levels of measurement

Study Endpoint Considerations: Final PRO Guidance and Beyond

SEMINAR ON SERVICE MARKETING

11/24/2017. Do not imply a cause-and-effect relationship

FUNDAMENTALS OF PSYCHOANALYTIC THOUGHT COURSE CATALOG ACADEMIC YEAR

Scale Building with Confirmatory Factor Analysis

Multifactor Confirmatory Factor Analysis

Standard KOA3 5=3+2 5=4+1. Sarah Krauss CCLM^2 Project Summer 2012

Why Mixed Effects Models?

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

PÄIVI KARHU THE THEORY OF MEASUREMENT

Rasch Versus Birnbaum: New Arguments in an Old Debate

TRANSLATING RESEARCH INTO PRACTICE

STA630 Research Methods Solved MCQs By

EDUCATIONAL PSYCHOLOGY (EPSY)

The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory

Structural Validation of the 3 X 2 Achievement Goal Model

Description of components in tailored testing

computation and interpretation of indices of reliability. In

Utilizing the NIH Patient-Reported Outcomes Measurement Information System

CRIMINAL JUSTICE (CJ)

Validity and reliability of measurements

Re-Examining the Role of Individual Differences in Educational Assessment

How Does Analysis of Competing Hypotheses (ACH) Improve Intelligence Analysis?

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015

Table 1 Hypothesized Relations between Motivation and Self-Regulation

Bruno D. Zumbo, Ph.D. University of Northern British Columbia

Basic Concepts in Research and DATA Analysis

Implicit Information in Directionality of Verbal Probability Expressions

Alignment in Educational Testing: What it is, What it isn t, and Why it is Important

MIETTINEN OS: ON EXCELLENCE OF EPIDEMIOLOGIC ACADEMIA

Parent initiated Why evaluate (3) Questions about program appropriateness School initiated Conflict resolution Component of a regular peer review proc

2 Types of psychological tests and their validity, precision and standards

The University Curriculum Committee will meet at 3:00 p.m. on Wednesday, March 4 in University Hall 282. A G E N D A

Occupational Therapy. Undergraduate. Graduate. Accreditation & Certification. Financial Aid from the Program. Faculty. Occupational Therapy 1

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Using Connectionist Models to Evaluate Examinees Response Patterns to Achievement Tests

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

In this chapter we discuss validity issues for quantitative research and for qualitative research.

Computer Adaptive-Attribute Testing

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD

Chapter 4 DESIGN OF EXPERIMENTS

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

EXECUTIVE SUMMARY 9. Executive Summary

Transcription:

Fundamental Concepts for Using Diagnostic Classification Models Section #2 NCME 2016 Training Session NCME 2016 Training Session: Section 2

Lecture Overview Nature of attributes What s in a name? Grain sizes explained Q-matrices Attribute hierarchies Types of attribute hierarchies Reporting of attribute profiles Developing cognitive processing models NCME 2016 Training Session: Section 2 2

THE NATURE OF ATTRIBUTES NCME 2016 Training Session: Section 2 3

The Nature of Attributes Different labels have been suggested in the literature for the latent variables: Latent characteristics Latent traits Elements of processes Attributes Each of these terms carries with it a specific connotation that reflects important beliefs about what analysts hope they can learn from applying DCMs NCME 2016 Training Session: Section 2 4

Meanings of Terms Latent characteristic: Highlights that the mental components that are of theoretical interest are unobserved, which is why they are represented with latent variables in DCMs Latent trait: Highlights that the mental components of interests are believed to be stable across time in contrast to latent states that change over time Element: Refers to the fact that the mental components are building blocks of a larger conceptual entity, which is often, but not necessarily, the cognitive response process NCME 2016 Training Session: Section 2 5

Meanings of Terms Attribute: Is perhaps most frequently used in the measurement literature on diagnostic assessments Has a history of use in the literature on factor analysis (e.g., McDonald, 1999) Typically synonymous with the terms latent trait and latent characteristic We use the term attribute for individual latent variables The dominant discourse on DCMs Attribute profile denotes a particular constellation of latent variable values for a particular respondent NCME 2016 Training Session: Section 2 6

Types of Attributes NCME 2016 Training Session: Section 2 7

Grain Sizes of Attributes The degree of definitional specificity of an attribute is often referred to as the definitional grain size The grain size is driven by the level of specificity with which one desires to make statements about respondents The grain size of an attribute is the resolution with which an investigator dissects a cognitive response process and describes its constituent components NCME 2016 Training Session: Section 2 8

Practical Issues with Grain Sizes It is possible to decompose individual attributes for more complex tasks further That would increase the number of attributes As the number of attributes increases, the number of latent variables in a DCM increases Attribute profiles and item parameters may become impossible to estimate statistically It is important to fix the number of attributes to a statistically manageable number for a given diagnostic assessment length and respondent sample size That number is rapidly increasing as technology gets better NCME 2016 Training Session: Section 2 9

Q-matrices The specification of which attributes are constitutive of the response process for each item is done numerically in a table with a particular structure called a Q-matrix (Tatsuoka, 1983) A Q-matrix traditionally contains the items in the rows and the attributes in the columns Its entries consist of 1s and 0s indicating whether or not an attribute is required to respond to an item A 1 indicates the item measures the attribute Identical to factor pattern matrices in CFA Define which parameters are set to zero in the DCM NCME 2016 Training Session: Section 2 10

Example Q-matrix NCME 2016 Training Session: Section 2 11

ATTRIBUTE HIERARCHIES NCME 2016 Training Session: Section 2 12

Attribute Hierarchies Attribute hierarchies: specifications of the attribute dependencies in the population of respondents By implication they represent hypotheses about which attribute profiles should be observed in a sample Suppose that mastery of Attribute 1 is prerequisite to mastery of Attribute 2: Attribute profiles where the second but not the first attribute is mastered must logically be empty in the population NCME 2016 Training Session: Section 2 13

Types of Attribute Hierarchies NCME 2016 Training Session: Section 2 14

Additional Matrices Implied by Attribute Hierarchies There are additional attribute matrices implied by attribute hierarchies: Adjacency Matrix Lists hierarchically dependent attributes Reachability Matrix Lists which attributes can be reached by others Comes from Adjacency Matrix Used in Attribute Hierarchy/Rule Space techniques We use these matrices to reduce the number of parameters in the structural model (see Chapter 8) Adjacency Matrix is a network/graph theoretic entity NCME 2016 Training Session: Section 2 15

REPORTING OF ATTRIBUTE PROFILES NCME 2016 Training Session: Section 2 16

Example Profile Report (Page 1) NCME 2016 Training Session: Section 2 17

Example Profile Report (Page 2) NCME 2016 Training Session: Section 2 18

DEVELOPING COGNITIVE PROCESSING MODELS NCME 2016 Training Session: Section 2 19

Attributes Exist Because of Theory Current applications of DCMs typically involve attributes that are defined via a theory of response processing supported by research in applied cognitive psychology and educational measurement In order to use DCMs to represent and, ideally, validate cognitive processing models one first needs to have developed a plausible model from theory and empirical investigations Cognitive response processes can be decomposed more easily into their constituent attributes for tasks that are narrower in scope and are can be solved with fewer alternative response strategies that rely on combinations of different attributes NCME 2016 Training Session: Section 2 20

Methods Used for Cognitive Models Verbal reports and protocol studies Both require that the items under consideration are presented to a sample of respondents from the population who are probed about the way they respond to them Eye-tracking research Evidence for the cognitive processes that respondents engage in comes from the physiological manifestations of these processes Expert panels Ask experts to describe the cognitive processes behind item responses based on prior research and experience with assessments in the domain NCME 2016 Training Session: Section 2 21

Limitations of the Methodologies Researchers from disciplines outside of educational measurement are often disconcerted with the use of certain terms and procedures in the educational measurement literature on diagnostic assessment In cognitive psychology, the dominant focus is on understanding the basic mental architecture of human beings and its consequences on how human beings process information to solve particular tasks In differential psychology, the dominant focus is on explaining intra- and inter-individual differences on these mental components and capacities and their structural relationships in a population NCME 2016 Training Session: Section 2 22

APPLYING CONCEPTS: THE DIAGNOSING TEACHERS MULTIPLICATIVE REASONING PROJECT NCME 2016 Training Session: Section 2 23

Introduction Diagnosing Teachers Multiplicative Reasoning* (DTMR) NSF funded grant (DRL-0903411) Goal was to create a test that will assess fine-grained components of teachers reasoning multiplicatively with rational numbers The test was used to Tailor professional development to teachers needs Quantitatively study teachers fine-grained abilities to reason multiplicatively Quantify findings based on extensive qualitative research base Generalize to larger populations NCME 2016 Training Session: Section 2 24

Big Picture Most psychometric models are designed to measure a unidimensional continuous trait or ability Examples of continuous traits Student s math ability at the 8th grade level In-service teachers mathematical knowledge for teaching number and operations The content area of focus for this study As a result, many tests are designed to measure a unidimensional ability This project took a different approach A multidimensional diagnostic approach Using a new class of psychometric models NCME 2016 Training Session: Section 2 25

Diagnosing Multiplicative Reasoning Instead of measuring an overall ability to reason multiplicatively with fractions, we can break that continuous trait down into more fine-grained cognitive facilities or attributes: Ability to identify appropriate referent units for numbers Ability to partition quantities and iterate unit fractions Ability to identify appropriate arithmetic operations Ability to make multiplicative comparisons We treat these attributes as categorical Dichotomous (have two categories) Mastery of an attribute ( = 1) or non-mastery of an attribute ( = 0) NCME 2016 Training Session: Section 2 26

Example Item This item is analogous to Item 22 on the DTMR test» Measures Referent Unit (Attribute 1) and Partitioning and Iterating (Attribute 2) Singapore CDM Workshop, Lecture 3 27

Groups According to Attribute Mastery The groups are based on patterns of mastery according to the set of attributes A classification of each individual skill results in a classification into one of these 16 patterns 2 A possible patterns or groups: Pattern RU PI APP MC 1 0 0 0 0 2 0 0 0 1 3 0 0 1 0 4 0 0 1 1 5 0 1 0 0 6 0 1 0 1 7 0 1 1 0 8 0 1 1 1 9 1 0 0 0 10 1 0 0 1 11 1 0 1 0 12 1 0 1 1 13 1 1 0 0 14 1 1 0 1 15 1 1 1 0 16 1 1 1 1 NCME 2016 Training Session: Section 2 28

Designing Diagnostic Tests Diagnostic tests are written so that each item measures one or more of the attributes The attributes measured by each item are recorded in a Q-matrix Describes whether an item measures an attribute (q = 1) or not (q = 0) Mapping is established by content experts Confirmed by item response interviews First several items on DTMR test: RU PI APP MC Item 1 1 0 0 0 Item 2 0 0 1 0 Item 3 1 0 0 0 Item 4 1 0 0 1 NCME 2016 Training Session: Section 2 29

Attribute Hierarchies We tested the following hierarchies using the Hierarchical Diagnostic Classification Model* (HDCM) All hierarchies fit significantly worse (p<.001) than the no hierarchy No Hierarchy Hierarchy 1 Hierarchy 2 α 1 α 2 α 2 α 3 α 4 α 2 α 3 α 4 α 3 α 4 α 1 α 1 Hierarchy 3 Hierarchy 7 α 2 α 3 α 4 α 2 α 3 α 4 α 1 α 1 *Templin, J., & Bradshaw, L. (2011). Hierarchical diagnostic classification models: A family of models for estimating and testing attribute hierarchies. Manuscript under review.

Wrapping Up In this lecture we defined: Attributes Q-matrices Attribute hierarchies How cognitive theories are built Limitations of the approaches NCME 2016 Training Session: Section 2 31