The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory
|
|
- Colin Henry
- 5 years ago
- Views:
Transcription
1 The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory Kate DeRoche, M.A. Mental Health Center of Denver Antonio Olmos, Ph.D. Mental Health Center of Denver Susan Hutchinson, Ph.D. University of Northern Colorado CJ McKinney, M.A. Mental Health Center of Denver
2 What presentation is this? We plan to combine 2 presentations: The Psychometrics of a Self-Report Recovery Measure: A Comparison of Classical Test Theory and Rasch Modeling Mental Health Center of Denver s (MHCD) Recovery Markers: Development of a Clinician Based Measure of Recovery From Mental Illness into one presentation with 2 examples (scales under development) We will focus on the process for the development of two measurement tools, including advantages and limitations of psychometric methodologies: 1 st Classical Test Theory 2 nd Rasch Models in IRT 3 rd Additional Models in IRT
3 Presentation Overview Warning: We have a wealth of data to discuss Overview: Briefly describe CTT and IRT Recovery initiative at MHCD and its implications for outcomes measures Example 1: Consumer Recovery Measure CTT Analysis Rasch Model Example 2: Recovery Markers CTT Analysis Partial Credit Model Nominal IRT Model Lessons Learned How many of you have used Item Response Theory, or have a background knowledge of how it works?
4 Comparison of CTT and IRT (Hambleton, Swaminathan, Roger, 1991) Classical Test Theory (CTT) Item characteristics are sample and test dependent Items are commonly at an equal level of the trait Reliability and assumption of paralleltest, difficult to obtain Equal standard error for all participants Item Response Theory (IRT) Separation of item parameters and participants ability Items are monotonically increasing in the latent trait Assumptions of unidimensionality & local independence Multiple models (1PL, 2PL & 3PL)
5 Models of CTT and IRT CTT has a single model IRT includes a collection of models (validity issues in model selection) 1PL (Rasch Models) N=100 s Difficulty parameters 2PL N= 1,000 s Difficulty and item discrimination parameters 3PL N= 10,000 s Difficulty, item discrimination, & pseudo guessing parameters
6 The Recovery Initiative at MHCD and its implications for outcomes Measurement The Mental Health Center of Denver is large, nonprofit community-based mental health center providing services for adults, children and families In the Evaluation and Research Department, we hold the belief that models should be data driven Evaluation of Recovery through 3 measures: Consumer Recovery Measure Recovery Markers Recovery Enhancing Environment Recovery Marker We have created 2 instruments to evaluate mental health recovery (latent trait) in adults consumers from two different perspectives Recovery Enhancing Environment Mental Health Recovery as defined at MHCD Consumer Recovery Measure
7 Theory and Measurement The relationship between theory and measurement is critical for latent constructs to be able to provide a feedback loop for quality improvement Measurement revised the underlying constructs of a theory which, in turn, revises the measurement tool Continuous Process Improve Theory Improve Theory Initially Define Theory Improve Measures Improve Measures Develop Measures
8 Example 1: Consumer Recovery Measures Measures recovery from the consumer s point of view in 5 domains: Active/growth orientation Hope Symptom s interference Safety Social network
9 Step 1: Classical Test Theory Conducted an Exploratory Factor analysis which revealed 5 factors and explained approximately 57% of the variance Conducted Cronbach s alpha reliability analysis Active/growth orientation (α = 0.67) Hope (α = 0.77) Symptom Interference (α = 0.88) Safety (α =.72) Social Network (α = 0.63) Total Scale = 0.88 According to this statistics we should conclude that the scale is not bad.
10 Step 2: Rasch Modeling Rating Scale Model (1PL) In IRT, a validity issue is selecting a model that is appropriate for your data. Most commonly, you begin with the simplest model (Rasch Model) and if it fits, you can stop, if not you can try a more complicated model. (some people do not agree with this concept) For example, our data is a Likert-type scale so we used a Rasch Rating Scale Model, which produced the following reliabilities: Domain Number of items Marginal Reliability (IRT) Active Hope Symptom Safe Social Total CRM The IRT analysis produced acceptable reliabilities, but further analysis showed something rather interesting:
11 Additional Information Provided by Rasch Rating Scale Model (IRT): Item Person Map High Score (High Recovery) Low Scores (Low Recovery) Consumers: Items: Q4 (3.36)-HOPE Q5 (2.57)- ACTIVE Q8 (2.14)- HOPE Q2 (1.72)- ACTIVE Q9 (1.29)- ACTIVE Q17 (1.16)- SOCIAL Q11 (.90)- SAFE Q7 (2.63)- HOPE Hard Items Q20 (1.76)- SYMP & Q21(2.08)- SYMP Q18 (1.43) SYMP Q14 (1.34)- SAFE & Q15 (1.33) SYMP Q12 (1.24)- SOCIAL Q19 (1.03) SOCIAL Q16 (.83)- SAFE Notice, that all of the items are at a higher level of recovery than the consumers This finding forced us to change our interpretation about the psychometrics of the scale Easy Items
12 Comparison of CTT and IRT results for Example 1 By only reviewing the CTT analysis the psychometrics seemed fine With the additional information provided by the Rasch model we understand that our questions are too difficult for our sample, Therefore, we need to create more items that will measure less recovery (to measure small changes)
13 Example 2: Recovery Markers Indicators usually associated with individual's recovery, but are not necessary for recovery (thus the term markers ) Includes 6 dimensions with varying response sets: Employment (8 response categories) Education/Training (7 response categories) Active/Growth orientation (6 response categories) Symptom Interference (5 response categories) Housing * (9 response categories) Engagement/role with service provider (6 response categories) Substance Abuse- level of use (6 response categories) Substance Abuse- level of change* (5 response categories)
14 Clinicians, rather than consumers complete this scale on a predetermined schedule dependent on the team type (High/medium CM versus outpatient)
15 Step 1: CTT Reliability and Factor Analysis Internal Reliability estimated Cronbach s alpha = 0.67 An EFA revealed 2 correlated factors (52.11%; 1 factor: 30%) A CFA was conducted on the 1 factor solution x²(11)= 26.32, p= (n = 776) RMSEA = GFI = 0.99 CFI = 0.99 This analysis told us that the scale could be improved, but did not explain how or where
16 Step 2: Partial Credit Rasch Model PERSONS MAP OF ITEMS <frequ> <less> 2 + T EDUCATION S. T EMPLOYMENT.#.##.### I #### ACTIVE/GROWTH SUBSTANCE_ABUSE-change 0.###### S+M.####### ENGAGEMENT/ROLE-ServProv.######### SUBSTANCE_ABUSE-use.######## SYMPTOM_INTERFERENCE.############ ########### M ########### S -1.###### +.#######.##### S.###.## HOUSING.## T. -2.# T <rare> <more> Index suggesting good model fit for persons -Mean Square Infit =.99 -Mean Square Outfit =1.0 Index suggesting moderate model fit for items -Mean Square Infit = Mean Square Outfit =1.03 Education & Employment are too difficult for the sample Housing is the easiest item Big gaps with no items measuring the majority of participants
17 Step 3: Nominal Response Model The Nominal Response model is based on the 2 PL and requires more participants (1,000 s) Allows us to view the order of the responses within an item, to make sure they are ordered We can change the response categories to make sure that they are ordered in terms of difficulty
18 Hard Example of Nominal Output Employment Education Active/Growth C- Active Job Search (3.29) F- Full time college (4.77) E- Part time college (.3.66) 4.0 F- Very high (3.41) D- Non-paid work/volunteer (2.64) Difficulty of Item H- Full time independent (1.59) G- Part time independent (.76) E- Part time supported (-.37) F- Full time supported (-1.22) B- Interest in work, no action (-1.99) A- No interest in work (-2.71) D- noncredit training (-.10) G- Recent Grad (-.20) C- Active education/training search (-1.00) B-interest in education, no action (-1.05) A- No interest in education (-2.82) E- High (-.75) D- Moderate out MH system (-1.62) B- Low (-2.28) C- Moderate in MH system (-2.37) Easy A- Very low (-3.88) There are issues of improper ordering, large gaps, where there are not responses & clumps of responses
19 We have met with our advisory committee and proposed some changes that will take care of the problems described Implementation is underway for the new version (2.0) and we should start collecting data soon
20 Review of Psychometric Process 1 st CTT analysis Determine reliability 2 nd Rasch modeling Determine model fit (reliability), participants ability level & item difficulty 3 rd Nominal Model Determine model fit (reliability) and ordering of responses within items As you increase the complexity of the measurement model, you also increase the assumptions required
21 Lessons Learned Requires time to educate yourself, critical to use appropriate model for your data In IRT literature be prepared to read conflicting pieces of information regarding model usage (Rasch vs. IRT) If you have stakeholders that want to be involved in the analysis (and we believe they should!), be prepared with example concepts (i.e. IQ) Sample size requirements Have resource to conduct analysis, stakeholder buy in Purchase software (winstep, bilog, etc..) Computer memory (Maximum Likelihood estimation)
22 Take home Message Measurement is a critical step in any evaluation IRT is beneficial and allows you to see more aspects of measurement than CTT alone As we increase our understanding with IRT, we also increase our assumptions Regardless of which method you use, understanding the benefits and limitations of your measurement model will help to interpret your data
23 Questions??? Contact Information: Kate DeRoche, M.A. (303) Antonio Olmos, Ph.D. (303)
24 IRT Resources IRT 101 Reise, S. P., Ainsworth, A. T. & Haviland, M. G. (2005). Item Response Theory: Fundamentals, Applications, and Promise in Psychological Research. Current Directions in Psychological Science, 14, IRT Books Hambleton, R. K., Swaminathan, H. & Rogers, H. J. (1991). Fundamentals of Item Response Theory. Newbury Park, CA: Sage Publication, Inc. Embretson, S. E. & Reise, S. P. (2000). Item Response Theory for Psychologists. Mahwah, NJ: Lawrence Erlbaum Associates And many more resources
Techniques for Explaining Item Response Theory to Stakeholder
Techniques for Explaining Item Response Theory to Stakeholder Kate DeRoche Antonio Olmos C.J. Mckinney Mental Health Center of Denver Presented on March 23, 2007 at the Eastern Evaluation Research Society
More informationThe MHSIP: A Tale of Three Centers
The MHSIP: A Tale of Three Centers P. Antonio Olmos-Gallo, Ph.D. Kathryn DeRoche, M.A. Mental Health Center of Denver Richard Swanson, Ph.D., J.D. Aurora Research Institute John Mahalik, Ph.D., M.P.A.
More informationReferences. Embretson, S. E. & Reise, S. P. (2000). Item response theory for psychologists. Mahwah,
The Western Aphasia Battery (WAB) (Kertesz, 1982) is used to classify aphasia by classical type, measure overall severity, and measure change over time. Despite its near-ubiquitousness, it has significant
More informationComprehensive Statistical Analysis of a Mathematics Placement Test
Comprehensive Statistical Analysis of a Mathematics Placement Test Robert J. Hall Department of Educational Psychology Texas A&M University, USA (bobhall@tamu.edu) Eunju Jung Department of Educational
More informationInstrument equivalence across ethnic groups. Antonio Olmos (MHCD) Susan R. Hutchinson (UNC)
Instrument equivalence across ethnic groups Antonio Olmos (MHCD) Susan R. Hutchinson (UNC) Overview Instrument Equivalence Measurement Invariance Invariance in Reliability Scores Factorial Invariance Item
More informationConstruct Validity of Mathematics Test Items Using the Rasch Model
Construct Validity of Mathematics Test Items Using the Rasch Model ALIYU, R.TAIWO Department of Guidance and Counselling (Measurement and Evaluation Units) Faculty of Education, Delta State University,
More informationProceedings of the 2011 International Conference on Teaching, Learning and Change (c) International Association for Teaching and Learning (IATEL)
EVALUATION OF MATHEMATICS ACHIEVEMENT TEST: A COMPARISON BETWEEN CLASSICAL TEST THEORY (CTT)AND ITEM RESPONSE THEORY (IRT) Eluwa, O. Idowu 1, Akubuike N. Eluwa 2 and Bekom K. Abang 3 1& 3 Dept of Educational
More informationThe Functional Outcome Questionnaire- Aphasia (FOQ-A) is a conceptually-driven
Introduction The Functional Outcome Questionnaire- Aphasia (FOQ-A) is a conceptually-driven outcome measure that was developed to address the growing need for an ecologically valid functional communication
More informationMultilevel Techniques for Quality Control Charts of Recovery Outcomes
Multilevel Techniques for Quality Control Charts of Recovery Outcomes INFORMS Annual Meeting 2009 San Diego, CA October, 11 th, 2009 Linda Laganga, PhD* (Linda.Laganga@mhcd.org) CJ McKinney, MA Kate DeRoche,
More informationItem Response Theory: Methods for the Analysis of Discrete Survey Response Data
Item Response Theory: Methods for the Analysis of Discrete Survey Response Data ICPSR Summer Workshop at the University of Michigan June 29, 2015 July 3, 2015 Presented by: Dr. Jonathan Templin Department
More informationBy Hui Bian Office for Faculty Excellence
By Hui Bian Office for Faculty Excellence 1 Email: bianh@ecu.edu Phone: 328-5428 Location: 1001 Joyner Library, room 1006 Office hours: 8:00am-5:00pm, Monday-Friday 2 Educational tests and regular surveys
More informationUsing the Rasch Modeling for psychometrics examination of food security and acculturation surveys
Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Jill F. Kilanowski, PhD, APRN,CPNP Associate Professor Alpha Zeta & Mu Chi Acknowledgements Dr. Li Lin,
More informationValidating Measures of Self Control via Rasch Measurement. Jonathan Hasford Department of Marketing, University of Kentucky
Validating Measures of Self Control via Rasch Measurement Jonathan Hasford Department of Marketing, University of Kentucky Kelly D. Bradley Department of Educational Policy Studies & Evaluation, University
More informationItem Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses
Item Response Theory Steven P. Reise University of California, U.S.A. Item response theory (IRT), or modern measurement theory, provides alternatives to classical test theory (CTT) methods for the construction,
More informationItem and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model
Jafari et al. Health and Quality of Life Outcomes 2012, 10:127 SHORT REPORT Open Access Item and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model Peyman
More informationPersonality Traits Effects on Job Satisfaction: The Role of Goal Commitment
Marshall University Marshall Digital Scholar Management Faculty Research Management, Marketing and MIS Fall 11-14-2009 Personality Traits Effects on Job Satisfaction: The Role of Goal Commitment Wai Kwan
More informationUsing Analytical and Psychometric Tools in Medium- and High-Stakes Environments
Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session
More informationDoes factor indeterminacy matter in multi-dimensional item response theory?
ABSTRACT Paper 957-2017 Does factor indeterminacy matter in multi-dimensional item response theory? Chong Ho Yu, Ph.D., Azusa Pacific University This paper aims to illustrate proper applications of multi-dimensional
More informationUsing Item Response Theory to Evaluate Self-directed Learning Readiness Scale
Journal of Educational and Developmental Psychology; Vol. 8, No. ; 8 ISSN 97-56 E-ISSN 97-5 Published by Canadian Center of Science and Education Using Item Response Theory to Evaluate Self-directed Learning
More informationInvestigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories
Kamla-Raj 010 Int J Edu Sci, (): 107-113 (010) Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories O.O. Adedoyin Department of Educational Foundations,
More informationItem Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century
International Journal of Scientific Research in Education, SEPTEMBER 2018, Vol. 11(3B), 627-635. Item Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century
More informationTurning Output of Item Response Theory Data Analysis into Graphs with R
Overview Turning Output of Item Response Theory Data Analysis into Graphs with R Motivation Importance of graphing data Graphical methods for item response theory Why R? Two examples Ching-Fan Sheu, Cheng-Te
More informationCYRINUS B. ESSEN, IDAKA E. IDAKA AND MICHAEL A. METIBEMU. (Received 31, January 2017; Revision Accepted 13, April 2017)
DOI: http://dx.doi.org/10.4314/gjedr.v16i2.2 GLOBAL JOURNAL OF EDUCATIONAL RESEARCH VOL 16, 2017: 87-94 COPYRIGHT BACHUDO SCIENCE CO. LTD PRINTED IN NIGERIA. ISSN 1596-6224 www.globaljournalseries.com;
More informationAuthor s response to reviews
Author s response to reviews Title: The validity of a professional competence tool for physiotherapy students in simulationbased clinical education: a Rasch analysis Authors: Belinda Judd (belinda.judd@sydney.edu.au)
More informationPsychometrics in context: Test Construction with IRT. Professor John Rust University of Cambridge
Psychometrics in context: Test Construction with IRT Professor John Rust University of Cambridge Plan Guttman scaling Guttman errors and Loevinger s H statistic Non-parametric IRT Traces in Stata Parametric
More informationEvaluation of the Short-Form Health Survey (SF-36) Using the Rasch Model
American Journal of Public Health Research, 2015, Vol. 3, No. 4, 136-147 Available online at http://pubs.sciepub.com/ajphr/3/4/3 Science and Education Publishing DOI:10.12691/ajphr-3-4-3 Evaluation of
More informationContents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD
Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT
More informationCHAPTER 7 RESEARCH DESIGN AND METHODOLOGY. This chapter addresses the research design and describes the research methodology
CHAPTER 7 RESEARCH DESIGN AND METHODOLOGY 7.1 Introduction This chapter addresses the research design and describes the research methodology employed in this study. The sample and sampling procedure is
More informationITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION SCALE
California State University, San Bernardino CSUSB ScholarWorks Electronic Theses, Projects, and Dissertations Office of Graduate Studies 6-2016 ITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION
More informationChapter 9. Youth Counseling Impact Scale (YCIS)
Chapter 9 Youth Counseling Impact Scale (YCIS) Background Purpose The Youth Counseling Impact Scale (YCIS) is a measure of perceived effectiveness of a specific counseling session. In general, measures
More informationCentre for Education Research and Policy
THE EFFECT OF SAMPLE SIZE ON ITEM PARAMETER ESTIMATION FOR THE PARTIAL CREDIT MODEL ABSTRACT Item Response Theory (IRT) models have been widely used to analyse test data and develop IRT-based tests. An
More informationChapter 2. Test Development Procedures: Psychometric Study and Analytical Plan
Chapter 2 Test Development Procedures: Psychometric Study and Analytical Plan Introduction The Peabody Treatment Progress Battery (PTPB) is designed to provide clinically relevant measures of key mental
More informationAndré Cyr and Alexander Davies
Item Response Theory and Latent variable modeling for surveys with complex sampling design The case of the National Longitudinal Survey of Children and Youth in Canada Background André Cyr and Alexander
More informationAssessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University.
Running head: ASSESS MEASUREMENT INVARIANCE Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies Xiaowen Zhu Xi an Jiaotong University Yanjie Bian Xi an Jiaotong
More informationInfluences of IRT Item Attributes on Angoff Rater Judgments
Influences of IRT Item Attributes on Angoff Rater Judgments Christian Jones, M.A. CPS Human Resource Services Greg Hurt!, Ph.D. CSUS, Sacramento Angoff Method Assemble a panel of subject matter experts
More informationLikelihood Ratio Based Computerized Classification Testing. Nathan A. Thompson. Assessment Systems Corporation & University of Cincinnati.
Likelihood Ratio Based Computerized Classification Testing Nathan A. Thompson Assessment Systems Corporation & University of Cincinnati Shungwon Ro Kenexa Abstract An efficient method for making decisions
More informationPresented By: Yip, C.K., OT, PhD. School of Medical and Health Sciences, Tung Wah College
Presented By: Yip, C.K., OT, PhD. School of Medical and Health Sciences, Tung Wah College Background of problem in assessment for elderly Key feature of CCAS Structural Framework of CCAS Methodology Result
More informationNonparametric DIF. Bruno D. Zumbo and Petronilla M. Witarsa University of British Columbia
Nonparametric DIF Nonparametric IRT Methodology For Detecting DIF In Moderate-To-Small Scale Measurement: Operating Characteristics And A Comparison With The Mantel Haenszel Bruno D. Zumbo and Petronilla
More informationOak Meadow Autonomy Survey
Oak Meadow Autonomy Survey Patricia M. Meehan, Ph.D. August 7, 214 1 Contents Contents 3 List of Figures 3 List of Tables 3 1 Introduction 4 2 Data 4 3 Determining the Number of Factors 5 4 Proposed Model
More informationvalidscale: A Stata module to validate subjective measurement scales using Classical Test Theory
: A Stata module to validate subjective measurement scales using Classical Test Theory Bastien Perrot, Emmanuelle Bataille, Jean-Benoit Hardouin UMR INSERM U1246 - SPHERE "methods in Patient-centered outcomes
More informationScale Building with Confirmatory Factor Analysis
Scale Building with Confirmatory Factor Analysis Latent Trait Measurement and Structural Equation Models Lecture #7 February 27, 2013 PSYC 948: Lecture #7 Today s Class Scale building with confirmatory
More informationMaking a psychometric. Dr Benjamin Cowan- Lecture 9
Making a psychometric Dr Benjamin Cowan- Lecture 9 What this lecture will cover What is a questionnaire? Development of questionnaires Item development Scale options Scale reliability & validity Factor
More informationA Comparison of Traditional and IRT based Item Quality Criteria
A Comparison of Traditional and IRT based Item Quality Criteria Brian D. Bontempo, Ph.D. Mountain ment, Inc. Jerry Gorham, Ph.D. Pearson VUE April 7, 2006 A paper presented at the Annual Meeting of the
More informationValidation the Measures of Self-Directed Learning: Evidence from Confirmatory Factor Analysis and Multidimensional Item Response Analysis
Doi:10.5901/mjss.2015.v6n4p579 Abstract Validation the Measures of Self-Directed Learning: Evidence from Confirmatory Factor Analysis and Multidimensional Item Response Analysis Chaiwichit Chianchana Faculty
More informationCollege Student Self-Assessment Survey (CSSAS)
13 College Student Self-Assessment Survey (CSSAS) Development of College Student Self Assessment Survey (CSSAS) The collection and analysis of student achievement indicator data are of primary importance
More informationOn indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state
On indirect measurement of health based on survey data Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state A scaling model: P(Y 1,..,Y k ;α, ) α = item difficulties
More informationASSESSING THE UNIDIMENSIONALITY, RELIABILITY, VALIDITY AND FITNESS OF INFLUENTIAL FACTORS OF 8 TH GRADES STUDENT S MATHEMATICS ACHIEVEMENT IN MALAYSIA
1 International Journal of Advance Research, IJOAR.org Volume 1, Issue 2, MAY 2013, Online: ASSESSING THE UNIDIMENSIONALITY, RELIABILITY, VALIDITY AND FITNESS OF INFLUENTIAL FACTORS OF 8 TH GRADES STUDENT
More informationREPORT. Technical Report: Item Characteristics. Jessica Masters
August 2010 REPORT Diagnostic Geometry Assessment Project Technical Report: Item Characteristics Jessica Masters Technology and Assessment Study Collaborative Lynch School of Education Boston College Chestnut
More informationItem Analysis: Classical and Beyond
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013 Why is item analysis relevant? Item analysis provides
More informationAdaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida
Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models
More informationConnexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan
Connexion of Item Response Theory to Decision Making in Chess Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Acknowledgement A few Slides have been taken from the following presentation
More informationMeasurement Invariance (MI): a general overview
Measurement Invariance (MI): a general overview Eric Duku Offord Centre for Child Studies 21 January 2015 Plan Background What is Measurement Invariance Methodology to test MI Challenges with post-hoc
More informationFactor Analysis. MERMAID Series 12/11. Galen E. Switzer, PhD Rachel Hess, MD, MS
Factor Analysis MERMAID Series 2/ Galen E Switzer, PhD Rachel Hess, MD, MS Ways to Examine Groups of Things Groups of People Groups of Indicators Cluster Analysis Exploratory Factor Analysis Latent Class
More informationThe Influence of Psychological Empowerment on Innovative Work Behavior among Academia in Malaysian Research Universities
DOI: 10.7763/IPEDR. 2014. V 78. 21 The Influence of Psychological Empowerment on Innovative Work Behavior among Academia in Malaysian Research Universities Azra Ayue Abdul Rahman 1, Siti Aisyah Panatik
More informationA Comparison of Several Goodness-of-Fit Statistics
A Comparison of Several Goodness-of-Fit Statistics Robert L. McKinley The University of Toledo Craig N. Mills Educational Testing Service A study was conducted to evaluate four goodnessof-fit procedures
More informationlinking in educational measurement: Taking differential motivation into account 1
Selecting a data collection design for linking in educational measurement: Taking differential motivation into account 1 Abstract In educational measurement, multiple test forms are often constructed to
More informationRasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments. Jan R. Böhnke & Jaime Delgadillo
Rasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments Jan R. Böhnke & Jaime Delgadillo NHS::IAPT Improving Access to Psychological Therapies "Improving Access to Psychological Therapies
More informationRunning head: PRELIM KSVS SCALES 1
Running head: PRELIM KSVS SCALES 1 Psychometric Examination of a Risk Perception Scale for Evaluation Anthony P. Setari*, Kelly D. Bradley*, Marjorie L. Stanek**, & Shannon O. Sampson* *University of Kentucky
More informationThe Modification of Dichotomous and Polytomous Item Response Theory to Structural Equation Modeling Analysis
Canadian Social Science Vol. 8, No. 5, 2012, pp. 71-78 DOI:10.3968/j.css.1923669720120805.1148 ISSN 1712-8056[Print] ISSN 1923-6697[Online] www.cscanada.net www.cscanada.org The Modification of Dichotomous
More informationEvaluating and restructuring a new faculty survey: Measuring perceptions related to research, service, and teaching
Evaluating and restructuring a new faculty survey: Measuring perceptions related to research, service, and teaching Kelly D. Bradley 1, Linda Worley, Jessica D. Cunningham, and Jeffery P. Bieber University
More informationA Rasch Analysis of the Statistical Anxiety Rating Scale
University of Wyoming From the SelectedWorks of Eric D Teman, J.D., Ph.D. 2013 A Rasch Analysis of the Statistical Anxiety Rating Scale Eric D Teman, Ph.D., University of Wyoming Available at: https://works.bepress.com/ericteman/5/
More informationThe Development of Scales to Measure QISA s Three Guiding Principles of Student Aspirations Using the My Voice TM Survey
The Development of Scales to Measure QISA s Three Guiding Principles of Student Aspirations Using the My Voice TM Survey Matthew J. Bundick, Ph.D. Director of Research February 2011 The Development of
More informationAlternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research
Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research Michael T. Willoughby, B.S. & Patrick J. Curran, Ph.D. Duke University Abstract Structural Equation Modeling
More informationTHE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES IN GHANA.
Africa Journal of Teacher Education ISSN 1916-7822. A Journal of Spread Corporation Vol. 6 No. 1 2017 Pages 56-64 THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES
More informationTesting the Multiple Intelligences Theory in Oman
Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 190 ( 2015 ) 106 112 2nd GLOBAL CONFERENCE on PSYCHOLOGY RESEARCHES, 28-29, November 2014 Testing the Multiple
More informationSurvey Sampling Weights and Item Response Parameter Estimation
Survey Sampling Weights and Item Response Parameter Estimation Spring 2014 Survey Methodology Simmons School of Education and Human Development Center on Research & Evaluation Paul Yovanoff, Ph.D. Department
More informationText-based Document. Psychometric Evaluation of the Diabetes Self-Management Instrument-Short Form (DSMI-20) Lin, Chiu-Chu; Lee, Chia-Lun
The Henderson Repository is a free resource of the Honor Society of Nursing, Sigma Theta Tau International. It is dedicated to the dissemination of nursing research, researchrelated, and evidence-based
More informationUSE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION
USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION Iweka Fidelis (Ph.D) Department of Educational Psychology, Guidance and Counselling, University of Port Harcourt,
More informationConstruct Invariance of the Survey of Knowledge of Internet Risk and Internet Behavior Knowledge Scale
University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2010 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-20-2010 Construct Invariance of the Survey
More informationPsychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals
Psychometrics for Beginners Lawrence J. Fabrey, PhD Applied Measurement Professionals Learning Objectives Identify key NCCA Accreditation requirements Identify two underlying models of measurement Describe
More informationManifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses
Journal of Modern Applied Statistical Methods Copyright 2005 JMASM, Inc. May, 2005, Vol. 4, No.1, 275-282 1538 9472/05/$95.00 Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement
More informationA PRELIMINARY EXAMINATION OF THE ICAR PROGRESSIVE MATRICES TEST OF INTELLIGENCE
A PRELIMINARY EXAMINATION OF THE ICAR PROGRESSIVE MATRICES TEST OF INTELLIGENCE Jovana Jankovski 1,2, Ivana Zečević 2 & Siniša Subotić 3 2 Faculty of Philosophy, University of Banja Luka 3 University of
More informationThe revised short-form of the Eating Beliefs Questionnaire: Measuring positive, negative, and permissive beliefs about binge eating
Burton and Abbott Journal of Eating Disorders (2018) 6:37 https://doi.org/10.1186/s40337-018-0224-0 RESEARCH ARTICLE Open Access The revised short-form of the Eating Beliefs Questionnaire: Measuring positive,
More informationEVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS
DePaul University INTRODUCTION TO ITEM ANALYSIS: EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS Ivan Hernandez, PhD OVERVIEW What is Item Analysis? Overview Benefits of Item Analysis Applications Main
More informationRUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT. Evaluating and Restructuring Science Assessments: An Example Measuring Student s
RUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT Evaluating and Restructuring Science Assessments: An Example Measuring Student s Conceptual Understanding of Heat Kelly D. Bradley, Jessica D. Cunningham
More informationTHE PROCESS OF MENTAL HEALTH RECOVERY/RESILIENCY IN CHILDREN AND ADOLESCENTS
THE PROCESS OF MENTAL HEALTH RECOVERY/RESILIENCY IN CHILDREN AND ADOLESCENTS Kate DeRoche Erica Gosselin Antonio Olmos Riley Rhodes Mental Health Center of Denver Presented at the American Evaluation Association,
More informationAnumber of studies have shown that ignorance regarding fundamental measurement
10.1177/0013164406288165 Educational Graham / Congeneric and Psychological Reliability Measurement Congeneric and (Essentially) Tau-Equivalent Estimates of Score Reliability What They Are and How to Use
More informationBuilding Evaluation Scales for NLP using Item Response Theory
Building Evaluation Scales for NLP using Item Response Theory John Lalor CICS, UMass Amherst Joint work with Hao Wu (BC) and Hong Yu (UMMS) Motivation Evaluation metrics for NLP have been mostly unchanged
More informationDetecting Suspect Examinees: An Application of Differential Person Functioning Analysis. Russell W. Smith Susan L. Davis-Becker
Detecting Suspect Examinees: An Application of Differential Person Functioning Analysis Russell W. Smith Susan L. Davis-Becker Alpine Testing Solutions Paper presented at the annual conference of the National
More informationWorld Academy of Science, Engineering and Technology International Journal of Psychological and Behavioral Sciences Vol:8, No:1, 2014
Validity and Reliability of Competency Assessment Implementation (CAI) Instrument Using Rasch Model Nurfirdawati Muhamad Hanafi, Azmanirah Ab Rahman, Marina Ibrahim Mukhtar, Jamil Ahmad, Sarebah Warman
More informationDevelopment of the Mental, Emotional, and Bodily Toughness Inventory in Collegiate Athletes and Nonathletes
Journal of Athletic Training 2008;43(2):125 132 g by the National Athletic Trainers Association, Inc www.nata.org/jat original research Development of the Mental, Emotional, and Bodily Toughness Inventory
More informationTHE PSYCHOMETRIC VALIDATION OF THE PRINCIPAL PRACTICES QUESTIONNAIRE BASED ON ITEM RESPONSE THEORY
THE PSYCHOMETRIC VALIDATION OF THE PRINCIPAL PRACTICES QUESTIONNAIRE BASED ON ITEM RESPONSE THEORY Corinne Jacqueline Perera *, Bambang Sumintono a, Jiang Na b a Institute of Educational Leadership, Faculty
More informationThe State COPAS Scales: Development, Validation, and Confirmatory Analyses. Cathy Lawrenz, Rod Van Whitlock, Bernard Lubin
State COPAS Scales 1 Running Head: STATE COPAS SCALES The State COPAS Scales: Development, Validation, and Confirmatory Analyses Cathy Lawrenz, Rod Van Whitlock, Bernard Lubin University of Missouri-Kansas
More informationInstrument Validation Study
Instrument Validation Study REGARDING LEADERSHIP CIRCLE PROFILE By Industrial Psychology Department Bowling Green State University INSTRUMENT VALIDATION STUDY EXECUTIVE SUMMARY AND RESPONSE TO THE RECOMMENDATIONS
More informationFundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2
Fundamental Concepts for Using Diagnostic Classification Models Section #2 NCME 2016 Training Session NCME 2016 Training Session: Section 2 Lecture Overview Nature of attributes What s in a name? Grain
More informationMarc J. Tassé, PhD Nisonger Center UCEDD
FINALLY... AN ADAPTIVE BEHAVIOR SCALE FOCUSED ON PROVIDING PRECISION AT THE DIAGNOSTIC CUT-OFF. How Item Response Theory Contributed to the Development of the DABS Marc J. Tassé, PhD UCEDD The Ohio State
More informationExploratory Factor Analysis Student Anxiety Questionnaire on Statistics
Proceedings of Ahmad Dahlan International Conference on Mathematics and Mathematics Education Universitas Ahmad Dahlan, Yogyakarta, 13-14 October 2017 Exploratory Factor Analysis Student Anxiety Questionnaire
More informationJournal of Educational and Psychological Studies - Sultan Qaboos University (Pages ) Vol.7 Issue
Journal of Educational and Psychological Studies - Sultan Qaboos University (Pages 537-548) Vol.7 Issue 4 2013 Constructing a Scale of Attitudes toward School Science Using the General Graded Unfolding
More informationModel fit and robustness? - A critical look at the foundation of the PISA project
Model fit and robustness? - A critical look at the foundation of the PISA project Svend Kreiner, Dept. of Biostatistics, Univ. of Copenhagen TOC The PISA project and PISA data PISA methodology Rasch item
More informationMULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL
Man In India, 96 (1-2) : 173-181 Serials Publications MULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL Adibah Binti Abd Latif 1*, Ibnatul Jalilah Yusof 1, Nor Fadila
More informationUnderstanding Social Norms, Enjoyment, and the Moderating Effect of Gender on E-Commerce Adoption
Association for Information Systems AIS Electronic Library (AISeL) SAIS 2010 Proceedings Southern (SAIS) 3-1-2010 Understanding Social Norms, Enjoyment, and the Moderating Effect of Gender on E-Commerce
More informationStudents' perceived understanding and competency in probability concepts in an e- learning environment: An Australian experience
University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2016 Students' perceived understanding and competency
More informationEmpowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison
Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological
More informationSchool Administrators Level of Self-Esteem and its Relationship To Their Trust in Teachers. Mualla Aksu, Soner Polat, & Türkan Aksu
School Administrators Level of Self-Esteem and its Relationship To Their Trust in Teachers Mualla Aksu, Soner Polat, & Türkan Aksu What is Self-Esteem? Confidence in one s own worth or abilities (http://www.oxforddictionaries.com/definition/english/self-esteem)
More informationPUBLIC KNOWLEDGE AND ATTITUDES SCALE CONSTRUCTION: DEVELOPMENT OF SHORT FORMS
PUBLIC KNOWLEDGE AND ATTITUDES SCALE CONSTRUCTION: DEVELOPMENT OF SHORT FORMS Prepared for: Robert K. Bell, Ph.D. National Science Foundation Division of Science Resources Studies 4201 Wilson Blvd. Arlington,
More informationComparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change
Comparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change Gyenam Kim Kang Korea Nazarene University David J. Weiss University of Minnesota Presented at the Item
More informationINTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT. Basic Concepts, Parameters and Statistics
INTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT Basic Concepts, Parameters and Statistics The designations employed and the presentation of material in this information product
More informationDetermining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory
Determining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory Teodora M. Salubayba St. Scholastica s College-Manila dory41@yahoo.com Abstract Mathematics word-problem
More informationRunning head: CFA OF STICSA 1. Model-Based Factor Reliability and Replicability of the STICSA
Running head: CFA OF STICSA 1 Model-Based Factor Reliability and Replicability of the STICSA The State-Trait Inventory of Cognitive and Somatic Anxiety (STICSA; Ree et al., 2008) is a new measure of anxiety
More informationResearch Brief Reliability of the Static Risk Offender Need Guide for Recidivism (STRONG-R)
Research Brief Reliability of the Static Risk Offender Need Guide for Recidivism (STRONG-R) Xiaohan Mei, M.A. Zachary Hamilton, Ph.D. Washington State University 1 Reliability/Internal Consistency of STRONG-R
More information