Techniques for Explaining Item Response Theory to Stakeholder

Size: px
Start display at page:

Download "Techniques for Explaining Item Response Theory to Stakeholder"

Transcription

1 Techniques for Explaining Item Response Theory to Stakeholder Kate DeRoche Antonio Olmos C.J. Mckinney Mental Health Center of Denver Presented on March 23, 2007 at the Eastern Evaluation Research Society Conference in Absecon, NJ

2 Overview of Presentation Current issues with applying IRT in evaluation Explaining the reasoning for using IRT in evaluation Explaining and involving stakeholder in the IRT analysis Explaing IRT to Stakeholder 2

3 IRT in Evaluation There has been an increase in the application of IRT in evaluation Due to the advantages it provides (Hambleton, Swaminathan, Roger, 1991) Multiple applications of IRT for evaluation purposes: Psychometrics-measurement validation Need to shorten a measurement tool Rank elements of a dimension for difficulty Equating instruments Differential Item Functioning (DIF) Explaing IRT to Stakeholder 3

4 Limitation to applying IRT in Evaluation: Less training in measurement as evaluators/researchers Can teach IRT in a class but difficult to explain to people who do not understand psychometrics and advanced statistics. PROBLEM: It is difficult for evaluators to apply Item Response Theory techniques because of the problems related to: 1. Explaining the advantages of IRT in simple terms 2. Explaining the results of IRT so that stakeholders can be involved in the analysis process Explaing IRT to Stakeholder 4

5 Explaining the Reasoning for Using IRT Why do we want to use a more complex method?

6 Explaining Benefits of IRT Hallmark: Separation of item and person parameters (or item/person invariance) Instead, There are 2 things: 1. Are all of our items equal? Should they all equally contribute to our score or are some questions harder than other? 2. Test created with Classical Test Thoery can be very reliable (i.e. very consistent for a person to score the same, if measured twice) and not measure all of our participants well Explaing IRT to Stakeholder 6

7 Are Items Equal? Example for assumptions of CTT: Question 1: Question 2: I feel sad often I think about suicide often Question 3: I sleep more than usual Question 4: I want to be alone more than usual 1 point 1 point 1 point 1 point Example for assumptions of IRT: Question 1: Question 2: I feel sad often 1/2 point I think about suicide often Question 3: I sleep more than usual 3/4 point Question 4: I want to be alone more than usual 1 1/4 point 1 1/2 points Explaing IRT to Stakeholder 7

8 True Depression More Depressed Test level on Trait MMPI- Depression Section Beck s Depression Inventory Could create a measure and depending on who you give it to, it may be created for a specific population MMPI measures a more depressed sample better Beck s measure a less depressed sample better Less Depressed With out IRT would not know the level of the trait that we are measuring (AKA. Which measure is harder) Explaing IRT to Stakeholder 8

9 Explaining IRT results to Stakeholder You can use IRT analysis and involve stakeholder!!!

10 IQ Hard 4.0 Response- G (Super-dupper smart)-(3.0) Scaling Example with IQ Assuming an order that A is a very easy item (i.e. what is your name?) and G is a very hard question (what does floccinaucinihillipilification mean?) It is assumed that if you get a higher response correct, then you got all of the below responses correct, even though you may or may not have been tested for the items below (i.e. if you get D correct, then you got C, B, & A correct). The higher the response you get correct, the higher score you receive. If you get G correct, then your IQ score is 145 Difficulty of Response Response- F (Super-Smart)-(2.0) Response- E (Smart)-(1.0) Response- D (average)-(0.0) Response- C (Below average)-(-1.0) Response- B (border-line mental retardation)-(-2.0) If you get F correct, then your IQ score is 130 If you get E correct, then your IQ score is 115 If you get D correct, then your IQ score is 100 If you get C correct, then your IQ score is 85 If you get B correct, then your IQ score is 70 Response-A (mentally retarded)-(-3.0) If you get A correct, then your IQ score is 55 Easy -4.0 Ideally, this is what a scale should look like, with even intervals of potential responses across the scale. Explaing IRT to Stakeholder 10

11 Problem 1: Clumping The items are clumping together (i.e. E, D & C). Hard 4.0 Example 2 Response- G (Super-dupper smart) (3.0) IQ 145 Response- F (Super-Smart)- (2.8) IQ 142 This means that responses E, D, & C are basically measuring the same thing, and only discriminate between 106 to 102 and 102 to 97. We could remove D, E, or C, but this should be determined based on context and other IRT outcomes. WE DO NOT WANT CLUMPS OF RESPONSES Difficulty of Response Easy Response- E (Smart) (.4) IQ 106 Response- D (average) (.1) IQ 102 Response- C (Below average) (-.2) IQ 97 Response- B (border-line mental retardation) Response-A (mentally retarded) Explaing IRT to Stakeholder 11

12 Problem #2: Full Range Hard Difficulty of Response Easy Example Response- F (Super-Smart) (.5) IQ 107 Response- G (Super-dupper smart)- (0) IQ 100 Response- E (Smart) (-.5) IQ 93 Response- D (average) (-1.25) IQ 81 Response- C (Below average) (-2) IQ 70 Response- B (border-line mental retardation) (-2.8) IQ 58 Response-A (mentally retarded) (-3.5) IQ 48 The scale does not contain any responses above (+0.5) suggesting that the highest IQ we can measure is 107. People with IQ score higher than 107 (i.e. 130) would only be able to know that they have an IQ above 107. THE RESPONSES NEED TO RANGE FROM +3 TO -3 Explaing IRT to Stakeholder 12

13 Problem #3: Ordering Notice that the order of easy to hard goes A,B,C, D, E, G, then F, suggesting that F and G are out of order. A response of what we think is hard (only for supper-duper smart people) is really not that hard and will only produce an IQ score of 100, not 145 as assumed. THE RESPONSES NEED TO BE IN CORRECT ORDER Hard Difficulty of Response Easy Example Response- F (Super-Smart) (.5) IQ 107 Response- G (Super-dupper smart)- (0) IQ 100 Response- E (Smart) (-.5) IQ 93 Response- D (average) (-1.25) IQ 81 Response- C (Below average) (-2) IQ 70 Response- B (border-line mental retardation) (-2.8) IQ 58 Response-A (mentally retarded) (-3.5) IQ 48 Explaing IRT to Stakeholder 13

14 Problem #4: Large Gaps Hard 4.0 Example 2 Response- G (Super-dupper smart) (3.0) IQ 145 Response- F (Super-Smart)- (2.8) IQ 142 Again we see gaps, but they are within the scale (not just at the top or bottom). Difficulty of Response Response- E (Smart) (.4) IQ 106 Response- D (average) (.1) IQ 102 Response- C (Below average) (-.2) IQ 97 This suggest that there are no items able to measure an IQ score between 141 to 107, and between 96 and 56 people are not able to receive score in this area Response-A (mentally retarded) WE CANNOT HAVE LARGE GAPS Explaing IRT to Stakeholder 14

15 Activity with Stakeholder Keep in mind the 4 problems presented (C.R.O.G.)- Clumping, Range, Order & Gaps Present the results of the IRT item map Have problems listed with potential reason and solutions Should have also previously reviewed other IRT output for the software program (ICC s, Information functions, a, b, or c parameters, infit, outfit, etc ) Stakeholders were able interpret the results in terms of their program Added context to the results Most importantly, stakeholders felt that they were involved in the process Explaing IRT to Stakeholder 15

16 How to Present the Results Two ways the item map was presented 1. Display only the current item map Beneficial for individuals familiar with looking at data or graphs Beneficial for long measures when viewing individual items 2. Present the current item map and ideal item map, with suggestions and explanations at the bottom Beneficial for individuals not as familiar with data, graphs, or have number anxiety Ideal for short measure when viewing individual items because it takes up a lot of room Explaing IRT to Stakeholder 16

17 Example 1: Current Item Map Hard Employment Education Active/Growth C- Active Job Search (3.29) F- Full time college (4.77) E- Part time college (.3.66) 4.0 F- Very high (3.41) D- Non-paid work/volunteer (2.64) Difficulty of Item H- Full time independent (1.59) G- Part time independent (.76) E- Part time supported (-.37) F- Full time supported (-1.22) B- Interest in work, no action (-1.99) A- No interest in work (-2.71) D- noncredit training (-.10) G- Recent Grad (-.20) C- Active education/training search (-1.00) B-interest in education, no action (-1.05) A- No interest in education (-2.82) E- High (-.75) D- Moderate out MH system (-1.62) B- Low (-2.28) C- Moderate in MH system (-2.37) Easy A- Very low (-3.88) Explaing IRT to Stakeholder 17

18 Example 2: Current & Ideal Item Maps 4.0 Current Status: F- Very high (3.41) Active/Growth 4.0 Ideal: F- Very high (3.41) High (??) E- High (-.75) E- Moderate/High (-.75) -2.0 D- Moderate out MH system (-1.62) B- Low (-2.28) C- Moderate in MH system (-2.37) -2.0 D- Moderate/low (-1.62) B & C- Low (-2.28 to -2.37) -4.0 A- Very low (-3.88) Problems: #2 Ordering, #3 Clustering, & #4 Clumping Suggestion: 1. Combined low and moderate in MH systems, because their definitions are very similar, and clinicians can not differentiate between the 2 2. Since, B & C are combined and called low, then moderate out of MH should be call moderate low and high should be called moderate high 3. There is a very large gap between high and very high, when looking at their definition the only difference is participating in activates that are other focused in very high, and self-focused in high. Therefore, in terms of recovery, moving from self to other focused activities is a large step, and there should be other levels added in. Therefore, Explaing a new IRT response to Stakeholder should be added that includes participating in actives that are 18 approximately 50% self-focused and 50% other focuses, called high. Any suggestions regarding this issues? -4.0 A- Very low (-3.88)

19 Applications in Other Domains Any commonly understood test that has a normed mean and standard deviation Education- standard test scores SAT, GRE, K-12 state testing, t-scores Health- any biological measure heart rate, blood pressure, blood sugar Explaing IRT to Stakeholder 19

20 Questions??? For future questions or comments: Kate DeRoche (303) Explaing IRT to Stakeholder 20

21 IRT Resources IRT 101 Reise, S. P., Ainsworth, A. T. & Haviland, M. G. (2005). Item Response Theory: Fundamentals, Applications, and Promise in Psychological Research. Current Directions in Psychological Science, 14, IRT Books Hambleton, R. K., Swaminathan, H. & Rogers, H. J. (1991). Fundamentals of Item Response Theory. Newbury Park, CA: Sage Publication, Inc. Embretson, S. E. & Reise, S. P. (2000). Item Response Theory for Psychologists. Mahwah, NJ: Lawrence Erlbaum Associates And many more resources Explaing IRT to Stakeholder 21

The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory

The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory Kate DeRoche, M.A. Mental Health Center of Denver Antonio Olmos, Ph.D. Mental Health

More information

The MHSIP: A Tale of Three Centers

The MHSIP: A Tale of Three Centers The MHSIP: A Tale of Three Centers P. Antonio Olmos-Gallo, Ph.D. Kathryn DeRoche, M.A. Mental Health Center of Denver Richard Swanson, Ph.D., J.D. Aurora Research Institute John Mahalik, Ph.D., M.P.A.

More information

Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories

Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories Kamla-Raj 010 Int J Edu Sci, (): 107-113 (010) Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories O.O. Adedoyin Department of Educational Foundations,

More information

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT

More information

References. Embretson, S. E. & Reise, S. P. (2000). Item response theory for psychologists. Mahwah,

References. Embretson, S. E. & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, The Western Aphasia Battery (WAB) (Kertesz, 1982) is used to classify aphasia by classical type, measure overall severity, and measure change over time. Despite its near-ubiquitousness, it has significant

More information

Multilevel Techniques for Quality Control Charts of Recovery Outcomes

Multilevel Techniques for Quality Control Charts of Recovery Outcomes Multilevel Techniques for Quality Control Charts of Recovery Outcomes INFORMS Annual Meeting 2009 San Diego, CA October, 11 th, 2009 Linda Laganga, PhD* (Linda.Laganga@mhcd.org) CJ McKinney, MA Kate DeRoche,

More information

Determining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory

Determining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory Determining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory Teodora M. Salubayba St. Scholastica s College-Manila dory41@yahoo.com Abstract Mathematics word-problem

More information

Validating Measures of Self Control via Rasch Measurement. Jonathan Hasford Department of Marketing, University of Kentucky

Validating Measures of Self Control via Rasch Measurement. Jonathan Hasford Department of Marketing, University of Kentucky Validating Measures of Self Control via Rasch Measurement Jonathan Hasford Department of Marketing, University of Kentucky Kelly D. Bradley Department of Educational Policy Studies & Evaluation, University

More information

Item Response Theory: Methods for the Analysis of Discrete Survey Response Data

Item Response Theory: Methods for the Analysis of Discrete Survey Response Data Item Response Theory: Methods for the Analysis of Discrete Survey Response Data ICPSR Summer Workshop at the University of Michigan June 29, 2015 July 3, 2015 Presented by: Dr. Jonathan Templin Department

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Comprehensive Statistical Analysis of a Mathematics Placement Test

Comprehensive Statistical Analysis of a Mathematics Placement Test Comprehensive Statistical Analysis of a Mathematics Placement Test Robert J. Hall Department of Educational Psychology Texas A&M University, USA (bobhall@tamu.edu) Eunju Jung Department of Educational

More information

Construct Validity of Mathematics Test Items Using the Rasch Model

Construct Validity of Mathematics Test Items Using the Rasch Model Construct Validity of Mathematics Test Items Using the Rasch Model ALIYU, R.TAIWO Department of Guidance and Counselling (Measurement and Evaluation Units) Faculty of Education, Delta State University,

More information

USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION

USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION Iweka Fidelis (Ph.D) Department of Educational Psychology, Guidance and Counselling, University of Port Harcourt,

More information

Proceedings of the 2011 International Conference on Teaching, Learning and Change (c) International Association for Teaching and Learning (IATEL)

Proceedings of the 2011 International Conference on Teaching, Learning and Change (c) International Association for Teaching and Learning (IATEL) EVALUATION OF MATHEMATICS ACHIEVEMENT TEST: A COMPARISON BETWEEN CLASSICAL TEST THEORY (CTT)AND ITEM RESPONSE THEORY (IRT) Eluwa, O. Idowu 1, Akubuike N. Eluwa 2 and Bekom K. Abang 3 1& 3 Dept of Educational

More information

Item Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century

Item Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century International Journal of Scientific Research in Education, SEPTEMBER 2018, Vol. 11(3B), 627-635. Item Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century

More information

Survey Sampling Weights and Item Response Parameter Estimation

Survey Sampling Weights and Item Response Parameter Estimation Survey Sampling Weights and Item Response Parameter Estimation Spring 2014 Survey Methodology Simmons School of Education and Human Development Center on Research & Evaluation Paul Yovanoff, Ph.D. Department

More information

Does factor indeterminacy matter in multi-dimensional item response theory?

Does factor indeterminacy matter in multi-dimensional item response theory? ABSTRACT Paper 957-2017 Does factor indeterminacy matter in multi-dimensional item response theory? Chong Ho Yu, Ph.D., Azusa Pacific University This paper aims to illustrate proper applications of multi-dimensional

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

Item Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses

Item Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses Item Response Theory Steven P. Reise University of California, U.S.A. Item response theory (IRT), or modern measurement theory, provides alternatives to classical test theory (CTT) methods for the construction,

More information

Item and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model

Item and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model Jafari et al. Health and Quality of Life Outcomes 2012, 10:127 SHORT REPORT Open Access Item and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model Peyman

More information

Psychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals

Psychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals Psychometrics for Beginners Lawrence J. Fabrey, PhD Applied Measurement Professionals Learning Objectives Identify key NCCA Accreditation requirements Identify two underlying models of measurement Describe

More information

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models

More information

Validation of an Analytic Rating Scale for Writing: A Rasch Modeling Approach

Validation of an Analytic Rating Scale for Writing: A Rasch Modeling Approach Tabaran Institute of Higher Education ISSN 2251-7324 Iranian Journal of Language Testing Vol. 3, No. 1, March 2013 Received: Feb14, 2013 Accepted: March 7, 2013 Validation of an Analytic Rating Scale for

More information

By Hui Bian Office for Faculty Excellence

By Hui Bian Office for Faculty Excellence By Hui Bian Office for Faculty Excellence 1 Email: bianh@ecu.edu Phone: 328-5428 Location: 1001 Joyner Library, room 1006 Office hours: 8:00am-5:00pm, Monday-Friday 2 Educational tests and regular surveys

More information

Exploratory Factor Analysis Student Anxiety Questionnaire on Statistics

Exploratory Factor Analysis Student Anxiety Questionnaire on Statistics Proceedings of Ahmad Dahlan International Conference on Mathematics and Mathematics Education Universitas Ahmad Dahlan, Yogyakarta, 13-14 October 2017 Exploratory Factor Analysis Student Anxiety Questionnaire

More information

HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT

HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT HARRISON ASSESSMENTS HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT Have you put aside an hour and do you have a hard copy of your report? Get a quick take on their initial reactions

More information

WHO Quality of Life. health other than the cause of a disease or the side effects that come along with it. These other

WHO Quality of Life. health other than the cause of a disease or the side effects that come along with it. These other WHO Quality of Life Overview of the WHO Quality of Life As healthcare progresses globally, so does that evolution of scientific research on healthcare assessments and practices. Healthcare services have

More information

Influences of IRT Item Attributes on Angoff Rater Judgments

Influences of IRT Item Attributes on Angoff Rater Judgments Influences of IRT Item Attributes on Angoff Rater Judgments Christian Jones, M.A. CPS Human Resource Services Greg Hurt!, Ph.D. CSUS, Sacramento Angoff Method Assemble a panel of subject matter experts

More information

Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys

Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Jill F. Kilanowski, PhD, APRN,CPNP Associate Professor Alpha Zeta & Mu Chi Acknowledgements Dr. Li Lin,

More information

Connexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan

Connexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Connexion of Item Response Theory to Decision Making in Chess Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Acknowledgement A few Slides have been taken from the following presentation

More information

CHAPTER 7 RESEARCH DESIGN AND METHODOLOGY. This chapter addresses the research design and describes the research methodology

CHAPTER 7 RESEARCH DESIGN AND METHODOLOGY. This chapter addresses the research design and describes the research methodology CHAPTER 7 RESEARCH DESIGN AND METHODOLOGY 7.1 Introduction This chapter addresses the research design and describes the research methodology employed in this study. The sample and sampling procedure is

More information

André Cyr and Alexander Davies

André Cyr and Alexander Davies Item Response Theory and Latent variable modeling for surveys with complex sampling design The case of the National Longitudinal Survey of Children and Youth in Canada Background André Cyr and Alexander

More information

FIGURE 1-The Cognitive Model. Core belief. I m incompetent. Intermediate belief. If I don t understand something perfectly, then I m dumb

FIGURE 1-The Cognitive Model. Core belief. I m incompetent. Intermediate belief. If I don t understand something perfectly, then I m dumb FIGURE 1-The Cognitive Model Core belief I m incompetent Intermediate belief If I don t understand something perfectly, then I m dumb Situation Automatic thoughts Reactions Reading this book This is too

More information

The Classification Accuracy of Measurement Decision Theory. Lawrence Rudner University of Maryland

The Classification Accuracy of Measurement Decision Theory. Lawrence Rudner University of Maryland Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago, April 23-25, 2003 The Classification Accuracy of Measurement Decision Theory Lawrence Rudner University

More information

Instrument equivalence across ethnic groups. Antonio Olmos (MHCD) Susan R. Hutchinson (UNC)

Instrument equivalence across ethnic groups. Antonio Olmos (MHCD) Susan R. Hutchinson (UNC) Instrument equivalence across ethnic groups Antonio Olmos (MHCD) Susan R. Hutchinson (UNC) Overview Instrument Equivalence Measurement Invariance Invariance in Reliability Scores Factorial Invariance Item

More information

insight. Psychological tests to help support your work with medical patients

insight. Psychological tests to help support your work with medical patients insight. Psychological tests to help support your work with medical patients C O M P R E H E N S I V E Shedding light on important issues Sometimes a closer view is all you need to find the answers you

More information

ITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION SCALE

ITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION SCALE California State University, San Bernardino CSUSB ScholarWorks Electronic Theses, Projects, and Dissertations Office of Graduate Studies 6-2016 ITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION

More information

Using Performance and Outcome Measures for Enhancing the Quality of Behavioral Health Care Services

Using Performance and Outcome Measures for Enhancing the Quality of Behavioral Health Care Services Using Performance and Outcome Measures for Enhancing the Quality of Behavioral Health Care Services Dennis Morrison, PhD Chief Clinical Officer Netsmart Technologies @DrDennyM Knowledge Without Data is

More information

Sampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015

Sampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015 Sampling for Impact Evaluation Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015 How many hours do you expect to sleep tonight? A. 2 or less B. 3 C.

More information

INTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT. Basic Concepts, Parameters and Statistics

INTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT. Basic Concepts, Parameters and Statistics INTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT Basic Concepts, Parameters and Statistics The designations employed and the presentation of material in this information product

More information

Rasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments. Jan R. Böhnke & Jaime Delgadillo

Rasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments. Jan R. Böhnke & Jaime Delgadillo Rasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments Jan R. Böhnke & Jaime Delgadillo NHS::IAPT Improving Access to Psychological Therapies "Improving Access to Psychological Therapies

More information

2 Types of psychological tests and their validity, precision and standards

2 Types of psychological tests and their validity, precision and standards 2 Types of psychological tests and their validity, precision and standards Tests are usually classified in objective or projective, according to Pasquali (2008). In case of projective tests, a person is

More information

Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses

Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses Journal of Modern Applied Statistical Methods Copyright 2005 JMASM, Inc. May, 2005, Vol. 4, No.1, 275-282 1538 9472/05/$95.00 Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement

More information

The Value of an Electronic Health Record System in Lean Process Improvement and Research Innovation

The Value of an Electronic Health Record System in Lean Process Improvement and Research Innovation POMS College of Service Operations Research Incubator, April 30, 2009, Orlando, FL The Value of an Electronic Health Record System in Lean Process Improvement and Research Innovation Linda LaGanga, Ph.D.

More information

Children's Depression Inventory 2nd Edition: Parent Maria Kovacs, Ph.D.

Children's Depression Inventory 2nd Edition: Parent Maria Kovacs, Ph.D. Children's Depression Inventory 2nd Edition: Parent Maria Kovacs, Ph.D. Assessment Report This Assessment Report is intended for use by qualified assessors only, and is not to be shown or presented to

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

These questionnaires are used by psychology services to help us understand how people feel. One questionnaire measures how sad people feel.

These questionnaires are used by psychology services to help us understand how people feel. One questionnaire measures how sad people feel. ADAPTED PHQ-9 & GAD-7 QUESTIONNAIRES How to fill in these questionnaires: These questionnaires are used by psychology services to help us understand how people feel. One questionnaire measures how sad

More information

During the past century, mathematics

During the past century, mathematics An Evaluation of Mathematics Competitions Using Item Response Theory Jim Gleason During the past century, mathematics competitions have become part of the landscape in mathematics education. The first

More information

Known-Groups Validity 2017 FSSE Measurement Invariance

Known-Groups Validity 2017 FSSE Measurement Invariance Known-Groups Validity 2017 FSSE Measurement Invariance A key assumption of any latent measure (any questionnaire trying to assess an unobservable construct) is that it functions equally across all different

More information

Model fit and robustness? - A critical look at the foundation of the PISA project

Model fit and robustness? - A critical look at the foundation of the PISA project Model fit and robustness? - A critical look at the foundation of the PISA project Svend Kreiner, Dept. of Biostatistics, Univ. of Copenhagen TOC The PISA project and PISA data PISA methodology Rasch item

More information

The Lens Model and Linear Models of Judgment

The Lens Model and Linear Models of Judgment John Miyamoto Email: jmiyamot@uw.edu October 3, 2017 File = D:\P466\hnd02-1.p466.a17.docm 1 http://faculty.washington.edu/jmiyamot/p466/p466-set.htm Psych 466: Judgment and Decision Making Autumn 2017

More information

On indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state

On indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state On indirect measurement of health based on survey data Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state A scaling model: P(Y 1,..,Y k ;α, ) α = item difficulties

More information

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Mantel-Haenszel Procedures for Detecting Differential Item Functioning A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of

More information

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971) Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity

More information

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS DePaul University INTRODUCTION TO ITEM ANALYSIS: EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS Ivan Hernandez, PhD OVERVIEW What is Item Analysis? Overview Benefits of Item Analysis Applications Main

More information

Part 5. Clare s Recovery Story

Part 5. Clare s Recovery Story Part 5 Clare s Recovery Story Clare s story is about someone who used exposure and habituation to treat her panic disorder with agoraphobia. Exposure and habituation is a technique that breaks the cycle

More information

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat. Brad Pilon 1 2 3 ! For many people, the best thing about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.! However, this still means there

More information

Author s response to reviews

Author s response to reviews Author s response to reviews Title: The validity of a professional competence tool for physiotherapy students in simulationbased clinical education: a Rasch analysis Authors: Belinda Judd (belinda.judd@sydney.edu.au)

More information

RUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT. Evaluating and Restructuring Science Assessments: An Example Measuring Student s

RUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT. Evaluating and Restructuring Science Assessments: An Example Measuring Student s RUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT Evaluating and Restructuring Science Assessments: An Example Measuring Student s Conceptual Understanding of Heat Kelly D. Bradley, Jessica D. Cunningham

More information

MEASURING MIDDLE GRADES STUDENTS UNDERSTANDING OF FORCE AND MOTION CONCEPTS: INSIGHTS INTO THE STRUCTURE OF STUDENT IDEAS

MEASURING MIDDLE GRADES STUDENTS UNDERSTANDING OF FORCE AND MOTION CONCEPTS: INSIGHTS INTO THE STRUCTURE OF STUDENT IDEAS MEASURING MIDDLE GRADES STUDENTS UNDERSTANDING OF FORCE AND MOTION CONCEPTS: INSIGHTS INTO THE STRUCTURE OF STUDENT IDEAS The purpose of this study was to create an instrument that measures middle grades

More information

APPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH

APPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH APPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH Margaret Wu & Ray Adams Documents supplied on behalf of the authors by Educational Measurement Solutions TABLE OF CONTENT CHAPTER

More information

Assessment: Interviews, Tests, Techniques. Clinical Psychology Lectures

Assessment: Interviews, Tests, Techniques. Clinical Psychology Lectures Lecture 6 Assessment: Interviews, Tests, Techniques Clinical Psychology Lectures Psychodiagnostic Assessment Also termed: personality assessment, diagnostic assessment, pretreatment assessments or psychological

More information

CLINICAL NEUROPSYCHOLOGY AND COGNITIVE NEUROLOGY OF PARKINSON'S DISEASE AND OTHER MOVEMENT DISORDERS BY ALEXANDER I. TROSTER

CLINICAL NEUROPSYCHOLOGY AND COGNITIVE NEUROLOGY OF PARKINSON'S DISEASE AND OTHER MOVEMENT DISORDERS BY ALEXANDER I. TROSTER CLINICAL NEUROPSYCHOLOGY AND COGNITIVE NEUROLOGY OF PARKINSON'S DISEASE AND OTHER MOVEMENT DISORDERS BY ALEXANDER I. TROSTER DOWNLOAD EBOOK : CLINICAL NEUROPSYCHOLOGY AND COGNITIVE MOVEMENT DISORDERS BY

More information

Introduction to Item Response Theory

Introduction to Item Response Theory Introduction to Item Response Theory Prof John Rust, j.rust@jbs.cam.ac.uk David Stillwell, ds617@cam.ac.uk Aiden Loe, bsl28@cam.ac.uk Luning Sun, ls523@cam.ac.uk www.psychometrics.cam.ac.uk Goals Build

More information

CYRINUS B. ESSEN, IDAKA E. IDAKA AND MICHAEL A. METIBEMU. (Received 31, January 2017; Revision Accepted 13, April 2017)

CYRINUS B. ESSEN, IDAKA E. IDAKA AND MICHAEL A. METIBEMU. (Received 31, January 2017; Revision Accepted 13, April 2017) DOI: http://dx.doi.org/10.4314/gjedr.v16i2.2 GLOBAL JOURNAL OF EDUCATIONAL RESEARCH VOL 16, 2017: 87-94 COPYRIGHT BACHUDO SCIENCE CO. LTD PRINTED IN NIGERIA. ISSN 1596-6224 www.globaljournalseries.com;

More information

Turning Output of Item Response Theory Data Analysis into Graphs with R

Turning Output of Item Response Theory Data Analysis into Graphs with R Overview Turning Output of Item Response Theory Data Analysis into Graphs with R Motivation Importance of graphing data Graphical methods for item response theory Why R? Two examples Ching-Fan Sheu, Cheng-Te

More information

Examining Factors Affecting Language Performance: A Comparison of Three Measurement Approaches

Examining Factors Affecting Language Performance: A Comparison of Three Measurement Approaches Pertanika J. Soc. Sci. & Hum. 21 (3): 1149-1162 (2013) SOCIAL SCIENCES & HUMANITIES Journal homepage: http://www.pertanika.upm.edu.my/ Examining Factors Affecting Language Performance: A Comparison of

More information

1. Evaluate the methodological quality of a study with the COSMIN checklist

1. Evaluate the methodological quality of a study with the COSMIN checklist Answers 1. Evaluate the methodological quality of a study with the COSMIN checklist We follow the four steps as presented in Table 9.2. Step 1: The following measurement properties are evaluated in the

More information

In this session we re focusing on a high level overview of the Collaborative Care approach.

In this session we re focusing on a high level overview of the Collaborative Care approach. In this session we re focusing on a high level overview of the Collaborative Care approach. 1 The evidence base for collaborative care is extremely strong. More than 80 RCTs demonstrate this approach is

More information

Basic Immunology: Functions And Disorders Of The Immune System, 4e PDF

Basic Immunology: Functions And Disorders Of The Immune System, 4e PDF Basic Immunology: Functions And Disorders Of The Immune System, 4e PDF Understand all the essential concepts in immunology with Basic Immunology: Functions and Disorders of the Immune System! This concise,

More information

Development of a descriptive fit statistic for the Rasch model

Development of a descriptive fit statistic for the Rasch model Development of a descriptive fit statistic for the Rasch model Purya Baghaei, Takuya Yanagida, Moritz Heene To cite this version: Purya Baghaei, Takuya Yanagida, Moritz Heene. Development of a descriptive

More information

Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates. Kent Rittschof Department of Curriculum, Foundations, & Reading

Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates. Kent Rittschof Department of Curriculum, Foundations, & Reading Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates Kent Rittschof Department of Curriculum, Foundations, & Reading What is Ambiguity Tolerance (AT) and why should it be measured?

More information

Running head: PRELIM KSVS SCALES 1

Running head: PRELIM KSVS SCALES 1 Running head: PRELIM KSVS SCALES 1 Psychometric Examination of a Risk Perception Scale for Evaluation Anthony P. Setari*, Kelly D. Bradley*, Marjorie L. Stanek**, & Shannon O. Sampson* *University of Kentucky

More information

Psychology UNIT 1: PSYCHOLOGY AS A SCIENCE. Core

Psychology UNIT 1: PSYCHOLOGY AS A SCIENCE. Core Core provides a solid overview of the field's major domains: methods, biopsychology, cognitive and developmental psychology, and variations in individual and group behavior. By focusing on significant

More information

Tasks of Executive Control TEC. Interpretive Report. Developed by Peter K. Isquith, PhD, Robert M. Roth, PhD, Gerard A. Gioia, PhD, and PAR Staff

Tasks of Executive Control TEC. Interpretive Report. Developed by Peter K. Isquith, PhD, Robert M. Roth, PhD, Gerard A. Gioia, PhD, and PAR Staff Tasks of Executive Control TEC Interpretive Report Developed by Peter K. Isquith, PhD, Robert M. Roth, PhD, Gerard A. Gioia, PhD, and PAR Staff Client Information Client Name: Sample Client Client ID:

More information

Development of the Mental, Emotional, and Bodily Toughness Inventory in Collegiate Athletes and Nonathletes

Development of the Mental, Emotional, and Bodily Toughness Inventory in Collegiate Athletes and Nonathletes Journal of Athletic Training 2008;43(2):125 132 g by the National Athletic Trainers Association, Inc www.nata.org/jat original research Development of the Mental, Emotional, and Bodily Toughness Inventory

More information

Facebook Therapy? Why Do People Share Self-Relevant Content Online? Eva Buechel. University of Miami. Jonah Berger. University of Pennsylvania

Facebook Therapy? Why Do People Share Self-Relevant Content Online? Eva Buechel. University of Miami. Jonah Berger. University of Pennsylvania Facebook Therapy 1 Facebook Therapy? Why Do People Share Self-Relevant Content Online? Eva Buechel University of Miami Jonah Berger University of Pennsylvania Word Count: 1000 Contact Information: Jonah

More information

MULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL

MULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL Man In India, 96 (1-2) : 173-181 Serials Publications MULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL Adibah Binti Abd Latif 1*, Ibnatul Jalilah Yusof 1, Nor Fadila

More information

Mental Toughness overview

Mental Toughness overview Mental Toughness overview Mental health: a state of well-being Mental health is defined as a state of well-being in which every individual realizes his or her own potential, can cope with the normal stresses

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

REPORT. Technical Report: Item Characteristics. Jessica Masters

REPORT. Technical Report: Item Characteristics. Jessica Masters August 2010 REPORT Diagnostic Geometry Assessment Project Technical Report: Item Characteristics Jessica Masters Technology and Assessment Study Collaborative Lynch School of Education Boston College Chestnut

More information

The Psychometric Principles Maximizing the quality of assessment

The Psychometric Principles Maximizing the quality of assessment Summer School 2009 Psychometric Principles Professor John Rust University of Cambridge The Psychometric Principles Maximizing the quality of assessment Reliability Validity Standardisation Equivalence

More information

Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure. Rob Cavanagh Len Sparrow Curtin University

Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure. Rob Cavanagh Len Sparrow Curtin University Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure Rob Cavanagh Len Sparrow Curtin University R.Cavanagh@curtin.edu.au Abstract The study sought to measure mathematics anxiety

More information

was also my mentor, teacher, colleague, and friend. It is tempting to review John Horn s main contributions to the field of intelligence by

was also my mentor, teacher, colleague, and friend. It is tempting to review John Horn s main contributions to the field of intelligence by Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30, 179 185. (3362 citations in Google Scholar as of 4/1/2016) Who would have thought that a paper

More information

Report for the TEWV Children and Young People s Directorate Clinical Audit & Effectiveness Annual Presentation Day 2014.

Report for the TEWV Children and Young People s Directorate Clinical Audit & Effectiveness Annual Presentation Day 2014. Report for the TEWV Children and Young People s Directorate Clinical Audit & Effectiveness Annual Presentation Day 204. Acknowledgements We would like to thank the Foundation for Professionals in Services

More information

The Mental Toughness Development Toolkit

The Mental Toughness Development Toolkit The Mental Toughness Development Toolkit The Mental Toughness Development Toolkit is an array of experiential exercises which can be run with individuals and/or small and large groups. It also works well

More information

Using Generalizability Theory to Investigate the Psychometric Property of an Assessment Center in Indonesia

Using Generalizability Theory to Investigate the Psychometric Property of an Assessment Center in Indonesia Using Generalizability Theory to Investigate the Psychometric Property of an Assessment Center in Indonesia Urip Purwono received his Ph.D. (psychology) from the University of Massachusetts at Amherst,

More information

THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right.

THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right. THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right. The Test reliability indicators are measures of how well the questions

More information

linking in educational measurement: Taking differential motivation into account 1

linking in educational measurement: Taking differential motivation into account 1 Selecting a data collection design for linking in educational measurement: Taking differential motivation into account 1 Abstract In educational measurement, multiple test forms are often constructed to

More information

Goldsmith. Marshall. FEEDFORWARD Coaching For Your Future. Coaching For Your Future. MMIX ChartHouse Learning. All Rights Reserved. What Is It?

Goldsmith. Marshall. FEEDFORWARD Coaching For Your Future. Coaching For Your Future. MMIX ChartHouse Learning. All Rights Reserved. What Is It? FOR PREVIEW ONLY Marshall Goldsmith Table of Contents.............................................. 2 Get 360 Feedback...3 Pick a Behavior...4 Respond.......................................... 4 Involve...6

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer A reaction time study of responses to trait and ability emotional intelligence test items Citation for published version: Austin, EJ 2009, 'A reaction time study of responses

More information

The Influence of Test Characteristics on the Detection of Aberrant Response Patterns

The Influence of Test Characteristics on the Detection of Aberrant Response Patterns The Influence of Test Characteristics on the Detection of Aberrant Response Patterns Steven P. Reise University of California, Riverside Allan M. Due University of Minnesota Statistical methods to assess

More information

Comparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change

Comparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change Comparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change Gyenam Kim Kang Korea Nazarene University David J. Weiss University of Minnesota Presented at the Item

More information

Program Evaluation Presentation

Program Evaluation Presentation Program Evaluation Presentation Rosalie Hall Young Parent Employment Program October 19, 2011 Young Parent Employment Program (YPEP) YPEP is federally funded employment program with 2.5 FTE of staff allocated

More information

Understanding University Students Implicit Theories of Willpower for Strenuous Mental Activities

Understanding University Students Implicit Theories of Willpower for Strenuous Mental Activities Understanding University Students Implicit Theories of Willpower for Strenuous Mental Activities Success in college is largely dependent on students ability to regulate themselves independently (Duckworth

More information

Connecting Career and Mental Health Counseling: Integrating Theory and Practice

Connecting Career and Mental Health Counseling: Integrating Theory and Practice Connecting Career and Mental Health Counseling: Integrating Theory and Practice Janet G. Lenz, Ph.D. Gary W. Peterson, Ph.D. Robert C. Reardon, Ph.D. Andrew R. Morrison, M.S. Florida State University Denise

More information

An Introduction to Missing Data in the Context of Differential Item Functioning

An Introduction to Missing Data in the Context of Differential Item Functioning A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute

More information

Objective List Theories

Objective List Theories 2015.09.23 Objective List Theories Table of contents 1 Ideal Desire Satisfaction 2 Remote Desires 3 Objective List Theories Desires that would harm one if satisfied Gasoline and water. Suicidal teenager.

More information

Reflect on the Types of Organizational Structures. Hierarch of Needs Abraham Maslow (1970) Hierarchy of Needs

Reflect on the Types of Organizational Structures. Hierarch of Needs Abraham Maslow (1970) Hierarchy of Needs Reflect on the Types of Organizational Structures 1 Hierarch of Needs Abraham Maslow (1970) Self- Actualization or Self- Fulfillment Esteem Belonging, Love, and Social Activities Safety and Security Psychological

More information

Psychometrics in context: Test Construction with IRT. Professor John Rust University of Cambridge

Psychometrics in context: Test Construction with IRT. Professor John Rust University of Cambridge Psychometrics in context: Test Construction with IRT Professor John Rust University of Cambridge Plan Guttman scaling Guttman errors and Loevinger s H statistic Non-parametric IRT Traces in Stata Parametric

More information