Techniques for Explaining Item Response Theory to Stakeholder
|
|
- Blanche Shelton
- 5 years ago
- Views:
Transcription
1 Techniques for Explaining Item Response Theory to Stakeholder Kate DeRoche Antonio Olmos C.J. Mckinney Mental Health Center of Denver Presented on March 23, 2007 at the Eastern Evaluation Research Society Conference in Absecon, NJ
2 Overview of Presentation Current issues with applying IRT in evaluation Explaining the reasoning for using IRT in evaluation Explaining and involving stakeholder in the IRT analysis Explaing IRT to Stakeholder 2
3 IRT in Evaluation There has been an increase in the application of IRT in evaluation Due to the advantages it provides (Hambleton, Swaminathan, Roger, 1991) Multiple applications of IRT for evaluation purposes: Psychometrics-measurement validation Need to shorten a measurement tool Rank elements of a dimension for difficulty Equating instruments Differential Item Functioning (DIF) Explaing IRT to Stakeholder 3
4 Limitation to applying IRT in Evaluation: Less training in measurement as evaluators/researchers Can teach IRT in a class but difficult to explain to people who do not understand psychometrics and advanced statistics. PROBLEM: It is difficult for evaluators to apply Item Response Theory techniques because of the problems related to: 1. Explaining the advantages of IRT in simple terms 2. Explaining the results of IRT so that stakeholders can be involved in the analysis process Explaing IRT to Stakeholder 4
5 Explaining the Reasoning for Using IRT Why do we want to use a more complex method?
6 Explaining Benefits of IRT Hallmark: Separation of item and person parameters (or item/person invariance) Instead, There are 2 things: 1. Are all of our items equal? Should they all equally contribute to our score or are some questions harder than other? 2. Test created with Classical Test Thoery can be very reliable (i.e. very consistent for a person to score the same, if measured twice) and not measure all of our participants well Explaing IRT to Stakeholder 6
7 Are Items Equal? Example for assumptions of CTT: Question 1: Question 2: I feel sad often I think about suicide often Question 3: I sleep more than usual Question 4: I want to be alone more than usual 1 point 1 point 1 point 1 point Example for assumptions of IRT: Question 1: Question 2: I feel sad often 1/2 point I think about suicide often Question 3: I sleep more than usual 3/4 point Question 4: I want to be alone more than usual 1 1/4 point 1 1/2 points Explaing IRT to Stakeholder 7
8 True Depression More Depressed Test level on Trait MMPI- Depression Section Beck s Depression Inventory Could create a measure and depending on who you give it to, it may be created for a specific population MMPI measures a more depressed sample better Beck s measure a less depressed sample better Less Depressed With out IRT would not know the level of the trait that we are measuring (AKA. Which measure is harder) Explaing IRT to Stakeholder 8
9 Explaining IRT results to Stakeholder You can use IRT analysis and involve stakeholder!!!
10 IQ Hard 4.0 Response- G (Super-dupper smart)-(3.0) Scaling Example with IQ Assuming an order that A is a very easy item (i.e. what is your name?) and G is a very hard question (what does floccinaucinihillipilification mean?) It is assumed that if you get a higher response correct, then you got all of the below responses correct, even though you may or may not have been tested for the items below (i.e. if you get D correct, then you got C, B, & A correct). The higher the response you get correct, the higher score you receive. If you get G correct, then your IQ score is 145 Difficulty of Response Response- F (Super-Smart)-(2.0) Response- E (Smart)-(1.0) Response- D (average)-(0.0) Response- C (Below average)-(-1.0) Response- B (border-line mental retardation)-(-2.0) If you get F correct, then your IQ score is 130 If you get E correct, then your IQ score is 115 If you get D correct, then your IQ score is 100 If you get C correct, then your IQ score is 85 If you get B correct, then your IQ score is 70 Response-A (mentally retarded)-(-3.0) If you get A correct, then your IQ score is 55 Easy -4.0 Ideally, this is what a scale should look like, with even intervals of potential responses across the scale. Explaing IRT to Stakeholder 10
11 Problem 1: Clumping The items are clumping together (i.e. E, D & C). Hard 4.0 Example 2 Response- G (Super-dupper smart) (3.0) IQ 145 Response- F (Super-Smart)- (2.8) IQ 142 This means that responses E, D, & C are basically measuring the same thing, and only discriminate between 106 to 102 and 102 to 97. We could remove D, E, or C, but this should be determined based on context and other IRT outcomes. WE DO NOT WANT CLUMPS OF RESPONSES Difficulty of Response Easy Response- E (Smart) (.4) IQ 106 Response- D (average) (.1) IQ 102 Response- C (Below average) (-.2) IQ 97 Response- B (border-line mental retardation) Response-A (mentally retarded) Explaing IRT to Stakeholder 11
12 Problem #2: Full Range Hard Difficulty of Response Easy Example Response- F (Super-Smart) (.5) IQ 107 Response- G (Super-dupper smart)- (0) IQ 100 Response- E (Smart) (-.5) IQ 93 Response- D (average) (-1.25) IQ 81 Response- C (Below average) (-2) IQ 70 Response- B (border-line mental retardation) (-2.8) IQ 58 Response-A (mentally retarded) (-3.5) IQ 48 The scale does not contain any responses above (+0.5) suggesting that the highest IQ we can measure is 107. People with IQ score higher than 107 (i.e. 130) would only be able to know that they have an IQ above 107. THE RESPONSES NEED TO RANGE FROM +3 TO -3 Explaing IRT to Stakeholder 12
13 Problem #3: Ordering Notice that the order of easy to hard goes A,B,C, D, E, G, then F, suggesting that F and G are out of order. A response of what we think is hard (only for supper-duper smart people) is really not that hard and will only produce an IQ score of 100, not 145 as assumed. THE RESPONSES NEED TO BE IN CORRECT ORDER Hard Difficulty of Response Easy Example Response- F (Super-Smart) (.5) IQ 107 Response- G (Super-dupper smart)- (0) IQ 100 Response- E (Smart) (-.5) IQ 93 Response- D (average) (-1.25) IQ 81 Response- C (Below average) (-2) IQ 70 Response- B (border-line mental retardation) (-2.8) IQ 58 Response-A (mentally retarded) (-3.5) IQ 48 Explaing IRT to Stakeholder 13
14 Problem #4: Large Gaps Hard 4.0 Example 2 Response- G (Super-dupper smart) (3.0) IQ 145 Response- F (Super-Smart)- (2.8) IQ 142 Again we see gaps, but they are within the scale (not just at the top or bottom). Difficulty of Response Response- E (Smart) (.4) IQ 106 Response- D (average) (.1) IQ 102 Response- C (Below average) (-.2) IQ 97 This suggest that there are no items able to measure an IQ score between 141 to 107, and between 96 and 56 people are not able to receive score in this area Response-A (mentally retarded) WE CANNOT HAVE LARGE GAPS Explaing IRT to Stakeholder 14
15 Activity with Stakeholder Keep in mind the 4 problems presented (C.R.O.G.)- Clumping, Range, Order & Gaps Present the results of the IRT item map Have problems listed with potential reason and solutions Should have also previously reviewed other IRT output for the software program (ICC s, Information functions, a, b, or c parameters, infit, outfit, etc ) Stakeholders were able interpret the results in terms of their program Added context to the results Most importantly, stakeholders felt that they were involved in the process Explaing IRT to Stakeholder 15
16 How to Present the Results Two ways the item map was presented 1. Display only the current item map Beneficial for individuals familiar with looking at data or graphs Beneficial for long measures when viewing individual items 2. Present the current item map and ideal item map, with suggestions and explanations at the bottom Beneficial for individuals not as familiar with data, graphs, or have number anxiety Ideal for short measure when viewing individual items because it takes up a lot of room Explaing IRT to Stakeholder 16
17 Example 1: Current Item Map Hard Employment Education Active/Growth C- Active Job Search (3.29) F- Full time college (4.77) E- Part time college (.3.66) 4.0 F- Very high (3.41) D- Non-paid work/volunteer (2.64) Difficulty of Item H- Full time independent (1.59) G- Part time independent (.76) E- Part time supported (-.37) F- Full time supported (-1.22) B- Interest in work, no action (-1.99) A- No interest in work (-2.71) D- noncredit training (-.10) G- Recent Grad (-.20) C- Active education/training search (-1.00) B-interest in education, no action (-1.05) A- No interest in education (-2.82) E- High (-.75) D- Moderate out MH system (-1.62) B- Low (-2.28) C- Moderate in MH system (-2.37) Easy A- Very low (-3.88) Explaing IRT to Stakeholder 17
18 Example 2: Current & Ideal Item Maps 4.0 Current Status: F- Very high (3.41) Active/Growth 4.0 Ideal: F- Very high (3.41) High (??) E- High (-.75) E- Moderate/High (-.75) -2.0 D- Moderate out MH system (-1.62) B- Low (-2.28) C- Moderate in MH system (-2.37) -2.0 D- Moderate/low (-1.62) B & C- Low (-2.28 to -2.37) -4.0 A- Very low (-3.88) Problems: #2 Ordering, #3 Clustering, & #4 Clumping Suggestion: 1. Combined low and moderate in MH systems, because their definitions are very similar, and clinicians can not differentiate between the 2 2. Since, B & C are combined and called low, then moderate out of MH should be call moderate low and high should be called moderate high 3. There is a very large gap between high and very high, when looking at their definition the only difference is participating in activates that are other focused in very high, and self-focused in high. Therefore, in terms of recovery, moving from self to other focused activities is a large step, and there should be other levels added in. Therefore, Explaing a new IRT response to Stakeholder should be added that includes participating in actives that are 18 approximately 50% self-focused and 50% other focuses, called high. Any suggestions regarding this issues? -4.0 A- Very low (-3.88)
19 Applications in Other Domains Any commonly understood test that has a normed mean and standard deviation Education- standard test scores SAT, GRE, K-12 state testing, t-scores Health- any biological measure heart rate, blood pressure, blood sugar Explaing IRT to Stakeholder 19
20 Questions??? For future questions or comments: Kate DeRoche (303) Explaing IRT to Stakeholder 20
21 IRT Resources IRT 101 Reise, S. P., Ainsworth, A. T. & Haviland, M. G. (2005). Item Response Theory: Fundamentals, Applications, and Promise in Psychological Research. Current Directions in Psychological Science, 14, IRT Books Hambleton, R. K., Swaminathan, H. & Rogers, H. J. (1991). Fundamentals of Item Response Theory. Newbury Park, CA: Sage Publication, Inc. Embretson, S. E. & Reise, S. P. (2000). Item Response Theory for Psychologists. Mahwah, NJ: Lawrence Erlbaum Associates And many more resources Explaing IRT to Stakeholder 21
The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory
The Psychometric Development Process of Recovery Measures and Markers: Classical Test Theory and Item Response Theory Kate DeRoche, M.A. Mental Health Center of Denver Antonio Olmos, Ph.D. Mental Health
More informationThe MHSIP: A Tale of Three Centers
The MHSIP: A Tale of Three Centers P. Antonio Olmos-Gallo, Ph.D. Kathryn DeRoche, M.A. Mental Health Center of Denver Richard Swanson, Ph.D., J.D. Aurora Research Institute John Mahalik, Ph.D., M.P.A.
More informationInvestigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories
Kamla-Raj 010 Int J Edu Sci, (): 107-113 (010) Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories O.O. Adedoyin Department of Educational Foundations,
More informationContents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD
Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT
More informationReferences. Embretson, S. E. & Reise, S. P. (2000). Item response theory for psychologists. Mahwah,
The Western Aphasia Battery (WAB) (Kertesz, 1982) is used to classify aphasia by classical type, measure overall severity, and measure change over time. Despite its near-ubiquitousness, it has significant
More informationMultilevel Techniques for Quality Control Charts of Recovery Outcomes
Multilevel Techniques for Quality Control Charts of Recovery Outcomes INFORMS Annual Meeting 2009 San Diego, CA October, 11 th, 2009 Linda Laganga, PhD* (Linda.Laganga@mhcd.org) CJ McKinney, MA Kate DeRoche,
More informationDetermining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory
Determining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory Teodora M. Salubayba St. Scholastica s College-Manila dory41@yahoo.com Abstract Mathematics word-problem
More informationValidating Measures of Self Control via Rasch Measurement. Jonathan Hasford Department of Marketing, University of Kentucky
Validating Measures of Self Control via Rasch Measurement Jonathan Hasford Department of Marketing, University of Kentucky Kelly D. Bradley Department of Educational Policy Studies & Evaluation, University
More informationItem Response Theory: Methods for the Analysis of Discrete Survey Response Data
Item Response Theory: Methods for the Analysis of Discrete Survey Response Data ICPSR Summer Workshop at the University of Michigan June 29, 2015 July 3, 2015 Presented by: Dr. Jonathan Templin Department
More informationUsing Analytical and Psychometric Tools in Medium- and High-Stakes Environments
Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session
More informationComprehensive Statistical Analysis of a Mathematics Placement Test
Comprehensive Statistical Analysis of a Mathematics Placement Test Robert J. Hall Department of Educational Psychology Texas A&M University, USA (bobhall@tamu.edu) Eunju Jung Department of Educational
More informationConstruct Validity of Mathematics Test Items Using the Rasch Model
Construct Validity of Mathematics Test Items Using the Rasch Model ALIYU, R.TAIWO Department of Guidance and Counselling (Measurement and Evaluation Units) Faculty of Education, Delta State University,
More informationUSE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION
USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION Iweka Fidelis (Ph.D) Department of Educational Psychology, Guidance and Counselling, University of Port Harcourt,
More informationProceedings of the 2011 International Conference on Teaching, Learning and Change (c) International Association for Teaching and Learning (IATEL)
EVALUATION OF MATHEMATICS ACHIEVEMENT TEST: A COMPARISON BETWEEN CLASSICAL TEST THEORY (CTT)AND ITEM RESPONSE THEORY (IRT) Eluwa, O. Idowu 1, Akubuike N. Eluwa 2 and Bekom K. Abang 3 1& 3 Dept of Educational
More informationItem Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century
International Journal of Scientific Research in Education, SEPTEMBER 2018, Vol. 11(3B), 627-635. Item Response Theory (IRT): A Modern Statistical Theory for Solving Measurement Problem in 21st Century
More informationSurvey Sampling Weights and Item Response Parameter Estimation
Survey Sampling Weights and Item Response Parameter Estimation Spring 2014 Survey Methodology Simmons School of Education and Human Development Center on Research & Evaluation Paul Yovanoff, Ph.D. Department
More informationDoes factor indeterminacy matter in multi-dimensional item response theory?
ABSTRACT Paper 957-2017 Does factor indeterminacy matter in multi-dimensional item response theory? Chong Ho Yu, Ph.D., Azusa Pacific University This paper aims to illustrate proper applications of multi-dimensional
More informationEmpowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison
Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological
More informationItem Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses
Item Response Theory Steven P. Reise University of California, U.S.A. Item response theory (IRT), or modern measurement theory, provides alternatives to classical test theory (CTT) methods for the construction,
More informationItem and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model
Jafari et al. Health and Quality of Life Outcomes 2012, 10:127 SHORT REPORT Open Access Item and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model Peyman
More informationPsychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals
Psychometrics for Beginners Lawrence J. Fabrey, PhD Applied Measurement Professionals Learning Objectives Identify key NCCA Accreditation requirements Identify two underlying models of measurement Describe
More informationAdaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida
Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models
More informationValidation of an Analytic Rating Scale for Writing: A Rasch Modeling Approach
Tabaran Institute of Higher Education ISSN 2251-7324 Iranian Journal of Language Testing Vol. 3, No. 1, March 2013 Received: Feb14, 2013 Accepted: March 7, 2013 Validation of an Analytic Rating Scale for
More informationBy Hui Bian Office for Faculty Excellence
By Hui Bian Office for Faculty Excellence 1 Email: bianh@ecu.edu Phone: 328-5428 Location: 1001 Joyner Library, room 1006 Office hours: 8:00am-5:00pm, Monday-Friday 2 Educational tests and regular surveys
More informationExploratory Factor Analysis Student Anxiety Questionnaire on Statistics
Proceedings of Ahmad Dahlan International Conference on Mathematics and Mathematics Education Universitas Ahmad Dahlan, Yogyakarta, 13-14 October 2017 Exploratory Factor Analysis Student Anxiety Questionnaire
More informationHARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT
HARRISON ASSESSMENTS HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT Have you put aside an hour and do you have a hard copy of your report? Get a quick take on their initial reactions
More informationWHO Quality of Life. health other than the cause of a disease or the side effects that come along with it. These other
WHO Quality of Life Overview of the WHO Quality of Life As healthcare progresses globally, so does that evolution of scientific research on healthcare assessments and practices. Healthcare services have
More informationInfluences of IRT Item Attributes on Angoff Rater Judgments
Influences of IRT Item Attributes on Angoff Rater Judgments Christian Jones, M.A. CPS Human Resource Services Greg Hurt!, Ph.D. CSUS, Sacramento Angoff Method Assemble a panel of subject matter experts
More informationUsing the Rasch Modeling for psychometrics examination of food security and acculturation surveys
Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Jill F. Kilanowski, PhD, APRN,CPNP Associate Professor Alpha Zeta & Mu Chi Acknowledgements Dr. Li Lin,
More informationConnexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan
Connexion of Item Response Theory to Decision Making in Chess Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Acknowledgement A few Slides have been taken from the following presentation
More informationCHAPTER 7 RESEARCH DESIGN AND METHODOLOGY. This chapter addresses the research design and describes the research methodology
CHAPTER 7 RESEARCH DESIGN AND METHODOLOGY 7.1 Introduction This chapter addresses the research design and describes the research methodology employed in this study. The sample and sampling procedure is
More informationAndré Cyr and Alexander Davies
Item Response Theory and Latent variable modeling for surveys with complex sampling design The case of the National Longitudinal Survey of Children and Youth in Canada Background André Cyr and Alexander
More informationFIGURE 1-The Cognitive Model. Core belief. I m incompetent. Intermediate belief. If I don t understand something perfectly, then I m dumb
FIGURE 1-The Cognitive Model Core belief I m incompetent Intermediate belief If I don t understand something perfectly, then I m dumb Situation Automatic thoughts Reactions Reading this book This is too
More informationThe Classification Accuracy of Measurement Decision Theory. Lawrence Rudner University of Maryland
Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago, April 23-25, 2003 The Classification Accuracy of Measurement Decision Theory Lawrence Rudner University
More informationInstrument equivalence across ethnic groups. Antonio Olmos (MHCD) Susan R. Hutchinson (UNC)
Instrument equivalence across ethnic groups Antonio Olmos (MHCD) Susan R. Hutchinson (UNC) Overview Instrument Equivalence Measurement Invariance Invariance in Reliability Scores Factorial Invariance Item
More informationinsight. Psychological tests to help support your work with medical patients
insight. Psychological tests to help support your work with medical patients C O M P R E H E N S I V E Shedding light on important issues Sometimes a closer view is all you need to find the answers you
More informationITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION SCALE
California State University, San Bernardino CSUSB ScholarWorks Electronic Theses, Projects, and Dissertations Office of Graduate Studies 6-2016 ITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION
More informationUsing Performance and Outcome Measures for Enhancing the Quality of Behavioral Health Care Services
Using Performance and Outcome Measures for Enhancing the Quality of Behavioral Health Care Services Dennis Morrison, PhD Chief Clinical Officer Netsmart Technologies @DrDennyM Knowledge Without Data is
More informationSampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015
Sampling for Impact Evaluation Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015 How many hours do you expect to sleep tonight? A. 2 or less B. 3 C.
More informationINTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT. Basic Concepts, Parameters and Statistics
INTRODUCTION TO ITEM RESPONSE THEORY APPLIED TO FOOD SECURITY MEASUREMENT Basic Concepts, Parameters and Statistics The designations employed and the presentation of material in this information product
More informationRasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments. Jan R. Böhnke & Jaime Delgadillo
Rasch-scaling of PHQ-9 and GAD-7 Consequences for repeated assessments Jan R. Böhnke & Jaime Delgadillo NHS::IAPT Improving Access to Psychological Therapies "Improving Access to Psychological Therapies
More information2 Types of psychological tests and their validity, precision and standards
2 Types of psychological tests and their validity, precision and standards Tests are usually classified in objective or projective, according to Pasquali (2008). In case of projective tests, a person is
More informationManifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses
Journal of Modern Applied Statistical Methods Copyright 2005 JMASM, Inc. May, 2005, Vol. 4, No.1, 275-282 1538 9472/05/$95.00 Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement
More informationThe Value of an Electronic Health Record System in Lean Process Improvement and Research Innovation
POMS College of Service Operations Research Incubator, April 30, 2009, Orlando, FL The Value of an Electronic Health Record System in Lean Process Improvement and Research Innovation Linda LaGanga, Ph.D.
More informationChildren's Depression Inventory 2nd Edition: Parent Maria Kovacs, Ph.D.
Children's Depression Inventory 2nd Edition: Parent Maria Kovacs, Ph.D. Assessment Report This Assessment Report is intended for use by qualified assessors only, and is not to be shown or presented to
More informationTechnical Specifications
Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically
More informationThese questionnaires are used by psychology services to help us understand how people feel. One questionnaire measures how sad people feel.
ADAPTED PHQ-9 & GAD-7 QUESTIONNAIRES How to fill in these questionnaires: These questionnaires are used by psychology services to help us understand how people feel. One questionnaire measures how sad
More informationDuring the past century, mathematics
An Evaluation of Mathematics Competitions Using Item Response Theory Jim Gleason During the past century, mathematics competitions have become part of the landscape in mathematics education. The first
More informationKnown-Groups Validity 2017 FSSE Measurement Invariance
Known-Groups Validity 2017 FSSE Measurement Invariance A key assumption of any latent measure (any questionnaire trying to assess an unobservable construct) is that it functions equally across all different
More informationModel fit and robustness? - A critical look at the foundation of the PISA project
Model fit and robustness? - A critical look at the foundation of the PISA project Svend Kreiner, Dept. of Biostatistics, Univ. of Copenhagen TOC The PISA project and PISA data PISA methodology Rasch item
More informationThe Lens Model and Linear Models of Judgment
John Miyamoto Email: jmiyamot@uw.edu October 3, 2017 File = D:\P466\hnd02-1.p466.a17.docm 1 http://faculty.washington.edu/jmiyamot/p466/p466-set.htm Psych 466: Judgment and Decision Making Autumn 2017
More informationOn indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state
On indirect measurement of health based on survey data Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state A scaling model: P(Y 1,..,Y k ;α, ) α = item difficulties
More informationMantel-Haenszel Procedures for Detecting Differential Item Functioning
A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of
More informationValidity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)
Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity
More informationEVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS
DePaul University INTRODUCTION TO ITEM ANALYSIS: EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS Ivan Hernandez, PhD OVERVIEW What is Item Analysis? Overview Benefits of Item Analysis Applications Main
More informationPart 5. Clare s Recovery Story
Part 5 Clare s Recovery Story Clare s story is about someone who used exposure and habituation to treat her panic disorder with agoraphobia. Exposure and habituation is a technique that breaks the cycle
More informationabout Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.
Brad Pilon 1 2 3 ! For many people, the best thing about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.! However, this still means there
More informationAuthor s response to reviews
Author s response to reviews Title: The validity of a professional competence tool for physiotherapy students in simulationbased clinical education: a Rasch analysis Authors: Belinda Judd (belinda.judd@sydney.edu.au)
More informationRUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT. Evaluating and Restructuring Science Assessments: An Example Measuring Student s
RUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT Evaluating and Restructuring Science Assessments: An Example Measuring Student s Conceptual Understanding of Heat Kelly D. Bradley, Jessica D. Cunningham
More informationMEASURING MIDDLE GRADES STUDENTS UNDERSTANDING OF FORCE AND MOTION CONCEPTS: INSIGHTS INTO THE STRUCTURE OF STUDENT IDEAS
MEASURING MIDDLE GRADES STUDENTS UNDERSTANDING OF FORCE AND MOTION CONCEPTS: INSIGHTS INTO THE STRUCTURE OF STUDENT IDEAS The purpose of this study was to create an instrument that measures middle grades
More informationAPPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH
APPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH Margaret Wu & Ray Adams Documents supplied on behalf of the authors by Educational Measurement Solutions TABLE OF CONTENT CHAPTER
More informationAssessment: Interviews, Tests, Techniques. Clinical Psychology Lectures
Lecture 6 Assessment: Interviews, Tests, Techniques Clinical Psychology Lectures Psychodiagnostic Assessment Also termed: personality assessment, diagnostic assessment, pretreatment assessments or psychological
More informationCLINICAL NEUROPSYCHOLOGY AND COGNITIVE NEUROLOGY OF PARKINSON'S DISEASE AND OTHER MOVEMENT DISORDERS BY ALEXANDER I. TROSTER
CLINICAL NEUROPSYCHOLOGY AND COGNITIVE NEUROLOGY OF PARKINSON'S DISEASE AND OTHER MOVEMENT DISORDERS BY ALEXANDER I. TROSTER DOWNLOAD EBOOK : CLINICAL NEUROPSYCHOLOGY AND COGNITIVE MOVEMENT DISORDERS BY
More informationIntroduction to Item Response Theory
Introduction to Item Response Theory Prof John Rust, j.rust@jbs.cam.ac.uk David Stillwell, ds617@cam.ac.uk Aiden Loe, bsl28@cam.ac.uk Luning Sun, ls523@cam.ac.uk www.psychometrics.cam.ac.uk Goals Build
More informationCYRINUS B. ESSEN, IDAKA E. IDAKA AND MICHAEL A. METIBEMU. (Received 31, January 2017; Revision Accepted 13, April 2017)
DOI: http://dx.doi.org/10.4314/gjedr.v16i2.2 GLOBAL JOURNAL OF EDUCATIONAL RESEARCH VOL 16, 2017: 87-94 COPYRIGHT BACHUDO SCIENCE CO. LTD PRINTED IN NIGERIA. ISSN 1596-6224 www.globaljournalseries.com;
More informationTurning Output of Item Response Theory Data Analysis into Graphs with R
Overview Turning Output of Item Response Theory Data Analysis into Graphs with R Motivation Importance of graphing data Graphical methods for item response theory Why R? Two examples Ching-Fan Sheu, Cheng-Te
More informationExamining Factors Affecting Language Performance: A Comparison of Three Measurement Approaches
Pertanika J. Soc. Sci. & Hum. 21 (3): 1149-1162 (2013) SOCIAL SCIENCES & HUMANITIES Journal homepage: http://www.pertanika.upm.edu.my/ Examining Factors Affecting Language Performance: A Comparison of
More information1. Evaluate the methodological quality of a study with the COSMIN checklist
Answers 1. Evaluate the methodological quality of a study with the COSMIN checklist We follow the four steps as presented in Table 9.2. Step 1: The following measurement properties are evaluated in the
More informationIn this session we re focusing on a high level overview of the Collaborative Care approach.
In this session we re focusing on a high level overview of the Collaborative Care approach. 1 The evidence base for collaborative care is extremely strong. More than 80 RCTs demonstrate this approach is
More informationBasic Immunology: Functions And Disorders Of The Immune System, 4e PDF
Basic Immunology: Functions And Disorders Of The Immune System, 4e PDF Understand all the essential concepts in immunology with Basic Immunology: Functions and Disorders of the Immune System! This concise,
More informationDevelopment of a descriptive fit statistic for the Rasch model
Development of a descriptive fit statistic for the Rasch model Purya Baghaei, Takuya Yanagida, Moritz Heene To cite this version: Purya Baghaei, Takuya Yanagida, Moritz Heene. Development of a descriptive
More informationImproving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates. Kent Rittschof Department of Curriculum, Foundations, & Reading
Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates Kent Rittschof Department of Curriculum, Foundations, & Reading What is Ambiguity Tolerance (AT) and why should it be measured?
More informationRunning head: PRELIM KSVS SCALES 1
Running head: PRELIM KSVS SCALES 1 Psychometric Examination of a Risk Perception Scale for Evaluation Anthony P. Setari*, Kelly D. Bradley*, Marjorie L. Stanek**, & Shannon O. Sampson* *University of Kentucky
More informationPsychology UNIT 1: PSYCHOLOGY AS A SCIENCE. Core
Core provides a solid overview of the field's major domains: methods, biopsychology, cognitive and developmental psychology, and variations in individual and group behavior. By focusing on significant
More informationTasks of Executive Control TEC. Interpretive Report. Developed by Peter K. Isquith, PhD, Robert M. Roth, PhD, Gerard A. Gioia, PhD, and PAR Staff
Tasks of Executive Control TEC Interpretive Report Developed by Peter K. Isquith, PhD, Robert M. Roth, PhD, Gerard A. Gioia, PhD, and PAR Staff Client Information Client Name: Sample Client Client ID:
More informationDevelopment of the Mental, Emotional, and Bodily Toughness Inventory in Collegiate Athletes and Nonathletes
Journal of Athletic Training 2008;43(2):125 132 g by the National Athletic Trainers Association, Inc www.nata.org/jat original research Development of the Mental, Emotional, and Bodily Toughness Inventory
More informationFacebook Therapy? Why Do People Share Self-Relevant Content Online? Eva Buechel. University of Miami. Jonah Berger. University of Pennsylvania
Facebook Therapy 1 Facebook Therapy? Why Do People Share Self-Relevant Content Online? Eva Buechel University of Miami Jonah Berger University of Pennsylvania Word Count: 1000 Contact Information: Jonah
More informationMULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL
Man In India, 96 (1-2) : 173-181 Serials Publications MULTIPLE-CHOICE ITEMS ANALYSIS USING CLASSICAL TEST THEORY AND RASCH MEASUREMENT MODEL Adibah Binti Abd Latif 1*, Ibnatul Jalilah Yusof 1, Nor Fadila
More informationMental Toughness overview
Mental Toughness overview Mental health: a state of well-being Mental health is defined as a state of well-being in which every individual realizes his or her own potential, can cope with the normal stresses
More informationChapter 7: Descriptive Statistics
Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of
More informationREPORT. Technical Report: Item Characteristics. Jessica Masters
August 2010 REPORT Diagnostic Geometry Assessment Project Technical Report: Item Characteristics Jessica Masters Technology and Assessment Study Collaborative Lynch School of Education Boston College Chestnut
More informationThe Psychometric Principles Maximizing the quality of assessment
Summer School 2009 Psychometric Principles Professor John Rust University of Cambridge The Psychometric Principles Maximizing the quality of assessment Reliability Validity Standardisation Equivalence
More informationMeasuring mathematics anxiety: Paper 2 - Constructing and validating the measure. Rob Cavanagh Len Sparrow Curtin University
Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure Rob Cavanagh Len Sparrow Curtin University R.Cavanagh@curtin.edu.au Abstract The study sought to measure mathematics anxiety
More informationwas also my mentor, teacher, colleague, and friend. It is tempting to review John Horn s main contributions to the field of intelligence by
Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30, 179 185. (3362 citations in Google Scholar as of 4/1/2016) Who would have thought that a paper
More informationReport for the TEWV Children and Young People s Directorate Clinical Audit & Effectiveness Annual Presentation Day 2014.
Report for the TEWV Children and Young People s Directorate Clinical Audit & Effectiveness Annual Presentation Day 204. Acknowledgements We would like to thank the Foundation for Professionals in Services
More informationThe Mental Toughness Development Toolkit
The Mental Toughness Development Toolkit The Mental Toughness Development Toolkit is an array of experiential exercises which can be run with individuals and/or small and large groups. It also works well
More informationUsing Generalizability Theory to Investigate the Psychometric Property of an Assessment Center in Indonesia
Using Generalizability Theory to Investigate the Psychometric Property of an Assessment Center in Indonesia Urip Purwono received his Ph.D. (psychology) from the University of Massachusetts at Amherst,
More informationTHE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right.
THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right. The Test reliability indicators are measures of how well the questions
More informationlinking in educational measurement: Taking differential motivation into account 1
Selecting a data collection design for linking in educational measurement: Taking differential motivation into account 1 Abstract In educational measurement, multiple test forms are often constructed to
More informationGoldsmith. Marshall. FEEDFORWARD Coaching For Your Future. Coaching For Your Future. MMIX ChartHouse Learning. All Rights Reserved. What Is It?
FOR PREVIEW ONLY Marshall Goldsmith Table of Contents.............................................. 2 Get 360 Feedback...3 Pick a Behavior...4 Respond.......................................... 4 Involve...6
More informationEdinburgh Research Explorer
Edinburgh Research Explorer A reaction time study of responses to trait and ability emotional intelligence test items Citation for published version: Austin, EJ 2009, 'A reaction time study of responses
More informationThe Influence of Test Characteristics on the Detection of Aberrant Response Patterns
The Influence of Test Characteristics on the Detection of Aberrant Response Patterns Steven P. Reise University of California, Riverside Allan M. Due University of Minnesota Statistical methods to assess
More informationComparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change
Comparison of Computerized Adaptive Testing and Classical Methods for Measuring Individual Change Gyenam Kim Kang Korea Nazarene University David J. Weiss University of Minnesota Presented at the Item
More informationProgram Evaluation Presentation
Program Evaluation Presentation Rosalie Hall Young Parent Employment Program October 19, 2011 Young Parent Employment Program (YPEP) YPEP is federally funded employment program with 2.5 FTE of staff allocated
More informationUnderstanding University Students Implicit Theories of Willpower for Strenuous Mental Activities
Understanding University Students Implicit Theories of Willpower for Strenuous Mental Activities Success in college is largely dependent on students ability to regulate themselves independently (Duckworth
More informationConnecting Career and Mental Health Counseling: Integrating Theory and Practice
Connecting Career and Mental Health Counseling: Integrating Theory and Practice Janet G. Lenz, Ph.D. Gary W. Peterson, Ph.D. Robert C. Reardon, Ph.D. Andrew R. Morrison, M.S. Florida State University Denise
More informationAn Introduction to Missing Data in the Context of Differential Item Functioning
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute
More informationObjective List Theories
2015.09.23 Objective List Theories Table of contents 1 Ideal Desire Satisfaction 2 Remote Desires 3 Objective List Theories Desires that would harm one if satisfied Gasoline and water. Suicidal teenager.
More informationReflect on the Types of Organizational Structures. Hierarch of Needs Abraham Maslow (1970) Hierarchy of Needs
Reflect on the Types of Organizational Structures 1 Hierarch of Needs Abraham Maslow (1970) Self- Actualization or Self- Fulfillment Esteem Belonging, Love, and Social Activities Safety and Security Psychological
More informationPsychometrics in context: Test Construction with IRT. Professor John Rust University of Cambridge
Psychometrics in context: Test Construction with IRT Professor John Rust University of Cambridge Plan Guttman scaling Guttman errors and Loevinger s H statistic Non-parametric IRT Traces in Stata Parametric
More information