Analyzing Teacher Professional Standards as Latent Factors of Assessment Data: The Case of Teacher Test-English in Saudi Arabia
|
|
- Cecil Simmons
- 5 years ago
- Views:
Transcription
1 Analyzing Teacher Professional Standards as Latent Factors of Assessment Data: The Case of Teacher Test-English in Saudi Arabia 1
2 Introduction The Teacher Test-English (TT-E) is administered by the NCA as an assessment tool for teacher certification in KSA. The 69 operational test items used in this study are grouped into five content domains: Domain 1: Language Pedagogy (29 items), Domain 2: Curriculum Design (15 items), Domain 3: Theoretical Knowledge (13 items), Domain 4. Theoretical Application (8 items), and Domain 5. Language Proficiency (4 items). These domains are further operationalized into 16 professional standards (PSs) Domain 1: Language Pedagogy (29 items), Domain 2: Curriculum Design (15 items), Domain 3: Theoretical Knowledge (13 items), Domain 4. Theoretical Application (8 2
3 3
4 4
5 Purpose of the study The purpose is to examine the performance of English language teacher candidates on 16 professional standards (PSs) targeted with the TT-E and to test for gender differences on the entire TT-E construct and separately by PSs. To perform analyses at true score (error-free) level, the TT-E construct and individual PSs are treated as latent factors in the framework of confirmatory factor analysis (CFA). Testing for preliminary assumptions required under this approach is also targeted with this study (e.g., testing for differential item functioning of TT-E items for gender). 5
6 Research Questions 6
7 Method Data: The binary scores (1 = correct,0 = incorrect) of 19,167 English language teacher candidates on 69 items of the TT-E; (59.8% females and 40.2% males). Statistical Analysis: Latent variable modeling (LVM) in CFA framework using the computer program Mplus (Muthén & Muthén, 2010). The testing for differential item functioning (DIF) was performed via IRT analysis using the computer program Xcalibre 4.2 (Guyer & Thompson, 2012). 7
8 Results The results related to RQ1 and RQ2, obtained within CFA, show that the TT-E data are essentially unidimensional; that is, there is one dominant ability measured by TT-E. Also, the one-factor CFA model fit the data separately for males and females; i.e., there is configural invariance of the TT-E structure across gender. The score reliability, estimated via a latent variable modeling approach, was found to be for the entire sample, for the sample of males, and for the sample of females. 8
9 Testing for Data Fit of the One-Factor CFA Model With TT-E Data: Total Sample and by Gender 9
10 10
11 RQ4 results: Correlations Among Professional Standards 11
12 12
13 Results on RQ5 (b): CFA-based MIMIC model for gender differences on 16 professional standards (PS1,, PS16) 13
14 Females outperform males on all PSs, except in the case of no gender difference on one professional standard, PS11: Teachers have a general knowledge of the language as a system. 14
15 Gender differences on domains of professional standards 15
16 16
17 17
18 Conclusion 1. The TT-E data are essentially unidimensional; i.e., there is one dominant ability that underlies TT-E data, referred to as General Teacher Ability on Professional Standards (GTAPS). 2. The unidimensional factor structure of the TT-E holds also for males and females separately. 3. There is no differential item functioning (DIF) of TT-E items against males or females; i.e., examinees with the same ability (same GTAPS) have equal chances to answer any item correctly regardless of their gender. 18
19 4. Females outperform males on the general TT-E ability (GTAPS scores), with a difference of medium effect size. 5. There is no gender difference on PS11: Teachers have a general knowledge of the language as a system but females do better than males on all other 15 PSs, with a difference of small effect size on three PSs (PS3, PS6, and PS13) and medium effect size on 12 PSs (PS1, PS2, PS4, PS5, PS7, PS8, PS9, PS10, PS12, PS14, PS15, and PS16). 6. The pattern of differences among PSs (individually or grouped into domains) is similar for males and females, but the PS performance of females is higher compared to males. 19
20 Thank You! SHUKRAN 20
Brent Duckor Ph.D. (SJSU) Kip Tellez, Ph.D. (UCSC) BEAR Seminar April 22, 2014
Brent Duckor Ph.D. (SJSU) Kip Tellez, Ph.D. (UCSC) BEAR Seminar April 22, 2014 Studies under review ELA event Mathematics event Duckor, B., Castellano, K., Téllez, K., & Wilson, M. (2013, April). Validating
More informationInvestigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories
Kamla-Raj 010 Int J Edu Sci, (): 107-113 (010) Investigating the Invariance of Person Parameter Estimates Based on Classical Test and Item Response Theories O.O. Adedoyin Department of Educational Foundations,
More informationUsing the Rasch Modeling for psychometrics examination of food security and acculturation surveys
Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Jill F. Kilanowski, PhD, APRN,CPNP Associate Professor Alpha Zeta & Mu Chi Acknowledgements Dr. Li Lin,
More informationContents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD
Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT
More informationManifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses
Journal of Modern Applied Statistical Methods Copyright 2005 JMASM, Inc. May, 2005, Vol. 4, No.1, 275-282 1538 9472/05/$95.00 Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement
More informationPSYCHOMETRICS APPLIED TO HEALTHCARE PROFESSIONS EDUCATION
PSYCHOMETRICS APPLIED TO HEALTHCARE PROFESSIONS EDUCATION COURSE PROGRAMME Psychometric properties such as reliability and validity are essential components in the utility of assessment in medical education.
More informationGENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS
GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS Michael J. Kolen The University of Iowa March 2011 Commissioned by the Center for K 12 Assessment & Performance Management at
More informationBruno D. Zumbo, Ph.D. University of Northern British Columbia
Bruno Zumbo 1 The Effect of DIF and Impact on Classical Test Statistics: Undetected DIF and Impact, and the Reliability and Interpretability of Scores from a Language Proficiency Test Bruno D. Zumbo, Ph.D.
More informationPublished by European Centre for Research Training and Development UK (
DETERMINATION OF DIFFERENTIAL ITEM FUNCTIONING BY GENDER IN THE NATIONAL BUSINESS AND TECHNICAL EXAMINATIONS BOARD (NABTEB) 2015 MATHEMATICS MULTIPLE CHOICE EXAMINATION Kingsley Osamede, OMOROGIUWA (Ph.
More informationInfluences of IRT Item Attributes on Angoff Rater Judgments
Influences of IRT Item Attributes on Angoff Rater Judgments Christian Jones, M.A. CPS Human Resource Services Greg Hurt!, Ph.D. CSUS, Sacramento Angoff Method Assemble a panel of subject matter experts
More informationOn indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state
On indirect measurement of health based on survey data Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state A scaling model: P(Y 1,..,Y k ;α, ) α = item difficulties
More information1. Evaluate the methodological quality of a study with the COSMIN checklist
Answers 1. Evaluate the methodological quality of a study with the COSMIN checklist We follow the four steps as presented in Table 9.2. Step 1: The following measurement properties are evaluated in the
More informationMeasurement Invariance (MI): a general overview
Measurement Invariance (MI): a general overview Eric Duku Offord Centre for Child Studies 21 January 2015 Plan Background What is Measurement Invariance Methodology to test MI Challenges with post-hoc
More informationHaving your cake and eating it too: multiple dimensions and a composite
Having your cake and eating it too: multiple dimensions and a composite Perman Gochyyev and Mark Wilson UC Berkeley BEAR Seminar October, 2018 outline Motivating example Different modeling approaches Composite
More informationAssessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University.
Running head: ASSESS MEASUREMENT INVARIANCE Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies Xiaowen Zhu Xi an Jiaotong University Yanjie Bian Xi an Jiaotong
More informationTechnical Specifications
Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically
More informationITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION SCALE
California State University, San Bernardino CSUSB ScholarWorks Electronic Theses, Projects, and Dissertations Office of Graduate Studies 6-2016 ITEM RESPONSE THEORY ANALYSIS OF THE TOP LEADERSHIP DIRECTION
More informationPurpose of Workshop. Faculty. Culturally Sensitive Research. ISOQOL Workshop 19 Oct 2005
Introduction to ing Health-Related Measures in Diverse Populations Pre-conference Workshop 2 The International Society for Quality of Life Research Center for Aging in Diverse Communities: A Resource Center
More informationThe Effect of Guessing on Assessing Dimensionality in Multiple-Choice Tests: A Monte Carlo Study with Application. Chien-Chi Yeh
The Effect of Guessing on Assessing Dimensionality in Multiple-Choice Tests: A Monte Carlo Study with Application by Chien-Chi Yeh B.S., Chung Yuan Christian University, 1988 M.Ed., National Tainan Teachers
More informationMethodological Issues in Measuring the Development of Character
Methodological Issues in Measuring the Development of Character Noel A. Card Department of Human Development and Family Studies College of Liberal Arts and Sciences Supported by a grant from the John Templeton
More informationProceedings of the 2011 International Conference on Teaching, Learning and Change (c) International Association for Teaching and Learning (IATEL)
EVALUATION OF MATHEMATICS ACHIEVEMENT TEST: A COMPARISON BETWEEN CLASSICAL TEST THEORY (CTT)AND ITEM RESPONSE THEORY (IRT) Eluwa, O. Idowu 1, Akubuike N. Eluwa 2 and Bekom K. Abang 3 1& 3 Dept of Educational
More informationAndré Cyr and Alexander Davies
Item Response Theory and Latent variable modeling for surveys with complex sampling design The case of the National Longitudinal Survey of Children and Youth in Canada Background André Cyr and Alexander
More informationConstruct Invariance of the Survey of Knowledge of Internet Risk and Internet Behavior Knowledge Scale
University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2010 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-20-2010 Construct Invariance of the Survey
More informationEXPLORING DIFFERENTIAL ITEM FUNCTIONING AMONG HAWAI I RESIDENTS ON THE BOSTON NAMING TEST
EXPLORING DIFFERENTIAL ITEM FUNCTIONING AMONG HAWAI I RESIDENTS ON THE BOSTON NAMING TEST A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI I AT MĀNOA IN PARTIAL FULFILLMENT OF THE
More informationItem Analysis: Classical and Beyond
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013 Why is item analysis relevant? Item analysis provides
More informationModels in Educational Measurement
Models in Educational Measurement Jan-Eric Gustafsson Department of Education and Special Education University of Gothenburg Background Measurement in education and psychology has increasingly come to
More informationPsychometric Methods for Investigating DIF and Test Bias During Test Adaptation Across Languages and Cultures
Psychometric Methods for Investigating DIF and Test Bias During Test Adaptation Across Languages and Cultures Bruno D. Zumbo, Ph.D. Professor University of British Columbia Vancouver, Canada Presented
More informationOn the usefulness of the CEFR in the investigation of test versions content equivalence HULEŠOVÁ, MARTINA
On the usefulness of the CEFR in the investigation of test versions content equivalence HULEŠOVÁ, MARTINA MASARY K UNIVERSITY, CZECH REPUBLIC Overview Background and research aims Focus on RQ2 Introduction
More informationDevelopment, Standardization and Application of
American Journal of Educational Research, 2018, Vol. 6, No. 3, 238-257 Available online at http://pubs.sciepub.com/education/6/3/11 Science and Education Publishing DOI:10.12691/education-6-3-11 Development,
More informationDifferential Item Functioning
Differential Item Functioning Lecture #11 ICPSR Item Response Theory Workshop Lecture #11: 1of 62 Lecture Overview Detection of Differential Item Functioning (DIF) Distinguish Bias from DIF Test vs. Item
More informationThe Classification Accuracy of Measurement Decision Theory. Lawrence Rudner University of Maryland
Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago, April 23-25, 2003 The Classification Accuracy of Measurement Decision Theory Lawrence Rudner University
More informationComparing DIF methods for data with dual dependency
DOI 10.1186/s40536-016-0033-3 METHODOLOGY Open Access Comparing DIF methods for data with dual dependency Ying Jin 1* and Minsoo Kang 2 *Correspondence: ying.jin@mtsu.edu 1 Department of Psychology, Middle
More informationPaul Irwing, Manchester Business School
Paul Irwing, Manchester Business School Factor analysis has been the prime statistical technique for the development of structural theories in social science, such as the hierarchical factor model of human
More informationCopyright. Hwa Young Lee
Copyright by Hwa Young Lee 2012 The Dissertation Committee for Hwa Young Lee certifies that this is the approved version of the following dissertation: Evaluation of Two Types of Differential Item Functioning
More informationEmpowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison
Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological
More informationSocial Studies 4 8 (118)
Purpose Social Studies 4 8 (118) The purpose of the Social Studies 4 8 test is to measure the requisite knowledge and skills that an entry-level educator in this field in Texas public schools must possess.
More informationComprehensive Statistical Analysis of a Mathematics Placement Test
Comprehensive Statistical Analysis of a Mathematics Placement Test Robert J. Hall Department of Educational Psychology Texas A&M University, USA (bobhall@tamu.edu) Eunju Jung Department of Educational
More informationAN ASSESSMENT OF ITEM BIAS USING DIFFERENTIAL ITEM FUNCTIONING TECHNIQUE IN NECO BIOLOGY CONDUCTED EXAMINATIONS IN TARABA STATE NIGERIA
American International Journal of Research in Humanities, Arts and Social Sciences Available online at http://www.iasir.net ISSN (Print): 2328-3734, ISSN (Online): 2328-3696, ISSN (CD-ROM): 2328-3688 AIJRHASS
More informationUSING MULTIDIMENSIONAL ITEM RESPONSE THEORY TO REPORT SUBSCORES ACROSS MULTIPLE TEST FORMS. Jing-Ru Xu
USING MULTIDIMENSIONAL ITEM RESPONSE THEORY TO REPORT SUBSCORES ACROSS MULTIPLE TEST FORMS By Jing-Ru Xu A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements
More informationCritical Evaluation of the Beach Center Family Quality of Life Scale (FQOL-Scale)
Critical Evaluation of the Beach Center Family Quality of Life Scale (FQOL-Scale) Alyssa Van Beurden M.Cl.Sc (SLP) Candidate University of Western Ontario: School of Communication Sciences and Disorders
More informationScaling TOWES and Linking to IALS
Scaling TOWES and Linking to IALS Kentaro Yamamoto and Irwin Kirsch March, 2002 In 2000, the Organization for Economic Cooperation and Development (OECD) along with Statistics Canada released Literacy
More informationProgressive Matrices
Seeing Reason: Visuospatial Ability, Sex Differences and the Raven s Progressive Matrices Nicolette Amanda Waschl School of Psychology, University of Adelaide A thesis submitted in fulfillment of the requirements
More informationCYRINUS B. ESSEN, IDAKA E. IDAKA AND MICHAEL A. METIBEMU. (Received 31, January 2017; Revision Accepted 13, April 2017)
DOI: http://dx.doi.org/10.4314/gjedr.v16i2.2 GLOBAL JOURNAL OF EDUCATIONAL RESEARCH VOL 16, 2017: 87-94 COPYRIGHT BACHUDO SCIENCE CO. LTD PRINTED IN NIGERIA. ISSN 1596-6224 www.globaljournalseries.com;
More informationConfirmatory Factor Analysis of Preschool Child Behavior Checklist (CBCL) (1.5 5 yrs.) among Canadian children
Confirmatory Factor Analysis of Preschool Child Behavior Checklist (CBCL) (1.5 5 yrs.) among Canadian children Dr. KAMALPREET RAKHRA MD MPH PhD(Candidate) No conflict of interest Child Behavioural Check
More informationConnexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan
Connexion of Item Response Theory to Decision Making in Chess Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Acknowledgement A few Slides have been taken from the following presentation
More informationSurvey Question. What are appropriate methods to reaffirm the fairness, validity reliability and general performance of examinations?
Clause 9.3.5 Appropriate methodology and procedures (e.g. collecting and maintaining statistical data) shall be documented and implemented in order to affirm, at justified defined intervals, the fairness,
More informationParameter Estimation with Mixture Item Response Theory Models: A Monte Carlo Comparison of Maximum Likelihood and Bayesian Methods
Journal of Modern Applied Statistical Methods Volume 11 Issue 1 Article 14 5-1-2012 Parameter Estimation with Mixture Item Response Theory Models: A Monte Carlo Comparison of Maximum Likelihood and Bayesian
More informationSelection and estimation in exploratory subgroup analyses a proposal
Selection and estimation in exploratory subgroup analyses a proposal Gerd Rosenkranz, Novartis Pharma AG, Basel, Switzerland EMA Workshop, London, 07-Nov-2014 Purpose of this presentation Proposal for
More informationSection 5. Field Test Analyses
Section 5. Field Test Analyses Following the receipt of the final scored file from Measurement Incorporated (MI), the field test analyses were completed. The analysis of the field test data can be broken
More informationSex Differences in Fluid Reasoning: Manifest and Latent Estimates from the Cognitive Abilities Test
J. Intell. 2014, 2, 36-55; doi:10.3390/jintelligence2020036 Article OPEN ACCESS Journal of Intelligence ISSN 2079-3200 www.mdpi.com/journal/jintelligence Sex Differences in Fluid Reasoning: Manifest and
More informationAn Exploratory Case Study of the Use of Video Digitizing Technology to Detect Answer-Copying on a Paper-and-Pencil Multiple-Choice Test
An Exploratory Case Study of the Use of Video Digitizing Technology to Detect Answer-Copying on a Paper-and-Pencil Multiple-Choice Test Carlos Zerpa and Christina van Barneveld Lakehead University czerpa@lakeheadu.ca
More informationBasic concepts and principles of classical test theory
Basic concepts and principles of classical test theory Jan-Eric Gustafsson What is measurement? Assignment of numbers to aspects of individuals according to some rule. The aspect which is measured must
More informationMeasures of children s subjective well-being: Analysis of the potential for cross-cultural comparisons
Measures of children s subjective well-being: Analysis of the potential for cross-cultural comparisons Ferran Casas & Gwyther Rees Children s subjective well-being A substantial amount of international
More informationAPPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH
APPLYING THE RASCH MODEL TO PSYCHO-SOCIAL MEASUREMENT A PRACTICAL APPROACH Margaret Wu & Ray Adams Documents supplied on behalf of the authors by Educational Measurement Solutions TABLE OF CONTENT CHAPTER
More informationPsychometrics in context: Test Construction with IRT. Professor John Rust University of Cambridge
Psychometrics in context: Test Construction with IRT Professor John Rust University of Cambridge Plan Guttman scaling Guttman errors and Loevinger s H statistic Non-parametric IRT Traces in Stata Parametric
More informationMaximum Marginal Likelihood Bifactor Analysis with Estimation of the General Dimension as an Empirical Histogram
Maximum Marginal Likelihood Bifactor Analysis with Estimation of the General Dimension as an Empirical Histogram Li Cai University of California, Los Angeles Carol Woods University of Kansas 1 Outline
More informationAdaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida
Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models
More informationDetection of Gender related DIF in the Foreign Language Classroom Anxiety Scale
KURAM VE UYGULAMADA EĞİTİM BİLİMLERİ EDUCATIONAL SCIENCES: THEORY & PRACTICE Received: November 26, 2016 Accepted: February 5, 2018 OnlineFirst: March 5, 2018 Copyright 2018 EDAM www.estp.com.tr DOI 10.12738/estp.2018.1.0606
More informationImpact of Violation of the Missing-at-Random Assumption on Full-Information Maximum Likelihood Method in Multidimensional Adaptive Testing
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute
More informationDetection of Differential Test Functioning (DTF) and Differential Item Functioning (DIF) in MCCQE Part II Using Logistic Models
Detection of Differential Test Functioning (DTF) and Differential Item Functioning (DIF) in MCCQE Part II Using Logistic Models Jin Gong University of Iowa June, 2012 1 Background The Medical Council of
More informationThree Generations of DIF Analyses: Considering Where It Has Been, Where It Is Now, and Where It Is Going
LANGUAGE ASSESSMENT QUARTERLY, 4(2), 223 233 Copyright 2007, Lawrence Erlbaum Associates, Inc. Three Generations of DIF Analyses: Considering Where It Has Been, Where It Is Now, and Where It Is Going HLAQ
More informationComputerized Mastery Testing
Computerized Mastery Testing With Nonequivalent Testlets Kathleen Sheehan and Charles Lewis Educational Testing Service A procedure for determining the effect of testlet nonequivalence on the operating
More informationParallel Forms for Diagnostic Purpose
Paper presented at AERA, 2010 Parallel Forms for Diagnostic Purpose Fang Chen Xinrui Wang UNCG, USA May, 2010 INTRODUCTION With the advancement of validity discussions, the measurement field is pushing
More informationUtilizing the NIH Patient-Reported Outcomes Measurement Information System
www.nihpromis.org/ Utilizing the NIH Patient-Reported Outcomes Measurement Information System Thelma Mielenz, PhD Assistant Professor, Department of Epidemiology Columbia University, Mailman School of
More informationRUNNING HEAD: THE SOCIAL PROVISIONS SCALE AND BI-FACTOR-ESEM
RUNNING HEAD: THE SOCIAL PROVISIONS SCALE AND BI-FACTOR-ESEM Construct Validity of the Social Provisions Scale: A Bifactor Exploratory Structural Equation Modeling Approach Harsha N. Perera University
More informationDifferential Item Functioning from a Compensatory-Noncompensatory Perspective
Differential Item Functioning from a Compensatory-Noncompensatory Perspective Terry Ackerman, Bruce McCollaum, Gilbert Ngerano University of North Carolina at Greensboro Motivation for my Presentation
More informationExamining Construct Stability Across Career Stage Cohorts
Eastern Kentucky University Encompass Online Theses and Dissertations Student Scholarship 2011 Examining Construct Stability Across Career Stage Cohorts Deborah L. Kinney Eastern Kentucky University Follow
More informationModel fit and robustness? - A critical look at the foundation of the PISA project
Model fit and robustness? - A critical look at the foundation of the PISA project Svend Kreiner, Dept. of Biostatistics, Univ. of Copenhagen TOC The PISA project and PISA data PISA methodology Rasch item
More informationINVESTIGATING FIT WITH THE RASCH MODEL. Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form
INVESTIGATING FIT WITH THE RASCH MODEL Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form of multidimensionality. The settings in which measurement
More informationA structural equation modeling approach for examining position effects in large scale assessments
DOI 10.1186/s40536-017-0042-x METHODOLOGY Open Access A structural equation modeling approach for examining position effects in large scale assessments Okan Bulut *, Qi Quo and Mark J. Gierl *Correspondence:
More informationMCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2
MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and Lord Equating Methods 1,2 Lisa A. Keller, Ronald K. Hambleton, Pauline Parker, Jenna Copella University of Massachusetts
More informationExamination of the Application of Item Response Theory to the Angoff Standard Setting Procedure
University of Massachusetts Amherst ScholarWorks@UMass Amherst Open Access Dissertations 9-2013 Examination of the Application of Item Response Theory to the Angoff Standard Setting Procedure Jerome Cody
More informationRunning head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note
Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1 Nested Factor Analytic Model Comparison as a Means to Detect Aberrant Response Patterns John M. Clark III Pearson Author Note John M. Clark III,
More informationIncorporating Measurement Nonequivalence in a Cross-Study Latent Growth Curve Analysis
Structural Equation Modeling, 15:676 704, 2008 Copyright Taylor & Francis Group, LLC ISSN: 1070-5511 print/1532-8007 online DOI: 10.1080/10705510802339080 TEACHER S CORNER Incorporating Measurement Nonequivalence
More informationThe Self Group Distinction Scale: A new approach to measure individualism and collectivism in adolescents
Psychological Test and Assessment Modeling, Volume 56, 2014 (3), 304-313 The Self Group Distinction Scale: A new approach to measure individualism and collectivism in adolescents Takuya Yanagida 1, Dagmar
More informationUCLA UCLA Electronic Theses and Dissertations
UCLA UCLA Electronic Theses and Dissertations Title Detection of Differential Item Functioning in the Generalized Full-Information Item Bifactor Analysis Model Permalink https://escholarship.org/uc/item/3xd6z01r
More informationRegistered Radiologist Assistant (R.R.A. ) 2016 Examination Statistics
Registered Radiologist Assistant (R.R.A. ) Examination Statistics INTRODUCTION This report summarizes the results of the Registered Radiologist Assistant (R.R.A. ) examinations developed and administered
More informationEffects of Local Item Dependence
Effects of Local Item Dependence on the Fit and Equating Performance of the Three-Parameter Logistic Model Wendy M. Yen CTB/McGraw-Hill Unidimensional item response theory (IRT) has become widely used
More informationUvA-DARE (Digital Academic Repository)
UvA-DARE (Digital Academic Repository) Standaarden voor kerndoelen basisonderwijs : de ontwikkeling van standaarden voor kerndoelen basisonderwijs op basis van resultaten uit peilingsonderzoek van der
More informationHanne Søberg Finbråten 1,2*, Bodil Wilde-Larsson 2,3, Gun Nordström 3, Kjell Sverre Pettersen 4, Anne Trollvik 3 and Øystein Guttersrud 5
Finbråten et al. BMC Health Services Research (2018) 18:506 https://doi.org/10.1186/s12913-018-3275-7 RESEARCH ARTICLE Open Access Establishing the HLS-Q12 short version of the European Health Literacy
More informationA longitudinal comparison of depression in later life in the US and England
A longitudinal comparison of depression in later life in the US and England Bram Vanhoutte, Stephen Jivraj & James Nazroo Centre for Survey and Census Research, University of Manchester Elsa wave 5 Launch,
More informationDetermining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory
Determining Differential Item Functioning in Mathematics Word Problems Using Item Response Theory Teodora M. Salubayba St. Scholastica s College-Manila dory41@yahoo.com Abstract Mathematics word-problem
More informationModel-based Diagnostic Assessment. University of Kansas Item Response Theory Stats Camp 07
Model-based Diagnostic Assessment University of Kansas Item Response Theory Stats Camp 07 Overview Diagnostic Assessment Methods (commonly called Cognitive Diagnosis). Why Cognitive Diagnosis? Cognitive
More informationTwo Studies Investigating the Reliability and Validity of the English ACTFL OPIc with Korean Test Takers
Two Studies Investigating the Reliability and Validity of the English ACTFL OPIc with Korean Test Takers The ACTFL OPIc Validation Project Technical Report Updated March 23, 2008 1 Authored by: Eric A.
More informationRUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT. Evaluating and Restructuring Science Assessments: An Example Measuring Student s
RUNNING HEAD: EVALUATING SCIENCE STUDENT ASSESSMENT Evaluating and Restructuring Science Assessments: An Example Measuring Student s Conceptual Understanding of Heat Kelly D. Bradley, Jessica D. Cunningham
More informationIntroduction to Factor Analysis. Hsueh-Sheng Wu CFDR Workshop Series June 18, 2018
Introduction to Factor Analysis Hsueh-Sheng Wu CFDR Workshop Series June 18, 2018 1 Outline Why do sociologists need factor analysis? What is factor analysis? Sternberg s triangular love theory Some data
More informationPsychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals
Psychometrics for Beginners Lawrence J. Fabrey, PhD Applied Measurement Professionals Learning Objectives Identify key NCCA Accreditation requirements Identify two underlying models of measurement Describe
More informationAssessing Measurement Invariance of the Teachers Perceptions of Grading Practices Scale across Cultures
Assessing Measurement Invariance of the Teachers Perceptions of Grading Practices Scale across Cultures Xing Liu Assistant Professor Education Department Eastern Connecticut State University 83 Windham
More informationInitial Report on the Calibration of Paper and Pencil Forms UCLA/CRESST August 2015
This report describes the procedures used in obtaining parameter estimates for items appearing on the 2014-2015 Smarter Balanced Assessment Consortium (SBAC) summative paper-pencil forms. Among the items
More informationItem Analysis Explanation
Item Analysis Explanation The item difficulty is the percentage of candidates who answered the question correctly. The recommended range for item difficulty set forth by CASTLE Worldwide, Inc., is between
More informationThe Patient-Reported Outcomes Measurement Information
ORIGINAL ARTICLE Practical Issues in the Application of Item Response Theory A Demonstration Using Items From the Pediatric Quality of Life Inventory (PedsQL) 4.0 Generic Core Scales Cheryl D. Hill, PhD,*
More informationGeorge B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide
George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational data Improving health worldwide www.lshtm.ac.uk Outline Sensitivity analysis Causal Mediation
More informationMultidimensional Item Response Theory in Clinical Measurement: A Bifactor Graded- Response Model Analysis of the Outcome- Questionnaire-45.
Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2012-05-22 Multidimensional Item Response Theory in Clinical Measurement: A Bifactor Graded- Response Model Analysis of the Outcome-
More informationReferences. Embretson, S. E. & Reise, S. P. (2000). Item response theory for psychologists. Mahwah,
The Western Aphasia Battery (WAB) (Kertesz, 1982) is used to classify aphasia by classical type, measure overall severity, and measure change over time. Despite its near-ubiquitousness, it has significant
More informationDiagnostic Classification Models
Diagnostic Classification Models Lecture #13 ICPSR Item Response Theory Workshop Lecture #13: 1of 86 Lecture Overview Key definitions Conceptual example Example uses of diagnostic models in education Classroom
More informationBackground. Workshop. Using the WHOQOL in NZ
Using the WHOQOL in NZ Australasian Mental Health Outcomes Conference Workshop Using WHOQOL in New Zealand Prof Rex Billington, Dr Daniel Shepherd, & Dr Chris Krägeloh Rex Billington Chris Krageloh What
More informationCOMBINING SCALING AND CLASSIFICATION: A PSYCHOMETRIC MODEL FOR SCALING ABILITY AND DIAGNOSING MISCONCEPTIONS LAINE P. BRADSHAW
COMBINING SCALING AND CLASSIFICATION: A PSYCHOMETRIC MODEL FOR SCALING ABILITY AND DIAGNOSING MISCONCEPTIONS by LAINE P. BRADSHAW (Under the Direction of Jonathan Templin and Karen Samuelsen) ABSTRACT
More informationFundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2
Fundamental Concepts for Using Diagnostic Classification Models Section #2 NCME 2016 Training Session NCME 2016 Training Session: Section 2 Lecture Overview Nature of attributes What s in a name? Grain
More informationTurning Output of Item Response Theory Data Analysis into Graphs with R
Overview Turning Output of Item Response Theory Data Analysis into Graphs with R Motivation Importance of graphing data Graphical methods for item response theory Why R? Two examples Ching-Fan Sheu, Cheng-Te
More informationA Moderated Nonlinear Factor Model for the Development of Commensurate Measures in Integrative Data Analysis
Multivariate Behavioral Research, 49:214 231, 2014 Copyright C Taylor & Francis Group, LLC ISSN: 0027-3171 print / 1532-7906 online DOI: 10.1080/00273171.2014.889594 A Moderated Nonlinear Factor Model
More information