Psychometric Details of the 20-Item UFFM-I Conscientiousness Scale

Size: px
Start display at page:

Download "Psychometric Details of the 20-Item UFFM-I Conscientiousness Scale"

Transcription

1 Psychometric Details of the 20-Item UFFM-I Conscientiousness Scale Documentation Prepared By: Nathan T. Carter & Rachel L. Williamson Applied Psychometric Laboratory at The University of Georgia Last Updated: July 2014 Development of the UFFM-I was supported by funding from the UGA Provost Research Seed Grant awarded to the first author. Note: This scale may be used freely by any not-for-profit researcher or practitioner with appropriate citation of the scale and its scoring program in resulting publications and/or reports. For-profit entities must obtain permission from the first author.

2 Psychometric Details of the 20-Item UFFM-I Conscientiousness Scale 1. Content Development and Item Pool Selection This document describes the technical details of the Unfolding Five Factor Model (UFFM-I) 20-item scale and reports its psychometric properties. The UFFM-I Conscientiousness scale was developed by first creating six 9-item facet scales according to the Costa and McCrae (1995) structure of FFM trait facets: 1. Competence 2. Orderliness 3. Dutifulness 4. Achievement Motivation 5. Self-Discipline 6. Cautiousness From these six scales, two 14-item measures to represent the higher-order factors, Orderliness and Industriousness identified by DeYoung et al. (2007) Items were written by graduate and undergraduate research assistants in the Applied Psychometric Laboratory trained by the author. For each facet, each of three item writers wrote 60 items (20 positively-worded, 20 negatively-worded, and 20 moderatelyworded), resulting in a pool of 180 items per construct. This pool of items was checked for quality of wording and redundancies. The resulting item pool (approximately 120 items per facet) was then evaluated independently by four undergraduate research assistants by: (1) rating the extremity of the item on a 7-point scale; and (2) conducting a Q-sorting task by assigning each item to the facet they believed the item belonged to (raters were provided with all facet definitions). For an item to be retained, it had to meet two criteria: (1) at least 75% of raters had to assign the item to the correct facet dimension; and (2) the standard deviation of the item extremity ratings had to be 1 or less. For each facet, 30 items with mean extremity ratings that spanned the entire 7-point scale range uniformly were selected. 2. Selection of Items for the 20-Item General Conscientiousness Scale After selection of 180 items (30 items per facet), item response theory (IRT) analyses were conducted on a sample of 732 participants. Data were collected via the Amazon Mechanical Turk crowdsourcing website. The generalized graded unfolding model (GGUM; Roberts, Donoghue, & Laughlin, 2000) was chosen as the unfolding IRT model

3 due to its applicability to polytomous items. Each 30-item set for the six facets were analyzed separately using the GGUM2004 software program (Roberts, Fang, Cui, & Wang, 2006). For each facet, a 9-item scale was developed by selecting items that maximized score model-data fit and scale reliability across the trait continuum (details on facets are also available on the Unfolding Project website). Next, the 9-item scales for Orderliness, Dutifulness, and Cautiousness were combined to form a scale represented the higher-order Orderliness factor, and the Competence, Achievement Motivation, and Self-Discipline were combined to form a scale of the higher-order Industriousness factor. These two 27-item combinations were analyzed separately, and 14 items were chosen that maximized model-data fit and score reliability across the trait continuum while maintaining nearly equal representation of each facet (details on the Orderliness and Industriousness scales are also available on the Unfolding Project website). Finally, the two scales representing the DeYoung et al. (2007) factors were combined and analyzed together. Items again were selected to maximize model-data fit and score reliability while maintaining a balanced representation of the lower-order facets. 3. Item Response Theory Analysis of the UFFM-I Conscientiousness Scale In this section, we present analyses of the UFFM-I Conscientiousness scale. The scale was analyzed using the full GGUM (Model 8) using the GGUM2004 program (Roberts, Fang, Cui, & Wang, 2006) which using marginal maximum likelihood in estimation of item parameters and expected a posteriori (EAP) in estimation of person scores. Modeldata fit was assessed using the MODFIT v2.0 program (Stark, 2007), which calculates the 2 /df ratio (adjusted to N=3,000 to promote more accurate fit assessment; see Chernyshenko et al., 2001) item fit statistic. Ratios less than three are considered indicate of good fit. Table 1 shows that all item 2 /df ratio singles were below 3, and indicates excellent model-data fit. Doubles and triples were slightly high, indicating the potential for a moderate degree of local independence (i.e., items are interrelated for reasons other than conscientiousness), which is not surprising given the hierarchical nature of this general trait variable (i.e., due to the presence of lower-order facets). Table 1. Frequency table for Adjusted (N=3,000) 2 /df ratios for the UFFM-I Conscientiousness scale. <1 1<2 2<3 3<4 4<5 5<7 >7 M SD Singles Doubles Triples

4 Figure 1. Model-data fit plot Graphical analyses comparing observed scores versus the scores that would be expected (or predicted) by the estimated model confirmed the excellent model-data fit of the UFFM-I Conscientiousness scale (see Figure 1). The GGUM has two primary parameters for consideration: and. The parameter is an estimate of how extreme an item is, and corresponds to the trail level,, of persons whom would be likely to fully endorse the item. For example, a person whom fully endorses Item 12, I always go above and beyond what is expected, would be expected to be 3.18 SD above the mean. On the other hand, a person whom fully endorses Item 16, I wouldn t describe myself as messy or clean; my organization is average, would be expected to be around.13 SD below the mean. Figure 2. Item characteristic curve A truly unfolding scale Figure 3. Test characteristic curve would be expected to have as approximately as many moderate items as there are extreme items, which would imply curvilinear relationship between traditional sumscoring and unfolding scores. Indeed, the Test Characteristic Curve (TCC), which relates unfolding scores ( ) to traditional Likert-type sum scores. As shown in Figure 3, the TCC clearly shows the expected curvilinear relationship between the two types of scores, with the inflection point of the curve occurring approximately at the mean (0) of unfolding scores. Finally, a crucial consideration in all IRT analyses is the Test Information Curve (TIC), which describes the reliability of measurement (called information in IRT) for different levels of estimates (i.e., unfolding scores. One of the major advantages of a truly unfolding measure is that including a variety of items allows for more reliable measurement across the trait continuum than scales developed using traditional approaches. Show in Figure 4, the TIC shows that reliability of measurement is relatively constant across the Figure 4. Test information curve

5 distribution of unfolding scores. This means that at most levels of conscientiousness, the UFFM-I scale is similarly reliable. To summarize, the UFFM-I scale for measuring general conscientiousness showed excellent model-data fit, truly unfolding properties, and high reliability across the trait continuum. Use of this scale is likely to be highly advantageous in applications wherein it is important to distinguish between persons whose conscientiousness is extremely high and those whose conscientiousness is moderately high. 4. Scoring the UFFM-I Conscientiousness Scale in Small Samples A major goal of the Unfolding Project is to make the use of unfolding measures more accessible. This is accomplished by providing a scoring program for each scale developed. This program is a SPSS macro titled UFFM-I CONSCORE, and is a specialized modification of the GUMSCORE SPSS (Carter & LoPilato, 2014), both available at the Applied Psychometric Laboratory website. The difference between these program is that whereas GUMSCORE SPSS requires the user to enter their own item parameters, the UFFM-I CONSCORE program has item parameters built into it specifically for the use of the 20-item UFFM-I Conscientiousness measure. To run this program, the user need only: (1) collect data using the full UFFM-I conscientiousness scale with a 6-point scale (strongly agree to strongly disagree); (2) download the UFFM-I CONSCORE folder directly to the C:\ drive; (3) enter the data into the Resp_File.sav file, in the order shown in the file titled UFFM-I Conscientiousness (20 Items).pdf or in the order indicated by the Item # column shown in Appendix A (note, in Appendix A items have been sorted by their item location and data should not be entered in the order shown); (4) Run the portion of the SPSS syntax below the first line (from the command DEFINE to the command!enddefine; this defines the macro titled!conscscore); (5) Change the parameter p on the first line to reflect the number of persons entered into the dataset (the default of this parameter is p=1); and (6) Select and run the first line of the syntax (beginning with!conscscore). The result will be a SPSS.sav file titled RESULTS_Theta_SE.sav which will include unfolding scores, or Thetas, and their standard errors (SEs). Additionally, this file will include the response pattern for each person so that the user can ensure the data were read into the program correctly. This program uses EAP estimation (using 30 quadrature points) to score respondents (the same approach used in GGUM2004) with the item parameters reported on the last page of this document. Therefore, respondents scored from different samples will be placed on the same scale as in the calibration sample. The UFFM-I measure and associated scoring program may be used by any not-forprofit research or practice. Any for-profit use requires the permission of the first author of this report.

6 REFERENCES Carter, N.T., & LoPilato, A. (2014). GUMSCORE SPSS [software program]. University of Georgia: Athens, GA. Available at: Chernyshenko, O.S., Stark, S., Chan, K.Y., Drasgow, F., & Williams, B. (2001). Fitting item response theory models to two personality inventories: Issues and insights. Multivariate Behavioral Research, 36, Costa, P.T., & McCrae, R.R. (1995). Domains and facets: Hierarchical personality assessment using the revised NEO personality inventory. Journal of Personality Assessment, 64, DeYoung, C.G., Quilty, L.C., & Peterson, J.B. (2007). Between facets and domains: 10 aspects of the Big Five. Journal of Personality and Social Psychology, 93, Roberts, J.S., Donoghue, J.R., & Laughlin, J.E. (2000). A general item response theory model for unfolding unidimensional polytomous responses. Applied Psychological Measurement, 24, Roberts, J. S., Fang, H., Cui, W. And Wang, Y. (2006). GGUM2004: A Windows-based program to estimate parameters in the generalized graded unfolding model. Applied Psychological Measurement, 30, Stark, S. (2007). MODFIT version 2.0 [software program].

7 APPENDIX: Item Parameters and Test Content for the 20 Item UFFM-I Conscientiousness Scale (Note: In applications using the UFFM-I CONSCORE program, data should be entered in the order indicated by Item #) ITEM # ITEM CONTENT 1 I tend to do just enough work to get by I procrastinate a lot I do not keep my room clean My performance at work is always adequate, no more and no less I am good about getting things done on time but sometimes I do not manage my time well I let my room get kind of messy but I don t let it get out of control I wouldn t describe myself as messy or clean, my organization is average I would say my self-discipline is about the same as most people I prefer to be above average at things but don t have to be the very best I would say I understand things at a normal pace I usually excel in what I m doing but occasionally I ll do mediocre at something I follow the rules about as much as most people I would say I am more disciplined than most, but there a lot of people with better self-discipline than 7 me I love to win, but I am not a sore loser I always respect authority, even if I disagree with 20 them If there is a problem, I can usually solve it I would never jump into doing something without thinking about it I am very well organized I always follow through with my plans I always go above and beyond what is expected

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models

More information

DEVELOPING IDEAL INTERMEDIATE ITEMS FOR THE IDEAL POINT MODEL MENGYANG CAO THESIS

DEVELOPING IDEAL INTERMEDIATE ITEMS FOR THE IDEAL POINT MODEL MENGYANG CAO THESIS 2013 Mengyang Cao DEVELOPING IDEAL INTERMEDIATE ITEMS FOR THE IDEAL POINT MODEL BY MENGYANG CAO THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Arts in Psychology

More information

COMPARING THE DOMINANCE APPROACH TO THE IDEAL-POINT APPROACH IN THE MEASUREMENT AND PREDICTABILITY OF PERSONALITY. Alison A. Broadfoot.

COMPARING THE DOMINANCE APPROACH TO THE IDEAL-POINT APPROACH IN THE MEASUREMENT AND PREDICTABILITY OF PERSONALITY. Alison A. Broadfoot. COMPARING THE DOMINANCE APPROACH TO THE IDEAL-POINT APPROACH IN THE MEASUREMENT AND PREDICTABILITY OF PERSONALITY Alison A. Broadfoot A Dissertation Submitted to the Graduate College of Bowling Green State

More information

Neurotic Styles and the Five Factor Model of Personality

Neurotic Styles and the Five Factor Model of Personality Graduate Faculty Psychology Bulletin Volume 3, No. 1, 2005 Neurotic Styles and the Five Factor Model of Personality Brian Norensberg, M.A. 1 & Peter Zachar Ph.D. 2 Abstract ~ This study investigates the

More information

The Doctrine of Traits. Lecture 29

The Doctrine of Traits. Lecture 29 The Doctrine of Traits Lecture 29 1 The Doctrine of Traits Allport (1937) [A trait is] a generalized and focalized neuropsychic system... with the capacity to render many stimuli functionally equivalent,

More information

A comparative study of psychological skills in relation to performance of different level Indian female boxers

A comparative study of psychological skills in relation to performance of different level Indian female boxers 2016; 3(4): 198-202 P-ISSN: 2394-1685 E-ISSN: 2394-1693 Impact Factor (ISRA): 5.38 IJPESH 2016; 3(4): 198-202 2016 IJPESH www.kheljournal.com Received: 09-05-2016 Accepted: 10-06-2016 Rajesh Panchal Research

More information

Issues That Should Not Be Overlooked in the Dominance Versus Ideal Point Controversy

Issues That Should Not Be Overlooked in the Dominance Versus Ideal Point Controversy Industrial and Organizational Psychology, 3 (2010), 489 493. Copyright 2010 Society for Industrial and Organizational Psychology. 1754-9426/10 Issues That Should Not Be Overlooked in the Dominance Versus

More information

Methodology Introduction of the study Statement of Problem Objective Hypothesis Method

Methodology Introduction of the study Statement of Problem Objective Hypothesis Method 3.1. Introduction of the study 3.2. Statement of Problem 3.3. Objective 3.4. Hypothesis 3.5. Method 3.5.1. Procedure Sample A.5.2. Variable A.5.3. Research Design A.5.4. Operational Definition Of The Terms

More information

Examining the Process of Responding to Circumplex Scales of Interpersonal Values Items: Should Ideal Point Scoring Methods Be Considered?

Examining the Process of Responding to Circumplex Scales of Interpersonal Values Items: Should Ideal Point Scoring Methods Be Considered? Journal of Personality Assessment ISSN: 0022-3891 (Print) 1532-7752 (Online) Journal homepage: http://www.tandfonline.com/loi/hjpa20 Examining the Process of Responding to Circumplex Scales of Interpersonal

More information

Psychology Research Methods Lab Session Week 10. Survey Design. Due at the Start of Lab: Lab Assignment 3. Rationale for Today s Lab Session

Psychology Research Methods Lab Session Week 10. Survey Design. Due at the Start of Lab: Lab Assignment 3. Rationale for Today s Lab Session Psychology Research Methods Lab Session Week 10 Due at the Start of Lab: Lab Assignment 3 Rationale for Today s Lab Session Survey Design This tutorial supplements your lecture notes on Measurement by

More information

Type I Error Rates and Power Estimates for Several Item Response Theory Fit Indices

Type I Error Rates and Power Estimates for Several Item Response Theory Fit Indices Wright State University CORE Scholar Browse all Theses and Dissertations Theses and Dissertations 2009 Type I Error Rates and Power Estimates for Several Item Response Theory Fit Indices Bradley R. Schlessman

More information

Validating Measures of Self Control via Rasch Measurement. Jonathan Hasford Department of Marketing, University of Kentucky

Validating Measures of Self Control via Rasch Measurement. Jonathan Hasford Department of Marketing, University of Kentucky Validating Measures of Self Control via Rasch Measurement Jonathan Hasford Department of Marketing, University of Kentucky Kelly D. Bradley Department of Educational Policy Studies & Evaluation, University

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Autobiographical memory as a dynamic process: Autobiographical memory mediates basic tendencies and characteristic adaptations

Autobiographical memory as a dynamic process: Autobiographical memory mediates basic tendencies and characteristic adaptations Available online at www.sciencedirect.com Journal of Research in Personality 42 (2008) 1060 1066 Brief Report Autobiographical memory as a dynamic process: Autobiographical memory mediates basic tendencies

More information

Rosenthal, Montoya, Ridings, Rieck, and Hooley (2011) Appendix A. Supplementary material

Rosenthal, Montoya, Ridings, Rieck, and Hooley (2011) Appendix A. Supplementary material NPI Factors and 16-Item NPI Rosenthal, Montoya, Ridings, Rieck, and Hooley (2011) Table 2 examines the relation of four sets of published factor-based NPI subscales and a shortened version of the NPI to

More information

Personality: Definitions

Personality: Definitions Personality: Definitions Anastasi "Personality tests are instruments for the measurement of emotional, motivational, interpersonal and attitudinal characteristics, as distinct from abilities. Kaplan &

More information

Work Personality Index Factorial Similarity Across 4 Countries

Work Personality Index Factorial Similarity Across 4 Countries Work Personality Index Factorial Similarity Across 4 Countries Donald Macnab Psychometrics Canada Copyright Psychometrics Canada 2011. All rights reserved. The Work Personality Index is a trademark of

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS Michael J. Kolen The University of Iowa March 2011 Commissioned by the Center for K 12 Assessment & Performance Management at

More information

Procedia - Social and Behavioral Sciences 140 ( 2014 ) PSYSOC 2013

Procedia - Social and Behavioral Sciences 140 ( 2014 ) PSYSOC 2013 Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 140 ( 2014 ) 506 510 PSYSOC 2013 Personality Traits and Different Career Stages A Study on Indian School

More information

Turning Output of Item Response Theory Data Analysis into Graphs with R

Turning Output of Item Response Theory Data Analysis into Graphs with R Overview Turning Output of Item Response Theory Data Analysis into Graphs with R Motivation Importance of graphing data Graphical methods for item response theory Why R? Two examples Ching-Fan Sheu, Cheng-Te

More information

GHABP Scoring/ Administration instructions GHABP Complete questionnaire. Buy full version here - for $7.00

GHABP Scoring/ Administration instructions GHABP Complete questionnaire. Buy full version here - for $7.00 This is a Sample version of the Glasgow Hearing Aid Benefit Profile- KIT (GHABP- KIT). The full version of Disability Assessment For Dementia (DAD) comes without sample watermark.. The full complete KIT

More information

Abstract. Key words: bias, culture, Five-Factor Model, language, NEO-PI-R, NEO-PI-3, personality, South Africa

Abstract. Key words: bias, culture, Five-Factor Model, language, NEO-PI-R, NEO-PI-3, personality, South Africa Dedication I dedicate this research to my parents who have supported me throughout my university career. Without their emotional and financial support, none of my academic accolades could have been possible.

More information

Halesworth & District. Malcolm Ballantine

Halesworth & District. Malcolm Ballantine Halesworth & District Malcolm Ballantine Personality Assessment What is personality? Two Approaches Trait: Many independent descriptors Type: Single pithy descriptor Two Approaches - Examples Trait approach

More information

PHYSICAL FUNCTION A brief guide to the PROMIS Physical Function instruments:

PHYSICAL FUNCTION A brief guide to the PROMIS Physical Function instruments: PROMIS Bank v1.0 - Physical Function* PROMIS Short Form v1.0 Physical Function 4a* PROMIS Short Form v1.0-physical Function 6a* PROMIS Short Form v1.0-physical Function 8a* PROMIS Short Form v1.0 Physical

More information

The Myers Briggs Type Inventory

The Myers Briggs Type Inventory The Myers Briggs Type Inventory Charles C. Healy Professor of Education, UCLA In press with Kapes, J.T. et. al. (2001) A counselor s guide to Career Assessment Instruments. (4th Ed.) Alexandria, VA: National

More information

UNFOLDING THE MEASUREMENT OF THE CREATIVE PERSONALITY

UNFOLDING THE MEASUREMENT OF THE CREATIVE PERSONALITY Page 1 UNFOLDING THE MEASUREMENT OF THE CREATIVE PERSONALITY Leonidas A. Zampetakis * *Send correspondence to: Leonidas A. Zampetakis Department of Production Engineering and Management, Management Systems

More information

draft Big Five 03/13/ HFM

draft Big Five 03/13/ HFM participant client HFM 03/13/201 This report was generated by the HFMtalentindex Online Assessment system. The data in this report are based on the answers given by the participant on one or more psychological

More information

Likert Scaling: A how to do it guide As quoted from

Likert Scaling: A how to do it guide As quoted from Likert Scaling: A how to do it guide As quoted from www.drweedman.com/likert.doc Likert scaling is a process which relies heavily on computer processing of results and as a consequence is my favorite method

More information

GLOBAL HEALTH. PROMIS Pediatric Scale v1.0 Global Health 7 PROMIS Pediatric Scale v1.0 Global Health 7+2

GLOBAL HEALTH. PROMIS Pediatric Scale v1.0 Global Health 7 PROMIS Pediatric Scale v1.0 Global Health 7+2 GLOBAL HEALTH A brief guide to the PROMIS Global Health instruments: ADULT PEDIATRIC PARENT PROXY PROMIS Scale v1.0/1.1 Global Health* PROMIS Scale v1.2 Global Health PROMIS Scale v1.2 Global Mental 2a

More information

Journal of Educational and Psychological Studies - Sultan Qaboos University (Pages ) Vol.7 Issue

Journal of Educational and Psychological Studies - Sultan Qaboos University (Pages ) Vol.7 Issue Journal of Educational and Psychological Studies - Sultan Qaboos University (Pages 537-548) Vol.7 Issue 4 2013 Constructing a Scale of Attitudes toward School Science Using the General Graded Unfolding

More information

Test Partnership TPAQ Series Psychometric Properties

Test Partnership TPAQ Series Psychometric Properties Test Partnership TPAQ Series Psychometric Properties 2018 1 Construct Validity The IPIP-NEO-120 (Johnson, 2014) is a validated measure of the Big-5 model of personality, specifically the OCEAN model (Costa

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 39 Evaluation of Comparability of Scores and Passing Decisions for Different Item Pools of Computerized Adaptive Examinations

More information

Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates. Kent Rittschof Department of Curriculum, Foundations, & Reading

Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates. Kent Rittschof Department of Curriculum, Foundations, & Reading Improving Measurement of Ambiguity Tolerance (AT) Among Teacher Candidates Kent Rittschof Department of Curriculum, Foundations, & Reading What is Ambiguity Tolerance (AT) and why should it be measured?

More information

Observer OPTION 5 Manual

Observer OPTION 5 Manual Observer OPTION 5 Manual Measuring shared decision making by assessing recordings or transcripts of encounters from clinical settings. Glyn Elwyn, Stuart W Grande, Paul Barr The Dartmouth Institute for

More information

M.A. Alhad & S.S. Turnip Faculty of Psychology, Universitas Indonesia, Depok, Indonesia

M.A. Alhad & S.S. Turnip Faculty of Psychology, Universitas Indonesia, Depok, Indonesia Diversity in Unity: Perspectives from Psychology and Behavioral Sciences Ariyanto et al. (Eds) 2018 Taylor & Francis Group, London, ISBN 978-1-138-62665-2 The association between the five-factor model

More information

Procrastination, Motivation, & Flow

Procrastination, Motivation, & Flow Andrews University Digital Commons @ Andrews University Honors Theses Undergraduate Research 3-28-2016 Procrastination, Motivation, & Flow Reginald Desrosiers Andrews University, rjd@andrews.edu This research

More information

Building Evaluation Scales for NLP using Item Response Theory

Building Evaluation Scales for NLP using Item Response Theory Building Evaluation Scales for NLP using Item Response Theory John Lalor CICS, UMass Amherst Joint work with Hao Wu (BC) and Hong Yu (UMMS) Motivation Evaluation metrics for NLP have been mostly unchanged

More information

ALABAMA SELF-ASSESSMENT INDEX PILOT PROGRAM SUMMARY REPORT

ALABAMA SELF-ASSESSMENT INDEX PILOT PROGRAM SUMMARY REPORT ALABAMA SELF-ASSESSMENT INDEX PILOT PROGRAM SUMMARY REPORT July 7, 2000 A large sample (N = 1,127) of welfare recipient clients were administered the Self-Assessment Index Behavior Data Systems, Ltd. P.O.

More information

Personality and Individual Differences

Personality and Individual Differences Personality and Individual Differences 45 (2008) 649 654 Contents lists available at ScienceDirect Personality and Individual Differences journal homepage: www.elsevier.com/locate/paid Reliability and

More information

Item Analysis: Classical and Beyond

Item Analysis: Classical and Beyond Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013 Why is item analysis relevant? Item analysis provides

More information

Utilizing the NIH Patient-Reported Outcomes Measurement Information System

Utilizing the NIH Patient-Reported Outcomes Measurement Information System www.nihpromis.org/ Utilizing the NIH Patient-Reported Outcomes Measurement Information System Thelma Mielenz, PhD Assistant Professor, Department of Epidemiology Columbia University, Mailman School of

More information

DOES SELF-EMPLOYED WORK MAKE 15-YEAR LONGITUDINAL PERSONALITY- BASED ANALYSIS

DOES SELF-EMPLOYED WORK MAKE 15-YEAR LONGITUDINAL PERSONALITY- BASED ANALYSIS Frontiers of Entrepreneurship Research Volume 35 Issue 3 CHAPTER III. THE ENTREPRENEUR AND CHARACTERISTICS Article 2 6-13-2015 DOES SELF-EMPLOYED WORK MAKE INDIVIDUALS NOT ONLY MORE ENTREPRENEURIAL BUT

More information

Differential Item Functioning

Differential Item Functioning Differential Item Functioning Lecture #11 ICPSR Item Response Theory Workshop Lecture #11: 1of 62 Lecture Overview Detection of Differential Item Functioning (DIF) Distinguish Bias from DIF Test vs. Item

More information

Download from:

Download from: Paul Barrett 11, and Nina Ebbeling 22 1 Psytech 1 Psytech International International Ltd. 1 Ltd. University 1 University of of Auckland 1 Auckland University 1 University of of Canterbury Canterbury The

More information

Marc J. Tassé, PhD Nisonger Center UCEDD

Marc J. Tassé, PhD Nisonger Center UCEDD FINALLY... AN ADAPTIVE BEHAVIOR SCALE FOCUSED ON PROVIDING PRECISION AT THE DIAGNOSTIC CUT-OFF. How Item Response Theory Contributed to the Development of the DABS Marc J. Tassé, PhD UCEDD The Ohio State

More information

Continuum Specification in Construct Validation

Continuum Specification in Construct Validation Continuum Specification in Construct Validation April 7 th, 2017 Thank you Co-author on this project - Andrew T. Jebb Friendly reviewers - Lauren Kuykendall - Vincent Ng - James LeBreton 2 Continuum Specification:

More information

The happy personality: Mediational role of trait emotional intelligence

The happy personality: Mediational role of trait emotional intelligence Personality and Individual Differences 42 (2007) 1633 1639 www.elsevier.com/locate/paid Short Communication The happy personality: Mediational role of trait emotional intelligence Tomas Chamorro-Premuzic

More information

Introduction to SPSS: Defining Variables and Data Entry

Introduction to SPSS: Defining Variables and Data Entry Introduction to SPSS: Defining Variables and Data Entry You will be on this page after SPSS is started Click Cancel Choose variable view by clicking this button Type the name of the variable here Lets

More information

COGNITIVE FUNCTION. PROMIS Pediatric Item Bank v1.0 Cognitive Function PROMIS Pediatric Short Form v1.0 Cognitive Function 7a

COGNITIVE FUNCTION. PROMIS Pediatric Item Bank v1.0 Cognitive Function PROMIS Pediatric Short Form v1.0 Cognitive Function 7a COGNITIVE FUNCTION A brief guide to the PROMIS Cognitive Function instruments: ADULT PEDIATRIC PARENT PROXY PROMIS Item Bank v1.0 Applied Cognition - Abilities* PROMIS Item Bank v1.0 Applied Cognition

More information

HOW IS PACE TO BE USED

HOW IS PACE TO BE USED Introduction and overview of the pace monitor what is THE PACE MONITOR The PACE Monitor is a comprehensive program for the screening, in-depth assessment and the evaluation of the progress and change of

More information

A Comparison of Several Goodness-of-Fit Statistics

A Comparison of Several Goodness-of-Fit Statistics A Comparison of Several Goodness-of-Fit Statistics Robert L. McKinley The University of Toledo Craig N. Mills Educational Testing Service A study was conducted to evaluate four goodnessof-fit procedures

More information

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION Timothy Olsen HLM II Dr. Gagne ABSTRACT Recent advances

More information

Module One: What is Statistics? Online Session

Module One: What is Statistics? Online Session Module One: What is Statistics? Online Session Introduction The purpose of this online workshop is to aid you in better understanding the materials you have read within the class textbook. The goal of

More information

Automated process to create snapshot reports based on the 2016 Murray Community-based Groups Capacity Survey: User Guide Report No.

Automated process to create snapshot reports based on the 2016 Murray Community-based Groups Capacity Survey: User Guide Report No. research for a sustainable future Automated process to create snapshot reports based on the 2016 Murray Community-based Groups Capacity Survey: User Guide Report No. 116 Steven Vella Gail Fuller Michael

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Relationships Between Personality, Computer Stereotypes and Preference. Melissa Bagley, Stephanie Moulton, Evan Rubin, Deborah Ward

Relationships Between Personality, Computer Stereotypes and Preference. Melissa Bagley, Stephanie Moulton, Evan Rubin, Deborah Ward Relationships Between Personality, Computer Stereotypes and Preference Melissa Bagley, Stephanie Moulton, Evan Rubin, Deborah Ward Abstract The goal of this study was to determine if there is any truth

More information

The Big Five factors in personality. A primer.

The Big Five factors in personality. A primer. by Paul O Olson, MBA, Mc Psychology Introduction This paper explains the Big Five or Five Factor Model of personality and its application in the IEF Cultural survey. This model is state of the art of personality

More information

Assessing the Validity and Reliability of the Teacher Keys Effectiveness. System (TKES) and the Leader Keys Effectiveness System (LKES)

Assessing the Validity and Reliability of the Teacher Keys Effectiveness. System (TKES) and the Leader Keys Effectiveness System (LKES) Assessing the Validity and Reliability of the Teacher Keys Effectiveness System (TKES) and the Leader Keys Effectiveness System (LKES) of the Georgia Department of Education Submitted by The Georgia Center

More information

Measurement Error 2: Scale Construction (Very Brief Overview) Page 1

Measurement Error 2: Scale Construction (Very Brief Overview) Page 1 Measurement Error 2: Scale Construction (Very Brief Overview) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 22, 2015 This handout draws heavily from Marija

More information

IJPSS Volume 2, Issue 7 ISSN:

IJPSS Volume 2, Issue 7 ISSN: Teachers` as a Leader and their Traits: Evidence from secondary level Dr.Tahseen Mehmood Aslam* Zulfiqar Ali** Ijaz Ahmad Tatlah*** Dr Muhammad Iqbal**** _ Abstract Teacher must act as a leader because

More information

2 Psychological Processes : An Introduction

2 Psychological Processes : An Introduction 2 Psychological Processes : An Introduction 2.1 Introduction In our everyday life we try to achieve various goals through different activities, receive information from our environment, learn about many

More information

A Simulation Study on Methods of Correcting for the Effects of Extreme Response Style

A Simulation Study on Methods of Correcting for the Effects of Extreme Response Style Article Erschienen in: Educational and Psychological Measurement ; 76 (2016), 2. - S. 304-324 https://dx.doi.org/10.1177/0013164415591848 A Simulation Study on Methods of Correcting for the Effects of

More information

Item Response Theory for Polytomous Items Rachael Smyth

Item Response Theory for Polytomous Items Rachael Smyth Item Response Theory for Polytomous Items Rachael Smyth Introduction This lab discusses the use of Item Response Theory (or IRT) for polytomous items. Item response theory focuses specifically on the items

More information

INTRODUCTION TO ASSESSMENT OPTIONS

INTRODUCTION TO ASSESSMENT OPTIONS ASTHMA IMPACT A brief guide to the PROMIS Asthma Impact instruments: PEDIATRIC PROMIS Pediatric Item Bank v2.0 Asthma Impact PROMIS Pediatric Item Bank v1.0 Asthma Impact* PROMIS Pediatric Short Form v2.0

More information

Behavior Rating Inventory of Executive Function BRIEF. Interpretive Report. Developed by SAMPLE

Behavior Rating Inventory of Executive Function BRIEF. Interpretive Report. Developed by SAMPLE Behavior Rating Inventory of Executive Function BRIEF Interpretive Report Developed by Peter K. Isquith, PhD, Gerard A. Gioia, PhD, and PAR Staff Client Information Client Name : Sample Client Client ID

More information

Extraversion. The Extraversion factor reliability is 0.90 and the trait scale reliabilities range from 0.70 to 0.81.

Extraversion. The Extraversion factor reliability is 0.90 and the trait scale reliabilities range from 0.70 to 0.81. MSP RESEARCH NOTE B5PQ Reliability and Validity This research note describes the reliability and validity of the B5PQ. Evidence for the reliability and validity of is presented against some of the key

More information

Bruno D. Zumbo, Ph.D. University of Northern British Columbia

Bruno D. Zumbo, Ph.D. University of Northern British Columbia Bruno Zumbo 1 The Effect of DIF and Impact on Classical Test Statistics: Undetected DIF and Impact, and the Reliability and Interpretability of Scores from a Language Proficiency Test Bruno D. Zumbo, Ph.D.

More information

Oak Meadow Autonomy Survey

Oak Meadow Autonomy Survey Oak Meadow Autonomy Survey Patricia M. Meehan, Ph.D. August 7, 214 1 Contents Contents 3 List of Figures 3 List of Tables 3 1 Introduction 4 2 Data 4 3 Determining the Number of Factors 5 4 Proposed Model

More information

An Item Response Theory Integration of Normal and Abnormal Personality Scales

An Item Response Theory Integration of Normal and Abnormal Personality Scales Personality Disorders: Theory, Research, and Treatment 2010 American Psychological Association 2010, Vol. 1, No. 1, 5 21 1949-2715/10/$12.00 DOI: 10.1037/a0018136 An Item Response Theory Integration of

More information

Psychometric Toolbox USER S GUIDE

Psychometric Toolbox USER S GUIDE 1 PT Psychometric Toolbox USER S GUIDE Prepared by: Pere J. Ferrando David Navarro-González Urbano Lorenzo-Seva Please reference this document as: Ferrando, P.J., Navarro-González, D. & Lorenzo-Seva, U.

More information

Everything DiSC 363 for Leaders. Research Report. by Inscape Publishing

Everything DiSC 363 for Leaders. Research Report. by Inscape Publishing Everything DiSC 363 for Leaders Research Report by Inscape Publishing Introduction Everything DiSC 363 for Leaders is a multi-rater assessment and profile that is designed to give participants feedback

More information

Glasgow, 1055 Great Western Road, Glasgow G12 0XH, UK

Glasgow, 1055 Great Western Road, Glasgow G12 0XH, UK Title and running title: Conscientiousness predicts diurnal preference Alexandra L. Hogben* 1 Jason Ellis* 2 Simon N. Archer 1 Malcolm von Schantz 1 * These authors contributed equally to this work 1 Surrey

More information

Paul Irwing, Manchester Business School

Paul Irwing, Manchester Business School Paul Irwing, Manchester Business School Factor analysis has been the prime statistical technique for the development of structural theories in social science, such as the hierarchical factor model of human

More information

Computerized Mastery Testing

Computerized Mastery Testing Computerized Mastery Testing With Nonequivalent Testlets Kathleen Sheehan and Charles Lewis Educational Testing Service A procedure for determining the effect of testlet nonequivalence on the operating

More information

Multiple Act criterion:

Multiple Act criterion: Common Features of Trait Theories Generality and Stability of Traits: Trait theorists all use consistencies in an individual s behavior and explain why persons respond in different ways to the same stimulus

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

ABOUT PHYSICAL ACTIVITY

ABOUT PHYSICAL ACTIVITY PHYSICAL ACTIVITY A brief guide to the PROMIS Physical Activity instruments: PEDIATRIC PROMIS Pediatric Item Bank v1.0 Physical Activity PROMIS Pediatric Short Form v1.0 Physical Activity 4a PROMIS Pediatric

More information

Foreword. Did you know that developing your strengths those things you re good at and actually

Foreword. Did you know that developing your strengths those things you re good at and actually F Foreword Did you know that developing your strengths those things you re good at and actually enjoy doing makes it eighteen times more likely you ll describe yourself as flourishing at work? Given most

More information

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1 Nested Factor Analytic Model Comparison as a Means to Detect Aberrant Response Patterns John M. Clark III Pearson Author Note John M. Clark III,

More information

Product Interest and Engagement Scale, Beta (PIES-beta): Initial Development

Product Interest and Engagement Scale, Beta (PIES-beta): Initial Development Product Interest and Engagement Scale, Beta (PIES-beta): Initial Development Christopher N. Chapman Microsoft Corporation Microsoft Way (cchap) Redmond, WA 98052 USA chris.chapman@microsoft.com Michal

More information

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE Spring 20 11, Volume 1, Issue 1 THE STATSWHISPERER The StatsWhisperer Newsletter is published by staff at StatsWhisperer. Visit us at: www.statswhisperer.com Introduction to this Issue The current issue

More information

Development of the Web Users Self Efficacy scale (WUSE)

Development of the Web Users Self Efficacy scale (WUSE) Development of the Web Users Self Efficacy scale (WUSE) Eachus, P and Cassidy, SF Title Authors Type URL Published Date 2004 Development of the Web Users Self Efficacy scale (WUSE) Eachus, P and Cassidy,

More information

2-Group Multivariate Research & Analyses

2-Group Multivariate Research & Analyses 2-Group Multivariate Research & Analyses Research Designs Research hypotheses Outcome & Research Hypotheses Outcomes & Truth Significance Tests & Effect Sizes Multivariate designs Increased effects Increased

More information

The Youth Experience Survey 2.0: Instrument Revisions and Validity Testing* David M. Hansen 1 University of Illinois, Urbana-Champaign

The Youth Experience Survey 2.0: Instrument Revisions and Validity Testing* David M. Hansen 1 University of Illinois, Urbana-Champaign The Youth Experience Survey 2.0: Instrument Revisions and Validity Testing* David M. Hansen 1 University of Illinois, Urbana-Champaign Reed Larson 2 University of Illinois, Urbana-Champaign February 28,

More information

Chapter 8: Regression

Chapter 8: Regression Chapter 8: Regression Labcoat Leni s Real Research I want to be loved (on Facebook) Problem Ong, E. Y. L., et al. (2011). Personality and Individual Differences, 50(2), 180 185. Social media websites such

More information

Structural Equation Modeling (SEM)

Structural Equation Modeling (SEM) Structural Equation Modeling (SEM) Today s topics The Big Picture of SEM What to do (and what NOT to do) when SEM breaks for you Single indicator (ASU) models Parceling indicators Using single factor scores

More information

Evaluating the quality of analytic ratings with Mokken scaling

Evaluating the quality of analytic ratings with Mokken scaling Psychological Test and Assessment Modeling, Volume 57, 2015 (3), 423-444 Evaluating the quality of analytic ratings with Mokken scaling Stefanie A. Wind 1 Abstract Greatly influenced by the work of Rasch

More information

Moderators of the Tailored Adaptive Personality Assessment System Validity

Moderators of the Tailored Adaptive Personality Assessment System Validity Technical Report 1357 Moderators of the Tailored Adaptive Personality Assessment System Validity Stephen Stark, Oleksandr S. Chernyshenko, Christopher D. Nye, Fritz Drasgow Drasgow Consulting Group Leonard

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

Detecting Aberrant Responding on Unidimensional Pairwise Preference Tests: An Application of based on the Zinnes Griggs Ideal Point IRT Model

Detecting Aberrant Responding on Unidimensional Pairwise Preference Tests: An Application of based on the Zinnes Griggs Ideal Point IRT Model University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School January 2013 Detecting Aberrant Responding on Unidimensional Pairwise Preference Tests: An Application of based

More information

Scaling TOWES and Linking to IALS

Scaling TOWES and Linking to IALS Scaling TOWES and Linking to IALS Kentaro Yamamoto and Irwin Kirsch March, 2002 In 2000, the Organization for Economic Cooperation and Development (OECD) along with Statistics Canada released Literacy

More information

Chapter 9. Youth Counseling Impact Scale (YCIS)

Chapter 9. Youth Counseling Impact Scale (YCIS) Chapter 9 Youth Counseling Impact Scale (YCIS) Background Purpose The Youth Counseling Impact Scale (YCIS) is a measure of perceived effectiveness of a specific counseling session. In general, measures

More information

The Difference Analysis between Demographic Variables and Personal Attributes The Case of Internal Auditors in Taiwan

The Difference Analysis between Demographic Variables and Personal Attributes The Case of Internal Auditors in Taiwan The Difference Analysis between Demographic Variables and Personal Attributes The Case of Internal Auditors in Taiwan Li-Jia Chiu and Neng-Tang Norman Huang Department of Technology Application and Human

More information

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB Bryan Orme, Sawtooth Software, Inc. Copyright 009, Sawtooth Software, Inc. 530 W. Fir St. Sequim,

More information

Personal Style Inventory Item Revision: Confirmatory Factor Analysis

Personal Style Inventory Item Revision: Confirmatory Factor Analysis Personal Style Inventory Item Revision: Confirmatory Factor Analysis This research was a team effort of Enzo Valenzi and myself. I m deeply grateful to Enzo for his years of statistical contributions to

More information

Family Expectations, Self-Esteem, and Academic Achievement among African American College Students

Family Expectations, Self-Esteem, and Academic Achievement among African American College Students Family Expectations, Self-Esteem, and Academic Achievement among African American College Students Mia Bonner Millersville University Abstract Previous research (Elion, Slaney, Wang and French, 2012) found

More information

Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys

Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Jill F. Kilanowski, PhD, APRN,CPNP Associate Professor Alpha Zeta & Mu Chi Acknowledgements Dr. Li Lin,

More information

Personality Traits and Life Satisfaction among Working Men and Women

Personality Traits and Life Satisfaction among Working Men and Women Personality Traits and Life Satisfaction among Working Men and Women Azra Khanam Education University Lahore Campus, Pakistan Abstract The present research investigated personality traits and life satisfaction

More information

Incorporating Measurement Nonequivalence in a Cross-Study Latent Growth Curve Analysis

Incorporating Measurement Nonequivalence in a Cross-Study Latent Growth Curve Analysis Structural Equation Modeling, 15:676 704, 2008 Copyright Taylor & Francis Group, LLC ISSN: 1070-5511 print/1532-8007 online DOI: 10.1080/10705510802339080 TEACHER S CORNER Incorporating Measurement Nonequivalence

More information