how good is the Instrument? Dr Dean McKenzie

Size: px
Start display at page:

Download "how good is the Instrument? Dr Dean McKenzie"

Transcription

1 how good is the Instrument? Dr Dean McKenzie BA(Hons) (Psychology) PhD (Psych Epidemiology) Senior Research Fellow (Abridged Version) Full version to be presented July

2 Goals To briefly summarize the basic tools and concepts needed to answer the question Is this instrument any good? Mainly Psych instrument examples (my background) but also applies to instruments from other health areas 2

3 Psycho-Statistical Inventions Cohen s effect size / kappa / power analysis Cronbach s alpha (internal consistency) factor analysis (1904) item response theory, Rasch modelling, partial credit Likert scale, semantic differential McNemar s test Spearman correlation Test reliability Types of data: nominal, ordinal, interval, ratio (Stanley Smith Stephens, Science 1946, 103: ) (list partly based on Cohen et al (2013) 3

4 numbers into Information F First female member of the Royal Statistical Society A coxcomb or polar area diagram Rehmeyer J (2008) Florence Nightingale: the passionate statistician. ScienceNews, 26 November

5 A typical Instrument, the Kessler Psychological Distress Scale (K10) Developed in US by Ronald Kessler et al, tested in Australia, employed in Victorian Population Health Survey, Australian National Health Surveys etc 10 questions such as in the past 4 weeks, about how often did you feel worthless and in the past 4 weeks, about how often did you feel nervous Similar in scope to David Goldberg s GHQ (General Health Questionnaire, 1972) but K10 is public domain (GHQ licensed by NFER-Nelson) Andrews G, Slade T (2001) Australian and New Zealand Journal of Public Health, 25, 6:

6 Determining instrument/test Quality Is this instrument any good? Reliability: Does it give consistent results, across different raters / situations? Validity: Does it measure what it is supposed to? Does instrument measure what it is supposed to, consistently and well, in the population(s) that it is applied to, here & now 6

7 Test Reliability = Repeatability or consistency Test-retest: Should achieve same scores, each time (if testing characteristics that are stable across time, such as intelligence, personality, rather than less stable characteristics, such as mood, pain levels but even then, stability may be an unrealistic assumption, Cohen et al 2013) Inter-rater: Different interviewers / raters should reach same conclusions / score levels 7

8 Measuring Test Reliability Measured by intraclass* correlation, should be 0.70 and above (1.0 is maximum) (e.g. if rater A s scores are 1,3,5 and rater B s scores are 2,4,6 (Karl) Pearson correlation would be 1.0 (perfect), but the raters never actually assign same score!) intraclass correlation often used for reliability & (Jacob) Cohen s kappa* chance-corrected for binary/categorical measures (*please see Streiner & Norman 2008 for more details) 8

9 Reliability? Internal consistency? Split half: correlation between first half of test/instrument and second half, or between odd numbered and even numbered items, should be 0.70 or above (Lee) Cronbach s alpha: equivalent to the average of all possible split halves of instrument, should be 0.70 or above Cronbach s alpha size is affected by number of items, and although everyone reports it, may not be sure what it actually means! 9

10 Internal consistency: Alpha Virtually every published measure will have adequate to good internal consistency (alpha) Miles J & Gilbert P (2005) A handbook of research methods for clinical & health psychology. Oxford. p Choosing a test merely on basis of Cronbach s alpha is a bit like choosing a restaurant on basis of whether the chefs wash their hands (can only hope that they all do so!) 10

11 Measuring Test Validity Appropriateness, meaningfulness, usefulness Face validity: extent to which test appears (at face value) to measure what it is intended to be measuring Content validity: extent to which all items reflect the characteristic:, e.g. depression Criterion validity: extent to which items correlate with a criterion (e.g. depression) the instrument supposedly measures (concurrent validity) or predict future scores / performance / diagnosis (predictive validity) Construct validity: concepts such as quality of life, intelligence, depression, cannot be measured directly, but measures of depression should correlate with each other (convergent validity) 11

12 Reliability/Validity: how high? In general, correlations for validity and reliability should be 0.70 or above, but validity and reliability values may not be provided in papers using an instrument, reliability and validity values are generally provided in papers defining the instrument (perhaps also original scoring manual) or testing the instrument (e.g., with younger or older adults, non-english speaking background etc) 12

13 Correlation Rule of thumb: correlation 0.8 to 1.0 very strong 0.6 to 0.8 strong 0.4 to 0.6 moderate Salkind (2013) Square of the correlation is the % of variation on one variable/measure/rater, accounted for by variation in other variable/measure/rater, so correlation of 0.7 = ~50% or half of the variation explained (variation remaining is known as unexplained, or the coefficient of alienation ) Reliability and validity should just report the size of the correlation (which should be 0.70 or above), not the statistical significance p <0.05 (with large enough samples, even trivial correlations can be significantly different from zero) 13

14 Quality checklist 1 Why use this particular instrument or test? 2 Are there are published guidelines for the use of this test? 3 Is this instrument reliable? (0.70 or above, recent, Australia, subpopulations?) 4 Is this instrument valid? (0.70 or above, recent, Australia, subpopulations?) 5 Is this instrument cost-effective (e.g. performance vs length)? 6 What inferences/conclusions may reasonably be made from this test, and how generalizable are the findings? (e.g. older adults, hospital patients, military / veterans?) Cohen et al (2013), pp

15 Guide to (psych) Tests (charge for use, unless accessed through library) Corcoran, K., Fischer, J. (2013). Measures for clinical practice and research. 5 th ed. Volume 2: Adults. Oxford. e.g pages of measures of contentment, hardiness, loneliness, mindfulness, pain (e.g. 1 to 10 scale, sad face scale, PIQ-6) etc etc (reports Cronbach s alpha internal consistency, but remember that this is often used, but not very meaningful and is a function of how long the instrument is) 15

16 Supplement tests with open-ended questions? Since the early days of computers, openended questions have been regarded as problematic, although may provide extra information and insight compared with conventional instruments e.g. mixed method quantitative qualitative (e.g. Nvivo/QDA Miner/atlas.ti) or content analytic (e.g. Leximancer/Wordstat/LIWC) approaches Krippendorff K (2013) Content analysis. 3 rd ed. Sage. Richards L (2009) Handling qualitative data: a practical guide. 2 nd ed. Sage. Teddlie C, Tashakkori A (2009) Foundations of mixed methods. Sage. 16

17 Be sensitive, but specific Cut-points are generally chosen so as to maintain a balance between sensitivity and specificity Sensitivity = probability of being identified as positive by the screening test if person actually has the disease / diagnosis Specificity = probability of being identified as negative by the screening test if person doesn t have the disorder / diagnosis (function of disorder prevalence, but values of 0.70 or higher preferred) Positive predictive value = probability of person with positive test result actually having the disorder Negative predictive value = probability of person with negative test result not actually having the disorder Harris M, Taylor G (2008) Medical statistics made easy. 2 nd ed. Scion. 17

18 Future directions? Techniques for analysis of instruments constantly being developed but design of instruments themselves hasn t changed much in 50 years or more, i.e. pen and paper or simple computerized versions thereof Lots of scope for new instruments / new developments Bartram D, Hambleton RK (2006) Computer-based testing and the internet: issues and advances. Wiley. Wagner-Menghin, Masters GN (2013) Adaptive testing for psychological assessment. Journal of Applied Measurement, 14,

19 Conclusion Assess the reliability (consistency) and validity (appropriateness, meaningfulness) Who has it been applied to, further reading as required And in terms of the empirical evidence as well as your clinical judgement, is the instrument any good? Is it the right horse for the course? 19

20 Further / Future Reading Cohen RJ, Swerdlik ME, Sturman ED (2013) Psychological testing and assessment: an introduction to tests and measurement. 8 th ed. McGraw-Hill. Garson GD (2013) Validity and reliability. (Free ebook from or $US5 in Kindle format from Harvey J, Taylor V (2013) Measuring health and wellbeing. Sage. Salkind N (2013) Tests & measurement for people who (think they) hate tests & measurement. 2 nd ed. Sage. Streiner DL, Norman GR (2008) Health measurement scales: a practical guide to their development and use. 4 th ed. Oxford. 20

Reliability and Validity checks S-005

Reliability and Validity checks S-005 Reliability and Validity checks S-005 Checking on reliability of the data we collect Compare over time (test-retest) Item analysis Internal consistency Inter-rater agreement Compare over time Test-Retest

More information

alternate-form reliability The degree to which two or more versions of the same test correlate with one another. In clinical studies in which a given function is going to be tested more than once over

More information

DATA is derived either through. Self-Report Observation Measurement

DATA is derived either through. Self-Report Observation Measurement Data Management DATA is derived either through Self-Report Observation Measurement QUESTION ANSWER DATA DATA may be from Structured or Unstructured questions? Quantitative or Qualitative? Numerical or

More information

ADMS Sampling Technique and Survey Studies

ADMS Sampling Technique and Survey Studies Principles of Measurement Measurement As a way of understanding, evaluating, and differentiating characteristics Provides a mechanism to achieve precision in this understanding, the extent or quality As

More information

11-3. Learning Objectives

11-3. Learning Objectives 11-1 Measurement Learning Objectives 11-3 Understand... The distinction between measuring objects, properties, and indicants of properties. The similarities and differences between the four scale types

More information

PTHP 7101 Research 1 Chapter Assignments

PTHP 7101 Research 1 Chapter Assignments PTHP 7101 Research 1 Chapter Assignments INSTRUCTIONS: Go over the questions/pointers pertaining to the chapters and turn in a hard copy of your answers at the beginning of class (on the day that it is

More information

N Utilization of Nursing Research in Advanced Practice, Summer 2008

N Utilization of Nursing Research in Advanced Practice, Summer 2008 University of Michigan Deep Blue deepblue.lib.umich.edu 2008-07 N 536 - Utilization of Nursing Research in Advanced Practice, Summer 2008 Tzeng, Huey-Ming Tzeng, H. (2008, October 1). Utilization of Nursing

More information

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971) Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity

More information

Closed Coding. Analyzing Qualitative Data VIS17. Melanie Tory

Closed Coding. Analyzing Qualitative Data VIS17. Melanie Tory Closed Coding Analyzing Qualitative Data Tutorial @ VIS17 Melanie Tory A code in qualitative inquiry is most often a word or short phrase that symbolically assigns a summative, salient, essence capturing,

More information

Overview of Experimentation

Overview of Experimentation The Basics of Experimentation Overview of Experiments. IVs & DVs. Operational Definitions. Reliability. Validity. Internal vs. External Validity. Classic Threats to Internal Validity. Lab: FP Overview;

More information

Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that

Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that it purports to perform. Does an indicator accurately

More information

Reliability & Validity Dr. Sudip Chaudhuri

Reliability & Validity Dr. Sudip Chaudhuri Reliability & Validity Dr. Sudip Chaudhuri M. Sc., M. Tech., Ph.D., M. Ed. Assistant Professor, G.C.B.T. College, Habra, India, Honorary Researcher, Saha Institute of Nuclear Physics, Life Member, Indian

More information

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971) Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity

More information

Making a psychometric. Dr Benjamin Cowan- Lecture 9

Making a psychometric. Dr Benjamin Cowan- Lecture 9 Making a psychometric Dr Benjamin Cowan- Lecture 9 What this lecture will cover What is a questionnaire? Development of questionnaires Item development Scale options Scale reliability & validity Factor

More information

Reliability AND Validity. Fact checking your instrument

Reliability AND Validity. Fact checking your instrument Reliability AND Validity Fact checking your instrument General Principles Clearly Identify the Construct of Interest Use Multiple Items Use One or More Reverse Scored Items Use a Consistent Response Format

More information

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit Overview Quality of Measurement Instruments Introduction SPSS Read:

More information

Saville Consulting Wave Professional Styles Handbook

Saville Consulting Wave Professional Styles Handbook Saville Consulting Wave Professional Styles Handbook PART 4: TECHNICAL Chapter 19: Reliability This manual has been generated electronically. Saville Consulting do not guarantee that it has not been changed

More information

The Concept of Validity

The Concept of Validity The Concept of Validity Dr Wan Nor Arifin Unit of Biostatistics and Research Methodology, Universiti Sains Malaysia. wnarifin@usm.my Wan Nor Arifin, 2017. The Concept of Validity by Wan Nor Arifin is licensed

More information

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER VI RESEARCH METHODOLOGY CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the

More information

26:010:557 / 26:620:557 Social Science Research Methods

26:010:557 / 26:620:557 Social Science Research Methods 26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate Professor Department of Accounting & Information Systems Rutgers Business School Newark & New Brunswick 1 Overview

More information

PÄIVI KARHU THE THEORY OF MEASUREMENT

PÄIVI KARHU THE THEORY OF MEASUREMENT PÄIVI KARHU THE THEORY OF MEASUREMENT AGENDA 1. Quality of Measurement a) Validity Definition and Types of validity Assessment of validity Threats of Validity b) Reliability True Score Theory Definition

More information

HPS301 Exam Notes- Contents

HPS301 Exam Notes- Contents HPS301 Exam Notes- Contents Week 1 Research Design: What characterises different approaches 1 Experimental Design 1 Key Features 1 Criteria for establishing causality 2 Validity Internal Validity 2 Threats

More information

Variable Measurement, Norms & Differences

Variable Measurement, Norms & Differences Variable Measurement, Norms & Differences 1 Expectations Begins with hypothesis (general concept) or question Create specific, testable prediction Prediction can specify relation or group differences Different

More information

Validity and reliability of measurements

Validity and reliability of measurements Validity and reliability of measurements 2 Validity and reliability of measurements 4 5 Components in a dataset Why bother (examples from research) What is reliability? What is validity? How should I treat

More information

Doctoral Dissertation Boot Camp Quantitative Methods Kamiar Kouzekanani, PhD January 27, The Scientific Method of Problem Solving

Doctoral Dissertation Boot Camp Quantitative Methods Kamiar Kouzekanani, PhD January 27, The Scientific Method of Problem Solving Doctoral Dissertation Boot Camp Quantitative Methods Kamiar Kouzekanani, PhD January 27, 2018 The Scientific Method of Problem Solving The conceptual phase Reviewing the literature, stating the problem,

More information

Research Questions and Survey Development

Research Questions and Survey Development Research Questions and Survey Development R. Eric Heidel, PhD Associate Professor of Biostatistics Department of Surgery University of Tennessee Graduate School of Medicine Research Questions 1 Research

More information

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS DePaul University INTRODUCTION TO ITEM ANALYSIS: EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS Ivan Hernandez, PhD OVERVIEW What is Item Analysis? Overview Benefits of Item Analysis Applications Main

More information

Factor structure and psychometric properties of the General Health Questionnaire. (GHQ-12) among Ghanaian adolescents

Factor structure and psychometric properties of the General Health Questionnaire. (GHQ-12) among Ghanaian adolescents Factor structure and psychometric properties of the General Health Questionnaire (GHQ-12) among Ghanaian adolescents Franklin N Glozah 1* and David J Pevalin 1 1 School of Health and Human Sciences, University

More information

Examining the Psychometric Properties of The McQuaig Occupational Test

Examining the Psychometric Properties of The McQuaig Occupational Test Examining the Psychometric Properties of The McQuaig Occupational Test Prepared for: The McQuaig Institute of Executive Development Ltd., Toronto, Canada Prepared by: Henryk Krajewski, Ph.D., Senior Consultant,

More information

VARIABLES AND MEASUREMENT

VARIABLES AND MEASUREMENT ARTHUR SYC 204 (EXERIMENTAL SYCHOLOGY) 16A LECTURE NOTES [01/29/16] VARIABLES AND MEASUREMENT AGE 1 Topic #3 VARIABLES AND MEASUREMENT VARIABLES Some definitions of variables include the following: 1.

More information

Importance of Good Measurement

Importance of Good Measurement Importance of Good Measurement Technical Adequacy of Assessments: Validity and Reliability Dr. K. A. Korb University of Jos The conclusions in a study are only as good as the data that is collected. The

More information

Reliability and Validity

Reliability and Validity Reliability and Today s Objectives Understand the difference between reliability and validity Understand how to develop valid indicators of a concept Reliability and Reliability How accurate or consistent

More information

Establishing Interrater Agreement for Scoring Criterion-based Assessments: Application for the MEES

Establishing Interrater Agreement for Scoring Criterion-based Assessments: Application for the MEES Establishing Interrater Agreement for Scoring Criterion-based Assessments: Application for the MEES Mark C. Hogrebe Washington University in St. Louis MACTE Fall 2018 Conference October 23, 2018 Camden

More information

THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES IN GHANA.

THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES IN GHANA. Africa Journal of Teacher Education ISSN 1916-7822. A Journal of Spread Corporation Vol. 6 No. 1 2017 Pages 56-64 THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES

More information

Statistics for Psychosocial Research Session 1: September 1 Bill

Statistics for Psychosocial Research Session 1: September 1 Bill Statistics for Psychosocial Research Session 1: September 1 Bill Introduction to Staff Purpose of the Course Administration Introduction to Test Theory Statistics for Psychosocial Research Overview: a)

More information

Quality of Life Assessment of Growth Hormone Deficiency in Adults (QoL-AGHDA)

Quality of Life Assessment of Growth Hormone Deficiency in Adults (QoL-AGHDA) Quality of Life Assessment of Growth Hormone Deficiency in Adults (QoL-AGHDA) Guidelines for Users May 2016 B1 Chorlton Mill, 3 Cambridge Street, Manchester, M1 5BY, UK Tel: +44 (0)161 226 4446 Fax: +44

More information

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings.

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings. DATA GATHERING Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings. 2012 John Wiley & Sons Ltd. Measurement Measurement: the assignment

More information

The study of relationship between optimism (positive thinking) and mental health (Case study: students of Islamic Azad University of Bandar Abbas)

The study of relationship between optimism (positive thinking) and mental health (Case study: students of Islamic Azad University of Bandar Abbas) ORIGINAL ARTICLE Received 8 April. 2017 Accepted 24 Aguste. 2017 Vol. 6, Issue 4, 98-106, 2017 Academic Journal of Psychological Studies ISSN: 2333-0821 ajps.worldofresearches.com The study of relationship

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 381 Exam #2 E Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) An approach to qualitative research that focuses on building theories is known

More information

and Screening Methodological Quality (Part 2: Data Collection, Interventions, Analysis, Results, and Conclusions A Reader s Guide

and Screening Methodological Quality (Part 2: Data Collection, Interventions, Analysis, Results, and Conclusions A Reader s Guide 03-Fink Research.qxd 11/1/2004 10:57 AM Page 103 3 Searching and Screening Methodological Quality (Part 2: Data Collection, Interventions, Analysis, Results, and Conclusions A Reader s Guide Purpose of

More information

Gezinskenmerken: De constructie van de Vragenlijst Gezinskenmerken (VGK) Klijn, W.J.L.

Gezinskenmerken: De constructie van de Vragenlijst Gezinskenmerken (VGK) Klijn, W.J.L. UvA-DARE (Digital Academic Repository) Gezinskenmerken: De constructie van de Vragenlijst Gezinskenmerken (VGK) Klijn, W.J.L. Link to publication Citation for published version (APA): Klijn, W. J. L. (2013).

More information

Construct Reliability and Validity Update Report

Construct Reliability and Validity Update Report Assessments 24x7 LLC DISC Assessment 2013 2014 Construct Reliability and Validity Update Report Executive Summary We provide this document as a tool for end-users of the Assessments 24x7 LLC (A24x7) Online

More information

Collecting & Making Sense of

Collecting & Making Sense of Collecting & Making Sense of Quantitative Data Deborah Eldredge, PhD, RN Director, Quality, Research & Magnet Recognition i Oregon Health & Science University Margo A. Halm, RN, PhD, ACNS-BC, FAHA Director,

More information

Measurement is the process of observing and recording the observations. Two important issues:

Measurement is the process of observing and recording the observations. Two important issues: Farzad Eskandanian Measurement is the process of observing and recording the observations. Two important issues: 1. Understanding the fundamental ideas: Levels of measurement: nominal, ordinal, interval

More information

Interrater and Intrarater Reliability of the Assisting Hand Assessment

Interrater and Intrarater Reliability of the Assisting Hand Assessment Interrater and Intrarater Reliability of the Assisting Hand Assessment Marie Holmefur, Lena Krumlinde-Sundholm, Ann-Christin Eliasson KEY WORDS hand pediatric reliability OBJECTIVE. The aim of this study

More information

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors affecting reliability ON DEFINING RELIABILITY Non-technical

More information

CHAPTER III RESEARCH METHODOLOGY

CHAPTER III RESEARCH METHODOLOGY CHAPTER III RESEARCH METHODOLOGY Research methodology explains the activity of research that pursuit, how it progress, estimate process and represents the success. The methodological decision covers the

More information

32.5. percent of U.S. manufacturers experiencing unfair currency manipulation in the trade practices of other countries.

32.5. percent of U.S. manufacturers experiencing unfair currency manipulation in the trade practices of other countries. TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 11 Measurement Lecture

More information

ESTABLISHING VALIDITY AND RELIABILITY OF ACHIEVEMENT TEST IN BIOLOGY FOR STD. IX STUDENTS

ESTABLISHING VALIDITY AND RELIABILITY OF ACHIEVEMENT TEST IN BIOLOGY FOR STD. IX STUDENTS International Journal of Educational Science and Research (IJESR) ISSN(P): 2249-6947; ISSN(E): 2249-8052 Vol. 4, Issue 4, Aug 2014, 29-36 TJPRC Pvt. Ltd. ESTABLISHING VALIDITY AND RELIABILITY OF ACHIEVEMENT

More information

Psychologist use statistics for 2 things

Psychologist use statistics for 2 things Psychologist use statistics for 2 things O Summarize the information from the study/experiment O Measures of central tendency O Mean O Median O Mode O Make judgements and decisions about the data O See

More information

Editorial: An Author s Checklist for Measure Development and Validation Manuscripts

Editorial: An Author s Checklist for Measure Development and Validation Manuscripts Journal of Pediatric Psychology Advance Access published May 31, 2009 Editorial: An Author s Checklist for Measure Development and Validation Manuscripts Grayson N. Holmbeck and Katie A. Devine Loyola

More information

By Hui Bian Office for Faculty Excellence

By Hui Bian Office for Faculty Excellence By Hui Bian Office for Faculty Excellence 1 Email: bianh@ecu.edu Phone: 328-5428 Location: 1001 Joyner Library, room 1006 Office hours: 8:00am-5:00pm, Monday-Friday 2 Educational tests and regular surveys

More information

Scaling the quality of clinical audit projects: a pilot study

Scaling the quality of clinical audit projects: a pilot study International Journal for Quality in Health Care 1999; Volume 11, Number 3: pp. 241 249 Scaling the quality of clinical audit projects: a pilot study ANDREW D. MILLARD Scottish Clinical Audit Resource

More information

Ch. 11 Measurement. Measurement

Ch. 11 Measurement. Measurement TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 11 Measurement Lecture

More information

Validity and reliability of measurements

Validity and reliability of measurements Validity and reliability of measurements 2 3 Request: Intention to treat Intention to treat and per protocol dealing with cross-overs (ref Hulley 2013) For example: Patients who did not take/get the medication

More information

On the purpose of testing:

On the purpose of testing: Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase

More information

Do not write your name on this examination all 40 best

Do not write your name on this examination all 40 best Student #: Do not write your name on this examination Research in Psychology I; Final Exam Fall 200 Instructor: Jeff Aspelmeier NOTE: THIS EXAMINATION MAY NOT BE RETAINED BY STUDENTS This exam is worth

More information

Measures. David Black, Ph.D. Pediatric and Developmental. Introduction to the Principles and Practice of Clinical Research

Measures. David Black, Ph.D. Pediatric and Developmental. Introduction to the Principles and Practice of Clinical Research Introduction to the Principles and Practice of Clinical Research Measures David Black, Ph.D. Pediatric and Developmental Neuroscience, NIMH With thanks to Audrey Thurm Daniel Pine With thanks to Audrey

More information

Constructing Indices and Scales. Hsueh-Sheng Wu CFDR Workshop Series June 8, 2015

Constructing Indices and Scales. Hsueh-Sheng Wu CFDR Workshop Series June 8, 2015 Constructing Indices and Scales Hsueh-Sheng Wu CFDR Workshop Series June 8, 2015 1 Outline What are scales and indices? Graphical presentation of relations between items and constructs for scales and indices

More information

The Research Roadmap Checklist

The Research Roadmap Checklist 1/5 The Research Roadmap Checklist Version: December 1, 2007 All enquires to bwhitworth@acm.org This checklist is at http://brianwhitworth.com/researchchecklist.pdf The element details are explained at

More information

An International Study of the Reliability and Validity of Leadership/Impact (L/I)

An International Study of the Reliability and Validity of Leadership/Impact (L/I) An International Study of the Reliability and Validity of Leadership/Impact (L/I) Janet L. Szumal, Ph.D. Human Synergistics/Center for Applied Research, Inc. Contents Introduction...3 Overview of L/I...5

More information

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604 Measurement and Descriptive Statistics Katie Rommel-Esham Education 604 Frequency Distributions Frequency table # grad courses taken f 3 or fewer 5 4-6 3 7-9 2 10 or more 4 Pictorial Representations Frequency

More information

Ursuline College Accelerated Program

Ursuline College Accelerated Program Ursuline College Accelerated Program CRITICAL INFORMATION! DO NOT SKIP THIS LINK BELOW... BEFORE PROCEEDING TO READ THE UCAP MODULE, YOU ARE EXPECTED TO READ AND ADHERE TO ALL UCAP POLICY INFORMATION CONTAINED

More information

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera Homework assignment topics 32-36 Georgina Salas Topics 32-36 EDCI Intro to Research 6300.62 Dr. A.J. Herrera Topic 32 1. Researchers need to use at least how many observers to determine interobserver reliability?

More information

Agreement Coefficients and Statistical Inference

Agreement Coefficients and Statistical Inference CHAPTER Agreement Coefficients and Statistical Inference OBJECTIVE This chapter describes several approaches for evaluating the precision associated with the inter-rater reliability coefficients of the

More information

English 10 Writing Assessment Results and Analysis

English 10 Writing Assessment Results and Analysis Academic Assessment English 10 Writing Assessment Results and Analysis OVERVIEW This study is part of a multi-year effort undertaken by the Department of English to develop sustainable outcomes assessment

More information

Ch. 11 Measurement. Paul I-Hai Lin, Professor A Core Course for M.S. Technology Purdue University Fort Wayne Campus

Ch. 11 Measurement. Paul I-Hai Lin, Professor  A Core Course for M.S. Technology Purdue University Fort Wayne Campus TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 11 Measurement Lecture

More information

02a: Test-Retest and Parallel Forms Reliability

02a: Test-Retest and Parallel Forms Reliability 1 02a: Test-Retest and Parallel Forms Reliability Quantitative Variables 1. Classic Test Theory (CTT) 2. Correlation for Test-retest (or Parallel Forms): Stability and Equivalence for Quantitative Measures

More information

Chapter -6 Reliability and Validity of the test Test - Retest Method Rational Equivalence Method Split-Half Method

Chapter -6 Reliability and Validity of the test Test - Retest Method Rational Equivalence Method Split-Half Method Chapter -6 Reliability and Validity of the test 6.1 Introduction 6.2 Reliability of the test 6.2.1 Test - Retest Method 6.2.2 Rational Equivalence Method 6.2.3 Split-Half Method 6.3 Validity of the test

More information

Associate Prof. Dr Anne Yee. Dr Mahmoud Danaee

Associate Prof. Dr Anne Yee. Dr Mahmoud Danaee Associate Prof. Dr Anne Yee Dr Mahmoud Danaee 1 2 What does this resemble? Rorschach test At the end of the test, the tester says you need therapy or you can't work for this company 3 Psychological Testing

More information

Psychometric Evaluation of Self-Report Questionnaires - the development of a checklist

Psychometric Evaluation of Self-Report Questionnaires - the development of a checklist pr O C e 49C c, 1-- Le_ 5 e. _ P. ibt_166' (A,-) e-r e.),s IsONI 53 5-6b

More information

CHAPTER III RESEARCH METHOD. method the major components include: Research Design, Research Site and

CHAPTER III RESEARCH METHOD. method the major components include: Research Design, Research Site and CHAPTER III RESEARCH METHOD This chapter presents the research method and design. In this research method the major components include: Research Design, Research Site and Access, Population and Sample,

More information

Figure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b)

Figure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b) Page 1 of 1 Diagnostic test investigated indicates the patient has the Diagnostic test investigated indicates the patient does not have the Gold/reference standard indicates the patient has the True positive

More information

The Short NART: Cross-validation, relationship to IQ and some practical considerations

The Short NART: Cross-validation, relationship to IQ and some practical considerations British journal of Clinical Psychology (1991), 30, 223-229 Printed in Great Britain 2 2 3 1991 The British Psychological Society The Short NART: Cross-validation, relationship to IQ and some practical

More information

Monday 4 June 2018 Afternoon Time allowed: 2 hours

Monday 4 June 2018 Afternoon Time allowed: 2 hours Oxford ambridge and RS Level Psychology H567/01 Research methods Monday 4 June 2018 fternoon Time allowed: 2 hours *6906008764* You must have: a scientific or graphical calculator * H 5 6 7 0 1 * First

More information

Global Clinical Trials Innovation Summit Berlin October 2016

Global Clinical Trials Innovation Summit Berlin October 2016 Global Clinical Trials Innovation Summit Berlin 20-21 October 2016 BIOSTATISTICS A FEW ESSENTIALS: USE AND APPLICATION IN CLINICAL RESEARCH Berlin, 20 October 2016 Dr. Aamir Shaikh Founder, Assansa Here

More information

Validity, Reliability, and Defensibility of Assessments in Veterinary Education

Validity, Reliability, and Defensibility of Assessments in Veterinary Education Instructional Methods Validity, Reliability, and Defensibility of Assessments in Veterinary Education Kent Hecker g Claudio Violato ABSTRACT In this article, we provide an introduction to and overview

More information

Sample Exam Questions Psychology 3201 Exam 1

Sample Exam Questions Psychology 3201 Exam 1 Scientific Method Scientific Researcher Scientific Practitioner Authority External Explanations (Metaphysical Systems) Unreliable Senses Determinism Lawfulness Discoverability Empiricism Control Objectivity

More information

Measurement. 500 Research Methods Mike Kroelinger

Measurement. 500 Research Methods Mike Kroelinger Measurement 500 Research Methods Mike Kroelinger Levels of Measurement Nominal Lowest level -- used to classify variables into two or more categories. Cases placed in the same category must be equivalent.

More information

What are Indexes and Scales

What are Indexes and Scales ISSUES Exam results are on the web No student handbook, will have discussion questions soon Next exam will be easier but want everyone to study hard Biggest problem was question on Research Design Next

More information

Lessons in biostatistics

Lessons in biostatistics Lessons in biostatistics : the kappa statistic Mary L. McHugh Department of Nursing, National University, Aero Court, San Diego, California Corresponding author: mchugh8688@gmail.com Abstract The kappa

More information

TLQ Reliability, Validity and Norms

TLQ Reliability, Validity and Norms MSP Research Note TLQ Reliability, Validity and Norms Introduction This research note describes the reliability and validity of the TLQ. Evidence for the reliability and validity of is presented against

More information

Evaluating the Merits of Using Brief Measures of PTSD or General Mental Health Measures in Two-Stage PTSD Screening

Evaluating the Merits of Using Brief Measures of PTSD or General Mental Health Measures in Two-Stage PTSD Screening MILITARY MEDICINE, 179, 12:1497, 2014 Evaluating the Merits of Using Brief Measures of PTSD or General Mental Health Measures in Two-Stage PTSD Screening Nicole M. Steele, BSc (Hons)*; Helen P. Benassi,

More information

The Bengali Adaptation of Edinburgh Postnatal Depression Scale

The Bengali Adaptation of Edinburgh Postnatal Depression Scale IOSR Journal of Nursing and Health Science (IOSR-JNHS) e-issn: 2320 1959.p- ISSN: 2320 1940 Volume 4, Issue 2 Ver. I (Mar.-Apr. 2015), PP 12-16 www.iosrjournals.org The Bengali Adaptation of Edinburgh

More information

Conducting Survey Research. John C. Ricketts

Conducting Survey Research. John C. Ricketts + Conducting Survey Research John C. Ricketts + What is survey research? + Question When is survey research the best method available? A. when collecting original data B. when describing a population too

More information

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when. INTRO TO RESEARCH METHODS: Empirical Knowledge: based on observations. Answer questions why, whom, how, and when. Experimental research: treatments are given for the purpose of research. Experimental group

More information

Collecting & Making Sense of

Collecting & Making Sense of Collecting & Making Sense of Quantitative Data Deborah Eldredge, PhD, RN Director, Quality, Research & Magnet Recognition i Oregon Health & Science University Margo A. Halm, RN, PhD, ACNS-BC, FAHA Director,

More information

Class 7 Everything is Related

Class 7 Everything is Related Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational

More information

Chronicles of Dental Research

Chronicles of Dental Research REVIEW ARTICLE Validity and reliability of a questionnaire: a literature review Shyamalima Bhattacharyya 1 Ramneek Kaur 1 Sukirat Kaur 1 Syed Amaan Ali 1 Abstarct Questionnaires form an important part

More information

Chapter 3 Psychometrics: Reliability and Validity

Chapter 3 Psychometrics: Reliability and Validity 34 Chapter 3 Psychometrics: Reliability and Validity Every classroom assessment measure must be appropriately reliable and valid, be it the classic classroom achievement test, attitudinal measure, or performance

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

MEASUREMENT, SCALING AND SAMPLING. Variables

MEASUREMENT, SCALING AND SAMPLING. Variables MEASUREMENT, SCALING AND SAMPLING Variables Variables can be explained in different ways: Variable simply denotes a characteristic, item, or the dimensions of the concept that increases or decreases over

More information

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK Hanny Pradana, Gatot Sutapa, Luwandi Suhartono Sarjana Degree of English Language Education, Teacher

More information

X I UNIVER5 ITY. Of 1LLI NOIS OF THL B R. A R. Y

X I UNIVER5 ITY. Of 1LLI NOIS OF THL B R. A R. Y X I B R. A R. Y OF THL UNIVER5 ITY Of 1LLI NOIS 370 Return this book on or before the Latest Date stamped below. University of Illinois Library D«:!i,g; m 13«76 K 2 2 m L161 H41 Digitized by the Internet

More information

SOME NOTES ON STATISTICAL INTERPRETATION

SOME NOTES ON STATISTICAL INTERPRETATION 1 SOME NOTES ON STATISTICAL INTERPRETATION Below I provide some basic notes on statistical interpretation. These are intended to serve as a resource for the Soci 380 data analysis. The information provided

More information

ACDI. An Inventory of Scientific Findings. (ACDI, ACDI-Corrections Version and ACDI-Corrections Version II) Provided by:

ACDI. An Inventory of Scientific Findings. (ACDI, ACDI-Corrections Version and ACDI-Corrections Version II) Provided by: + ACDI An Inventory of Scientific Findings (ACDI, ACDI-Corrections Version and ACDI-Corrections Version II) Provided by: Behavior Data Systems, Ltd. P.O. Box 44256 Phoenix, Arizona 85064-4256 Telephone:

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Measurement of Resilience Barbara Resnick, PHD,CRNP

Measurement of Resilience Barbara Resnick, PHD,CRNP Measurement of Resilience Barbara Resnick, PHD,CRNP Resilience Definition Resilience has been defined in many ways and is considered both a process and a personality trait. Resilience generally refers

More information