Reliability and Validity

Similar documents
Reliability & Validity Dr. Sudip Chaudhuri

Chapter 4. The Validity of Assessment- Based Interpretations

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS

Measurement. 500 Research Methods Mike Kroelinger

ADMS Sampling Technique and Survey Studies

Variables in Research. What We Will Cover in This Section. What Does Variable Mean?

Overview of the Logic and Language of Psychology Research

Reliability Theory for Total Test Scores. Measurement Methods Lecture 7 2/27/2007

Survey Question. What are appropriate methods to reaffirm the fairness, validity reliability and general performance of examinations?

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Test Validity. What is validity? Types of validity IOP 301-T. Content validity. Content-description Criterion-description Construct-identification

DATA is derived either through. Self-Report Observation Measurement

Introduction to Reliability

PÄIVI KARHU THE THEORY OF MEASUREMENT

Validity of measurement instruments used in PT research

Variables in Research. What We Will Cover in This Section. What Does Variable Mean? Any object or event that can take on more than one form or value.

Handout 5: Establishing the Validity of a Survey Instrument

CHAPTER III RESEARCH METHODOLOGY

Research Proposal Development. Saptawati Bardosono

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

11-3. Learning Objectives

PRELIMINARY EXAM EVALUATION FACULTY SCORE SHEET

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

Conducting Research. Research Methods Chapter 1. Descriptive Research Methods. Conducting Research. Case Study

Variables in Research. What We Will Cover in This Section. What Does Variable Mean? Any object or event that can take on more than one form or value.

CHAPTER 3 METHOD AND PROCEDURE

Critical Thinking Assessment at MCC. How are we doing?

Chapter 3 Psychometrics: Reliability and Validity


Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that

SEMINAR ON SERVICE MARKETING

ATTITUDES, BELIEFS, AND TRANSPORTATION BEHAVIOR

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial ::

Measurement is the process of observing and recording the observations. Two important issues:

Introduction to Attitudes

Topic #2. A key criterion in evaluating any test, measure, or piece of research is validity.

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

Measurement Issues in Concussion Testing

Chapter 4: Defining and Measuring Variables

Steps in establishing reliability and validity of need assessment questionnaire on life skill training for adolescents

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Reliability and Validity

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

MEASUREMENT, SCALING AND SAMPLING. Variables

Importance of Good Measurement

About Reading Scientific Studies

Underlying Theory & Basic Issues

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Reliability and Validity checks S-005

CHAPTER IV CONSTRUCTION OF MORAL JUDGEMENT TEST, VALIDATION, DESCRIPTION AND ADMINISTRATION

What s it all about?! at GCSE!

Self Report Measures

VARIABLES AND MEASUREMENT

and Screening Methodological Quality (Part 2: Data Collection, Interventions, Analysis, Results, and Conclusions A Reader s Guide

TIPSHEET QUESTION WORDING

Psychology Department Assessment

LAPORAN AKHIR PROJEK PENYELIDIKAN TAHUN AKHIR (PENILAI) FINAL REPORT OF FINAL YEAR PROJECT (EXAMINER)

Basic concepts and principles of classical test theory

Lecture 11: Measurement to Hypotheses. Benjamin Graham

Reliability AND Validity. Fact checking your instrument

Making a psychometric. Dr Benjamin Cowan- Lecture 9

A Comparison of the Evaluation of the Victorian Deaf Education Institute Real-time Captioning and C-Print Projects

European Association for Cardiovascular Prevention & Rehabilitation (EACPR) A Registered Branch of the ESC

AP Psychology -- Chapter 02 Review Research Methods in Psychology

Introduction: Speaker. Introduction: Buros. Buros & Education. Introduction: Participants. Goal 10/5/2012

25. EXPLAINING VALIDITYAND RELIABILITY

FSA Training Papers Grade 4 Exemplars. Rationales

Conducting Research. Research Methods Chapter 1. Descriptive Research Methods. Conducting Research

how good is the Instrument? Dr Dean McKenzie

Understanding CELF-5 Reliability & Validity to Improve Diagnostic Decisions

An International Multi-Disciplinary Journal, Ethiopia Vol. 4 (1) January, 2010

Validity and reliability of measurements

Chapter 9: Intelligence and Psychological Testing

Process of a neuropsychological assessment

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

CHECKLIST FOR EVALUATING A RESEARCH REPORT Provided by Dr. Blevins

UNIT II: RESEARCH METHODS

that behavior. Thus, when we see the results of our action -- the fruits of our labor -- we are

Research Questions and Survey Development

THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES IN GHANA.

INVESTIGATING FIT WITH THE RASCH MODEL. Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Department of Psychological Sciences Learning Goals and Outcomes

Competency Rubric Bank for the Sciences (CRBS)

Definition of Scientific Research RESEARCH METHODOLOGY CHAPTER 2 SCIENTIFIC INVESTIGATION. The Hallmarks of Scientific Research

Psychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals

By Hui Bian Office for Faculty Excellence

Survey Research Methodology

VALUE CARD SORT Step 1: Step 2: Step 3: Step 4:

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

CHAPTER 3 RESEARCH METHODOLOGY. In this chapter, research design, data collection, sampling frame and analysis

Effect of Reward on Need for Achievement

26:010:557 / 26:620:557 Social Science Research Methods

Chapter (3) 3. Methodology

Gathering. Useful Data. Chapter 3. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Word Association Type and the Temporal Stacking of Responses

Transcription:

Reliability and Today s Objectives Understand the difference between reliability and validity Understand how to develop valid indicators of a concept Reliability and Reliability How accurate or consistent is the measure? Would two people understand a question in the same way? Would the same person give the same answers under similar circumstances? Does the concept measure what it is intended to measure? Does the measure actually reflect the concept? Do the findings reflect the opinions, attitudes, and behaviors of the target population? Reliable but not valid Valid ldbut not reliable Valid and reliable 1

Levels of Reliability Example: Person s weight LOW HIGH Estimate on the part of the subject Estimate on the part of the observer Old bathroom scale Industrial scale Reliability Reliability is the consistency of your measurement, or the degree to which an instrument measures the same way each time it is used under the same condition with the same subjects. In short, it is the repeatability of your measurement. A measure is considered reliable if a person's score on the same test given twice is similar. It is important to remember that reliability is not measured, it is estimated. Here is a simple example to illustrate this. Suppose that you have bathroom weight scales and these weight scales are broken. The weight scales will represent the methodology. One person weighs you with these scales and obtains a result. Then, the weight scales are passed along to another person. The second person follows the same procedure, uses the same weight scales and weighs you. The same broken weigh scales are used. The two people, using the same broken weight scales, come to similar measures. The results are reliable. The results are obtained by two (or perhaps more) people using the faulty scale. Although the results are reliable, they may not be valid. That is, by using the faulty scales, the results are not a true indicator of the real weight. Reliability Accuracy, precision, or consistency of measurement Degree ee to which measures es are free from error and therefore yield consistent results Reliable measures mean the same data would have been collected under similar circumstances 2

Methods used to determine reliability Test-retest method Administer the same measures to the same respondents at two separate points in time Split-half method Correlate one-half of a scale with the other half Calculate reliability coefficient Statistical test that measures the internal consistency of a set of items How to improve Reliability? Quality of items; concise statements, homogenous words (some sort of uniformity) Adequate sampling of content domain; comprehensiveness of items Longer assessment less distorted by chance factors Developing a scoring plan (esp. for subjective items rubrics) Ensure VALIDITY Food Quality What items would you include to get adequate sampling of content domain? Program Satisfaction I like the after-school program I like the after-school teachers I would sign up again for the afterschool program 3

The ability of a scale to measure what it is intended to measure The extent to which a measure e reflects the real meaning of the concept under consideration The extent to which a measure reflects the opinions and behaviors of the population under investigation Can not be valid unless also reliable refers to the degree to which a study accurately reflects or assesses the specific concept that the researcher is attempting to measure. While reliability is concerned with the accuracy of the actual measuring instrument or procedure, validity is concerned with the study's success at measuring what the researchers set out to measure. Depends on the Purpose of the measure E.g. a ruler may be a valid measuring device for length, but isn t very valid for measuring volume Measuring what it is supposed to Must be inferred from evidence; cannot be directly measured What would be valid measures of Intelligence? Religiosity? Knowledge of RPTS 336 material? Tourism motivations? Commitment to a leisure activity? Satisfaction with a leisure service? Environmental ethic? 4

Types of validity Face (content) validity professional agreement that variables cover range of meanings included within the concept Items should be evaluated for their presumed relevance Items should cover a range of ideas rather than a single topic area Items should be evaluated in terms of the abilities of the individuals under investigation Types of validity Construct validity the degree to which a measure relates to other variables, as expected, within a given system of theoretical relationships Satisfaction and Program Quality Predictive validity extent to which a measure predicts some future event Self-esteem and GPA Factors that can lower Unclear directions Difficult reading vocabulary and sentence structure Ambiguity in statements Inadequate time limits Inappropriate level of difficulty Poorly constructed test items Test items inappropriate for the outcomes being measured Continued. Tests that are too short Improper arrangement of items (complex to easy?) Identifiable patterns of answers Teaching Administration and scoring Students Nature of criterion 5

External Answers the question of generalizability To what populations or settings can this effect be generalized? Two aspects Population validity Ecological Population Is the actual sample representative of the theoretical population? To determine, need to identify: Theoretical population Accessible population Sampling design and selected sample Actual sample 6