Chapter -6 Reliability and Validity of the test Test - Retest Method Rational Equivalence Method Split-Half Method

Size: px
Start display at page:

Download "Chapter -6 Reliability and Validity of the test Test - Retest Method Rational Equivalence Method Split-Half Method"

Transcription

1 Chapter -6 Reliability and Validity of the test 6.1 Introduction 6.2 Reliability of the test Test - Retest Method Rational Equivalence Method Split-Half Method 6.3 Validity of the test 6.4 Conclusion

2 Chapter-6 Reliability and Validity of the test 6.1 Introduction Standardized tests provide uniform procedures for administering and scoring the instrument. The same questions are asked each time the test is used, with a set of directions, that specifies how the test should be administered. Psychological tests are among the most useful tools of educational research. They have been devised to evaluate or measure behaviour in a standardized way for the purpose of selection, classification, prediction and guidance as well as for the evaluation of educational programmes. For the purpose of educational research, standardized tests are the most commonly usedc ones because they are considered-more objective than the non-standardized ones. A standardized test is one that has specific directions for administration and scoring, a fixed set of test items, and has been administered to representative samples taken from the population for whom the test is intended for the purpose of establishing norms. Thus, the chief value of standardized test for research purposes in education lies in their use as tools of comparison. It may be noted that standardize tests are as objective as possible their scoring is unambiguous and simple. A researcher must have to evaluate the validity, reliability of the test. In the present chapter, the investigator has described the procedure to find out the validity and reliability of the self - made vocabulary test at length. 6.2 Reliability of the Test Standardized tests are always reliable. No doubts in the results of the test itself. A data collection test must be reliable, that is it must have the ability to consistently yield the same results when repeated 287

3 measurements are taken of the same individuals under the same conditions. Reliability refers to the consistency of measurement, the extent to which the results are similar over different forms of the same instrument or occasions of data collecting. In the words of Freeman (1965), The term reliability has two closely related but somewhat different connotations in psychological testing. First, it refers to the extent to which a test is internally consistent, that is, consistency of results obtained throughout the test when administered once. In other words, how accurately, is the test measuring at a particular time? Second, reliability refers to the extent to which a measuring device yields consistent results upon testing and retesting^ That is, how dependable is it for predictive purposes? 1 A test is reliable to the extent that it measures what ever it is measuring consistently. In tests that have a high coefficient of reliability, errors of measurement have been reduced to a minimum. Reliable tests are stable in whatever they measure and yield comparable scores on repeated administration. According to F.H. Brown (1983), How consistently a test measures overtime, occasions or samples of items the degree to which test scores are influenced by measurement errors. Indices include (reliability co-efficient) and the standard error of measurement. 288

4 In the words of Charles Jackson (1982), Reliability is the degree to which test scores are free from errors of measurement.3 According to John W. Best and James V. Kahan (2005), The reliability of a test may be raised by increasing the number of items of equal quality to the other items. Carefully designed directions for the administration of the test with no variation from group to group, providing an atmosphere free from distractions and one that minimizes boredom and fatigue, will also improve the reliability of the testing instrument.4 According to the above statements, we may sum up to say that, A test may be reliable even though it is not valid. However, for a test to be valid, it must be reliable. That is, a test can consistently measure (reliability) nothing of interest (be invalid), but if a test measures what it is designed to measure (validity), it must do so consistently (reliability). There are four procedures in common use for assessing the reliability of a test they include (1) The test - retest method (2) The alternate or Parallel forms method (3) the split half method and (4) The rational equivalence method. From the above methods three of them are use to find out the reliability of the test for the present study. (1) The Test - Retest Method. (2) The Rational Equivalent Method (3) The Split- Half Method. 289

5 6.2.1 The Test- Retest Method In this method the same test is re-administered shortly after the firs administration, and the two sets of scores are correlates to obtain the reliability of the test. It is known as very easy method to establish the reliability of the test. In the present study, the investigator has selected the students of std. 8, 9 and group sample for the test - retest administration. The test has given to the same group of students after one week. According to the scores of the students in the test the frequency is presented in Table 6.1. Table 6.1 Reliability Value of Test - Retest (One Week) Scores of the test Class Fy l Fx According to the above Table 6.1, the value of correlation of test- retest is carried out so, the value of test - retest reliability of the 290

6 vocabulary test is achieved very high and describe the positive corelation. So, it can be said that the test is reliable. To establish the reliability value of the test, it has given to the students of std. 9 after two weeks. According to the scores of the test, the frequency is presented in Table 6.2 Table 6.2 Reliability Value of Test - Retest (Two Weeks) Scores of the test Class Fy Fx According to the above table 6.2. The value of correlation of testretest is carried out So, the value of test - retest reliability is higher. The value of co-relation is significant at 0.01 level as its value is lower than the one week test - retest. It is definitely said that the reliability of the test is carried out higher. So, the efficiency of the students in 291

7 vocabulary test is stable. So, the test is successfully measured the vocabulary of the students Rational Equivalence Method Such type of method of reliability is evolved to get an estimate of the reliability of a test, free from the objection raised against the methods discussed. Two forms of a test are defined as equivalent when corresponding items are interchangeable, when the inter - item correlations are the same for both the forms. Two internal consistency formula developed by Kuder - Richardson are often used to obtain coefficients of equivalence for test where one point is given for every correct answers and zero for a wrong answer. Reliability by KR-21 Table 6.3 Frequency Distribution of the difference between odd and even items Difference Frequency N=

8 Table 6.4 Reliability and SE (Rullon) SD of difference SD of total score Rtt SEv Table 6.5 Reliability and SE (Flangon) N 90 SD of odd nos SD o f even nos SD of total scores v« SEv Table 6.6 Reliability and SE (Kundcr - Richardson) KR21 Mean SD o f total score Rtt OT7 o J& V N 90 According to the above table 6.5. The mean is carried out and the SD is carried out Accordint to the values, the value of V 293

9 is carried out Which describes the positive correlation. So, it can be said that the test has achieved good reliability S p lit- Half Method In this method, the test is divided into two equivalent halves and the scores on the half of the items are correlated with the scores on the other half from the reliability of the half - test, the self - correlation of the whole test is then estimated by spearman - brown prophecy formula. The items of the test can be divided into two sets in a variety of ways. This method of reliability measures the internal reliability of the test and if the two halves do not correlate highly it suggest that they are not measuring the same thing, Moreover, the method has the advantage of controlling the fatigue and practice effects. According to the B.K. Tuckman (1975), A co-efficient of reliability obtained by correlating scores on one half of a test with scores on the other half and applying the spearman - Brown formula to adjust for the doubled length of the total test. Generally but not necessarily to the two halves consists of the old numbered and the even numbered items.5 For the present test, of add and even numbered equations are divided into two part. The sample of 90 students has taken for the present calculation. The frequency distribution of the available scores are described in Table

10 Table 6.7 Frequency distribution of odd and even numbers Odd Even Class Intervals Frequency Class Intervals Frequency Total 90 Total 90 According to the table 6.7, the spit - half reliability is carried out So, overall the reliability value is carried out The available reliability of the present test is good with the help of the split-half method. 6.3 Validity of the Test Validity refers to the degree to which evidence and theory support the interpretation of the test scores entailed by proposed uses of test. That is, validity has to do with both the attributes of the test and the uses to which it is put. When tests are used for more than one purpose, these 295

11 needs to be evidence of validity for each of these uses. Considering the number and variety of tests and their uses, the type of validity. The test, as a data collection tool, must produce information that is not only relevant but free from systematic errors that is, it must produce valid information. In general, a test is valid if it measures wheat is claims to measure. A test, however, does not possess universal and eternal validity. It may be valid for use in one situation but invalid if used in another. Lee J. Cronbach (1964) says that, A test which helps in making one decision in a particular research situation may have no value at all for another. This means that a researcher should not ask the general questions. Is tais a valid test?. The pertinent question to ask is how valid is this test for the decision. I wish to make? or more generally for what decision is this test valid? 6 In other words of W.J. Bramble (1984), The degree to which a test measures what it is suppose to measure, the experiments are attributable to the treatments.7 These are different types of validity including (1) Content validity (2) Functional validity (3) Factorial validity (4) Co-current validity For the present study, the investigator has used the following types of validity method of established the validity o f the vocabulary test. 296

12 Functional Validity In the present - study, the investigator has checked the validity of the test with the scores of the MIQ test of Dr. J.H. Shah with the scores of the vocabulary test. The co-relation of the scores of both the test has carried out the v value Which is appropriated valid for the present test. The v value is significant at 0.01 level. This shows that the test is valid. The investigator has checked validity of the present test with the help of the marks of English subject in first from exam of the students of std. 10. The correlation of the scores of the vocabulary test and the marks of English subject is carried the v value Which is appropriately achieved higher. So, it can be said that the test is valid according to the correlation of the above marks. The v value is significant at 0.01 level. According to the interpretation of correlation, the test indicates significant positive correlation. This shows that test is valid. Co-current validity The teacher s rating has prepared according to the scores of the vocabulary test and the efficiency of the students. The rating scale has given to the teachers who have taught English subject in the school. The rating scale has prepared for the 50 students of std. 10. The X2 has calculated for the present rating scale. Which is presented in Table

13 Table 6.8 Efficiency of the students in the vocabulary test & English Subject Teacher s Rating Scale Class Very Very Good Medium Poor intervals\ Good Poor Total Total 'j According to the Table 6.8, the value of X is carried out 90. According to that the correlation has been calculated with the help of the following formula, C = 1 x : N + x* Here, %2 = Qui Square N = No. of Students C = Correlation From the above frequency, the C value is carried Which is significant at 0.01 level. Which means that the present test is valid. 298

14 6.4 Conclusion In the present study, the row scores of the present test has analyzed according to the different statistical method. So, that the usability value of the test can be achieved at length. For the present test, the reliability and validity of the test has been find out with the help of different methods as the important steps of the standardization of the present test. It is the great achievement for the present test, that it s validity value has carried out higher and on the other side the reliability value has also carried out higher. So, it acceptable that, the process of the standardization of the present vocabulary test has established the reliability and validity scientificallus with the help of that, it can be said that the test is absolutely standardized 299

15 References 1. Frank S. Freeman (1965); Theory and Practice of Psychological Testing, (New Delhi : Oxford and IBH Publishing Co. India edition p F.H. Brown (1983); Principle of Education and Psychological Testing, (Third Edition) New York : Holt Rinehard and Winston, p Charles Jackson (1982); Understanding Psychological Testing, Bombay : Jaico Publishing p John W. Best & James V. Kahan (2005); Research in Education (Ninth Edition), New Delhi : Pearson Education (Singapore) Pvt. Ltd., Indian Branch p B. K. Tuckman (1975); Measuring Educational Outcomes Fundamentals of Testing, New York : Harcourt Brasce Jabvanovich International Edition, p Lee, J. Cronbach (1964); Essentials of Psychological Testing, New York : Harper and Row, International Education p W.J. Bramble (1979); Understanding and Conducting Research (Second Edition) Singapore : Mc-Graw Hill Book Co. p

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors affecting reliability ON DEFINING RELIABILITY Non-technical

More information

ESTABLISHING VALIDITY AND RELIABILITY OF ACHIEVEMENT TEST IN BIOLOGY FOR STD. IX STUDENTS

ESTABLISHING VALIDITY AND RELIABILITY OF ACHIEVEMENT TEST IN BIOLOGY FOR STD. IX STUDENTS International Journal of Educational Science and Research (IJESR) ISSN(P): 2249-6947; ISSN(E): 2249-8052 Vol. 4, Issue 4, Aug 2014, 29-36 TJPRC Pvt. Ltd. ESTABLISHING VALIDITY AND RELIABILITY OF ACHIEVEMENT

More information

Construct Reliability and Validity Update Report

Construct Reliability and Validity Update Report Assessments 24x7 LLC DISC Assessment 2013 2014 Construct Reliability and Validity Update Report Executive Summary We provide this document as a tool for end-users of the Assessments 24x7 LLC (A24x7) Online

More information

Interpreting the Item Analysis Score Report Statistical Information

Interpreting the Item Analysis Score Report Statistical Information Interpreting the Item Analysis Score Report Statistical Information This guide will provide information that will help you interpret the statistical information relating to the Item Analysis Report generated

More information

VARIABLES AND MEASUREMENT

VARIABLES AND MEASUREMENT ARTHUR SYC 204 (EXERIMENTAL SYCHOLOGY) 16A LECTURE NOTES [01/29/16] VARIABLES AND MEASUREMENT AGE 1 Topic #3 VARIABLES AND MEASUREMENT VARIABLES Some definitions of variables include the following: 1.

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Introduction No measuring instrument is perfect. The most obvious problems relate to reliability. If we use a thermometer to

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

A Scale to Measure the Attitude of Farmers Towards Kisan Call Centre

A Scale to Measure the Attitude of Farmers Towards Kisan Call Centre Journal of Extension Education Vol. 27 No. 4, 2015 A Scale to Measure the Attitude of Farmers Towards Shely Mery Koshy 1 and N. Kishore Kumar 2 ABSTRACT is one of the active services provided by the Government

More information

Making a psychometric. Dr Benjamin Cowan- Lecture 9

Making a psychometric. Dr Benjamin Cowan- Lecture 9 Making a psychometric Dr Benjamin Cowan- Lecture 9 What this lecture will cover What is a questionnaire? Development of questionnaires Item development Scale options Scale reliability & validity Factor

More information

HPS301 Exam Notes- Contents

HPS301 Exam Notes- Contents HPS301 Exam Notes- Contents Week 1 Research Design: What characterises different approaches 1 Experimental Design 1 Key Features 1 Criteria for establishing causality 2 Validity Internal Validity 2 Threats

More information

Reliability & Validity Dr. Sudip Chaudhuri

Reliability & Validity Dr. Sudip Chaudhuri Reliability & Validity Dr. Sudip Chaudhuri M. Sc., M. Tech., Ph.D., M. Ed. Assistant Professor, G.C.B.T. College, Habra, India, Honorary Researcher, Saha Institute of Nuclear Physics, Life Member, Indian

More information

Reliability and Validity checks S-005

Reliability and Validity checks S-005 Reliability and Validity checks S-005 Checking on reliability of the data we collect Compare over time (test-retest) Item analysis Internal consistency Inter-rater agreement Compare over time Test-Retest

More information

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings.

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings. DATA GATHERING Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings. 2012 John Wiley & Sons Ltd. Measurement Measurement: the assignment

More information

CHAPTER 7 VALIDITY 7.1 INTRODUCTION THE CONCEPT OF VALIDITY THE METHOD OF DETERMINING VARIDITY TYPE OF VALIDITY...

CHAPTER 7 VALIDITY 7.1 INTRODUCTION THE CONCEPT OF VALIDITY THE METHOD OF DETERMINING VARIDITY TYPE OF VALIDITY... 235 CHAPTER 7 VALIDITY CONTENTS PAGE 7.1 INTRODUCTION... 236 7.2 THE CONCEPT OF VALIDITY...236 7.3 THE METHOD OF DETERMINING VARIDITY...237 7.4 TYPE OF VALIDITY... 238 7.5 CONTENT VALIDITY... 241 7.6 CONSTRUCT

More information

CHAPTER-III METHODOLOGY

CHAPTER-III METHODOLOGY CHAPTER-III METHODOLOGY 3.1 INTRODUCTION This chapter deals with the methodology employed in order to achieve the set objectives of the study. Details regarding sample, description of the tools employed,

More information

Introduction to Reliability

Introduction to Reliability Reliability Thought Questions: How does/will reliability affect what you do/will do in your future job? Which method of reliability analysis do you find most confusing? Introduction to Reliability What

More information

RELIABILITY AND VALIDITY (EXTERNAL AND INTERNAL)

RELIABILITY AND VALIDITY (EXTERNAL AND INTERNAL) UNIT 2 RELIABILITY AND VALIDITY (EXTERNAL AND INTERNAL) Basic Process/Concept in Research Structure 2.0 Introduction 2.1 Objectives 2.2 Reliability 2.3 Methods of Estimating Reliability 2.3.1 External

More information

alternate-form reliability The degree to which two or more versions of the same test correlate with one another. In clinical studies in which a given function is going to be tested more than once over

More information

CHAPTER 3 METHOD AND PROCEDURE

CHAPTER 3 METHOD AND PROCEDURE CHAPTER 3 METHOD AND PROCEDURE Previous chapter namely Review of the Literature was concerned with the review of the research studies conducted in the field of teacher education, with special reference

More information

Reliability AND Validity. Fact checking your instrument

Reliability AND Validity. Fact checking your instrument Reliability AND Validity Fact checking your instrument General Principles Clearly Identify the Construct of Interest Use Multiple Items Use One or More Reverse Scored Items Use a Consistent Response Format

More information

11-3. Learning Objectives

11-3. Learning Objectives 11-1 Measurement Learning Objectives 11-3 Understand... The distinction between measuring objects, properties, and indicants of properties. The similarities and differences between the four scale types

More information

Validity, Reliability, and Defensibility of Assessments in Veterinary Education

Validity, Reliability, and Defensibility of Assessments in Veterinary Education Instructional Methods Validity, Reliability, and Defensibility of Assessments in Veterinary Education Kent Hecker g Claudio Violato ABSTRACT In this article, we provide an introduction to and overview

More information

By Hui Bian Office for Faculty Excellence

By Hui Bian Office for Faculty Excellence By Hui Bian Office for Faculty Excellence 1 Email: bianh@ecu.edu Phone: 328-5428 Location: 1001 Joyner Library, room 1006 Office hours: 8:00am-5:00pm, Monday-Friday 2 Educational tests and regular surveys

More information

FACTORIAL CONSTRUCTION OF A LIKERT SCALE

FACTORIAL CONSTRUCTION OF A LIKERT SCALE FACTORIAL CONSTRUCTION OF A LIKERT SCALE Mukhtar Ahmad Wani (Ph.D. JMI) Lecturer, School Education Department (J&K Govt.) Email: mukhtar137897@st.jmi.ac.in, wanimukhtar913@gmail.com Abstract: The purpose

More information

Chapter 3 Psychometrics: Reliability and Validity

Chapter 3 Psychometrics: Reliability and Validity 34 Chapter 3 Psychometrics: Reliability and Validity Every classroom assessment measure must be appropriately reliable and valid, be it the classic classroom achievement test, attitudinal measure, or performance

More information

PG Diploma in Counseling & Behaviour Modification PGDCBM Scheme of Examinations

PG Diploma in Counseling & Behaviour Modification PGDCBM Scheme of Examinations DIRECTORATE OF DISTANCE EDUCATION GURU JAMBHESHWAR UNIVERSITY OF SCIENCE & TECHNOLOGY, HISAR COURSE CURRICULUM FOR PG Diploma in Counseling & Behaviour Modification PGDCBM Scheme of Examinations Pass Marks:

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK Hanny Pradana, Gatot Sutapa, Luwandi Suhartono Sarjana Degree of English Language Education, Teacher

More information

The Effect of Guessing on Item Reliability

The Effect of Guessing on Item Reliability The Effect of Guessing on Item Reliability under Answer-Until-Correct Scoring Michael Kane National League for Nursing, Inc. James Moloney State University of New York at Brockport The answer-until-correct

More information

Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that

Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that Validity refers to the accuracy of a measure. A measurement is valid when it measures what it is suppose to measure and performs the functions that it purports to perform. Does an indicator accurately

More information

Examining the Psychometric Properties of The McQuaig Occupational Test

Examining the Psychometric Properties of The McQuaig Occupational Test Examining the Psychometric Properties of The McQuaig Occupational Test Prepared for: The McQuaig Institute of Executive Development Ltd., Toronto, Canada Prepared by: Henryk Krajewski, Ph.D., Senior Consultant,

More information

CHAPTER IV METHOD AND PROCEDURE

CHAPTER IV METHOD AND PROCEDURE CHAPTER IV METHOD AND PROCEDURE In the preceding chapters we have stated the objectives of the present study and have also reviewed related research studies and their designs. The purpose of this chapter

More information

Discrimination Weighting on a Multiple Choice Exam

Discrimination Weighting on a Multiple Choice Exam Proceedings of the Iowa Academy of Science Volume 75 Annual Issue Article 44 1968 Discrimination Weighting on a Multiple Choice Exam Timothy J. Gannon Loras College Thomas Sannito Loras College Copyright

More information

DATA is derived either through. Self-Report Observation Measurement

DATA is derived either through. Self-Report Observation Measurement Data Management DATA is derived either through Self-Report Observation Measurement QUESTION ANSWER DATA DATA may be from Structured or Unstructured questions? Quantitative or Qualitative? Numerical or

More information

2 Types of psychological tests and their validity, precision and standards

2 Types of psychological tests and their validity, precision and standards 2 Types of psychological tests and their validity, precision and standards Tests are usually classified in objective or projective, according to Pasquali (2008). In case of projective tests, a person is

More information

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604 Measurement and Descriptive Statistics Katie Rommel-Esham Education 604 Frequency Distributions Frequency table # grad courses taken f 3 or fewer 5 4-6 3 7-9 2 10 or more 4 Pictorial Representations Frequency

More information

Comparative Analysis of the Reliability and Validity Coefficients of WAEC, NECO and NABTEB Constructed Mathematics Examination

Comparative Analysis of the Reliability and Validity Coefficients of WAEC, NECO and NABTEB Constructed Mathematics Examination Comparative Analysis of the Reliability and Validity Coefficients of WAEC, NECO and NABTEB Constructed Mathematics Examination Prof. S. O. Bandele Ekiti State University, Ado-Ekiti, Ekiti State, Nigeria.

More information

Empirical Demonstration of Techniques for Computing the Discrimination Power of a Dichotomous Item Response Test

Empirical Demonstration of Techniques for Computing the Discrimination Power of a Dichotomous Item Response Test Empirical Demonstration of Techniques for Computing the Discrimination Power of a Dichotomous Item Response Test Doi:10.5901/jesr.2014.v4n1p189 Abstract B. O. Ovwigho Department of Agricultural Economics

More information

Overview of Experimentation

Overview of Experimentation The Basics of Experimentation Overview of Experiments. IVs & DVs. Operational Definitions. Reliability. Validity. Internal vs. External Validity. Classic Threats to Internal Validity. Lab: FP Overview;

More information

International Journal of Research and Review E-ISSN: ; P-ISSN:

International Journal of Research and Review  E-ISSN: ; P-ISSN: International Journal of Research and Review www.gkpublication.in E-ISSN: 2349-9788; P-ISSN: 2454-2237 Original Research Article Impact of Self Concept and Emotional Intelligence on Adjustment of Adults

More information

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity Measurement & Variables - Initial step is to conceptualize and clarify the concepts embedded in a hypothesis or research question with

More information

RELATIONSHIP BETWEEN SOCIO ECONOMIC STATUS AND EDUCATIONAL ASPIRATION AMONG HIGHER SECONDARY SCHOOL STUDENT A STUDY. Sarita Soy 1

RELATIONSHIP BETWEEN SOCIO ECONOMIC STATUS AND EDUCATIONAL ASPIRATION AMONG HIGHER SECONDARY SCHOOL STUDENT A STUDY. Sarita Soy 1 96 RELATIONSHIP BETWEEN SOCIO ECONOMIC STATUS AND EDUCATIONAL ASPIRATION AMONG HIGHER SECONDARY SCHOOL STUDENT A STUDY. Sarita Soy 1 Abstract Educational aspiration and Socio-economic status plays important

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

Basic Statistics and Data Analysis in Work psychology: Statistical Examples

Basic Statistics and Data Analysis in Work psychology: Statistical Examples Basic Statistics and Data Analysis in Work psychology: Statistical Examples WORK PSYCHOLOGY INTRODUCTION In this chapter we examine a topic which is given too little coverage in most texts of this kind,

More information

THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right.

THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right. THE TEST STATISTICS REPORT provides a synopsis of the test attributes and some important statistics. A sample is shown here to the right. The Test reliability indicators are measures of how well the questions

More information

Verbal Reasoning: Technical information

Verbal Reasoning: Technical information Verbal Reasoning: Technical information Issues in test construction In constructing the Verbal Reasoning tests, a number of important technical features were carefully considered by the test constructors.

More information

Psychologist use statistics for 2 things

Psychologist use statistics for 2 things Psychologist use statistics for 2 things O Summarize the information from the study/experiment O Measures of central tendency O Mean O Median O Mode O Make judgements and decisions about the data O See

More information

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971) Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity

More information

Reliability and Validity

Reliability and Validity Reliability and Validity Why Are They Important? Check out our opening graphics. In a nutshell, do you want that car? It's not reliable. Would you recommend that car magazine (Auto Tester Weakly) to a

More information

Reliability and Validity

Reliability and Validity Reliability and Today s Objectives Understand the difference between reliability and validity Understand how to develop valid indicators of a concept Reliability and Reliability How accurate or consistent

More information

Adjustment status of students in relation to intelligence

Adjustment status of students in relation to intelligence Adjustment status of students in relation to intelligence Charanjit Kaur Dhillon 41, Officers Colony, Ferozepur City, Punjab-152002, India ABSTRACT Adjustment is a process by which an individual varies

More information

how good is the Instrument? Dr Dean McKenzie

how good is the Instrument? Dr Dean McKenzie how good is the Instrument? Dr Dean McKenzie BA(Hons) (Psychology) PhD (Psych Epidemiology) Senior Research Fellow (Abridged Version) Full version to be presented July 2014 1 Goals To briefly summarize

More information

Lesson 9: Two Factor ANOVAS

Lesson 9: Two Factor ANOVAS Published on Agron 513 (https://courses.agron.iastate.edu/agron513) Home > Lesson 9 Lesson 9: Two Factor ANOVAS Developed by: Ron Mowers, Marin Harbur, and Ken Moore Completion Time: 1 week Introduction

More information

CHAPTER IV CONSTRUCTION OF MORAL JUDGEMENT TEST, VALIDATION, DESCRIPTION AND ADMINISTRATION

CHAPTER IV CONSTRUCTION OF MORAL JUDGEMENT TEST, VALIDATION, DESCRIPTION AND ADMINISTRATION CHAPTER IV CONSTRUCTION OF MORAL JUDGEMENT TEST, VALIDATION, DESCRIPTION AND ADMINISTRATION 4.1 INTRODUCTION The present study is aimed at assessing the moral judgment among secondary school teachers of

More information

CHAPTER III RESEARCH METHODOLOGY

CHAPTER III RESEARCH METHODOLOGY CHAPTER III RESEARCH METHODOLOGY Research methodology explains the activity of research that pursuit, how it progress, estimate process and represents the success. The methodological decision covers the

More information

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial ::

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial :: Validity and Reliability PDF Created with deskpdf PDF Writer - Trial :: http://www.docudesk.com Validity Is the translation from concept to operationalization accurately representing the underlying concept.

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

Interpretation of Data and Statistical Fallacies

Interpretation of Data and Statistical Fallacies ISSN: 2349-7637 (Online) RESEARCH HUB International Multidisciplinary Research Journal Research Paper Available online at: www.rhimrj.com Interpretation of Data and Statistical Fallacies Prof. Usha Jogi

More information

MATH : Design and Analysis of Clinical Trials

MATH : Design and Analysis of Clinical Trials MATH 654-102: Design and Analysis of Clinical Trials MID-TERM EXAM Spring, 2012 (Time allowed: TWO AND HALF HOURS) INSTRUCTIONS TO STUDENTS: 1. This test contains FIVE questions and comprises SIX printed

More information

Survey Question. What are appropriate methods to reaffirm the fairness, validity reliability and general performance of examinations?

Survey Question. What are appropriate methods to reaffirm the fairness, validity reliability and general performance of examinations? Clause 9.3.5 Appropriate methodology and procedures (e.g. collecting and maintaining statistical data) shall be documented and implemented in order to affirm, at justified defined intervals, the fairness,

More information

CHAPTER 3 METHODOLOGY, DATA COLLECTION AND DATA ANALYSIS

CHAPTER 3 METHODOLOGY, DATA COLLECTION AND DATA ANALYSIS CHAPTER 3 METHODOLOGY, DATA COLLECTION AND DATA ANALYSIS TABLE OF CONTENTS Titles Page Nos. CHAPTER 3: METHODOLOGY, DATA COLLECTION 88 97 AND DATA ANALYSIS 3.1 Objectives of the Study 89 3.2 Research Design

More information

PM12 Validity P R O F. D R. P A S Q U A L E R U G G I E R O D E P A R T M E N T O F B U S I N E S S A N D L A W

PM12 Validity P R O F. D R. P A S Q U A L E R U G G I E R O D E P A R T M E N T O F B U S I N E S S A N D L A W PM12 Validity P R O F. D R. P A S Q U A L E R U G G I E R O D E P A R T M E N T O F B U S I N E S S A N D L A W Internal and External Validity The concept of validity is very important in PE. To make PE

More information

Physical Intelligence of University of Technology MARA Sport Science Students

Physical Intelligence of University of Technology MARA Sport Science Students Physical Intelligence of University of Technology MARA Sport Science Students Tan Chee Hian Lecturer Faculty of Sport Science & Recreation University of Technology MARA Shah Alam, Selangor 1 Physical Intelligence

More information

CHAPTER - III METHODOLOGY CONTENTS. 3.1 Introduction. 3.2 Attitude Measurement & its devices

CHAPTER - III METHODOLOGY CONTENTS. 3.1 Introduction. 3.2 Attitude Measurement & its devices 102 CHAPTER - III METHODOLOGY CONTENTS 3.1 Introduction 3.2 Attitude Measurement & its devices 3.2.1. Prior Scales 3.2.2. Psychophysical Scales 3.2.3. Sigma Scales 3.2.4. Master Scales 3.3 Attitude Measurement

More information

CLINICAL RESEARCH METHODS VISP356. MODULE LEADER: PROF A TOMLINSON B.Sc./B.Sc.(HONS) OPTOMETRY

CLINICAL RESEARCH METHODS VISP356. MODULE LEADER: PROF A TOMLINSON B.Sc./B.Sc.(HONS) OPTOMETRY DIVISION OF VISION SCIENCES SESSION: 2006/2007 DIET: 1ST CLINICAL RESEARCH METHODS VISP356 LEVEL: MODULE LEADER: PROF A TOMLINSON B.Sc./B.Sc.(HONS) OPTOMETRY MAY 2007 DURATION: 2 HRS CANDIDATES SHOULD

More information

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS

EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS DePaul University INTRODUCTION TO ITEM ANALYSIS: EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS Ivan Hernandez, PhD OVERVIEW What is Item Analysis? Overview Benefits of Item Analysis Applications Main

More information

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such

More information

Statistics for Psychosocial Research Session 1: September 1 Bill

Statistics for Psychosocial Research Session 1: September 1 Bill Statistics for Psychosocial Research Session 1: September 1 Bill Introduction to Staff Purpose of the Course Administration Introduction to Test Theory Statistics for Psychosocial Research Overview: a)

More information

Intelligence. Exam 3. Conceptual Difficulties. What is Intelligence? Chapter 11. Intelligence: Ability or Abilities? Controversies About Intelligence

Intelligence. Exam 3. Conceptual Difficulties. What is Intelligence? Chapter 11. Intelligence: Ability or Abilities? Controversies About Intelligence Exam 3?? Mean: 36 Median: 37 Mode: 45 SD = 7.2 N - 399 Top Score: 49 Top Cumulative Score to date: 144 Intelligence Chapter 11 Psy 12000.003 Spring 2009 1 2 What is Intelligence? Intelligence (in all cultures)

More information

An Introduction to Classical Test Theory as Applied to Conceptual Multiple-choice Tests

An Introduction to Classical Test Theory as Applied to Conceptual Multiple-choice Tests An Introduction to as Applied to Conceptual Multiple-choice Tests Paula V. Engelhardt Tennessee Technological University, 110 University Drive, Cookeville, TN 38505 Abstract: A number of conceptual multiple-choice

More information

Figure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b)

Figure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b) Page 1 of 1 Diagnostic test investigated indicates the patient has the Diagnostic test investigated indicates the patient does not have the Gold/reference standard indicates the patient has the True positive

More information

Basic concepts and principles of classical test theory

Basic concepts and principles of classical test theory Basic concepts and principles of classical test theory Jan-Eric Gustafsson What is measurement? Assignment of numbers to aspects of individuals according to some rule. The aspect which is measured must

More information

Choosing the Correct Statistical Test

Choosing the Correct Statistical Test Choosing the Correct Statistical Test T racie O. Afifi, PhD Departments of Community Health Sciences & Psychiatry University of Manitoba Department of Community Health Sciences COLLEGE OF MEDICINE, FACULTY

More information

X I UNIVER5 ITY. Of 1LLI NOIS OF THL B R. A R. Y

X I UNIVER5 ITY. Of 1LLI NOIS OF THL B R. A R. Y X I B R. A R. Y OF THL UNIVER5 ITY Of 1LLI NOIS 370 Return this book on or before the Latest Date stamped below. University of Illinois Library D«:!i,g; m 13«76 K 2 2 m L161 H41 Digitized by the Internet

More information

Biostatistics 2 - Correlation and Risk

Biostatistics 2 - Correlation and Risk BROUGHT TO YOU BY Biostatistics 2 - Correlation and Risk Developed by Pfizer January 2018 This learning module is intended for UK healthcare professionals only. PP-GEP-GBR-0957 Date of preparation Jan

More information

Dr. Babasaheb Ambedkar Marathwada University, Aurangabad.

Dr. Babasaheb Ambedkar Marathwada University, Aurangabad. Dr. Babasaheb Ambedkar Marathwada University, Aurangabad. PSYCHOLOGY SYIIABUS FOR - B. A. With effective from 2009-10. B. A. Second Year (Annual Pattern) Paper III : Psychology for Living Marks-75 Paper

More information

Effect of Adjustment on the Academic Performance of Urdu Medium Male and Female Secondary Level Students

Effect of Adjustment on the Academic Performance of Urdu Medium Male and Female Secondary Level Students Effect of Adjustment on the Academic Performance of Urdu Medium Male and Female Secondary Level Students Mohd. Moshahid Dept. of Education & Training, MANUU, Hyderabad-32, India. ABSTRACT This paper makes

More information

Intelligence. Exam 3. iclicker. My Brilliant Brain. What is Intelligence? Conceptual Difficulties. Chapter 10

Intelligence. Exam 3. iclicker. My Brilliant Brain. What is Intelligence? Conceptual Difficulties. Chapter 10 Exam 3 iclicker Mean: 32.8 Median: 33 Mode: 33 SD = 6.4 How many of you have one? Do you think it would be a good addition for this course in the future? Top Score: 49 Top Cumulative Score to date: 144

More information

ADMS Sampling Technique and Survey Studies

ADMS Sampling Technique and Survey Studies Principles of Measurement Measurement As a way of understanding, evaluating, and differentiating characteristics Provides a mechanism to achieve precision in this understanding, the extent or quality As

More information

Intelligence. PSYCHOLOGY (8th Edition) David Myers. Intelligence. Chapter 11. What is Intelligence?

Intelligence. PSYCHOLOGY (8th Edition) David Myers. Intelligence. Chapter 11. What is Intelligence? PSYCHOLOGY (8th Edition) David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, 2006 1 Intelligence Chapter 11 2 Intelligence What is Intelligence? Is Intelligence One General

More information

Bijay Lal Pradhan, M Sc Statistics, FDPM (IIMA) 2

Bijay Lal Pradhan, M Sc Statistics, FDPM (IIMA) 2 Bijay Lal Pradhan Measurement and Scaling 1) Definition of measurement and scale 2) Type of Physical scale i. Nominal Scale iii. Interval scale ii. Ordinal Scale iv. Ratio Scale 3) Need of scaling 4) Criteria

More information

DESIGNING AND VALIDATION OF SOCIO- PSYCHOLOGICAL AND SEXUAL CONSTRAINTS QUESTIONNAIRE FOR FEMALE ATHLETES

DESIGNING AND VALIDATION OF SOCIO- PSYCHOLOGICAL AND SEXUAL CONSTRAINTS QUESTIONNAIRE FOR FEMALE ATHLETES DESIGNING AND VALIDATION OF SOCIO- PSYCHOLOGICAL AND SEXUAL CONSTRAINTS QUESTIONNAIRE FOR FEMALE ATHLETES Dr. G.S. Brar Professor Department of Physical Education, Panjab University Chandigarh Veena Rani

More information

Using Lertap 5 in a Parallel-Forms Reliability Study

Using Lertap 5 in a Parallel-Forms Reliability Study Lertap 5 documents series. Using Lertap 5 in a Parallel-Forms Reliability Study Larry R Nelson Last updated: 16 July 2003. (Click here to branch to www.lertap.curtin.edu.au.) This page has been published

More information

Probability and Statistics Chapter 1 Notes

Probability and Statistics Chapter 1 Notes Probability and Statistics Chapter 1 Notes I Section 1-1 A is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions 1 is information coming from observations,

More information

The Psychometric Principles Maximizing the quality of assessment

The Psychometric Principles Maximizing the quality of assessment Summer School 2009 Psychometric Principles Professor John Rust University of Cambridge The Psychometric Principles Maximizing the quality of assessment Reliability Validity Standardisation Equivalence

More information

Inferential Statistics

Inferential Statistics Inferential Statistics and t - tests ScWk 242 Session 9 Slides Inferential Statistics Ø Inferential statistics are used to test hypotheses about the relationship between the independent and the dependent

More information

ISSN Electronic International Interdisciplinary Research Journal (EIIRJ) Bi-monthly Reviewed Journal July/Aug 2013

ISSN Electronic International Interdisciplinary Research Journal (EIIRJ) Bi-monthly Reviewed Journal July/Aug 2013 ACHIEVEMENT MOTIVATION OF SENIOR SECONDARY SCHOOL STUDENTS IN RELATION TO THEIR SELF CONFIDENCE Dr. Sushma Gupta Assistant Prof. Deptt of Education, Kurukshetra University Kurukshetra Asha Rani Research

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

Sleeping Beauty is told the following:

Sleeping Beauty is told the following: Sleeping beauty Sleeping Beauty is told the following: You are going to sleep for three days, during which time you will be woken up either once Now suppose that you are sleeping beauty, and you are woken

More information

A Scale to Measure Attitude of Agricultural Professionals towards Social Media in Tamil Nadu, India

A Scale to Measure Attitude of Agricultural Professionals towards Social Media in Tamil Nadu, India International Journal of Current Microbiology and Applied Sciences ISSN: 2319-7706 Special Issue-4 (September-2017) pp. 37-44 Journal homepage: http://www.ijcmas.com Original Research Article A Scale to

More information

AND ITS VARIOUS DEVICES. Attitude is such an abstract, complex mental set. up that its measurement has remained controversial.

AND ITS VARIOUS DEVICES. Attitude is such an abstract, complex mental set. up that its measurement has remained controversial. CHAPTER III attitude measurement AND ITS VARIOUS DEVICES Attitude is such an abstract, complex mental set up that its measurement has remained controversial. Psychologists studied attitudes of individuals

More information

The Effect of Brain Gym Exercises on Self- Esteem and Sensory Processing Speed on High School Hearing Impaired Students

The Effect of Brain Gym Exercises on Self- Esteem and Sensory Processing Speed on High School Hearing Impaired Students The International Journal of Indian Psychology ISSN 2348-5396 (e) ISSN: 2349-3429 (p) Volume 4, Issue 2, No. 93, DIP: 18.01.136/20170402 ISBN: 978-1-365-78193-3 http://www.ijip.in January-March, 2017 The

More information

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit Overview Quality of Measurement Instruments Introduction SPSS Read:

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

Field data reliability analysis of highly reliable item

Field data reliability analysis of highly reliable item Field data reliability analysis of highly reliable item David Valis University of Defence, Czech Republic david.valis@unob.cz Miroslav Koucky Technical University of Liberec, Czech Republic miroslav.koucky@tul.cz

More information

Reliability. Internal Reliability

Reliability. Internal Reliability 32 Reliability T he reliability of assessments like the DECA-I/T is defined as, the consistency of scores obtained by the same person when reexamined with the same test on different occasions, or with

More information

Using Standard and Asymmetric Confidence Intervals. Utilisation d intervalles de confiance standards et asymétriques. Christopher J.

Using Standard and Asymmetric Confidence Intervals. Utilisation d intervalles de confiance standards et asymétriques. Christopher J. Using Standard and Asymmetric Confidence Intervals Utilisation d intervalles de confiance standards et asymétriques KEY WORDS CONFIDENCE INTERVAL MEASUREMENT ERROR CONSISTENCY COEFFICIENT AGREEMENT COEFFICIENT

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Published in Education 3-13, 29 (3) pp. 17-21 (2001) Introduction No measuring instrument is perfect. If we use a thermometer

More information

CHAPTER LEARNING OUTCOMES

CHAPTER LEARNING OUTCOMES EXPERIIMENTAL METHODOLOGY CHAPTER LEARNING OUTCOMES When you have completed reading this article you will be able to: Define what is an experiment Explain the role of theory in educational research Justify

More information