EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS

Size: px
Start display at page:

Download "EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS"

Transcription

1 DePaul University INTRODUCTION TO ITEM ANALYSIS: EVALUATING AND IMPROVING MULTIPLE CHOICE QUESTIONS Ivan Hernandez, PhD

2 OVERVIEW What is Item Analysis? Overview Benefits of Item Analysis Applications Main Statistics of Item Analysis Item Difficulty Item Discrimination Test Reliability Implementing Item Analysis D2L Excel SPSS 2

3 WHAT IS ITEM ANALYSIS? 3

4 WHAT IS ITEM ANALYSIS Consider the following... Imagine you have a multiple-choice test Every student sees a collection of questions Each question has different choices; only one choice is correct We use the overall score on the exam to assess the student s aptitude/ability We want to know who understands the material and who doesn t We want to make sure that the student s score is stable Question: How do we know we assessing are the student's ability as well as we can? Answer: Item Analysis 4

5 WHAT IS ITEM ANALYSIS What is Item Analysis? Statistically analyzing your multiple choice test items So that you can ensure that your items are effectively evaluating student ability Item Analysis is a collection of techniques Many tools/methods to analyze questions Examples: Item Difficulty Item Discrimination Internal Consistency Differential Item Functioning 5

6 6 ITEM ANALYSIS PROCESS Item Analysis is an Iterative and Continuous Process Teach Content Area Consider learning goals Consider student abilities Write Test Items Have items reflect overall content area Eliminate ambiguous or misleading items Administer Exam Perform Item Analysis All students should be given sufficient time to complete exam Testing conditions should be consistent for all students Examine item performance Evaluate reasons for poor performance

7 BENEFITS 7

8 8 PURPOSE OF ITEM ANALYSIS Primary questions that can be answered by Item Analysis Were any of the questions too difficult or easy? How well do the questions identify those students who knew the material from those that did not? (The more of these questions, the better your exam can precisely measure ability)2 How consistent are the exam s questions?3 : Item Difficulty How stable are people s scores?3 2: Item Discrimination 3: Reliability

9 BENEFITS OF ITEM ANALYSIS Benefits of using Item Analysis Test Development - Improve test quality Assess ability more efficiently Produce scores that are stable Make the exam more coherent Precision - Quantify measurement of assessment Identify areas of improvement Understand characteristics of exam Have clear indication of item quality 9

10 APPLICATIONS

11 APPLICATIONS OF ITEM ANALYSIS Item Analysis can answer many questions ( Am I able to know who has ability and who does not, Am I able to get a fine-grained view on ability, How consistent are scores ) Many fields are concerned with knowing the answer to those questions: Academic exams (GRE, LSAT, ACT) Employee selection (Wunderlic, Situational Judgement Test) Personality assessment (Five Factor Model, Emotional Intelligence)

12 APPLICATIONS OF ITEM ANALYSIS: ACADEMIC EXAMS 2 Academic Exams: Purpose is to assess a student s ability relative to their peers Item analysis helps construct an exam where: The extremes (high and low scorers) are identifiable Scores are separated (Little overlap between students) Exam scores are consistent across examinations

13 APPLICATIONS OF ITEM ANALYSIS: EMPLOYEE SELECTION 3 Employee selection: Purpose is to select employees that have the highest ability in the workplace Item analysis helps construct an employee questionnaire where Employees abilities are distinguished from one another Slightly better employees are noticeable than slightly worse employees Scores on the employee questionnaire are consistent

14 APPLICATIONS OF ITEM ANALYSIS: PERSONALITY ASSESSMENT 4 Personality assessment: Purpose is to evaluate the behavioral tendencies of individuals Item analysis helps create a questionnaire where People high on a personality trait and low on a personality can be detected People s personality scores are fine grained People s personality score is consistent across evaluations

15 PRELIMINARY PREPARATION 5

16 BEFORE BEGINNING ITEM ANALYSIS Almost all item analysis procedures require the exam data to be a specific format Columns = Exam questions Rows = An individual examinee Cells = Whether the examinee got the question right () or wrong () 6

17 7 BEFORE BEGINNING ITEM ANALYSIS Almost all item analysis procedures require the exam data to be a specific format Columns = Exam questions Rows = An individual examinee Cells = Whether the examinee got the question right () or wrong () Examinee Question Question 2 Question 3 Question 4 Anne Bob Chelsea Dan Erica Fred

18 8 BEFORE BEGINNING ITEM ANALYSIS Almost all item analysis procedures require the exam data to be a specific format Columns = Exam questions Rows = An individual examinee Cells = Whether the examinee got the question right () or wrong () Examinee Question Question 2 Question 3 Question 4 Anne Bob Chelsea Dan Erica Fred

19 9 BEFORE BEGINNING ITEM ANALYSIS Almost all item analysis procedures require the exam data to be a specific format Columns = Exam questions Rows = An individual examinee Cells = Whether the examinee got the question right () or wrong () Examinee Question Question 2 Question 3 Question 4 Anne Bob Chelsea Dan Erica Fred

20 2 BEFORE BEGINNING ITEM ANALYSIS Almost all item analysis procedures require the exam data to be a specific format Columns = Exam questions Rows = An individual examinee Cells = Whether the examinee got the question right () or wrong () Examinee Question Question 2 Question 3 Question 4 Anne Bob Chelsea Dan Erica Fred

21 ACTIVITY: ITEM DIFFICULTY ) Complete the activity sheet passed out by the presenter 2) The activity sheet contains five students responses to five different multiple choice question 3) Create a data matrix that is formatted in a manner for item-analysis a) Each row is a different student b) Each column is a different question c) Each cell indicates whether the student answered the question correctly () or incorrectly() EXERCISE 2

22 ITEM DIFFICULTY 22

23 ITEM DIFFICULTY: WHAT IS IT? What is Item Difficulty? Item difficulty is how easy or hard a question is Examples If no one got the question right, the item is difficult If everyone got the question right, the item is considered easy If half the people got the question right, then the item is somewhere between easy and hard. 23

24 ITEM DIFFICULTY: WHY DOES IT MATTER? 24 Why Does Item Difficulty Matter? You want your test to provide information on the full range of people s ability If you don t pay attention to Item Difficulty, you don t get a precise measure of ability Helps ensure the full spectrum of ability is represented You want scores to be roughly symmetric If true ability has a bell-shaped distribution, then your estimated ability should have a bell-shaped distribution

25 25 ITEM DIFFICULTY: WHY DOES IT MATTER? You want your test to provide information on people s ranges of ability If question too easy -> everyone gets the question right -> you don t know who is on the lower end of ability If question too hard -> everyone gets question wrong -> you don t know who is on the higher end of ability Therefore, it is important to pay attention to item difficulty to have it be just right Student Question Alice % Bob % Cindy % Too Easy - Cannot tell who has more/less ability Student Question Alice % Bob % Cindy % Too hard - Cannot tell who has more/less ability

26 26 ITEM DIFFICULTY: WHY DOES IT MATTER? You want scores to be roughly symmetric Ability tends to follow a roughly normal distribution If item difficulty is too high or low, then scores will be truncated (prevents symmetry) Difficulty is too high Difficulty is just right Difficulty is too easy Therefore, it is important to pay attention to item difficulty to have it be just right

27 27 ITEM DIFFICULTY: EXAMPLE Examinee Q Q2 Q3 Alice Bob Imagine a test with THREE questions and FIVE examinees Everyone got Question correct Cindy Dan Erin Everyone got Question 2 wrong FOUR people got Question correct

28 ITEM DIFFICULTY: EXAMPLE Examinee Q Q2 28 Q3 Alice Bob Cindy Dan Erin Total 5 4 We can total up how many people got each question correct by taking the sum for each question Question = 5 Question 2 = Question 3 = 4

29 29 ITEM DIFFICULTY: EXAMPLE Examinee Q Q2 Q3 Alice Bob Cindy Dan Erin Total 5 We can total up how many people got each question correct by taking the sum for each question Question = 5 Question 2 = Question 3 = 4 Divide the total number of people who got the question correct by the total number of people who took the test, and you have the item s difficulty Question # Total correct/ Total test takers Item Difficulty Question 5 / 5 =. % Question 2 / 5 =. % Question 3 4/ 5 =.8 8% 4

30 ITEM DIFFICULTY: HOW TO CALCULATE IT How to Calculate Item Difficulty Count how many people answered the answer at all = N Count how many people answered the question correctly = P Item Difficulty = P / N 3

31 ITEM DIFFICULTY: HOW TO INTERPRET IT 3 How to Interpret Item Difficulty Think of it as Item Easiness Ranges from % to % Larger values = Easier Smaller values = Harder If value is at an ideal sweet spot, then your test can better separate high ability people from low ability people (discussed in next section) Ideal values depend on how answer choices there are: Two answer choices (true and false) =.75 Three answer choices=.67 Four answer choices=.63 Five answer choices=.6

32 ITEM DIFFICULTY: DIAGNOSTICS 32 Low item-difficulty can be problematic: Indicates that people, regardless of ability, could not answer the question correctly If Item Difficulty is too Low / Item is too Hard (<.25 or.3): The item may have been miskeyed The item may be too challenging relative to the overall level of ability of the class The item may be ambiguous or not written clearly There may be more than one correct answer.

33 ITEM DIFFICULTY: IMPROVEMENTS How to Improve Low Item Difficulty (Item is too hard) Make sure all items are keyed correctly Find a less challenging concept to assess Improve the class ability (typically via re-instruction or practice) Find where people are being confused and clarify in the question Make the question more specific 33

34 ITEM DIFFICULTY: PRECAUTIONS 34 Precautions on using Item Difficulty Not meaningful if Too few respondents (small sample size) Goal is to assessing mastery of specific concepts/problems The recommended values (.6-.75) assume you want to assess people s ability relative to others If you are concerned with content mastery, you want all items answered correctly Test had a short time limit ( speed test ) - later items seem difficult

35 35 ACTIVITY EXERCISE ) Break out into 4 separate groups 2) Each group will be assigned an item-difficulty value 3) Think of a question you can ask your fellow attendees that would probably have a difficulty value close to your group s assigned value Example: If you are assigned a difficulty value of.25, what is a question you could ask that only 25% of the attendees would know 4) Think of 4 multiple choice options to go with your question, one of which is right Remember: Item Difficulty: Percentage of people answering the question correctly 5) When you are ready, have one member of the group go up to the presenter and share the group s question and answers Larger values = Easier Smaller values = Harder 7) Access the spreadsheet link provided, and compute the difficulty for each question 6) WHEN TOLD THE SURVEY IS READY BY THE PRESENTER: Complete the combined survey online (link will be provided) - Skip your question 8) How close was the actual difficulty to the difficulty you were assigned?

36 ITEM DISCRIMINATION 36

37 ITEM DISCRIMINATION: WHAT IS IT? 37 What is Item Discrimination? Item discrimination is how much a question relates to a person s overall knowledge on the exam / ability High item discrimination = You either know it or you don t Examples of Question with Good Discrimination Smart people know the answer to the question, and low ability people don t know People who studied get the question right, people who didn t study get the question wrong

38 ITEM DISCRIMINATION Examinee Q Q2 38 Q3 Alice Bob Cindy Dan Erin Imagine a test with THREE questions and FIVE students

39 39 ITEM DISCRIMINATION Examinee Q Q2 Q3 Total Alice % Bob % Cindy 33% Dan 33% Erin 33% Imagine a test with THREE questions and FIVE students Alice and Bob did the best (perfect scores) Cindy, Erin, and Dan did the worst (33%)

40 4 ITEM DISCRIMINATION Examinee Q Q2 Q3 Total Alice % Bob % Cindy 33% Dan 33% Erin 33% Imagine a test with THREE questions and FIVE students Alice and Bob did the best (perfect scores) Cindy, Erin, and Dan did the worst (33%) Which question s performance best predicts who will score high on the exam (Alice and Bob) and who will score low (Cindy, Dan, and Erin)?

41 4 ITEM DISCRIMINATION Examinee Q Q2 Q3 Total Alice % Bob % Cindy 33% Dan 33% Erin 33% How people did on Question predicts who will score high and who will score low on the exam People who answered question correctly, got a % People who answered question incorrectly, got a % Question has good discrimination

42 42 ITEM DISCRIMINATION Examinee Q Q2 Q3 Total Alice % Bob % Cindy 33% Dan 33% How people did on Question 3 does not predicts who will score high and who will score low on the exam People who answered question 3 correctly, got a % Many who scored 33% also got question 3 correct Erin 33% Question 3 has worse discrimination than Question

43 ITEM DISCRIMINATION: HOW TO CALCULATE IT 43 How to Calculate Item Discrimination Make one column that has whether people got the question right () or wrong () = Question Scores Make another column that has people s total score on the exam ( to %) = Total Scores Item Discrimination = (Pearson s) correlation between Question scores and Total scores Commonly called Point-biserial correlation or Item-total correlation Often corrected by removing the item s score from the total score

44 ITEM DISCRIMINATION: HOW TO CALCULATE IT Examinee Q Q2 Q3 Total Alice 3 Bob 3 Cindy Dan Erin 44 Item-Discrimination = correlation between the item s performance and the total performance Correlation between Q column and Total column Discrimination =. Correlation between Q2 column and Total column Discrimination =.67 Correlation between Q3 column and Total column Discrimination =.4 Larger values indicate the item can better predict overall test performance

45 ITEM DISCRIMINATION: HOW TO INTERPRET IT 45 How to Interpret Item Discrimination Ranges from - to + (Almost always positive) Larger positive values = Question strongly relates to ability Smaller values = Question does not relate ability much Ideal values are positive and high (above +.2) positive = those who correctly answer a particular item also tend to do well on the test overall = no relationship between exam knowledge and getting the question right negative = the more you know, the less likely you are to get the question right If above.2, item is useful for describing people s overall ability

46 ITEM DISCRIMINATION: DIAGNOSTICS 46 Low item discrimination is problematic: Suggests that people who know the concepts really well overall, were not any more likely to understand the specific concept of the question If Item Discrimination is too Low (<.2): The item may be miskeyed Item may not represent domain of interest Item concept may not be taught well The item may be ambiguous The item maybe misleading The item may be too easy or difficult (everyone got question right or wrong)

47 ITEM DISCRIMINATION: IMPROVEMENTS To Improve Low Item Discrimination (<.2): Check to make sure the item is keyed correctly Check to make sure item is conceptually relevant Modify the instruction to explain the concept better Make the question more specific Ask how students interpreted the question Ensure that difficulty is at the ideal level, given the number of response options 47

48 ITEM DISCRIMINATION: PRECAUTIONS 48 Precautions on using Item Discrimination Not meaningful if Too few respondents (small sample size) Too few questions (doesn t capture topic area) Item difficulty too low or high (no variation for correlation) Partial credit for answers (some answers are less wrong than others)

49 ACTIVITY: ITEM DISCRIMINATION EXERCISE 49 ) Think to yourself about a multiple choice exam you might give in your respective field for a specific topic 2) What kinds of questions would you ask? 3) Which of those questions, if answered correctly would indicate that this person understands the topic well as a whole? a) This question has good discrimination b) Can tell you who likely has high knowledge and who has lower knowledge 4) Which of those questions, if answered correctly doesn t necessarily indicate that this person understands the topic well? a) This question has poor discrimination b) Cannot separate the high knowledge from the low knowledge students

50 RELIABILITY 5

51 TEST RELIABILITY: WHAT IS IT What is Test Reliability? Test Reliability = Consistency of Scores People s observed exam score is a mixture of true ability and error True ability = what you actually know for the entire topic Error = Fatigue, Misreading Question, Luck 5

52 TEST RELIABILITY: WHAT IS IT? 52 What Reliability Means? If test % reliable, then the score a person receives is their true score, and they get the same score each time they retake the exam There was no error on the exams Luck had nothing to do with the scores No questions were misread You were not tired The score is based completely on ability If our test is reliable, then a student s ability is reflected in the score received You are capturing pure ability instead of ability + error

53 TEST RELIABILITY: WHAT IS IT 53 What Reliability Means? If test not % reliability, then the score a person receives may be either higher or lower than their actual true score. Next score might be different Error played a big role in the scores You got lucky the first time You misread some questions You were tired Your true ability wasn t reflected in the scores If our test is unreliable, then a student s ability is not reflected in the score received

54 TEST RELIABILITY: WHAT IS IT 54 Ways of Measuring Reliability Test-retest reliability: Consistency from one examination point and another Parallel forms reliability: Consistency from one exam form and another Internal consistency reliability: Consistency of items with other items We re going to focus on internal consistency because it is the easiest to measure, and also provides highly useful information

55 INTERNAL CONSISTENCY: WHAT IS IT 55 What is Internal Consistency? Internal consistency is how consistent the items are with the other items High internal consistency = Questions are highly correlated (address a similar topic) and many questions Low internal consistency = Questions are unrelated and few questions Internal consistency measured with Cronbach s Alpha (also called KR-2 )

56 INTERNAL CONSISTENCY: WHAT DOES IT MATTER 56 Why Does Internal Consistency Matter? Cronbach s Alpha provides a lower-bound on the reliability of an exam If you know an exam internal consistency, then you know the worst-case of its reliability Reliability = correlation of scores on exam with scores on another equivalent exam Good internal consistency makes it more likely the students scores are stable

57 INTERNAL CONSISTENCY: HOW TO INTERPRET IT How to Interpret Internal Consistency Ranges from -Infinity to +Infinity (Almost always positive) Larger positive values = Test is highly reliable Smaller values = Test is not very reliable Ideal values are positive and high (above +.7) Alpha Interpretation >.9 Excellent.9 >.8 Good.8 >.7 Acceptable.7 > 6 Marginal <. 6 Poor 57

58 INTERNAL CONSISTENCY: HOW TO CALCULATE IT 58 How to Calculate Internal Consistency Count the number of questions = K For each question, make a column that has whether people got the question right () or wrong () Calculate the correlation between each right/wrong column and every other right/wrong column If K questions, then (K * (K-) / 2) comparisons If 2 questions, then (2 * (2-) / 2) comparisons If 4 questions, then (4 * (4-) / 2) comparisons Calculate the average inter-item correlation = r Apply the following formula:

59 ACTIVITY: TEST RELIABILITY 59 Internal consistency is affected by the test length and average inter-item correlation Test length: The more questions on the test, the more reliable the test will be. EXERCISE Average inter-item correlation: The more the questions address a single common domain, the more reliable the test will be - All questions pertain to the same topic area = Higher average correlation between question scores - All questions pertain to the disparate topic areas = Lower average correlation between question scores ) Imagine we ask students: a) Calculate the area of a hexagon b) Find the hypotenuse of a right triangle c) Find the missing angle in a triangle d) Find the radius of a circle 2) What additional question could we ask that would probably INCREASE the average inter-item correlation? 3) What additional question could we ask that would probably DECREASE the average inter-item correlation?

60 TEST RELIABILITY: DIAGNOSTICS 6 If Test Reliability is too Low (<.7): The item may be miskeyed Items represent too many distinct dimensions (too many concepts being asked) Too few items Items are not written clearly Items have poor difficulty and discrimination

61 TEST RELIABILITY: IMPROVEMENT To Improve Low Test Reliability (<.7): Check that items are keyed correctly Check that items are assessing common domain Increase the number of items (Increasing the number of items by 5% increases reliability by ~.) Clarify ambiguous items Recheck item difficulty and discrimination 6

62 TEST RELIABILITY: PRECAUTIONS Precautions on using Test Reliability Not meaningful if too few respondents (small sample size) Cronbach s Alpha only provides a lower-bound estimate of reliability - the actual reliability could be much higher 62

63 IMPLEMENTING ITEM ANALYSIS 63

64 IMPLEMENTING ITEM ANALYSIS You have several options for implementing item analysis D2L Excel SPSS 64

65 IMPLEMENTING ITEM ANALYSIS: D2L D2L provides many of the item analysis statistics Make a Quiz on D2L and collect responses Go to Quizzes Click on the dropdown menu next to the quiz name Click Statistics Click on Question Stats Average Grade = Item Difficulty Point-Biserial = Discrimination Discrimination Index = Similar to Point-Biserial, but not recommended 65

66 IMPLEMENTING ITEM ANALYSIS: D2L - STEPS VISUALIZED () Go to D2L Course Page and Click on Quizzes (3) Click on Question Stats to view Item Analysis 66 (2) Click on the menu for a specific quiz and select Statistics (4) Each item has its own analysis statistics

67 IMPLEMENTING ITEM ANALYSIS: D2L - THE OUTPUT Point Biserial = Item Discrimination Average Grade = Item Difficulty 67

68 IMPLEMENTING ITEM ANALYSIS: EXCEL You can calculate item analysis with Excel Need an Excel spreadsheet that has Each row is a different student Each column is a different question Each cell is either or indicating whether the student got the corresponding question right or wrong Example Spreadsheet: 68

69 IMPLEMENTING ITEM ANALYSIS: EXCEL - ENTER IN DATA 69 Enter in the data in the first Sheet, in the same format shown previously in the lesson

70 IMPLEMENTING ITEM ANALYSIS: EXCEL - EXAM ITEM STATISTICS 7 Each Item has its own Difficulty and Discrimination Score

71 IMPLEMENTING ITEM ANALYSIS: EXCEL - EXAM TEST STATISTICS 7 Cronbach s alpha = Internal Consistency

72 IMPLEMENTING ITEM ANALYSIS: SPSS 72 You can calculate item analysis with SPSS (or R, SPSS, Minitab, Stata) Access SPSS for free via DePaul Virtual Labs Enter the data as you would for an Excel spreadsheet Each row is a different student Each column is a different question Each cell is either or indicating whether the student got the corresponding question right or wrong At the top menu, go to Analyze -> Scale -> Reliability Analysis Move the Test Questions to the Items panel Click Statistics and ask for item, scale, and scale if item deleted statistics Click OK

73 IMPLEMENTING ITEM ANALYSIS: SPSS - STEP - CHOOSE ANALYSIS 73

74 IMPLEMENTING ITEM ANALYSIS: SPSS - STEP 2 - SELECT VARIABLES 74

75 IMPLEMENTING ITEM ANALYSIS: SPSS - STEP 3 - INTERPRETATION 75 Cronbach s alpha = Internal consistency Item Mean = Item difficulty Correct Item-Total Correlation = Item discrimination

76 76 SUMMARY

77 SUMMARY 77 Item Analysis can provide useful information when examining multiple choice tests How difficult were the questions? How well does a question contribute to understanding a person s performance? How reliable are the overall test scores? It is important to consider the reasons for why an item is performing poorly Items can perform poorly due to wording ambiguity, lack of ability in that domain, miscoding, lack of conceptual relevance, instructional issues Item Analysis is an iterative process - takes time

78 SUMMARY There are many tools for performing item analysis Ones we discussed: D2L, Excel, SPSS Many others available: Stata, SAS, R, PSPP, jmetrik 78

79 SUMMARY 79 Item Analysis is a collection of tools - there are more out there Alternative analysis - Are the incorrect answers equally likely to be chosen? Differential Item Functioning - Are the items fair between groups of people? Factor analysis - What underlying constructs are the items measuring? Item-Response Theory - What are people s ability, when you take into account the difficulty and discrimination of the item that people s answered correctly/incorrectly

80 Q&A

Reliability and Validity checks S-005

Reliability and Validity checks S-005 Reliability and Validity checks S-005 Checking on reliability of the data we collect Compare over time (test-retest) Item analysis Internal consistency Inter-rater agreement Compare over time Test-Retest

More information

Introduction to Reliability

Introduction to Reliability Reliability Thought Questions: How does/will reliability affect what you do/will do in your future job? Which method of reliability analysis do you find most confusing? Introduction to Reliability What

More information

Psychology Research Methods Lab Session Week 10. Survey Design. Due at the Start of Lab: Lab Assignment 3. Rationale for Today s Lab Session

Psychology Research Methods Lab Session Week 10. Survey Design. Due at the Start of Lab: Lab Assignment 3. Rationale for Today s Lab Session Psychology Research Methods Lab Session Week 10 Due at the Start of Lab: Lab Assignment 3 Rationale for Today s Lab Session Survey Design This tutorial supplements your lecture notes on Measurement by

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

Lab 4: Alpha and Kappa. Today s Activities. Reliability. Consider Alpha Consider Kappa Homework and Media Write-Up

Lab 4: Alpha and Kappa. Today s Activities. Reliability. Consider Alpha Consider Kappa Homework and Media Write-Up Lab 4: Alpha and Kappa Today s Activities Consider Alpha Consider Kappa Homework and Media Write-Up Reliability Reliability refers to consistency Types of reliability estimates Test-retest reliability

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

Using Lertap 5 in a Parallel-Forms Reliability Study

Using Lertap 5 in a Parallel-Forms Reliability Study Lertap 5 documents series. Using Lertap 5 in a Parallel-Forms Reliability Study Larry R Nelson Last updated: 16 July 2003. (Click here to branch to www.lertap.curtin.edu.au.) This page has been published

More information

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971) Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity

More information

Progress Monitoring Handouts 1

Progress Monitoring Handouts 1 Progress Monitoring Handouts Teacher Administration Scripts, Teacher Sheets, and Student Sheets Reading Letter Sound Fluency (LSF)..2 Word Identification Fluency (WIF)...5 Passage Reading Fluency (PRF)

More information

Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys

Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Using the Rasch Modeling for psychometrics examination of food security and acculturation surveys Jill F. Kilanowski, PhD, APRN,CPNP Associate Professor Alpha Zeta & Mu Chi Acknowledgements Dr. Li Lin,

More information

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA 15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA Statistics does all kinds of stuff to describe data Talk about baseball, other useful stuff We can calculate the probability.

More information

Examining the Psychometric Properties of The McQuaig Occupational Test

Examining the Psychometric Properties of The McQuaig Occupational Test Examining the Psychometric Properties of The McQuaig Occupational Test Prepared for: The McQuaig Institute of Executive Development Ltd., Toronto, Canada Prepared by: Henryk Krajewski, Ph.D., Senior Consultant,

More information

Psychological testing

Psychological testing Psychological testing Lecture 12 Mikołaj Winiewski, PhD Test Construction Strategies Content validation Empirical Criterion Factor Analysis Mixed approach (all of the above) Content Validation Defining

More information

Choosing Life: empowerment, Action, Results! CLEAR Menu Sessions. Adherence 1: Understanding My Medications and Adherence

Choosing Life: empowerment, Action, Results! CLEAR Menu Sessions. Adherence 1: Understanding My Medications and Adherence Choosing Life: empowerment, Action, Results! CLEAR Menu Sessions Adherence 1: Understanding My Medications and Adherence This page intentionally left blank. Understanding My Medications and Adherence Session

More information

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971)

Validity. Ch. 5: Validity. Griggs v. Duke Power - 2. Griggs v. Duke Power (1971) Ch. 5: Validity Validity History Griggs v. Duke Power Ricci vs. DeStefano Defining Validity Aspects of Validity Face Validity Content Validity Criterion Validity Construct Validity Reliability vs. Validity

More information

STAT 200. Guided Exercise 4

STAT 200. Guided Exercise 4 STAT 200 Guided Exercise 4 1. Let s Revisit this Problem. Fill in the table again. Diagnostic tests are not infallible. We often express a fale positive and a false negative with any test. There are further

More information

From the scenario below please identify the situation, thoughts, and emotions/feelings.

From the scenario below please identify the situation, thoughts, and emotions/feelings. Introduction to Mental Gremlins: Example From the scenario below please identify the situation, thoughts, and emotions/feelings. Bob has been working for Big Corporation for 12 years and has his annual

More information

Reliability & Validity Dr. Sudip Chaudhuri

Reliability & Validity Dr. Sudip Chaudhuri Reliability & Validity Dr. Sudip Chaudhuri M. Sc., M. Tech., Ph.D., M. Ed. Assistant Professor, G.C.B.T. College, Habra, India, Honorary Researcher, Saha Institute of Nuclear Physics, Life Member, Indian

More information

Designing a Questionnaire

Designing a Questionnaire Designing a Questionnaire What Makes a Good Questionnaire? As a rule of thumb, never to attempt to design a questionnaire! A questionnaire is very easy to design, but a good questionnaire is virtually

More information

Interpreting the Item Analysis Score Report Statistical Information

Interpreting the Item Analysis Score Report Statistical Information Interpreting the Item Analysis Score Report Statistical Information This guide will provide information that will help you interpret the statistical information relating to the Item Analysis Report generated

More information

Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations)

Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations) Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations) This is an experiment in the economics of strategic decision making. Various agencies have provided funds for this research.

More information

Two-Way Independent ANOVA

Two-Way Independent ANOVA Two-Way Independent ANOVA Analysis of Variance (ANOVA) a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment. There

More information

Using SPSS for Correlation

Using SPSS for Correlation Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded

More information

Types of Tests. Measurement Reliability. Most self-report tests used in Psychology and Education are objective tests :

Types of Tests. Measurement Reliability. Most self-report tests used in Psychology and Education are objective tests : Measurement Reliability Objective & Subjective tests Standardization & Inter-rater reliability Properties of a good item Item Analysis Internal Reliability Spearman-Brown Prophesy Formla -- α & # items

More information

Item Analysis Explanation

Item Analysis Explanation Item Analysis Explanation The item difficulty is the percentage of candidates who answered the question correctly. The recommended range for item difficulty set forth by CASTLE Worldwide, Inc., is between

More information

Impact of Group Rejections on Self Esteem

Impact of Group Rejections on Self Esteem Impact of Group Rejections on Self Esteem In 2005, researchers from the School of Physical Education at the University of Otago carried out a study to assess the impact of rejection on physical self esteem.

More information

Getting a DIF Breakdown with Lertap

Getting a DIF Breakdown with Lertap Getting a DIF Breakdown with Lertap Larry Nelson Curtin University of Technology Document date: 8 October 2009 website: www.lertap.curtin.edu.au This document shows how Lertap 5 may be used to look for

More information

HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT

HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT HARRISON ASSESSMENTS HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT Have you put aside an hour and do you have a hard copy of your report? Get a quick take on their initial reactions

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Published in Education 3-13, 29 (3) pp. 17-21 (2001) Introduction No measuring instrument is perfect. If we use a thermometer

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Test Reliability Basic Concepts

Test Reliability Basic Concepts Research Memorandum ETS RM 18-01 Test Reliability Basic Concepts Samuel A. Livingston January 2018 ETS Research Memorandum Series EIGNOR EXECUTIVE EDITOR James Carlson Principal Psychometrician ASSOCIATE

More information

Related KidsHealth Links

Related KidsHealth Links Grades 9 to 12 Health Problems Series KidsHealth.org/classroom Teacher s Guide This guide includes: Standards Related Links Discussion Questions Activities for Students Reproducible Materials is so common

More information

Chapter 12. The One- Sample

Chapter 12. The One- Sample Chapter 12 The One- Sample z-test Objective We are going to learn to make decisions about a population parameter based on sample information. Lesson 12.1. Testing a Two- Tailed Hypothesis Example 1: Let's

More information

Bayes Theorem Application: Estimating Outcomes in Terms of Probability

Bayes Theorem Application: Estimating Outcomes in Terms of Probability Bayes Theorem Application: Estimating Outcomes in Terms of Probability The better the estimates, the better the outcomes. It s true in engineering and in just about everything else. Decisions and judgments

More information

ORIENTATION SAN FRANCISCO STOP SMOKING PROGRAM

ORIENTATION SAN FRANCISCO STOP SMOKING PROGRAM ORIENTATION SAN FRANCISCO STOP SMOKING PROGRAM PURPOSE To introduce the program, tell the participants what to expect, and set an overall positive tone for the series. AGENDA Item Time 0.1 Acknowledgement

More information

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 2: What Are My External Drug and Alcohol Triggers?

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 2: What Are My External Drug and Alcohol Triggers? Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions Substance Use Risk 2: What Are My External Drug and Alcohol Triggers? This page intentionally left blank. What Are My External Drug and

More information

for DUMMIES How to Use this Guide Why Conduct Research? Research Design - Now What?

for DUMMIES How to Use this Guide Why Conduct Research? Research Design - Now What? EAGALA Research for DUMMIES How to Use this Guide This guide was created for the people who really want to do research, but don t know where to start. It is a general outline created so that those who

More information

Meeting-5 MEASUREMENT 8-1

Meeting-5 MEASUREMENT 8-1 Meeting-5 MEASUREMENT 8-1 Measurement Measurement Process: 1. Selecting observable empirical events 2. Using numbers or symbols to represent aspects of the events being measured 3. Applying a mapping rule

More information

Statisticians deal with groups of numbers. They often find it helpful to use

Statisticians deal with groups of numbers. They often find it helpful to use Chapter 4 Finding Your Center In This Chapter Working within your means Meeting conditions The median is the message Getting into the mode Statisticians deal with groups of numbers. They often find it

More information

Reliability AND Validity. Fact checking your instrument

Reliability AND Validity. Fact checking your instrument Reliability AND Validity Fact checking your instrument General Principles Clearly Identify the Construct of Interest Use Multiple Items Use One or More Reverse Scored Items Use a Consistent Response Format

More information

Overview of Experimentation

Overview of Experimentation The Basics of Experimentation Overview of Experiments. IVs & DVs. Operational Definitions. Reliability. Validity. Internal vs. External Validity. Classic Threats to Internal Validity. Lab: FP Overview;

More information

11-3. Learning Objectives

11-3. Learning Objectives 11-1 Measurement Learning Objectives 11-3 Understand... The distinction between measuring objects, properties, and indicants of properties. The similarities and differences between the four scale types

More information

Beattie Learning Disabilities Continued Part 2 - Transcript

Beattie Learning Disabilities Continued Part 2 - Transcript Beattie Learning Disabilities Continued Part 2 - Transcript In class Tuesday we introduced learning disabilities and looked at a couple of different activities that are consistent and representative of

More information

Quizzes (and relevant lab exercises): 20% Midterm exams (2): 25% each Final exam: 30%

Quizzes (and relevant lab exercises): 20% Midterm exams (2): 25% each Final exam: 30% 1 Intro to statistics Continued 2 Grading policy Quizzes (and relevant lab exercises): 20% Midterm exams (2): 25% each Final exam: 30% Cutoffs based on final avgs (A, B, C): 91-100, 82-90, 73-81 3 Numerical

More information

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS

Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS Lecture Week 3 Quality of Measurement Instruments; Introduction SPSS Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit Overview Quality of Measurement Instruments Introduction SPSS Read:

More information

How is ethics like logistic regression? Ethics decisions, like statistical inferences, are informative only if they re not too easy or too hard 1

How is ethics like logistic regression? Ethics decisions, like statistical inferences, are informative only if they re not too easy or too hard 1 How is ethics like logistic regression? Ethics decisions, like statistical inferences, are informative only if they re not too easy or too hard 1 Andrew Gelman and David Madigan 2 16 Jan 2015 Consider

More information

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests Objectives Quantifying the quality of hypothesis tests Type I and II errors Power of a test Cautions about significance tests Designing Experiments based on power Evaluating a testing procedure The testing

More information

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 5: Drugs, Alcohol, and HIV

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 5: Drugs, Alcohol, and HIV Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions Substance Use Risk 5: This page intentionally left blank. Session Aims: (70 Minutes) To understand the health consequences of drugs and

More information

How to Work with the Patterns That Sustain Depression

How to Work with the Patterns That Sustain Depression How to Work with the Patterns That Sustain Depression Module 5.2 - Transcript - pg. 1 How to Work with the Patterns That Sustain Depression How the Grieving Mind Fights Depression with Marsha Linehan,

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Introduction No measuring instrument is perfect. The most obvious problems relate to reliability. If we use a thermometer to

More information

So far. INFOWO Lecture M5 Homogeneity and Reliability. Homogeneity. Homogeneity

So far. INFOWO Lecture M5 Homogeneity and Reliability. Homogeneity. Homogeneity So far INFOWO Lecture M5 Homogeneity and Reliability Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht Descriptive statistics Scores and probability

More information

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors affecting reliability ON DEFINING RELIABILITY Non-technical

More information

Chapter 7: Correlation

Chapter 7: Correlation Chapter 7: Correlation Smart Alex s Solutions Task 1 A student was interested in whether there was a positive relationship between the time spent doing an essay and the mark received. He got 45 of his

More information

Fundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2

Fundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2 Fundamental Concepts for Using Diagnostic Classification Models Section #2 NCME 2016 Training Session NCME 2016 Training Session: Section 2 Lecture Overview Nature of attributes What s in a name? Grain

More information

BACKGROUND CHARACTERISTICS OF EXAMINEES SHOWING UNUSUAL TEST BEHAVIOR ON THE GRADUATE RECORD EXAMINATIONS

BACKGROUND CHARACTERISTICS OF EXAMINEES SHOWING UNUSUAL TEST BEHAVIOR ON THE GRADUATE RECORD EXAMINATIONS ---5 BACKGROUND CHARACTERISTICS OF EXAMINEES SHOWING UNUSUAL TEST BEHAVIOR ON THE GRADUATE RECORD EXAMINATIONS Philip K. Oltman GRE Board Professional Report GREB No. 82-8P ETS Research Report 85-39 December

More information

Intelligence. Exam 3. iclicker. My Brilliant Brain. What is Intelligence? Conceptual Difficulties. Chapter 10

Intelligence. Exam 3. iclicker. My Brilliant Brain. What is Intelligence? Conceptual Difficulties. Chapter 10 Exam 3 iclicker Mean: 32.8 Median: 33 Mode: 33 SD = 6.4 How many of you have one? Do you think it would be a good addition for this course in the future? Top Score: 49 Top Cumulative Score to date: 144

More information

A Broad-Range Tailored Test of Verbal Ability

A Broad-Range Tailored Test of Verbal Ability A Broad-Range Tailored Test of Verbal Ability Frederic M. Lord Educational Testing Service Two parallel forms of a broad-range tailored test of verbal ability have been built. The test is appropriate from

More information

Chapter 14 My Recovery Plan for My Life

Chapter 14 My Recovery Plan for My Life 281 1 Decision 4 Activities 5 People 6 Feelings 7 Life Style 8 History 3 Exposure 2 Body Chapter 14 My Recovery Plan for My Life In this chapter you can begin to map out a longer view of your recovery

More information

Problem Situation Form for Parents

Problem Situation Form for Parents Problem Situation Form for Parents Please complete a form for each situation you notice causes your child social anxiety. 1. WHAT WAS THE SITUATION? Please describe what happened. Provide enough information

More information

The 3 Things NOT To Do When You Quit Smoking

The 3 Things NOT To Do When You Quit Smoking The 3 Things NOT To Do When You Quit Smoking Here are the 3 common mistakes people make when they try to quit smoking: 1. They think quitting will be hard not true 2. They think they have no willpower

More information

ITEM ANALYSIS OF MID-TRIMESTER TEST PAPER AND ITS IMPLICATIONS

ITEM ANALYSIS OF MID-TRIMESTER TEST PAPER AND ITS IMPLICATIONS ITEM ANALYSIS OF MID-TRIMESTER TEST PAPER AND ITS IMPLICATIONS 1 SARITA DESHPANDE, 2 RAVINDRA KUMAR PRAJAPATI 1 Professor of Education, College of Humanities and Education, Fiji National University, Natabua,

More information

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics Psy201 Module 3 Study and Assignment Guide Using Excel to Calculate Descriptive and Inferential Statistics What is Excel? Excel is a spreadsheet program that allows one to enter numerical values or data

More information

Item Analysis for Beginners

Item Analysis for Beginners Questionmark Conference 2018 Savannah A s s e s s f o r S u c c e s s Item Analysis for Beginners John Kleeman Questionmark Executive Director and Founder All rights reserved. Questionmark is a registered

More information

Goldsmith. Marshall. FEEDFORWARD Coaching For Your Future. Coaching For Your Future. MMIX ChartHouse Learning. All Rights Reserved. What Is It?

Goldsmith. Marshall. FEEDFORWARD Coaching For Your Future. Coaching For Your Future. MMIX ChartHouse Learning. All Rights Reserved. What Is It? FOR PREVIEW ONLY Marshall Goldsmith Table of Contents.............................................. 2 Get 360 Feedback...3 Pick a Behavior...4 Respond.......................................... 4 Involve...6

More information

15.301/310, Managerial Psychology Prof. Dan Ariely Lecture 2: The Validity of Intuitions

15.301/310, Managerial Psychology Prof. Dan Ariely Lecture 2: The Validity of Intuitions Introduction: 15.301/310, Managerial Psychology Prof. Dan Ariely Lecture 2: The Validity of Intuitions We are beginning recitation sections this week. Topic: generating ideas for your project. We have

More information

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org Measuring impact William Parienté UC Louvain J PAL Europe povertyactionlab.org Course overview 1. What is evaluation? 2. Measuring impact 3. Why randomize? 4. How to randomize 5. Sampling and Sample Size

More information

Saville Consulting Wave Professional Styles Handbook

Saville Consulting Wave Professional Styles Handbook Saville Consulting Wave Professional Styles Handbook PART 4: TECHNICAL Chapter 19: Reliability This manual has been generated electronically. Saville Consulting do not guarantee that it has not been changed

More information

By Hui Bian Office for Faculty Excellence

By Hui Bian Office for Faculty Excellence By Hui Bian Office for Faculty Excellence 1 Email: bianh@ecu.edu Phone: 328-5428 Location: 1001 Joyner Library, room 1006 Office hours: 8:00am-5:00pm, Monday-Friday 2 Educational tests and regular surveys

More information

Making Your Treatment Work Long-Term

Making Your Treatment Work Long-Term Making Your Treatment Work Long-Term How to keep your treatment working... and why you don t want it to fail Regardless of the particular drugs you re taking, your drugs will only work when you take them.

More information

Motivation CURRENT MOTIVATION CONSTRUCTS

Motivation CURRENT MOTIVATION CONSTRUCTS Motivation CURRENT MOTIVATION CONSTRUCTS INTEREST and ENJOYMENT TASK VALUE GOALS (Purposes for doing) INTRINSIC vs EXTRINSIC MOTIVATION EXPECTANCY BELIEFS SELF-EFFICACY SELF-CONCEPT (Self-Esteem) OUTCOME

More information

Lesson 8 STD & Responsible Actions

Lesson 8 STD & Responsible Actions Lesson 8 STD & Responsible Actions Overview This lesson reinforces taking care of sexual health by teaching responsible actions around STD prevention. After discussing the responsibilities of a person

More information

Assignment 4: True or Quasi-Experiment

Assignment 4: True or Quasi-Experiment Assignment 4: True or Quasi-Experiment Objectives: After completing this assignment, you will be able to Evaluate when you must use an experiment to answer a research question Develop statistical hypotheses

More information

Healthy Self. Lesson 1 Self Esteem

Healthy Self. Lesson 1 Self Esteem Healthy Self Lesson 1 Self Esteem Starter Self Esteem Quiz FALSE 1. A person has high self-esteem if they are always concerned about what other people think of them. TRUE 2. Self-esteem means being proud

More information

My Review of John Barban s Venus Factor (2015 Update and Bonus)

My Review of John Barban s Venus Factor (2015 Update and Bonus) My Review of John Barban s Venus Factor (2015 Update and Bonus) December 26, 2013 by Erin B. White 202 Comments (Edit) This article was originally posted at EBWEIGHTLOSS.com Venus Factor is a diet program

More information

John Smith 20 October 2009

John Smith 20 October 2009 John Smith 20 October 2009 2009 MySkillsProfile.com. All rights reserved. Introduction The Emotional Competencies Questionnaire (ECQ) assesses your current emotional competencies and style by asking you

More information

Tech Talk: Using the Lafayette ESS Report Generator

Tech Talk: Using the Lafayette ESS Report Generator Raymond Nelson Included in LXSoftware is a fully featured manual score sheet that can be used with any validated comparison question test format. Included in the manual score sheet utility of LXSoftware

More information

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Structure

More information

Cognitive Restructuring

Cognitive Restructuring Cognitive Restructuring Cognitive Restructuring Cognitive Restructuring is an evidence based intervention for the treatment of low mood or anxiety, recommended by the National Institute for Health and

More information

Demonstrating Client Improvement to Yourself and Others

Demonstrating Client Improvement to Yourself and Others Demonstrating Client Improvement to Yourself and Others Making Sure Client Numbers Reflect Client Reality (Part 3 of 3) Greg Vinson, Ph.D. Senior Researcher and Evaluation Manager Center for Victims of

More information

Goal for Today. Making a Better Test: Using New Multiple Choice Question Varieties with Confidence

Goal for Today. Making a Better Test: Using New Multiple Choice Question Varieties with Confidence Making a Better Test: Using New Multiple Choice Question Varieties with Confidence Catherine Davis, CGFNS Int l David Foster, Kryterion, Inc. Goal for Today Create an appreciation for, and an optimism

More information

What is Data? Part 2: Patterns & Associations. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder

What is Data? Part 2: Patterns & Associations. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder What is Data? Part 2: Patterns & Associations INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder August 29, 2016 Prof. Michael Paul Prof. William Aspray Overview This lecture will look

More information

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity Measurement & Variables - Initial step is to conceptualize and clarify the concepts embedded in a hypothesis or research question with

More information

Understanding Pain. Teaching Plan: Guidelines for Teaching this Lesson

Understanding Pain. Teaching Plan: Guidelines for Teaching this Lesson Understanding Pain Teaching Plan: Guidelines for Teaching this Lesson Lesson Overview This one-hour lesson plan is about pain and how your workers should respond to and care for residents with pain. You

More information

CP316 Microprocessor Sysytems and Interfacing Evaluation Results Wilfrid Laurier University

CP316 Microprocessor Sysytems and Interfacing Evaluation Results Wilfrid Laurier University CP316 Microprocessor Sysytems and Interfacing Terry Sturtevant This evaluation for the purpose of evaluating my teaching methods and your impressions of the course. It is anonymous and you can omit any

More information

New Food Label Pages Diabetes Self-Management Program Leader s Manual

New Food Label Pages Diabetes Self-Management Program Leader s Manual New Food Label Pages The FDA has released a new food label, so we have adjusted Session 4 and provided a handout of the new label. Participants use the handout instead of looking at the label in the book

More information

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK Hanny Pradana, Gatot Sutapa, Luwandi Suhartono Sarjana Degree of English Language Education, Teacher

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Pearson Education Limited 2014

More information

FIGURE 1-The Cognitive Model. Core belief. I m incompetent. Intermediate belief. If I don t understand something perfectly, then I m dumb

FIGURE 1-The Cognitive Model. Core belief. I m incompetent. Intermediate belief. If I don t understand something perfectly, then I m dumb FIGURE 1-The Cognitive Model Core belief I m incompetent Intermediate belief If I don t understand something perfectly, then I m dumb Situation Automatic thoughts Reactions Reading this book This is too

More information

Section 4 - Dealing with Anxious Thinking

Section 4 - Dealing with Anxious Thinking Section 4 - Dealing with Anxious Thinking How do we challenge our unhelpful thoughts? Anxiety may decrease if we closely examine how realistic and true our unhelpful/negative thoughts are. We may find

More information

To open a CMA file > Download and Save file Start CMA Open file from within CMA

To open a CMA file > Download and Save file Start CMA Open file from within CMA Example name Effect size Analysis type Level Tamiflu Symptom relief Mean difference (Hours to relief) Basic Basic Reference Cochrane Figure 4 Synopsis We have a series of studies that evaluated the effect

More information

Variability. After reading this chapter, you should be able to do the following:

Variability. After reading this chapter, you should be able to do the following: LEARIG OBJECTIVES C H A P T E R 3 Variability After reading this chapter, you should be able to do the following: Explain what the standard deviation measures Compute the variance and the standard deviation

More information

Measuring the User Experience

Measuring the User Experience Measuring the User Experience Collecting, Analyzing, and Presenting Usability Metrics Chapter 2 Background Tom Tullis and Bill Albert Morgan Kaufmann, 2008 ISBN 978-0123735584 Introduction Purpose Provide

More information

Boise State University Foundational Studies Program Course Application Form

Boise State University Foundational Studies Program Course Application Form Boise State University Foundational Studies Program Course Application Form Due to the Foundational Studies Program by August 19, 2011 After the Foundational Studies Program has approved a course, departments

More information

V. Measuring, Diagnosing, and Perhaps Understanding Objects

V. Measuring, Diagnosing, and Perhaps Understanding Objects V. Measuring, Diagnosing, and Perhaps Understanding Objects Our purpose when undertaking this venture was not to explain data or even to build better instruments. It may not seem like it based on the discussion

More information

An Introduction to Research Statistics

An Introduction to Research Statistics An Introduction to Research Statistics An Introduction to Research Statistics Cris Burgess Statistics are like a lamppost to a drunken man - more for leaning on than illumination David Brent (alias Ricky

More information

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE Using StatCrunch for confidence intervals (CI s) is super easy. As you can see in the assignments, I cover 9.2 before 9.1 because

More information

How attract your dream clients

How attract your dream clients How attract your dream clients Would you like to bring healing where it's truly needed to people who appreciate what you do? Or would prefer to stagger back home after a treatment feeling tired and drained?

More information