Identifying Extraneous Threats to Test Validity for Improving Tests

Size: px
Start display at page:

Download "Identifying Extraneous Threats to Test Validity for Improving Tests"

Transcription

1 Identifying Extraneous Threats to Test Validity for Improving Tests Yi Du, Ph.D. Data Recognition Corporation Presentation at the 40th National Conference on Student Assessment, June 22,

2 If only it were so easy, we don t need testing professionals in this industry 2

3 State assessment designs may leave room for extraneous variable effects It may not be realistic that using a strictly experimental design which has randomized samples, pre- and post-tests, and comparison groups for a statewide assessment. Statewide assessments, in general, are based on an one sample post-test only design. This pre-experimental design provides the least amount of control and allows much room for extraneous variables possibly affecting the assessment outcomes. 3

4 An Example of Variable Map of An Assessment Age of student Gender of student Students SES X A Curriculum and textbook X B Instruction Approaches School and Classroom Environment O Classroom learning Y Student Math Achievement Students Ethnicity X C Students prior knowledge and skills 4

5 More Efforts Have Been Made in Providing Evidence on Instrument Validity Construct - consistency across concept, construct and instrument. Criterion-predictive - how well performance on one instrument predicts score on a second but different one at a future date. Criterion- concurrent - consistency between scores on a measure and a second independent instrument or classification. Content - adequacy of instrument in representing subject matter and skills. 5

6 Less Efforts Were Made in the Entire Test Process for Test Score Validity External Validity ensuring that both the samples and the test conditions under which the test is carried out are representative of the population and the situations to which the results are to apply; Statistical validity the appropriate choice of statistical methods, which is dependent on decisions made about measuring instruments and influenced by data collection process. Internal validity ensuring the design allows for the control of all other possible contributing of extraneous variables. 6

7 Test Score Validity May Be Beyond the Instrument Validity Planning Stage External Validity Statistical Validity Execution Stage State questions & hypotheses, identify variables Determine design structure Identify population & sample Design instruments & classify: operational definitions Select statistical test for resolving hypotheses Carry out plan, collect data Analyse data, draw conclusions & evaluate process Internal Validity Construct Validity 7

8 Possible extraneous variables in test process may affect test score validity Extraneous variables - those that could be competing independent variables, influencing the dependent variable, but are not of interest to the assessment. Item drifting Rater drifting Test/item overexposure Different choice of statistical/measurement tools Different administration approaches Test design effect 8

9 Possible Extraneous Variables Confounding in the Testing Confounding - occurs when the cause of the effect seen in the dependent variable is not unequivocally attributable to the independent variable alone. School program effectiveness may be confounded with the composition of students and faculty Students ethnicity background may be confounded with students social economic status Students scores on CR questions in social studies may be confounded with students writing levels. 9

10 Possible Identified Extraneous Variables Identified variables- those we know about that will affect the dependent variable. Students background: gender, age, education, social class. School characteristics Curriculum and instruction approaches 10

11 Possible Extraneous Variables Introduced by the Testing Itself Sampling: Sample representativeness Sample stability loss over time Time delay or ahead reduce sample quality Poor sampling procedures or non-random sampling Underestimate the sampling error variance or sample size with application of incorrect formulas, i.e., stratified sampling by using simple random sampling formulas Measuring growth with students loss over time Choices of analysis methods and procedures The impact of design effects The impact of equating error variance (Gary Phillips, NCME 2010) 11

12 Evidence of Extraneous Variable Threats (1) Rater Drifting 12

13 Rater Drifting from A Statewide Grade 8 Mathematics Assessment CR Anchor Item N 2008 Score 2009 Score Correl. Mean Dif SD Dif t-test Prob. Item Item These scores were from the same students responding to the same CR items. The only difference is the rater groups. It seems that raters in 2008 were more harsh than those in

14 Student Scores with Different Years of Raters 2009 Score Total 2008 Score Total

15 Comparison of the Mean Scale Score With and Without Rater Effect Adjustment in the Linking Linking with Rater Effect Adjustment Linking without Rater Effect Adjustment Mean SD Mean SD The mean scale scores were higher with rater effect adjustment than non-rater effect adjustment. 15

16 Proportion Difference (%) in Performance Level Classification N = (1.7%) students were affected Linking with Rater Effect Adjustment (Spring Operation) BB B P A Liking without Rater Effect Adjustment BB 12.8 B P 26.6 A

17 Rater Drifting from a Statewide Writing Assessment 2009 Operational Score Adjustment None Difficulty Rater Both 2008 Statistics N 50,000 50,000 50,000 50,000 50,000 % in Performa nce Level Equating Constants Mean SD Level Level Level Level Mult Add

18 Rater Drifting Rater Stringencies Different raters over years Possible different rater training over years Same raters change due to environmental factors Same raters change due to psychological factors 18

19 When Rater Drifting Occurs? Use CR items as anchors for over-year equating Use intact form which includes CR or essay items Use pre-equated forms which include CR or essay items 19

20 How to effectively identify and monitor rater drifting Use MC items only as anchors in equating Use trend score method (Tate, 1999) Rescore CR items or essays and add rater shift onto the equating constant Use auto-scoring? 20

21 Evidence of Extraneous Variable Threats (2) Item Drifting 21

22 Item Drifting Occurs Environmental factors Different curriculum and instructions approaches Different learning practices School or home environmental changes Societal environment Testing factors Item over-exposure Anchor items or items in intact forms were edited 22

23 Edits to Intact Forms or Anchor Items Item Used in a Spring Operational Test The first time that Linda saved someone s life, she knew they needed her help because A. a storm was coming. B. she heard yelling in the street. C. she saw signal lights from home. D. she was patrolling the harbor in a boat. Revised Item Used in the Later Year In 1990, when Linda saved someone s life, she knew they needed her help because A. a storm was coming. B. she heard yelling in the street. C. she saw signal lights from home. D. she was patrolling the harbor in a boat. 23

24 The Edits Reflected on the Raw-to-Scaled Score Table Raw Score Original RS-SS Table Scale Score SS SEM Updated RS-SS Table Scale Score Standard Error Difference

25 Evidence of Extraneous Variable Threats (3) Sampling 25

26 Sample Differences Result In Different Raw-to- Scale Score Tables Form A Form B Diff Sample N Median Mean SD N Median Mean Old Sample 4, , New Sample 4, , SD Median Mean (SE) *

27 Partial Raw-to-Scaled Score Tables for Old and New Samples Old Sample New sample Differences Form A Form B Form A Form B Old Sample New Sample

28 Evidence of Extraneous Variable Threats (4) Test Design Effect Slides # 35 through #37 were from and modified based on Score Drift and Design Effect by G. W. Phillips at NCME

29 How does the design effect impact estimates of sample size (n)? One of design effects is in the spiraling of field test forms associated with field testing. Let s assume that we have 12 field test forms that will be administered to 12,000 students in the state. If we could use a simple random sample (SRS) we would draw 12,000 students from across the state and administer each form to a random sub-sample of 1000 students. The standard error of the mean for each form would equal 5 scaled score units as Design 1 shows. 29

30 How does the design effect impact estimates of sample size (n)? The effect of clustering in field test samples Spiraling Design Forms are administered SE of the Mean Effective Sample Size and needed sample Design 1 (SRS) Random statewide sample /1000 Design 2 (cluster) Design 3 (cluster) Spiraled by students within classrooms Spiraled by classrooms within schools / / 4200 Design 4 (cluster) Spiraled by schools within districts /20800 Design 5 (cluster) Spiraled by districts /

31 How does the design effect impact estimates of sample size (n)? When we do sampling at the group level as designs 2 through 5, we need to add the cluster variance into the error variance. If we use the formulas used for SRS for sample size or sampling errors in the cluster sampling, we actually underestimate the error variance as the table shows. Consequently, their statistics (e.g., p-values, pointbiserials, different item function statistics, item parameter estimates, means, correlations, equating error, etc) have substantially more error associated with them. 31

32 How would we be aware of extraneous variables threats to the tests Be careful with every step of assessment process: Test design: Be careful with post-equating designs and pre-equating designs Rater drifting For anchor items For pre-equating forms For intact forms Be careful of editing anchor items and intact forms Be careful of changing the testing time, content subject sequence, and test format change over years Be careful of the choice of statistical methods 32

33 How would we be aware of extraneous variables threats to the tests Be careful of Sampling: Sample representativeness Poor sampling procedures or non-random sampling Appropriately estimate sample size or standard errors Be careful of not only the choice of statistical methods, but also different computation software, and different versions of the same software 33

34 What A Testing Professional Can do? We have responsibilities: statistical methods/tools can only solve the problems that we want to solve. It is up to the test professionals to construct the test that takes into account possible extraneous variables. We need to ensure the score validity as much as possible. More possible extraneous variables are accounted for, we actually ensure the validity of the test better. Find the balanced between the best and the ideal plan. In the real testing world with limited control of extraneous variables, we may have to realize some extraneous variables are controllable but not all of them. Thus, we need a balanced between the best plan and the ideal plan. 34

35 Thank you for your listening and your time! 35

An Alternative to the Trend Scoring Method for Adjusting Scoring Shifts. in Mixed-Format Tests. Xuan Tan. Sooyeon Kim. Insu Paek.

An Alternative to the Trend Scoring Method for Adjusting Scoring Shifts. in Mixed-Format Tests. Xuan Tan. Sooyeon Kim. Insu Paek. An Alternative to the Trend Scoring Method for Adjusting Scoring Shifts in Mixed-Format Tests Xuan Tan Sooyeon Kim Insu Paek Bihua Xiang ETS, Princeton, NJ Paper presented at the annual meeting of the

More information

SJSU Annual Program Assessment Form Academic Year

SJSU Annual Program Assessment Form Academic Year SJSU Annual Program Assessment Form Academic Year 2015 2016 Department: Industrial and Systems Engineering Program: B.S. ISE College: Engineering Program Website: http://ise.sjsu.edu/content/bs ise Link

More information

Interpreting the Item Analysis Score Report Statistical Information

Interpreting the Item Analysis Score Report Statistical Information Interpreting the Item Analysis Score Report Statistical Information This guide will provide information that will help you interpret the statistical information relating to the Item Analysis Report generated

More information

The knowledge and skills involved in clinical audit

The knowledge and skills involved in clinical audit The knowledge and skills involved in clinical A list of possible knowledge and skills related to clinical is in the box. Knowledge and skills involved in clinical Knowledge Skills The clinical process

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

investigate. educate. inform.

investigate. educate. inform. investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced

More information

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS Michael J. Kolen The University of Iowa March 2011 Commissioned by the Center for K 12 Assessment & Performance Management at

More information

HPS301 Exam Notes- Contents

HPS301 Exam Notes- Contents HPS301 Exam Notes- Contents Week 1 Research Design: What characterises different approaches 1 Experimental Design 1 Key Features 1 Criteria for establishing causality 2 Validity Internal Validity 2 Threats

More information

Assessment with Multiple-Choice Questions in Medical Education: Arguments for Selected-Response Formats

Assessment with Multiple-Choice Questions in Medical Education: Arguments for Selected-Response Formats Assessment with Multiple-Choice Questions in Medical Education: Arguments for Selected-Response Formats Congreso Nacional De Educacion Medica Puebla, Mexico 11 January, 2007 Steven M. Downing, PhD Department

More information

Investigations in Number, Data, and Space, Grade 4, 2nd Edition 2008 Correlated to: Washington Mathematics Standards for Grade 4

Investigations in Number, Data, and Space, Grade 4, 2nd Edition 2008 Correlated to: Washington Mathematics Standards for Grade 4 Grade 4 Investigations in Number, Data, and Space, Grade 4, 2nd Edition 2008 4.1. Core Content: Multi-digit multiplication (Numbers, Operations, Algebra) 4.1.A Quickly recall multiplication facts through

More information

Does the Relationship between Motivation and Performance Differ with Ability?

Does the Relationship between Motivation and Performance Differ with Ability? James Madison University JMU Scholarly Commons Department of Graduate Psychology - Faculty Scholarship Department of Graduate Psychology 4-1999 Does the Relationship between Motivation and Performance

More information

Pediatrics Milestones and Meaningful Assessment Translating the Pediatrics Milestones into Assessment Items for use in the Clinical Setting

Pediatrics Milestones and Meaningful Assessment Translating the Pediatrics Milestones into Assessment Items for use in the Clinical Setting Pediatrics Milestones and Meaningful Assessment Translating the Pediatrics Milestones into Assessment Items for use in the Clinical Setting Ann Burke Susan Guralnick Patty Hicks Jeanine Ronan Dan Schumacher

More information

Comparing Multiple-Choice, Essay and Over-Claiming Formats as Efficient Measures of Knowledge

Comparing Multiple-Choice, Essay and Over-Claiming Formats as Efficient Measures of Knowledge Comparing Multiple-Choice, Essay and Over-Claiming Formats as Efficient Measures of Knowledge Kevin M. Williams, Bryce Westlake, & Delroy L. Paulhus The University of British Columbia Poster presented

More information

25. Two-way ANOVA. 25. Two-way ANOVA 371

25. Two-way ANOVA. 25. Two-way ANOVA 371 25. Two-way ANOVA The Analysis of Variance seeks to identify sources of variability in data with when the data is partitioned into differentiated groups. In the prior section, we considered two sources

More information

Experimental Psychology

Experimental Psychology Title Experimental Psychology Type Individual Document Map Authors Aristea Theodoropoulos, Patricia Sikorski Subject Social Studies Course None Selected Grade(s) 11, 12 Location Roxbury High School Curriculum

More information

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology ISC- GRADE XI HUMANITIES (2018-19) PSYCHOLOGY Chapter 2- Methods of Psychology OUTLINE OF THE CHAPTER (i) Scientific Methods in Psychology -observation, case study, surveys, psychological tests, experimentation

More information

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604 Measurement and Descriptive Statistics Katie Rommel-Esham Education 604 Frequency Distributions Frequency table # grad courses taken f 3 or fewer 5 4-6 3 7-9 2 10 or more 4 Pictorial Representations Frequency

More information

Performance-Based Assessments. Performance-Based Tests

Performance-Based Assessments. Performance-Based Tests Performance-Based Assessments Performance-Based Tests Performance Assessments Referred to as authentic assessments or alternative assessments (Grunlund & Linn, 2000:160-161) (a) authentic -- stresses on

More information

A Cross-validation of easycbm Mathematics Cut Scores in. Oregon: Technical Report # Daniel Anderson. Julie Alonzo.

A Cross-validation of easycbm Mathematics Cut Scores in. Oregon: Technical Report # Daniel Anderson. Julie Alonzo. Technical Report # 1104 A Cross-validation of easycbm Mathematics Cut Scores in Oregon: 2009-2010 Daniel Anderson Julie Alonzo Gerald Tindal University of Oregon Published by Behavioral Research and Teaching

More information

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of

More information

The Necker Cube An Alternative Measure of Direct Attention

The Necker Cube An Alternative Measure of Direct Attention Proceedings of The National Conference On Undergraduate Research (NCUR) 2011 Ithaca College, New York March 31 - April 2, 2011 The Necker Cube An Alternative Measure of Direct Attention Jessica Hurlbut

More information

ADDES-3 QUICK SCORE SCHOOL VERION PROFILE SAMPLE

ADDES-3 QUICK SCORE SCHOOL VERION PROFILE SAMPLE ATTENTION DEFICIT DISORDERS EVALUATION SCALE-THIRD EDITIO SCHOOL VERSION RATING FORM PROFILE SHEET SUMMARY OF SCORES Name: Andrew Thomas Gender: Male Raw Standard Standard School: Midvale Middle School

More information

Analyzing a Food Nutrition Label for Sodium Intake. Content Analysis Report. Renee Adams Gail Arakaki Ryan Tanaka Michelle Yokota

Analyzing a Food Nutrition Label for Sodium Intake. Content Analysis Report. Renee Adams Gail Arakaki Ryan Tanaka Michelle Yokota Analyzing a Food Nutrition Label for Sodium Intake Content Analysis Report Renee Adams Gail Arakaki Ryan Tanaka Michelle Yokota adamsrk@hawaii.edu gailk@hawaii.edu rktanaka@hawaii.edu yokotam@hawaii.edu

More information

Smarter Balanced Interim Assessment Blocks Total Number of Items and hand scoring Requirements by Grade and Subject.

Smarter Balanced Interim Assessment Blocks Total Number of Items and hand scoring Requirements by Grade and Subject. Smarter Balanced Interim Assessment Blocks of Items and hand scoring Requirements by Grade and Subject. The following tables are intended to assist coordinators, site coordinators, and test administrators

More information

Between Groups & Within-Groups ANOVA

Between Groups & Within-Groups ANOVA Between Groups & Within-Groups ANOVA BG & WG ANOVA Partitioning Variation making F making effect sizes Things that influence F Confounding Inflated within-condition variability Integrating stats & methods

More information

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Structure

More information

RATER EFFECTS AND ALIGNMENT 1. Modeling Rater Effects in a Formative Mathematics Alignment Study

RATER EFFECTS AND ALIGNMENT 1. Modeling Rater Effects in a Formative Mathematics Alignment Study RATER EFFECTS AND ALIGNMENT 1 Modeling Rater Effects in a Formative Mathematics Alignment Study An integrated assessment system considers the alignment of both summative and formative assessments with

More information

Preparation Action Reflection

Preparation Action Reflection Preparation Action Reflection Preparation: Meet a recognized need in the community What is the need in the community? Every 5 minutes someone in the United States is diagnosed with a blood cancer. Every

More information

Chapter 9 Experimental Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.

Chapter 9 Experimental Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters. Chapter 9 Experimental Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.) In this chapter we talk about what experiments are, we

More information

Experiment 1: Scientific Measurements and Introduction to Excel

Experiment 1: Scientific Measurements and Introduction to Excel Experiment 1: Scientific Measurements and Introduction to Excel Reading: Chapter 1 of your textbook and this lab handout. Learning Goals for Experiment 1: To use a scientific notebook as a primary record

More information

Designing Experiments... Or how many times and ways can I screw that up?!?

Designing Experiments... Or how many times and ways can I screw that up?!? www.geo.uzh.ch/microsite/icacogvis/ Designing Experiments... Or how many times and ways can I screw that up?!? Amy L. Griffin AutoCarto 2012, Columbus, OH Outline When do I need to run an experiment and

More information

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity Measurement & Variables - Initial step is to conceptualize and clarify the concepts embedded in a hypothesis or research question with

More information

Lecture 12: Normal Probability Distribution or Normal Curve

Lecture 12: Normal Probability Distribution or Normal Curve 12_normalcurve.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 12: Normal Probability Distribution or Normal Curve The real importance of this lecture is to show you what a normal curve looks

More information

Differential Item Functioning

Differential Item Functioning Differential Item Functioning Lecture #11 ICPSR Item Response Theory Workshop Lecture #11: 1of 62 Lecture Overview Detection of Differential Item Functioning (DIF) Distinguish Bias from DIF Test vs. Item

More information

Psychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals

Psychometrics for Beginners. Lawrence J. Fabrey, PhD Applied Measurement Professionals Psychometrics for Beginners Lawrence J. Fabrey, PhD Applied Measurement Professionals Learning Objectives Identify key NCCA Accreditation requirements Identify two underlying models of measurement Describe

More information

Alignment in Educational Testing: What it is, What it isn t, and Why it is Important

Alignment in Educational Testing: What it is, What it isn t, and Why it is Important Alignment in Educational Testing: What it is, What it isn t, and Why it is Important Stephen G. Sireci University of Massachusetts Amherst Presentation delivered at the Connecticut Assessment Forum Rocky

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8 Experimental Design Dewayne E Perry ENS 623 Perry@ece.utexas.edu 1 Problems in Experimental Design 2 True Experimental Design Goal: uncover causal mechanisms Primary characteristic: random assignment to

More information

CP Statistics Sem 1 Final Exam Review

CP Statistics Sem 1 Final Exam Review Name: _ Period: ID: A CP Statistics Sem 1 Final Exam Review Multiple Choice Identify the choice that best completes the statement or answers the question. 1. A particularly common question in the study

More information

Environmental Issues. Chapter 3, Section 1

Environmental Issues. Chapter 3, Section 1 Environmental Issues Chapter 3, Section 1 1 Chapter 3 Table of Contents- Glue on S page 57 Page # Topic 58 Background Knowledge 58 Define environmental issue 59 Issues (2) & Debate (2) 60 Notes 3.1 61

More information

CAUTIONS ABOUT THE PRACTICE EXAM

CAUTIONS ABOUT THE PRACTICE EXAM Dr. Michael Passer Psych 209, U. of Washington CAUTIONS ABOUT THE PRACTICE EXAM DEAR STUDENT: This practice tests consists of questions from past exams. By taking this practice test, you should gain an

More information

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2 MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and Lord Equating Methods 1,2 Lisa A. Keller, Ronald K. Hambleton, Pauline Parker, Jenna Copella University of Massachusetts

More information

Psychometrics, Measurement Validity & Data Collection

Psychometrics, Measurement Validity & Data Collection Psychometrics, Measurement Validity & Data Collection Psychometrics & Measurement Validity Measurement & Constructs Kinds of items and their combinations Properties of a good measure Data Collection Observational

More information

1 Motion. What is motion? Focus Questions: How do we measure it? How do we show it? Name: pd date

1 Motion. What is motion? Focus Questions: How do we measure it? How do we show it? Name: pd date 1 Motion Name: pd date Quick Write Focus Question: draw motion use words to describe motion (do not use move, movement, etc) Motion Notes definition symbol formula motion position initial position final

More information

Do not write your name on this examination all 40 best

Do not write your name on this examination all 40 best Student #: Do not write your name on this examination Research in Psychology I; Final Exam Fall 200 Instructor: Jeff Aspelmeier NOTE: THIS EXAMINATION MAY NOT BE RETAINED BY STUDENTS This exam is worth

More information

Math Released Item Grade 3. Find the Area and Identify Equal Areas 1749-M23082

Math Released Item Grade 3. Find the Area and Identify Equal Areas 1749-M23082 Math Released Item 2018 Grade 3 Find the Area and Identify Equal Areas 1749-M23082 Anchor Set A1 A6 With Annotations Prompt 1749-M23082 Rubric Part A Score Description 1 This part of the item is machine

More information

Linking Mixed-Format Tests Using Multiple Choice Anchors. Michael E. Walker. Sooyeon Kim. ETS, Princeton, NJ

Linking Mixed-Format Tests Using Multiple Choice Anchors. Michael E. Walker. Sooyeon Kim. ETS, Princeton, NJ Linking Mixed-Format Tests Using Multiple Choice Anchors Michael E. Walker Sooyeon Kim ETS, Princeton, NJ Paper presented at the annual meeting of the American Educational Research Association (AERA) and

More information

A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel

A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel Pre-lab Assignment: Introduction Reading: 1. Chapter sections 1.4 through 1.6 in your course text. 2. This lab handout. Questions:

More information

Importance of Good Measurement

Importance of Good Measurement Importance of Good Measurement Technical Adequacy of Assessments: Validity and Reliability Dr. K. A. Korb University of Jos The conclusions in a study are only as good as the data that is collected. The

More information

Confounds: Threats to Validity. Why Are They Important?

Confounds: Threats to Validity. Why Are They Important? Confounds: Threats to Validity Why Are They Important? Previously, we asked whether you would recommend that car magazine (Auto Tester Weakly) to a friend. We concluded that its recommendations were not

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Experiment 1: Scientific Measurements and Introduction to Excel

Experiment 1: Scientific Measurements and Introduction to Excel Experiment 1: Scientific Measurements and Introduction to Excel Reading: Chapter 1 of your textbook and this lab handout. Learning Goals for Experiment 1: To use a scientific notebook as a primary record

More information

Measurement is the process of observing and recording the observations. Two important issues:

Measurement is the process of observing and recording the observations. Two important issues: Farzad Eskandanian Measurement is the process of observing and recording the observations. Two important issues: 1. Understanding the fundamental ideas: Levels of measurement: nominal, ordinal, interval

More information

CHAPTER 3 METHOD AND PROCEDURE

CHAPTER 3 METHOD AND PROCEDURE CHAPTER 3 METHOD AND PROCEDURE Previous chapter namely Review of the Literature was concerned with the review of the research studies conducted in the field of teacher education, with special reference

More information

The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors for Performance Ratings

The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors for Performance Ratings 0 The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors for Performance Ratings Mark R. Raymond, Polina Harik and Brain E. Clauser National Board of Medical Examiners 1

More information

Equine-Assisted Psychotherapy s Impact on the Self-Efficacy of Adult Female Victims of Interpersonal Violence

Equine-Assisted Psychotherapy s Impact on the Self-Efficacy of Adult Female Victims of Interpersonal Violence Equine-Assisted Psychotherapy s Impact on the Self-Efficacy of Adult Female Victims of Interpersonal Violence Dr. Wanda Whittlesey-Jerome, Principal Investigator, & Karen Longenecker, Observer, New Mexico

More information

I Feel: Stressed Lesson Plan

I Feel: Stressed Lesson Plan I Feel: Stressed Lesson Plan Course Description This course is designed to define stress and identify its causes. It will also discuss ways to reduce stress and suggest who young people can turn to when

More information

How Can We Improve the Blood Donation Process in the United States?

How Can We Improve the Blood Donation Process in the United States? How Can We Improve the Blood Donation Process in the United States? By H. H. Modeling Team: H. H. and Hilary Johnson Wilson High School Math Modeling and System Dynamics May 8, 009 System Dynamics Advisor:

More information

Interviewer: Tell us about the workshops you taught on Self-Determination.

Interviewer: Tell us about the workshops you taught on Self-Determination. INTERVIEW WITH JAMIE POPE This is an edited translation of an interview by Jelica Nuccio on August 26, 2011. Jelica began by explaining the project (a curriculum for SSPs, and for Deaf-Blind people regarding

More information

1.2A: Subitize (0-10) Use dominoes (real tiles or domino pictures), dot cards, 10-frames cards, and/or other tools with structured arrangements.

1.2A: Subitize (0-10) Use dominoes (real tiles or domino pictures), dot cards, 10-frames cards, and/or other tools with structured arrangements. FIRST GRADE 1.2 NUMBER AND OPERATIONS. The student applies mathematical process standards to represent and compare whole numbers, the relative position and magnitude of whole numbers, and relationships

More information

Kathleen Schrader DNS, RN Dean Math, Science & Engineering Irvine Valley College. 2/26/2010 Schrader, ACCCA San Francisco 2/2010 1

Kathleen Schrader DNS, RN Dean Math, Science & Engineering Irvine Valley College. 2/26/2010 Schrader, ACCCA San Francisco 2/2010 1 Kathleen Schrader DNS, RN Dean Math, Science & Engineering Irvine Valley College 2/26/2010 Schrader, ACCCA San Francisco 2/2010 1 http://www.youtube.com/watch?v=aayclwgmx5w 2/26/2010 Schrader, ACCCA San

More information

MTH 225: Introductory Statistics

MTH 225: Introductory Statistics Marshall University College of Science Mathematics Department MTH 225: Introductory Statistics Course catalog description Basic probability, descriptive statistics, fundamental statistical inference procedures

More information

FORM C Dr. Sanocki, PSY 3204 EXAM 1 NAME

FORM C Dr. Sanocki, PSY 3204 EXAM 1 NAME PSYCH STATS OLD EXAMS, provided for self-learning. LEARN HOW TO ANSWER the QUESTIONS; memorization of answers won t help. All answers are in the textbook or lecture. Instructors can provide some clarification

More information

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error I. Introduction and Data Collection B. Sampling In this section Bias Random Sampling Sampling Error 1. Bias Bias a prejudice in one direction (this occurs when the sample is selected in such a way that

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Psychometric Properties of the Mean Opinion Scale

Psychometric Properties of the Mean Opinion Scale Psychometric Properties of the Mean Opinion Scale James R. Lewis IBM Voice Systems 1555 Palm Beach Lakes Blvd. West Palm Beach, Florida jimlewis@us.ibm.com Abstract The Mean Opinion Scale (MOS) is a seven-item

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

Lesson 8 Descriptive Statistics: Measures of Central Tendency and Dispersion

Lesson 8 Descriptive Statistics: Measures of Central Tendency and Dispersion Lesson 8 Descriptive Statistics: Measures of Central Tendency and Dispersion Learning Objectives All students will define key terms and calculate different descriptive statistics. All students should be

More information

Welcome to OSA Training Statistics Part II

Welcome to OSA Training Statistics Part II Welcome to OSA Training Statistics Part II Course Summary Using data about a population to draw graphs Frequency distribution and variability within populations Bell Curves: What are they and where do

More information

John A. Nunnery, Ed.D. Executive Director, The Center for Educational Partnerships Old Dominion University

John A. Nunnery, Ed.D. Executive Director, The Center for Educational Partnerships Old Dominion University An Examination of the Effect of a Pilot of the National Institute for School Leadership s Executive Development Program on School Performance Trends in Massachusetts John A. Nunnery, Ed.D. Executive Director,

More information

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK

AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK AN ANALYSIS ON VALIDITY AND RELIABILITY OF TEST ITEMS IN PRE-NATIONAL EXAMINATION TEST SMPN 14 PONTIANAK Hanny Pradana, Gatot Sutapa, Luwandi Suhartono Sarjana Degree of English Language Education, Teacher

More information

STRENGTHENING CHILDREN S ORAL LANGUAGE AND LITERACY: ASKING QUESTIONS

STRENGTHENING CHILDREN S ORAL LANGUAGE AND LITERACY: ASKING QUESTIONS STRENGTHENING CHILDREN S ORAL LANGUAGE AND LITERACY: ASKING QUESTIONS ORAL LANGUAGE SUPPORTING EARLY LITERACY RESEARCH PROJECT Hugh McCusker OLSEL Project Officer Asking Questions Background to the Project

More information

How to Compute Mean Scores: CFAI-W

How to Compute Mean Scores: CFAI-W How to Compute Mean Scores: CFAI-W Here are the steps for computing the mean scores for the CFAI-W subscales. Compute each mean to four decimal places. Step 1 Reverse-score the following items: 5,7,8,14,15,16,21,23,28,38,40,42,43,45,46,47,49,53,56,61,63,69,73,79,80,81,82

More information

What Is the Fat Intake of a Typical Middle School Student?

What Is the Fat Intake of a Typical Middle School Student? What Is the Fat Intake of a Typical Middle School Student? Health professionals believe that the average American diet contains too much fat. Although there is no Recommended Daily Allowance for fat, it

More information

Multiple Regression Models

Multiple Regression Models Multiple Regression Models Advantages of multiple regression Parts of a multiple regression model & interpretation Raw score vs. Standardized models Differences between r, b biv, b mult & β mult Steps

More information

Basic concepts and principles of classical test theory

Basic concepts and principles of classical test theory Basic concepts and principles of classical test theory Jan-Eric Gustafsson What is measurement? Assignment of numbers to aspects of individuals according to some rule. The aspect which is measured must

More information

AS Psychology Curriculum Plan & Scheme of work

AS Psychology Curriculum Plan & Scheme of work AS Psychology Curriculum Plan & Scheme of work 2015-16 Week Content Further detail and reference to specification H/w. Reading & Notes, Resources, Extension activities Hodder textbook pages 1-12 Hodder

More information

Fostering a Culture of

Fostering a Culture of Trauma Sensitive Schools: Fostering a Culture of Begins with Adults MSCA Fall Conference 2017 Jenni Beck & Michelle Hofmann Raymore-Peculiar School District Be kinder than necessary because everyone you

More information

Statistics Coursework Free Sample. Statistics Coursework

Statistics Coursework Free Sample. Statistics Coursework Statistics Coursework For my initial investigation I am going to compare results on the following hypothesis, to see if people s intelligence affects their height and their ability to memorise a certain

More information

Chapter Three: Sampling Methods

Chapter Three: Sampling Methods Chapter Three: Sampling Methods The idea of this chapter is to make sure that you address sampling issues - even though you may be conducting an action research project and your sample is "defined" by

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

A Programming Approach to the CS Principles Data Task

A Programming Approach to the CS Principles Data Task A Programming Approach to the CS Principles Data Task (from The Beauty and Joy of Computing team) UC Berkeley Sr Lecturer SOE Dan Garcia bjc.berkeley.edu Broadening Participation in HS CS! New Course:

More information

Colorado Student Assessment Program

Colorado Student Assessment Program Colorado Student Assessment Program Technical Report 2007 Submitted to the Colorado Department of Education November 2007 Developed and published under contract with Colorado Department of Education by

More information

Student Performance Q&A:

Student Performance Q&A: Student Performance Q&A: 2009 AP Statistics Free-Response Questions The following comments on the 2009 free-response questions for AP Statistics were written by the Chief Reader, Christine Franklin of

More information

Introduction: Speaker. Introduction: Buros. Buros & Education. Introduction: Participants. Goal 10/5/2012

Introduction: Speaker. Introduction: Buros. Buros & Education. Introduction: Participants. Goal 10/5/2012 Introduction: Speaker PhD in Educational Measurement University of Nebraska-Lincoln October 28, 2012 CRITICAL TESTING AND MEASUREMENT CONCEPTS: ASSESSMENT PROFESSIONALS 13 years experience HE Assessment

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Demonstrating validity

Demonstrating validity Demonstrating validity Nivja de Jong & Jelle Goeman What is validity? Construct validity Criterion validity Face validity Content validity Consequential validity Validity, back to basics Cattell, 1946;

More information

STEP II Conceptualising a Research Design

STEP II Conceptualising a Research Design STEP II Conceptualising a Research Design This operational step includes two chapters: Chapter 7: The research design Chapter 8: Selecting a study design CHAPTER 7 The Research Design In this chapter you

More information

FOR TEACHERS ONLY. The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION INTEGRATED ALGEBRA

FOR TEACHERS ONLY. The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION INTEGRATED ALGEBRA FOR TEACHERS ONLY The University of the State of New Yk REGENTS HIGH SCHOOL EXAMINATION INTEGRATED ALGEBRA Thursday, January 28, 2010 1:15 to 4:15 p.m., only SCORING KEY AND RATING GUIDE Mechanics of Rating

More information

Research Approaches Quantitative Approach. Research Methods vs Research Design

Research Approaches Quantitative Approach. Research Methods vs Research Design Research Approaches Quantitative Approach DCE3002 Research Methodology Research Methods vs Research Design Both research methods as well as research design are crucial for successful completion of any

More information

PSY 250. Experimental Design: The Basic Building Blocks. Simple between subjects design. The Two-Group Design 7/25/2015. Experimental design

PSY 250. Experimental Design: The Basic Building Blocks. Simple between subjects design. The Two-Group Design 7/25/2015. Experimental design Experimental Design: The Basic Building Blocks PSY 250 Experimental design The general plan for selecting participants, assigning participants to experimental conditions, controlling extraneous variables,

More information

HOW STATISTICS IMPACT PHARMACY PRACTICE?

HOW STATISTICS IMPACT PHARMACY PRACTICE? HOW STATISTICS IMPACT PHARMACY PRACTICE? CPPD at NCCR 13 th June, 2013 Mohamed Izham M.I., PhD Professor in Social & Administrative Pharmacy Learning objective.. At the end of the presentation pharmacists

More information

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when. INTRO TO RESEARCH METHODS: Empirical Knowledge: based on observations. Answer questions why, whom, how, and when. Experimental research: treatments are given for the purpose of research. Experimental group

More information

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE 1. When you assert that it is improbable that the mean intelligence test score of a particular group is 100, you are using. a. descriptive

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola 1.1-1 Chapter 1 Introduction to Statistics 1-1 Review and Preview 1-2 Statistical Thinking 1-3

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Session 401. Creating Assessments that Effectively Measure Knowledge, Skill, and Ability

Session 401. Creating Assessments that Effectively Measure Knowledge, Skill, and Ability Practical and Effective Assessments in e-learning Session 401 Creating Assessments that Effectively Measure Knowledge, Skill, and Ability Howard Eisenberg, Questionmark Effectively Measuring Knowledge,

More information

Diagnostic Classification Models

Diagnostic Classification Models Diagnostic Classification Models Lecture #13 ICPSR Item Response Theory Workshop Lecture #13: 1of 86 Lecture Overview Key definitions Conceptual example Example uses of diagnostic models in education Classroom

More information

Interpretation Clinical significance: what does it mean?

Interpretation Clinical significance: what does it mean? Interpretation Clinical significance: what does it mean? Patrick Marquis, MD, MBA Mapi Values - Boston DIA workshop Assessing Treatment Impact Using PROs: Challenges in Study Design, Conduct and Analysis

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information