STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES)

Similar documents
The t-test: Answers the question: is the difference between the two conditions in my experiment "real" or due to chance?

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Lesson 11.1: The Alpha Value

Applied Statistical Analysis EDUC 6050 Week 4

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

APPENDIX N. Summary Statistics: The "Big 5" Statistical Tools for School Counselors

Module 28 - Estimating a Population Mean (1 of 3)

Chapter 12. The One- Sample

Psychology Research Process

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

PSYCHOLOGY 300B (A01) One-sample t test. n = d = ρ 1 ρ 0 δ = d (n 1) d

Creative Commons Attribution-NonCommercial-Share Alike License

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Designing Psychology Experiments: Data Analysis and Presentation

Conduct an Experiment to Investigate a Situation

The Single-Sample t Test and the Paired-Samples t Test

Psychology Research Process

Sheila Barron Statistics Outreach Center 2/8/2011

Unit 1 Exploring and Understanding Data

Lecture 18: Controlled experiments. April 14

Missy Wittenzellner Big Brother Big Sister Project

The Scientific Method

25. Two-way ANOVA. 25. Two-way ANOVA 371

Instructions for doing two-sample t-test in Excel

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Designed Experiments have developed their own terminology. The individuals in an experiment are often called subjects.

Limitations of 2-cond Designs

BIOSTATISTICAL METHODS

appstats26.notebook April 17, 2015

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

Name: Date: Solubility Lab - Worksheet #3 Level 1

INVESTIGATION SLEEPLESS DRIVERS

Scientific Inquiry Review

Something to think about. What happens, however, when we have a sample with less than 30 items?

Part I: Alcohol Metabolization Explore and Explain

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Chapter 7: Descriptive Statistics

TOC: VE examples, VE student surveys, VE diagnostic questions Virtual Experiments Examples

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona

Control Chart Basics PK

Chapter 13 Summary Experiments and Observational Studies

Math 075 Activities and Worksheets Book 2:

CCM6+7+ Unit 12 Data Collection and Analysis

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Handout on Perfect Bayesian Equilibrium

Kepler tried to record the paths of planets in the sky, Harvey to measure the flow of blood in the circulatory system, and chemists tried to produce

Midterm project due next Wednesday at 2 PM

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013#

The Epidemic Model 1. Problem 1a: The Basic Model

More Designs. Section 4.2B

Reflection Questions for Math 58B

Six Sigma Glossary Lean 6 Society

Never P alone: The value of estimates and confidence intervals

Lab 2: The Scientific Method. Summary

MiSP Solubility Lab L3

Name Psychophysical Methods Laboratory

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

Using Your Robo-Pad Remote Control

Chapter 12: Introduction to Analysis of Variance

Standard Deviation and Standard Error Tutorial. This is significantly important. Get your AP Equations and Formulas sheet

Final Exam Practice Test

DIFFERENCE BETWEEN TWO MEANS: THE INDEPENDENT GROUPS T-TEST

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

One-Way Independent ANOVA

EPS 625 INTERMEDIATE STATISTICS TWO-WAY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY)

What is Data? Part 2: Patterns & Associations. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

Research Methods II, Spring Term Logic of repeated measures designs

Chapter 11: Experiments and Observational Studies p 318

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

Comparing Two Means using SPSS (T-Test)

Statistical inference provides methods for drawing conclusions about a population from sample data.

Limitations of 2-cond Designs

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

MITOCW conditional_probability

FUNCTIONAL INJURY PREVENTION EXERCISES Part 2 The Hip Complex Jerry Shreck

What do you think of the following research? I m interested in whether a low glycemic index diet gives better control of diabetes than a high

Still important ideas

How should each run feel?!

Our goal in this section is to explain a few more concepts about experiments. Don t be concerned with the details.

12-Week Vertical Jump Program Trial Version (first 5 weeks of program)

Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences

Student Performance Q&A:

Inferential Statistics

Faculty of Physical Education & Sport Sciences, Diala University. Seif El Din Watheq :

Lesson 1: Distributions and Their Shapes

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias.

1) What is the independent variable? What is our Dependent Variable?

Center of Dilation. Vertical Distance: 6. Horizontal Distance: 4

Comparison of two means

t-test for r Copyright 2000 Tom Malloy. All rights reserved

Chapter 11. Experimental Design: One-Way Independent Samples Design

Transcription:

STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) In baseball after a player gets a hit, they need to decide whether to stop at first base, or try to stretch their hit from a single to a double. Does the path that they take to round first base make much of a difference in how quickly they can get to second? For example (see Figure 1), is it better to take a narrow angle (minimizing the distance) or a wide angle (for improved turn speed) around first base? (This exploration is based on an actual study reported in a master s thesis by W. F. Woodward in 1970.) Figure 1. Two methods of rounding first base when a player plans to run to second base One reasonable experimental design would be to sample 20 players, and randomly assign 10 to run with the wide angle and the other 10 to run with the narrow angle. Date: November 14, 2017. 1

2 STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) (1) Some runners are faster than others. How does random assignment control for this, so that speed is not likely to be a confounding variable in this study? (Remember the critical property of a confounding variable) (2) The sample statistic x narrow x wide can be influenced by many factors; some systematic, some random. A possible systematic factor is any inherent superiority of one method over the other. Even though random assignment removes any systematic effect of intrinsic player speed (the two groups will be evenly matched on average across all possible assignments into groups), for any particular instance of this experiment, we may happen to get faster runners in one group over another that is, there is still a random influence of player speed which adds variability to the difference. There are other random factors as well that might contribute variability, such as tiredness, wind, basepath condition, etc. These other things are mostly unavoidable, but can you think of a different way of conducting the experiment that would remove the effect of intrinsic player speed on the difference of means?

STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) 3 One thing we could do is have each runner use both base running angles. That way, our two samples come from the same set of players, and any difference we see has to be due either to the difference in method, or to other random factors like tiredness, wind speed, etc.... (3) If we wanted to test whether these two methods differ, we would set about asking how likely the observed difference is to have arisen by random chance. How does the fact that the samples come from the same players affect the sizes of difference that we expect to see by chance alone? Would you interpret a difference of 0.1 seconds differently if it came from within-player comparisons vs a between-groups comparison? How should this affect your willingness to reject H 0? (4) Suppose we had each player run to second using the wide angle method first, and then the narrow angle method. What problem does this design have?

4 STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) In the real study, each player used both methods, with a rest in between, and the order was randomly assigned separately for each player. The data for the first ten players below (time in seconds that it took the player to get from the point 35 feet past home to the point 15 feet before second base): Player 1 2 3 4 5 6 7 8 9 10... narrow time 5.50 5.70 5.60 5.50 5.85 5.55 5.40 5.50 5.15 5.80... wide time 5.55 5.75 5.50 5.40 5.70 5.60 5.35 5.35 5.00 5.70... All 22 players data is shown in a pair of dot plots below: narrow wide 5.0 5.2 5.4 5.6 5.8 6.0 6.2 Time (sec.) (5) Based on the dotplots, do you see strong evidence that one method tends to produce slower times than the other? How are you deciding? (6) Considering the paired nature of these data, what would be a better method for comparing these running times? What do you suggest using as the response variable in this study? What population parameter and sample statistic would you focus on?

STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) 5 With a paired design, we analyze the differences in the response between the two treatments. In this case we would calculate the difference in running times between the wide and narrow angles for each player and then analyze the sample of differences. Below, the dotplots are shown again, but now with lines connecting the pairs of times from each player. narrow wide 5.0 5.2 5.4 5.6 5.8 6.0 6.2 Time (sec.) (7) Do you notice any pattern to the gray lines? Does this change your subjective sense of whether there is a meaningful difference between the two methods? (8) By focusing on the sample of differences for each player, we reduce the data to a single sample (of differences), drawn from a single population (of differences). Our hypotheses concern the population mean of differences, µ D. State the relevant null and alternative hypotheses in terms of this parameter.

6 STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) (9) Since we have reduced the problem to a single population and a single sample, we can apply our confidence interval and hypothesis test for a single mean without modification. What can we say about the sampling distribution of differences of means? (Shape, center, spread) (10) Define x D as the sample mean of differences, and s D as the standard deviation x of the same. What distribution will the standardized test statistic, D 0 s D / have n if the null hypothesis is true? How can we find a P -value? (Don t actually do it yet; just describe the process) (11) How can we construct a confidence interval? (Don t actually do it yet; just describe the process)

STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) 7 The sample distribution of differences is shown in the dot plot below. The sample mean of the differences (wide minus narrow) is x D = 0.075 (depicted by the vertical dashed line). The sample standard deviation of differences is s D = 0.09 (the points x ± s are drawn with light dotted lines). 0.2 0.1 0.0 0.1 0.2 Wide Minus Narrow (12) Construct a 95% confidence interval for the population difference, µ D. How did you find the critical values (standardized endpoints)? (13) Compute the P -value for x D under H 0. What do you conclude in context?

8 STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) (14) For comparison, let s suppose we had the same data, but it came from two separate groups of players. How do you expect the confidence interval and P -value to change? (Hint: think back to question (3)) (15) The individual means are x wide = 5.459 seconds and x narrow = 5.534 seconds. The individual sample standard deviations are s wide = 0.273 seconds and s narrow = 0.260 seconds. Find the estimated standard error for x wide x narrow. (16) Use the standard error, together with a critical T value (you ll need StatKey or R for that part), to compute a 95% confidence interval (use the min(n A 1, n B 1) rule for the degrees of freedom). Was your intuition correct about the relative widths of the intervals? (17) Compute an observed T -statistic, and find a P -value (you ll need StatKey or R to convert your test statistic to a P -value. Was your intuition correct about the relative sizes of the P -values?