Chapter 3. Producing Data

Similar documents
Chapter 3. Producing Data

CHAPTER 5: PRODUCING DATA

Vocabulary. Bias. Blinding. Block. Cluster sample

aps/stone U0 d14 review d2 teacher notes 9/14/17 obj: review Opener: I have- who has

Sampling. (James Madison University) January 9, / 13

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

Chapter 5: Producing Data

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

3.2 Designing Experiments

Chapter 9. Producing Data: Experiments. BPS - 5th Ed. Chapter 9 1

Chapter 3 Producing Data

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Chapter 1 - Sampling and Experimental Design

Math 140 Introductory Statistics

Examining Relationships Least-squares regression. Sections 2.3

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Sta 309 (Statistics And Probability for Engineers)

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Unit 1 Exploring and Understanding Data

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

Chapter 1 Data Collection

AP Statistics Chapter 5 Multiple Choice

More Designs. Section 4.2B

REVIEW FOR THE PREVIOUS LECTURE

Observation Studies, Sampling Designs and Bias

Designed Experiments have developed their own terminology. The individuals in an experiment are often called subjects.

Chapter 1: Exploring Data

Chapter 5: Producing Data Review Sheet

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

Chapter 4 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Objectives. Data Collection 8/25/2017. Section 1-3. Identify the five basic sample techniques

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

Problems for Chapter 8: Producing Data: Sampling. STAT Fall 2015.

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

Chapter 1: Data Collection Pearson Prentice Hall. All rights reserved

Experimental Design There is no recovery from poorly collected data!

Soci708 Statistics for Sociologists

august 3, 2018 What do you think would have happened if we had time to do the same activity but with a sample size of 10?

Define the population Determine appropriate sample size Choose a sampling design Choose an appropriate research design

STA 291 Lecture 4 Jan 26, 2010

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Chapter 13 Summary Experiments and Observational Studies

Sampling Reminders about content and communications:

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

MAT 155. Chapter 1 Introduction to Statistics. Key Concept. Basics of Collecting Data. August 20, S1.5_3 Collecting Sample Data

An observational study observes individuals and measures variables of interest but does not attempt to influence the responses.

Section 1.1 What is Statistics?

MATH-134. Experimental Design

Chapter 1: Introduction to Statistics

CHAPTER 9: Producing Data: Experiments

CHAPTER 4 Designing Studies

Dr. Allen Back. Oct. 7, 2016

General Biostatistics Concepts

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling.

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

Ch 1.1 & 1.2 Basic Definitions for Statistics

Probability and Statistics Chapter 1 Notes

Experimental and survey design

AP Statistics Exam III Multiple Choice Questions

GATHERING DATA. Chapter 4

Intro to Survey Design and Issues. Sampling methods and tips

Review+Practice. May 30, 2012

Math 124: Module 3 and Module 4

Aim: Intro Chp. 4 Designing Studies

Section 4.3 Using Studies Wisely. Honors Statistics. Aug 23-8:26 PM. Daily Agenda. 1. Check homework C4# Group Quiz on

STAT 111 SEC 006 PRACTICE EXAM 1: SPRING 2007

Quiz 4.1C AP Statistics Name:

Overview: Part I. December 3, Basics Sources of data Sample surveys Experiments

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

Dr. Allen Back. Sep. 30, 2016

Unit notebook May 29, 2015

3.2 Designing Experiments

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics

Handout 16: Opinion Polls, Sampling, and Margin of Error

Lecture 4: Chapter 3, Section 4 Designing Studies (Focus on Experiments)

Review. Chapter 5. Common Language. Ch 3: samples. Ch 4: real world sample surveys. Experiments, Good and Bad

Lecture 4: Chapter 3, Section 4. (Focus on Experiments) Designing Studies. Looking Back: Review. Definitions

Design, Sampling, and Probability

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

Chapter 13. Experiments and Observational Studies

Villarreal Rm. 170 Handout (4.3)/(4.4) - 1 Designing Experiments I

I can explain how under coverage, nonresponse, and question wording can lead to bias in a sample survey. Strive p. 67; Textbook p.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

MATH 2300: Statistical Methods. What is Statistics?

Introduction, Evidence, and Sampling

Name: Class: Date: 1. Use Scenario 4-6. Explain why this is an experiment and not an observational study.

Gathering. Useful Data. Chapter 3. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Moore, IPS 6e Chapter 03

Methodological skills

Sampling and Data Collection

Handout 1: Introduction to the Research Process and Study Design STAT 335 Fall 2016

Chapter 2 Designing Observational Studies and Experiments Section 1 Simple Random Sampling

Ch 4 Practice Test. Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 4-1

Lecture Start

Chapter 5 & 6 Review. Producing Data Probability & Simulation

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

Psychology: The Science

Transcription:

Chapter 3 Producing Data

Types of data collected Anecdotal data data collected haphazardly (not representative!!) Available data existing data (examples: internet, library, census bureau,.) Gather own data (takes money and time to get own data)

Some terminology Population the entire group of individuals or objects of interest (answers the question: Who?) Sample subset of the population on which information is obtained. Census-sample is the entire population Variable characteristics of interest

Observational study vs Experiment Observational study A study that observes individuals and measures variables of interest but does not attempt to influence the response. Experiment A study that imposes some treatment on individuals in order to record their response.

Types of variables Response variable the outcome of the study. Explanatory variable variable(s) that attempt to explain the changes in the response Examples: Smoking and lung cancer Running on a treadmill and heart rate

Classroom Examples One study of cell phones and the risk of brain cancer looked at a group of 469 people who have brain cancer. The investigators matched each cancer patient with a person of the same sex, age, and race who did not have brain cancer, then asked about use of cell phones. Result: Our data suggest that use of handheld cellular telephones is not associated with the risk of brain cancer. Is this an observational study or experiment? Why? What are the explanatory and response variables? A typical hour of prime-time television shows 3-5 violent acts. Linking family interviews and police records shows a clear association between time spent watching TV as a child and later aggressive behavior. Is this an observational study or experiment? What are the explanatory and response variables? Suggest some lurking variables that could explain the aggressive behavior. An educational software company wants to compare the effectiveness of its computer animation for teaching cell biology with that of a textbook presentation. The company tests the biological knowledge of each group of first year college students, then randomly divides them into two groups. One group uses the animation, and the other studies the text. The company retests all the students and compares the increase in understanding of cell biology in the two groups. Is this an observational study or experiment? What are the explanatory and response variables?

3.1 Design of Experiments Experimental units individual on which experiment is done. Treatment specific experimental condition Factors = explanatory variables Placebo false treatment to control for psychological effects. Example: Gastric freezing is a clever treatment for ulcers in the upper intestine. The patient swallows a deflated balloon with tubes attached, then a refrigerated liquid is pumped through the balloon for an hour (cooling will reduce production of acid and relieve ulcers). An experiment reported in the Journal of the American Medical Association showed that gastric freezing did reduce acid production and relieve ulcer pain. Later experiment included a control group (34% of the treatment group improved..38% of the placebo group improved). Joint effects combination of levels of two or more factors. Example: A maker of fabric for clothing is setting up a new line to finish the raw fabric. The line will use either metal rollers or natural-bristle rollers to raise the surface of the fabric; a dyeing cycle time of either 30 minutes or 40 minutes and a temperature of either 150 or 175 degrees Celsius. Four specimens of fabric will be subjected to each treatment and scored for quality. What are the factors and the treatments? How many units (fabric specimens) does the experiment require?

Experiments continued Experiments provide good evidence for causation (able to control lurking variables) Confounded variables variable(s) associated with the response, but are not of interest; effects cannot be separated from the effect of the explanatory explanatory variable on the response Bias systematically favors certain outcomes.

Experiments continued Randomization is very important in experiments helps to ensure groups are as similar as possible. The three principles of Experimental Design are Control Randomize Repeat How can we randomize? Draw names out of a hat, use table of random digits, computer software (calculator), phone-random digit dialing

Using R to randomize First, you need to set the seed > set.seed(put seed number in here) Then sample >sample(seq(1:n),sample size,replace=false) Assign class to two groups

Completely Randomized Design (with one treatment group and one control group) Group 1-Treatment Random Assignment Group 2- Control Compare results

More on Experiments Single blind individual receiving treatment does not know what treatment they are receiving. Double blind individual getting treatment and individual recording outcome do not know which treatment was administered.

Block design One way to control for confounding variables is to block on them. A block design first breaks the experimental units into blocks according to the blocking variable (for example, if one is blocking on gender, first place units into female and male blocks ).

Example of Block Design The progress of a type of cancer differs in women and men. A clinical experiment to compare three therapies for this cancer therefore treats sex as a blocking variable. Two separate randomizations are done, one assigning the female subjects to the treatment and the other assigning the male subjects. Draw this design.

Matched Pairs Design A special type of block design is called Matched pairs design. Can only compare two treatments (hence the pairs ). Block usually consists of units as similar as possible (self, twins, husband and wife).

Example of Matched Pairs design Does talking on a hands-free cell phone distract drivers? Undergraduate students drove in a high-fidelity driving simulator equipped with hands-free cell phone. Each student drove once while talking on the cell phone and once without talking on the cell phone. The order for each student was randomly assigned. The car ahead breaks: how quickly does the subject respond?

3.2 Sampling Design Voluntary response sample (call-in polls, comment cards) are very biased bad sampling design. Want to get a probability sample. A probability sample is a sample chosen by chance (will look at four of them in this course).

Different Types of Probability samples SRS (Simple Random Sample) every sample of size n has the same chance of being selected. Stratified random sample first divide into groups (strata), and then take a SRS from each stratum. Cluster sample first divide into clusters, and then take a SRS of clusters (once a cluster is chosen, every unit in that cluster is in the sample).

Probability samples continued Multistage sampling design at each stage, a probability sample is obtained. Problems with sample surveys Undercoverage Nonresponse Response bias

Towards statistical inference Use information from sample (known information) to infer about the population (unknown) Statistics information from a sample Parameter information from a population Sampling variability information from a sample will differ from one sample to the next. Sample statistics will have a predictable pattern (referred to as sampling distribution)

Bias and variability Figure 3.14 Introduction to the Practice of Statistics, Sixth Edition

Figure 3.15 Introduction to the Practice of Statistics, Sixth Edition 2009 W.H. Freeman and Company

3.4 Continued Want statistics that are unbiased and have low variability. How can we eliminate or at least reduce the bias? Use a random sample and good instruments. How to increase precision? Larger sample Population size does not effect precision!!! Sample size does.

Definition, pg 184 Introduction to the Practice of Statistics, Sixth Edition 2009 W.H. Freeman and Company Statistical Significance

Definition, pg 225 Introduction to the Practice of Statistics, Sixth Edition 2009 W.H. Freeman and Company Ethics