Bias in Sampling. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

Similar documents
Confidence Intervals and Sampling Design. Lecture Notes VI

MATH& 146 Lesson 4. Section 1.3 Study Beginnings

Problems for Chapter 8: Producing Data: Sampling. STAT Fall 2015.

Sampling and Data Collection

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling.

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Principle underlying all of statistics

Math 124: Module 3 and Module 4

MATH-134. Experimental Design

Sampling Controlled experiments Summary. Study design. Patrick Breheny. January 22. Patrick Breheny Introduction to Biostatistics (BIOS 4120) 1/34

If you could interview anyone in the world, who. Polling. Do you think that grades in your school are inflated? would it be?

Introduction to Statistics

BIAS: The design of a statistical study shows bias if it systematically favors certain outcomes.

Introduction; Study design

Study Design. Study design. Patrick Breheny. January 23. Patrick Breheny Introduction to Biostatistics (171:161) 1/34

Ch 4 Practice Test. Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 4-1

STATISTICS: METHOD TO GET INSIGHT INTO VARIATION IN A POPULATIONS If every unit in the population had the same value,say

Chapter 3. Producing Data

THE PUBLIC S PRIORITIES FOR CONGRESS AND PRESIDENT TRUMP IN THE POST- THANKSGIVING PERIOD

ZIKA VIRUS AND THE ELECTION SEASON

Sampling Reminders about content and communications:

Homework #2 is due next Friday at 5pm.

How are polls conducted? by Frank Newport, Lydia Saad, David Moore from Where America Stands, 1997 John Wiley & Sons, Inc.

Chapter 5: Producing Data Review Sheet

Chapter 7. Sampling Techniques

Math 140 Introductory Statistics

RECOMMENDED CITATION: Pew Research Center, October 2014, Most Are Confident in Government s Ability to Prevent Major Ebola Outbreak in U.S.

Chapter 1 Data Collection

ELECTION NIGHT 2018 HOT SHEET

OPIOID USE IN NEW JERSEY: PUBLIC FAVORS TREATMENT MORE THAN PENALTIES

1 PEW RESEARCH CENTER

Serbia Tracking Poll Fourth Wave. Prepared By Penn, Schoen & Berland Associates Inc. September 20, 2000

Center for Urban Initiatives and Research Wisconsin Public Health Survey December 2011 N=626. Frequency Tables (Weighted)

Chapter 1: Data Collection Pearson Prentice Hall. All rights reserved

Polling. Polling. History of Polling

8.2 Relative Frequency

THE OPIOID CRISIS: AMERICANS FAVOR COMPASSION AND TREATMENT FOR ADDICTS OF ALL STRIPES

Item Nonresponse and the 10-Point Response Scale in Telephone Surveys 1

Moore, IPS 6e Chapter 03

STA 291 Lecture 4 Jan 26, 2010

PRESIDENT OBAMA, THE SUPREME COURT AND THE ECONOMY May 6-12, 2009

Oncology Clinical Research & Race: Statistical Principles

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

OPINIONS CONCERNING DRUG LAW REFORM IN HAWAII

WEDNESDAY JUNE 20, 2018

Experimental Design There is no recovery from poorly collected data!

CAMPAIGN FOR TOBACCO-FREE KIDS SURVEY OF WASHINGTON VOTERS Raising the Age of Tobacco Sales DECEMBER 2015.

American Views on Stem Cell Research Summary of Survey Findings. Results for America

NEW JERSEY RESIDENTS DON T KNOW OR LIKE MUCH ABOUT COMMON CORE

Oregon Political Issues N = 618 Active Voters April 28 30

Attitudes about Opioids among North Carolina Voters

Serbia Election Tracking Polls. Prepared By Penn, Schoen & Berland Associates Inc. September 1, 2000

Funding Health and Long-Term Care: A Survey on Increasing North Carolina's Cigarette and Alcohol Taxes

THE PUBLIC AND GENETIC EDITING, TESTING, AND THERAPY

Tim Johnson Survey Research Laboratory University of Illinois at Chicago September 2011

Public Views of the Zika Virus Outbreak

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

CHAPTER 2 SOCIOLOGICAL RESEARCH METHODS

AP Statistics Chapter 5 Multiple Choice

Two Branches Of Statistics

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics

Section Introduction

FINNISH NATIONAL HEALTH INTERVIEW SURVEY USING BLAISE CATI

Too much About Right Too Little 10% 46% 37%

RECOMMENDED CITATION: Pew Research Center, February 2015, 83% Say Measles Vaccine Is Safe for Healthy Children

Arab American Voters 2014

Lecture 7 Section 2.5. Mon, Sep 8, 2008

People have used random sampling for a long time

JUNE 2000 HEALTH NEWS INTEREST INDEX

Diageo/The Hotline Poll Conducted By:

Chapter 4 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Political Science 15, Winter 2014 Final Review

Sampling. (James Madison University) January 9, / 13

MississippiTaxeson CigaretesandFood:A SurveyofSelf-Identified RegisteredVotersAge18+

NEW JERSEY: LEGAL WEED SEEN AS ECONOMIC BOON

Creative Commons Attribution-NonCommercial-Share Alike License

Poll of New York State Voters in Target State Senate Districts!

Lecture Start

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

Section 1.1 What is Statistics?

A Survey of Public Opinion on Secondhand Smoke Related Issues in Bourbon County, KY

RECOMMENDED CITATION: Pew Research Center, December, 2014, Perceptions of Job News Trend Upward

Survey of Pennsylvanians on the Issue of the Swine Flu KEY FINDINGS REPORT

2015 Survey on Prescription Drugs

CHAPTER 5: PRODUCING DATA

SURVEY RESEARCH. Topic #9. Measurement and assessment of opinions, attitudes, etc. Usually by means of questionnaires and sampling methods.

THE ARAB AMERICAN VOTE September 27, 2012

Orange County Voter Survey

involving young people in decision making a survey of local authorities research briefing 10 August 2001

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

Majority approve of legalized, regulated, taxed cannabis

Increasing the Cigarette Tax Rate in Wyoming to Maintain State Programs: An AARP Survey

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

GENDER AND RESPONSE EFFECTS IN A PRE-ELECTION POLL: ILLINOIS 1992

Issues in the 2000 Election: Health Care

Review+Practice. May 30, 2012

NEW JERSEY: SUPPORT FOR LEGAL WEED STAYS HIGH

FINAL WEIGHTED FREQUENCIES N=1,315 GEVs Including Oversamples

august 3, 2018 What do you think would have happened if we had time to do the same activity but with a sample size of 10?

Americans Views on Moonshot Initiative and Cancer Research

Transcription:

Bias in Sampling MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018

Bias If the results of the sample are not representative of the population, then the sample has bias. Remark: bias could mean to give preference to selecting some individuals over others; it could also mean that certain responses are more likely to occur in the sample than in the population.

Sources of Bias There are three sources of bias: 1. Sampling Bias 2. Nonresponse Bias 3. Response Bias

Sampling Bias Sampling bias means that the technique used to obtain the individuals to be in the sample tends to favor one part of the population over another. Undercoverage is a type of sampling bias. Undercoverage occurs when the proportion of one segment of the population is lower in a sample than it is in the population.

Example: 1936 Presidential Election The magazine Literary Digest predicted that Alfred M. Landon would defeat Franklin D. Roosevelt in the 1936 presidential election. The Literary Digest conducted a poll based on a list of its subscribers, telephone directories, and automobile owners. On the basis of the results, the Literary Digest predicted that Landon would win with 57% of the popular vote. However, Roosevelt won the election with about 62% of the popular vote. This election took place during the height of the Great Depression. In 1936, most subscribers to the magazine, households with telephones, and automobile owners were Republican, the party of Landon. There was undercoverage of Democrats.

Nonresponse Bias Nonresponse bias exists when individuals selected to be in the sample who do not respond to the survey have different opinions from those who do. Remark: Nonresponse can be improved through the use of callbacks or rewards/incentives.

Example: Nonresponse Bias The federal government s Current Population Survey has a response rate of about 92%, but it varies depending on the age of the individual. For example, the response rate for 20- to 29-year-olds is 85%, and for individuals 70 and older, it is 99%. Response rates in random digit dialing (RDD) telephone surveys are typically around 70%, e-mail survey response rates hover around 40%, and mail surveys can have response rates as high as 60%.

Response Bias Response bias exists when the answers on a survey do not reflect the true feelings of the respondent. Types of response bias: Interviewer error Misrepresented answers (ask someone how many push-ups they can do in 1 minute versus having them show you how many push-ups they can do in 1 minute). Words used in survey question Order of the questions or words within the question

Words Used in Survey Question Consider two ways of asking for opinions regarding estate taxes: 1. Do you oppose the reduction of estate taxes? 2. Do you favor or oppose the reduction of estate taxes? The second question is considered balanced. How would responses to the following two questions likely differ? 1. Do you think the United States should forbid public speeches against democracy? 2. Do you think the United States should allow public speeches against democracy?

Ordering of Questions or Words Many surveys will rearrange the order of the questions within a questionnaire so that responses are not affected by prior questions. Consider the following two questions: 1. Do you think the United States should let Communist newspaper reporters from other countries come in here and send back to their papers the news as they see it? 2. Do you think a Communist country such as Russia should let American newspaper reporters come in and send back to America the news as they see it? Effect of question order: For surveys conducted in 1980 in which the questions appeared in the order (1, 2), 54.7% of respondents answered yes to (1) and 63.7% answered yes to (2). If the questions were ordered (2, 1), then 74.6% answered yes to (1) and 81.9% answered yes to (2).

Open Questions An open question allows the respondent to express his or her response. Example What is the most important problem facing America s youth today?

Closed Questions A closed question requires the respondent to choose from a list of predetermined responses. Example What is the most important problem facing America s youth today? 1. Drugs 2. Violence 3. Single-parent homes 4. Promiscuity 5. Peer pressure Remark: In closed questions, the possible responses should be rearranged because respondents are likely to choose early choices in a list rather than later choices.

Sources of Error in Sampling Non-sampling errors are errors that result from the survey process. They are due to the nonresponse of individuals selected to be in the survey, inaccurate responses, poorly worded questions, or bias in the selection of individuals to be given the survey. Sampling error is the error that results from using sampling to estimate the information regarding a population. This type of error occurs because a sample gives incomplete information about the population.