Chapter 1: Data Collection Pearson Prentice Hall. All rights reserved

Similar documents
Ch 1.1 & 1.2 Basic Definitions for Statistics

Sampling. (James Madison University) January 9, / 13

Mini-Lecture 1.1 Introduction to the Practice of Statistics

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

The Nature of Probability and Statistics

Chapter 1 Data Collection

Introduction to Statistics

Vocabulary. Bias. Blinding. Block. Cluster sample

Sta 309 (Statistics And Probability for Engineers)

aps/stone U0 d14 review d2 teacher notes 9/14/17 obj: review Opener: I have- who has

Unit 1 Exploring and Understanding Data

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

Introduction to Biostatics.

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

Sampling Reminders about content and communications:

Design, Sampling, and Probability

Chapter 1: Introduction to Statistics

IS STATISTICS 100% CORRECT?

Data Collection. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

Chapter 1: The Nature of Probability and Statistics

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Section 1.1 What is Statistics?

Chapter 3. Producing Data

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

CH 13 Experiments and observational Studies

Probability and Statistics Chapter 1 Notes

Examining Relationships Least-squares regression. Sections 2.3

Chapter 3. Producing Data

Slide 1 - Introduction to Statistics Tutorial: An Overview Slide notes

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

General Biostatistics Concepts

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

Chapter 1: Exploring Data

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Chapter 1 - Sampling and Experimental Design

HOW STATISTICS IMPACT PHARMACY PRACTICE?

Chapter 5: Producing Data

Example 1. October 1, / 14

Section 1: Exploring Data

Chapter 1 - The Nature of Probability and Statistics

1. If a variable has possible values 2, 6, and 17, then this variable is

CHAPTER 5: PRODUCING DATA

Statistics are commonly used in most fields of study and are regularly seen in newspapers, on television, and in professional work.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

Chapter 1: Introduction to Statistics

REVIEW FOR THE PREVIOUS LECTURE

Experimental Design There is no recovery from poorly collected data!

Copyright 2014, 2011, and 2008 Pearson Education, Inc. 1-1

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Chapter 13. Experiments and Observational Studies

Ch. 1 Collecting and Displaying Data

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2

AP Statistics Experimental Design. Penny Smeltzer

Chapter 1: Introduction to Statistics

Data collection, summarizing data (organization and analysis of data) The drawing of inferences about a population from a sample taken from

MATH 2300: Statistical Methods. What is Statistics?

Overview: Part I. December 3, Basics Sources of data Sample surveys Experiments

AP Stats Review for Midterm

Methodological skills

Review+Practice. May 30, 2012

AP Statistics Chapter 5 Multiple Choice

Class 1. b. Sampling a total of 100 Californians, where individuals are randomly selected from each major ethnic group.

Chapter 1: Statistical Basics

Moore, IPS 6e Chapter 03

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Test Bank for Privitera, Statistics for the Behavioral Sciences

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

The Practice of Statistics 1 Week 2: Relationships and Data Collection

Collecting Data Example: Does aspirin prevent heart attacks?

7) A tax auditor selects every 1000th income tax return that is received.

Define the population Determine appropriate sample size Choose a sampling design Choose an appropriate research design

Creative Commons Attribution-NonCommercial-Share Alike License

Introduction to Statistics and Research Design. Arlo Clark-Foos

Chapter 4 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Chapter 1 Overview. Created by Tom Wegleitner, Centreville, Virginia. Copyright 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Lecture (chapter 1): Introduction

Designed Experiments have developed their own terminology. The individuals in an experiment are often called subjects.

Introduction, Evidence, and Sampling

Statistics. Nur Hidayanto PSP English Education Dept. SStatistics/Nur Hidayanto PSP/PBI

CHAPTER 1. YAKUP ARI,Ph.D.(C)

What Is Statistics. Chapter 1

What Is Statistics. Learning Objectives. Definition. Who Uses Statistics? 12/9/2015

Do Now Prob & Stats 8/26/14 What conclusions can you draw from this bar graph?

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

A) I only B) II only C) III only D) II and III only E) I, II, and III

in explaning the result of research studies In planning and decision making are supported by data

full file at

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

STA Module 1 Introduction to Statistics and Data

Summer AP Statistic. Chapter 4 : Sampling and Surveys: Read What s the difference between a population and a sample?

I. Identifying the question Define Research Hypothesis and Questions

Villarreal Rm. 170 Handout (4.3)/(4.4) - 1 Designing Experiments I

Transcription:

Chapter 1: Data Collection 2010 Pearson Prentice Hall. All rights reserved 1-1

Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions. In addition, statistics is about providing a measure of confidence in any conclusions. The information referred to in the definition is data. Data are a fact or proposition used to draw a conclusion or make a decision. Data describe characteristics of of an individual. 1-2

The entire group of individuals to be studied is called the population. An individual is a person or object that is a member of the population being studied. A sample is a subset of the population that is being studied. 1-3

Descriptive statistics consist of organizing and summarizing data. Descriptive statistics describe data through numerical summaries, tables, and graphs. 1-4

Inferential statistics uses methods that take results from a sample, extends them to the population, and measures the reliability of the result. 1-5

A statistic is a numerical summary based on a sample. A parameter is a numerical summary of a population. 1-6

EXAMPLE PARAMETER VERSUS STATISTIC Statistic or parameter: Suppose the percentage of all students on your campus who have a job is 84.9%. This value represents a parameter because it is a numerical summary of a population. Suppose a sample of 250 students is obtained, and from this sample we find that 86.3% have a job. This value represents a statistic because it is a numerical summary based on a sample. 2010 Pearson Prentice Hall. All rights reserved 1-7

2010 Pearson Prentice Hall. All rights reserved 1-8

Variables are the characteristics of the individuals within the population. The list of observations a variable assumes is called data. 1-9

Qualitative or Categorical variables allow for classification of individuals based on some attribute or characteristic. Quantitative variables provide numerical measures of individuals. Arithmetic operations such as addition and subtraction can be performed on the values of the quantitative variable and provide meaningful results. 1-10

EXAMPLE Distinguishing between Qualitative and Quantitative Variables Researcher Elisabeth Kvaavik and others studied factors that affect the eating habits of adults in their mid-thirties. (Source: Kvaavik E, et. al. Psychological explanatorys of eating habits among adults in their mid-30 s (2005) International Journal of Behavioral Nutrition and Physical Activity (2)9.) Classify each of the following variables considered in the study as qualitative or quantitative. a. Nationality b. Number of children c. Household income in the previous year d. Level of education Qualitative Quantitative Qualitative Quantitative e. Daily intake of whole grains (measured in grams per day) Quantitative 2010 Pearson Prentice Hall. All rights reserved 1-11

A discrete variable is a quantitative variable that either has a finite number of possible values or a countable number of possible values. The term countable means the values result from counting such as 0, 1, 2, 3, and so on. A continuous variable is a quantitative variable that has an infinite number of possible values it can take on and can be measured to any desired level of accuracy. 1-12

EXAMPLE Distinguishing between Qualitative and Quantitative Variables Researcher Elisabeth Kvaavik and others studied factors that affect the eating habits of adults in their mid-thirties. (Source: Kvaavik E, et. al. Psychological explanatorys of eating habits among adults in their mid-30 s (2005) International Journal of Behavioral Nutrition and Physical Activity (2)9.) Classify each of the following quantitative variables considered in the study as discrete or continuous. a. Number of children Discrete b. Household income in the previous year Continuous c. Daily intake of whole grains (measured in grams per day) Continuous 1-13

A variable is at the nominal level of measurement if the values of the variable name, label, or categorize. In addition, the naming scheme does not allow for the values of the variable to be arranged in a ranked, or specific, order. A variable is at the ordinal level of measurement if it has the properties of the nominal level of measurement and the naming scheme allows for the values of the variable to be arranged in a ranked, or specific, order. A variable is at the interval level of measurement if it has the properties of the ordinal level of measurement and the differences in the values of the variable have meaning. A value of zero in the interval level of measurement does not mean the absence of the quantity. Arithmetic operations such as addition and subtraction can be performed on values of the variable. A variable is at the ratio level of measurement if it has the properties of the interval level of measurement and the ratios of the values of the variable have meaning. A value of zero in the ratio level of measurement means the absence of the quantity. Arithmetic operations such as multiplication and division can be performed on the values of the variable. 1-14

EXAMPLE DETERMINING THE LEVEL OF MEASUREMENT OF A VARIABLE A study was conducted to assess school eating patterns in high schools in the United States. The study analyzed the impact of vending machines and school policies on student food consumption. A total of 1088 students in 20 schools were surveyed. (Source: Neumark-Sztainer D, French SA, Hannan PJ, Story M and Fulkerson JA (2005) School lunch and snacking patterns among high school students: associations with school food environment and policies. International Journal of Behavioral Nutrition and Physical Activity 2005, (2)14.) Classify each of the following variables considered in the study as qualitative or quantitative. a. Number of snack and soft drink vending machines in the school Ratio b. Whether or not the school has a closed campus policy during lunch Nominal c. Class rank (Freshman, Sophomore, Junior, Senior) Ordinal d. Number of days per week a student eats school lunch Ratio 2010 Pearson Prentice Hall. All rights reserved 1-15

In research, we wish to determine how varying the amount of an explanatory variable affects the value of a response variable. 1-16

An observational study measures the value of the response variable without attempting to influence the value of either the response or explanatory variables. That is, in an observational study, the researcher observes the behavior of the individuals in the study without trying to influence the outcome of the study. If a researcher assigns the individuals in a study to a certain group, intentionally changes the value of the explanatory variable, and then records the value of the response variable for each group, the researcher is conducting a designed experiment. 1-17

Confounding in a study occurs when the effects of two or more explanatory variables are not separated. Therefore, any relation that may exist between an explanatory variable and the response variable may be due to some other variable or variables not accounted for in the study. A lurking variable is an explanatory variable that was not considered in a study, but that affect the value of the response variable in the study. In addition, lurking variables are typically related to any explanatory variables considered in the study. 1-18

A census is a list of all individuals in a population along with certain characteristics of each individual. 1-19

Random sampling is the process of using chance to select individuals from a population to be included in the sample. 1-20

A sample of size n from a population of size N is obtained through simple random sampling if every possible sample of size n has an equally likely chance of occurring. The sample is then called a simple random sample. 1-21

A stratified sample is one obtained by separating the population into homogeneous, nonoverlapping groups called strata, and then obtaining a simple random sample from each stratum. 1-22

A systematic sample is obtained by selecting every k th individual from the population. The first individual selected is a random number between 1 and k. 1-23

A cluster sample is obtained by selecting all individuals within a randomly selected collection or group of individuals. 1-24

1-25

A convenience sample is one in which the individuals in the sample are easily obtained. Any studies that use this type of sampling generally have results that are suspect. Results should be looked upon with extreme skepticism. 2010 Pearson Prentice Hall. All rights reserved 1-26

If the results of the sample are not representative of the population, then the sample has bias. Three Sources of Bias 1. Sampling Bias 2. Nonresponse Bias 3. Response Bias 1-27

Sampling bias means that the technique used to obtain the individuals to be in the sample tend to favor one part of the population over another. Undercoverage is a type of sampling bias. Undercoverage occurs when the proportion of one segment of the population is lower in a sample than it is in the population. 1-28

Nonresponse bias exists when individuals selected to be in the sample who do not respond to the survey have different opinions from those who do. Nonresponse can be improved through the use of callbacks or rewards/incentives. 1-29

Response bias exists when the answers on a survey do not reflect the true feelings of the respondent. Types of Response Bias 1. Interviewer error 2. Misrepresented answers 3. Words used in survey question 4. Order of the questions or words within the question 1-30

An experiment is a controlled study conducted to determine the effect of varying one or more explanatory variables or factors has on a response variable. Any combination of the values of the factors is called a treatment. The experimental unit (or subject) is a person, object or some other well-defined item upon which a treatment is applied. A control group serves as a baseline treatment that can be used to compare to other treatments. A placebo is an innocuous medication, such as a sugar tablet, that looks, tastes, and smells like the experimental medication. 1-31

Blinding refers to nondisclosure of the treatment an experimental unit is receiving. A single-blind experiment is one in which the experimental unit (or subject) does not know which treatment he or she is receiving. A double-blind experiment is one in which neither the experimental unit nor the researcher in contact with the experimental unit knows which treatment the experimental unit is receiving. 1-32