Chapter 1. Picturing Distributions with Graphs

Similar documents
STP226 Brief Class Notes Instructor: Ela Jackiewicz

Probability and Statistics. Chapter 1

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Chapter 1: Exploring Data

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots

AP Statistics. Semester One Review Part 1 Chapters 1-5

2.4.1 STA-O Assessment 2

q2_2 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Section I: Multiple Choice Select the best answer for each question.

Introduction to Statistical Data Analysis I

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

Organizing Data. Types of Distributions. Uniform distribution All ranges or categories have nearly the same value a.k.a. rectangular distribution

Lesson 9 Presentation and Display of Quantitative Data

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.

Section 1: Exploring Data

Unit 1 Exploring and Understanding Data

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Stem-and-Leaf Displays. Example: Binge Drinking. Stem-and-Leaf Displays 1/29/2016. Section 3.2: Displaying Numerical Data: Stem-and-Leaf Displays

Choosing a Significance Test. Student Resource Sheet

LOTS of NEW stuff right away 2. The book has calculator commands 3. About 90% of technology by week 5

V. Gathering and Exploring Data

Statistical Methods Exam I Review

Chapter 1. Where Do Data Come From? Chapter 1 1

Statistics. Nur Hidayanto PSP English Education Dept. SStatistics/Nur Hidayanto PSP/PBI

Test 1C AP Statistics Name:

Summarizing Data. (Ch 1.1, 1.3, , 2.4.3, 2.5)

Introduction. Lecture 1. What is Statistics?

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

Outline. Practice. Confounding Variables. Discuss. Observational Studies vs Experiments. Observational Studies vs Experiments

Chapter 1: Introduction to Statistics

Undertaking statistical analysis of

Stats 95. Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures

PRINTABLE VERSION. Quiz 1. True or False: The amount of rainfall in your state last month is an example of continuous data.

CHAPTER 3 Describing Relationships

12.1 Inference for Linear Regression. Introduction

Data, frequencies, and distributions. Martin Bland. Types of data. Types of data. Clinical Biostatistics

AP Stats Review for Midterm

Chapter 1: Exploring Data

Unit 7 Comparisons and Relationships

Math 2200 First Mid-Term Exam September 22, 2010

Analysis and Interpretation of Data Part 1

CCM6+7+ Unit 12 Data Collection and Analysis

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

AP Psych - Stat 1 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Name AP Statistics UNIT 1 Summer Work Section II: Notes Analyzing Categorical Data

Still important ideas

AP STATISTICS 2010 SCORING GUIDELINES (Form B)

Chapter 1: Explaining Behavior

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Identify two variables. Classify them as explanatory or response and quantitative or explanatory.

Paper Airplanes & Scientific Methods

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013#

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Frequency distributions

Making Inferences from Experiments

A) I only B) II only C) III only D) II and III only E) I, II, and III

Test 1 Version A STAT 3090 Spring 2018

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Frequency Distributions

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3

STOR 155 Section 2 Midterm Exam 1 (9/29/09)

Business Statistics Probability

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

Lesson 1: Distributions and Their Shapes

Lauren DiBiase, MS, CIC Associate Director Public Health Epidemiologist Hospital Epidemiology UNC Hospitals

SCATTER PLOTS AND TREND LINES

CHAPTER 2. MEASURING AND DESCRIBING VARIABLES

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

Averages and Variation

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Frequency Distributions

Observational studies; descriptive statistics

(a) 50% of the shows have a rating greater than: impossible to tell

How to interpret scientific & statistical graphs

Displaying the Order in a Group of Numbers Using Tables and Graphs

Readings: Textbook readings: OpenStax - Chapters 1 4 Online readings: Appendix D, E & F Online readings: Plous - Chapters 1, 5, 6, 13

CP Statistics Sem 1 Final Exam Review

STAT243 LS: Intro to Probability and Statistics Quiz 1, Feb 10, 2017 KEY

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

Still important ideas

Missy Wittenzellner Big Brother Big Sister Project

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Distributions and Samples. Clicker Question. Review

PubHlth Introductory Biostatistics Practice Test I (Without Unit 3 Questions)

Understandable Statistics

Lecture 10: Chapter 5, Section 2 Relationships (Two Categorical Variables)

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

STA Module 9 Confidence Intervals for One Population Mean

Unit 1 Outline Science Practices. Part 1 - The Scientific Method. Screencasts found at: sciencepeek.com. 1. List the steps of the scientific method.

AP Psych - Stat 2 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate

Department of Statistics TEXAS A&M UNIVERSITY STAT 211. Instructor: Keith Hatfield

REVIEW from Chapter 1 : Key Elements of a Statistical Problem

Transcription:

Chapter 1 Picturing Distributions with Graphs

Statistics Statistics is a science that involves the extraction of information from numerical data obtained during an experiment or from a sample. It involves the design of the experiment or sampling procedure, the collection and analysis of the data, and making inferences (statements) about the population based upon information in a sample.

Individuals and Variables Individuals (outcomes) the objects described by a set of data may be people, animals, or things Variable any characteristic of an individual can take different values for different individuals

Variables Categorical Places an individual into one of several groups or categories Quantitative (Numerical) Takes numerical values for which arithmetic operations such as adding and averaging make sense

Case Study The Effect of Hypnosis on the Immune System reported in Science News, Sept. 4, 1993, p. 153

Case Study The Effect of Hypnosis on the Immune System Objective: To determine if hypnosis strengthens the disease-fighting capacity of immune cells.

Case Study 65 college students. 33 easily hypnotized 32 not easily hypnotized white blood cell counts measured all students viewed a brief video about the immune system.

Case Study Students randomly assigned to one of three conditions subjects hypnotized, given mental exercise subjects relaxed in sensory deprivation tank control group (no treatment)

Case Study white blood cell counts re-measured after one week the two white blood cell counts are compared for each group results hypnotized group showed larger jump in white blood cells easily hypnotized group showed largest immune enhancement

Case Study Variables measured categorical quantitative Easy or difficult to achieve hypnotic trance Group assignment Pre-study white blood cell count Post-study white blood cell count

Case Study Weight Gain Spells Heart Risk for Women Weight, weight change, and coronary heart disease in women. W.C. Willett, et. al., vol. 273(6), Journal of the American Medical Association, Feb. 8, 1995. (Reported in Science News, Feb. 4, 1995, p. 108)

Case Study Weight Gain Spells Heart Risk for Women Objective: To recommend a range of body mass index (a function of weight and height) in terms of coronary heart disease (CHD) risk in women.

Case Study Study started in 1976 with 115,818 women aged 30 to 55 years and without a history of previous CHD. Each woman s weight (body mass) was determined. Each woman was asked her weight at age 18.

Case Study The cohort of women were followed for 14 years. The number of CHD (fatal and nonfatal) cases were counted (1292 cases).

Case Study Variables measured quantitative categorical Age (in 1976) Weight in 1976 Weight at age 18 Incidence of coronary heart disease Smoker or nonsmoker Family history of heart disease

picture) Distribution (getting the quantitative Tells what values a variable takes and how often it takes these values Can be a table, graph, or function

Displaying Distributions Categorical variables Pie charts (when categories make a whole) Bar graphs Quantitative variables Histograms Stemplots (stem-and-leaf plots)

Class Make-up on First Day Data Table Year Count Percent Freshman 18 41.9% Sophomore 10 23.3% Junior 6 14.0% Senior 9 20.9% Total 43 100.1%

Class Make-up on First Day Pie Chart

Class Make-up on First Day Bar Graph

Example: U.S. Solid Waste (2000) Data Table Material Weight (million tons) Percent of total Food scraps 25.9 11.2 % Glass 12.8 5.5 % Metals 18.0 7.8 % Paper, paperboard 86.7 37.4 % Plastics 24.7 10.7 % Rubber, leather, textiles 15.8 6.8 % Wood 12.7 5.5 % Yard trimmings 27.7 11.9 % Other 7.5 3.2 % Total 231.9 100.0 %

Example: U.S. Solid Waste (2000) Pie Chart

Example: U.S. Solid Waste (2000) Bar Graph (cannot use Pie Chart since categories do not make a whole)

Histograms For quantitative variables that take many values Divide the possible values into class intervals (we will only consider equal widths) Count how many observations fall in each interval (may change to percents) Draw picture representing distribution

Case Study Weight Data Introductory Statistics class Spring, 1997 Virginia Commonwealth University

Weight Data 192 110 195 180 170 215 152 120 170 130 130 125 135 185 120 155 101 194 110 165 185 220 180 128 212 175 140 187 180 119 203 157 148 260 165 185 150 106 170 210 123 172 180 165 186 139 175 127 150 100 106 133 124

Weight Data: Frequency Table Weight Group Count 100-120 9 120< - 140 11 140< - 160 6 160< - 180 13 180< - 200 8 200< - 220 5 220< - 240 0 240< - 260 1 sqrt(53) = 7.2, or 8 intervals; range (260 100=160) / 8 = 20 = class width

Weight Data: Histogram Number of students 100 120 140 160 180 200 220 240 260 Weight * Left endpoint is included in the group, right endpoint is not.

Histograms: Class Intervals How many intervals? One rule is to calculate the square root of the sample size, and round up. Size of intervals? Divide range of data (max min) by number of intervals desired, and round to convenient number Pick intervals so each observation can only fall in exactly one interval (no overlap)

Examining the Distribution of Quantitative Data Overall pattern of graph Deviations from overall pattern Shape of the data Center of the data Spread of the data (Variation) Outliers

Shape of the Data Symmetric bell shaped other symmetric shapes Asymmetric right skewed left skewed Unimodal, bimodal

Symmetric Bell-Shaped

Symmetric Mound-Shaped

Symmetric Uniform

Asymmetric Skewed to the Left

Asymmetric Skewed to the Right

Outliers Extreme values that fall outside the overall pattern (the data point 260 in previous histogram example) May occur naturally May occur due to error in recording May occur due to error in measuring

Time Plots A time plot shows behavior over time. Time is always on the horizontal axis, and the variable being measured is on the vertical axis. Look for an overall pattern (trend), and deviations from this trend. Connecting the data points by lines may emphasize this trend. Look for patterns that repeat at known regular intervals (seasonal variations).

Class Make-up on First Day (Fall Semesters: 1985-1993) 70% 60% Class Make-up On First Day Percent of Class That Are Freshman 50% 40% 30% 20% 10% 0% 1985 1986 1987 1988 1989 1990 1991 1992 1993 Year of Fall Semester

Average Tuition (Public vs. Private)

Stemplots (Stem-and-Leaf Plots) For quantitative variables Separate each observation into a stem (first part of the number) and a leaf (the remaining part of the number) Write the stems in a vertical column; draw a vertical line to the right of the stems Write each leaf in the row to the right of its stem; order leaves if desired

1 2 Weight Data 192 110 195 180 170 215 152 120 170 130 130 125 135 185 120 155 101 194 110 165 185 220 180 128 212 175 140 187 180 119 203 157 148 260 165 185 150 106 170 210 123 172 180 165 186 139 175 127 150 100 106 133 124

Weight Data: Stemplot (Stem & Leaf Plot) Key 20 3 means 203 pounds Stems = 10 s Leaves = 1 s 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 5 2 2 192 152 135

Weight Data: Stemplot (Stem & Leaf Plot) Key 20 3 means 203 pounds Stems = 10 s Leaves = 1 s 10 0166 11 009 12 0034578 13 00359 14 08 15 00257 16 555 17 000255 18 000055567 19 245 20 3 21 025 22 0 23 24 25 26 0

Extended Stem-and-Leaf Plots If there are very few stems (when the data cover only a very small range of values), then we may want to create more stems by splitting the original stems.

Extended Stem-and-Leaf Plots Example: if all of the data values were between 150 and 179, then we may choose to use the following stems: 15 15 16 16 17 17 Leaves 0-4 would go on each upper stem (first 15 ), and leaves 5-9 would go on each lower stem (second 15 ).