Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

Size: px
Start display at page:

Download "Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50"

Transcription

1 Statistics: Interpreting Data and Making Predictions Interpreting Data 1/50

2 Last Time Last time we discussed central tendency; that is, notions of the middle of data. More specifically we discussed the mean (= average) and median. Depending on the situation, one of these may be more indicative of the middle of the data than the others. We will see today that the talking about the middle of the data shows only part of the story. On Tuesday we discussed some weather data for San Francisco. Let s look at weather data for both San Francisco and Las Cruces. We will also calculate the mean and median for both data sets. Interpreting Data 2/50

3 Histograms (= Bar Charts) Since numbers often do not resonate with people, giving visual representations of data often makes the data more meaningful and understandable. We will talk about a couple ways to represent data visually. One we saw last time. A histogram, or bar chart, is the most common way to represent numerical data. We ll illustrate histograms with weather data for high temperatures on January 1 in San Francisco. We ll see some different pictures representing the same data. Interpreting Data 3/50

4 Here is a histogram we saw last time. Interpreting Data 4/50

5 Each value, in this case temperature, is drawn with a vertical bar. The height of the bar represents how many times that value occurs. The values are listed on the horizontal axis in increasing value from left to right. We can get a different look at the data by graphing a range of temperature values rather than each value. The following chart does this. Interpreting Data 5/50

6 Clicker Question Estimate the average high temperature from this chart and enter your estimate with your clicker. Interpreting Data 6/50

7 Answer Answering something between 57 and 59 would be reasonable. The actual average is Because of graphing ranges of temperatures, we can t get the actual average from this kind of chart, but we can get a reasonable estimate fairly quickly. Interpreting Data 7/50

8 One thing that is often done with visual displays is to compare two sets of data. Let s look at some more weather data. We will show the high temperatures on January 1 from 1977 to 2007 for both San Francisco and Las Cruces. We will also calculate the mean and median for both data sets. We will look at histograms for both sets of data. We can graph each set separately or both together. We ll see both. Interpreting Data 8/50

9 Interpreting Data 9/50

10 Trying to see how temperatures in the two cities compare is really hard with the table of data above. We ll see that we can get some sense of the difference with histograms. One interesting point of this data is the following calculation of central tendency, which we ll demonstrate with an Excel spreadsheet. The results are San Francisco Las Cruces Mean Median The mean and median are virtually identical for the two cities. We will now draw histograms in a similar way as we did earlier. Interpreting Data 10/50

11 San Francisco and Las Cruces High Temperatures, Jan. 1 Interpreting Data 11/50

12 Interpreting Data 12/50

13 Clicker Question What both stands out to you about these and could be significant? A One uses red and one uses blue B The Las Cruces data is more spread out C The biggest bars for SF are larger than those for LC D A and B E B and C Interpreting Data 13/50

14 Answer It is true, and significant, that the Las Cruces data is more spread out. That means there is more variation of high temperatures for Las Cruces than San Francisco. This is due, in part, to the ocean s moderation of the San Francisco weather versus Las Cruces being in the desert. It is true that the highest bars for San Francisco are higher than those for Las Cruces. This is actually a consequence of the previous observation. We ll see other graphs that share this property. Interpreting Data 14/50

15 While the mean and median of the two sets of temperature data are nearly the same, the graphical representation makes the data look much different. The data for Las Cruces is spread out much more than that of San Francisco. A calculation of the middle of the data only presents part of the story. The dispersion or deviation of the data is also an important part of the data. While there are several measures of deviation, the most common one is called standard deviation. Interpreting Data 15/50

16 Standard Deviation The most basic property of standard deviation is: The larger the standard deviation, the more spread out the data. That is, the larger the deviation, the more the data is away from the middle. We won t formally define standard deviation or give a formula for how to calculate it. Spreadsheets have built in functions to calculate standard deviation. Interpreting Data 16/50

17 The point of measuring deviation is to give a sense of how far data is from the middle, or the average. Standard deviation approximately measures the average of how far data is from the middle. For example, the standard deviations of the San Francisco and Las Cruces weather data are 4.0 and 8.0, respectively. The standard deviation for Las Cruces is much larger than for San Francisco, reflecting that the Las Cruces data is more spread out. Interpreting Data 17/50

18 Some Coin Flipping Data Let s look at the experiment of flipping a coin repeatedly. We will simulate this with Excel and the computer program Maple. In the experiment we simulate a bunch of people flipping 100 coins and determining the percentage of flips which came up heads. We ll first look at an Excel spreadsheet, Coin Flip Distribution.xlsx. Interpreting Data 18/50

19 Excel isn t the best way to simulate a large amount of data. The following charts were created with the program Maple. We used this program to demonstrate RSA encryption in the beginning of the semester. Interpreting Data 19/50

20 Simulation of flipping 100 coins 1,000 times Interpreting Data 20/50

21 Simulation of flipping 100 coins 10,000 times Interpreting Data 21/50

22 Simulation of flipping 100 coins 100,000 times Interpreting Data 22/50

23 Simulation of flipping 100 coins 500,000 times Interpreting Data 23/50

24 Simulation of flipping 100 coins 1,000,000 times Interpreting Data 24/50

25 As the number of flips gets larger and larger, the graph looks more and more regular. Note that the graphs are roughly symmetric, and that the middle of each graph is at 50, the expected percentage of heads. Q Does the shape of the graphs, especially the latter ones, look at all familiar? A Yes B No Interpreting Data 25/50

26 The Bell Curve (or Normal Curve) Interpreting Data 26/50

27 The importance of the bell curve is that as the number of trials gets larger and larger, histograms generally look more and more like a bell curve. The particular shape of the bell curve reflects the mean and the standard deviation. The center of the curve represents the mean. How wide or thin is the curve is an indication of the standard deviation. The larger the standard deviation the wider is the curve. Interpreting Data 27/50

28 Bell Curves with Different Standard Deviations Standard Deviation = 5 Standard Deviation = 2 For both of these graphs the mean is 50. Interpreting Data 28/50

29 The blue and red graph have mean 0 and the green graph has mean 2. The blue graph has the smallest standard deviation, followed by the green graph, and finally by the red graph, which has the largest standard deviation. Interpreting Data 29/50

30 Standard Deviation and Number of Coins Flipped Let s go back to the coin flipping experiment. How do things change if each person changes the number of flips? The following graphs represent a simulation of 50,000 people flipping a coin repeatedly. Each person determines the percentages of heads on their flips. The first graph represents each person flipping a coin 10 times and recording the percentage of heads. The second graph represents each person flipping 100 times. Interpreting Data 30/50

31 Clicker Question Which graph represents the most variability in the data? A The graph on the left B The graph on the right 10 flips per person 100 flips per person Interpreting Data 31/50

32 Answer The first graph has more variability. While nearly all the data of the second graph is between 40 and 60, a lot of the data in the first graph is outside that range. Interpreting Data 32/50

33 Pie Charts We will come back to uses of standard deviation, but we ll talk about another visual representions of data first. Pie charts are used to break a whole into pieces, typically with percentages, according to different categories. Here is an example. Interpreting Data 33/50

34 Pie charts are often used to give visual displays of data. The size of a wedge is proportional to the size of the data it represents. For example, to represent 25%, you draw a wedge that is 25%, or 1/4, of the full circle. Pie charts are useful typically to show relative sizes of a few categories. When some categories are close in size, they lose some of their effectiveness. Trying to make a pie chart fancier is a good way to make the data less clear, and even to encourage misconceptions about the data. Interpreting Data 34/50

35 Clicker Question Between the blue and the green regions, which represents the largest number? A The green region B The blue region C They represent the same number D I can t tell Interpreting Data 35/50

36 Answer Whether or not it appears to be the case, the blue and green regions both represent 25%. These were created by Excel. The spreadsheet Pie Charts.xlsx shows two variants of a pie chart for the same data. Interpreting Data 36/50

37 For another example of a badly designed pie chart, how easy is it to get information from the following chart? Interpreting Data 37/50

38 Misleading Charts Here are some webpages demonstrating poorly designed, misleading, or incorrect charts. lovestats.wordpress.com/2009/03/11/pie-charts-our-evil-friend n html obamapacman.com/2011/12/google-android-director-presentsmisleading-chart wonkette.com/wp-content/uploads/2009/11/193.jpg Interpreting Data 38/50

39 Back to Standard Deviation In terms of the normal curve, standard deviation can be interpreted approximately with the following rules of thumb: 68% of all data is within 1 standard deviation of the mean. 95% of all data is within 2 standard deviations of the mean. 99.7% of all data is within 3 standard deviations of the mean. Interpreting Data 39/50

40 The letter σ is an abbreviation for the standard deviation, and µ for the mean. Finding the area under a curve was one of the problems that led to the development of Calculus in the 17th century. Interpreting Data 40/50

41 How to Tell if a Coin is Unfair? We ll use the normal distribution to get some sense on how to tell if a coin is fair. Suppose you flip a coin and get at least 60% heads. Can you conclude the coin is unfair? Interpreting Data 41/50

42 Clicker Question Q Suppose you flip a coin 10 times and get 6 heads. Do you think this is good evidence to conclude the coin is unfair? A Yes B No A No it really isn t. If you flip a coin 10 times, you are pretty likely to get 6 or more heads quite often. In fact, the probability of getting 6 or more heads is a little over 30%, which we can estimate with our spreadsheet. Interpreting Data 42/50

43 Let s suppose we flip a coin 100 times and get at least 60% heads. We can ask what is the probability that that happens. Let s imagine we do this many many times. As our simulations indicate, we can consider the distribution of trials giving us a bell curve. Based on the data from Coin Flip Distribution.xlsx, the mean for this curve is 50% and the standard deviation is 5%. Then 60% is two standard deviations to the right of the mean. The amount of data to the right of 60% is then approximately 2.25% of the data. So, there is only about a 2% chance that flipping a coin 100 times results in at least 60% heads, but that is not so small. Unless you had a reason to think the coin might be unfair, it would probably be hard to argue from this data that the coin is unfair, even though it is not too likely to get at least 60% heads. Interpreting Data 43/50

44 To think a little further about it, suppose a class of 100 students each flipped a coin 100 times. Even if everybody had a fair coin, we d expect, on average, 2 of the 100 students to get at least 60% heads. Therefore, while getting this many heads isn t too likely for any one person, with enough people it will happen. Interpreting Data 44/50

45 Heights A common example of a data set that looks like a bell curve is heights of adult males (or females). The webpage distribution says that the average height of an adult male is about 70 inches (5 10 ) with a standard deviation of 3 inches. Let s explore some questions about this data. Recall the following rules of thumb: 68% of all data is within 1 standard deviation of the mean. 95% of all data is within 2 standard deviations of the mean. 99.7% of all data is within 3 standard deviations of the mean. Interpreting Data 45/50

46 Clicker Question Q What is the probability of an adult male having height between 5 7 and 6 1? Enter a two-digit number to represent the probability as a percentage. A The answer is approximately 68%; since the average is 5 10 and the standard deviation is 3 inches, we are asking for the percentage of men within 1 standard deviation of the mean. Interpreting Data 46/50

47 Clicker Question Q What is the probability of an adult male having height between and 6 1 and 6 4? Enter a two-digit number to represent the probability as a percentage. A The answer is approximately 13.6%; we are asking for the percentage between 1 and 2 standard deviations above the mean. Interpreting Data 47/50

48 Clicker Question If a person is 7 2 tall, they are 4 standard deviations above the mean. The probability of having height that much or more is This can be found from the website stattrek.com/online-calculator/normal.aspx Q If there are about 2 billion adult males on the planet, what would be the expected number at least 7 2 tall? A The expected number is 2,000,000, = 6,000. Interpreting Data 48/50

49 Do you believe this number? The website estimates that there are around 2,800 people over 7 feet tall. However, the website claims there are 20,000 people over 7 feet tall. One thing this shows is that information on the web is not always accurate. A more subtle issue is that the assumption that heights follow a normal distribution may be approximately correct, but extreme values may not fit the model as well as values near the average. Interpreting Data 49/50

50 Next Week and Homework # 10 Next week is the final week of the semester. On Tuesday we will discuss fractals and focus on their artistic nature. On Thursday we ll wrap up the semester by watching a clip from the Simpson s Treehouse of Horror VI. In that episode lots of math and science is mentioned in the background, and we ll talk about that. Homework #10 is on the class website. It is due by next Thursday by midnight. Interpreting Data 50/50

Module 28 - Estimating a Population Mean (1 of 3)

Module 28 - Estimating a Population Mean (1 of 3) Module 28 - Estimating a Population Mean (1 of 3) In "Estimating a Population Mean," we focus on how to use a sample mean to estimate a population mean. This is the type of thinking we did in Modules 7

More information

Lesson 9 Presentation and Display of Quantitative Data

Lesson 9 Presentation and Display of Quantitative Data Lesson 9 Presentation and Display of Quantitative Data Learning Objectives All students will identify and present data using appropriate graphs, charts and tables. All students should be able to justify

More information

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 SAMPLING AND CONFIDENCE INTERVALS Learning objectives for this session:

More information

Applied Statistical Analysis EDUC 6050 Week 4

Applied Statistical Analysis EDUC 6050 Week 4 Applied Statistical Analysis EDUC 6050 Week 4 Finding clarity using data Today 1. Hypothesis Testing with Z Scores (continued) 2. Chapters 6 and 7 in Book 2 Review! = $ & '! = $ & ' * ) 1. Which formula

More information

Standard Deviation and Standard Error Tutorial. This is significantly important. Get your AP Equations and Formulas sheet

Standard Deviation and Standard Error Tutorial. This is significantly important. Get your AP Equations and Formulas sheet Standard Deviation and Standard Error Tutorial This is significantly important. Get your AP Equations and Formulas sheet The Basics Let s start with a review of the basics of statistics. Mean: What most

More information

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis: Section 1.0 Making Sense of Data Statistics: Data Analysis: Individuals objects described by a set of data Variable any characteristic of an individual Categorical Variable places an individual into one

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

Descriptive Statistics Lecture

Descriptive Statistics Lecture Definitions: Lecture Psychology 280 Orange Coast College 2/1/2006 Statistics have been defined as a collection of methods for planning experiments, obtaining data, and then analyzing, interpreting and

More information

Unit 7 Comparisons and Relationships

Unit 7 Comparisons and Relationships Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons

More information

Gage R&R. Variation. Allow us to explain with a simple diagram.

Gage R&R. Variation. Allow us to explain with a simple diagram. Gage R&R Variation We ve learned how to graph variation with histograms while also learning how to determine if the variation in our process is greater than customer specifications by leveraging Process

More information

AP Statistics TOPIC A - Unit 2 MULTIPLE CHOICE

AP Statistics TOPIC A - Unit 2 MULTIPLE CHOICE AP Statistics TOPIC A - Unit 2 MULTIPLE CHOICE Name Date 1) True or False: In a normal distribution, the mean, median and mode all have the same value and the graph of the distribution is symmetric. 2)

More information

Organizing Data. Types of Distributions. Uniform distribution All ranges or categories have nearly the same value a.k.a. rectangular distribution

Organizing Data. Types of Distributions. Uniform distribution All ranges or categories have nearly the same value a.k.a. rectangular distribution Organizing Data Frequency How many of the data are in a category or range Just count up how many there are Notation x = number in one category n = total number in sample (all categories combined) Relative

More information

Appendix B Statistical Methods

Appendix B Statistical Methods Appendix B Statistical Methods Figure B. Graphing data. (a) The raw data are tallied into a frequency distribution. (b) The same data are portrayed in a bar graph called a histogram. (c) A frequency polygon

More information

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time. Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time. While a team of scientists, veterinarians, zoologists and

More information

t-test for r Copyright 2000 Tom Malloy. All rights reserved

t-test for r Copyright 2000 Tom Malloy. All rights reserved t-test for r Copyright 2000 Tom Malloy. All rights reserved This is the text of the in-class lecture which accompanied the Authorware visual graphics on this topic. You may print this text out and use

More information

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA 15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA Statistics does all kinds of stuff to describe data Talk about baseball, other useful stuff We can calculate the probability.

More information

Undertaking statistical analysis of

Undertaking statistical analysis of Descriptive statistics: Simply telling a story Laura Delaney introduces the principles of descriptive statistical analysis and presents an overview of the various ways in which data can be presented by

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Unit 2: Probability and distributions Lecture 3: Normal distribution

Unit 2: Probability and distributions Lecture 3: Normal distribution Unit 2: Probability and distributions Lecture 3: Normal distribution Statistics 101 Thomas Leininger May 23, 2013 Announcements 1 Announcements 2 Normal distribution Normal distribution model 68-95-99.7

More information

Never P alone: The value of estimates and confidence intervals

Never P alone: The value of estimates and confidence intervals Never P alone: The value of estimates and confidence Tom Lang Tom Lang Communications and Training International, Kirkland, WA, USA Correspondence to: Tom Lang 10003 NE 115th Lane Kirkland, WA 98933 USA

More information

Available as a Powerpoint data file

Available as a Powerpoint data file Available as a Powerpoint data file The Elsmar Cove http://elsmar.com/courses.html Slide 1 Statistics Statistical Process Control & Control Charting Slide 2 Recommended Statistical Course Attendance Basic

More information

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion Chapter 8 Estimating with Confidence Lesson 2: Estimating a Population Proportion What proportion of the beads are yellow? In your groups, you will find a 95% confidence interval for the true proportion

More information

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points. Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points. 1. The bell-shaped frequency curve is so common that if a population has this shape, the measurements are

More information

Probability and Statistics. Chapter 1

Probability and Statistics. Chapter 1 Probability and Statistics Chapter 1 Individuals and Variables Individuals and Variables Individuals are objects described by data. Individuals and Variables Individuals are objects described by data.

More information

Lesson 11.1: The Alpha Value

Lesson 11.1: The Alpha Value Hypothesis Testing Lesson 11.1: The Alpha Value The alpha value is the degree of risk we are willing to take when making a decision. The alpha value, often abbreviated using the Greek letter α, is sometimes

More information

Lecture 12: Normal Probability Distribution or Normal Curve

Lecture 12: Normal Probability Distribution or Normal Curve 12_normalcurve.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 12: Normal Probability Distribution or Normal Curve The real importance of this lecture is to show you what a normal curve looks

More information

PROBABILITY Page 1 of So far we have been concerned about describing characteristics of a distribution.

PROBABILITY Page 1 of So far we have been concerned about describing characteristics of a distribution. PROBABILITY Page 1 of 9 I. Probability 1. So far we have been concerned about describing characteristics of a distribution. That is, frequency distribution, percentile ranking, measures of central tendency,

More information

Statistics Coursework Free Sample. Statistics Coursework

Statistics Coursework Free Sample. Statistics Coursework Statistics Coursework For my initial investigation I am going to compare results on the following hypothesis, to see if people s intelligence affects their height and their ability to memorise a certain

More information

Test 1C AP Statistics Name:

Test 1C AP Statistics Name: Test 1C AP Statistics Name: Part 1: Multiple Choice. Circle the letter corresponding to the best answer. 1. At the beginning of the school year, a high-school teacher asks every student in her classes

More information

I Can t Stand BOREDOM!

I Can t Stand BOREDOM! I Can t Stand BOREDOM! A Teenager s Guide to ADHD Booklet 8 In a Series Chesterfield County Public Schools 1 My biggest problem with school is boredom! Boredom is the Number 1 problem for people with ADHD.

More information

STT315 Chapter 2: Methods for Describing Sets of Data - Part 2

STT315 Chapter 2: Methods for Describing Sets of Data - Part 2 Chapter 2.5 Interpreting Standard Deviation Chebyshev Theorem Empirical Rule Chebyshev Theorem says that for ANY shape of data distribution at least 3/4 of all data fall no farther from the mean than 2

More information

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such

More information

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics Psy201 Module 3 Study and Assignment Guide Using Excel to Calculate Descriptive and Inferential Statistics What is Excel? Excel is a spreadsheet program that allows one to enter numerical values or data

More information

3.2 Least- Squares Regression

3.2 Least- Squares Regression 3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these

More information

Making Inferences from Experiments

Making Inferences from Experiments 11.6 Making Inferences from Experiments Essential Question How can you test a hypothesis about an experiment? Resampling Data Yield (kilograms) Control Group Treatment Group 1. 1.1 1.2 1. 1.5 1.4.9 1.2

More information

Frequency Distributions

Frequency Distributions Frequency Distributions In this section, we look at ways to organize data in order to make it more user friendly. It is difficult to obtain any meaningful information from the data as presented in the

More information

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604 Measurement and Descriptive Statistics Katie Rommel-Esham Education 604 Frequency Distributions Frequency table # grad courses taken f 3 or fewer 5 4-6 3 7-9 2 10 or more 4 Pictorial Representations Frequency

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots Section 1.2 Displaying Quantitative Data with Graphs Dotplots One of the simplest graphs to construct and interpret is a dotplot. Each data value is shown as a dot above its location on a number line.

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

Experimental Design Process. Things you can change or vary: Things you can measure or observe:

Experimental Design Process. Things you can change or vary: Things you can measure or observe: Experimental Design Process Things you can change or vary: Things you can measure or observe: Choosing Variables I will change (independent variable): I will measure (dependent variable): I will not change,

More information

Welcome to OSA Training Statistics Part II

Welcome to OSA Training Statistics Part II Welcome to OSA Training Statistics Part II Course Summary Using data about a population to draw graphs Frequency distribution and variability within populations Bell Curves: What are they and where do

More information

Statisticians deal with groups of numbers. They often find it helpful to use

Statisticians deal with groups of numbers. They often find it helpful to use Chapter 4 Finding Your Center In This Chapter Working within your means Meeting conditions The median is the message Getting into the mode Statisticians deal with groups of numbers. They often find it

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Statistics: Bar Graphs and Standard Error

Statistics: Bar Graphs and Standard Error www.mathbench.umd.edu Bar graphs and standard error May 2010 page 1 Statistics: Bar Graphs and Standard Error URL: http://mathbench.umd.edu/modules/prob-stat_bargraph/page01.htm Beyond the scatterplot

More information

Patrick Breheny. January 28

Patrick Breheny. January 28 Confidence intervals Patrick Breheny January 28 Patrick Breheny Introduction to Biostatistics (171:161) 1/19 Recap Introduction In our last lecture, we discussed at some length the Public Health Service

More information

HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT

HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT HARRISON ASSESSMENTS HARRISON ASSESSMENTS DEBRIEF GUIDE 1. OVERVIEW OF HARRISON ASSESSMENT Have you put aside an hour and do you have a hard copy of your report? Get a quick take on their initial reactions

More information

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion Chapter 8 Estimating with Confidence Lesson 2: Estimating a Population Proportion Conditions for Estimating p These are the conditions you are expected to check before calculating a confidence interval

More information

(2) In each graph above, calculate the velocity in feet per second that is represented.

(2) In each graph above, calculate the velocity in feet per second that is represented. Calculus Week 1 CHALLENGE 1-Velocity Exercise 1: Examine the two graphs below and answer the questions. Suppose each graph represents the position of Abby, our art teacher. (1) For both graphs above, come

More information

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3 Lecture 3 Nancy Pfenning Stats 1000 We learned last time how to construct a stemplot to display a single quantitative variable. A back-to-back stemplot is a useful display tool when we are interested in

More information

The Wellbeing Course. Resource: Mental Skills. The Wellbeing Course was written by Professor Nick Titov and Dr Blake Dear

The Wellbeing Course. Resource: Mental Skills. The Wellbeing Course was written by Professor Nick Titov and Dr Blake Dear The Wellbeing Course Resource: Mental Skills The Wellbeing Course was written by Professor Nick Titov and Dr Blake Dear About Mental Skills This resource introduces three mental skills which people find

More information

V. Gathering and Exploring Data

V. Gathering and Exploring Data V. Gathering and Exploring Data With the language of probability in our vocabulary, we re now ready to talk about sampling and analyzing data. Data Analysis We can divide statistical methods into roughly

More information

Quantitative Literacy: Thinking Between the Lines

Quantitative Literacy: Thinking Between the Lines Quantitative Literacy: Thinking Between the Lines Crauder, Noell, Evans, Johnson Chapter 6: Statistics 2013 W. H. Freeman and Company 1 Chapter 6: Statistics Lesson Plan Data summary and presentation:

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Control Chart Basics PK

Control Chart Basics PK Control Chart Basics Primary Knowledge Unit Participant Guide Description and Estimated Time to Complete Control Charts are one of the primary tools used in Statistical Process Control or SPC. Control

More information

Sleeping Beauty is told the following:

Sleeping Beauty is told the following: Sleeping beauty Sleeping Beauty is told the following: You are going to sleep for three days, during which time you will be woken up either once Now suppose that you are sleeping beauty, and you are woken

More information

Part III Taking Chances for Fun and Profit

Part III Taking Chances for Fun and Profit Part III Taking Chances for Fun and Profit Chapter 8 Are Your Curves Normal? Probability and Why it Counts What You Will Learn in Chapter 8 How probability relates to statistics Characteristics of the

More information

Frequency Distributions

Frequency Distributions Frequency Distributions In this section, we look at ways to organize data in order to make it user friendly. We begin by presenting two data sets, from which, because of how the data is presented, it is

More information

FIGHTING DRUG-RESISTANT MALARIA

FIGHTING DRUG-RESISTANT MALARIA FIGHTING DRUG-RESISTANT MALARIA Read the daily message and record notes of important points and deliverables. (5 min) NSEI 1.3.1 WARM-UP Open your pinned Daily Message Notes Add the today s date and the

More information

Pneumococcal Vaccines: Questions and Answers

Pneumococcal Vaccines: Questions and Answers Pneumococcal Vaccines: Questions and s Question 1 What is pneumococcal disease? What is pneumococcal disease? So pneumococcal disease is a group of diseases that are caused by a bacteria. Most of us are

More information

C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape.

C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape. MODULE 02: DESCRIBING DT SECTION C: KEY POINTS C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape. C-2:

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

Summarizing Data. (Ch 1.1, 1.3, , 2.4.3, 2.5)

Summarizing Data. (Ch 1.1, 1.3, , 2.4.3, 2.5) 1 Summarizing Data (Ch 1.1, 1.3, 1.10-1.13, 2.4.3, 2.5) Populations and Samples An investigation of some characteristic of a population of interest. Example: You want to study the average GPA of juniors

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions Copyright 2010, 2007, 2004 Pearson Education, Inc. Standard Error Both of the sampling distributions we ve looked at are Normal. For proportions For means

More information

Beattie Learning Disabilities Continued Part 2 - Transcript

Beattie Learning Disabilities Continued Part 2 - Transcript Beattie Learning Disabilities Continued Part 2 - Transcript In class Tuesday we introduced learning disabilities and looked at a couple of different activities that are consistent and representative of

More information

Self-Injury. What is it? How do I get help? Adapted from Signs of Self-Injury Program

Self-Injury. What is it? How do I get help? Adapted from Signs of Self-Injury Program Self-Injury What is it? How do I get help? Adapted from Signs of Self-Injury Program Why are we doing this? *Prevention of self-injury-it s happening so why ignore it? *Statistics show high prevalence

More information

Descriptive Research a systematic, objective observation of people.

Descriptive Research a systematic, objective observation of people. What is going on in this picture? We cannot say exactly, but we can describe what we see. Thus we have.. Descriptive Research a systematic, objective observation of people. Any research that observes and

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still

More information

Variability. After reading this chapter, you should be able to do the following:

Variability. After reading this chapter, you should be able to do the following: LEARIG OBJECTIVES C H A P T E R 3 Variability After reading this chapter, you should be able to do the following: Explain what the standard deviation measures Compute the variance and the standard deviation

More information

Statistics for Psychology

Statistics for Psychology Statistics for Psychology SIXTH EDITION CHAPTER 3 Some Key Ingredients for Inferential Statistics Some Key Ingredients for Inferential Statistics Psychologists conduct research to test a theoretical principle

More information

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat. Brad Pilon 1 2 3 ! For many people, the best thing about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.! However, this still means there

More information

Introduction to Statistical Data Analysis I

Introduction to Statistical Data Analysis I Introduction to Statistical Data Analysis I JULY 2011 Afsaneh Yazdani Preface What is Statistics? Preface What is Statistics? Science of: designing studies or experiments, collecting data Summarizing/modeling/analyzing

More information

MITOCW conditional_probability

MITOCW conditional_probability MITOCW conditional_probability You've tested positive for a rare and deadly cancer that afflicts 1 out of 1000 people, based on a test that is 99% accurate. What are the chances that you actually have

More information

Statistics is a broad mathematical discipline dealing with

Statistics is a broad mathematical discipline dealing with Statistical Primer for Cardiovascular Research Descriptive Statistics and Graphical Displays Martin G. Larson, SD Statistics is a broad mathematical discipline dealing with techniques for the collection,

More information

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 2: What Are My External Drug and Alcohol Triggers?

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 2: What Are My External Drug and Alcohol Triggers? Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions Substance Use Risk 2: What Are My External Drug and Alcohol Triggers? This page intentionally left blank. What Are My External Drug and

More information

STP226 Brief Class Notes Instructor: Ela Jackiewicz

STP226 Brief Class Notes Instructor: Ela Jackiewicz CHAPTER 2 Organizing Data Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that can be assigned a numerical value or nonnumerical

More information

Eating and Sleeping Habits of Different Countries

Eating and Sleeping Habits of Different Countries 9.2 Analyzing Scatter Plots Now that we know how to draw scatter plots, we need to know how to interpret them. A scatter plot graph can give us lots of important information about how data sets are related

More information

Introduction. Lecture 1. What is Statistics?

Introduction. Lecture 1. What is Statistics? Lecture 1 Introduction What is Statistics? Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain information and understanding from data. A statistic

More information

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Chapter 1. Picturing Distributions with Graphs

Chapter 1. Picturing Distributions with Graphs Chapter 1 Picturing Distributions with Graphs Statistics Statistics is a science that involves the extraction of information from numerical data obtained during an experiment or from a sample. It involves

More information

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60 M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60 Multiple choice questions (1 point each) For questions

More information

Filling the Bins - or - Turning Numerical Data into Histograms. ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016

Filling the Bins - or - Turning Numerical Data into Histograms. ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016 * Filling the Bins - or - Turning Numerical Data into Histograms ISC1057 Janet Peterson and John Burkardt Computational Thinking Fall Semester 2016 A histogram is a kind of chart which shows patterns in

More information

Unit 1: Perception and Dreaming

Unit 1: Perception and Dreaming Write your name here Surname Other names Pearson Edexcel GSE entre Number Psychology Unit 1: Perception and Dreaming andidate Number Tuesday 3 June 2014 fternoon Time: 1 hour 15 minutes You do not need

More information

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1: Research Methods 1 Handouts, Graham Hole,COGS - version 10, September 000: Page 1: T-TESTS: When to use a t-test: The simplest experimental design is to have two conditions: an "experimental" condition

More information

111, section 8.6 Applications of the Normal Distribution

111, section 8.6 Applications of the Normal Distribution 111, section 8.6 Applications of the Normal Distribution notes by Tim Pilachowski A probability density function f(x) for a continuous random variable has two necessary characteristics. 1. f(x) 0 for all

More information

How to interpret scientific & statistical graphs

How to interpret scientific & statistical graphs How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott 1 A brief introduction Graphics:

More information

Making charts in Excel

Making charts in Excel Making charts in Excel Use Excel file MakingChartsInExcel_data We ll start with the worksheet called treatment This shows the number of admissions (not necessarily people) to Twin Cities treatment programs

More information

Chapter 12. The One- Sample

Chapter 12. The One- Sample Chapter 12 The One- Sample z-test Objective We are going to learn to make decisions about a population parameter based on sample information. Lesson 12.1. Testing a Two- Tailed Hypothesis Example 1: Let's

More information

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego Biostatistics Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego (858) 534-1818 dsilverstein@ucsd.edu Introduction Overview of statistical

More information

Measuring the User Experience

Measuring the User Experience Measuring the User Experience Collecting, Analyzing, and Presenting Usability Metrics Chapter 2 Background Tom Tullis and Bill Albert Morgan Kaufmann, 2008 ISBN 978-0123735584 Introduction Purpose Provide

More information

Paper Airplanes & Scientific Methods

Paper Airplanes & Scientific Methods Paper Airplanes & Scientific Methods Scientific Inquiry refers to the many different ways in which scientists investigate the world. Scientific investigations are done to answer questions and solve problems.

More information

Name AP Statistics UNIT 1 Summer Work Section II: Notes Analyzing Categorical Data

Name AP Statistics UNIT 1 Summer Work Section II: Notes Analyzing Categorical Data Name AP Statistics UNIT 1 Summer Work Date Section II: Notes 1.1 - Analyzing Categorical Data Essential Understanding: How can I represent the data when it is treated as a categorical variable? I. Distribution

More information

Lesson 1: Distributions and Their Shapes

Lesson 1: Distributions and Their Shapes Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that

More information

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS. Math 117 Confidence Intervals and Hypothesis Testing Interpreting Results SOLUTIONS The results are given. Interpret the results and write the conclusion within context. Clearly indicate what leads to

More information

Comparing Two Means using SPSS (T-Test)

Comparing Two Means using SPSS (T-Test) Indira Gandhi Institute of Development Research From the SelectedWorks of Durgesh Chandra Pathak Winter January 23, 2009 Comparing Two Means using SPSS (T-Test) Durgesh Chandra Pathak Available at: https://works.bepress.com/durgesh_chandra_pathak/12/

More information

First Problem Set: Answers, Discussion and Background

First Problem Set: Answers, Discussion and Background First Problem Set: Answers, Discussion and Background Part I. Intuition Concerning Probability Do these problems individually Answer the following questions based upon your intuitive understanding about

More information

Chapter 1: Alternative Forced Choice Methods

Chapter 1: Alternative Forced Choice Methods Chapter 1: Alternative Forced Choice Methods Section 1.1 Birdnapping Lewiston man confounded by stolen parrot art Lewiston, Minn Jim Schloegel stood under the shade of a giant bird cage and raised both

More information