Statistics: Bar Graphs and Standard Error

Similar documents
Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Chapter 12. The One- Sample

Chapter 7: Descriptive Statistics

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

Paper Airplanes & Scientific Methods

MITOCW ocw f99-lec20_300k

Unit 7 Comparisons and Relationships

Sheila Barron Statistics Outreach Center 2/8/2011

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

7. Bivariate Graphing

Lesson 9: Two Factor ANOVAS

t-test for r Copyright 2000 Tom Malloy. All rights reserved

Undertaking statistical analysis of

Section 3.2 Least-Squares Regression

PROSTATE CANCER SCREENING SHARED DECISION MAKING VIDEO

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

5 Ways To Get Rid Of The Baby fat 1

Business Statistics Probability

Organizing Data. Types of Distributions. Uniform distribution All ranges or categories have nearly the same value a.k.a. rectangular distribution

Appendix B Statistical Methods

C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape.

CHAPTER ONE CORRELATION

Still important ideas

3.2 Least- Squares Regression

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

UNIT 1CP LAB 1 - Spaghetti Bridge

PROBABILITY Page 1 of So far we have been concerned about describing characteristics of a distribution.

Still important ideas

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

Reliability, validity, and all that jazz

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of

Cell Processes: Osmosis

If you could interview anyone in the world, who. Polling. Do you think that grades in your school are inflated? would it be?

6 Simple Ways To Conquer Your Fears Of Returning To School

Table of Contents. Introduction. 1. Diverse Weighing scale models. 2. What to look for while buying a weighing scale. 3. Digital scale buying tips

7 Secrets To Gain Weight And Build Muscle Fast For Hard Gainers

What Constitutes a Good Contribution to the Literature (Body of Knowledge)?

Something to think about. What happens, however, when we have a sample with less than 30 items?

Psychology Research Process

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Unit 1 Outline Science Practices. Part 1 - The Scientific Method. Screencasts found at: sciencepeek.com. 1. List the steps of the scientific method.

Data, frequencies, and distributions. Martin Bland. Types of data. Types of data. Clinical Biostatistics

1SCIENTIFIC METHOD PART A. THE SCIENTIFIC METHOD

The Bell Curve of Mature Buck Antler Scores: When You Manage a Buck Herd, You Manage a Bell Curve of Antler Scores By Stuart W.

Gage R&R. Variation. Allow us to explain with a simple diagram.

From Here To Macros: 4 Steps To Better Nutrition

ORAL HYGIENE SESSION 2

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

Steps to writing a lab report on: factors affecting enzyme activity

Two-Way Independent ANOVA

Making An Exercise Plan

Patrick Breheny. January 28

Making charts in Excel

Eating and Sleeping Habits of Different Countries

ORAL HYGIENE SESSION 2

Bouncing Ball Lab. Name

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Predictive analytics can spot patients not taking their medicine

Paper Airplanes & Scientific Methods

CCM6+7+ Unit 12 Data Collection and Analysis

A Critical View of JUPITER

Goldsmith. Marshall. FEEDFORWARD Coaching For Your Future. Coaching For Your Future. MMIX ChartHouse Learning. All Rights Reserved. What Is It?

If I Do 50 Sit Ups A Day Will I Get A Six Pack

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Audio: In this lecture we are going to address psychology as a science. Slide #2

#1. What is SAD and how will we resolve it?

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

Psychology Research Process

Section 4 Decision-making

Math for Liberal Arts MAT 110: Chapter 5 Notes

Rife Machines - Do They Really Work Like They Did In the 1930's?

FDA/CFSAN: Guidance on How to Understand a...e the Nutrition Facts Panel on Food Labels

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Observational studies; descriptive statistics

Iron Deficiency Anemia in Patients with Inflammatory Bowel Disease ReachMD Page 1 of 7

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

BBC Learning English 6 Minute English Chill and lose weight!

STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES)

ANOVA in SPSS (Practical)

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

Making Inferences from Experiments

Planning and Carrying Out an Investigation. Name:

Section 6: Analysing Relationships Between Variables

Inferential Statistics

Never P alone: The value of estimates and confidence intervals

Offseason Training: Nutritional Troubleshooting and FAQ Section

MARKET ANIMAL WEIGHT PROJECTION WORKSHEET

Understanding Correlations The Powerful Relationship between Two Independent Variables

Statistics for Psychology

11 - Blood Tests (Completed 05/29/18) Transcript by Rev.com. Page 1 of 5

1 The conceptual underpinnings of statistical power

appstats26.notebook April 17, 2015

The Scientific Method Scientific method

Transcription:

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 1 Statistics: Bar Graphs and Standard Error URL: http://mathbench.umd.edu/modules/prob-stat_bargraph/page01.htm Beyond the scatterplot In this module, we'll spice things up with some other kinds of graphs. Below are two graphs relating to fish food. The one on the left is obviously a scatterplot. The one on the right is a bar graph -- it has, um, bars on it. But how do you know that you need to use a bargraph for your data? Here are the data tables for these two graphs: Scatterplot vs. Barchart Amount of Fish2Whale (g/day) 2 3 4 5 6 8 10 Growth (g/day) 0.1 0.12 0.14 0.14 0.17 0.17 0.17 Type of fishfood Budget fude Ball- Mart Fish-o-matic Fish2Whale Growth (g/day) 0.05 0.06 0.07 0.15 What is the essential difference between these two data tables? If you said, the first datatable is all numbers, but the second datatable has some stuff that is not numbers, then you are correct. " Some stuff that is not numbers" is known in the mathematical world as "non-quantitative data". The word "quantitative" means able to be quantified or counted. In other words, a number. "Nonquantitative data" is also called "qualitative data." In this case, "qualitative" means data that you can qualify, or describe, but not count. Quantitative data can be put on a quantitative axis, but qualitative data can't. You just can't type of fish food into units of tens or fifties or hundreds. So, I have four categories in this data table, and the best I can do is simply list each category on x-axis.

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 2 Practice with quantitative and qualitative. In each case below, decide whether the data presented is qualitative or quantitative. Scatter or Bars? Questions You do an experiment in which you measure growth of fish depending on the percentage of protein in their diet. You compare average growth of three different species of fish, all of which are fed 3 grams of Fish2Whale per day. You take 20 fish and measure their nose-to-tail length. You then feed each fish 1.5 grams/day of Fish2Whale for the first week and 2 grams/day for the second week. At the end of the experiment, you measure each fish's final noseto-tail length. You start with 30 fish, which you classify as "healthy", "unhealthy" or "dying", and feed each group a mixture of Fish-O-Matic and Ballmart fish food. At the end of 2 weeks, you measure the weight of each fish. You start with 6 tanks (1 week old, 2 weeks old, 3 weeks old, etc) and measure how long, on average, the fish in each tank spend eating in minutes per day. You start with 3 tanks (newly hatched, juvenile, and adult fish) and measure how long, on average, the fish in each tank spend eating eating in minutes per day. Answers Percent protein can be measured in numbers, therefore use a scatterplot "Species" is not quantifiable, therefore you need a barchart. You need to compare final to initial length, both of which can be measured with numbers. Therefore, use a scatterplot. The categories "healthy", "unhealthy", and "dying" are qualitative, so use a barchart. You can measure fish age in weeks, so use a scatterplot. Your age categories are not measured using numbers, so you must use a barchart. Here is a rule of thumb: if your x-axis contains qualitative data, you must use some sort of bargraph. If you're x-axis contains quantitative data, you will usually use some kind of scatter plot or line plot -- unless you have only a very few data points. How to make a bar chart. Since you already know how to make a scatterplot, making bar chart will be a lot easier. Here is a comparison: For a Scatterplot Choose x and y Label x, y and top Decide minimums and maximums Decide distance for ticks Make a legend Add data For a barchart Same! Same! (except x usually has no units Only for y Only for y Same Make rectangles, not dots

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 3 How to make a fancier bar chart We're not even going to practice making bar charts, but we'll go right on to adding bells and whistles. Remember that your boss wants you to show that Fish2Whale is better than the competitor brands (Budget Fude, Ballmart, and Fish-O-Matic). On the first screen, I showed one possible graph: To make this graph, your boss started with 4 equal-size fish and fed one type of food to each fish. Although the results are certainly promising (the Fish2Whale fish ended up considerably bigger than the other 3 fish) it is far from being a slam-dunk. Most importantly, your boss only tested each food on a single fish. As we discussed in the module on the Normal Distribution, virtually all scientific experiments involve some level of natural variability. Some fish are naturally larger, stronger, happier, whatever... and they grow faster. So what you need to do is repeat -- or more impressively, to "replicate" -- your treatements using several similar fish. Time passes. Fish grow. Finally the results of your experiment are ready... good news for the company. Here is your datatable, and a possible graph: Fish # 1 2 3 4 5 6 7 8 9 10 11 12 Brand BF BF BF Bmart Bmart Bmart F-O-M F-O-M F-O-M F2W F2W F2W Final Length (mm) 16 17 18 14 17 20 12 17 22 19 22 25

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 4 Hmm, this graph is kinda funny looking. Other than a criminal use of gradient fills, what did your boss do wrong? Drawing 12 big rectangles seems like a little bit of overkill. Maybe there is a way to summarize the data? What we really want to know is, what was the AVERAGE growth for each type of fishfood, right? So let's try taking some averages. Now our datatable looks like this: Fish # 1 to 3 4 to 6 7 to 9 10 to 12 Brand BF BMart F-O-M F2W!! AVERAGE Final Length (mm) 17 17 17 22 And your new, improved graph looks like this: Info lost and found I think we can agree that the summarized barchart is more elegant than the unsummarized one. But unfortunately, we lost some information when we summarized (averaged) the data. You can no longer look at the graph and get any sense of how much growth varied. If you look at the graph below, you can see that one brands produced very CONSISTENT growth rates, and one brand produced very VARIABLE growth rates. That is, one brand showed high variability, one showed low variability, and two showed medium levels.

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 5 The standard deviation (SD), which we talked about in module 2, is one way to measure how consistent or variable a result is. Recall that a small standard deviation would mean the normal distribution is skinny and tall, and most of the measurements (final fish sizes) are about the same. A big standard deviation would mean the normal distribution is wide and low, and the measurements (final fish sizes) are all over the place. Unfortunately, SD is a bit tricky to calculate, not to mention time-consuming and frustrating. What we're going to do instead is the AVERAGE deviation, which is a good approximation of the standard deviation. Say you have 10 fish, with an average length of 100. The first fish actually measures 91 mm, so that one DEVIATES from the average by 9 mm. The second one measures 105 mm, so that one DEVIATES by 5 mm (notice it doesn't matter whether the deviation is above or below average). If you find all 10 deviations AND THEN AVERAGE THEM, you will get the average deviation. This is very similar to the formula for the standard deviation, except the SD has some squares and square roots tossed in that make it play nicely with other statistics. Fish-O-Matic had the highest variability of any of the four brands. What is the approximate standard deviation of growth for this brand? Fish # 7 8 9 Brand F-O-M F-O-M F-O-M Final Length (mm) 12 17 22 Use average deviation : Remember that the Standard Deviation is almost the same as the AVERAGE deviation -- how far on average a given fish deviates from the average final length. The average final length: (12 + 17 + 22)/3 = 17 mm. The deviation for fish #7: Fish #7 deviates by (17-12)= 5 mm. The average deviation : The three fish deviate by 5, 0, and 5 mm. Take the average of these deviations to get... I think I have the answer: 10/3 = 3.33 mm

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 6 The actual standard deviation is a bit larger than the average deviation -- in this case 5 mm rather than 3.33. Below is a table of each brand, the average growth, and the standard deviation: Fish # 1 to 3 4 to 6 7 to 9 10 to 12 Brand BF BM FOM F2W!! AVERAGE Growth (mg/day) 17 17 17 22 Standard Deviation of Growth (mg/day) 1 3 5 3 Assuming that growth of fish is normally distributed (it should be, since many factors contribute), and that we have accurately measured the mean and standard deviation of growth for each type of fish food, then the distributions of actual growth should look like this: But the summarized version of our barchart loses all of this information.

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 7 Error bars So the question is, how can we average the data but still keep enough information to get a good sense of what the unsummarized data looked like? This is where statistics comes to the rescue. In fact, there is even more than one way to do this in statistics. I'll show you one method on this page, and a second method later. The First Way: Say you want to know how much the data varied. For example, the company buying Fish2Whale might simply want to know the range of fish sizes they can reasonably expect after 4 weeks. In this case you would use the standard deviation of final fish size. As you saw on the last screen, the "standard deviation" is calculated with a slightly different formula than the "average deviation". However, you can use the average deviation formula to get a general idea of the SD, and you can calculate the SD automatically by using a graphing calculator or a spreadsheet. Once you know the mean and standard deviation of the data, you can make your bar chart. You need to label, range, scale, and fill in your axes as usual. HOWEVER, when you determine the maximum values for your axes, make sure to consider the average PLUS 1 SD. Finally you make bars for each average value and add "error bars" for each standard error. The "error bars" are not rectangles, but vertical lines with a little cross bar at the top and bottom. The line starts at the top of the rectangle and the length of the line represents the size of the standard deviation (in other words, the line stops at mean + standard deviation). You can optionally do the same thing heading down as well, as shown on the graph above. Practice with error bars If you have any uncertainty as to how to make a bar chart with error bars, here is a set of data to practice with,

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 8 Question: how does final size differ among male and female fish? The data in this column is summarized in the table to the right. fish # sex size 1 female 18 2 female 18 3 female 21 4 female 23 5 male 8 6 male 10 7 male 10 8 male 12 male average 20 10 Average deviation 2 1 What is the largest value for y needed? 22 female Another way to add info: the standard error Graphs using standard deviation (SD) tell you what a big population of fish would look like -- whether their sizes would be all uniform, or somewhat raggedy, or totally raggedy. Sometimes, though, you don't really care what a population looks like, you just want to know, did a treatment (like Fish2Whale instead of other competing brands) make a difference on average? In that case you measure a bunch of fish because you're trying to get a really good estimate of the average effect, despite whatever raggediness might be present in the populations. Let's say your company decides to go all out to prove that Fish2Whale really is better than the competition. They convert a supply closet into an acquarium, hatch 400 fish, and tell you to do a HUGE experiment. The whole idea of the HUGE experiment is to get a really accurate measurement of the effect of Fish2Whale, despite the natural differences such as temperature, light, initial size of fish, solar flares, and ESP phenomena. The return on their investment? Really small error bars. But how do you get small error bars? Just using 400 fish WON'T give you a smaller SD. A huge population will be just as "ragged" as a small population. Instead, you need to use a quantity called the "standard error", or SE, which is the same as the standard deviation DIVIDED BY the square root of the sample size. Since you fed 100 fish with Fish2Whale, you get to divide the standard deviation of each result by 10 (i.e., the square root of 100). Likewise with each of the other 3 brands. So your reward for all that work is that your error bars are much smaller:

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 9 Why should you care about small error bars? Well, as a rule of thumb, if the SE error bars for the 2 treatments do not overlap, then you have shown that the treatment made a difference. So, in order to show that Fish2Whale really is better than the competitors, NOT ONLY does the mean growth need to be higher, but (mean minus SE) for Fish2Whale must be bigger than (mean plus SE) for the other brands. In other words, the error bars shouldn't overlap. It's a little easier to see on a graph: No overlap means the 2 treatments really had different effects (on average). If there is overlap, then the two treatments did NOT have different effects (on average). The good news is, you already know how to make this kind of graph. Just use the SE instead of SD and you're good. The same graph both ways If you get confused between standard deviation and standard error, here are some suggestions: 1. Standard deviation is about how far members of the population deviate from the average. It is a characteristic of the population, so it doesn't depend on how many members you sample (except that if you only sample a few members, you won't get a good estimate of SD). If you are trying to describe variability in a population, you probably want to use standard deviation. 2. Standard error is about how much error you, as an experimenter, cannot rule out from your results. SE falls steadily as you sample more and more members of the population. No matter how "ragged" the population is, some TRUE mean exists, and SE tells you how accurately you can measure this true mean. If you are trying to support or reject a hypothesis -- in other words, when you are reporting on the results of an experiment -- you will most likely be using Standard Error for your error bars. Which graph will be more useful for each question below?

www.mathbench.umd.edu Bar graphs and standard error May 2010 page 10 The Budget Fude people take exception to your results, and carry out a ginormous experiment in an Olympic-size swimming pool divided in half, containing 10,000 fish. They claim that under the right temperature conditions, Budget Fude is superior to Fish2Whale. Are they interested in SD or SE? The University wants to choose a brand of fishfood to feed their fish during experminents on critical temperature limits. It is important that the brand they choose produces fish that are as similar as possible. Are they interested in SD or SE? Review For a scatterplot For a barchart Choose x and y same! Label x, y, and top same! (except x usually has no "units") Decide minimums and maximums only for y -- and include the error bars Decide distance for ticks only for y Make a legend same Add data! make rectangles, not dots SD vs. SE SD How far members of the population deviate from the average Quantifies the population Does NOT depend on sample size Use to characterize the population SE How far off your estimate of the mean is Quantifies your experiment DOES depend on sample size (a lot!) Use to test your results