Unit 2: Probability and distributions Lecture 3: Normal distribution
|
|
- Chloe Manning
- 6 years ago
- Views:
Transcription
1 Unit 2: Probability and distributions Lecture 3: Normal distribution Statistics 101 Thomas Leininger May 23, 2013
2 Announcements 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
3 Announcements Announcements Problem Set #2 due tomorrow Quiz #1 tomorrow Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
4 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
5 Normal distribution Unimodal and symmetric, bell shaped curve Most variables are nearly normal, but none are exactly normal Denoted as N(µ, σ) Normal with mean µ and standard deviation σ Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
6 Heights of males blog.okcupid.com/ index.php/ the-biggest-lies-in-online-dating/ Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
7 Heights of males The male heights on OkCupid very nearly follow the expected normal distribution except the whole thing is shifted to the right of where it should be. Almost universally guys like to add a couple inches. You can also see a more subtle vanity at work: starting at roughly 5 8, the top of the dotted curve tilts even further rightward. This means that guys as they get closer to six feet round up a bit more than usual, stretching for that coveted psychological benchmark. blog.okcupid.com/ index.php/ the-biggest-lies-in-online-dating/ Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
8 Heights of females blog.okcupid.com/ index.php/ the-biggest-lies-in-online-dating/ Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
9 Heights of females When we looked into the data for women, we were surprised to see height exaggeration was just as widespread, though without the lurch towards a benchmark height. blog.okcupid.com/ index.php/ the-biggest-lies-in-online-dating/ Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
10 Normal distribution model 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
11 Normal distribution model Normal distributions with different parameters µ: mean, σ: standard deviation N(µ = 0, σ = 1) N(µ = 19, σ = 4) Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
12 Rule 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
13 Rule Rule For nearly normally distributed data, about 68% falls within 1 SD of the mean, about 95% falls within 2 SD of the mean, about 99.7% falls within 3 SD of the mean. It is possible for observations to fall 4, 5, or more standard deviations away from the mean, but these occurrences are very rare if the data are nearly normal. 68% 95% 99.7% µ 3σ µ 2σ µ σ µ µ + σ µ + 2σ µ + 3σ Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
14 Rule Describing variability using the Rule SAT scores are distributed nearly normally with mean 1500 and standard deviation 300. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
15 Rule Describing variability using the Rule SAT scores are distributed nearly normally with mean 1500 and standard deviation % of students score between 1200 and 1800 on the SAT. 95% of students score between 900 and 2100 on the SAT. 99.7% of students score between 600 and 2400 on the SAT. 68% 95% 99.7% Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
16 Rule Number of hours of sleep on school nights We can approximate this with a normal distribution (a bit of a stretch here, but it seems to hold in larger samples) mean = 6.88 sd = Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
17 Rule Number of hours of sleep on school nights We can approximate this with a normal distribution (a bit of a stretch here, but it seems to hold in larger samples) Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
18 Rule Number of hours of sleep on school nights We can approximate this with a normal distribution (a bit of a stretch here, but it seems to hold in larger samples) % Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
19 Rule Number of hours of sleep on school nights We can approximate this with a normal distribution (a bit of a stretch here, but it seems to hold in larger samples) % 75 % Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
20 Rule Number of hours of sleep on school nights We can approximate this with a normal distribution (a bit of a stretch here, but it seems to hold in larger samples) % 95 % 75 % Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
21 Standardizing with Z scores 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
22 Standardizing with Z scores SAT scores are distributed nearly normally with mean 1500 and standard deviation 300. ACT scores are distributed nearly normally with mean 21 and standard deviation 5. A college admissions officer wants to determine which of the two applicants scored better on their standardized test with respect to the other test takers: Pam, who earned an 1800 on her SAT, or Jim, who scored a 24 on his ACT? Pam Jim Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
23 Standardizing with Z scores Standardizing with Z scores Since we cannot just compare these two raw scores, we instead compare how many standard deviations beyond the mean each observation is. Pam s score is = 1 standard deviation above the mean. Jim s score is = 0.6 standard deviations above the mean. Jim Pam Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
24 Standardizing with Z scores Standardizing with Z scores (cont.) These are called standardized scores, or Z scores. Z score of an observation is the number of standard deviations it falls above or below the mean. Z scores Z = observation mean SD Z scores are defined for distributions of any shape, but only when the distribution is normal can we use Z scores to calculate percentiles. Observations that are more than 2 SD away from the mean ( Z > 2) are usually considered unusual. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
25 Standardizing with Z scores Percentiles Percentile is the percentage of observations that fall below a given data point. Graphically, percentile is the area below the probability distribution curve to the left of that observation Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
26 Standardizing with Z scores Approximately what percent of students score below 1800 on the SAT? (Hint: Use the % rule. The mean is 1500 and the SD is 300.) Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
27 Standardizing with Z scores Approximately what percent of students score below 1800 on the SAT? (Hint: Use the % rule. The mean is 1500 and the SD is 300.) = 32% 32/2 = 16% = 84% Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
28 Standardizing with Z scores Jim or Pam? So who had a higher score Jim or Pam? Pam got an 1800 on the SAT (mean 1500, SD 300). Jim got a 24 on the ACT (mean 21, SD 5). Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
29 Standardizing with Z scores Jim or Pam? So who had a higher score Jim or Pam? Pam got an 1800 on the SAT (mean 1500, SD 300). Jim got a 24 on the ACT (mean 21, SD 5) Pam: Z Pam = 300 Percentile: 84% = 1.0 Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
30 Standardizing with Z scores Jim or Pam? So who had a higher score Jim or Pam? Pam got an 1800 on the SAT (mean 1500, SD 300). Jim got a 24 on the ACT (mean 21, SD 5) Pam: Z Pam = 300 Percentile: 84% = Jim: Z Jim = = Percentile: 73% Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
31 Standardizing with Z scores Jim or Pam? So who had a higher score Jim or Pam? Pam got an 1800 on the SAT (mean 1500, SD 300). Jim got a 24 on the ACT (mean 21, SD 5) Pam: Z Pam = 300 Percentile: 84% = Jim: Z Jim = = Percentile: 73% images/ gallery/ 10.jpg Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
32 Calculating percentiles 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
33 Calculating percentiles Calculating percentiles - using computation There are many ways to compute percentiles/areas under the curve: R: > pnorm(1800, mean = 1500, sd = 300) [1] Applet: htmls/ SOCR Distributions.html Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
34 Calculating percentiles Calculating percentiles - using tables Second decimal place of Z Z You ll find a similar table in Appendix B in the back of the book. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
35 Recap 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
36 Recap Question Which of the following is false? (a) Majority of Z scores in a right skewed distribution are negative. (b) In skewed distributions the Z score of the mean might be different than 0. (c) For a normal distribution, IQR is less than 2 SD. (d) Z scores are helpful for determining how unusual a data point is compared to the rest of the data in the distribution. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
37 Recap Question Which of the following is false? (a) Majority of Z scores in a right skewed distribution are negative. (b) In skewed distributions the Z score of the mean might be different than 0. (c) For a normal distribution, IQR is less than 2 SD. (d) Z scores are helpful for determining how unusual a data point is compared to the rest of the data in the distribution. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
38 Evaluating the normal approximation 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
39 Evaluating the normal approximation Normal probability plot A histogram and normal probability plot of a sample of 100 male heights. Male heights (inches) Theoretical Quantiles male heights (in.) Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
40 Evaluating the normal approximation Anatomy of a normal probability plot Data are plotted on the y-axis of a normal probability plot, and theoretical quantiles (following a normal distribution) on the x-axis. If there is a one-to-one relationship between the data and the theoretical quantiles, then the data follow a nearly normal distribution. Since a one-to-one relationship would appear as a straight line on a scatter plot, the closer the points are to a perfect straight line, the more confident we can be that the data follow the normal model. Constructing a normal probability plot requires calculating percentiles and corresponding z-scores for each observation, which is tedious. Therefore we generally rely on software when making these plots. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
41 Evaluating the normal approximation Below is a histogram and normal probability plot for the NBA heights from the season. Do these data appear to follow a normal distribution? Height (inches) Theoretical quantiles Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
42 Evaluating the normal approximation Below is a histogram and normal probability plot for the NBA heights from the season. Do these data appear to follow a normal distribution? Height (inches) Theoretical quantiles Why do the points on the normal probability have jumps? Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
43 Evaluating the normal approximation Construct a normal probability plot for the data set given below and determine if the data follow an approximately normal distribution. 3.46, 4.02, 5.09, 2.33, 6.47 Observation i x i Percentile = i n Corrsponding Z i Since the points on the normal probability plot seem to follow a straight line we can say that the distribution is nearly normal. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
44 Evaluating the normal approximation Construct a normal probability plot for the data set given below and determine if the data follow an approximately normal distribution. 3.46, 4.02, 5.09, 2.33, 6.47 Observation i x i Percentile = i n Corrsponding Z i Since the points on the normal probability plot seem to follow a straight line we can say that the distribution is nearly normal. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
45 Evaluating the normal approximation Normal probability plot and skewness Right Skew - If the plotted points appear to bend up and to the left of the normal line that indicates a long tail to the right. Left Skew - If the plotted points bend down and to the right of the normal line that indicates a long tail to the left. Short Tails - An S shaped-curve indicates shorter than normal tails, i.e. narrower than expected. Long Tails - A curve which starts below the normal line, bends to follow it, and ends above it indicates long tails. That is, you are seeing more variance than you would expect in a normal distribution, i.e. wider than expected. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
46 Examples (time permitting) 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
47 Examples (time permitting) Normal probability and quality control 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
48 Examples (time permitting) Normal probability and quality control Six sigma The term six sigma process comes from the notion that if one has six standard deviations between the process mean and the nearest specification limit, as shown in the graph, practically no items will fail to meet specifications. en.wikipedia.org/ wiki/ Six Sigma Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
49 Examples (time permitting) Normal probability and quality control Question At Heinz ketchup factory the amounts which go into bottles of ketchup are supposed to be normally distributed with mean 36 oz. and standard deviation 0.11 oz. Once every 30 minutes a bottle is selected from the production line, and its contents are noted precisely. If the amount of ketchup in the bottle is below 35.8 oz. or above 36.2 oz., then the bottle fails the quality control inspection. What percent of bottles have fewer than 35.8 ounces of ketchup? (a) less than 0.15% (b) between 0.15% and 2.5% (c) between 2.5% and 16% (d) between 16% and 50% Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
50 Examples (time permitting) Normal probability and quality control Question At Heinz ketchup factory the amounts which go into bottles of ketchup are supposed to be normally distributed with mean 36 oz. and standard deviation 0.11 oz. Once every 30 minutes a bottle is selected from the production line, and its contents are noted precisely. If the amount of ketchup in the bottle is below 35.8 oz. or above 36.2 oz., then the bottle fails the quality control inspection. What percent of bottles have fewer than 35.8 ounces of ketchup? (a) less than 0.15% Let X = amount of ketchup in a bottle: X N(µ = 36, σ = 0.11) (b) between 0.15% and 2.5% (c) between 2.5% and 16% (d) between 16% and 50% Z = = 1.82 Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
51 Examples (time permitting) Normal probability and quality control Finding the exact probability - using the Z table Second decimal place of Z Z Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
52 Examples (time permitting) Normal probability and quality control Finding the exact probability - using R > pnorm(-1.82, mean = 0, sd = 1) [1] Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
53 Examples (time permitting) Normal probability and quality control Finding the exact probability - using R > pnorm(-1.82, mean = 0, sd = 1) [1] OR Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
54 Examples (time permitting) Normal probability and quality control Finding the exact probability - using R > pnorm(-1.82, mean = 0, sd = 1) [1] OR > pnorm(35.8, mean = 36, sd = 0.11) [1] Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
55 Examples (time permitting) Normal probability and quality control Question At Heinz ketchup factory the amounts which go into bottles of ketchup are supposed to be normally distributed with mean 36 oz. and standard deviation 0.11 oz. Once every 30 minutes a bottle is selected from the production line, and its contents are noted precisely. If the amount of the bottle goes below 35.8 oz. or above 36.2 oz., then the bottle fails the quality control inspection. What percent of bottles pass the quality control inspection? (a) 1.82% (b) 3.44% (c) 6.88% (d) 93.12% (e) 96.56% Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
56 Examples (time permitting) Normal probability and quality control P(35.8 < X < 36.2) =? = Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
57 Examples (time permitting) Normal probability and quality control P(35.8 < X < 36.2) =? = Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
58 Examples (time permitting) Normal probability and quality control P(35.8 < X < 36.2) =? = Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
59 Examples (time permitting) Normal probability and quality control P(35.8 < X < 36.2) =? = Z 35.8 = = Z 36.2 = = P(35.8 < X < 36.2) = P( 1.82 < Z < 1.82) = = Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
60 Examples (time permitting) Finding cutoff points 1 Announcements 2 Normal distribution Normal distribution model Rule Standardizing with Z scores Calculating percentiles Recap 3 Evaluating the normal approximation 4 Examples (time permitting) Normal probability and quality control Finding cutoff points Statistics 101
61 Examples (time permitting) Finding cutoff points Body temperatures of healthy humans are distributed nearly normally with mean 98.2 F and standard deviation 0.73 F. What is the cutoff for the lowest 3% of human body temperatures? Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
62 Examples (time permitting) Finding cutoff points Body temperatures of healthy humans are distributed nearly normally with mean 98.2 F and standard deviation 0.73 F. What is the cutoff for the lowest 3% of human body temperatures? 0.03? 98.2 Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
63 Examples (time permitting) Finding cutoff points Body temperatures of healthy humans are distributed nearly normally with mean 98.2 F and standard deviation 0.73 F. What is the cutoff for the lowest 3% of human body temperatures? Z ? 98.2 Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
64 Examples (time permitting) Finding cutoff points Body temperatures of healthy humans are distributed nearly normally with mean 98.2 F and standard deviation 0.73 F. What is the cutoff for the lowest 3% of human body temperatures? Z ? 98.2 P(X < x) = 0.03 P(Z < -1.88) = 0.03 obs mean Z = x 98.2 = 1.88 SD 0.73 x = ( ) = 96.8 Mackowiak, Wasserman, and Levine (1992), A Critical Appraisal of 98.6 Degrees F, the Upper Limit of the Normal Body Temperature, and Other Legacies of Carl Reinhold August Wunderlick. Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
65 Examples (time permitting) Finding cutoff points Question Body temperatures of healthy humans are distributed nearly normally with mean 98.2 F and standard deviation 0.73 F. What is the cutoff for the highest 10% of human body temperatures? (a) 99.1 (b) 97.3 (c) 99.4 (d) 99.6 Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
66 Examples (time permitting) Finding cutoff points Z ? Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
67 Examples (time permitting) Finding cutoff points Z ? P(X > x) = 0.10 P(Z < 1.28) = 0.90 obs mean Z = x 98.2 = 1.28 SD 0.73 x = ( ) = 99.1 Statistics 101 (Thomas Leininger) U2 - L3: Normal distribution May 23, / 30
Normal Distribution. Many variables are nearly normal, but none are exactly normal Not perfect, but still useful for a variety of problems.
Review Probability: likelihood of an event Each possible outcome can be assigned a probability If we plotted the probabilities they would follow some type a distribution Modeling the distribution is important
More informationQuantitative Literacy: Thinking Between the Lines
Quantitative Literacy: Thinking Between the Lines Crauder, Noell, Evans, Johnson Chapter 6: Statistics 2013 W. H. Freeman and Company 1 Chapter 6: Statistics Lesson Plan Data summary and presentation:
More informationChapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.
Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able
More informationAP Statistics TOPIC A - Unit 2 MULTIPLE CHOICE
AP Statistics TOPIC A - Unit 2 MULTIPLE CHOICE Name Date 1) True or False: In a normal distribution, the mean, median and mode all have the same value and the graph of the distribution is symmetric. 2)
More informationApplied Statistical Analysis EDUC 6050 Week 4
Applied Statistical Analysis EDUC 6050 Week 4 Finding clarity using data Today 1. Hypothesis Testing with Z Scores (continued) 2. Chapters 6 and 7 in Book 2 Review! = $ & '! = $ & ' * ) 1. Which formula
More informationModule 28 - Estimating a Population Mean (1 of 3)
Module 28 - Estimating a Population Mean (1 of 3) In "Estimating a Population Mean," we focus on how to use a sample mean to estimate a population mean. This is the type of thinking we did in Modules 7
More informationStatistics for Psychology
Statistics for Psychology SIXTH EDITION CHAPTER 3 Some Key Ingredients for Inferential Statistics Some Key Ingredients for Inferential Statistics Psychologists conduct research to test a theoretical principle
More information12.1 Inference for Linear Regression. Introduction
12.1 Inference for Linear Regression vocab examples Introduction Many people believe that students learn better if they sit closer to the front of the classroom. Does sitting closer cause higher achievement,
More informationNormal Random Variables
Normal Random Variables The distribution associated with Normal random variable is called Normal distribution. Carl Friedrich Gauss analyzed astronomical data using Normal distribution and defined the
More informationObservational studies; descriptive statistics
Observational studies; descriptive statistics Patrick Breheny August 30 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 38 Observational studies Association versus causation
More informationChapter 7: Descriptive Statistics
Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of
More informationStatistics: Interpreting Data and Making Predictions. Interpreting Data 1/50
Statistics: Interpreting Data and Making Predictions Interpreting Data 1/50 Last Time Last time we discussed central tendency; that is, notions of the middle of data. More specifically we discussed the
More informationChapter 1: Introduction to Statistics
Chapter 1: Introduction to Statistics Variables A variable is a characteristic or condition that can change or take on different values. Most research begins with a general question about the relationship
More informationAP Statistics. Semester One Review Part 1 Chapters 1-5
AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically
More informationUNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016
UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 STAB22H3 Statistics I, LEC 01 and LEC 02 Duration: 1 hour and 45 minutes Last Name: First Name:
More informationAppendix B Statistical Methods
Appendix B Statistical Methods Figure B. Graphing data. (a) The raw data are tallied into a frequency distribution. (b) The same data are portrayed in a bar graph called a histogram. (c) A frequency polygon
More informationAP Psych - Stat 1 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
AP Psych - Stat 1 Name Period Date MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) In a set of incomes in which most people are in the $15,000
More informationProbability and Statistics. Chapter 1
Probability and Statistics Chapter 1 Individuals and Variables Individuals and Variables Individuals are objects described by data. Individuals and Variables Individuals are objects described by data.
More informationSTAT 200. Guided Exercise 4
STAT 200 Guided Exercise 4 1. Let s Revisit this Problem. Fill in the table again. Diagnostic tests are not infallible. We often express a fale positive and a false negative with any test. There are further
More informationClassical Psychophysical Methods (cont.)
Classical Psychophysical Methods (cont.) 1 Outline Method of Adjustment Method of Limits Method of Constant Stimuli Probit Analysis 2 Method of Constant Stimuli A set of equally spaced levels of the stimulus
More informationStandard Deviation and Standard Error Tutorial. This is significantly important. Get your AP Equations and Formulas sheet
Standard Deviation and Standard Error Tutorial This is significantly important. Get your AP Equations and Formulas sheet The Basics Let s start with a review of the basics of statistics. Mean: What most
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationWelcome to OSA Training Statistics Part II
Welcome to OSA Training Statistics Part II Course Summary Using data about a population to draw graphs Frequency distribution and variability within populations Bell Curves: What are they and where do
More informationIntroduction to Statistical Data Analysis I
Introduction to Statistical Data Analysis I JULY 2011 Afsaneh Yazdani Preface What is Statistics? Preface What is Statistics? Science of: designing studies or experiments, collecting data Summarizing/modeling/analyzing
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationSTA Module 9 Confidence Intervals for One Population Mean
STA 2023 Module 9 Confidence Intervals for One Population Mean Learning Objectives Upon completing this module, you should be able to: 1. Obtain a point estimate for a population mean. 2. Find and interpret
More informationLesson 9 Presentation and Display of Quantitative Data
Lesson 9 Presentation and Display of Quantitative Data Learning Objectives All students will identify and present data using appropriate graphs, charts and tables. All students should be able to justify
More informationCHAPTER 3 DATA ANALYSIS: DESCRIBING DATA
Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such
More informationAP Psych - Stat 2 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
AP Psych - Stat 2 Name Period Date MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) In a set of incomes in which most people are in the $15,000
More informationData, frequencies, and distributions. Martin Bland. Types of data. Types of data. Clinical Biostatistics
Clinical Biostatistics Data, frequencies, and distributions Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Types of data Qualitative data arise when individuals
More informationChapter 2: The Normal Distributions
Chapter 2: The Normal Distributions Use the following to answer questions 1-3: 1. For this density curve, which of the following is true? a) It is symmetric. c) The median is 1. b) The total area under
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More informationCHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Reading Quiz 3.1 True/False 1.
More informationThe normal curve and standardisation. Percentiles, z-scores
The normal curve and standardisation Percentiles, z-scores The normal curve Frequencies (histogram) Characterised by: Central tendency Mean Median Mode uni, bi, multi Positively skewed, negatively skewed
More informationNormal Distribution: Homework *
OpenStax-CNX module: m16978 1 Normal Distribution: Homework * Susan Dean Barbara Illowsky, Ph.D. This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 2.0 Exercise
More informationSTOR 155 Section 2 Midterm Exam 1 (9/29/09)
STOR 155 Section 2 Midterm Exam 1 (9/29/09) Name: PID: Instructions: Both the exam and the bubble sheet will be collected. On the bubble sheet, print your name and ID number, sign the honor pledge, also
More informationHomework Exercises for PSYC 3330: Statistics for the Behavioral Sciences
Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences compiled and edited by Thomas J. Faulkenberry, Ph.D. Department of Psychological Sciences Tarleton State University Version: July
More information2.4.1 STA-O Assessment 2
2.4.1 STA-O Assessment 2 Work all the problems and determine the correct answers. When you have completed the assessment, open the Assessment 2 activity and input your responses into the online grading
More informationPart 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.
Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points. 1. The bell-shaped frequency curve is so common that if a population has this shape, the measurements are
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationRegression. Lelys Bravo de Guenni. April 24th, 2015
Regression Lelys Bravo de Guenni April 24th, 2015 Outline Regression Simple Linear Regression Prediction of an individual value Estimate Percentile Ranks Regression Simple Linear Regression The idea behind
More information111, section 8.6 Applications of the Normal Distribution
111, section 8.6 Applications of the Normal Distribution notes by Tim Pilachowski A probability density function f(x) for a continuous random variable has two necessary characteristics. 1. f(x) 0 for all
More informationReminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll you
Reminders/Comments Thanks for the quick feedback I ll try to put HW up on Saturday and I ll email you Final project will be assigned in the last week of class You ll have that week to do it Participation
More informationSTATISTICS AND RESEARCH DESIGN
Statistics 1 STATISTICS AND RESEARCH DESIGN These are subjects that are frequently confused. Both subjects often evoke student anxiety and avoidance. To further complicate matters, both areas appear have
More informationResults & Statistics: Description and Correlation. I. Scales of Measurement A Review
Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Statistics Final Review Semeter I Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The Centers for Disease
More informationLesson 1: Distributions and Their Shapes
Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that
More information3 CONCEPTUAL FOUNDATIONS OF STATISTICS
3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical
More informationPreviously, when making inferences about the population mean,, we were assuming the following simple conditions:
Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean,, we were assuming the following simple conditions: (1) Our data (observations)
More informationLOTS of NEW stuff right away 2. The book has calculator commands 3. About 90% of technology by week 5
1.1 1. LOTS of NEW stuff right away 2. The book has calculator commands 3. About 90% of technology by week 5 1 Three adventurers are in a hot air balloon. Soon, they find themselves lost in a canyon in
More informationStatistical Methods and Reasoning for the Clinical Sciences
Statistical Methods and Reasoning for the Clinical Sciences Evidence-Based Practice Eiki B. Satake, PhD Contents Preface Introduction to Evidence-Based Statistics: Philosophical Foundation and Preliminaries
More informationStandard Scores. Richard S. Balkin, Ph.D., LPC-S, NCC
Standard Scores Richard S. Balkin, Ph.D., LPC-S, NCC 1 Normal Distributions While Best and Kahn (2003) indicated that the normal curve does not actually exist, measures of populations tend to demonstrate
More informationBiostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego
Biostatistics Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego (858) 534-1818 dsilverstein@ucsd.edu Introduction Overview of statistical
More informationStudents will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of
Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of numbers. Also, students will understand why some measures
More information10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible
Pledge: 10/4/2007 MATH 171 Name: Dr. Lunsford Test 1 100 Points Possible I. Short Answer and Multiple Choice. (36 points total) 1. Circle all of the items below that are measures of center of a distribution:
More informationMBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION
MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION Variables In the social sciences data are the observed and/or measured characteristics of individuals and groups
More informationChapter 20: Test Administration and Interpretation
Chapter 20: Test Administration and Interpretation Thought Questions Why should a needs analysis consider both the individual and the demands of the sport? Should test scores be shared with a team, or
More informationChapter Three in-class Exercises. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Name Chapter Three in-class Exercises MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) The table below lists the populations, in thousands, of several
More informationObjectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests
Objectives Quantifying the quality of hypothesis tests Type I and II errors Power of a test Cautions about significance tests Designing Experiments based on power Evaluating a testing procedure The testing
More informationMedical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?
Medical Statistics 1 Basic Concepts Farhad Pishgar Defining the data Population and samples Except when a full census is taken, we collect data on a sample from a much larger group called the population.
More informationSummarizing Data. (Ch 1.1, 1.3, , 2.4.3, 2.5)
1 Summarizing Data (Ch 1.1, 1.3, 1.10-1.13, 2.4.3, 2.5) Populations and Samples An investigation of some characteristic of a population of interest. Example: You want to study the average GPA of juniors
More informationEstimation. Preliminary: the Normal distribution
Estimation Preliminary: the Normal distribution Many statistical methods are only valid if we can assume that our data follow a distribution of a particular type, called the Normal distribution. Many naturally
More informationConditional Distributions and the Bivariate Normal Distribution. James H. Steiger
Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal
More information4.3 Measures of Variation
4.3 Measures of Variation! How much variation is there in the data?! Look for the spread of the distribution.! What do we mean by spread? 1 Example Data set:! Weight of contents of regular cola (grams).
More informationStill important ideas
Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still
More informationSTAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES)
STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES) In baseball after a player gets a hit, they need to decide whether to stop at first base, or try to stretch their hit from a single to a double. Does the
More informationM 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60
M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60 Multiple choice questions (1 point each) For questions
More information(a) 50% of the shows have a rating greater than: impossible to tell
q 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included in
More informationOn the purpose of testing:
Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase
More informationV. Gathering and Exploring Data
V. Gathering and Exploring Data With the language of probability in our vocabulary, we re now ready to talk about sampling and analyzing data. Data Analysis We can divide statistical methods into roughly
More informationReadings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14
Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14 Still important ideas Contrast the measurement of observable actions (and/or characteristics)
More informationStats 95. Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures
Stats 95 Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures Stats 95 Why Stats? 200 countries over 200 years http://www.youtube.com/watch?v=jbksrlysojo
More informationStatistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions
Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated
More informationOCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010
OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 SAMPLING AND CONFIDENCE INTERVALS Learning objectives for this session:
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 5, 6, 7, 8, 9 10 & 11)
More informationM 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75
M 140 est 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions
More informationCHAPTER 2. MEASURING AND DESCRIBING VARIABLES
4 Chapter 2 CHAPTER 2. MEASURING AND DESCRIBING VARIABLES 1. A. Age: name/interval; military dictatorship: value/nominal; strongly oppose: value/ ordinal; election year: name/interval; 62 percent: value/interval;
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter
More informationExample The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3
Lecture 3 Nancy Pfenning Stats 1000 We learned last time how to construct a stemplot to display a single quantitative variable. A back-to-back stemplot is a useful display tool when we are interested in
More informationPooling Subjective Confidence Intervals
Spring, 1999 1 Administrative Things Pooling Subjective Confidence Intervals Assignment 7 due Friday You should consider only two indices, the S&P and the Nikkei. Sorry for causing the confusion. Reading
More informationStatistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions
Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated
More informationPRINTABLE VERSION. Quiz 1. True or False: The amount of rainfall in your state last month is an example of continuous data.
Question 1 PRINTABLE VERSION Quiz 1 True or False: The amount of rainfall in your state last month is an example of continuous data. a) True b) False Question 2 True or False: The standard deviation is
More informationMath 1680 Class Notes. Chapters: 1, 2, 3, 4, 5, 6
Math 1680 Class Notes Chapters: 1, 2, 3, 4, 5, 6 Chapter 1. Controlled Experiments Salk vaccine field trial: a randomized controlled double-blind design 1. Suppose they gave the vaccine to everybody, and
More informationBasic Statistics 01. Describing Data. Special Program: Pre-training 1
Basic Statistics 01 Describing Data Special Program: Pre-training 1 Describing Data 1. Numerical Measures Measures of Location Measures of Dispersion Correlation Analysis 2. Frequency Distributions (Relative)
More informationStatistical Techniques. Masoud Mansoury and Anas Abulfaraj
Statistical Techniques Masoud Mansoury and Anas Abulfaraj What is Statistics? https://www.youtube.com/watch?v=lmmzj7599pw The definition of Statistics The practice or science of collecting and analyzing
More informationSTT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.
STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. For Questions 1 & 2: It is known that the distribution of starting salaries for MSU Education majors has
More informationSTT315 Chapter 2: Methods for Describing Sets of Data - Part 2
Chapter 2.5 Interpreting Standard Deviation Chebyshev Theorem Empirical Rule Chebyshev Theorem says that for ANY shape of data distribution at least 3/4 of all data fall no farther from the mean than 2
More informationStatistics is a broad mathematical discipline dealing with
Statistical Primer for Cardiovascular Research Descriptive Statistics and Graphical Displays Martin G. Larson, SD Statistics is a broad mathematical discipline dealing with techniques for the collection,
More informationAP Stats Review for Midterm
AP Stats Review for Midterm NAME: Format: 10% of final grade. There will be 20 multiple-choice questions and 3 free response questions. The multiple-choice questions will be worth 2 points each and the
More informationIdentify two variables. Classify them as explanatory or response and quantitative or explanatory.
OLI Module 2 - Examining Relationships Objective Summarize and describe the distribution of a categorical variable in context. Generate and interpret several different graphical displays of the distribution
More informationMATH 227 CP 8 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
MATH 227 CP 8 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Find the indicated critical z value. 1) Find z /2 for = 0.07. 1) 2) Find the value of z
More informationHow Faithful is the Old Faithful? The Practice of Statistics, 5 th Edition 1
How Faithful is the Old Faithful? The Practice of Statistics, 5 th Edition 1 Who Has Been Eating My Cookies????????? Someone has been steeling the cookie I bought for your class A teacher from the highschool
More informationEmpirical Rule ( rule) applies ONLY to Normal Distribution (modeled by so called bell curve)
Chapter 2.5 Interpreting Standard Deviation Chebyshev Theorem Empirical Rule Chebyshev Theorem says that for ANY shape of data distribution at least 3/4 of all data fall no farther from the mean than 2
More informationFrequency distributions
Applied Biostatistics distributions Martin Bland Professor of Health Statistics University of York http://www-users.york.ac.uk/~mb55/ Types of data Qualitative data arise when individuals may fall into
More informationUnit 7 Comparisons and Relationships
Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons
More informationSheila Barron Statistics Outreach Center 2/8/2011
Sheila Barron Statistics Outreach Center 2/8/2011 What is Power? When conducting a research study using a statistical hypothesis test, power is the probability of getting statistical significance when
More informationPractice First Midterm Exam
Practice First Midterm Exam Statistics 200 (Pfenning) This is a closed book exam worth 150 points. You are allowed to use a calculator and a two-sided sheet of notes. There are 9 problems, with point values
More information