The Confidence Interval. Finally, we can start making decisions!

Similar documents
Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

STA Module 9 Confidence Intervals for One Population Mean

STA Learning Objectives. What is Population Proportion? Module 7 Confidence Intervals for Proportions

STA Module 7 Confidence Intervals for Proportions

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Chapter 8: Estimating with Confidence

Chapter 8 Estimating with Confidence

Chapter 25. Paired Samples and Blocks. Copyright 2010 Pearson Education, Inc.

Comparison of two means

Chapter 8: Estimating with Confidence

Confidence Intervals. Chapter 10

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Exam 4 Review Exercises

Module 28 - Estimating a Population Mean (1 of 3)

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

10.1 Estimating with Confidence. Chapter 10 Introduction to Inference

Student Performance Q&A:

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

9. Interpret a Confidence level: "To say that we are 95% confident is shorthand for..

Creative Commons Attribution-NonCommercial-Share Alike License

Basic Statistics for Comparing the Centers of Continuous Data From Two Groups

Chi-Square Goodness-of-Fit Test

Chapter 11. Inference for Means. Inference For Means: 11.1" The t # Distribution 11.1" Inference for a Mean 11.2" Comparing Two Means

The normal curve and standardisation. Percentiles, z-scores

PSYCHOLOGY 300B (A01) One-sample t test. n = d = ρ 1 ρ 0 δ = d (n 1) d

Two-Way Independent ANOVA

Recognizing Ambiguity

An Introduction to Bayesian Statistics

Reflection Questions for Math 58B

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC

The t-test: Answers the question: is the difference between the two conditions in my experiment "real" or due to chance?

The Z Test. Assignment. Terms You Should Know. Psychological Statistics The Z Test. G&W, Chapter 6. Z-test

Kepler tried to record the paths of planets in the sky, Harvey to measure the flow of blood in the circulatory system, and chemists tried to produce

appstats26.notebook April 17, 2015

The following command was executed on their calculator: mean(randnorm(m,20,16))

Inferential Statistics: An Introduction. What We Will Cover in This Section. General Model. Population. Sample

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

Statistical inference provides methods for drawing conclusions about a population from sample data.

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Applied Statistical Analysis EDUC 6050 Week 4

Business Statistics Probability

Sheila Barron Statistics Outreach Center 2/8/2011

extraction can take place. Another problem is that the treatment for chronic diseases is sequential based upon the progression of the disease.

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Constructing a Bivariate Table:

CHAPTER 8 Estimating with Confidence

Study on Gender in Physics

Something to think about. What happens, however, when we have a sample with less than 30 items?

One-Way Independent ANOVA

Creative Commons Attribution-NonCommercial-Share Alike License

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Chapter 12: Analysis of covariance, ANCOVA

Never P alone: The value of estimates and confidence intervals

A point estimate is a single value that has been calculated from sample data to estimate the unknown population parameter. s Sample Standard Deviation

Lecture 12: Normal Probability Distribution or Normal Curve

* σ = The Z Test. Formulas and Symbols You Should Know. Assignment: Heiman Chapter 10. Terms You Should Know.

SECTION I Number of Questions 15 Percent of Total Grade 50

111, section 8.6 Applications of the Normal Distribution

Cult of Statistical Significance 8/03/

Instructions for doing two-sample t-test in Excel

A point estimate is a single value that has been calculated from sample data to estimate the unknown population parameter. s Sample Standard Deviation

Normal Distribution: Homework *

Comparing multiple proportions

Psychology 205, Revelle, Fall 2014 Research Methods in Psychology Mid-Term. Name:

Statistical Inference

HOW DO YOU REALLY KEEP YOUR KIDS SAFE FROM ADDICTION?

3.2 Least- Squares Regression

The Single-Sample t Test and the Paired-Samples t Test

1) What is the independent variable? What is our Dependent Variable?

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

Section I: Multiple Choice Select the best answer for each question.

FAQs about Provider Profiles on Breast Cancer Screenings (Mammography) Q: Who receives a profile on breast cancer screenings (mammograms)?

Lessons in biostatistics

Lecture 12A: Chapter 9, Section 1 Inference for Categorical Variable: Confidence Intervals

Designing an experiment 7 TH /8 TH GRADE SCIENCE

Carrying out an Empirical Project

Simple Linear Regression the model, estimation and testing

YSU Students. STATS 3743 Dr. Huang-Hwa Andy Chang Term Project 2 May 2002

Editorial Note: this manuscript has been previously reviewed at another journal that is not operating a transparent peer review scheme.

Welcome to the Offseason Hockey Training System!

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

AP STATISTICS 2013 SCORING GUIDELINES

14.1: Inference about the Model

Math 261 Exam I Spring Name:

Assignment 4: True or Quasi-Experiment

Practice First Midterm Exam

Frequency Distributions

Math 243 Sections , 6.1 Confidence Intervals for ˆp

Chapter 6: Confidence Intervals

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Still important ideas

LIVING BETTER WITH HEART FAILURE. starts with talking about your symptoms

t-test Tutorial Aliza McConnahey & Josh Petkash

MATH 1040 Skittles Data Project

Stay Married with the FIT Technique Go from Pissed off to Peaceful in Three Simple Steps!

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Transcription:

The Confidence Interval Finally, we can start making decisions!

Reminder The Central Limit Theorem (CLT) The mean of a random sample is a random variable whose sampling distribution can be approximated by a Normal model. The larger the sample, the better the approximation will be.

The Population of Interest Still true Sampling Model The Sample

Where the CLT is used now! We should not be surprised if 95% of various polls gave results that were near the mean but varied above and below that by no more than two standard deviations. This is what we mean by sampling error. It s not really an error at all, but just variability you d expect to see from one sample to another. A better term would be sampling variability.

As usual, 2 types of data Categorical Quantitative

Categorical First up, categorical data.

Assumptions Assumptions Independence Do we have good justification to believe samples are independent? What is it?

Conditions Randomization do we have random samples? Conditions n<10% of population (is our sample too big?) Success/Failure (is our sample too small?) np 10 AND n(1-p) 10

One- Proportion z- Interval When the conditions are met, we are ready to find the confidence interval for the population proportion, p. Interval= p ± z S. D. ( p) Where the critical value, z*, depends on the particular confidence level, C, that you want. Slide 19-9

One- Proportion z- Interval Interval= p ± z p 1 Where the critical value, z*, depends on the particular confidence level, C, that you want. n p Slide 19-10

Quantitative Next up, quantitative data

William Sealy Gosset A historical digression 1836-1937 Had a GREAT job.

What a job Guinness Brewery brewmaster in charge of quality control so Guinness had exactly the same flavor, color, alcohol content, etc.

Really, not kidding. No joke. One of the very crucial, important, and most useful models in statistics was created because of making better beer.

They take their beer very seriously in Ireland. He found that the z* formula gave him much too narrow of an interval. His model was wrong and NOT USEFUL. So he created a completely new model. The Student s-t model.

Gosset s t The Student s t-models form a whole family of related distributions that depend on a parameter known as degrees of freedom. We often denote degrees of freedom as df, and the model as t df.

The t-statistic (t-score) is calculated the same as the z- statistic (or z-score), but uses the Central Limit Theorem! The t-statistic t = x μ s n Slide 23-17

The t- distribution As the degrees of freedom increase, the t- models look more and more like the Normal. In fact, the t-model with infinite degrees of freedom is exactly Normal. Slide 23-18

Assumptions for 1 sample t interval Assumptions Independence Do we have good justification to believe samples are independent? What is it?

Conditions Randomization do we have random samples? Conditions n<10% of population when we draw w/o replacement (is our sample too big?) Is our sample Large Enough to be Nearly Normal? (n>30 usually is but graphing the info is a better check.)

When the conditions are met, we are ready to find the confidence interval for the population mean, μ. 1 sample t interval interval = x ± t n 1 S. D. x The critical value t n 1 depends on the particular confidence level, C, that you specify and on the number of degrees of freedom, n 1, which we get from the sample size.

1 sample t interval interval = x ± t n 1 s n The critical value t n 1 depends on the particular confidence level, C, that you specify and on the number of degrees of freedom, n 1, which we get from the sample size.

ENOUGH Theory, just tell us how to do the problems! Guess what the first step to doing the problems is? Yup, you guessed it: Categorical Quantitative data is how we decide which to use!

Don t P.A.N.I.C at Confidence Intervals P: Parameter: State your parameter, whether it is a mean or a proportion in context. A: Assumptions: Check all 4 of your assumptions: Check the R.I.M.M. N: Name: your Confidence Interval, do the math I: Interval: Write the answer in interval notation C: Conclusion: in context of the problem.

Categorical CI In 1998 a San Diego reproductive clinic reported 49 live births to 207 women under the age of 40 who had previously been unable to conceive, and claimed a 25% success rate. Give a 90% confidence interval for the success rate at this clinic, and determine if their claim is justified. P: The question gives a proportion, p, of live births, with p=49/207 =.2367 A: It is reasonable to assume that the conceptions are independent but not random, but I will assume the births are representative of the population. n=207 < 10% of all such conceptions in San Diego per year, and np=207*.2367=49 >10. nq =207*.7633 =158 > 10.

Categorical CI In 1998 a San Diego reproductive clinic reported 49 live births to 207 women under the age of 40 who had previously been unable to conceive, and claimed a 25% success rate. Give a 90% confidence interval for the success rate at this clinic, and determine if their claim is justified. N: Therefore, a 1 proportion z-interval may be used. p ± z p(1 p) n I: (.188,.285) = 49 207 ± 1.645 49 207 158 207 207 =.2367 ±.048596

Categorical CI In 1998 a San Diego reproductive clinic reported 49 live births to 207 women under the age of 40 who had previously been unable to conceive, and claimed a 25% success rate. Give a 90% confidence interval for the success rate at this clinic, and determine if their claim is justified. C: About 90% of random samples of size 207 will capture the true proportion of women under 40 who will conceive in this clinic. The interval is (.188,.285), and since the claim of 25% (.25) is within the interval the data does not refute their claim. Therefore, their claim is justified.

Quantitative CI A sample of Alzheimer's patients are tested to assess the amount of time in stage IV sleep. It has been hypothesized that individuals suffering from Alzheimer's Disease may spend less time per night in the deeper stages of sleep. Number of minutes spent is Stage IV sleep is recorded for sixty-one patients. The sample produced a mean of 48 minutes (S=14 minutes) of stage IV sleep over a 24 hour period of time with a normal distribution. Compute a 95 percent confidence interval for this data. What does this information tell you about a particular individual's (an Alzheimer's patient) stage IV sleep? P: The parameter of interest is the minutes of stage IV sleep of Alzheimer s patients; mean = 48 minutes. A: There is reason to believe that the sleep of one patient is independent of the others, but the sample is not random but volunteer. We can assume this sample is representative of the population to compensate. n<10% of the population, because there are more than 610 Alzheimer s patients, and normality is given.

Quantitative CI A sample of Alzheimer's patients are tested to assess the amount of time in stage IV sleep. It has been hypothesized that individuals suffering from Alzheimer's Disease may spend less time per night in the deeper stages of sleep. Number of minutes spent is Stage IV sleep is recorded for sixty-one patients. The sample produced a mean of 48 minutes (S=14 minutes) of stage IV sleep over a 24 hour period of time with a normal distribution. Compute a 95 percent confidence interval for this data. What does this information tell you about a particular individual's (an Alzheimer's patient) stage IV sleep? N: This is a one sample t-interval I: x ± t σ = 48 ± 1.671 14 n 61 48 ± 3.585 Interval = (44.415, 51.585) = 48 ± 2.000 1.79252 =

Quantitative CI A sample of Alzheimer's patients are tested to assess the amount of time in stage IV sleep. It has been hypothesized that individuals suffering from Alzheimer's Disease may spend less time per night in the deeper stages of sleep. Number of minutes spent is Stage IV sleep is recorded for sixty-one patients. The sample produced a mean of 48 minutes (S=14 minutes) of stage IV sleep over a 24 hour period of time with a normal distribution. Compute a 95 percent confidence interval for this data. What does this information tell you about a particular individual's (an Alzheimer's patient) stage IV sleep? C: About 95% of random samples of size 61 will capture the true mean minutes of type IV sleep of Alzheimer s patients. The interval is (44.415, 51.585), which means 95% of the time, samples of size 61 will capture the true mean between 44.415 and 51.585 minutes of type IV sleep.