Estimating Means with Confidence

Similar documents
Intro to Scientific Analysis (BIO 100) THE t-test. Plant Height (m)

Chapter 23 Summary Inferences about Means

Chapter 18 - Inference about Means

Chapter 7 - Hypothesis Tests Applied to Means

Distribution of sample means. Estimation

Chapter 7 - Hypothesis Tests Applied to Means

ANALYZING ECOLOGICAL DATA

Confidence Intervals and Point Estimation

Chapter 8 Student Lecture Notes 8-1

Chapter 21. Recall from previous chapters: Statistical Thinking. Chapter What Is a Confidence Interval? Review: empirical rule

Estimation and Confidence Intervals

Statistics 11 Lecture 18 Sampling Distributions (Chapter 6-2, 6-3) 1. Definitions again

Objectives. Sampling Distributions. Overview. Learning Objectives. Statistical Inference. Distribution of Sample Mean. Central Limit Theorem

Statistics for Managers Using Microsoft Excel Chapter 7 Confidence Interval Estimation

5/7/2014. Standard Error. The Sampling Distribution of the Sample Mean. Example: How Much Do Mean Sales Vary From Week to Week?

Chapter 8 Descriptive Statistics

Recall, general format for all sampling distributions in Ch. 9: The sampling distribution of the sample statistic is approximately normal, with:

Concepts Module 7: Comparing Datasets and Comparing a Dataset with a Standard

Statistics Lecture 13 Sampling Distributions (Chapter 18) fe1. Definitions again

Recall, general format for all sampling distributions in Ch. 9:

Lecture 18b: Practice problems for Two-Sample Hypothesis Test of Means

Lecture Outline. BIOST 514/517 Biostatistics I / Applied Biostatistics I. Paradigm of Statistics. Inferential Statistic.

Measures of Spread: Standard Deviation

Sec 7.6 Inferences & Conclusions From Data Central Limit Theorem

CHAPTER 8 ANSWERS. Copyright 2012 Pearson Education, Inc. Publishing as Addison-Wesley

Sampling Distributions and Confidence Intervals

Review for Chapter 9

How is the President Doing? Sampling Distribution for the Mean. Now we move toward inference. Bush Approval Ratings, Week of July 7, 2003

Standard deviation The formula for the best estimate of the population standard deviation from a sample is:

Objectives. Types of Statistical Inference. Statistical Inference. Chapter 19 Confidence intervals: Estimating with confidence

23.3 Sampling Distributions

Maths skills. for biologists Biology. Planning field investigations. Planning field investigations. Asking ecological questions

Statistical Analysis and Graphing

Technical Assistance Document Algebra I Standard of Learning A.9

Caribbean Examinations Council Secondary Education Certificate School Based Assessment Additional Math Project

Practical Basics of Statistical Analysis

Appendix C: Concepts in Statistics

EDEXCEL NATIONAL CERTIFICATE UNIT 28 FURTHER MATHEMATICS FOR TECHNICIANS OUTCOME 1- ALGEBRAIC TECHNIQUES TUTORIAL 3 - STATISTICAL TECHNIQUES

Two Data sets. Variability. Data Example with the range. Issues with the range. Central Tendency tells part of the story

2014 International Journal of Medical Science Research and Practice available on

Measuring Dispersion

Sample Size Determination

REVIEW for Exam 2. Chapters 9 13 (& chi-square in ch8)

A Generalized Difference-cum-Ratio Type Estimator for the Population Variance in Double Sampling

DISTRIBUTION AND PROPERTIES OF SPERMATOZOA IN DIFFERENT FRACTIONS OF SPLIT EJACULATES*

Calibration Approach based Estimation of Finite Population Total under Two Stage Sampling

GOALS. Describing Data: Numerical Measures. Why a Numeric Approach? Concepts & Goals. Characteristics of the Mean. Graphic of the Arithmetic Mean

A Supplement to Improved Likelihood Inferences for Weibull Regression Model by Yan Shen and Zhenlin Yang

CCXCIII. VITAMIN A DETERMINATION: RELA- AND PHYSICAL METHODS OF TEST. TION BETWEEN THE BIOLOGICAL, CHEMICAL

5.1 Description of characteristics of population Bivariate analysis Stratified analysis

Methodology National Sports Survey SUMMARY

Modified Early Warning Score Effect in the ICU Patient Population

Simple intervention to improve detection of hepatitis B and hepatitis C in general practice

Teacher Manual Module 3: Let s eat healthy

talking about Men s Health...

Autism Awareness Education. April 2018

Introduction. The Journal of Nutrition Methodology and Mathematical Modeling

Chapter - 8 BLOOD PRESSURE CONTROL AND DYSLIPIDAEMIA IN PATIENTS ON DIALYSIS

Project Title: A1C and Diabetic Control

GSK Medicine Study Number: Title: Rationale: Study Period: Objectives: Primary Secondary Indication: Study Investigators/Centers: Research Methods

What are minimal important changes for asthma measures in a clinical trial?

STATISTICAL ANALYSIS & ASTHMATIC PATIENTS IN SULAIMANIYAH GOVERNORATE IN THE TUBER-CLOSES CENTER

Study No.: Title: Rationale: Phase: Study Period: Study Design: Centres: Indication: Treatment: Objectives: Primary Outcome/Efficacy Variable:

INTERNATIONAL STUDENT TIMETABLE SYDNEY CAMPUS

Bayesian Sequential Estimation of Proportion of Orthopedic Surgery of Type 2 Diabetic Patients Among Different Age Groups A Case Study of Government

APPROVAL REQUIRED. By approving this proof you are confirming that the contact information is correct.

So... we make an error when we estimate

Estimating Income Variances by Probability Sampling: A Case Study

Since the early 1930s until the early 2000s, Quantitative scoring of an interferon-c assay for differentiating active from latent tuberculosis

Reporting Checklist for Nature Neuroscience

Training with inspiratory pressure support in patients with severe COPD

Retention in HIV care among a commercially insured population,

Event detection. Biosignal processing, S Autumn 2017

Visual Acuity Screening of Children 6 Months to 3 Years of Age

Ch 9 In-class Notes: Cell Reproduction

JUST THE MATHS UNIT NUMBER STATISTICS 3 (Measures of dispersion (or scatter)) A.J.Hobson

Should We Care How Long to Publish? Investigating the Correlation between Publishing Delay and Journal Impact Factor 1

Lecture 4: Distribution of the Mean of Random Variables

QUANTITATIVE STUDIES ON THE CILIATE GLAUCOMA

NIH Public Access Author Manuscript J Low Genit Tract Dis. Author manuscript; available in PMC 2014 April 01.

Infertility and subfecundity in population-based samples from Denmark, Germany, Italy, Poland and Spain

(4) n + 1. n+1. (1) 2n 1 (2) 2 (3) n 1 2 (1) 1 (2) 3 (1) 23 (2) 25 (3) 27 (4) 30

Plantar Pressure Difference: Decision Criteria of Motor Relearning Feedback Insole for Hemiplegic Patients

Determination of the Sampling Technique to Estimate the Performance at Grade Five Scholarship Examination

A longitudinal study of self-assessment accuracy

Epilepsy and Family Dynamics

Ovarian Cancer Survival

Automatic reasoning evaluation in diet management based on an Italian cookbook

Drug use in Ireland and Northern Ireland

Basic Requirements. of meeting cow herd production and profitability goals for the beef cattle enterprise.

The value of population campaigns offering free-of-charge HIV-testing: Observational study in a town in Cameroon

Repeatability of the Glaucoma Hemifield Test in Automated Perimetry

On The Use of Coefficient of Variation and 1, 2

Spinal Flexibility and Individual Factors That Influence It

COMPARISON OF A NEW MICROCRYSTALLINE

What is a file system Link? Hard & Symbolic Links. What is a file system Link? What is a file system Link? Murray Saul Seneca College

IMPAIRED THEOPHYLLINE CLEARANCE IN PATIENTS WITH COR PULMONALE

Primary: To assess the change on the subject s quality of life between diagnosis and the first 3 months of treatment.

GSK Medicine: Study Number: Title: Rationale: Study Period: Objectives: Indication: Study Investigators/Centers: Research Methods:

Ida Leida M.Thaha, Mega Marindrawati Rochka 1, Muh. Syafar 2

Transcription:

Today: Chapter, cofidece iterval for mea Aoucemet Ueful ummary table: Samplig ditributio: p. 353 Cofidece iterval: p. 439 Hypothei tet: p. 534 Homework aiged today ad Wed, due Friday. Fial exam eat aigmet will be et oo. Homework: (Due Fri March 5) Chapter : #30bc, 48, 86*. *Ue R Commader for #86. Data file liked to webite. Both 48 ad 86 cout double, poit each. Chapter Copyright 0 Brook/Cole, Cegage Learig Etimatig Mea with Cofidece Recall: A parameter i a populatio characteritic value i uually ukow. We etimate the parameter uig ample iformatio. Chapter : C.I. for mea A tatitic, or etimate, i a characteritic of a ample. A tatitic etimate a parameter. A cofidece iterval i a iterval of value computed from ample data that i likely to iclude the true populatio value. The cofidece level for a iterval decribe our cofidece i the procedure we ued. We are cofidet that mot of the cofidece iterval we compute uig our procedure will iclude the true populatio value. 3 Copyright 0 Brook/Cole, Cegage Learig, updated by J. Utt, March 03 4 Recall from Chapter 0 A cofidece iterval or iterval etimate for ay of the five parameter ca be expreed a Sample etimate ± multiplier tadard error where the multiplier i a umber baed o the cofidece level deired ad determied from the tadard ormal (z) ditributio (for proportio) or Studet t-ditributio (for mea). Sample etimate = ample tatitic. 5 Three Etimatio Situatio Ivolvig Mea Situatio. Mea of a quatitative variable. Example: What i the mea umber of facebook fried UCI tudet have (for thoe o facebook)? What i the mea umber of word a -year old kow? Populatio parameter: (pelled mu ad proouced mew ) = populatio mea for the variable Sample etimate: x = ample mea for the variable, baed o a ample of ize. 6

Etimatig the Populatio Mea of Paired Differece Situatio. Data meaured i pair, take differece, etimate the mea of the populatio of differece: What i the mea differece i blood preure before ad after learig meditatio? (d i = differece for pero i) What i the mea differece i hour/day pet tudyig ad pet watchig televiio for college tudet? Populatio parameter: d (called mu d) Sample etimate: d = the ample mea for the differece, baed o a ample of pair, where the differece i computed for each pair. Differece i two mea Situatio 3. Etimatig the differece betwee two populatio mea for idepedet ample Example: How much differece i there i the mea of what male tudet ad female tudet expect to ear a a tartig alary after graduatio? (Quetio o 0 cla urvey.) How much differece i there i the mea IQ for childre whoe mom moked ad did t durig pregacy? Populatio parameter: = differece betwee the two populatio mea Sample etimate: x x = differece betwee the two ample mea Thi require idepedet ample 7 8 Recall from Chapter 0 A cofidece iterval or iterval etimate for ay of the five parameter ca be expreed a Sample etimate ± multiplier tadard error Stadard error (i geeral) Rough Defiitio: The tadard error of a ample tatitic meaure, roughly, the average differece betwee the tatitic ad the populatio parameter. Thi average differece i over all poible radom ample of a give ize that ca be take from the populatio. Techical Defiitio: The tadard error of a ample tatitic i the etimated tadard deviatio of the amplig ditributio for the tatitic. 9 0 Situatio : Stadard Error of the Mea d.( x ) = I practice: We do t kow o we etimate it uig Replacig with i the tadard deviatio expreio give u a etimate that i called the tadard error of x. x ) = Chapter 9 weight lo example: 5 = 5 weight loe, σ = 5; d..( x) = poud 5 Suppoe ample tadard deviatio i = 4.74 poud So the tadard error of the mea i 4.74/5 = 0.948 poud Icreaig the Size of the Sample Suppoe we take = 00 weight loe itead of jut 5. The tadard deviatio of the mea would be 5 d.( x ) = 0. 5 poud 00 For ample of = 5, ample mea are likely to rage betwee 8 ± 3 poud => 5 to poud For ample of = 00, ample mea are likely to rage oly betwee 8 ±.5 poud => 6.5 to 9.5 poud Larger ample ted to reult i more accurate etimate of populatio value tha maller ample

Stadard Error of a Sample Mea x), ample tadard deviatio Example: Mea umber of Facebook fried Cla Survey, Witer 0: Oly thoe o Facebook. About how may Facebook fried do you have? Miitab provide e., R Commader doe t, but provide Statitic Summarie Numerical ummarie, check Stadard Deviatio Miitab for the example: Variable N Mea Media StDev SE Mea Facebook 57 46.0 404.0 30. 8.8 30. e.. x 8.8 57 3 Situatio : Stadard Error of the mea of paired differece d d ) where d = ample tadard deviatio for the differece Example: How much taller (or horter) are daughter tha their mother thee day? d = 3.4 (for idividual) = 93 pair (daughter mother) d =.3 iche d 3.4 d. 33 e. 93 4 Situatio 3: Stadard Error of the Differece Betwee Two Sample Mea (upooled) x x) Example.3 Loe More Weight by Diet or Exercie? Study: = 4 me o diet, = 47 me o exercie routie Diet: Lot a average of 7. kg with td dev of 3.7 kg Exercie: Lot a average of 4.0 kg with td dev of 3.9 kg So, x x 7. 4.0 3. kg 3.7 3.9 ad x x) 4 47 0.8 Recall from Chapter 0 A cofidece iterval or iterval etimate for ay of the five parameter ca be expreed a Sample etimate ± multiplier tadard error The multiplier i a umber baed o the cofidece level deired ad determied from the tadard ormal ditributio (for proportio) or Studet t- ditributio (for mea). 5 6 Studet t-ditributio: Replacig with Dilemma: we geerally do t kow. Uig we have: x x ( x ) t x) / If the ample ize i mall, thi tadardized tatitic will ot have a N(0,) ditributio but rather a t-ditributio with degree of freedom (df). NOTE: Ue t* for all 3 ituatio ivolvig mea, but Fidig the t-multiplier Excel: See page 4. R Commader: Ditributio Cotiuou ditributio t ditributio t quatile Example: 95% CI for mea whe = 0 Probabilitie: α/ (for 95%, ue.05) Degree of freedom ( = 0, o df = 9) Lower tail Give egative of the t-multiplier Ex:.05, 9, lower tail.657, multiplier.6 Table A. (ee page 4 for itructio) Table A. i o page 670 or tur page iide back cover (eay to ue compared to z!) differet df formula for two idepedet ample 7 8 3

Example: df = 9 95% cofidece t* =.6 Cofidece Iterval for Oe Mea or Paired Data A Cofidece Iterval for a Populatio Mea * * x t e.. x x t where the multiplier t* i the value i a t-ditributio with degree of freedom = df = - uch that the area betwee -t* ad t* equal the deired cofidece level. (Foud from Excel, R Commader or Table A..) Coditio: Populatio of meauremet i bell-haped (o major kewe or outlier) ad r. of ay ize > ; OR Populatio of meauremet i ot bell-haped, but a large radom ample i meaured, 30. etc 9 0 95% C.I. for Mea Aticipated Startig Salary Data from 0 urvey, what do you expect your tartig alary to be after you graduate? (or after grad/prof chool?) = 44 (Some outlier at $00K, $50K, $500K) Sample mea = $63,075 Sample tadard deviatio = $46,607 46,607 Stadard error of the mea = 44,984 Multiplier = t* with df of 00 =.98 (cloet i Table A.) Sample etimate ± multiplier tadard error 63,075 ±.98 984 63,075 ± 5908 $57,67 to $68,983 C.I. for ome other urvey Q Variable N Mea StDev SE Mea 95% CI Facebook 57 46.0 30. 8.8 (45.0, 499.0) Icome00 60 353 797 446 ( 65, 440) StudetLoa 4 759 33973 84 (37, 83) HourStudy 64 5.36 4.03 0.59 (4.85, 5.87) Note the extremely large tadard deviatio for all of thee. Obviouly they are ot bell-haped variable! Example: Hitogram for tudy hour Frequecy Hitogram of HourStudy 00 80 60 40 0 0 6 8 4 30 36 HourStudy 3 Paired Data Cofidece Iterval Data: two variable for idividual or pair; ue the differece d = x x. Populatio parameter: d = mea of differece for the populatio (ame a ). Sample etimate: d = ample mea of the differece Stadard deviatio ad tadard error: d = tadard deviatio of the ample of differece; d e. d Cofidece iterval for d : d t * e. d, where df = for the multiplier t*. 4 4

Fid 90% C.I. for differece: (daughter mother) height differece How much taller (or horter) are daughter tha their mother thee day? = 93 pair (daughter mother), d =.3 iche d = 3.4 iche, o d 3.4 e. d. 33 93 Multiplier = t* with 9 df for 90% C.I. =.66 (ue df=90) Sample etimate ± multiplier tadard error.3 ±.66 0.33.3 ± 0.55 0.75 to.85 iche (doe ot cover 0) Cofidece iterval iterpretatio We are 95% cofidet that the mea tudy hour per week for Stat 7, for all tudet over all time (who would complete a urvey??) i betwee 4.85 ad 5.87 hour We are 90% cofidet that the mea height differece betwee female college tudet ad their mother i betwee 0.75 ad.85 iche, with tudet beig taller tha their mother 5 6 Example: Small ample, o check for outlier Data: Hour pet tudyig for thoe tudet who atted cla 0 or time a week; = 0 tudet Create a 95% CI for tudy hour for tudet who do t atted cla Small, o check for kewe ad outlier HourStudy 6 5 4 3 Boxplot of HourStudy Note: Boxplot how o obviou kewe ad o outlier 7 Example, cotiued (tudy hour) Reult:.89 x 3.7,.89, ad e.. x 0.60 0 Multiplier t* from Table A. with df = 9 i t* =.6 95% Cofidece Iterval: 3.7.6(0.6) => 3.7.36 =>.34 to 5.06 hour Iterpretatio: We are 95% cofidet that the mea of the tudy hour per week for Stat 7 for tudet who do t atted cla (ad are repreeted by thi ample) i covered by the iterval from.34 to 5.06 hour per week. (Compare to 95% C.I. for everyoe, 4.85 to 5.87 hour) 8.4 CI for Differece Betwee Two Mea (Idepedet ample) A CI for the Differece Betwee Two Mea (Idepedet Sample, upooled cae): * x x t where t* i the value i a t-ditributio with area betwee -t* ad t* equal to the deired cofidece level. Neceary Coditio Two ample mut be idepedet. Either Populatio of meauremet both bell-haped, ad radom ample of ay ize are meaured. or Large ( 30) radom ample are meaured. 9 30 5

Degree of Freedom The t-ditributio i oly approximately correct ad df formula i complicated (Welch approx): Statitical oftware ca ue the above approximatio, but if doe by had the ue a coervative df = maller of ( ) ad ( ). 3 Example: Aticipated Startig Salary for Me/Wome Two-ample T for StartSalary (Miitab output) Group N Mea StDev SE Mea Me 87 69356 44937 488 Wome 56 5677 3485 5 Differece = mu (Me) - mu (Wome) Etimate for differece: 585, df = 33 95% CI for differece: (830, 3340) (44937) (3485) x x t,585.98 87 56 * Iterpretatio: We are 95% certai that the mea aticipated tartig alary for me i betwee $830 ad $3,340 higher the the mea aticipated tartig alary for wome, for the populatio of tudet repreeted by thi 3 ample. Approximate 95% CI For ufficietly large ample, the iterval Sample etimate Stadard error i a approximate 95% cofidece iterval for a populatio parameter. Note: Except for very mall degree of freedom, the multiplier t* for 95% cofidece iterval i cloe to. So, i ofte ued to approximate, rather tha fidig degree of freedom. For itace, for 95% C.I.: t*(30) =.04, t*(60) =.00, t*(90) =.99, t*( ) = z* =.96 33 Example.3 Diet v Exercie Study: = 4 me o diet, = 47 me exercie Diet: Lot a average of 7. kg with td dev of 3.7 kg Exercie: Lot a average of 4.0 kg with td dev of 3.9 kg So, x x 7. 4.0 3. kg ad x x) Approximate 95% Cofidece Iterval: 3. (.8) => 3..6 =>.58 to 4.8 kg 0.8kg Note: We are 95% cofidet the iterval.58 to 4.8 kg cover the additioal mea weight lo for dieter compared to thoe who exercied. The iterval doe ot cover 0, o a real differece i likely to hold for the populatio a well. 34 Uig R Commader Doe tet ad C.I. i ame tep Read i or eter data et Statitic Mea Sigle ample t-tet Idepedet ample t-tet (require data i oe colum, ad group code i aother) Paired t-tet (require the origial data i two eparate colum) Example: Compare tudy hour for thoe who drik ad thoe who do t Data New data et give ame, eter data Oe colum for Drik or Do t drik, oe for Study hour Statitic Mea Idepedet ample t-tet Chooe the alterative (, >, <) ad cofidece level Welch Two Sample t-tet data: HourStudy by Drik t =.993, df = 70.556, p-value = 0.050 alterative hypothei: true differece i mea i ot equal to 0 95 percet cofidece iterval: -0.0065343 3.45080754 ample etimate: mea i group Do't drik mea i group Drik 6.500000 4.775735 35 36 6

Should check for outlier if mall ample() Graph Boxplot Plot by Group Thee are large ample, fortuately, becaue very kewed! But till look like thoe who drik have fewer tudy hour Cofidece iterval applet: Illutrate the ame cocept a the had-o team project lat Friday. http://www.romachace.com/applet/cofim/cofim.html http://www.romachace.com/applet/newcofim/cofim.html 37 7