Comparing Means among Two (or More) Independent Populations. John McGready Johns Hopkins University

Similar documents
Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University

Confounding and Effect Modification. John McGready Johns Hopkins University

Section B. Variability in the Normal Distribution: Calculating Normal Scores

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University

Section G. Comparing Means between More than Two Independent Populations

Section F. Measures of Association: Risk Difference, Relative Risk, and the Odds Ratio

Section D. Another Non-Randomized Study Design: The Case-Control Design

Section C. Sample Size Determination for Comparing Two Proportions

Describing Data: Part I. John McGready Johns Hopkins University

Statistics for Psychosocial Research Lecturer: William Eaton

Study Designs. Lecture 3. Kevin Frick

Proposal Writing. Lynda Burton, ScD Johns Hopkins University

[Source: Freedman, Pisani, Purves (1998) Statistics, 3rd ed., Ch 2.]

Proteins and Amino Acids. Benjamin Caballero, MD, PhD Johns Hopkins University

The Impact of Pandemic Influenza on Public Health

Experimental design. Basic principles

How to Choose the Wrong Model. Scott L. Zeger Department of Biostatistics Johns Hopkins Bloomberg School

Lecture 3: Digestion and Dieting Physiology

Section H. Case Study: Risk Assessment Issues Associated with Chloroform

Health Issues for Aging Populations

Alcohol. Gordon Smith, MD (MB, ChB Otago), MPH Professor University of Maryland National Study Center for Trauma and EMS

Critical Analysis of Popular Diets and Supplements

The Nutrition Transition

Section B. Self-Regulation, Attention, and Problem Solving

Evaluation and Assessment of Health Policy. Gerard F. Anderson, PhD Johns Hopkins University

Section D. Genes whose Mutation can lead to Initiation

Epidemiology of Mood Disorders II: Analytic Epidemiology and the Search for Etiologic Clues. William W. Eaton, PhD Johns Hopkins University

Epidemiologic Methods for Evaluating Screening Programs. Rosa M. Crum, MD, MHS Johns Hopkins University

Severe Mental Disorders. Etheldreda Nakimuli-Mpungu, MMed (Psych), MBChB Johns Hopkins University

Economic Evaluation. Introduction to Economic Evaluation

Epidemiology of Diarrheal Diseases. Robert Black, MD, MPH Johns Hopkins University

Justice in Research Ethics: International Issues. Nancy E. Kass, ScD Johns Hopkins University

STD Epidemiology. Jonathan Zenilman, MD Johns Hopkins University

Section C. Biomarkers of Early Biological Effect and Altered Structure and Function

En & IH Engineering Aspects of Public Health Crises. Quiz 1

Defining Mental Disorders. Judy Bass, MPH, PhD Johns Hopkins University

TB Epidemiology. Richard E. Chaisson, MD Johns Hopkins University Center for Tuberculosis Research

Things you need to know about the Normal Distribution. How to use your statistical calculator to calculate The mean The SD of a set of data points.

Social Capital. Ginny Tedrow. Health Behavior Change at the Individual, Household and Community Levels

Monitoring and Evaluation IN ACTION! Duff Gillespie

Master Class: Fundamentals of Lung Cancer

One way Analysis of Variance (ANOVA)

Chapter 8 Estimating with Confidence

Introduction, Nosology, and History. William W. Eaton, PhD Johns Hopkins University

STI Prevention: Housekeeping and How We Use Public Health. M. Terry Hogan, MPH Johns Hopkins University

Costing in Vaccine Planning and Programming

Breast Cancer Basics. Clinical Oncology for Public Health Professionals. Ben Ho Park, MD, PhD

New Drug Targets. Sean T. Prigge, PhD

Changing People s Behavior. Larry Wissow Professor Health, Behavior and Society Johns Hopkins School of Public Health

Creative Commons Attribution-NonCommercial-Share Alike License

Meanings of medications, metonymy. Peter Winch. Health Behavior Change at the Individual, Household and Community Levels

Measurement of Psychopathology in Populations. William W. Eaton, PhD Johns Hopkins University

Michael J. McQuestion, PhD, MPH Johns Hopkins University. Measurement Concepts and Introduction to Problem Solving

Economic Evaluation. Defining the Scope of a Costeffectiveness

Summarizing Data. (Ch 1.1, 1.3, , 2.4.3, 2.5)

Post Procedural Care

SAMPLE ASSESSMENT TASKS MATHEMATICS ESSENTIAL GENERAL YEAR 11

Salt Iodization in Haiti: Challenges to Improving Salt Production Quality and Recommendations for Pursuing Iodization

CHAPTER 8 Estimating with Confidence

CRITICAL APPRAISAL SKILLS PROGRAMME Making sense of evidence about clinical effectiveness. 11 questions to help you make sense of case control study

Learning from nature for a healthier start Staged infant formulas with Lacprodan mimic the changing protein profile of breast milk

Section C. Intrinsic and Extrinsic Influences on Infant and Early Childhood Regulatory Development

2018 OPTIONS FOR INDIVIDUAL MEASURES: REGISTRY ONLY. MEASURE TYPE: Outcome

Importance of Cell Stock Concentration for Accurate Target Cell Recovery

111, section 8.6 Applications of the Normal Distribution

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Psychology 2019 v1.3. IA2 high-level annotated sample response. Student experiment (20%) August Assessment objectives

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

GUIDANCE FOR SAMPLING ART CLINICS IN COUNTRIES COMBINING SURVEILLANCE OF PRE-TREATMENT HIV DRUG RESISTANCE AND ACQUIRED HIV DRUG RESISTANCE AT 12 AND

Reflection Questions for Math 58B

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

LECTURE 27 ENERGY IN THE BODY

Essential Skills for Evidence-based Practice: Statistics for Therapy Questions

Module 28 - Estimating a Population Mean (1 of 3)

2017 OPTIONS FOR INDIVIDUAL MEASURES: REGISTRY ONLY. MEASURE TYPE: Process

An Introduction to Bayesian Statistics

Students were asked to report how far (in miles) they each live from school. The following distances were recorded. 1 Zane Jackson 0.

Sheila Barron Statistics Outreach Center 2/8/2011

2019 COLLECTION TYPE: MIPS CLINICAL QUALITY MEASURES (CQMS) MEASURE TYPE: Patient Reported Outcome High Priority

2018 OPTIONS FOR INDIVIDUAL MEASURES: REGISTRY ONLY. MEASURE TYPE: Process

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

DENOMINATOR: All patients undergoing anterior or apical pelvic organ prolapse (POP) surgery

Don t overdo it! Weighing and weight maintenance

Don t overdo it! Weighing and weight maintenance

New Paradigms of Type 2 Diabetes. From:

Measure #265: Biopsy Follow-Up National Quality Strategy Domain: Communication and Care Coordination

RAG Rating Indicator Values

1. Find your BMR (basal metabolic rate) formula: 2. Calculating your TDEE (total daily energy expenditure):

ANALYSIS OF BODY FAT AMONG AMRAVATI CITY ADOLESCENT SCHOOL BOYS

Hypothesis testing: comparig means

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.

2019 COLLECTION TYPE: MIPS CLINICAL QUALITY MEASURES (CQMS) MEASURE TYPE: Patient Reported Outcome High Priority

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Basic Biostatistics. Chapter 1. Content

Sample Size Considerations. Todd Alonzo, PhD

Measure #265: Biopsy Follow-Up National Quality Strategy Domain: Communication and Care Coordination

Pre-Lab #7: Nutrition

Individual-level models of health behavior. Caitlin Kennedy. Health Behavior Change at the Individual, Household and Community Levels 224.

Transcription:

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this site. Copyright 2009, The Johns Hopkins University and John McGready. All rights reserved. Use of these materials permitted only in accordance with license rights granted. Materials provided AS IS ; no representations or warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for obtaining permissions for use from third parties as needed.

Comparing Means among Two (or More) Independent Populations John McGready Johns Hopkins University

Lecture Topics CIs for mean difference between two independent populations Two sample t-test Non-parametric alternative, Mann Whitney (FYI, optional) Comparing means amongst more than two independent populations: ANOVA 3

Section A Two Sample t-test: The Resulting Confidence Interval

Comparing Two Independent Groups A Low Carbohydrate as Compared with a Low Fat Diet in Severe Obesity * 132 severely obese subjects randomized to one of two diet groups Subjects followed for a six month period At the end of study period Subjects on the low-carbohydrate diet lost more weight than those on a low fat diet (95% confidence interval for the difference in weight loss between groups, -1.6 to -6.2 kg; p <.01) Source: * Samaha, F., et al. A low-carbohydrate as compared with a low-fat diet in severe obesity, New England Journal of Medicine, 348: 21. 5

Comparing Two Independent Groups: Diet Types Study Scientific question Is weight change associated with diet type? Low-Carb Diet Group Low-Fat Number of subjects (n) 64 68 Mean weight change (kg) Post-diet less pre-diet Standard deviation of weight changes (kg) -5.7-1.8 8.6 3.9 6

Diet Type and Weight Change 95% CIs for weight change by diet group Low Carb: Low Fat: 7

Comparing Two Independent Groups: Diet Types Study In statistical terms, is there a non-zero difference in the average weight change for the subjects on the low-fat diet as compared to subjects on the low-carbohydrate diet? 95% CIs for each diet group mean weight change do not overlap, but how do you quantify for the difference? The comparison of interest is not paired There are different subjects in each diet group For each subject a change in weight (after diet before weight) was computed However, the authors compared the changes in weight between two independent groups! 8

Comparing Two Independent Groups How do we calculate Confidence interval for difference? p-value to determine if the difference in two groups is significant? Since we have large samples (both greater than 60) we know the sampling distributions of the sample means in both groups are approximately normal It turns out the difference of quantities, which are (approximately) normally distributed, are also normally distributed 9

Sampling Distribution: Difference in Sample Means So, the big news is... The sampling distribution of the difference of two sample means, each based on large samples, approximates a normal distribution This sampling distribution is centered at the true mean difference, µ 1 - µ 2 10

Simulated Sampling Dist n of Sample Mean Weight Loss Simulated sampling distribution of sample mean weight change: low carbohydrate diet group 11

Simulated Sampling Dist n of Sample Mean Weight Loss Simulated sampling distribution of sample mean weight change: low fat diet group 12

Simulated Sampling Dist n of Sample Mean Weight Loss Simulated sampling distribution of sample mean weight change: low fat diet group 13

Simulated Sampling Dist n of Sample Mean Weight Loss Side by side boxplots 14

95% Confidence Interval for Difference in Means Our most general formula The best estimate of a population mean difference based on sample means: Here, may represent the sample mean weight loss for the 64 subjects on the low carbohydrate diet, and the mean weight less for the 68 subjects on the low fat diet 15

95% CI for Difference in Means: Diet Types Study So, : hence the formula for the 95% CI for µ 1 - µ 2 is: Where = standard error of the difference of two sample means 16

Two Independent (Unpaired) Groups The standard error of the difference for two independent samples is calculated differently than we did for paired designs With paired design we reduced data on two samples to one set of differences between two groups Statisticians have developed formulas for the standard error of the difference These formulas depend on sample sizes in both groups and standard deviations in both groups The is greater than either or Why do you think this is? 17

Principle Variation from independent sources can be added Why do you think this is additive Of course, we don t know σ 1 and σ 2 : so we estimate with s1 and s2 to get an estimated standard error: 18

Comparing Two Independent Groups: Diet Types Study Recall the data from the weight change/diet type study Low-Carb Diet Group Low-Fat Number of subjects (n) 64 68 Mean weight change (kg) Post-diet less pre-diet Standard deviation of weight changes (kg) -5.7-1.8 8.6 3.9 19

95% CI for Difference in Means: Diet Types Study So in this example, the estimated 95% for the true mean difference in weight between the low-carbohydrate and low-fat diet groups is: 20

From Article Subjects on the low-carbohydrate diet lost more weight than those on a low fat diet (95% confidence interval for the difference in weight loss between groups, -1.6 to -6.2 kg; p<.01) So those on the low carb diet lost more on average by 3.9 kg: after accounting for sampling variability this excess average loss over the low-fat diet group could be as small as 1.6 kg or as large as 6.2 kg This confidence interval does not include 0, suggesting a real population level association between type of diet (low-carb or low-fat) and weight loss 21