Sample Size, Power and Sampling Methods

Similar documents
Statistical Power Sampling Design and sample Size Determination

Sampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015

Chapter Three: Sampling Methods

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

Chapter 3. Producing Data

Chapter 02. Basic Research Methodology

Planning Sample Size for Randomized Evaluations.

Unit 1 Exploring and Understanding Data

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

BIOSTATISTICS. Dr. Hamza Aduraidi

aps/stone U0 d14 review d2 teacher notes 9/14/17 obj: review Opener: I have- who has

SAMPLING AND SAMPLE SIZE

Sampling and Power Calculations East Asia Regional Impact Evaluation Workshop Seoul, South Korea

Choose a fertility clinic: update

Fundamental Clinical Trial Design

Vocabulary. Bias. Blinding. Block. Cluster sample

Study Guide for the Final Exam

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

Design of Experiments & Introduction to Research

Where does "analysis" enter the experimental process?

Chapter 5: Producing Data

Sample Size Considerations. Todd Alonzo, PhD

Sample size and sampling for quantitative research A. Bowling; Research Methods in Health(2002): pp

MATH-134. Experimental Design

Experimental Design There is no recovery from poorly collected data!

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

Survival Skills for Researchers. Study Design

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Observation Studies, Sampling Designs and Bias

Outline. Chapter 3: Random Sampling, Probability, and the Binomial Distribution. Some Data: The Value of Statistical Consulting

Statistical Sampling: An Overview for Criminal Justice Researchers April 28, 2016

STA 291 Lecture 4 Jan 26, 2010

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

Study Design STUDY DESIGN CASE SERIES AND CROSS-SECTIONAL STUDY DESIGN

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Chapter Three Research Methodology

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

Probability Models for Sampling

Statistical inference provides methods for drawing conclusions about a population from sample data.

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

REVIEW FOR THE PREVIOUS LECTURE

Math 140 Introductory Statistics

Clinical Research Scientific Writing. K. A. Koram NMIMR

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Sheila Barron Statistics Outreach Center 2/8/2011

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling.

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

Chapter 3. Producing Data

Review Sheet. 2) Which branch of science broken into chemistry and physics? a. Life science b. Earth science c. Biology d.

Collecting & Making Sense of

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

Quantitative survey methods

In this second module in the clinical trials series, we will focus on design considerations for Phase III clinical trials. Phase III clinical trials

CHAPTER 3 METHOD AND PROCEDURE

Class 1. b. Sampling a total of 100 Californians, where individuals are randomly selected from each major ethnic group.

Sampling. (James Madison University) January 9, / 13

Design, Sampling, and Probability

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

How to Randomise? (Randomisation Design)

e.com/watch?v=hz1f yhvojr4 e.com/watch?v=kmy xd6qeass

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

Sampling Reminders about content and communications:

Designing Experiments... Or how many times and ways can I screw that up?!?

Lecture Start

Survey Research Methodology

Research Questions, Variables, and Hypotheses: Part 2. Review. Hypotheses RCS /7/04. What are research questions? What are variables?

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

Chapter 1 Data Types and Data Collection. Brian Habing Department of Statistics University of South Carolina. Outline

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

VALIDITY OF QUANTITATIVE RESEARCH

CHAPTER 5: PRODUCING DATA

Collecting & Making Sense of

Chapter 13 Summary Experiments and Observational Studies

Our goal in this section is to explain a few more concepts about experiments. Don t be concerned with the details.

Power & Sample Size. Dr. Andrea Benedetti

Variables Research involves trying to determine the relationship between two or more variables.

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Lab 2: The Scientific Method. Summary

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015

Psychology Research Process

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

THIS CHAPTER COVERS: The importance of sampling. Populations, sampling frames, and samples. Qualities of a good sample.

Research Methodology. Characteristics of Observations. Variables 10/18/2016. Week Most important know what is being observed.

Introduction to Program Evaluation

BIOSTATISTICS. Mahmoud Al Hussami, PhD., DSc. Associate Professor of Epidemiology

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

NEW JERSEY RESIDENTS DON T KNOW OR LIKE MUCH ABOUT COMMON CORE

Math 124: Module 3 and Module 4

Research Design. Beyond Randomized Control Trials. Jody Worley, Ph.D. College of Arts & Sciences Human Relations

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Transcription:

Sample Size, Power and Sampling Methods Mary Ann McBurnie, PhD Senior Investigator, Kaiser Permanente Center for Health Research Steering Committee Chair, Community Health Applied Research Network (CHARN) Jonathan N. Tobin, PhD Professor, Albert Einstein College of Medicine of Yeshiva University President/CEO CLINICAL DIRECTORS NETWORK, INC. (CDN)

Acknowledgment Material in this presentation was developed as part of the curriculum for the international Methods in Epidemiologic, Clinical and Operations Research (MECOR) program sponsored by the American Thoracic Society (ATS) Designed for physicians and health care professionals Intended to strengthen capacity and leadership in research related to respiratory conditions, critical care and sleep medicine in middle and low income countries

Testing a hypothesis We want to make inferences about a population from a sample. i.e., we have a hypothesis we want to test We need to choose the number of observations to include in a study sample. The larger the sample, the greater the power of the statistical test => Sample size determines statistical power

Testing a hypothesis What is (statistical) power? It is the probability that we will observe an intervention effect (based on data from our sample) when an intervention effect actually exists.

Testing a hypothesis: Example Say we want to test a text messaging intervention to remind diabetes patients to take their meds on schedule, which should result in lower HbA1c levels. We randomly assign patients to the text message intervention or to the usual care control groups. Define the mean difference between HbA1c scores in the intervention and non intervention groups as our effect size: diff = tx no tx diff is the effect size

Testing a hypothesis: Example We specify mutually exclusive null and alternative hypotheses : Null hypothesis: there is no difference in mean HbA1c between patients receiving the text messages and those not: H 0 : diff = 0 Alternative (2-sided) hypothesis : mean HbA1c differs between patients who do and do not get the text intervention: H A : diff = 0

The truth vs what we observe What is the truth? Either the intervention is effective or it isn t (we don t know which) What do we observe (we only do the study in a sample of the intended population) Either we will observe an effect on HbA1c level when we analyze the data from our sample or we won t We want to optimize our chances of: Observing an effect in our data if the truth is that the intervention really works. NOT observing an effect if the truth is that the intervention really doesn t work

What we see (in our sample) Testing a Hypothesis The Truth (unknown to us) Our text intervention is: Effective Not Effective We observe a difference in HbA1c We don t observe a difference in HbA1c OK error! error! OK

Testing a hypothesis Let s think about these possibilities in terms of probabilities

What we see (in our sample) Testing a Hypothesis The Truth (unknown to us) Our text intervention: Works Doesn t Work We observe a difference in HbA1c We don t observe a difference in HbA1c Probability of deciding there is an effect when there really is one Probability of deciding there is no effect, when there really is one Probability of deciding there is an effect when there really isn t one Probability of deciding there is no effect when there really isn t one

What we see (in our sample) Testing a Hypothesis The Truth (unknown to us) Our text intervention: Works Doesn t Work We observe a difference in HbA1c We don t observe a difference in HbA1c Power = 1- Type II error = Type I error = 1-

What we see (in our sample) Testing a Hypothesis The Truth (unknown to us) Our text intervention: Works Doesn t Work We observe a difference in HbA1c We don t observe a difference in HbA1c Power = 1- Usually >.80 Type II error = Usually <.20 Type I error = Usually <.05 1- Usually >.95

Testing a Hypothesis Setting =.05 is traditional (and arbitrary) i.e., we accept a 5/100 (or 1 in 20) chance of making type I error ( i.e., deciding that there IS an effect when there really ISN T) Setting power =.80, (i.e. =.20) means we expect to have at least an 80% chance of detecting an effect when there really is one.

Testing a Hypothesis So if we want to have 80% power and a 5% significance level how many patients do we need?

Sample Size (N) depends on Type I error = Power = 1- Effect size = Estimate of expected treatment effect, diff tx no tx Estimate of variability in treatment effect, Will therapy affect everyone to the same degree or does the effect vary substantially, affecting some patients a lot and others very little?

Sample Size Some common sample size applications assume that your data are normally distributed

A simple sample size calculation: 2n = 4(Z 1- + Z 1- ) 2

±1 SD, coverage=68.26% ±2 SDs, coverage=95.46% ±1.96 SDs, coverage=95% ±3 SDs, coverage=99.73% Normal Distribution 1SD 2SD 3SD

Distributions for Ho and Ha1 Ho: =0 Ha1: >0 Ho=0 Ha1 > 0 β α α: Type I error β: Type II error = 0 >0 Must choose cut point corresponding to : Z EnCoRE Presentation, Feb 17, 2015

Increase in Sample Size Tightens the Distributions Decreases the Overlap Both α and β are decreased =0 >0 Z

Trade-offs between sample size, power and effect size If we want to increase power, say from 80% to 90%, we must increase sample size If we want to detect a smaller effect size, say a 10% increase in survival instead of a 20% increase, we must increase sample size

What is β? Quiz Type II error: The probability of failing to reject the null hypothesis when it is false What is α? Type I error: The probability of rejecting the null hypothesis when it is true What is 1-β? Power: the probability of accepting the alternative hypothsis when it is true.

Resources for power and sample size calculations http://davidmlane.com/hyperstat/power.html http://homepage.stat.uiowa.edu/~rlenth/power/

Sampling Methods

Sampling Methods for selecting subjects for study e.g., people, organizations, trees, etc.) from a population of interest. Often we want to generalize results of study to the larger population.

Sampling Theoretical population vs accessible population Usually not feasible to sample full population. => Sample some members of a theoretical population as representative of the entire population Multi-step process Identify population of interest Identify accessible population Develop sampling frame Draw sample Contact/recruit subjects/participants

Types of sampling Convenience sampling = identify a group of people you can get to "conveniently Examples: hospital staff, market women, senior housing No "formal" sampling frame is used Not able to calculate confidence intervals Useful for many purposes Probability sampling = random selection SRS ( simple random sample ) Stratified sample CS ( cluster sample )

Random Sampling Involves creation of a "Sampling frame" (list of members of group to be sampled). Assures equal probability of selection for all members of the population. Estimates will be "unbiased" (precise statistical meaning: average of multiple samples will have population mean) Note: random does not mean "haphazard"

Simple Random Sampling (SRS) SRS Process: Develop sampling frame: List accessible population of N subjects from which n subjects will be drawn (e.g., all individuals/hhs). Use random process, e.g. random number table, to generate "n" numbers between 1 and N. Identify "n" individuals in sample corresponding to the "n" numbers generated. Examples: phonebook - random digit dialing.

Simple Random Sampling (SRS) Advantages: Most basic form of sampling The gold standard to which other methods are compared. Disadvantages: All individuals/hh must be identified prior to sampling May be unrealistic - time, money. Selected individuals/hhs may be highly dispersed. Visiting each may be very time consuming

Stratified Random Sampling Divide population in homogeneous (mutually exclusive) subgroups (i.e., strata ) and do SRS within each group Advantages Enough cases from each strata to make meaningful inferences on key subgroups (e.g., small minority groups). Generally have more statistical precision than SRS if the strata are homogeneous: => the variability within-groups is lower than the variability for the population as a whole. => smaller sample sizes Examples of common strata: age, race, gender, educational attainment level, socioeconomic status, rare traits, diseases or conditions

Cluster Sampling (CS) CS Process Construct sampling frame using groups or clusters of individuals (or HHs) without identifying or listing each one. Cluster = villages, towns, districts, urban blocks, etc. List clusters Take a random sample of the clusters Obtain list of individuals/hhs only for those clusters selected in the sample Sample a random sample of individuals/hhs within each selected cluster

Advantages: Cluster Sampling (CS) Economy: listing costs/travel costs. Feasibility: most countries will have lists of population by groups (villages, towns) Allows a larger sample size than SRS due to decreased costs for same resources (money, person-time), it s possible to gain greater precision with cluster sampling Disadvantages: Estimate not as precise as SRS for the same sample size. Design Effect = Variance Cluster Sampling/Variance SRS. Factor by which to increase CS sample size to obtain same precision as for SRS Example: WHO Expanded Programme on Immunization (EPI) sampling method.

Cluster Sampling (CS) Considerations Budget transport costs can be high, esp. in rural areas Travel/logistics Define center/periphery of urban areas. How to account for appartments vs houses need to establish unambiguous methods, eliminate personal choice of researchers Number of surveys in given area per day. Supplies Technical Can estimate optimal cluster size if you know:» transport costs to each cluster» cost for interviewing each respondent»roh

Questions?

Thank You!

How sample size affects the power to detect differences 1 0.8 0.6 0.4 0.2 0 0 1 2 3 4 5 6 7 8 9 10 Difference in Population Means 560 600 800 1200 12000 n

How δ affects the relationship between power and sample size 1 0.8 0.6 0.4 0.2 0 9 17 6 4 2 0 200 400 600 800 1000 Sample size

How variability affects the relationship between power and sample size 1 0.8 0.6 0.4 0.2 0 0 200 400 600 800 1000 Sample size 30 39 58 70