e.com/watch?v=hz1f yhvojr4 e.com/watch?v=kmy xd6qeass

Similar documents
NEED A SAMPLE SIZE? How to work with your friendly biostatistician!!!

Fundamental Clinical Trial Design

Statistical Essentials in Interpreting Clinical Trials Stuart J. Pocock, PhD

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Clinical Trials. Susan G. Fisher, Ph.D. Dept. of Clinical Sciences

BIOSTATISTICAL METHODS

Lesson 11.1: The Alpha Value

How to CRITICALLY APPRAISE

95% 2.5% 2.5% +2SD 95% of data will 95% be within of data will 1.96 be within standard deviations 1.96 of sample mean

Power & Sample Size. Dr. Andrea Benedetti

Overview of Clinical Study Design Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine US National

Questions for the Table Group

SAMPLING AND SAMPLE SIZE

III. WHAT ANSWERS DO YOU EXPECT?

Political Science 15, Winter 2014 Final Review

Bios 6648: Design & conduct of clinical research

CLINICAL PROTOCOL DEVELOPMENT

Statistics for Clinical Trials: Basics of Phase III Trial Design

Living well today...32 Hope for tomorrow...32

Methods for Determining Random Sample Size

Biostatistics 3. Developed by Pfizer. March 2018

How do Categories Work?

PROSTATE CANCER SCREENING SHARED DECISION MAKING VIDEO

Chapter 12. The One- Sample

Journal Club. 1. Develop a PICO (Population, Intervention, Comparison, Outcome) question for this study

What Constitutes a Good Contribution to the Literature (Body of Knowledge)?

Lesson 9: Two Factor ANOVAS

Sample Size, Power and Sampling Methods

Statistical Considerations for Research Design. Analytic Goal. Analytic Process

NEW SEX THERAPY: ACTIVE TREATMENT OF SEXUAL DYSFUNCTIONS BY HELEN SINGER KAPLAN

Crossing boundaries between disciplines: A perspective on Basil Bernstein s legacy

Viewpoints on Setting Clinical Trial Futility Criteria

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Roles of Non-HDL Cholesterol in Risk Assessment and Treatment

This is a repository copy of Practical guide to sample size calculations: superiority trials.

Design and analysis of clinical trials

State of Connecticut Department of Education Division of Teaching and Learning Programs and Services Bureau of Special Education

Phase II Design. Kim N. Chi NCIC/NCIC-CTG NEW INVESTIGATORS CLINICAL TRIALS COURSE

Full title: A likelihood-based approach to early stopping in single arm phase II cancer clinical trials

Visit Names

Bayesian and Frequentist Approaches

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

European Federation of Statisticians in the Pharmaceutical Industry (EFSPI)

Audio: In this lecture we are going to address psychology as a science. Slide #2

Sample Size Considerations. Todd Alonzo, PhD

Do Now. Complete the KWL chart answering the following question: WHAT IS SCIENCE? The last column should be left blank.

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Challenges in the Design and Analysis of Non-Inferiority Trials: A Case Study. Key Points from FDA Guidance

10/9/2018. Ways to Measure Variables. Three Common Types of Measures. Scales of Measurement

Introduction to Clinical Trials - Day 1 Session 2 - Screening Studies

Top 10 Tips for Successful Searching ASMS 2003

Power of a Clinical Study

1 The conceptual underpinnings of statistical power

The Impact of Erectile Dysfunction on Partners of Men with ED

Methodological aspects of non-inferiority and equivalence trials

For more information about how to cite these materials visit

UNIT 5 - Association Causation, Effect Modification and Validity

SAMPLE SIZE AND POWER

How to Interpret a Clinical Trial Result

Survival Skills for Researchers. Study Design

UNIT. Experiments and the Common Cold. Biology. Unit Description. Unit Requirements

After Adrenal Cancer Treatment

Flat Belly Fix Review

Adaptive designs: When are they sensible?

New Cleveland Catholic Diocese ministry reaches out to those who have loved ones who are incarcerated

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

Free Data! Using Publicly- Available Data for Your Research Project

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

Chapter 8. Learning Objectives 9/10/2012. Research Principles and Evidence Based Practice

t-test for r Copyright 2000 Tom Malloy. All rights reserved

The! Lie Detection Cheat Sheet!

Herpes Zoster Vaccination: New Recommendations for Shingles Prevention - Frankly Speaking EP 50

A Critical View of JUPITER

A Helping Model of Problem Solving

7 PRINCIPLES TO QUIT SMOKING FOREVER

Biost 524: Design of Medical Studies

Planning Sample Size for Randomized Evaluations.

appstats26.notebook April 17, 2015

This is a repository copy of Practical guide to sample size calculations: non-inferiority and equivalence trials.

After Soft Tissue Sarcoma Treatment

Critical Appraisal. Dave Abbott Senior Medicines Information Pharmacist

Conversations: Let s Talk About Bladder Cancer

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

Gary Duhon, PhD, Professor of School Psychology OSU

Statistical Considerations in Pilot Studies

Sheila Barron Statistics Outreach Center 2/8/2011

Economics 2010a. Fall Lecture 11. Edward L. Glaeser

Two-sample Categorical data: Measuring association

Is there any way you might be better off if you quit? What happens when you think about it? What do you imagine will happen if you don t change?

Statin Intolerance: Keys to Patient Counseling and Guidance

ADVICE TO PATIENTS REQUESTING PSA MEASUREMENT FREQUENTLY-ASKED QUESTIONS

UNEQUAL CELL SIZES DO MATTER

CLINICAL DECISION USING AN ARTICLE ABOUT TREATMENT JOSEFINA S. ISIDRO LAPENA MD, MFM, FPAFP PROFESSOR, UPCM

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

DON M. PALLAIS, CPA 14 Dahlgren Road Richmond, Virginia Telephone: (804) Fax: (804)

BY BRUCE ECKER COHERENCE THERAPY PRACTICE MANUAL AND TRAINING GUIDE [SPIRAL-BOUND] BY BRUCE ECKER, LAUREL HULLEY

Stat Wk 9: Hypothesis Tests and Analysis

SECOND TRADITION SKIT

9 research designs likely for PSYC 2100

Ultrasound: Improving Breast Cancer Detection

Transcription:

What you need to know before talking to your statistician about sample size Sharon D. Yeatts, Ph.D. Associate Professor of Biostatistics Data Coordination Unit Department of Public Health Sciences Medical University of South Carolina yeatts@musc.edu

Disclosures PRISMS (A Phase 3b, Double-blind, Multicenter Study to Evaluate the Efficacy and Safety of Alteplase in Patients with Mild Stroke: Rapidly Improving Symptoms and Minor Neurologic Deficits) I receive consultant fees from Genentech for my role on the Steering Committee. This presentation will not include information on unlabeled use of any commercial products or investigational use that is not yet approved for any purpose.

Conversations between Biostatisticians and Scientists https://www.youtub e.com/watch?v=hz1f yhvojr4 https://www.youtub e.com/watch?v=kmy xd6qeass

Disclaimer This talk is geared towards the design of a superiority trial: a two arm, randomized trial, where the intention is to show that one treatment is better than the other with regard to the primary outcome where the deliverable is the pvalue from a hypothesis test There are many other trial designs, where the objective of the trial might be something other than a demonstration of superiority (futility, non-inferiority, etc) the endgame might be something other than a hypothesis test (estimate of precision, selection, dose-finding) certain factors described here will be more/less/not relevant at all

Why Worry about Power/Sample Size? Provides assurance that the trial has a reasonable probability of being conclusive Allows one to determine the sample size necessary, so that resources are efficiently allocated Ethical Issues Study too large implies some subjects needlessly exposed, resources needlessly spent Study too small implies potential for misleading conclusions, unnecessary experimentation

Factors in Determining Sample Size Level of significance α (pre-specified) Power (targeted) Minimum Scientifically Important Difference Variability in the response Experimental Design

First things first What s the research question? Should be stated in the form of testable hypotheses. What s the population? What s the response variable? What are you comparing? At what timepoint?

2 states of nature: 2 hypotheses Alternative hypothesis (H A ) The statement we hope to be able to conclude (ie, the intervention affects outcome). The statement about a population that is true if the null hypothesis is not true. Null hypothesis (H 0 ) Nothing unusual is occurring (ie, the intervention does not affect outcome). The statement we hope to contradict with data.

Proof by Contradiction We conceptualize nature as being in one state or the other. Null hypothesis Alternative hypothesis We collect data from the real world. If the data clearly contradict the null hypothesis, we trust the data and conclude that the alternative must be true. If random noise, measurement error or chance can account for observations, then there is no need to formulate a more complicated explanation.

Four Possible Outcomes True State of the Null Hypothesis Decision Reject H 0 (and conclude H A, that the intervention works) FTR H 0 (there is insufficient evidence to conclude H A, that the intervention works) H 0 TRUE (intervention doesn t work) Type I error Level of Significance α H 0 FALSE (intervention works) Power Type II error β Related to Power

Level of Significance α Interpretation: Even when the null hypothesis is true, we re going to conclude the alternative α(100)% of the time. Prespecified in trial design How often are you willing to accept a false positive? One-sided α 0.10 One-sided α 0.05 One-sided α 0.01

Power The probability that a statistical test will reject H 0 when H 0 is false Power=1-β Interpretation: For a given value under the alternative hypothesis, we re going to reject the null in favor of the alternative (1-β)100% of the time. Related to the sample size

Impact on Power Level of Significance One-sided α 0.10 One-sided α 0.05 One-sided α 0.01

Minimum Scientifically Important Difference the smallest difference which would mandate, in the absence of serious side effects and/or excessive cost, a change in scientific practice/ understanding Larger the difference, smaller the sample size

Variability Is the outcome continuous or categorical? Continuous Need estimate of standard deviation/variance Dichotomous Need estimate of control rate

Variability Larger the difference, smaller the sample size ignores contribution of variability

Example: Continuous Outcome Phase 2 clinical trial in subjects with acute ICH Intervention intended to impact health-related quality of life, as assessed by NeuroQOL Assume MCID 0.5 units

Example: Binary Outcome Phase 2 clinical trial in subjects with acute ICH Intervention intended to impact functional independence (mrs 0-2) Assume MCID absolute 10%

Example: Binary Outcome Phase 2 clinical trial in subjects with acute ICH What if we want to show that the effect is less than 10%, in order to discard the intervention if it is futile?

Before asking about sample size be prepared to talk about Level of significance α (set) Power (target) Expected variability in response based on relevant clinical literature Better yet, a range of plausible values Minimum Scientifically Important Difference what s the smallest difference which will change practice? If the sample size proves to make the trial not feasible, there s room for compromise. Experimental Design

Experimental Design Considerations Controls Do you need a concurrent control arm? Can you make use of historical controls? Can subjects serve as their own control? Are there multiple questions which can be answered in the same design? Is a hypothesis test the best way to achieve your goal? Dose-finding Selection Sample size calculation depends on the method of analysis

Justifying Sample Size How many subjects? Is the sample size adequate for testing the hypothesis? How was the sample size determined? Do we need more? Can we answer questions with fewer?

Sample Size: Keeping It Small Study continuous rather than binary outcome (if variability does not increase) change in tumor size instead of recurrence Study surrogate outcome where effect is large cholesterol reduction rather than mortality

Sample Size: Calculation Calculate N specify difference to be detected specify variability (continuous) or control % Specify target power OR Calculate detectable difference: specify N specify target power specify variability (continuous) or control %

Sample Size Software Freeware on the Web http://www.stat.uiowa.edu/~rlenth/power/ http://hedwig.mgh.harvard.edu/sample_size/size.html http://www.bio.ri.ccf.org/power.html R Purchase Software http://www.powerandprecision.com/ Nquery: www.statsolusa.com PASS Statistical collaborators will have lots of options available to them.

Sample Size Software From the Iowa website: I receive quite a few questions that start with something like this: "I'm not much of a stats person, but I tried [details...] am I doing it right?" Please compare this with: "I don't know much about heart surgery, but my wife is suffering from... and I plan to operate... can you advise me?" Folks, just because you can plug numbers into a program doesn't change the fact that if you don't know what you're doing, you're almost guaranteed to get meaningless results -- if not dangerously misleading ones. Statistics really is like rocket science; it isn't easy, even to us who have studied it for a long time. Anybody who think it's easy surely lacks a deep enough knowledge to understand why it isn't! If your scientific integrity matters, and statistics is a mystery to you, then you need expert help. Find a statistician in your company or at a nearby university, and talk to her face-to-face if possible. It may well cost money. It's worth it.

Before asking about sample size be prepared to talk about Level of significance α (set) Power (target) Expected variability in response based on relevant clinical literature Better yet, a range of plausible values Minimum Scientifically Important Difference what s the smallest difference which will change practice? If the sample size proves to make the trial not feasible, there s room for compromise. Experimental Design Logistics

Logistics Anticipated recruitment Treatment crossovers Protocol nonadherence LTFU/consent withdrawal

Famous Last Words "You can't fix by analysis what you bungled by design." - Light, Singer and Willett To call in the statistician after the experiment is done may be no more than asking him to perform a postmortem examination: he may be able to say what the experiment died of. - RA Fisher