How to Make Sense of Statistics Reported in the Medical Literature

Similar documents
Outline. What is Evidence-Based Practice? EVIDENCE-BASED PRACTICE. What EBP is Not:

Updates in Therapeutics 2015: The Pharmacotherapy Preparatory Review & Recertification Course

Critical Appraisal of Evidence

Interpretation of Epidemiologic Studies

Quantifying the Benefits and Harms of Interventions: Relative vs. Absolute Risk

Biosta's'cs Board Review. Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016

Disclosures No relationships (not even to an employer) No off-label uses. Cholesterol Lowering Guidelines: What now?

8/18/10. Disclosure. Objectives. A Pervasive Concern. Evidence Based Medicine, Family Physicians, and Knowledge Translation (KATIE)

Level 2: Critical Appraisal of Research Evidence

Critical Appraisal of Evidence A Focus on Intervention/Treatment Studies

EBM: Therapy. Thunyarat Anothaisintawee, M.D., Ph.D. Department of Family Medicine, Ramathibodi Hospital, Mahidol University

GLOSSARY OF GENERAL TERMS

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Critical Appraisal Series

CVD risk assessment using risk scores in primary and secondary prevention

the standard deviation (SD) is a measure of how much dispersion exists from the mean SD = square root (variance)

John J.P. Kastelein MD PhD Professor of Medicine Dept. of Vascular Medicine Academic Medial Center / University of Amsterdam

Critical Appraisal of a Meta-Analysis: Rosiglitazone and CV Death. Debra Moy Faculty of Pharmacy University of Toronto

Do the sample size assumptions for a trial. addressing the following question: Among couples with unexplained infertility does

Critical Appraisal. Dave Abbott Senior Medicines Information Pharmacist

EBM, Study Design and Numbers. David Frankfurter, MD Professor OB/GYN The George Washington University

CLINICAL DECISION USING AN ARTICLE ABOUT TREATMENT JOSEFINA S. ISIDRO LAPENA MD, MFM, FPAFP PROFESSOR, UPCM

Common Statistical Issues in Biomedical Research

Comparison of Performance in Biostatistics and Epidemiology across USMLE Steps 1, 2, and 3 D Swanson, W Ouyang K Holtzman, M Johnson, and S Haist

Understanding Statistics for Research Staff!

Placebo-Controlled Statin Trials EXPLAINING THE DECREASE IN DEATHS FROM CHD! PREVENTION OF CARDIOVASCULAR DISEASE IN WOMEN EXPLAINING THE DECREASE IN

Who cares about fractures! is more important. October 3, 2014 CSIM Workshop Brian Wirzba, MD, FRCPC, FACP Clinical Professor Grey Nuns Hospital

Expressions and illustra0ons of treatment effect. Ka$e Program Dalhousie University

Welcome to the 19th Annual wow, almost 20 years Drug Therapy Decision Making Course

Journal Club: Fairfax Internal Medicine. Stephanie Shin PGY-2 Bianca Ummat PGY-2 July 27, 2012

Understanding NNT- Patient s and Physicians Perspective

Intro to Evidence Based Medicine

Global Clinical Trials Innovation Summit Berlin October 2016

Confounding and Interaction

Branko N Huisa M.D. Assistant Professor of Neurology UNM Stroke Center

Sultan Qaboos University Hospital Medicines Information Course. Assessment

An Introduction to Epidemiology

THERAPY WORKSHEET: page 1 of 2 adapted from Sackett 1996

Evidence Based Medicine

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

SESSION 3 11 AM 12:30 PM

Epidemiology and Prevention of Stroke

The Royal College of Pathologists Journal article evaluation questions

OVERVIEW WOMEN S HEALTH: YEAR IN REVIEW

Introduction to research methods and critical appraisal WEEK 11

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

The Healthy User Effect: Ubiquitous and Uncontrollable S. R. Majumdar, MD MPH FRCPC FACP

CRITICAL APPRAISAL WORKSHEET

Dawn Matherne Meyer PhD,RN,FNP-C. Assistant Professor University of California San Diego

JAMA. 2011;305(24): Nora A. Kalagi, MSc

INTERPRETATION OF STUDY FINDINGS: PART I. Julie E. Buring, ScD Harvard School of Public Health Boston, MA

CARDIOVASCULAR HEALTH LIVING A HEART HEALTHY LIFE. Matt Handley MD Group Health Physicians

JUPITER NEJM Poll. Panel Discussion: Literature that Should Have an Impact on our Practice: The JUPITER Study

Is there enough evidence for DAPT after endovascular intervention for PAOD?

Introduction to systematic reviews/metaanalysis

Learning Objectives 9/9/2013. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

9/4/2013. Decision Errors. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

Principles of meta-analysis

SESSION 5 2:20 3:35 PM

Critical Appraisal of RCT

Biostatistics 2 - Correlation and Risk

Hypertension Guidelines: Lessons for Primary Care. Paul A James MD Professor and Chair Department of Family Medicine University of Washington

Epidemiology & Evidence Based Medicine. Patrick Linden & Matthew McCall

Kathryn M. Rexrode, MD, MPH. Assistant Professor. Division of Preventive Medicine Brigham and Women s s Hospital Harvard Medical School

La terapia antiaggregante nel paziente con stroke

Placebo-Controlled Statin Trials

Dyslipidemia: Lots of Good Evidence, Less Good Interpretation.

LDL Cholesterol Lowering with Evolocumab and Outcomes in Patients with Peripheral Artery Disease: Insights from the FOURIER Trial

Case Presentation. Rafael Bitzur The Bert W Strassburger Lipid Center Sheba Medical Center Tel Hashomer

Understanding Risk Factors for Stroke

Reducing CVD globally through combination approaches to prevention: the polypill. Salim Yusuf

The Engaged Patient: Understanding and Participating in Shared Health Decisions

Lies, damn lies, & clinical trials. Paul Young

Presentation Outline. Introduction to Biomedical Research Designs. EBM: A Practical Definition. Grade the Evidence (Example McMaster Grading System)

Overview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate

Should we prescribe aspirin and statins to all subjects over 65? (Or even all over 55?) Terje R.Pedersen Oslo University Hospital Oslo, Norway

How to CRITICALLY APPRAISE

CEA or CAS for asymptomatic carotid stenosis which patients benefit most?

Evidence Based Medicine

MMS/Mass Coalition Program, Nov. 4, 2008 Patients with AF: Who Should be on Warfarin?

Evaluating you relationships

Do Women Benefit From Statins for Primary Prevention?: Controversy, Challenges and Consensus

Glossary of Practical Epidemiology Concepts

#s in EVERYDAY LIFE OUTLINE Class 3: Medical and Health Studies Ricki Lewis

Lipid Panel Management Refresher Course for the Family Physician

Introduction to Clinical Trials - Day 1 Session 2 - Screening Studies

FIRST DO NO HARM: AN EBM ODYSSEY. Cathleen Colón-Emeric, MD, MHS Geriatrics, Department of Medicine Durham VA GRECC

Risk Study. Section for Clinical Epidemiology and Biostatistics. Definition

To evaluate a single epidemiological article we need to know and discuss the methods used in the underlying study.

Bias and confounding. Mads Kamper-Jørgensen, associate professor, Section of Social Medicine

Update in Hypertension

Providing High Value Cost-Conscious Care:

To open a CMA file > Download and Save file Start CMA Open file from within CMA

Is it worth offering cardiovascular disease prevention to the elderly? Prof. Dr. Helmut Gohlke Herz-Zentrum Bad Krozingen, Germany

Critical Review Form Therapy

Epidemiologic Study Designs. (RCTs)

This article is the second in a series in which I

MANAGEMENT OF HYPERTENSION: TREATMENT THRESHOLDS AND MEDICATION SELECTION

Garvan Ins)tute Biosta)s)cal Workshop 7/5/2015. Tuan V. Nguyen. Garvan Ins)tute of Medical Research Sydney, Australia

Types of Data. Systematic Reviews: Data Synthesis Professor Jodie Dodd 4/12/2014. Acknowledgements: Emily Bain Australasian Cochrane Centre

Transcription:

How to Make Sense of Statistics Reported in the Medical Literature Colin P. West, MD, PhD Division of General Internal Medicine Division of Biomedical Statistics and Informatics Denise M. Dupras, MD, PhD Division of Primary Care Internal Medicine

Disclosures Dr. West has no known conflicts of interest to disclose. Dr. Dupras has no known conflicts of interest to disclose.

Session Objectives Highlight several common issues in published research that lead to misinterpretation of study findings. Promote the EBM principle of enlightened skepticism when reading medical literature. Challenge the dogma that EBM and medical statistics are dull by having fun and telling several bad jokes.

Tip #1: Clinical vs Statistical Significance Background Statistical significance UNLIKELY to occur by chance Is this a real effect? Not = important or meaningful Clinical significance Is this effect important or meaningful?

Tip #1: Clinical vs Statistical Significance Outcomes If you can measure it, you can ask if a difference is statistically significant Specific outcomes impact clinical significance Change in systolic BP versus stroke Visual analog scale for pain

Tip #1: Clinical vs Statistical Significance Example Effect of statins on carotid intimal thickness Meta-analysis 10 studies, 3443 subjects Results Beneficial effect of statins (p<0.00001) -0.02235 (-0.02656, -0.01614) mm/y Kang et al. Effects of statin therapy on the progression of carotid atherosclerosis: a systematic review and metaanalysis. Atherosclerosis. 2004 Dec; 177(2): 433-42.

Tip #1: Clinical vs Statistical Significance Questions to ask yourself: Would the reported effect matter to an individual patient? Would the reported effect change public health if it were applied to a larger group of patients?

Tip #2: Absolute vs Relative Risk Background Differences between 2 groups Absolute = actual difference Example Treatment group 10% Placebo group 30% Absolute risk reduction = 20% Number needed to Treat = 1/ARR NNT = 5

Tip #2: Absolute vs Relative Risk Background Relative = change in one group compared to another group ( ) Example Treatment group 10% Placebo group 30% Relative risk = 1/3 or 33% Relative risk reduction = 67%

Tip #2: Absolute vs Relative Risk An Example Drug Placebo Events 10 50 No Event 90 50 % Event 10% 50% ARR 40% NNT 1/0.4 2.5 RRR 40%/50% 80% Event 10 50 No Event 990 950 % Event 1% 5% ARR 4% NNT 1/0.04 250 RRR 4%/5% 80%

Tip #2: Absolute vs Relative Risk An Example Drug Placebo Events 10 50 No Event 90 50 % Event 10% 50% ARR 40% NNT 1/0.4 2.5 RRR 40%/50% 80% Event 10 50 No Event 9990 9950 % Event 0.1% 0.5% ARR 0.4% NNT 1/0.004 250 RRR 0.4%/0.5% 80%

Tip #2: Absolute vs Relative Risk What do pharmaceutical ads report? What is reported on the news? RRR

Tip #3: P Values and Confidence Intervals Background What is a p value anyway? The probability of rejecting the null hypothesis (typically that two treatments do not differ) IF IN FACT THEY DO NOT DIFFER. It is NOT the probability that the study finding is false!

Tip #3: P Values and Confidence Intervals Background What does this mean? If we think the treatment effect is implausible going into the study, even a significant p value may not make the effect likely. This is analogous to interpretation of a positive diagnostic test in a patient for whom the diagnosis is very unlikely the test often does not confirm the diagnosis! Ioannidis JP. Why most published research findings are false. PLoS 2005;2:124.

Tip #3: P Values and Confidence Intervals Confidence interval A measure of the precision of your estimates. A 95% confidence interval for your outcome means (roughly) that 95% of similarly conducted studies would yield an interval that includes the true estimate Often interpreted as the true effect is in this interval, with 95% confidence Not quite right, but close enough for many circumstances

Tip #3: P Values and Confidence Intervals One useful tip is to look at the confidence interval associated with the reported p value If very wide, the results may not be as trustworthy. May be due to small sample size or small number of outcomes In either case, USE CAUTION! May be due to implausible or improbable association If it seems too good to be true

Tip #3: P Values and Confidence Intervals Another useful tip is to consider the clinical impact of each end of the confidence interval If these are similar, the confidence interval is precise enough If these differ markedly, however, the confidence interval lacks precision sufficient for clear clinical decision-making

Tip #3: P Values and Confidence Intervals Consider the precision of each of the following 95% CI s for a relative risk, for which no effect is indicated by RR=1: (11,15) (1.1,212) (0.1,14) (0.97,1.02)

Tip #3: P Values and Confidence Intervals Example: PPA and risk of hemorrhagic stroke In women taking high doses as appetite suppressant (not as a cold medication), OR 16.58, p value 0.02. 95% CI: (1.51, 182.21)! Phenylpropanolamine and the risk of hemorrhagic stroke. Kernan WN et al. NEJM 2000;343:1826-32. Phenylpropanolamine and hemorrhagic stroke in the hemorrhagic stroke project: a reappraisal in the context of science, the Food and Drug Administration, and the law. Stier BG et al. Ann Epidemiol 2006;16:49-52.

Tip #4: Absence of Evidence Is Not Evidence of Absence Another Example: Based on a nonsignificant statistical test it may be declared that there is no association between an exposure and an outcome. Are these confidence intervals for a RR telling us the same thing? (0.99, 1.01) (0.50, 2.75)

Tip #4: Absence of Evidence Is Not Evidence of Absence Example: Pelvic lymphadenectomy vs. none in early-stage endometrial carcinoma RR of death was 1.20 (95% CI 0.70, 2.07), p=0.50. Reported as similar risks C/w 30% risk reduction or doubling of risk, both clinically important Systematic pelvic lymphadenectomy vs no lymphadenectomy in early-stage endometrial carcinoma: randomized clinical trial. Panici et al. J Natl Cancer Inst 2008;100;1707-16.

Tip #5: Multiple Testing and the Isolated Significant P Value Background Assuming the null hypothesis is true, the probability of falsely rejecting it with a p value of 0.05 is 5%, and the probability of correctly rejecting it is 95%. If we run 20 tests, the probability all of them are correct is (0.95) 20 = 0.36. So there is a 64% chance at least one of the conclusions when 20 tests are run is WRONG and we can t tell which one it is!

Tip #5: Multiple Testing and the Isolated Significant P Value The tip is to look skeptically at a table of many results when one is selected out of many for further discussion. Suspect data dredging, or torturing the data until it confesses What makes this more believable? Biologically plausible association Prior evidence Clear a priori primary hypothesis

Tip #5: Multiple Testing and the Isolated Significant P Value Example: Caffeine and the risk of breast cancer In women with benign breast disease consuming at least 4 cups of coffee daily, RR for breast cancer was 1.35 (95% CI 1.01-1.80). 230 statistical tests reported 18 statistically significant results Would expect 12 by chance alone, and very few of the results were strongly significant Caffeine consumption and the risk of breast cancer in a large prospective cohort of women. Ishitani K et al. Arch Intern Med 2008;168:2022-31.

Summary 5 Tips for enlightened skepticism Clinical vs statistical significance Absolute vs relative risk P values and confidence intervals Absence of evidence is not evidence of absence Multiple testing and the isolated significant p value Thank You!