Generalizing the right question, which is?

Similar documents
Beyond the intention-to treat effect: Per-protocol effects in randomized trials

Beyond Controlling for Confounding: Design Strategies to Avoid Selection Bias and Improve Efficiency in Observational Studies

COMMENTARY: DATA ANALYSIS METHODS AND THE RELIABILITY OF ANALYTIC EPIDEMIOLOGIC RESEARCH. Ross L. Prentice. Fred Hutchinson Cancer Research Center

State of the art pharmacoepidemiological study designs for post-approval risk assessment

Per-Protocol Analyses of Pragmatic Trials

Right Answers, Wrong Ques2ons. Ralph I Horwitz

Brief introduction to instrumental variables. IV Workshop, Bristol, Miguel A. Hernán Department of Epidemiology Harvard School of Public Health

Comparison And Application Of Methods To Address Confounding By Indication In Non- Randomized Clinical Studies

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Long-Term Use of Long-Acting Insulin Analogs and Breast Cancer Incidence in Women with Type 2 Diabetes.

EBM: Therapy. Thunyarat Anothaisintawee, M.D., Ph.D. Department of Family Medicine, Ramathibodi Hospital, Mahidol University

Comparative Effectiveness Research Using Observational Data: Active Comparators to Emulate Target Trials with Inactive Comparators

The ROBINS-I tool is reproduced from riskofbias.info with the permission of the authors. The tool should not be modified for use.

Estimands in klinischen Studien: was ist das und geht mich das was an?

Methods to Estimate the Comparative Effectiveness of Clinical Strategies that Administer the Same Intervention at Different Times

However, if instead, CHD risk is plotted on a doubling scale (as in slide 2) then there is a

OBSERVATIONAL STUDIES IN A LEARNING HEALTH SYSTEM

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

Accounting for treatment switching/discontinuation in comparative effectiveness studies

Comparisons of Dynamic Treatment Regimes using Observational Data

STUDY DESIGN (continued) Thomas Grieve, DC Department of Research

Treatment changes in cancer clinical trials: design and analysis

Kathryn M. Rexrode, MD, MPH. Assistant Professor. Division of Preventive Medicine Brigham and Women s s Hospital Harvard Medical School

Research in Real-World Settings: PCORI s Model for Comparative Clinical Effectiveness Research

Experimental Design. Terminology. Chusak Okascharoen, MD, PhD September 19 th, Experimental study Clinical trial Randomized controlled trial

Is There An Association?

Marian L Neuhouser, PhD, RD

Observational Study Designs. Review. Today. Measures of disease occurrence. Cohort Studies

CAN EFFECTIVENESS BE MEASURED OUTSIDE A CLINICAL TRIAL?

Adjusting the Crossover Effect in Overall Survival Analysis Using a Rank Preserving Structural Failure Time Model: The Case of Sunitinib GIST Trial

Current Use of Unopposed Estrogen and Estrogen Plus Progestin and the Risk of Acute Myocardial Infarction Among Women With Diabetes

FIRST RENAL REPLACEMENT

The CARI Guidelines Caring for Australians with Renal Impairment. Cardiovascular Risk Factors

Study Designs in Epidemiology

Postmenopausal hormone therapy - cardiac disease risks and benefits

Chapter 2: Identification and Care of Patients With Chronic Kidney Disease

The RoB 2.0 tool (individually randomized, cross-over trials)

Evidence Based Practice

Real-world data in pragmatic trials

Osler Journal Club Outcomes Research

Can you guarantee that the results from your observational study are unaffected by unmeasured confounding? H.Hosseini

Perspectives on Large Simple Trials

INTERPRETATION OF STUDY FINDINGS: PART I. Julie E. Buring, ScD Harvard School of Public Health Boston, MA

Supplement 2. Use of Directed Acyclic Graphs (DAGs)

Evidence Table. Study Type: Randomized controlled trial. Study Aim: To compare frequent nocturnal hemodialysis and conventional in-center dialysis.

Okayama University, Japan

Critical Appraisal Series

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Estimands, Missing Data and Sensitivity Analysis: some overview remarks. Roderick Little

Strategies for handling missing data in randomised trials

Systematic Reviews and Meta- Analysis in Kidney Transplantation

Updates in Therapeutics 2015: The Pharmacotherapy Preparatory Review & Recertification Course

Propensity Score Methods for Analyzing Observational Data Like Randomized Experiments: Challenges and Solutions for Rare Outcomes and Exposures

WEIGHING UP THE RISKS OF HRT. Department of Endocrinology Chris Hani Baragwanath Academic Hospital

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS

Epidemiologic study designs

exposure/intervention

Breast cancer risk with postmenopausal hormonal treatment

Modeling the annual costs of postmenopausal prevention therapy: raloxifene, alendronate, or estrogen-progestin therapy Mullins C D, Ohsfeldt R L

The New England Journal of Medicine

Antihypertensive Trial Design ALLHAT

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Evidence Based Medicine

Examining Facility Level Data

Pros and Cons of Clinical Trials vs. Observational Studies. Beth Devine PCORP Summer Institute July 14, 2015

Non-inferiority trials and switch from non-inferiority to superiority. D Costagliola U 943 INSERM and UPMC Paris 06

WHI, HERS y otros estudios: Su significado en la clinica diária. Manuel Neves-e-Castro

rosuvastatin, 5mg, 10mg, 20mg, film-coated tablets (Crestor ) SMC No. (725/11) AstraZeneca UK Ltd.

Misperceptions still exist that cardiovascular disease is not a real problem for women.

Rapid appraisal of the literature: Identifying study biases

A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases

Clinical research in AKI Timing of initiation of dialysis in AKI

Statistical Methods in Medical Research

Sensitivity Analysis in Observational Research: Introducing the E-value

A (Constructive/Provocative) Critique of the ICH E9 Addendum

GRADE. Grading of Recommendations Assessment, Development and Evaluation. British Association of Dermatologists April 2018

Chapter 2: Identification and Care of Patients With CKD

Evidence Based Medicine

Everolimus for the Treatment of Advanced Renal Cell Carcinoma (arcc ) - Additional Analysis Using RPSFT

Analysis methods for improved external validity

CAUSAL INFERENCE IN HIV/AIDS RESEARCH: GENERALIZABILITY AND APPLICATIONS

Clinical Evidence: Asking the Question and Understanding the Answer. Keeping Up to Date. Keeping Up to Date

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

How to improve the reliability of Single Arm Trials

USRDS UNITED STATES RENAL DATA SYSTEM

HT: Where do we stand after WHI?

Hormone Replacement Therapy (HRT) Benefits & Risks - The Facts

Placebo-Controlled Statin Trials EXPLAINING THE DECREASE IN DEATHS FROM CHD! PREVENTION OF CARDIOVASCULAR DISEASE IN WOMEN EXPLAINING THE DECREASE IN

Complier Average Causal Effect (CACE)

Main objective of Epidemiology. Statistical Inference. Statistical Inference: Example. Statistical Inference: Example

ICH E9(R1) Technical Document. Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials.

Choosing the Comparison Group in CER

Clarifying questions about risk factors : predictors versus explanation

Appendix 2 Quality assessment tools. Cochrane risk of bias tool for RCTs. Support for judgment

1. Draft checklist for judging on quality of animal studies (Van der Worp et al., 2010)

Real World Evidence in the Treatment of Ovarian Cancer. Elizabeth Eisenhauer MD FRCPC Queen s University

研究設計理論 ( 一 ): RANDOMIZED TRIALS. 醫學研究初階課程 General Medicine, CGMH 劉素旬醫師 03/12/2013

EU regulatory concerns about missing data: Issues of interpretation and considerations for addressing the problem. Prof. Dr.

Transcription:

Generalizing RCT results to broader populations IOM Workshop Washington, DC, April 25, 2013 Generalizing the right question, which is? Miguel A. Hernán Harvard School of Public Health

Observational studies for CER Criticized because of potential for confounding and other biases Praised because of faster results lower cost fewer ethical problems better transportability (generalizability) of effect estimates to clinical practice than RCTs 5/7/2013 Observational - Randomized 2

Reasons for better transportability of observational estimates Patients similar to those in clinical settings Patients in RCTs are highly selected Long follow-up allows to study clinical outcomes Many RCTs rely on surrogate outcomes Realistic implementation of treatments/interventions Often RCTs include intensive monitoring and additional measures to increase adherence 5/7/2013 Observational - Randomized 3

What if RCTs and observational studies had Same patients Same follow-up Same interventions Would they be answering the same question then? Only if data analysis were the same 5/7/2013 Observational - Randomized 4

Consider RCT and observational study with similar design except that baseline treatment assignment is not randomly assigned in the observational study Example: an open-label randomized trial and an observational follow-up study to estimate the effect of postmenopausal hormone therapy on the risk of coronary heart disease The RCT is just a follow-up study with baseline randomization 5/7/2013 Observational - Randomized 5

Follow-up studies with and without baseline randomization Perhaps a more useful classification than RCTs and observational studies? every RCT is an OS on day 2 Incidentally, I also wanted a T-shirt Think large simple trials and so-called pragmatic trials Benefits of baseline randomization overshadowed by large noncompliance, loss to follow-up? Typically not data collected to adjust for these biases 5/7/2013 Observational - Randomized 6

Follow-up studies with and without baseline randomization are often analyzed differently Example: intention to treat analysis in randomized trial; as treated analysis in observational studies Why? The analytic approach depends on the causal question (which is the same) and study design (which is almost the same) 5/7/2013 Observational - Randomized 7

Is it because of confounding adjustment? Observational studies need adjustment for baseline confounders RCTs do not, at least when they are large But, other than adjustment for baseline confounding, analysis should be identical both designs need adjustment for timevarying confounding and selection bias because decisions after baseline are not randomly assigned under either design 5/7/2013 Observational - Randomized 8

The analytic approach is determined by the question We often compare the estimates of randomized and observational studies which implies we believe they answer the same causal question Example: the effects of statins and of postmenopausal hormone therapy on the risk of coronary heart disease What causal question is that? Let us review a few possible questions 5/7/2013 Observational - Randomized 9

Causal questions in follow-up studies 1. The effect of being assigned to treatment, regardless of treatment received Intention to treat (ITT) effect Interventions to be compared: be assigned to treatment A at baseline and remain in the study until it ends be assigned to treatment B at baseline and remain in the study until it ends Requires adjustment for time-varying selection bias due to loss to follow-up 5/7/2013 Observational - Randomized 10

Causal questions in follow-up studies 2. The effect of receiving the treatment regimes specified in the study protocol Per protocol effect Example of interventions to be compared: receive treatment A continuously between baseline and study end (unless toxicity arises) receive treatment B continuously between baseline and study end (unless toxicity arises) Requires adjustment for time-varying confounding and selection bias 5/7/2013 Observational - Randomized 11

Causal questions in follow-up studies 3. The effect of receiving treatment regimes other than the ones specified in the study protocol Related to as treated effects Example of interventions to be compared: receive A as per protocol receive B but switch from B to A if LDLcholesterol raises above 160 mg/dl (4.1 mmol/l) Requires adjustment for time-varying confounding and selection bias 5/7/2013 Observational - Randomized 12

ITT, per protocol effects vs. ITT, per protocol analyses Typical ITT, per protocol, and as treated analyses of randomized trials Usually do not adjust for time-varying (post-baseline) variables Possibly biased estimates of ITT, per protocol, and as treated effects (Robins et al, since 1986) Warning: Instrumental variable analyses require separate discussion 5/7/2013 Observational - Randomized 13

Efficacy and effectiveness These terms are probably useful in simple settings with short-term interventions, but ambiguous in complex settings with sustained interventions over long periods An explicit definition of the interventions that define the causal effect of interest is more informative. ITT effect does not necessarily measure effectiveness in the real world Per protocol effect may measure effectiveness but does not generally measure efficacy 5/7/2013 Observational - Randomized 14

Observational studies analyzed like randomized trials We need to define the observational analogs of intention to treat per protocol or as treated effects to know what question to generalize Hint: New user designs are a first step towards estimating ITT effects in observational studies 5/7/2013 Observational - Randomized 15

and vice versa Because the analysis of many RCTs require adjustment for time-varying variables to estimate intention to treat per protocol as treated causal effects Hint: Adjustment for time-varying variables may require techniques like inverse probability weighting or the g-formula Robins 1986 5/7/2013 Observational - Randomized 16

Conclusions 1. Question of interest must be clearly specified ITT, per protocol, other? In both RCTs and observational studies 2. Analysis of observational studies and RCTs must be the same Except adjustment for baseline confounding 3. The terms effectiveness and efficacy are too vague to define causal questions 5/7/2013 Observational - Randomized 17

Supplementary materials 5/7/2013 Observational - Randomized 18

4 examples of follow-up studies with and without randomization 1. Classic cohort study Nurses Health Study Postmenopausal hormone therapy and coronary heart disease (CHD) 2. Randomized trial Women s Health Initiative Postmenopausal hormone therapy and breast cancer 3. Electronic medical records THIN Statins and coronary heart disease (CHD) 4. Claims database USRDS Medicare Epoetin and mortality 5/7/2013 Observational - Randomized 19

Example #1 Observational study analyzed like a randomized experiment Question: What is the effect of postmenopausal hormone therapy on risk of coronary heart disease in postmenopausal women? Data: Nurses Health Study Hernán et al. Epidemiology 2008 5/7/2013 Observational - Randomized 20

Answers Observational studies >30% lower risk in current users compared with never users e.g., HR 0.68 in Nurses Health Study (Grodstein et al. J Women s Health 2006) Randomized trial >20% higher risk in initiators compared with noninitiators HR 1.24 in Women s Health Initiative (Manson et al. NEJM 2003) 5/7/2013 Observational - Randomized 21

WHI: ITT effect estimates Hazard ratio (95% CI) of CHD Overall 1.23 (0.99, 1.53) Years of follow-up 0-2 1.51 (1.06, 2.14) >2-5 1.31 (0.93, 1.83) >5 0.67 (0.41, 1.09) Years since menopause <10 0.89 (0.54, 1.44) 10-20 1.24 (0.86, 1.80) >20 1.65 (1.14, 2.40) 5/7/2013 Observational - Randomized 22

Randomized trial estimated the ITT effect What is the CHD risk in women assigned to initiation of hormone therapy compared with women assigned to no initiation of hormone therapy? Design and analysis: Women randomly assigned to initiation of hormone therapy or placebo Analytic approach: Compare risk between initiators (incident users) and nonusers of hormone therapy 5/7/2013 Observational - Randomized 23

Observational studies did not estimate ITT effect What is the CHD risk in women who are currently taking hormone therapy compared with women who are not? Design and analysis: Women are asked about therapy use Analytic approach: Compare risk between prevalent users and nonusers of hormone therapy (current vs. never) 5/7/2013 Observational - Randomized 24

Current vs. never contrast does not address any relevant question Consider a woman wondering whether to start hormone therapy The current vs. never contrast does not provide the information she needs Consider a woman wondering whether to stop hormone therapy The current vs. never contrast does not provide the information she needs Nothing interesting to generalize! 5/7/2013 Observational - Randomized 25

What if observational data used to estimate analog of ITT effect? Use observational data to answer same question as randomized trial Comparing the risk in incident users vs. nonusers Re-analyze observational studies to estimate the observational analog of the ITT effect 5/7/2013 Observational - Randomized 26

ITT effect estimates WHI NHS Overall 1.23 (0.99, 1.53) 1.05 (0.82, 1.34) Years of follow-up 0-2 1.51 (1.06, 2.14) 1.43 (0.92, 2.23) >2 1.07 (0.81, 1.41) 0.91 (0.72, 1.16) Years since menopause <10 0.89 (0.54, 1.44) 0.88 (0.63, 1.21) 10-20 1.24 (0.86, 1.80) 1.13 (0.85, 1.49) >20 1.65 (1.14, 2.40) -- 5/7/2013 Observational - Randomized 27

When same question is asked No shocking observationalrandomized discrepancies for ITT estimates though wide CIs in both studies Any residual confounding? Probably, but insufficient to explain the original discrepancy 5/7/2013 Observational - Randomized 28

But ITT analyses are problematic Hernán, Hernández-Díaz. Clinical Trials 2012 ITT effect affected by adherence Imperfect adherence in both randomized and observational studies ITT inappropriate for safety outcomes We also estimated per protocol effect Inverse probability (IP) weighted analyses to adjust for noncompliance Again no randomized-observational discrepancies Toh et al. Ann Intern Med 2010 5/7/2013 Observational - Randomized 29

EXAMPLE #2 Randomized experiment analyzed like an observational study Question: What is the effect of postmenopausal hormone therapy on risk of breast cancer in postmenopausal women? Data: Women s Health Initiative randomized clinical trial Toh et al. Epidemiology 2010; 21:528-539 5/7/2013 Observational - Randomized 30

Methodologic approach to estimate per protocol effect Estimate IP weights to adjust for timevarying confounding Need data on post-randomization variables Estimate IP weighted hazards model to estimate Hazard ratios Survival (or cumulative incidence) Compare survival curves for continuous treatment vs. no treatment Standardize curves to baseline variables 5/7/2013 Observational - Randomized 31

Hazard ratio of breast cancer Hormone therapy vs. placebo Intention to treat analysis 1.25 (1.01, 1.54) Per protocol analysis (IP weighted) 1.68 (1.24 to 2.28) Suppose you are a woman considering initiation of hormone therapy and who plans to take it as instructed by your doctor Which hazard ratio do you want? 5/7/2013 Observational - Randomized 32

EXAMPLE #3 Electronic medical records Question: What is the effect of statin therapy on CHD risk? Data: UK THIN (electronic medical records) ~75,000 eligible patients Used to emulate a sequence of observational trials of statin initiation Generalization of new-user design Danaei et al. Statistical Methods in Medical Research 2013 5/7/2013 Observational - Randomized 33

CONSORT flowchart of emulated trials 32.9 million potential person-trials (617,432 patients) 844,800 eligible person-trials (74,806 patients) - 3.2 million prior statin use or <2 years on database - 12.5 million major chronic diseases - 16.4 million no recent data on confounders 13,599 initiators 831,201 non-initiators - 245 died - 62 lost to follow-up - 16,094 died - 2,894 lost to follow-up - 117 cases - 13,175 alive and event free at the end of follow-up - 6,218 cases - 805,995 alive and event-free at the end of follow-up 5/7/2013 Observational - Randomized 35

Hazard ratio (95% CI) of CHD THIN trials 2000-2006 Intention-totreat analysis Per-protocol analysis Unique cases 635 488 Unique persons 74,806 74,806 Cases 6,335 4,849 Person- trials 844,800 844,800 Age-sex adjusted 1.29 (1.06, 1.56) 1.54 (1.09, 2.18) Adjusted for covariates 0.89 (0.73, 1.09) 0.84 (0.54, 1.30) Adjusted for covariates (excluding first year of follow-up) 0.71 (0.53, 0.94) 0.53 (0.27, 1.02) 5/7/2013 Observational - Randomized 36

What if we had compared prevalent (not incident) users vs. nonusers? Current users HR: 1.42 (1.16, 1.73) Persistent (1 yr) current users HR: 1.05 Persistent (2 yrs) current users HR: 0.77 (0.51, 1.18) We can get any result we want by changing the definition of current user! Confounding-selection bias tradeoff Nothing interesting to generalize here! See also Danaei et al. Am J Epidemiol 2012 5/7/2013 Observational - Randomized 37

EXAMPLE #4 Health claims database Question: What is the effect of different doses of epoetin therapy on the mortality risk of patients undergoing hemodialysis? Data: US Renal Data System (Medicare claims database) ~18,000 eligible elderly patients Zhang et al. CJASN 2009; 21:638-644 5/7/2013 Observational - Randomized 38

ITT effect not interesting here Because epoetin dose and use usually changes every month If we assigned individuals to whatever dose they are receiving at baseline Adherence too low Effect greatly attenuated Therefore per protocol analysis only 5/7/2013 Observational - Randomized 39

Survival under 3 epoetin dosing regimes Zhang et al. CJASN 2009; 21:638-644 5/7/2013 Observational - Randomized 40

References RCTs Robins JM. Correction for non-compliance in equivalence trials. Statistics in Medicine 1998; 17:269-302. Mark S, Robins JM. A method for the analysis of randomized trials with compliance information: An application to the multiple risk factor trial. Controlled Clinical Trials 1993; 14: 79-97. Hernán MA, Hernández-Díaz S. Beyond the intention to treat in comparative effectiveness research. Clinical Trials 2012; 9(1):48-55 Toh S, Hernán MA. Causal inference from longitudinal studies with baseline randomization. International Journal of Biostatistics 2008; 4(1): Article 22. Toh S, Hernández-Díaz S, Logan R, Robins JM, Hernán MA. Estimating absolute risks in the presence of nonadherence: an application to a follow-up study with baseline randomization. Epidemiology 2010; 21(4):528-39 Hernán MA. Comments on Lauer s How the debate about comparative effectiveness research should impact the future of clinical trials. Statistics in Medicine 2012; 31(25):3060-3061. 5/7/2013 Observational - Randomized 41

References Observational studies - Methods Robins JM (1997). Causal Inference from Complex Longitudinal Data. Latent Variable Modeling and Applications to Causality. Lecture Notes in Statistics (120), Berkane M, Ed. NY: Springer Verlag, pp. 69-117. Robins JM, Hernán MA. Estimation of the causal effects of time-varying exposures. In: Longitudinal Data Analysis. Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G, eds. New York: Chapman and Hall/CRC Press, 2008, pp. 553-599. Hernán MA, Robins JM. Observational studies analyzed like randomized experiments: Best of both worlds. Epidemiology 2008; 19(6):789-792 Hernán MA. With great data comes great responsibility. Publishing comparative effectiveness research in EPIDEMIOLOGY. Epidemiology 2011; 22(3):290-291 Institute of Medicine. Ethical and Scientific Issues in Studying the Safety of Approved Drugs. Washington, DC: The National Academies Press, 2012 5/7/2013 Observational - Randomized 42

References Observational studies - Applications Hernán MA, Alonso A, Logan R, Grodstein F, Michels KB, Willett WC, Manson JE, Robins JM. Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease. Epidemiology 2008; 19(6):766-779 Danaei G, García-Rodríguez L, Cantero OF, Logan RW, Hernán MA. Observational data for comparative effectiveness research: an emulation of a randomized trial of statins for primary prevention of coronary heart disease. Statistical Methods in Medical Research 2013; 22(1): 70-96 Danaei G, Tavakkoli M, Hernán MA. Bias in observational studies of prevalent users: Lessons for comparative effectiveness from a metaanalysis of statins. American Journal of Epidemiology 2012; 175(4): 250-262 Zhang Y, Thamer M, Cotter D, Kaufman J, Hernán MA. Estimated effect of epoetin dosage on survival among elderly hemodialysis patients in the United States. Clinical Journal of the American Society of Nephrology 2009; 4(3):638-644 5/7/2013 Observational - Randomized 43