Hypothetical Scenario

Similar documents
5/10/2010. ISPOR Good Research Practices for Comparative Effectiveness Research: Indirect Treatment Comparisons Task Force

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy

Discussion Leaders. Workshop Outline. David J. Vanness, PhD. Ágnes Benedict, MA, MS. Background

Modelling heterogeneity variances in multiple treatment comparison meta-analysis Are informative priors the better solution?

The use of continuous data versus binary data in MTC models: A case study in rheumatoid arthritis

Methods Research Report. An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy

Comparison of Meta-Analytic Results of Indirect, Direct, and Combined Comparisons of Drugs for Chronic Insomnia in Adults: A Case Study

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis

1 Executive summary. Background

Bayesian Inference Bayes Laplace

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS

Ordinal Data Modeling

An Introduction to Using WinBUGS for Cost-Effectiveness Analyses in Health Economics

Fundamental Clinical Trial Design

Models for potentially biased evidence in meta-analysis using empirically based priors

Systematic review with multiple treatment comparison metaanalysis. on interventions for hepatic encephalopathy

Mediation Analysis With Principal Stratification

Bayesian Statistics Estimation of a Single Mean and Variance MCMC Diagnostics and Missing Data

Hierarchical network meta-analysis models to address sparsity of events and differing treatment classifications with regard to adverse outcomes

GUIDELINE COMPARATORS & COMPARISONS:

Introduction to Bayesian Analysis 1

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Effective Health Care Program

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

A Brief Introduction to Bayesian Statistics

Bayesian Mediation Analysis

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

A Case Study: Two-sample categorical data

Bayesian Methods p.3/29. data and output management- by David Spiegalhalter. WinBUGS - result from MRC funded project with David

Bayesian and Frequentist Approaches

Meta-analysis of few small studies in small populations and rare diseases

Evidence synthesis with limited studies

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

Using Statistical Principles to Implement FDA Guidance on Cardiovascular Risk Assessment for Diabetes Drugs

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Bayesian Estimation of a Meta-analysis model using Gibbs sampler

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Applications of Bayesian methods in health technology assessment

Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods

Method Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects

Network Meta-Analysis for Comparative Effectiveness Research

What is indirect comparison?

Indirect Comparisons: heroism or heresy. Neil Hawkins With acknowledgements to Sarah DeWilde

Chemoprevention of colorectal cancer in individuals with previous colorectal neoplasia: systematic review and network meta-analysis

Individual Differences in Attention During Category Learning

Draft Methods Report Number XX

Advanced Bayesian Models for the Social Sciences

Systematic Reviews and meta-analyses of Diagnostic Test Accuracy. Mariska Leeflang

Rheumatoid arthritis in adults: diagnosis and management

Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

Bayesian Meta-Analysis in Drug Safety Evaluation

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison

TNF-alpha inhibitors for ankylosing spondylitis(review)

A COMPARISON OF BAYESIAN MCMC AND MARGINAL MAXIMUM LIKELIHOOD METHODS IN ESTIMATING THE ITEM PARAMETERS FOR THE 2PL IRT MODEL

Center for Evidence-based Policy

Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases

How do we combine two treatment arm trials with multiple arms trials in IPD metaanalysis? An Illustration with College Drinking Interventions

You must answer question 1.

Technology appraisal guidance Published: 9 August 2017 nice.org.uk/guidance/ta466

MS&E 226: Small Data

Bayesian meta-analysis of Papanicolaou smear accuracy

Data Analysis Using Regression and Multilevel/Hierarchical Models

University of Bristol - Explore Bristol Research. Publisher's PDF, also known as Version of record

Technology appraisal guidance Published: 22 February 2012 nice.org.uk/guidance/ta247

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

Methods for assessing consistency in Mixed Treatment Comparison Meta-analysis

Kelvin Chan Feb 10, 2015

Bayesian Multiparameter Evidence Synthesis to Inform Decision Making: A Case Study in Metastatic Hormone-Refractory Prostate Cancer

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests

Agenda. Frequent Scenario WS I: Treatment Comparisons and Network Meta-Analysis: Learning the Basics

Live WebEx meeting agenda

Session 7: The Sliding Dichotomy 7.1 Background 7.2 Principles 7.3 Hypothetical example 7.4 Implementation 7.5 Example: CRASH Trial

Empirical evidence on sources of bias in randomised controlled trials: methods of and results from the BRANDO study

Computational methods for evidence-based decision support

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests

Meta Analysis. David R Urbach MD MSc Outcomes Research Course December 4, 2014

May 31, Re: Submission of revised manuscript (BMJ R1) Dear Drs. Villanueva and Godlee

Running Head: BAYESIAN MEDIATION WITH MISSING DATA 1. A Bayesian Approach for Estimating Mediation Effects with Missing Data. Craig K.

Heavy menstrual bleeding (update)

A Multilevel Testlet Model for Dual Local Dependence

Public observer slides

Methods for evidence synthesis in the case of very few studies. Guido Skipka IQWiG, Cologne, Germany

An Introduction to Bayesian Statistics

Criteria Inclusion criteria Exclusion criteria. despite treatment with csdmards, NSAIDs, and/or previous anti-tnf therapy and/or

META-ANALYSIS FOR 2 x 2 TABLES: A BAYESIAN APPROACH

The response to the Committee request for additional analyses in section 1.4, MSD follows:

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Simulation evaluation of statistical properties of methods for indirect and mixed treatment comparisons

Annual Rheumatology & Therapeutics Review for Organizations & Societies

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods

Bayesian Adjustments for Misclassified Data. Lawrence Joseph

Critical appraisal of pharmacoeconomic studies comparing TNF-α antagonists for rheumatoid arthritis treatment

Using Indirect Comparisons to Support a Health Technology Assessment (HTA)

How Valuable are Multiple Treatment Comparison Methods in Evidence-Based Health-Care Evaluation?

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis

Cost-effectiveness of apremilast (Otezla )

Transcription:

Using Indirect and Multiple Treatment Comparisons Methods To Estimate Comparative Effectiveness of Alternative Treatments Workshop September 27, 2010 Beth Devine, PharmD, MBA, PhD Rafael Alfonso C., MD, MSc UW Pharmaceutical Outcomes Research & Policy Program Hypothetical Scenario Your institution is deciding on treatments for Rheumatoid Arthritis and YOU have been selected to review the comparative effectiveness of biologic therapies in patients who have inadequate response to oral DMARDs, especially Methotrexate. Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. Evidence Based Medicine Framework Question PICO Literature search Validity/Strength of Infer rence Time Spent in Critical Appraisa al Appraisal Levels of evidence Integrate evidence 1

Literature Search We performed a literature search to identify the best available evidence of biological therapies for RA in patients with MTX IR Only RCT comparing biologic therapies to placebo or MTX, no head to head comparison across all the available treatments Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. Network of studies available Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. Meta analysis Quantitatively combined results of comparable studies of the same agent to obtain overall estimate of effect. Indirect Treatment Comparison Statistical comparison of two or more agents that have not been directly compared to each other, but that have one comparator in common, thus creating a network Mixed Treatment Comparison Extension of ITC where both direct and indirect evidence is included Validity of evidence synthesis relies on methods that: appreciate within trial randomization and Limit bias due to lack of randomization across trials Jansen. Value in Health 2008;11(5):956-64; Lumley. Stat Med 2002;21:2313-24; Lu & Ades. Stat Med 2004;23:3105-24; Sutton & Ades. Pharmacoeconomics 2008;26(9);753-67 2

ITC/MTC Assumptions Song. BMJ 2009;338:b1174 Statistical Methods Fixed effects models We believe all trials estimate the same treatment effect Mantel Haenszel method Peto method Random effects models We believe there is variability between studies; that there is a true mean effect and the effect from each study is distributed around that mean DerSimonian Laird method Statistical Methods Frequentist Probability is the limit of relative frequency (Pr(A B) Assumes experiment can be repeated under identical conditions Only applies to events that have not yet occurred Once data collected, probability no longer applies! After data collected can only describe uncertainty Applies to repetitititititive phenomena Flipping a coin Hollenbeak.ISPOR. May 15, 2010 Thompson. The Stata Journal 2006;6(4):530-49 Bayesian Based on subjective probability Probability is my degree of belief in a statement Probability y describes uncertainty Uncertainty has a probability distribution Obtain a distribution of your model parameters Parameters are random and data are fixed Calculate 95% credible intervals Probability the event lies within the interval P(B A) = [P(A B)p(B)] / P(A) 3

Statistical Methods Monte Carlo simulation estimates random sequence of chains, where next chain relies only on its immediate predecessor Markov chain Markov chain Monte Carlo simulation (MCMC) set up a Markov chain whose distribution is the posterior distribution Chain must run to convergence before estimating posterior probabilities burn ins A special type of algorithm cycles through each model parameter one at a time is called Gibbs sampling WinBUGS = Bayesian inference Using Gibbs Sampling, for Windows http://www.mrc-bsu.cam.ac.uk/bugs/ or Lunn, Thomas, Best, Speigelhalter. Stat Comput 2000;10:325-37 Thompson. The Stata Journal 2006;6(4):530-49 Methods: Model Development Two random effects logistic regression models (6 and 12 months), using the ACR 50 as the primary outcome. Randomization was preserved MTX and/or placebo, as common comparators to biologics. Mean disease duration and Mean baseline HAQ DI score were included as meta regression covariates, to account for heterogeneity between trials These covariates have prognostic value in determining the effect of RA treatment Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. Methods: Model Compilation and Running the Models Our statistical model is represented by the following equations: For the control arms: logit (p i 0) = α i + β i m i 0 For the treatment arms: logit (pij) = αi + βimij + θi + γ1x1i + γ2x2i and j=1, Ji We ran three chains and 20,000 iterations after a burn in of 10,000 iterations. We assessed the accuracy of the posterior assessments by calculating the Monte Carlo error for each parameter. For nomenclature see notes below Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. 4

Results: 6 months In the 6 month model, when compared to placebo, all biologic DMARDS and MTX were significantly more efficacious and ranked in the following order: Certolizumab (median log odds ratio=2.6), tocilizumab (1.7), rituximab (1.6), infliximab (1.6), etanercept (1.4), adulimumab (1.4), golimumab (1.4), abatacept (1.2), anakinra (1.0), MTX (0.8). There were 45 pairwise comparisons. Certolizumab was significantly more efficacious than MTX; none others were significant. Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. 6-month Model - Pairwise comparisons between biologic DMARDS and MTX (99.9% CrI) 6 month model - Odds ratios by treatment and disease duration compared to comparator (95% CrI) 5

Results: 12 months The rank order in the 12 month analysis was: Certolizumab (2.0), rituximab (2.0), adalimumab (1.4), infliximab (1.4), etanercept (0.9), abatacept (0.6), and MTX (0.8). There were 21 pairwisecomparisons; nonewere significant. The results of the model parameterized by therapeutic class revealed that each class was more efficacious than placebo. In pairwise comparisons, each class was more efficacious than MTX, but none more efficacious than another. Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. Limitations We assumed treatment effects are exchangeable between studies for both biologic and MTX treatment We assumed treatment effect similar between study arms withineachstudy didnotdifferentiate differentiate amongdoses We evaluated only one outcome ACR50(not safety) Meta regression controlled for some, but not all, between study heterogeneity Our results are dependent on the studies included We did not address publication bias Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. Conclusions Using emerging ITC methods enabled us to compare the efficacy of biologic DMARDS for treatment of RA in the absence of head to head trials. Our methods enabled us to rank order these treatments. Further analysis by drug and by therapeutic class suggests that biologic DMARDS are similarly efficacious. Based on the manuscript to be published in Pharmacotherpy by: Devine BE, Alfonso Cristancho R, Sullivan S. 6

Now Let s Practice using WinBUGS! Advantage of Bayesian analysis in ITC/ MTC is that it allows calculation of the probability of which treatment is best http://www.mrc-bsu.cam.ac.uk/bugs/ or Lunn, Thomas, Best, Speigelhalter. Stat Comput 2000;10:325-37 Outcome Measures How is your outcome of interest measured? Binary (e.g. dead or alive) Continuous (e.g blood pressure) Categorical/ordinal (e.g. severity scale) Bi Binary outcomes most common We will consider here Continuous Similar approach to binary Ordinal More complex and more rare 7

Binary Outcome Measures Binary outcome data from a comparative study can be expressed in a 2 x 2 table RCT Failure/Dead New Treatment A B Control C d Success/Alive Three common outcome measures: Odds ratios, risk ratios, risk differences Binary Outcome Measures Odds ratios OR = ad/bc Var ln(or) = 1/a + 1/b + 1/c + 1/d Work on log (ln) scale due to better statistical properties Relative Risk RR (a/(a+b))/ (c/(c+d)) Var ln(or) = 1/a 1/(a+b) + 1/c 1/(c+d) Work on log (ln) scale due to better statistical properties Binary Outcome Measures Risk Difference RD = (a/(a+b)) (c/(c+d)) Var = [p 1 / (1 p 1 ]/n 1 + [p 2 / (1 p 2 ]/n 2 ] Where: p 1 = a/(a+b) p 2 = c/(c+d) n 1 = a+b and n 2 = c+d 8

Fixed Effects Model Statistical homogeneity Formally assume: Y i = Normal(d,V i ) Random error We estimate the common true effect, d Point estimate True effect Generic Fixed Effect Y i ~ Normal(d,V i ) where i= 1.N studies Y i is the observed effect in study i with Variance V i All studies assumed to be measuring the same underlying effect size, d For a Bayesian analysis, a prior distribution must be specified for d Choice of Prior for d Often, amount of information in studies is large enough to render any prior of little importance therefore choice not critical Often specified as vague or flat Often specified as vague or flat E.g. If meta analysis is on ln(or) scale, could specify d~normal (0, 10 5 ) This states a priori we would be 95% certain that true value of d is between [0±1.96( 10 5 )] On OR scale: 10 269 to 10 269 very flat! 9

Fixed Effect with Prior Y i ~ Normal(d,V i ) where i= 1.N studies d ~ Normal(0, 10 5 ) Models are specified in WinBUGS using formulas similar to this algebra Note: Normal distributions are specified by mean and precision where precision = 1/variance Estimate model paramter using MCMC, rather than inverse weighting of variance Example: Meta analysis, RCTs of effect of aspirin preventing death after acute MIs Study Aspirin Group Placebo Group Deaths Total Deaths Total MRC 1 49 615 67 624 CDP 44 758 64 771 MRC 2 102 832 126 850 GASP 32 317 38 309 PARIS 85 810 52 406 AMIS 246 2267 219 2257 ISIS 2 1570 8587 1720 8600 Fleiss 1993 Example: Calculation: Log(OR) & Variance For MCR 1 Survive Die Aspirin 566 49 Placebo 557 67 OR=(566*67)/ 67)/ (557*49) = 1.389 Log(ln)OR = 0.3289 Variance lnor = 1/566 + 1/49 + 1/557 + 1/67 = 0.0389 Note this is OR for Survival If 2x2 table contains any zeros, common to add 0.5 to those cells before calculations 10

Example:Aspirin Data to be Combined Study OR LnOR (Y i ) Var(lnOR) (V i ) Weight (1/V i ) MRC 1 1.39 0.33 0.04 25.71 CDP 1.47 0.39 0.04 24.29 MRC 2 1.25 0.22 0.02 48.77 GASP 1.25 0.22 0.06 15.44 PARIS 1.25 0.23 0.02 28.41 AMIS 0.88 0.12 0.01 103.92 ISIS 2 1.12 0.11 0.002 664.26 Note: ISIS-2 with small variance and large weight (1/0.002) Launch WinBUGS Click on WinBUGS14.exe Click File Open Load aspirin FE.odc Components of WinBUGS.odc file Model { < Likelihood> <Prior distributions> } #Data <List or column format> #Starting Values <List or mixture of list and column format> 11

Steps for Running a Model in WinBUGS 1. Make model active. Doodles: If in own window, click title bar. If in compound document, double click the doodle (should have hairy border). Text: Simply highlight the word model at the beginning of your model. 2. Bring up Model Specification Tool (menu: Model > Specification) 3. Click check model Should see model is syntactically correct in lower left corner of window. 4. Highlight first row of data containing variable labels (if in rectangular format) 5. Click load data Should see data is loaded in lower left corner of window. 6. If using multiple chains, enter number in num of chains box. Otherwise, proceed. 7. Click compile Should see model is compiled in lower left corner of window. 8. Highlight line containing initial values: list( ) 9. Click load inits If using multiple chains, you will need to repeat steps 8 9 for each chain. Should see model is initialized. 10. Bring up Sample Monitor Tool (menu: Inference > Samples) Enter name of each node you wish to monitor and click set 11. Bring up Update Tool (menu: Model > Update) 12. Enter a number of samples to take and click update. Should see model is updating. Load and Check Model 12

Load and Check Data Compile Compile Model Model Load Initials 13

Pooled OR: median 1.12 (1.05 to 1.19) 14

Random Effects Model Model Within studies Y i ~Normal( i,vi) Across studies ~Normal(d 2 I Normal(d, ) d=solid line =dotted lines 2 = variability between studies (heterogeneity) Y 5 Trial-specific effects=dotted lines True Mean Effect = solid line 5 Generic Random Effect Y i ~ Normal(,V i ) where i= 1.N studies i ~ Normal(d, 2 ) As for fixed effect, Y i is observed effect in study i with variance V i Now study specific effects, I are allowed to be different from each other and are assumed to be sampled from a Normal distribution with mean d and variance 2 For a Bayesian analysis, a prior distribution is required for 2 as well as for d Choice of Prior for 2 This is a little trickier than for d Variances cannot be negative so Normal distribution is not a good choice Examples in WinBUGS Manual use Uniform distribution. E.g. ~ Uniform (0,10) of 10 is massive, because we are working with ORs; even of 1 or 2 is large Specification of vague priors on variance components is complex and is an active area of research 15

Generic Random Effects Model Load aspirin RE.odc 16

Results of Aspirin RE model Pooled OR: median 1.149 (0.976 1.434) OR now contains 1 Heterogeneity (log scale)=0.02 (95% CrI: 0 0.4) Compare with FE model: Pooled OR: median 1.12 (1.05 to 1.19) Bayesian CrI wider than classical CI 2 is random variable and uncertainty is included in pooled result Random Effects Binomial Model Load aspirin RE bin.odc If ORs are being pooled it is possible to specify a model that does not require assumption of normality of study effect sizes Useful when events are rare Avoids need to add continuity correction factors when cells with 0 events exist Considered more exact Based on logistic regression Random Effects Binomial Model r Ai ~Bin(p Ai, n Ai ) r Bi ~Bin(p Bi, n Bi ) logit(p Ai )=µ i logit(p Bi )=µ i + i i ~N(d, 2 ) where i=1...n studies n and n = total number of patientsin ith study n Ai and n Bi = total number of patients in ith study r Ai and r Bi = total number of events in each group p Ai and p Bi = underlying probabilities of event in each group µ i = estimated log odds of event in group A (requires prior) I = log (OR) between groups d=true mean effect; =heterogeneity between studies Logit(p)=log((p)/(1 p)) unconstrained scale used logistic regression 17

Compare our Three Odds Ratios and CrIs Fixed effects Normal Distribution OR=1.12 (95% CrI: 1.05, 1.19) Random Effects Normal Distribution OR=1.15 (95% CrI: 0.97, 1.44) Random Effects Binary Distribution OR=1.14 (95% CrI: 0.98, 1.44) MCMC Basics Now that we ve run a few models consider sensitivity analyses Sensitivity to prior distributions esp. important for distributions of variance/precision parameters Sensitivityto initialvalues Sensitivity to initial values Multiple chains using very different starting values & comparing using Brooks Gelman Rubin Statistics Length of burn in & ample: examine history/trace plots Auto correlation high auto correlation suggests samples are not independent & poor/slow mixing of chains: Run model longer or reparameterize (center variables) Convergence diagnostics & CODA & BOA 18

Interpreting Random Effects A single parameter cannot adequately summarize heterogeneous effects Therefore estimation and reporting of 2 is important This tells us how much variability there is between estimates from the population of studies In some instances studies contain both beneficial and harmful effects very important! Looking to the Future (The Future is Here!) Data Sources Evidence Synthesis RCT1 RCT2 Obs 1 Routine Care Meta- Analysis General Synthesis Bayes Theorem Combination Model Clinical Adverse Inputs (w/ Effects Effects undertainty) Decision Model Tx A Fib Warfarin NO Warfarin Stroke No Strk Stroke No Strk Bleed NoBld Bleed NoBld Bleed NoBld Bleed NoBld Utility Utility Utility Utility Utility Utility Utility Utility Cost Cost Cost Cost Cost Cost Cost Costs Questions? 19