Methods for adjusting survival estimates in the presence of treatment crossover a simulation study

Similar documents
Assessing methods for dealing with treatment switching in clinical trials: A follow-up simulation study

NICE DSU TECHNICAL SUPPORT DOCUMENT 16: ADJUSTING SURVIVAL TIME ESTIMATES IN THE PRESENCE OF TREATMENT SWITCHING

The (mis)use of treatment switching adjustment methods in health technology assessment busting some myths!

Adjusting overall survival for treatment switch

The Logic of Causal Inference. Uwe Siebert, MD, MPH, MSc, ScD

Everolimus for the Treatment of Advanced Renal Cell Carcinoma (arcc ) - Additional Analysis Using RPSFT

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials.

Adjusting for treatment switching in the METRIC study shows further improved overall survival with trametinib compared with chemotherapy

Technology appraisal guidance Published: 27 July 2016 nice.org.uk/guidance/ta399

Treatment changes in cancer clinical trials: design and analysis

Statistical Challenges in Immunotherapy: Non Proportional Hazard Model. BBS / PSI CIT event 15-June-2017, Basel Claude BERGE, Roche

Statistical Models for Censored Point Processes with Cure Rates

Exploring and Validating Surrogate Endpoints in Colorectal Cancer. Center, Pittsburgh, USA

Statistical, clinical and ethical considerations when minimizing confounding for overall survival in cancer immunotherapy trials

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

Models for HSV shedding must account for two levels of overdispersion

Clinical trial design issues and options for the study of rare diseases

The NCPE has issued a recommendation regarding the use of pertuzumab for this indication. The NCPE do not recommend reimbursement of pertuzumab.

Beyond the intention-to treat effect: Per-protocol effects in randomized trials

Survival analysis beyond the Cox-model

Everolimus, lutetium-177 DOTATATE and sunitinib for treating unresectable or metastatic neuroendocrine tumours with disease progression MTA

Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Cost-effectiveness of osimertinib (Tagrisso )

Adjusting the Crossover Effect in Overall Survival Analysis Using a Rank Preserving Structural Failure Time Model: The Case of Sunitinib GIST Trial

Technology appraisal guidance Published: 16 May 2018 nice.org.uk/guidance/ta520

Technology appraisal guidance Published: 25 February 2015 nice.org.uk/guidance/ta333

Section 3: Economic evaluation

Dabrafenib for treating unresectable or metastatic BRAF V600 mutation-positive melanoma

State of the art pharmacoepidemiological study designs for post-approval risk assessment

Clinical Study Synopsis for Public Disclosure

Fundamental Clinical Trial Design

Individual- and trial-level surrogacy in colorectal cancer

Technology appraisal guidance Published: 18 July 2018 nice.org.uk/guidance/ta531

NICE decisions on health care provisions in England

ASSESSING LONG-TERM BENEFITS OF IMMUNOTHERAPY BASED ON EARLY TUMOR ASSESSMENT DATA

Instrumental Variables Estimation: An Introduction

Your submission states that the adjusted indirect comparison methodology was based largely on the publication by Glenny et al

A Comparison of Shape and Scale Estimators of the Two-Parameter Weibull Distribution

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

Overview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate

Technology appraisal guidance Published: 27 June 2012 nice.org.uk/guidance/ta257

Nivolumab for adjuvant treatment of resected stage III and IV melanoma [ID1316] STA Lead team presentation: Cost Effectiveness Part 1

Partitioned survival analysis for decision modelling in health care: a critical review

Technology appraisal guidance Published: 27 January 2016 nice.org.uk/guidance/ta380

Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme

Technology appraisal guidance Published: 7 October 2015 nice.org.uk/guidance/ta357

Received: 14 April 2016, Accepted: 28 October 2016 Published online 1 December 2016 in Wiley Online Library

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies

BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS

Technology appraisal guidance Published: 29 June 2011 nice.org.uk/guidance/ta227

Technology appraisal guidance Published: 27 January 2016 nice.org.uk/guidance/ta381

Case Study in Placebo Modeling and Its Effect on Drug Development

Dealing with Treatment Switches in Cost effectiveness Analysis: Martin Pitt Martin Hoyle Peninsula Technology Assessment Group Exeter

Regulatory Challenges in Reviewing Oncology Product Applications

ASSESSMENT OF LEAD-TIME BIAS IN ESTIMATES OF RELATIVE SURVIVAL FOR BREAST CANCER

Introduction to Survival Analysis Procedures (Chapter)

Lead team presentation Brentuximab vedotin for relapsed or refractory systemic anaplastic large cell lymphoma (STA)

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

CLINICAL ASSISTED REPRODUCTION

Technology appraisal guidance Published: 12 December 2012 nice.org.uk/guidance/ta269

Cost-effectiveness of nivolumab with ipilimumab (Opdivo with Yervoy ) for the treatment of advanced (unresectable or metastatic) melanoma.

Simple Linear Regression

Daratumumab for treating relapsed and refractory multiple myeloma after a proteasome inhibitor and an immunomodulatory agent [ID933]

Generalizing the right question, which is?

Technology appraisal guidance Published: 28 September 2016 nice.org.uk/guidance/ta406

Technology appraisal guidance Published: 22 October 2014 nice.org.uk/guidance/ta321

Recursive Partitioning Method on Survival Outcomes for Personalized Medicine

Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model

Bayesian Multiparameter Evidence Synthesis to Inform Decision Making: A Case Study in Metastatic Hormone-Refractory Prostate Cancer

Section A: Clarification on effectiveness data. Licensed population

TWISTED SURVIVAL: IDENTIFYING SURROGATE ENDPOINTS FOR MORTALITY USING QTWIST AND CONDITIONAL DISEASE FREE SURVIVAL. Beth A.

Real-world observational data in costeffectiveness analyses: Herceptin as a case study

Module 14: Missing Data Concepts

Technology appraisal guidance Published: 21 March 2018 nice.org.uk/guidance/ta512

THE UNIVERSITY OF OKLAHOMA HEALTH SCIENCES CENTER GRADUATE COLLEGE A COMPARISON OF STATISTICAL ANALYSIS MODELING APPROACHES FOR STEPPED-

Technology appraisal guidance Published: 22 June 2016 nice.org.uk/guidance/ta395

NICE DSU TECHNICAL SUPPORT DOCUMENT 19: PARTITIONED SURVIVAL ANALYSIS FOR DECISION MODELLING IN REPORT BY THE DECISION SUPPORT UNIT

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome

SCHOOL OF MATHEMATICS AND STATISTICS

Estimands in oncology

The Statistical Analysis of Failure Time Data

Generalized Estimating Equations for Depression Dose Regimes

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Optimal full matching for survival outcomes: a method that merits more widespread use

Technology appraisal guidance Published: 23 April 2014 nice.org.uk/guidance/ta310

Slides for Committee CIC redacted

THE ISSUE OF STAGE AT DIAGNOSIS

Strategies for handling missing data in randomised trials

Background Comparative effectiveness of nivolumab

Sample-size re-estimation in Multiple Sclerosis trials

ATTACH YOUR SAS CODE WITH YOUR ANSWERS.

Chair s presentation Lutetium (177lu) oxodotreotide for treating unresectable or metastatic neuroendocrine tumours in people with progressive disease

Glossary of abbreviations

Anale. Seria Informatică. Vol. XVI fasc Annals. Computer Science Series. 16 th Tome 1 st Fasc. 2018

Sample size calculation for a stepped wedge trial

NICE Single Technology Appraisal of cetuximab for the treatment of recurrent and /or metastatic squamous cell carcinoma of the head and neck

Big Data Challenges & Opportunities. L. Miriam Dickinson, PhD

Choice of appropriate endpoints in clinical trials. Fortunato Ciardiello Second University of Naples

Transcription:

Methods for adjusting survival estimates in the presence of treatment crossover a simulation study Nicholas Latimer, University of Sheffield Collaborators: Paul Lambert, Keith Abrams, Michael Crowther and James Morden Please note these are not final results: do not disseminate results without permission from authors

2 Contents Extensions compared to James study Data generation Scenarios Performance measures Methods Results Conclusions Please note: results subject to minor changes re-running due to minor errors in data generating model

3 Extensions Allow the treatment effect to change over time, based upon a timedependent covariate Generate a time-dependent covariate that represents the continuous progression of the disease as this increases, the relative treatment effect falls This involves relaxing the equal treatment effect assumption Allow the treatment crossover decision to be based upon timedependent covariates, rather than baseline characteristics Include observational-based methods IPCW SNM with g-estimation

Data Generation (1) 4 Used a two-stage Weibull model to generate underlying survival times and a time-dependent covariate (called CEA ) Longitudinal model for CEA (for ith patient at time t): where is the random intercept is the slope for a patient in the control arm is the slope for a patient in the treatment arm (all is the change in the intercept for a patient with bad prognosis compared to a patient without bad prognosis Picked parameter values such that CEA increased over time, more slowly in the experimental group, and was higher in the badprog group

5 Data Generation (2) The survival hazard function was based upon a Weibull (see Bender et al 2005): In our case, where α is the log hazard ratio (the treatment effect) is the impact of a bad prognosis baseline covariate on survival is the coefficient of CEA, indicating its effect on survival We used this to generate our survival times So, CEA has an effect on survival

6 Data Generation (3) We then estimated the treatment effect over time (in terms of an acceleration factor) based upon Collett s HR to AF formula: We used this to inflate survival times of crossover patients, based upon the time-point at which they started receiving the experimental treatment NOTE: This equation is wrong Collett s formula is only applicable when there are proportional hazards, and we do not have proportional hazards due to our time-dependent covariate Working on this Note unlikely to change our results as we are still doing what we intended applying a lower treatment effect to crossover patients

0.00 0.25 0.50 0.75 1.00 Acceleration Factor 7 Data Generation (4) We then selected parameter values in order that realistic datasets were created: Kaplan-Meier survival estimates 3 2.5 2 1.5 1 0 200 400 600 analysis time 800 1000 1200 0.5 Number at risk trtrand = 0 243 77 45 30 25 17 0 trtrand = 1 257 158 117 97 73 60 0 trtrand = 0 trtrand = 1 1 100 200 300 400 500 600 700 800 900 1000 Time (days)

Data Generation (5) 8 We made several assumptions about the crossover mechanism : 1. Crossover could only occur after disease progression (disease progression was approximately half of OS, calculated for each patient using a beta(5,5) distribution) 2. Crossover could only occur at 3 consultations following disease progression These were set at 21 day intervals Probability of crossover highest at initial consultation, then falls in second and third 3. Crossover probability depended on time-dependent covariates: CEA value at progression (high value reduced chance of crossover) Time to disease progression (high value increased chance of crossover) This was altered in scenarios to test a simpler mechanism where probability only depended on CEA Given all this, CEA was a time-dependent confounder

Scenarios 9 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due Log hazard ratio = 0.5 to bad prognosis group Survival time distribution Alter parameters to test two levels of disease severity Initially assumed treatment effect Alter to test two levels of treatment effect Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients Probability of switching treatment Test two levels of treatment crossover proportions over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over

Scenarios 10 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due to bad prognosis group Log hazard ratio = 0.5 Survival time distribution Alter parameters to test two levels of disease (2) severity Initially assumed treatment effect Alter to test two levels of treatment effect Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients Probability of switching treatment Test two levels of treatment crossover proportions over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over

Scenarios 11 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due to bad prognosis group Log hazard ratio = 0.5 Survival time distribution Alter parameters to test two levels of disease (2) severity Initially assumed treatment effect Alter to test two levels of treatment effect (4) Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients Probability of switching treatment Test two levels of treatment crossover proportions over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over

Scenarios 12 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due to bad prognosis group Log hazard ratio = 0.5 Survival time distribution Alter parameters to test two levels of disease (2) severity Initially assumed treatment effect Alter to test two levels of treatment effect (4) Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients Probability of switching treatment Test two levels of treatment crossover proportions over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over (8) (12)

Scenarios 13 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due to bad prognosis group Log hazard ratio = 0.5 Survival time distribution Alter parameters to test two levels of disease (2) severity Initially assumed treatment effect Alter to test two levels of treatment effect (4) Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients (8) (12) Probability of switching treatment Test two levels of treatment crossover proportions (24) over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over

Scenarios 14 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due to bad prognosis group Log hazard ratio = 0.5 Survival time distribution Alter parameters to test two levels of disease (2) severity Initially assumed treatment effect Alter to test two levels of treatment effect (4) Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients (8) (12) Probability of switching treatment Test two levels of treatment crossover proportions (24) over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over (72) This combined to 72 scenarios

Scenarios 15 Variable Value Alternative Sample size 500 Number of prognosis groups (prog) 2 Probability of good prognosis 0.5 Probability of poor prognosis 0.5 Maximum follow-up time 3 years (1095 days) Multiplication of OS survival time due Log hazard ratio = 0.5 to bad prognosis group Survival time distribution Alter parameters to test two levels of disease severity Initially assumed treatment effect Alter to test two levels of treatment effect Time-dependence of treatment effect Treatment effect received depends upon CEA at time of crossover. However set α to zero in some scenarios. Also include additional treatment effect decrement in crossover patients in some scenarios to approximate a continued reduction in treatment effect over time in these patients Probability of switching treatment Test two levels of treatment crossover proportions over time Prognosis of crossover patients Test three crossover mechanisms in which different groups become more likely to cross over

16 Performance measures Similar to James study We used bias as our primary performance measure Also assessed coverage However, because our treatment effect is time-dependent there is not a true HR or AF Therefore we used restricted mean survival as our true measure. We estimated the truth from our survivor function equations This is highly relevant for the context of economic evaluation But means that we had to estimate restricted mean survival for each crossover method not just the adjusted HR or AF

17 Estimating survival Three broad approaches assessed (all estimated out to 3 years): 1. Survivor function approach Apply treatment effect to survivor function (or hazard function) estimated for experimental group calculate AUC 2. Extrapolation approach Extrapolate counterfactual dataset to required time-point (only relevant for RPSFTM/IPE approaches) calculate AUC 3. Shrinkage approach Use estimated acceleration factor to shrink survival times in crossover patients in order to obtain an adjusted dataset calculate AUC (only relevent for AF-based approaches)

18 Methods Naive methods ITT Exclude crossover patients (PPexc) Censor crossover patients (PPcens) Treatment group as a time-dependent covariate (TDCM) Treatment crossover as a time-dependent indicator (XOTDCM)

19 Methods Complex methods RPSFTM with log-rank test (with and without covariates) IPE algorithm (Weibull and exponential versions, with and without covariates) IPCW SNM with g-estimation Two-stage Weibull method (Weib 2m) Note, we did not include: Walker et al s method due to poor performance in James study Loey and Goetghebeur s method as only for all-or-nothing compliance Law and Kaldor s method as fundamentally flawed And only included log-rank test version of RPSFTM

AUC mean bias (%) 20 Results (1) Randomisation-based methods worked very well when the treatment effect was not time-dependent, eg: 60.00 50.00 40.00 30.00 20.00 10.00 0.00-10.00-20.00 3 4 7 8 15 16 19 20 27 IPCW method performed poorly when crossover proportion was very high SNM method performed poorly Naive methods performed poorly Two-stage Weibull produced low bias ITT Weib 2m IPE Weib wc sfunc IPCW sfunc RPSFTM wc sfunc

AUC mean bias (%) 21 Results (2) 50.00 40.00 30.00 20.00 10.00-10.00-20.00-30.00 Randomisation-based methods produced large bias when the treatment effect was time-dependent, eg: 0.00 ITT Weib 2m IPE Weib wc sfunc IPE Weib wc shrink IPCW sfunc RPSFTM wc sfunc RPSFTM wc shrink 1 2 5 6 13 14 17 18 RPSFTM/IPE shrinkage approach performed better, but this is flawed IPCW method performed better than randomisation-based approaches providing crossover proportion was less than 90%, but stilll gave considerable bias

AUC mean bias (%) 22 Results (3) When there was an additional treatment effect decrement in crossover patients, indicating a particularly strong time-dependent treatment effect, the randomisation-based methods performed even less well: 50.00 40.00 30.00 20.00 10.00 0.00-10.00-20.00-30.00 ITT IPCW sfunc Weib 2m RPSFTM wc sfunc IPE Weib wc sfunc RPSFTM wc shrink IPE Weib wc shrink 9 10 11 12 21 22 23 24 33 IPCW is unaffected by this, and becomes more likely to produce least bias (excluding two-stage Weibull approach)

23 Conclusions RPSFTM / IPE survivor function methods produce very low levels of bias when the treatment effect is not time-dependent When the treatment effect (in terms of an AF) is 20-30% lower in crossover patients RPSFTM / IPE survivor function approaches produce high levels of bias (>10%) Shrinkage approaches perform with lower bias but these methods are flawed When the treatment effect decrement is >30% IPCW produces less bias than any RPSFTM / IPE variant, providing <90% of at-risk patients crossover But significant bias remains Where applicable, two-stage methods are worthy of consideration Survivor function approaches generally produce lower bias than extrapolation approaches, due to the loss of information associated with recensoring and the effect of this on the extrapolation