Individualized Treatment Effects Using a Non-parametric Bayesian Approach

Similar documents
Bayesian analysis of heterogeneous treatment effects for patient-centered outcomes research

Bayesians methods in system identification: equivalences, differences, and misunderstandings

The Statistical Analysis of Failure Time Data

Standards for Heterogeneity of Treatment Effect (HTE)

Evidence synthesis with limited studies

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

MS&E 226: Small Data

Bayesian Nonparametric Methods for Precision Medicine

Statistical Methods for the Evaluation of Treatment Effectiveness in the Presence of Competing Risks

TRIPODS Workshop: Models & Machine Learning for Causal I. & Decision Making

Bayesian Dose Escalation Study Design with Consideration of Late Onset Toxicity. Li Liu, Glen Laird, Lei Gao Biostatistics Sanofi

Advanced IPD meta-analysis methods for observational studies

Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme

Introduction to Survival Analysis Procedures (Chapter)

Decision Making in Confirmatory Multipopulation Tailoring Trials

Biostatistics II

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

Bayesian Inference. Thomas Nichols. With thanks Lee Harrison

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

LOGO. Statistical Modeling of Breast and Lung Cancers. Cancer Research Team. Department of Mathematics and Statistics University of South Florida

cloglog link function to transform the (population) hazard probability into a continuous

A Case Study: Two-sample categorical data

For general queries, contact

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

Design for Targeted Therapies: Statistical Considerations

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

A (Constructive/Provocative) Critique of the ICH E9 Addendum

Hierarchy of Statistical Goals

Progress in Risk Science and Causality

Bayes Linear Statistics. Theory and Methods

Benefit - Risk Analysis for Oncology Clinical Trials

BIOSTATISTICAL METHODS

Knowledge Discovery and Data Mining. Testing. Performance Measures. Notes. Lecture 15 - ROC, AUC & Lift. Tom Kelsey. Notes

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

How to interpret scientific & statistical graphs

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia

Fundamental Clinical Trial Design

2014 Modern Modeling Methods (M 3 ) Conference, UCONN

Analysis methods for improved external validity

An Introduction to Bayesian Statistics

Introduction to Meta-analysis of Accuracy Data

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time.

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Biomarkers in oncology drug development

Statistical methods for the meta-analysis of full ROC curves

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Bayesian Joint Modelling of Benefit and Risk in Drug Development

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Data Analysis Using Regression and Multilevel/Hierarchical Models

Ecological Statistics

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

Ch.20 Dynamic Cue Combination in Distributional Population Code Networks. Ka Yeon Kim Biopsychology

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

PRACTICAL STATISTICS FOR MEDICAL RESEARCH

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES

A novel approach to estimation of the time to biomarker threshold: Applications to HIV

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

STATISTICS AND RESEARCH DESIGN

The Pretest! Pretest! Pretest! Assignment (Example 2)

STATS8: Introduction to Biostatistics. Overview. Babak Shahbaba Department of Statistics, UCI

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Predicting Breast Cancer Survival Using Treatment and Patient Factors

Graphical Modeling Approaches for Estimating Brain Networks

Statistics and Probability

Ch 1.1 & 1.2 Basic Definitions for Statistics

NIH Public Access Author Manuscript Conf Proc IEEE Eng Med Biol Soc. Author manuscript; available in PMC 2013 February 01.

SUPPLEMENTARY MATERIAL

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Effective Implementation of Bayesian Adaptive Randomization in Early Phase Clinical Development. Pantelis Vlachos.

A Mini-conference on Bayesian Methods at Lund University 5th of February, 2016 Room C121, Lux building, Helgonavägen 3, Lund University.

A web application for conducting the continual reassessment method

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

BREAST CANCER EPIDEMIOLOGY MODEL:

Search e Fall /18/15

Bayesian methods in health economics

T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design

Abstract: Heart failure research suggests that multiple biomarkers could be combined

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

Genome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data

Linear Regression in SAS

Motivation: Fraud Detection

Bayesian additive decision trees of biomarker by treatment interactions for predictive biomarkers detection and subgroup identification

Using Electronic Health Records Data for Predictive and Causal Inference About the HIV Care Cascade

Biostatistics for Med Students. Lecture 1

BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

Appropriate Statistical Methods to Account for Similarities in Binary Outcomes Between Fellow Eyes

Cumulative metformin use and its impact on survival in gastric cancer patients after INDEX

Package QualInt. February 19, 2015

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Drug Supply for Adaptive Trials

Does Machine Learning. In a Learning Health System?

Summarising and validating test accuracy results across multiple studies for use in clinical practice

Joint modeling and dynamic predictions with applications to cancer research

Problem set 2: understanding ordinary least squares regressions

Transcription:

Individualized Treatment Effects Using a Non-parametric Bayesian Approach Ravi Varadhan Nicholas C. Henderson Division of Biostatistics & Bioinformatics Department of Oncology Johns Hopkins University July 27, 2017

Acknowledgements Drs. Tom Louis and Gary Rosner Patient-Centered Outcomes Research Institute (PCORI) Award (ME-1303-5896) Dr. Dan Scharfstein and the CTSA Supplement Award 2

Why is HTE Important? The paradox of the clinical trial is that it is the best way to assess whether an intervention works, but is arguably the worst way to assess who benefits from it. (Mant, 1999) 3

Heterogeneity of Treatment Effect (HTE) Heterogeneity of Treatment Effect (HTE) refers to variability in treatment response that is attributable to observable differences in patient characteristics. Consistency of treatment effect across key patient subgroups - subgroup analysis - most prevalent form of HTE assessment Tests for treatment-covariate interactions - less common. 4

HTE Goals/Questions Our view: HTE assessment encompasses a broad range of related goals and questions. This goes beyond conventional subgroup analysis and treatment-covariate interactions. Key questions of interest include: Quantifying overall heterogeneity in treatment response. Estimating the proportion of patients that benefit from treatment Detection of cross-over (qualitative) interactions. Estimation and prediction of individualized treatment effects. Optimal allocation of treatments to individuals 5

Modeling HTE In contrast to subgroup analysis, many important HTE questions could be directly addressed if a sufficiently rich model describing patient outcomes were available. Bayesian nonparametric methods are well-suited to provide this individual-level view of HTE. Bayesian nonparametrics allow construction of flexible models for patient outcomes coupled with probability modeling of all unknown quantities Motivation of this work: Develop a flexible, non-parametric approach that can address many of the previously highlighted HTE goals. 6

Why Bayes? The phrase heterogeneity of treatment effects implies an underlying distribution of treatment effects Thus a Bayesian framework appears natural Emphasizes estimation of treatment effect heterogeneity rather than hypothesis testing. Well-suited to estimation with many parameters and small subgroups. Tends to prevent over-fitting. 7

Why Bayes? Direct probability statements for questions of interest: e.g., what is the probability that a given individual will benefit from the treatment? Customized treatment recommendations - can utilize the posterior for each individual, can directly weigh efficacy versus safety. 8

Time-to-Event Data and Notation Our focus here is on cases where patient outcomes are recorded as time-to-events. For the i th patient, we observe Y i = min{t i, C i } T i - failure time C i - censoring time δ i = 1 if T i C i, and δ i = 0 if T i > C i A i - treatment assignment, A i = 1 or A i = 0 x i - a collection of baseline covariates 9

Accelerated Failure Time (AFT) Models and Individualized Treatment Effects We assume patients are randomly assigned to one of two treatments A = 1 or A = 0. Consider the AFT model for log-failure time T i log T i = m(a i, x i) }{{} Regression Function + W }{{} i ; E(W i ) = 0 Error Term The Individualized Treatment Effect θ(x i ) for the i th patient is the difference between expected log-failure under treatment A = 1 and expected log-failure time under treatment A = 0 θ(x i) = E [ [ ] log T i A i = 1, x i] E log Ti A i = 0, x i = m(1, x i) m(0, x i). The ratio of expected failure times offers a more interpretable measure of treatment effect ξ(x i ) = E[ ] T i A i = 1, x i E [ ] = exp{θ(x i )} T i A i = 0, x i 10

Accelerated Failure Time (AFT) Models In contrast to Cox-proportional hazards model, AFT models provide a direct relationship between survival times and patient covariates. AFT models have a nice interpretation as a regression with log-time as the response. They provide interpretable measures of treatment effect: i.e., differences in expected log-survival time or ratios in expected survival, 11

Modeling the Regression Function Our flexible approach to modeling the regression function m(a i, x i) of the AFT model is to use Bayesian additive regression trees (BART) Advantages of BART: Good at handling interactions and non-linearities Very effective as an off-the-shelf method - works quite well without any hyperparameter tuning. Seamlessly incorporates both discrete and continuous predictors. Automatically provides measures of uncertainty despite the complex nature of the model. 12

Using the Nonparametric AFT model to assess HTE The posterior distribution of all unknowns in the AFT model can be used to assess a variety of questions. For example, Point estimates of covariate-specific treatment effects The distribution of treatment effects Proportion of patient expected to benefit from treatment Qualitative interactions. The posterior can potentially be utilized in an individualized decision analysis 13

Application: The SOLVD trial A placebo-controlled trial studying the efficacy of the drug Enalapril in chronic heart failure patients 2, 569 enrolled in the treatment trial and 4, 228 enrolled in the prevention trial We utilized 18 patient covariates common to both trials (e.g., age, gender, ejection fraction) 14

The AFTrees package The methods discussed here are implemented using the R package: AFTrees. ## An example: library(aftrees) solvd.fit <- AFTrees(X, y, status, ndpost = 2000) ## X - design matrix ## y - follow-up time ## status - event indicator (1 if event, 0 otherwise) ## ndpost - number of posterior draws The AFTrees package is available for download at http://www.hteguru.com/software 15

Individualized treatment effect estimates for all patients in the SOLVD treatment and prevention trials. 6000 5000 Patient index 4000 3000 2000 1000 0 0.5 0.0 0.5 1.0 1.5 Difference in expected log failure time 16

Distribution of treatment effects in the SOLVD trials. 2.0 Treatment Trial Prevention Trial 1.5 Density 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 treatment effect (ratio in expected survival) 17

Proportion Benefiting The proportion of patients benefiting is the proportion of individuals with a positive treatment effect (i.e., θ(x i ) > 0) Q = 1 n n 1{θ(x i ) > 0} = 1 n i=1 n 1{ξ(x i ) > 1} i=1 Alternatively, one could define the proportion benefiting relative to a clinically relevant threshold ε > 0, i.e., Q ɛ = 1 n n i=1 1{θ(x i) > ε}. An estimate of Q is obtained from taking the area under the curve to the right of 1 in the graph of treatment effect distribution (shown on the previous slide). The estimated proportions of patients benefiting were 95.6% and 89.1% in the SOLVD-T and SOLVD-P trials respectively. 18

Evidence of Benefit Treatment Trial Prevention Trial P{ξ(x i ) > 1 data} (0.99, 1] 51.38 20.47 P{ξ(x i ) > 1 data} (0.95, 0.99] 24.69 23.71 P{ξ(x i ) > 1 data} (0.75, 0.95] 20.08 41.98 P{ξ(x i ) > 1 data} (0.25, 0.75] 3.85 13.84 P{ξ(x i ) > 1 data} [0, 0.25] 0.00 0.00 Table: For each trial, the percentage of patients whose estimated posterior probability of treatment benefit lies within each of the intervals (0.99, 1], (0.95, 0.99], (0.75, 0.95], (0.25, 0.75], and [0, 0.25]. 19

Evidence of Differential Treatment Effect For patient i, the posterior probability of a greater than average treatment effect may be defined as D i = P{θ(x i ) θ data}, θ = 1 n θ(x i ) n and the posterior probability of a differential treatment effect is D i = max{1 2D i, 2D i 1}. Note that D i will be close to 1 whenever D i is either close to 1 or close to 0. Treatment Trial i=1 Prevention Trial Di > 0.95 19.36 7.30 Di > 0.80 41.93 31.58 Table: For each trial, the percentage of patients that show strong (i.e., D i >.95) and mild (i.e., D i > 0.80 but D i 0.95) evidence of differential treatment effect. 20

Individual-Specific Posterior Survival Curves in SOLVD treatment trial 1.0 0.8 Survival Probability 0.6 0.4 0.2 0.0 Enalapril Placebo Enalapril KM estimate Placebo KM estimate 0 500 1000 1500 Time 21

Examining Qualitative Interactions Beyond quantitative heterogeneity, examination of qualitative interactions is often of key interest. Qualitative Interaction: occurs when the treatment effect in one subgroup has a different sign than in another subgroup. The presence of qualitative interactions can be examined by looking at the posterior histogram. For pre-specified subgroups of interest such as male vs. female, we can look at the posteriors of the subgroup-level treatment effects θ male = θ female = 1 N male i male 1 θ(x i ) N female i female θ(x i ) 22

SOLVD data: posterior of θ male and θ female P { sign(θ male ) sign(θ female ) data } = 0.035 3 Male Female Density 2 1 0 0.2 0.0 0.2 0.4 0.6 0.8 1.0 difference in log survival time (days) 23

Variable Importance: Partial Dependence Plots 0.600 0.9 0.595 0.8 difference in log survival 0.590 0.585 0.580 difference in log survival 0.7 0.6 0.575 0.5 0.570 30 40 50 60 70 80 age 10 15 20 25 30 35 ejection fraction 24

Summary A flexible Bayesian approach for HTE assessment Fully non-parametric Default hyper-parameter settings, hence minimal user input Easy to use R package: AFTrees http://www.hteguru.com/software Give it a try and send us feedback! 25