An Introduction to Dynamic Treatment Regimes

Similar documents
The Right Treatment for the Right Patient (at the Right Time) Marie Davidian Department of Statistics North Carolina State University

SMARTs in Practice: Case Studies in Design and Analysis. Marie Davidian and Eric B. Laber

Developing Adaptive Health Interventions

SMART Clinical Trial Designs for Dynamic Treatment Regimes

Introduction to Adaptive Interventions and SMART Study Design Principles

Instrumental Variables Estimation: An Introduction

Mathematical-Statistical Modeling to Inform the Design of HIV Treatment Strategies and Clinical Trials

Structural Approach to Bias in Meta-analyses

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

Beyond the intention-to treat effect: Per-protocol effects in randomized trials

Lecture 1: Introduction to Personalized Medicine. Donglin Zeng, Department of Biostatistics, University of North Carolina

Lecture 5: Sequential Multiple Assignment Randomized Trials (SMARTs) for DTR. Donglin Zeng, Department of Biostatistics, University of North Carolina

Mathematical-Statistical Modeling to Inform the Design of HIV Treatment Strategies and Clinical Trials

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

60 minutes. This is the 4 th module of a 6 module Seminar on experimental designs for building optimal adaptive health interventions.

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS

Propensity scores: what, why and why not?

The RoB 2.0 tool (individually randomized, cross-over trials)

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

Lecture 2: Foundations of Concept Learning

ISPOR Task Force Report: ITC & NMA Study Questionnaire

Our goal in this section is to explain a few more concepts about experiments. Don t be concerned with the details.

50 min. SMART Study Designs for Developing Adaptive Interventions. SA Murphy 1

Comparisons of Dynamic Treatment Regimes using Observational Data

Regulatory Challenges in Reviewing Oncology Product Applications

Online Supplementary Material

Statistical, clinical and ethical considerations when minimizing confounding for overall survival in cancer immunotherapy trials

BIOSTATISTICAL METHODS

Comparing Dynamic Treatment Regimes Using Repeated-Measures Outcomes: Modeling Considerations in SMART Studies

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

Overview of Clinical Study Design Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine US National

In this second module in the clinical trials series, we will focus on design considerations for Phase III clinical trials. Phase III clinical trials

Automatic learning of adaptive treatment strategies. using kernel-based reinforcement learning

About Reading Scientific Studies

Introduction to Applied Research in Economics Kamiljon T. Akramov, Ph.D. IFPRI, Washington, DC, USA

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Sequential, Multiple Assignment, Randomized Trials

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

Research Design & Protocol Development

Getting ready for propensity score methods: Designing non-experimental studies and selecting comparison groups

3 Phase III Clinical Trials

Placebo and Belief Effects: Optimal Design for Randomized Trials

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials.

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Getting SMART about Adaptive Interventions: A Conceptual Introduction Rona L. Levy With many thanks to Daniel Almirall

Systemic Treatment of Acute Myeloid Leukemia (AML)

Assessing the impact of unmeasured confounding: confounding functions for causal inference

Phase II Design. Kim N. Chi NCIC/NCIC-CTG NEW INVESTIGATORS CLINICAL TRIALS COURSE

Randomized Controlled Trial

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Multiple Regression Models

Fundamental Clinical Trial Design

Methods to control for confounding - Introduction & Overview - Nicolle M Gatto 18 February 2015

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_

Eligibility Form. 1. Patient Profile. (This form must be completed before the first dose is dispensed.) Request prior approval for enrolment

Propensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018

Methodological requirements for realworld cost-effectiveness assessment

Version No. 7 Date: July Please send comments or suggestions on this glossary to

Analysis A step in the research process that involves describing and then making inferences based on a set of data.

Research Approach & Design. Awatif Alam MBBS, Msc (Toronto),ABCM Professor Community Medicine Vice Provost Girls Section

OHDSI Tutorial: Design and implementation of a comparative cohort study in observational healthcare data

Introduction to Observational Studies. Jane Pinelis

PubH 7405: REGRESSION ANALYSIS. Propensity Score

Introduction to Applied Research in Economics

Treatment changes in cancer clinical trials: design and analysis

Introduction to Design and Analysis of SMARTs

Technical Specifications

Villarreal Rm. 170 Handout (4.3)/(4.4) - 1 Designing Experiments I

Effective Implementation of Bayesian Adaptive Randomization in Early Phase Clinical Development. Pantelis Vlachos.

on Overall Survival in Acute Leukemia

This memo includes background of the project, the major themes within the comments, and specific issues that require further CSAC/HITAC action.

Design for Targeted Therapies: Statistical Considerations

Statistical Decision Theory and Bayesian Analysis

On Adaptive Interventions and SMART

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

Supplementary Online Content

16:35 17:20 Alexander Luedtke (Fred Hutchinson Cancer Research Center)

CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH

J 13 (10) : , 1995 STUDY DESIGN AND CONDUCT

Doing High Quality Field Research. Kim Elsbach University of California, Davis

Adaptive Interventions Treatment Modelling and Regimen Optimization Using Sequential Multiple Assignment Randomized Trials (Smart) and Q- Learning

Biostatistics Primer

ICH E9 (R1) Addendum on Estimands and Sensitivity Analysis

Estimands. EFPIA webinar Rob Hemmings, Frank Bretz October 2017

Using Statistical Principles to Implement FDA Guidance on Cardiovascular Risk Assessment for Diabetes Drugs

Lecture 2. Key Concepts in Clinical Research

Estimands and Sensitivity Analysis in Clinical Trials E9(R1)

SWITCH Trial. A Sequential Multiple Adaptive Randomization Trial

Estimands for time to event endpoints in oncology and beyond

Instrumental Variables I (cont.)

DATA CORE MEETING. Observational studies in dementia and the MELODEM initiative. V. Shane Pankratz, Ph.D.

Generalizing the right question, which is?

Rise of the Machines

EC352 Econometric Methods: Week 07

A Brief Introduction to Bayesian Statistics

Recent advances in non-experimental comparison group designs

Implementing double-robust estimators of causal effects

National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Transcription:

An Introduction to Dynamic Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www4.stat.ncsu.edu/davidian 1/64 Dynamic Treatment Regimes Webinar

Outline What is a dynamic treatment regime, and why study them? Clinical trials to study dynamic treatment regimes Thinking in terms of dynamic treatment regimes Constructing dynamic treatment regimes Discussion 2/64 Dynamic Treatment Regimes Webinar

Hot topic Personalized Medicine Source of graphic: http://www.personalizedmedicine.com/ 3/64 Dynamic Treatment Regimes Webinar

A perspective on personalized medicine Clinical practice: Clinicians make (a series of) treatment decisions(s) over the course of a patient s disease or disorder Key decision points in the disease process Fixed schedule, milestone in the disease process, event necessitating a decision Several treatment options at each decision point Accruing information on the patient 4/64 Dynamic Treatment Regimes Webinar

A perspective on personalized medicine Clinical practice: Clinicians make (a series of) treatment decisions(s) over the course of a patient s disease or disorder Key decision points in the disease process Fixed schedule, milestone in the disease process, event necessitating a decision Several treatment options at each decision point Accruing information on the patient Personalize treatment to the patient 4/64 Dynamic Treatment Regimes Webinar

A perspective on personalized medicine Clinical practice: Clinicians make (a series of) treatment decisions(s) over the course of a patient s disease or disorder Key decision points in the disease process Fixed schedule, milestone in the disease process, event necessitating a decision Several treatment options at each decision point Accruing information on the patient Personalize treatment to the patient That is: Treatment in practice involves sequential decision-making based on accruing information Suggests thinking about and studying treatment from this perspective... 4/64 Dynamic Treatment Regimes Webinar

Clinical decision-making How are these decisions made? Clinical judgment Practice guidelines based on study results, expert opinion Synthesize all information on a patient up to the point of the decision to determine the next treatment action 5/64 Dynamic Treatment Regimes Webinar

Clinical decision-making How are these decisions made? Clinical judgment Practice guidelines based on study results, expert opinion Synthesize all information on a patient up to the point of the decision to determine the next treatment action Can clinical decision-making be formalized and made evidence-based? 5/64 Dynamic Treatment Regimes Webinar

Dynamic treatment regime Dynamic treatment regime: A set of sequential decision rules, each corresponding to a key decision point Each rule dictates the treatment to be given from among the available options based on the accrued information on the patient to that point Taken together, the rules define an algorithm for making treatment decisions Dynamic because the treatment action can vary depending on the accrued information Ideally, provides an evidence-based approach to personalized treatment 6/64 Dynamic Treatment Regimes Webinar

Treatment regime Terminology/Convention: Often, treatment regime is used to refer generally to any approach to deciding on treatment And dynamic treatment regime is reserved for the case where patient information is used We will use these terms interchangeably In fact: Many common situations can be cast as involving (dynamic) treatment regimes 7/64 Dynamic Treatment Regimes Webinar

ADHD therapy Sequential (scheduled) decision points Decision 1: Low dose therapy 2 options: medication or behavior modification Subsequent monthly decisions: Responders Continue initial therapy Non-responders 2 options: add the other therapy or increase dose of current therapy Objective: Improved end-of-school-year performance Example from Susan Murphy, University of Michigan 8/64 Dynamic Treatment Regimes Webinar

Cancer treatment Two (milestone) decision points: Decision 1 : Induction chemotherapy (options C 1, C 2 ) Decision 2 : Maintenance treatment for patients who respond (options M 1, M 2 ) Salvage chemotherapy for those who don t respond (options S 1, S 2 ) Objective : Maximize survival time 9/64 Dynamic Treatment Regimes Webinar

Possible treatment regimes Possible rules at Decision 1: Give C 1 (non-dynamic ) 10/64 Dynamic Treatment Regimes Webinar

Possible treatment regimes Possible rules at Decision 1: Give C 1 (non-dynamic ) If age < 50, progesterone receptor level < 10 fmol, RAD51 mutation, then give C 1, else, give C 2 10/64 Dynamic Treatment Regimes Webinar

Possible treatment regimes Possible rules at Decision 1: Give C 1 (non-dynamic ) If age < 50, progesterone receptor level < 10 fmol, RAD51 mutation, then give C 1, else, give C 2 If patient is a Libra, Scorpio, or Sagittarius, give C 1, else, give C 2 10/64 Dynamic Treatment Regimes Webinar

Possible treatment regimes Possible rules at Decision 1: Give C 1 (non-dynamic ) If age < 50, progesterone receptor level < 10 fmol, RAD51 mutation, then give C 1, else, give C 2 If patient is a Libra, Scorpio, or Sagittarius, give C 1, else, give C 2 Possible rules at Decision 2: If patient responds, give maintenance M 1 ; if does not respond, give salvage S 1 (dynamic ) 10/64 Dynamic Treatment Regimes Webinar

Possible treatment regimes Possible rules at Decision 1: Give C 1 (non-dynamic ) If age < 50, progesterone receptor level < 10 fmol, RAD51 mutation, then give C 1, else, give C 2 If patient is a Libra, Scorpio, or Sagittarius, give C 1, else, give C 2 Possible rules at Decision 2: If patient responds, give maintenance M 1 ; if does not respond, give salvage S 1 (dynamic ) If patient responds, age < 60, CEA > 10 ng/ml, progesterone receptor level < 8 fmol, give M 1, else, give M 2 ; if does not respond, age > 65, P53 mutation, CA 15-3 > 25 units/ml, then give S 1, else, give S 2 10/64 Dynamic Treatment Regimes Webinar

Possible treatment regimes Result: Rules, and thus regimes, can be simple or complex (or not realistic ) More complex rules involve more personalization and more closely mimic clinical practice There is an infinitude of possible rules at each decision point, and thus an infinitude of possible regimes Ultimate goal : Find the best or optimal regime Regimes of interest and optimal depend on the question For definiteness, assume larger outcomes are preferred 11/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 1. Classical treatment comparison: Focus on a single decision point 12/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 1. Classical treatment comparison: Focus on a single decision point Cancer example: Decision 1 Two regimes of interest: Give C 1 vs. Give C 2 Class of regimes of interest is D = { Give C 1, Give C 2 } 12/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 1. Classical treatment comparison: Focus on a single decision point Cancer example: Decision 1 Two regimes of interest: Give C 1 vs. Give C 2 Class of regimes of interest is D = { Give C 1, Give C 2 } Usual question : If all patients in the population were to be given C 1, would mean outcome (mean survival time ) be different from (better than ) that if all patients in the population were to be given C 2? 12/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 1. Classical treatment comparison: Focus on a single decision point Cancer example: Decision 1 Two regimes of interest: Give C 1 vs. Give C 2 Class of regimes of interest is D = { Give C 1, Give C 2 } Usual question : If all patients in the population were to be given C 1, would mean outcome (mean survival time ) be different from (better than ) that if all patients in the population were to be given C 2? Optimal regime in D: The regime such that, if all patients in the population were to receive treatment according to it, mean outcome would be the largest among all regimes in D (here, Give C 1 or Give C 2 ) 12/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 2. Which is the best treatment sequence? Multiple decision points 13/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 2. Which is the best treatment sequence? Multiple decision points Cancer example: Eight dynamic regimes of interest: 13/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 2. Which is the best treatment sequence? Multiple decision points Cancer example: Eight dynamic regimes of interest: 1. Give C 1 followed by (M 1 if response, S 1 if no response) 2. Give C 1 followed by (M 1 if response, S 2 if no response) 3. Give C 1 followed by (M 2 if response, S 1 if no response) 4. Give C 1 followed by (M 2 if response, S 2 if no response) 5. Give C 2 followed by (M 1 if response, S 1 if no response) 6. Give C 2 followed by (M 1 if response, S 2 if no response) 7. Give C 2 followed by (M 2 if response, S 1 if no response) 8. Give C 2 followed by (M 2 if response, S 2 if no response) Class D of interest contains these 8 regimes 13/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 2. Which is the best treatment sequence? Multiple decision points Cancer example: Eight dynamic regimes of interest: 1. Give C 1 followed by (M 1 if response, S 1 if no response) 2. Give C 1 followed by (M 1 if response, S 2 if no response) 3. Give C 1 followed by (M 2 if response, S 1 if no response) 4. Give C 1 followed by (M 2 if response, S 2 if no response) 5. Give C 2 followed by (M 1 if response, S 1 if no response) 6. Give C 2 followed by (M 1 if response, S 2 if no response) 7. Give C 2 followed by (M 2 if response, S 1 if no response) 8. Give C 2 followed by (M 2 if response, S 2 if no response) Class D of interest contains these 8 regimes Question: Comparison of mean outcomes if all patients in the population were to follow each regime 13/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 2. Which is the best treatment sequence? Multiple decision points Cancer example: Eight dynamic regimes of interest: 1. Give C 1 followed by (M 1 if response, S 1 if no response) 2. Give C 1 followed by (M 1 if response, S 2 if no response) 3. Give C 1 followed by (M 2 if response, S 1 if no response) 4. Give C 1 followed by (M 2 if response, S 2 if no response) 5. Give C 2 followed by (M 1 if response, S 1 if no response) 6. Give C 2 followed by (M 1 if response, S 2 if no response) 7. Give C 2 followed by (M 2 if response, S 1 if no response) 8. Give C 2 followed by (M 2 if response, S 2 if no response) Class D of interest contains these 8 regimes Question: Comparison of mean outcomes if all patients in the population were to follow each regime Optimal regime in D: The regime such that, if all patients were to receive treatment according to it, mean outcome would be the largest among all regimes in D 13/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 3. Best dynamic regime in a feasible class? Single or multiple decision points Cancer example: Decision 1 14/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 3. Best dynamic regime in a feasible class? Single or multiple decision points Cancer example: Decision 1 X 1 = (lots of) patient information available at Decision 1 In resource-limited setting, interested in rules depending on a subset of X 1 routinely collected, e.g., of form If age < η 1 and PR < η 2 give C 2 ; else give C 1 PR = progesterone receptor level 14/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 3. Best dynamic regime in a feasible class? Single or multiple decision points Cancer example: Decision 1 X 1 = (lots of) patient information available at Decision 1 In resource-limited setting, interested in rules depending on a subset of X 1 routinely collected, e.g., of form If age < η 1 and PR < η 2 give C 2 ; else give C 1 PR = progesterone receptor level Class D of interest consists of all regimes of this form (so for all values of η 1 and η 2 ) 14/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 3. Best dynamic regime in a feasible class? Single or multiple decision points Cancer example: Decision 1 X 1 = (lots of) patient information available at Decision 1 In resource-limited setting, interested in rules depending on a subset of X 1 routinely collected, e.g., of form If age < η 1 and PR < η 2 give C 2 ; else give C 1 PR = progesterone receptor level Class D of interest consists of all regimes of this form (so for all values of η 1 and η 2 ) Optimal regime in D: The regime defined by values η opt 1, η opt 2 such that, if all patients in the population were to receive treatment according to it, mean outcome would be the largest among all regimes in D 14/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 4. Optimal overall dynamic treatment regime: Single or multiple decision points Cancer example: Two decision points 15/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 4. Optimal overall dynamic treatment regime: Single or multiple decision points Cancer example: Two decision points X 1 = patient information available at Decision 1, X 2 = additional information collected between Decisions 1 and 2 Accrued information at each decision Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } 15/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 4. Optimal overall dynamic treatment regime: Single or multiple decision points Cancer example: Two decision points X 1 = patient information available at Decision 1, X 2 = additional information collected between Decisions 1 and 2 Accrued information at each decision Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } Class D of interest: All possible sets of rules {d 1 (H 1 ), d 2 (H 2 )} Each rule takes as input the accrued information and outputs a treatment from among the available options 15/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes 4. Optimal overall dynamic treatment regime: Single or multiple decision points Cancer example: Two decision points X 1 = patient information available at Decision 1, X 2 = additional information collected between Decisions 1 and 2 Accrued information at each decision Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } Class D of interest: All possible sets of rules {d 1 (H 1 ), d 2 (H 2 )} Each rule takes as input the accrued information and outputs a treatment from among the available options Optimal regime in D: {d opt 1 (H 1 ), d opt 2 (H 2 )} such that, if all patients were to receive treatment according to it, mean outcome would be the largest among all regimes in D 15/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes In all of Cases 1 4: A set of rules at each of K decision points, K = 1 or 2, depending on accrued information Dynamic treatment regime Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } d = d 1 (H 1 ) or d = {d 1 (H 1 ), d 2 (H 2 )} 16/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes In all of Cases 1 4: A set of rules at each of K decision points, K = 1 or 2, depending on accrued information Dynamic treatment regime Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } d = d 1 (H 1 ) or d = {d 1 (H 1 ), d 2 (H 2 )} Case 1 : K = 1, rules of form (simple ) d 1 (H 1 ) = C j for all H 1, j = 1, 2 16/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes In all of Cases 1 4: A set of rules at each of K decision points, K = 1 or 2, depending on accrued information Dynamic treatment regime Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } d = d 1 (H 1 ) or d = {d 1 (H 1 ), d 2 (H 2 )} Case 1 : K = 1, rules of form (simple ) d 1 (H 1 ) = C j for all H 1, j = 1, 2 Case 2 : K = 2, rules of form (simple ) X 2 contains response status d 1 (H 1 ) = C j for all H 1, j = 1, 2 d 2 (H 2 ) = M k if response, S l if no response, k, l = 1, 2 16/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes In all of Cases 1 4: A set of rules at each of K decision points, K = 1 or 2, depending on accrued information Dynamic treatment regime Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } d = d 1 (H 1 ) or d = {d 1 (H 1 ), d 2 (H 2 )} 17/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes In all of Cases 1 4: A set of rules at each of K decision points, K = 1 or 2, depending on accrued information Dynamic treatment regime Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } d = d 1 (H 1 ) or d = {d 1 (H 1 ), d 2 (H 2 )} Case 3 : K = 1, code {C 1, C 2 } = {0, 1}, rules of form d 1 (H 1 ) = I(age < η 1, PR < η 2 ) 17/64 Dynamic Treatment Regimes Webinar

Classes of treatment regimes In all of Cases 1 4: A set of rules at each of K decision points, K = 1 or 2, depending on accrued information Dynamic treatment regime Decision 1 H 1 = X 1 Decision 2 H 2 = {X 1, A 1, X 2 } d = d 1 (H 1 ) or d = {d 1 (H 1 ), d 2 (H 2 )} Case 3 : K = 1, code {C 1, C 2 } = {0, 1}, rules of form d 1 (H 1 ) = I(age < η 1, PR < η 2 ) Case 4 : K = 2, general rules {d 1 (H 1 ), d 2 (H 2 )}; e.g., with two options coded as {0, 1} at each decision d 1 (H 1 ) = I(η T 1 H 1 > 0), d 2 (H 2 ) = I(η T 2 H 2 > 0) Rules involve linear combinations of accrued information 17/64 Dynamic Treatment Regimes Webinar

Studying dynamic treatment regimes How do we find an optimal treatment regime within a class of interest? Required : Appropriate data Case 1. Classical, single decision treatment comparison : Data from a standard clinical trial comparing C 1 and C 2 Case 2. Optimal treatment sequence for two decision points (simple dynamic treatment regimes) We will return to Cases 3 and 4 later 18/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Recall: In our example, D consists of eight regimes 1. Give C 1 followed by (M 1 if response, S 1 if no response) 2. Give C 1 followed by (M 1 if response, S 2 if no response) 3. Give C 1 followed by (M 2 if response, S 1 if no response) 4. Give C 1 followed by (M 2 if response, S 2 if no response) 5. Give C 2 followed by (M 1 if response, S 1 if no response) 6. Give C 2 followed by (M 1 if response, S 2 if no response) 7. Give C 2 followed by (M 2 if response, S 1 if no response) 8. Give C 2 followed by (M 2 if response, S 2 if no response) How do we compare the regimes in D and identify the best? 19/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Can t we base this on data from a series of previous trials? In one trial, C 1 was compared against C 2 in terms of response rate 20/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Can t we base this on data from a series of previous trials? In one trial, C 1 was compared against C 2 in terms of response rate In another trial, M 1 and M 2 were compared on the basis of survival time in subjects who responded to their induction chemotherapy 20/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Can t we base this on data from a series of previous trials? In one trial, C 1 was compared against C 2 in terms of response rate In another trial, M 1 and M 2 were compared on the basis of survival time in subjects who responded to their induction chemotherapy In yet another, S 1 and S 2 were compared (survival ) in subjects for whom induction therapy did not induce response 20/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Can t we base this on data from a series of previous trials? In one trial, C 1 was compared against C 2 in terms of response rate In another trial, M 1 and M 2 were compared on the basis of survival time in subjects who responded to their induction chemotherapy In yet another, S 1 and S 2 were compared (survival ) in subjects for whom induction therapy did not induce response Can t we just piece together the results from these separate trials to figure out the best regime? E.g., figure out the best C treatment for inducing response and then the best M and S treatments for prolonging survival? Wouldn t the regime that uses these have to have the best mean outcome? 20/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes One problem with this: Delayed effects E.g., C 1 may yield a higher proportion of responders than C 2 but may also have other effects that render subsequent maintenance treatments less effective in terms of mean survival time Implication : Must study entire regimes in the same patients 21/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes One problem with this: Delayed effects E.g., C 1 may yield a higher proportion of responders than C 2 but may also have other effects that render subsequent maintenance treatments less effective in terms of mean survival time Implication : Must study entire regimes in the same patients Data for doing this: Design a clinical trial expressly for this purpose (next ) Use longitudinal observational data, where treatments actually received at each decision point have been recorded (with other information) 21/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Clinical trials: An eight arm trial subjects randomized to the jth arm follow the jth regime A Sequential, Multiple Assignment, Randomized Trial (next slide... ) How to analyze the data to compare regimes and find the optimal regime? What else can be learned from such trials? 22/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes SMART: Sequential, Multiple Assignment, Randomized Trial (Randomization at s) M 1 Response M 2 C 1 No Response S 1 S 2 Cancer Response M 1 C 2 M 2 No Response S 1 Pioneered by Susan Murphy, Phil Lavori, and others S 2 23/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Embedded regimes: The eight regimes in D are embedded in the SMART M 1 Response M 2 C 1 No Response S 1 S 2 Cancer Response M 1 C 2 M 2 No Response S 1 S 2 24/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Examples of SMARTs: SMARTs have been carried out or are ongoing, mainly in behavioral disorders; see http://methodology.psu.edu/ra/smart/projects SMARTs have also been done in oncology (coming up... ) 25/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Examples of SMARTs: SMARTs have been carried out or are ongoing, mainly in behavioral disorders; see http://methodology.psu.edu/ra/smart/projects SMARTs have also been done in oncology (coming up... ) Remarks: There is really no conceptual difference between randomizing up front or sequentially Advantages and disadvantages, e.g., consent, balance Important : Making efficient use of the data Seminal reference: Murphy SA. (2005). An experimental design for the development of adaptive treatment strategies, Statistics in Medicine, 24, 1455 1481. 25/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Remark 1: Individuals following the same regime can have different realized treatment experiences, e.g., Give C 1 followed by (M 1 if response, S 1 if no response) Subject 1 : Receives C 1, responds, receives M 1 Subject 2 : Receives C 1, does not respond, receives S 1 Both subjects experiences are consistent with following this regime 26/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Remark 1: Individuals following the same regime can have different realized treatment experiences, e.g., Give C 1 followed by (M 1 if response, S 1 if no response) Subject 1 : Receives C 1, responds, receives M 1 Subject 2 : Receives C 1, does not respond, receives S 1 Both subjects experiences are consistent with following this regime Remark 2: Individuals following different regimes can have the same realized treatment experience, e.g., experience C 1 Response M 1 is consistent with having followed EITHER OF regimes C 1 followed by (M 1 if response, S 1 if no response) C 1 followed by (M 1 if response, S 2 if no response) 26/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Remark 3: Do not confuse the regime with the possible realized experiences that can result from following it C 1 followed by response followed by M 1 and C 1 followed by no response followed by S 1 are not regimes but are possible results of following the above regime The regime is the algorithm (set of rules) 27/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Remark 3: Do not confuse the regime with the possible realized experiences that can result from following it C 1 followed by response followed by M 1 and C 1 followed by no response followed by S 1 are not regimes but are possible results of following the above regime The regime is the algorithm (set of rules) Remark 4: Do not confuse dynamic treatment regimes themselves or SMARTs with response-adaptive clinical trial designs for classical treatment comparisons A dynamic treatment regime is an algorithm for treating a single patient This has nothing to do with other patients in a study An adaptive trial is one in which the data are used to alter the design (e.g., drop an arm, sample size) The design of a SMART does not change 27/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Estimation of mean outcome (e.g., mean survival): Usual approach under up-front randomization : estimate mean for regime j by sample average outcome based on subjects randomized to regime j only 28/64 Dynamic Treatment Regimes Webinar

Clinical trials for studying treatment regimes Estimation of mean outcome (e.g., mean survival): Usual approach under up-front randomization : estimate mean for regime j by sample average outcome based on subjects randomized to regime j only However : Subjects will have realized experiences consistent with more than one regime! This can be exploited to improve precision... 28/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Demonstration: A certain kind of SMART is common in oncology...... but way these trials are usually analyzed does not focus on comparing the embedded dynamic treatment regimes and finding the best treatment sequence We demonstrate the general principle of how to exploit realized experiences consistent with more than one regime to do this Reference: Lunceford JK, Davidian M, Tsiatis AA. (2002). Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials. Biometrics, 58, 48 57. 29/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Cancer and Leukemia Group B (CALGB) Protocol 8923: Double-blind, placebo-controlled trial of 338 elderly subjects with acute myelogenous leukemia (AML) with randomizations at two key decision points 30/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Cancer and Leukemia Group B (CALGB) Protocol 8923: Double-blind, placebo-controlled trial of 338 elderly subjects with acute myelogenous leukemia (AML) with randomizations at two key decision points Decision 1 : Subjects randomized to either standard induction chemotherapy C 1 OR standard induction therapy + granulocyte-macrophage colony-stimulating factor (GM-CSF ) C 2 (two options) Decision 2 : If response, subjects randomized to M 1, M 2 = intensification/maintenance treatments I, II (two options) If no response, only one option: follow-up with physician All subjects followed for the outcome survival time 30/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Four possible regimes: The class D of interest comprises 1. C 1 followed by (M 1 if response, else follow-up) (C 1 M 1 ) 2. C 1 followed by (M 2 if response, else follow-up) (C 1 M 2 ) 3. C 2 followed by (M 1 if response, else follow-up) (C 2 M 1 ) 4. C 2 followed by (M 2 if response, else follow-up) (C 2 M 2 ) 31/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Schematic of CALGB 8923: Randomization at s Non- Response Follow-up Chemo + Placebo Intensification I Response Intensification II AML Non- Response Follow-up Chemo + GM-CSF Intensification I Response Intensification II 32/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Standard analysis: Compare response rates to C 1 and C 2 Compare survival between M 1 and M 2 among responders Compare survival between C 1 and C 2 regardless of subsequent response Does not address the embedded regimes 33/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Goal: Find the regime in D such that, if all patients in the population were to receive treatment according to it, mean survival would be the largest 34/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Goal: Find the regime in D such that, if all patients in the population were to receive treatment according to it, mean survival would be the largest Estimate mean survival if all patients followed each of the four embedded regimes C j M k, j = 1, 2, k = 1, 2 Use data from all subjects whose realized experience is consistent with having followed C j M k I.e., subjects with either C j response M k C j no response follow up with physician 34/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Statistical framework: Causal inference perspective Characterize in terms of potential outcomes Consider first: Classical single decision treatment comparison 35/64 Dynamic Treatment Regimes Webinar

Statistical framework Case 1: Classical, single decision treatment comparison D = { Give C 1, Give C 2 } Hypothesize potential outcomes under each regime in D 36/64 Dynamic Treatment Regimes Webinar

Statistical framework Case 1: Classical, single decision treatment comparison D = { Give C 1, Give C 2 } Hypothesize potential outcomes under each regime in D Y (1) = outcome that would be achieved if a randomly chosen patient from the population were to follow regime Give C 1 ; Y (2) defined analogously E(Y (1) ) = the mean outcome if all patients in the population were to follow Give C 1 ; E(Y (2) ) analogously 36/64 Dynamic Treatment Regimes Webinar

Statistical framework Case 1: Classical, single decision treatment comparison D = { Give C 1, Give C 2 } Hypothesize potential outcomes under each regime in D Y (1) = outcome that would be achieved if a randomly chosen patient from the population were to follow regime Give C 1 ; Y (2) defined analogously E(Y (1) ) = the mean outcome if all patients in the population were to follow Give C 1 ; E(Y (2) ) analogously Usual question : If all patients in the population were to be given C 1, would mean outcome be different from (better than ) that if all patients were to be given C 2? Compare E(Y (1) ) and E(Y (2) ) 36/64 Dynamic Treatment Regimes Webinar

Statistical framework Clinical trial: Do not observe Y (1) and Y (2) on each subject If A = 1 (2) if subject randomized to Give C 1 ( Give C 2 ), we do observe (Y, A), where Y = Y (1) I(A = 1) + Y (2) I(A = 2) 37/64 Dynamic Treatment Regimes Webinar

Statistical framework Clinical trial: Do not observe Y (1) and Y (2) on each subject If A = 1 (2) if subject randomized to Give C 1 ( Give C 2 ), we do observe (Y, A), where Y = Y (1) I(A = 1) + Y (2) I(A = 2) By randomization, Y (1), Y (2) A E(Y (1) ) = E(Y (1) A = 1) = E(Y A = 1) and similarly for E(Y (2) ) Thus, from observed data (Y i, A i ), i = 1,..., n (iid), can estimate n E(Y (1) i=1 ) by Y ii(a i = 1) n i=1 I(A i = 1), the usual sample average, and E(Y (2) ) similarly 37/64 Dynamic Treatment Regimes Webinar

Statistical framework Case 2: Optimal treatment sequence for two decision points D = { C j M k, j, k = 1, 2 } Hypothesize potential outcomes under each regime in D 38/64 Dynamic Treatment Regimes Webinar

Statistical framework Case 2: Optimal treatment sequence for two decision points D = { C j M k, j, k = 1, 2 } Hypothesize potential outcomes under each regime in D Y (jk) = survival time that would be achieved if a randomly chosen patient from the population were to follow C j M k 38/64 Dynamic Treatment Regimes Webinar

Statistical framework Case 2: Optimal treatment sequence for two decision points D = { C j M k, j, k = 1, 2 } Hypothesize potential outcomes under each regime in D Y (jk) = survival time that would be achieved if a randomly chosen patient from the population were to follow C j M k Question : Compare mean survival if all patients followed each of C j M k, j, k = 1, 2 Compare (estimate ) E(Y (jk) ), j, k = 1, 2 Or survival probabilities S jk (t) = pr(y (jk) > t) = E{I(Y (jk) > t)}, j, k = 1, 2 Assume no censoring (can be generalized ) 38/64 Dynamic Treatment Regimes Webinar

Statistical framework Clinical trial (e.g., SMART): Do not observe Y (jk), j, k = 1, 2 Can we make a connection between potential outcomes and observed data as we did in Case 1? 39/64 Dynamic Treatment Regimes Webinar

Statistical framework Clinical trial (e.g., SMART): Do not observe Y (jk), j, k = 1, 2 Can we make a connection between potential outcomes and observed data as we did in Case 1? Consider j = 1; j = 2 similar Observed for each subject: (R, RZ, Y ) Y = survival time R = 1 if subject responds to C 1, R = 0 if not Z = k for responder randomized to M k, k = 1, 2 (not defined if R = 0) 39/64 Dynamic Treatment Regimes Webinar

Statistical framework Clinical trial (e.g., SMART): Do not observe Y (jk), j, k = 1, 2 Can we make a connection between potential outcomes and observed data as we did in Case 1? Consider j = 1; j = 2 similar Observed for each subject: (R, RZ, Y ) Y = survival time R = 1 if subject responds to C 1, R = 0 if not Z = k for responder randomized to M k, k = 1, 2 (not defined if R = 0) Assume when R = 0, Y (11), Y (12) are the same ; then Y = (1 R)Y (11) + RI(Z = 1)Y (11) + RI(Z = 2)Y (12) From observed data (R i, R i Z i, Y i ), i = 1,..., n (iid), Estimate E(Y (11) ), E(Y (12) ) and similarly for j = 2 39/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Consider j = 1: Responders to C 1 are randomized to M 1 with probability π = 1/2 Nonresponders to C 1 follow up Half of responders get M 1, half get M 2 Estimate mean survival for C 1 M 1 by weighted average Nonresponders represent themselves weight = 1 Each responder who got M 1 represents him/herself and another similar subject who got randomized to M 2 weight = 2 Estimator for C 1 M 2, switch roles Note : Survival times from nonresponders are used to estimate the means for both C 1 M 1 and C 1 M 2 40/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Formally: For j = 1 (j = 2 similar), (R i, R i Z i, Y i ), i = 1,..., n Y i = survival time for subject i R i = 1 if i responds to C 1, R i = 0 if not Z i = k for responder randomized to M k, k = 1, 2 pr(z i = 1 R i = 1) = π (= 1/2 in previous) 41/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Formally: For j = 1 (j = 2 similar), (R i, R i Z i, Y i ), i = 1,..., n Y i = survival time for subject i R i = 1 if i responds to C 1, R i = 0 if not Z i = k for responder randomized to M k, k = 1, 2 pr(z i = 1 R i = 1) = π (= 1/2 in previous) Estimators for E(Y (11) ): Q i = 1 R i + R i I(Z i = 1) π 1 n 1 n Q i Y i i=1 or ( n i=1 Q i) 1 n i=1 Q i Y i Q i = 0 if i is inconsistent with C 1 M 1 (consistent with C 1 M 2 ) Q i = 1 if R i = 0 Q i = π 1 if R i = 1 and Z i = 1 Similarly for E(Y (12) ) 41/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Estimators for E(Y (11) ): Q i = 1 R i + R i I(Z i = 1) π 1 n 1 n Q i Y i i=1 or ( n i=1 Q i) 1 n i=1 Q i Y i Can show : E(QY ) = E(Y (11) ), E(Q) = 1 And similarly for j, k = 1, 2 Consistent estimators for E(Y (jk) ) (Appendix) Estimators for E(Y (jk) ), k = 1, 2, are correlated Can derive statistics for comparison identify optimal regime in D 42/64 Dynamic Treatment Regimes Webinar

Estimating mean outcome for embedded regimes Remarks: Subjects may die before having a chance to respond nonresponders at the time of death (R = 0) Survival time may be right-censored can incorporate inverse probability of censoring weighting Randomization at each decision is key subjects are prognostically similar Can be generalized to arbitrary number of decisions, numbers of options at each 43/64 Dynamic Treatment Regimes Webinar

Considerations: Designing SMARTs Class of regimes should involve key decision points where it is feasible to randomize And with more than one treatment option and no consensus on choice among options Simplicity small numbers of decision points and options Embedded regimes should have simple decision rules ; e.g., depending only on a few variables (response status ) Criteria and methods for sample size determination is an open problem Critical : Collect rich patient information at baseline and between decision points to inform development of more complex, optimal regimes (e.g., Cases 3 and 4) More shortly... 44/64 Dynamic Treatment Regimes Webinar

Designing SMARTs Schematic of CALGB 8923: Randomization at s Non- Response Follow-up Chemo + Placebo Intensification I Response Intensification II AML Non- Response Follow-up Chemo + GM-CSF Intensification I Response Intensification II 45/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Questions not addressed in a conventional clinical trial: If a treatment is effective, what should be the duration of administration? How would the randomized treatments have compared if no patients had discontinued their assigned treatments? 46/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Questions not addressed in a conventional clinical trial: If a treatment is effective, what should be the duration of administration? How would the randomized treatments have compared if no patients had discontinued their assigned treatments? Such questions can be cast as questions about dynamic treatment regimes Available data are almost always observational Databases from registries Databases from completed clinical trials 46/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Example: Optimal treatment duration ESPRIT trial Integrilin vs. placebo in PCI/stent patients Primary analysis : Integrilin superior Protocol : Infusion duration of 18 24 hours with mandatory stopping for adverse events Duration of infusion left to physician discretion What should be the recommended treatment duration? Data are observational with respect to this question 47/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Example: Optimal treatment duration ESPRIT trial Integrilin vs. placebo in PCI/stent patients Primary analysis : Integrilin superior Protocol : Infusion duration of 18 24 hours with mandatory stopping for adverse events Duration of infusion left to physician discretion What should be the recommended treatment duration? Data are observational with respect to this question More precisely: Treatment duration of t hours means infuse for t hours or until an adverse event requiring stopping, whichever comes first This is a dynamic treatment regime for each t because realized duration depends on the adverse event status Johnson BA, Tsiatis AA. (2004). Estimating mean response as a function of treatment duration in an observational study, where duration may be informatively censored. Biometrics, 60, 315 323. 47/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Duration regime of t hours: AE before t hours Stop infusion immediately Start Integrilin infusion No AE before t hours Stop infusion at t hours D = { all regimes of the form infuse for t hours or until an adverse event requiring stopping, whichever comes first for 18 t 24 } Objective : Find t opt [18, 24] leading to largest mean outcome (probability of no CVD event in 30 days) 48/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Example: Treatment comparison in presence of treatment discontinuation SYNERGY trial - enoxaparin (ENOX) vs. unfractionated heparin (UFH) in ACS patients (open label ) Primary (intent-to-treat) analysis : No difference Lots of treatment discontinuation (switching, stopping) Some mandatory due to adverse events, some at clinician/patient discretion How do the treatments compare if there were no discontinuation? 49/64 Dynamic Treatment Regimes Webinar

Thinking in terms of dynamic treatment regimes Example: Treatment comparison in presence of treatment discontinuation SYNERGY trial - enoxaparin (ENOX) vs. unfractionated heparin (UFH) in ACS patients (open label ) Primary (intent-to-treat) analysis : No difference Lots of treatment discontinuation (switching, stopping) Some mandatory due to adverse events, some at clinician/patient discretion How do the treatments compare if there were no discontinuation? Objective: Compare the two dynamic treatment regimes Take ENOX (UFH) until completion or discontinuation for mandatory reasons Zhang M, Tsiatis AA, Davidian M, Pieper KS, Mahaffey KW. (2011). Inference on treatment effects from a clinical trial in the presence of premature treatment discontinuation: The SYNERGY trial. Biostatistics, 12, 258 269. 49/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Again: Data are observational with respect to these questions Decisions on duration, treatment discontinuation were not randomized Made at clinician/patient discretion 50/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Again: Data are observational with respect to these questions Decisions on duration, treatment discontinuation were not randomized Made at clinician/patient discretion Difficulties for studying regimes: Confounding subjects receiving one treatment or another may not be prognostically similar E.g., subjects who discontinued may be sicker, older, etc Standard methods are available to adjust for confounding, e.g., regression, propensity scores, etc, assuming no unmeasured confounders However, the time-dependent nature of treatment causes additional complications 50/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Time-dependent confounding: Treatments actually received over time depend on accruing information Temptation : Adjust for such time-dependent confounding E.g., a Cox model for outcome including time-dependent intermediate variables and treatments However : Part of the effect of treatment on outcome may be mediated through intermediate variables Adjustment would incorrectly remove this effect and hence misrepresent the true treatment effect 51/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Resolution: Requires a generalization of no unmeasured confounders Unverifiable from the observed data 52/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Resolution: Requires a generalization of no unmeasured confounders Unverifiable from the observed data Sequential randomization assumption: At any point where a treatment decision is made, the treatment received (among the options available) depends only on the accrued information on the patient and not additionally on his/her future prognosis At some level, this must be true In a SMART, this is automatically true by randomization With observational data, is tenable only if all accrued information used to make decisions is available in the database 52/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Under sequential randomization: Inference on dynamic treatment regimes Can use weighted methods similar to those discussed earlier for Case 2, extended to multiple decision points Critical difference : Rather than weighting based on known randomization probabilities, weighting is based on the propensities of receiving treatment at each decision as a function of accrued information Modeling/estimation of propensities 53/64 Dynamic Treatment Regimes Webinar

Studying regimes based on observational data Under sequential randomization: Inference on dynamic treatment regimes Can use weighted methods similar to those discussed earlier for Case 2, extended to multiple decision points Critical difference : Rather than weighting based on known randomization probabilities, weighting is based on the propensities of receiving treatment at each decision as a function of accrued information Modeling/estimation of propensities Moral: Many complex questions can be posed in terms of a class of dynamic treatment regimes Methods are available for inference on regimes in the class 53/64 Dynamic Treatment Regimes Webinar

Constructing dynamic treatment regimes Cases 3 and 4: More complex regimes focused on personalizing treatment to the patient Case 3 : D = specified class of feasible regimes Case 4 : D = all possible regimes Rules involve accrued information on the patient 54/64 Dynamic Treatment Regimes Webinar

Constructing dynamic treatment regimes Cases 3 and 4: More complex regimes focused on personalizing treatment to the patient Case 3 : D = specified class of feasible regimes Case 4 : D = all possible regimes Rules involve accrued information on the patient Can we estimate an optimal regime within these classes? From data from a SMART in which detailed accruing information was collected? From data from an observational database? 54/64 Dynamic Treatment Regimes Webinar

Characterizing an optimal regime Demonstration: Characterize an optimal regime d opt in the class D of all possible regimes d (Case 4 ) Single decision point Two treatment options coded as {0, 1} d D is a single rule d 1 (X 1 ) taking values 0 or 1 Data from a conventional clinical trial (simplest SMART) (X 1i, A 1i, Y i ), i = 1,..., n (iid) A 1 is treatment received taking values {0, 1} Assume large outcomes are better 55/64 Dynamic Treatment Regimes Webinar

Characterizing an optimal regime Potential outcome for a regime: For any regime d D Y (0) and Y (1) are potential outcomes if a randomly chosen patient were to receive treatments 0 and 1, respectively 56/64 Dynamic Treatment Regimes Webinar

Characterizing an optimal regime Potential outcome for a regime: For any regime d D Y (0) and Y (1) are potential outcomes if a randomly chosen patient were to receive treatments 0 and 1, respectively Potential outcome if a randomly chosen patient were to follow regime d Y (d) = Y (1) I{d(X 1 ) = 1} + Y (0) I{d(X 1 ) = 0} = Y (1) d(x 1 ) + Y (0) {1 d(x 1 )} E(Y (d) ) = mean outcome if all patients in the population were to follow regime d 56/64 Dynamic Treatment Regimes Webinar

Characterizing an optimal regime Potential outcome for a regime: For any regime d D Y (0) and Y (1) are potential outcomes if a randomly chosen patient were to receive treatments 0 and 1, respectively Potential outcome if a randomly chosen patient were to follow regime d Y (d) = Y (1) I{d(X 1 ) = 1} + Y (0) I{d(X 1 ) = 0} = Y (1) d(x 1 ) + Y (0) {1 d(x 1 )} E(Y (d) ) = mean outcome if all patients in the population were to follow regime d Optimal regime d opt : d opt maximizes E(Y (d) ) among all d D Can we estimate d opt satisfying this from the trial data? 56/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Observed outcome: Y = Y (1) I(A 1 = 1) + Y (0) I(A 1 = 0) = Y (1) A 1 + Y (0) (1 A 1 ) By randomization, Y (0), Y (1) A 1 X 1 E(Y (1) X 1 ) = E(Y (1) X 1, A 1 = 1) = E(Y X 1, A 1 = 1) and similarly for Y (0) 57/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Observed outcome: Y = Y (1) I(A 1 = 1) + Y (0) I(A 1 = 0) = Y (1) A 1 + Y (0) (1 A 1 ) By randomization, Y (0), Y (1) A 1 X 1 E(Y (1) X 1 ) = E(Y (1) X 1, A 1 = 1) = E(Y X 1, A 1 = 1) and similarly for Y (0) Thus: E(Y (d) ) = E{ E(Y (d) X 1 ) } [ ] = E E(Y (1) X 1 )d(x 1 ) + E(Y (0) X 1 ){1 d(x 1 )} [ ] = E E(Y (1) X 1, A 1 = 1)d(X 1 ) + E(Y (0) X 1, A 1 = 0){1 d(x 1 )} [ ] = E E(Y X 1, A 1 = 1)d(X 1 ) + E(Y X 1, A 1 = 0){1 d(x 1 )} 57/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Recall: We wish to maximize [ ] E(Y (d) ) = E E(Y X 1, A 1 = 1)d(X 1 )+E(Y X 1, A 1 = 0){1 d(x 1 )} Clearly : E(Y (d) ) is maximized by d opt (X 1 ) = I{ E(Y X 1, A 1 = 1) > E(Y X 1, A 1 = 0) } E(Y X 1, A 1 ) is the regression of outcome on baseline information and treatment received 58/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Recall: We wish to maximize [ ] E(Y (d) ) = E E(Y X 1, A 1 = 1)d(X 1 )+E(Y X 1, A 1 = 0){1 d(x 1 )} Clearly : E(Y (d) ) is maximized by d opt (X 1 ) = I{ E(Y X 1, A 1 = 1) > E(Y X 1, A 1 = 0) } E(Y X 1, A 1 ) is the regression of outcome on baseline information and treatment received Suggests: Posit a regression model for E(Y X 1, A 1 ) Q(X 1, A 1 ; β) Fit the model to trial data Q(X 1, A 1 ; β) Estimated optimal regime d opt (X 1 ) = I{ Q(X 1, 1; β) > Q(X 1, 0; β) } 58/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Recall: We wish to maximize [ ] E(Y (d) ) = E E(Y X 1, A 1 = 1)d(X 1 )+E(Y X 1, A 1 = 0){1 d(x 1 )} Clearly : E(Y (d) ) is maximized by d opt (X 1 ) = I{ E(Y X 1, A 1 = 1) > E(Y X 1, A 1 = 0) } E(Y X 1, A 1 ) is the regression of outcome on baseline information and treatment received Suggests: Posit a regression model for E(Y X 1, A 1 ) Q(X 1, A 1 ; β) Fit the model to trial data Q(X 1, A 1 ; β) Estimated optimal regime d opt (X 1 ) = I{ Q(X 1, 1; β) > Q(X 1, 0; β) } Issue : What if the model Q(X 1, A 1 ; β) is misspecified? 58/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Shameless promotion: Discussion of estimation of an optimal regime within a broad class of regimes D with a focus on personalized treatment as in Cases 3 and 4 merits its own shortcourse Robustness to misspecification of models? Alternative approaches? Extension to multiple decision points? Etc, etc... 59/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Shameless promotion: Discussion of estimation of an optimal regime within a broad class of regimes D with a focus on personalized treatment as in Cases 3 and 4 merits its own shortcourse Robustness to misspecification of models? Alternative approaches? Extension to multiple decision points? Etc, etc... Personalized Medicine and Dynamic Treatment Regimes Half-day shortcourse at 2015 ENAR Spring Meeting (Sunday, March 15, morning) 59/64 Dynamic Treatment Regimes Webinar

Estimating an optimal regime Shameless promotion: Discussion of estimation of an optimal regime within a broad class of regimes D with a focus on personalized treatment as in Cases 3 and 4 merits its own shortcourse Robustness to misspecification of models? Alternative approaches? Extension to multiple decision points? Etc, etc... Personalized Medicine and Dynamic Treatment Regimes Half-day shortcourse at 2015 ENAR Spring Meeting (Sunday, March 15, morning) Forthcoming book: Kosorok, M. R. and Moodie, E. E. M. (2015). Adaptive Treatment Strategies in Practice: Planning Trials and Analyzing Data for Personalized Medicine. SIAM. 59/64 Dynamic Treatment Regimes Webinar

Discussion Dynamic treatment regimes formalize clinical decision-making and provide a framework for personalized treatment A broad range of problems can be cast in terms of dynamic treatment regimes SMARTs are the gold standard data source for estimation of dynamic treatment regimes Design considerations for SMARTs? Broader adoption? Implications for how treatments are evaluated? Estimation of optimal treatment regimes is a wide open area of research 60/64 Dynamic Treatment Regimes Webinar

Thought Leaders 2013 MacArthur Fellow Susan Murphy and Jamie Robins 61/64 Dynamic Treatment Regimes Webinar