Non-inferiority: Issues for today and developments for tomorrow. Kevin Carroll AstraZeneca

Similar documents
European Federation of Statisticians in the Pharmaceutical Industry (EFSPI)

Issues regarding non-inferiority within the anti-bacterials area. Jon Armstrong, AstraZeneca Pharmaceuticals

This is a repository copy of Practical guide to sample size calculations: non-inferiority and equivalence trials.

Noninferiority Clinical Trials

Guidance Document for Claims Based on Non-Inferiority Trials

Understanding noninferiority trials

Methodological aspects of non-inferiority and equivalence trials

Meta-analysis to compare combination therapies: A case study in kidney transplantation

Fundamental Clinical Trial Design

Statistics for Clinical Trials: Basics of Phase III Trial Design

Evaluating Adaptive Dose Finding Designs and Methods

HOW ARE NON-INFERIORITY MARGINS SELECTED IN NON- INFERIORITY TRIALS AND HOW DO THEY VARY WITHIN AND ACROSS FOUR MAJOR DISEASE DOMAINS?

Rolling Dose Studies

Choice of appropriate endpoints in clinical trials. Fortunato Ciardiello Second University of Naples

Biomarkers in oncology drug development

Discussion Meeting for MCP-Mod Qualification Opinion Request. Novartis 10 July 2013 EMA, London, UK

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

Regulatory Considerations for Determining Vaccine Efficacy U.S. FDA Perspective

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

The Non-inferiority Margin in Diabetes Trials

Trials and Tribulations of Non-Inferiority

Regulatory Hurdles for Drug Approvals

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

Bayesian and Frequentist Approaches

A (Constructive/Provocative) Critique of the ICH E9 Addendum

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

Innovative Approaches for Designing and Analyzing Adaptive Dose-Ranging Trials

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA

(Regulatory) views on Biomarker defined Subgroups

Dynamic borrowing of historical data: Performance and comparison of existing methods based on a case study

Bayesian approaches to analysing studies and summarizing evidences

Guidance for Industry

Challenges in the Design and Analysis of Non-Inferiority Trials: A Case Study. Key Points from FDA Guidance

Biostatistics 3. Developed by Pfizer. March 2018

Effective Implementation of Bayesian Adaptive Randomization in Early Phase Clinical Development. Pantelis Vlachos.

Current issues in non-inferiority trials

Come leggere i risultati di uno studio clinico How to read the results of a clinical trial

Health authorities are asking for PRO assessment in dossiers From rejection to recognition of PRO

A Case Study: Two-sample categorical data

LONDON CANCER NEW DRUGS GROUP RAPID REVIEW. Erlotinib for the third or fourth-line treatment of NSCLC January 2012

Phase II Design. Kim N. Chi NCIC/NCIC-CTG NEW INVESTIGATORS CLINICAL TRIALS COURSE

What is indirect comparison?

Guideline on the evaluation of medicinal products indicated for treatment of bacterial infections (CPMP/EWP/558/95 rev 2)

The Roles of Short Term Endpoints in. Clinical Trial Planning and Design

PrEP & Microbicide Studies

Using Statistical Principles to Implement FDA Guidance on Cardiovascular Risk Assessment for Diabetes Drugs

Pacira v. FDA: Summary of Declaration by Lee Jen Wei, PhD Concluding that the Pivotal Hemorrhoidectomy Study for EXPAREL Demonstrated a Treatment

WATCHMAN PROTECT AF Study Rev. 6

Accelerating Phase II-III Oncology Drug Development Through the Use of Adaptive Designs

Safeguarding public health Subgroup Analyses: Important, Infuriating and Intractable

Non-inferiority trials and switch from non-inferiority to superiority. D Costagliola U 943 INSERM and UPMC Paris 06

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

How to use prior knowledge and still give new data a chance?

EUROPEAN COMMISSION HEALTH AND FOOD SAFETY DIRECTORATE-GENERAL. PHARMACEUTICAL COMMITTEE 21 October 2015

SAWP BMWP GTWP. Other WPs SWP QWP BWP CHMP BPWP. PhVWP VWP CTWP EWP

Comparison of Futility Monitoring Methods Using RTOG Clinical Trials. Q. Ed Zhang, PhD

Psychology, 2010, 1: doi: /psych Published Online August 2010 (

Bayesian Joint Modelling of Benefit and Risk in Drug Development

Rare Disease Workshop 5: Post Marketing Confirmatory Studies in Rare Diseases

Systematic Reviews and Meta- Analysis in Kidney Transplantation

How to Interpret a Clinical Trial Result

Safeguarding public health CHMP's view on multiplicity; through assessment, advice and guidelines

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

Discrimination and Reclassification in Statistics and Study Design AACC/ASN 30 th Beckman Conference

Full title: A likelihood-based approach to early stopping in single arm phase II cancer clinical trials

Determining the size of a vaccine trial

Practical Statistical Reasoning in Clinical Trials

Benefit - Risk Analysis for Oncology Clinical Trials

Population Enrichment Designs Case Study of a Large Multinational Trial

Benefit/Risk Assessment A Critical Need

Squamous Cell Carcinoma Standard and Novel Targets.

Chapter 12: Introduction to Analysis of Variance

3. Fixed-sample Clinical Trial Design

Lecture 2: Learning and Equilibrium Extensive-Form Games

Protocol F1K-MC-EVDP Efficacy and Safety of Drotrecogin Alfa (Activated) in Adult Patients with Septic Shock PROWESS SHOCK

BACKGROUND + GENERAL COMMENTS

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

Supplementary Online Content

Performance Outcome Measures: A Regulatory Perspective

Common Statistical Issues in Biomedical Research

Sample Size, Power and Sampling Methods

EMA Workshop on Multiplicity Issues in Clinical Trials 16 November 2012, EMA, London, UK

Innovative Analytic Approaches in the Context of a Global Pediatric IBD Drug Development

ISPOR Task Force Report: ITC & NMA Study Questionnaire

Draft ICH Guidance on Estimands and Sensitivity Analyses: Why and What?

XII Michelangelo Foundation Seminar

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Approaches to Integrating Evidence From Animal and Human Studies in Chemical Assessments

Application of Bayesian Extrapolation in Pediatric Drug Development Program

Benefit-Risk Assessment via Case Studies: Key Considerations & Best Practices

RAG Rating Indicator Values

Broad and clinically important benefits beyond the initial registrational endpoints are now reported.

Missing data in clinical trials: making the best of what we haven t got.

Possibilities and risks associated with directly moving from Phase I to Phase III: Adaptive designs and Co

Enhancing Regulators Confidence in your Quality Risk Management Work Products

Technology appraisal guidance Published: 28 September 2016 nice.org.uk/guidance/ta411

July 7, Dockets Management Branch (HFA-305) Food and Drug Administration 5630 Fishers Lane, Rm Rockville, MD 20852

Sample Size Considerations. Todd Alonzo, PhD

SOME STATISTICAL CONSIDERATIONS IN THE DESIGN AND ANALYSIS OF EQUIVALENCE STUDIES USING PHARMACODYNAMIC AND CLINICAL ENDPOINTS

Outline of Part III. SISCR 2016, Module 7, Part III. SISCR Module 7 Part III: Comparing Two Risk Models

Transcription:

Non-inferiority: Issues for today and developments for tomorrow Kevin Carroll AstraZeneca

Contents Showing drug effectiveness and requirements for approval Approaches to NI assessment PhRMA CDIG PISC team view A possible mulit-stage approach to AC design and analysis Summary

Contents Showing drug effectiveness and requirements for approval Approaches to NI assessment PhRMA CDIG PISC team view A possible mulit-stage approach to AC design and analysis Summary

Dr R Pazdur, Director, Division of Oncology Drug Products, CDER, FDA For regular approval of a drug, the sponsor must demonstrate that the drug is safe and effective in adequate and well-controlled trials. The effectiveness must be demonstrated on an endpoint that the agency believes to represent clinical benefit, usually survival, disease symptom amelioration or established surrogates for these. The sponsor is not obligated to show that the drug is safer and/or more effective than an approved drug. Oncologic Drugs Advisory Committee, July 27, 2004: Transcript, pg 17: http://www.fda.gov/ohrms/dockets/ac/04/transcripts/2004-4060t1.htm

The statutory requirement for regulatory approval The drug is effective There is a positive benefit:risk New drugs do not have to be shown to have superior efficacy vs existing agents, otherwise only one therapy could ever be approved and available at any given time Approval of a new drug would logically necessitate the currently approved standard to be withdrawn This would result in chaos and a total loss of therapeutic options to the patient and physician

How might we show effectiveness? Directly, via a placebo controlled trial This is, and remains, the gold standard for effectiveness Indirectly via an active-controlled trial When a placebo controlled trial is either unethical or impractical

There really is no such thing as an NI trial There are only active-control (AC) trials with differing objectives When placebo control is either unethical or impractical, effectiveness is established via an AC trial by either Showing drug is better than control and, thus, drug is better than placebo or Showing indirectly, by reference to historical data, that drug is better than placebo

The first objective of an AC NI trial is not NI The true and first regulatory purpose of an activecontrolled NI trial is to establish indirectly that a new therapy would have beaten placebo if a placebo controlled trial could have been conducted. Examination of relative efficacy of new to control by, say, showing a given amount of the control effect, say 25%, 50% or 75%, has been retained might be considered as a further, descriptive and subsidiary objective.

Contents Showing drug effectiveness and requirements for approval Approaches to NI assessment PhRMA CDIG PISC team view A possible mulit-stage approach to AC design and analysis Summary

Two approaches to NI assessment in AC trials Preservation of control effect Superiority to putative placebo not considered sufficient The new drug must, in addition, preserve some fraction of active control s effect The fixed margin The largest loss of effectiveness that could be tolerated clinically for drug relative to control Denoted by δ Must show that the CI for the difference between drug and control does excludes δ

Define: Parameters for an AC trial γ TC = Effect of drug vs control Estimated as B TC (with variance V TC ) in AC trial γ CP = Effect of control vs placebo Estimated as B CP (with variance V CP ) from historical trial(s)) γ TP = Effect of drug vs placebo Indirectly estimated from as B TC (with variance V TC ) from B TC (V TC ) and B CP (V CP )

Preservation of effect For preservation of effect, want to show: γ TC (1-f)*γ CP Where 0 f 1 is the preservation factor Declare NI if: The 95% CI for γ TC - (1-f)*γ CP excludes zero {B TC - (1-f)* B CP } 1.96* (V TC + (1-f) 2 * V CP ) > 0

Fixed margin For fixed margin, not uncommon to: Base δ on lower end of 95% CI for γ CP δ = (1-f) * [B CP -1.96* (V CP )] Declare NI if: The 95% CI for effect of drug vs control excludes the margin, i.e. if B TC - 1.96* (V TC ) > δ {B TC - (1-f)* B CP } 1.96*{ (V TC ) + (1-f)* (V PC )}] > 0

Logical inconsistencies with preservation of effect (1) The requirement for approval is to show effectiveness, γ TP > 0 For preservation of effect, need to show γ TP > f*γ CP Why should this requirement depend on the study design?

Logical inconsistencies with preservation of effect (2) The regulatory bias inherent in applying the preservation criterion leads to logical inconsistencies, as illustrated by the following example: Suppose C and T were both evaluated in placebo-controlled trials, but C was approved first Assume γ TP > γ CP FDA s requirement that T preserve f% of C s benefit over placebo will in many plausible instances lead to rejection of T (even though T may be a better drug than C!)

Logical inconsistencies with preservation of effect (3) The results here indicate that both C and T are superior to placebo, and they suggest that T may be even better than C, but since C was approved first, the preservation of effect criterion logically requires that T cannot be approved. C vs P T vs P 0 f*b CP

Logical issues with the fixed margin Two sponsors compare their new drugs T1 and T2 against some active control C First sponsor specifies NI limit =1.30 Second sponsor specifies NI limit=1.15 Suppose upper 95% CL for T1:C is 1.25 and for T2:C is 1.20. Conclusion: T1 is non-inferior while T2 is not, despite being able to rule out a lesser for disadvantage for T2.

Do We Need a Non-Inferiority Margin? If the goal is demonstration of effectiveness: Need to show γ TP > 0, or γ CP + γ TC > 0 This is estimated by B CP + B TC, with Variance V CP + V TC The Synthesis Method provides and efficient test for non-inferiority: (B CP +B TC )/sqrt(v CP +V TC ) > 1.96 Where Does the Need for a Margin Come From?

One Standard of Evidence The standard of evidence for effectiveness of a new treatment T is: Statistically significant evidence that γ TP > 0 Why should an arbitrarily higher standard of evidence (γ TP > y > 0) be applied when an AC trial has been performed? The preservation margin is necessarily arbitrary, in the sense that there will be values below the margin for which there is no meaningful clinical difference in efficacy from a value above the margin. Preserving less than f% does not imply that T is an ineffective treatment. In contrast, γ TP = 0 has a definite objective clinical meaning. Requiring a higher standard of evidence for AC trials institutes a regulatory bias in favor of the first drug to be approved.

Contents Showing drug effectiveness and requirements for approval Approaches to NI assessment PhRMA CDIG PISC team view A possible mulit-stage approach to AC design and analysis Summary

PhRMA CDIG PISC Team (Non-Inferiority / Active-Controlled Trials) Steven Snapinn (Amgen): Chair Kevin Carroll (Astra Zeneca) Christy Chuang-Stein (Pfizer) Yu-Yun Ho (Novartis) Qi Jiang (Amgen) Gang Li (J&J) Patrick Peterson (Lilly) Yong-Cheng Wang (J&J) Matilde Sanchez (Arenapharm) Rick Sax (Astra Zeneca)

Proposed Approach Basic Principles Required degree of efficacy to support approval should be independent of study design If weaker drug is approvable then the Better drug should be approvable Methodologic weaknesses of NI trials should Be addressed separately Proposal Rigorous demonstration of Any Efficacy (γ TP > 0) Magnitude of treatment effect evaluated through point estimate via the synthesis method Term Non-Inferiority seems inappropriate prefer Active Control Efficacy Trial

Common criticisms We re allowing for new drugs to be ineffective when we do NI trials; we want positive superiority trials A superiority trial with p<0.05 does not rule out a new drug is ineffective We accept a 1 in 40 chance that the drug is ineffective Constancy is the Achilles heel of NI trials But this issue applies to all AC trials regardless of the objective if the control arm under performs, then p<0.05 might not be convincing if it over performs then we increase the risk of a false negative Assay sensitivity Regardless of the objective, a poorly conduct AC trial raises a legitimate concern and PP and ITT analysis would always need to be close Methods in development to examine assay sensitivity and constancy (discounting, Snappin 2008)

Contents Showing drug effectiveness and requirements for approval Approaches to NI assessment PhRMA CDIG PISC team view A possible mulit-stage approach to AC design and analysis Summary

Three stage proposal for design and analysis Stage 1 Design Calculate the number of deaths to establish indirectly new treatment is better than (putative) placebo with 90% power, 2.5% significance. Sample size formulae provided by Simon (1999) or Rothmann (2003)

Proposal for design and for analysis (contd) Stage 2 Analysis (a) Demonstrate absolute efficacy by indirect analysis vs placebo using historical data Stage 3 Analysis (b) Having passed Stage 2, then describe the relative efficacy of new to control on a continuum in terms of to what extent the new drug has retained the efficacy of the control Effect Retention Likelihood

Effect Retention Likelihood Both analysis stages can be simultaneously visualised in an effect retention likelihood plot Trial results can be expressed in a natural hierarchy, starting with the likelihood that the new drug is better than a putative placebo, followed by the likelihood that any given fraction of the control effect has been retained through to the likelihood that the new drug is better than the control

How might this proposal be classified? Consistent with thinking in CHMP guidance Bayesian in nature Historical data are prior, the active control trial is the data, and the effect retention likelihood is the posterior Coincides with Simon, 1999, under uninformative priors for effect of new drug and effect of placebo

A simple example HR[placeo:control]=1.5, p=0.005, 95% CI (1.13, 1.99) An AC trial with new drug vs control is planned To show efficacy with 90% power, 2.5% 1-sided α 800 events, delta 1.26

Drug vs. control effect retention likelihood for trial conducted with 800 events 100% 90% 80% 99.4% 97.8% 90.0% Prob retaining 70% 60% 50% 40% Probability drug better than control Probability Probability drug retains drug retains 50% 75% of control of the effect control effect HR T:C = 1.00 30% 20% 10% 0% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% % Effect retained

Setting a delta in the protocol Analysis will take place with 800 events. With this amount of information,noting the effect of the control has previously been estimated to be 0.50, p=0.005 and accepting constancy, this trial will be able to show that New is significantly better than placebo with 90% power. In practice, this equates to observing a HR of new to control of 1.09 or less, upper 97.5% CL <1.26.

Pemetrexed 1 example Pemetrexed vs. docetaxel in 2 nd line NSCLC, N=571 HR=0.99 (0.82-1.20), 409 deaths NI limit in protocol = 1.11 78%* docetaxel effect retention Docetaxel previously improved survival cf placebo (BSC) with HR=0.56 95% CI (0.35, 0.88) 1 Nasser Hanna et al, J Clin Oncol 22:1589-1597, 2004. * retention required to give p=0.025 (1-sided) for non-inferiority

Pemetrexed not non-inferior to docetaxel (p=0.11) 1 Cannot say with >97.5% confidence that true HR <1.1 or pemetrexed retained >78% of docetaxel effect The primary objective in the present Alimta trial was not achieved. Neither superiority nor non-inferiority to docetaxel were adequately demonstrated. 2 Dr R Pazdur, Director Office of Oncology Drug Products, FDA 1: 1-sided test that true HR<1.1 2: http://www.fda.gov/ohrms/dockets/ac/04/transcripts/2004-4060t1.htm

Effect retention likelihood for pemetrexed 100% 99.1% 90% 97.6% 89% probability 80% 70% 60% 50% 40% 30% 20% Probability pemetrexed better than BSC HR v placebo<1 Probability pemetrexed retains 50% of docetaxel effect HR v placebo<0.75 89% probability pemetrexed retains 78% of docetaxel effect HR v placebo<0.64 10% 0% Better 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% Better 100% than than BSC docetaxel Fraction of control effect retained

EPAR Scientific Discussion Although non-inferiority was not formally established, the data submitted are robust enough to conclude that a clinically significant inferiority of pemetrexed to docetaxel in terms of efficacy in this population is unlikely

Contents Showing drug effectiveness and requirements for approval Approaches to NI assessment PhRMA CDIG PISC team view A possible mulit-stage approach to AC design and analysis Summary

Summary (1) One standard of evidence (superiority to placebo) should be maintained for the assessment of effectiveness (γ TP > 0); this standard should not be determined by trial design. In situations where a placebo control is not possible necessitating an AC trial, requiring an arbitrary non-zero preservation of the control effect leads to serious logical inconsistencies; the requirement for approval should be convincing demonstration of effectiveness, γ TP > 0. Fixed NI margins are equally arbitary, statistically inefficient and highly conservative and should be abandoned

Summary (2) Analysis should be based on the synthesis method The synthesis approach can test for superiority to placebo, accounting for the variability within both the AC trial and the historical data. Methodologic issues associated with AC trials should be addressed as far as possible in the design, but will inevitably be a key feature in evaluating and interpreting the data A natural extension of the synthesis approach is a retention likelihood analysis which can be used to simultaneously assess the strength of evidence for γ TP > 0 and gauge the relative efficacy between drug and control right through to γ TC > 0

Thank you Any questions?