Statistical issues in HIV trial design. Andrew Hill Senior Visiting Research Fellow Liverpool University

Similar documents
What are the most promising opportunities for dose optimisation?

The next generation of ART regimens

Switching ARV Regimens: Managing Toxicity and Improving Tolerability; Switches & Class-Sparing Approaches

ART and Prevention: What do we know?

HIV Treatment Update. Anton Pozniak Consultant Physician, Director of HIV Services Chelsea and Westminster Hospital, London

HIV Treatment: New and Veteran Drugs Classes

SA HIV Clinicians Society Adult ART guidelines

Pharmacological considerations on the use of ARVs in pregnancy

Comprehensive Guideline Summary

Selected Issues in HIV Clinical Trials

Pediatric HIV Infection and the Medical Management of Pregnant Women infected with HIV. Ernesto Parra, M.D., M.P.H.

HIV Clinical Management: Antiretroviral Therapy and Drug Resistance

Professor Jeffery Lennox

Selected Issues in HIV Clinical Trials

Criteria for Oral PrEP

HIV Treatment: State of the Art 2013

Are the current doses of ARV correct. Richard Elion MD Associate Adjunct Clinical Professor of Medicine Johns Hopkins School of Medicine

ART Treatment. ART Treatment

Treatment strategies for the developing world

HIV - Therapy Principles

Optimizing 2 nd and 3 rd Line Antiretroviral Therapy in Children and Adolescents

HIV Treatment Update. Awewura Kwara, MD, MPH&TM Associate Professor of Medicine and Infectious Diseases Brown University

Management of patients with antiretroviral treatment failure: guidelines comparison

Clinical support for reduced drug regimens. David A Cooper The University of New South Wales Sydney, Australia

What's new in the WHO ART guidelines How did markets react?

BHIVA antiretroviral treatment guidelines 2015

Josep Mallolas Hospital Clínic Barcelona

Antiretroviral Treatment Strategies: Clinical Case Presentation

Second-Line Therapy NORTHWEST AIDS EDUCATION AND TRAINING CENTER

BHIVA guidelines on the treatment of HIV-1-positive adults with antiretroviral therapy. START & other changes

Management of ART Failure. EACS Advanced HIV Course 2015 Dr Nicky Mackie

Virological suppression and PIs. Diego Ripamonti Malattie Infettive - Bergamo

Susan L. Koletar, MD

Second and third line paediatric ART strategies

Antiretroviral Therapy: Current Recommendations, New Drugs, and Novel Strategies. Joel Gallant, MD, MPH Johns Hopkins University School of Medicine

First line ART Rilpirivine A New NNRTI. Chris Jack Physician, Durdoc Centre ethekwini

Dr Carole Wallis, PhD Medical Director, BARC-SA Head of the Specialty Molecular Division, Lancet Laboratories, South Africa

The use of antiretroviral agents during pregnancy in Canada and compliance with North-American guidelines

Cases from the Clinic(ians): Case-Based Panel Discussion

12th European AIDS Conference / EACS ARV Therapies and Therapeutic Strategies A CME Newsletter

STRIBILD (aka. The Quad Pill)

Impact of baseline HIV-1 RNA levels on initial highly active antiretroviral therapy outcome: a meta-analysis of 12,370 patients in 21 clinical trials*

Disclosures. Update on HIV Drug Therapy: A Case based Discussion. Case # 1: Dr. Grant has received grant support from BMS, Gilead, Janssen, and Viiv

Continuing Education for Pharmacy Technicians

Update on HIV Drug Resistance. Daniel R. Kuritzkes, MD Division of Infectious Diseases Brigham and Women s Hospital Harvard Medical School

DTG Versus LPV/r in Second Line (DAWNING): Outcomes by WHO- Recommended NRTI Backbone

HIV Update Allegra CPD Day Program Port Elizabeth Dr L E Nojoko

Reduced Drug Regimens

Pretreatment drug resistance and new treatment paradigms in firstline

Selecting an Initial Antiretroviral Therapy (ART) Regimen

Clinical skills building - HIV drug resistance

Susan L. Koletar, MD

Crafting an ART Regimen for Initiation or Salvage: Are NRTI s Necessary?

More Options, Some Opinions Initial Therapies for HIV Judith S. Currier, MD

SINGLE. Efficacy and safety of dolutegravir (DTG) in treatment-naïve subjects

CNS Toxicity of Integrase Inhibitors Myth or Reality? Ignacio Pérez Valero Unidad de VIH. Hospital U. La Paz

Didactic Series. CROI 2014 Update. March 27, 2014

ARVs in Development: Where do they fit?

Bon Usage des Antirétroviraux dans l Infection par le VIH

SELECTING THE BEST ART FOR EACH PATIENT

Efavirenz vs dolutegravir for 1st line ART: Is it time to change? The argument AGAINST. Graeme Meintjes University of Cape Town

Resistance Analyses of Integrase Strand Transfer Inhibitors within Phase 3 Clinical Trials of Treatment-Naive Patients

The results of the ARTEN study. Vicente Soriano Hospital Carlos III, Madrid, Spain

Somnuek Sungkanuparph, M.D.

2016 Perinatal Treatment Guidelines Update

Resistance Workshop. 3rd European HIV Drug

Professor José Arribas

BHIVA Clinical Audit. Management of patients who switch therapy; re-audit of patients starting therapy from naïve

The Use of Integrase Inhibitors In Latin America: From Guidelines to the Real World Ernesto Martínez B., MD Internal Medicine, Infectious Diseases

Rajesh T. Gandhi, M.D.

Does Resistance Still Matter? Daniel R. Kuritzkes, M.D. Division of Infectious Diseases Brigham and Women s Hospital Harvard Medical School

Case # 1. Case #1 (cont d)

Introduction to HIV Drug Resistance. Kevin L. Ard, MD, MPH Massachusetts General Hospital Harvard Medical School

Antiretroviral Therapy: What to Start

NNRTI Resistance NORTHWEST AIDS EDUCATION AND TRAINING CENTER

Dr Alan Winston. Imperial College Healthcare NHS Trust London. 7-8 October 2010, Queen Elizabeth II Conference Centre, London.

HIV and the Central Nervous System Impact of Drug Distribution Scott L. Letendre, MD. Professor of Medicine University of California, San Diego

Terapia del paciente naive con un régimen de Inhibidor de la proteasa Dr. Jose R Arribas IX Curso de avances en Infección VIH y hepatitis virales

TDF containing ART: Efficacy and Safety. Dr Lloyd B. Mulenga Adult Infectious Diseases Centre University Teaching Hospital Lusaka, Zambia

Purpose Methods Demographics of patients in the study Outcome. Efficacy Adverse Event. Limitation

2-Drug regimens in HIV Anton Pozniak MD FRCP

Simplifying HIV Treatment Now and in the Future

Real Life Experience of Dolutegravir and Lamivudine Dual Therapy As a Switching Regimen in HIVTR Cohort

The Future of HIV: Advances in Drugs and Research. Shauna Gunaratne December 17, 2018

Pediatric Antiretroviral Resistance Challenges

This graph displays the natural history of the HIV disease. During acute infection there is high levels of HIV RNA in plasma, and CD4 s counts

WHO mission on ART optimization in Belarus March 30-31, April 1, 2016

Changes in cellular HIV DNA levels during the MONET trial: 144 Weeks of darunavir/ritonavir monotherapy versus DRV/r + 2NRTIs

ACTHIV 2018: A State-of-the-Science Conference for Frontline Health Professionals

ART: The New, The Old and The Ugly

Update on CADO/PADO: what are the challenges in using the current guidelines and foreseen ARV revisions: opportunities and challenges

HIV in in Women Women

Integrase Inhibitors in the Treatment HIV-Infection. Andrew Zolopa, MD Stanford University

Actualización y Futuro en VIH

Prima linea: dovremmo evitare i PI nella terapia di prima linea per i loro effetti non desiderati? Giuseppina Liuzzi

MDR HIV and Total Therapeutic Failure. Douglas G. Fish, MD Albany Medical College Albany, New York Cali, Colombia March 30, 2007

Drug toxicities: Safest PIs. Michelle Moorhouse 14 Apr 2016

Management of Treatment-Experienced Patients: New Agents and Rescue Strategies. Joel E. Gallant, MD, MPH Johns Hopkins University School of Medicine

Case 1 continued. Case 1 (cont) 12/8/16. MMAH Debate Panel Thursday, December 8, Case 1

First-Line Antiretroviral Therapy for Treatment and Prevention:

Transcription:

Statistical issues in HIV trial design Andrew Hill Senior Visiting Research Fellow Liverpool University

Type 1 error It is statistically significant! (after 100 different statistical tests)

Type 1 error: significant p value after many tests When you throw two dice, the chance of getting a double six: p=0.028

Type 1 error: atenolol Randomised trial was submitted to the Lancet Reviewers asked for several subgroup analyses Authors did not agree and so they tested for treatment effect by zodiac sign Patients with zodiac sign of Sagittarius showed a strong benefit, while others did not have a significant benefit

Type 1 error in HIV trials? 50 100 statistical tests on safety endpoints Enfuvirtide: bacterial pneumonia? Interim and final analyses: Raltegravir significant benefit over EFV in STARTMRK only seen after 4 analyses Darunavir significant benefit over LPV/r In ARTEMIS only seen after 4 analyses MONET trial non-inferiority seen at Week 48, then not seen in later analyses Are these real differences, or Type 1 errors?

Type 1 error: checks Double six: p=0.028 A clinical trial only has the power to test the primary endpoint Everything else is secondary and should be validated in another trial Count how many tests and p values shown in the paper? Be careful in the safety sections How significant was the result: p<0.05 (2 sixes), p<0.01 (3 sixes), p<0.001 (4 sixes)? You can correct for multiple tests, but p value drops: p<0.005 for 10 tests, p<0.001 for 50 tests

Clinical significance? The difference is statistically significant, but is it clinically significant?

LDL cholesterol (mg/dl) Change in LDL to Week 48 TAF versus TDF Study 104 (n=867) Study 111 (n=866) 16 14 12 10 8 6 4 2 0 Will there be more myocardial infarctions on TAF than TDF? Data from metaanalysis needed TAF/FTC + ELV/c TDF/FTC + ELV/c TAF/FTC + ELV/c TAF/FTC + ELV/c

egfr (ml/min) Gilead press release, September 2014 Change in egfr to Week 48 TAF versus TDF Study 104 (n=867) Study 111 (n=866) 0-2 -4-6 -8-10 -12-6,8-3.6-10,4-5,7-6.2-11,9 Will there be more renal failure on TAF than TDF? Data from metaanalysis needed -14 TAF/FTC + ELV/c TDF/FTC + ELV/c TAF/FTC + ELV/c TAF/FTC + ELV/c

Type 2 error There is no difference between the arms But the study was too small to show a difference

Sample size calculations 1. You can look at cohort studies to get estimates of endpoints, to use in clinical trial design. 2. For example, how many people would need to be enrolled in a clinical trial of an HIV vaccine to lower transmission rates by 70%? 3. Assume 80% power, 5% significance level, one primary analysis

Cohort studies and HIV vaccine trials Country Population HIV transmission (% per year) USA STD clinic / Heterosexuals 0.5% Canada Men having Sex with Men 0.6% Brazil IV drug users 0.8% Tanzania Factory workers 1.5% UK Men having Sex with Men 1.5% USA IV drug users 2.0% Australia Men having Sex with Men 2.1% Kenya Commercial Sex Workers 3.5% Congo Discordant couples 4.0% Zimbabwe New mothers 4.8% China IV Drug Users 5.0% Thailand IV Drug Users 5.8% Cote D Ivoire Commercial Sex Workers 6.0% Uganda Rural adults 6.5% USA STD clinics / MSM 7.1% Thailand IV Drug Users 17% South Africa Women / Antenatal clinics 17% Tanzania Police Officers 20%

Sample sizes for HIV vaccine trials Sample size for an HIV vaccine trial (2 arms, 70% reduction in transmission) 9000 8000 7000 6000 5000 4000 The sample size for this trial could range between 200-8200 patients, depending on the cohort study used to estimate HIV transmission rates. Make sure to assess as many cohorts as possible 3000 2000 1000 0 0 2 4 6 8 10 12 14 16 18 20 HIV Transmission rate per year (%)

Non-inferiority trials: key points Non-inferiority trials are powered to show that the new treatment is NO WORSE than the control The trial is powered with a delta estimate, so we can conclude that the new treatment is no worse than control by this delta (10 12% for HIV trials) The control treatment needs to be the recognised standard of care, and show a typical level of efficacy versus reference trials Hill A, Sabin S. AIDS 2008;22:913 21.

Non-inferiority trials: key terms 95% confidence intervals -12% Difference between treatment arms 0 Point estimate Point of no difference between treatments Delta, or non-inferiority margin Control arm better New drug better Hill A, Sabin S. AIDS 2008;22:913 21.

Non-inferiority trials: outcomes -10% or -12% Difference between treatment arms 0 New drug is: Inferior Significantly worse, but non-inferior Non-inferiority not demonstrated Non-inferior Superior Control arm better New drug better Hill A, Sabin S. AIDS 2008;22:913 21.

HIV trials where non-inferiority was not shown HIV RNA <50 copies/ml at Week 48 difference between treatment arms Trial: new drug and control -12% -10% 0-8% -16.8% 1.4% BICOMBO: ABC/3TC vs TDF/FTC -6% -12.8% 0.9% 2NN: NVP vs EFV -11.2% -5% 1.2%* MERIT: MVC vs EFV -10.4% -4% 1.5% Gilead 903: TDF/3TC vs d4t/3tc Control arm better New drug better * Assumes symmetrical confidence intervals

Underpowered studies DTG and TB 2NRTI + DTG n=60 Treatment naïve TB co-infected 2NRTI + EFV n=60 Study is too small to establish non-inferiority for DTG vs EFV in TB co-infection Strong interaction between DTG and rifampicin does double dosing cover this? How do we use DTG in Africa without clinical validation of efficacy? www.clinicaltrials.gov

Type 2 error: checks Not significantly different does not mean the same A new treatment needs to be NON-INFERIOR to the standard of care to be acceptable: look at the 95% confidence intervals of the difference between the arms Nevirapine has never shown true non-inferiority versus efavirenz in naïve patients Abacavir has never shown true non-inferiority versus tenofovir in treatment naïve or experienced patients

Intent to Treat analyses: a reality check Intent to Treat analysis (FDA TLOVR): includes all patients randomised, even if not compliant with the protocol Per Protocol analysis: patients with major protocol violations are excluded (not well standardised) As Treated analysis: only virological endpoints are included Results need to be consistent across these analyses

Per Protocol analysis why do we do it? Example 1 a patient was randomised to Arm A, but wanted to take the drugs in Arm B. He took the Arm B drugs for 3 years and had HIV RNA <50 copies/ml at the end of the trial. Example 2 a drug addict entered a trial and took cocaine for 2 weeks, then HIV drugs for 2 weeks, for three months. The patient showed virological failure. Example 3 a patient was randomised but then died in a car accident before taking the first dose of medication

2NN trial: EFV is significantly better than NVP when only the treated patients are included 100% 90% ITT - All ITT - Exposed 80% 70% 62.3% 56.3% 65.4% 57.7% 60% 50% 40% 30% 20% 10% p=0.091 p=0.029 0% EFV (n=400) NVP (n=387) EFV (n=381) NVP (n=378) ITT exposed = Per Protocol analysis: only includes patients who actually took at least one dose of study medication Van Leth et al, Lancet 2004.

Endpoints in HIV clinical trials a mixture versus Virological failure Discontinuation for adverse events or other reasons

The FDA Snapshot endpoint Success or failure depends only on the HIV RNA level at the time of the endpoint (e.g. Week 48) If HIV RNA is below 50 copies/ml at this time, the patient is a success If HIV RNA is at or above 50 copies/ml FOR ANY REASON, the patient is a failure Advantages Simple endpoint to work with Can be easily analysed across trials Disadvantages HIV RNA assays are variable, HIV RNA can be 50-200 copies/ml by chance Patient could have HIV RNA <50 before and after Week 48, but could still be classified as failure It is more accurate to use two or more HIV RNA measures

What is virological failure? In regulatory trials, only 25 30% of endpoints are virological Discontinuation for adverse events or other reasons can also be classified as virological failure True virological failure and discontinuations may not be balanced across the arms

Summary efficacy data from registrational trials (data source: FDA drug labels) Percentage of patients with virological failure by FDA TLOVR endpoint, in three categories. HIV RNA endpoint of 400 copies/ml at Week 48. Trial Arm N % of patients with VF d/c AE d/c other DMP-006 ZDV/3TC/EFV 422 6 7 17 FTC301A FTC/ddI/EFV 286 3 7 9 FTC301A d4t/ddi/efv 285 11 14 8 EPV2001 ZDV/3TC/EFV 278 8 7 18 EPV2001 ZDV/3TC/EFV 276 8 12 14 Gilead 903 TDF/3TC/EFV 299 6 7 8 Gilead 903 d4t/3tc/efv 301 4 7 7 CNA30021 ABC/3TC/EFV 384 5 13 11 CNA30021 ABC/3TC/EFV 386 5 11 3 APV 3002 ABC/3TC/fAPV 322 6 10 15 Abott 863 d4t/3tc/lpv/r 326 9 6 10 Gilead 934 TDF/FTC/EFV 244 2 4 10 Gilead 934 ZDV/3TC/EFV 243 4 9 14 KLEAN ABC/3TC/fAPV/r 434 6 5 16 KLEAN ABC/3TC/LPV/r 444 7 5 17 VF = true virological failure; d/c AE = discontinuation for adverse events; d/c other = discontinuation for other reasons

CASTLE trial: 96-week efficacy analysis ATV/r is superior by ITT, not by On Treatment analysis ITT analysis On Treatment analysis 100 90 80 70 +6.1%, 95% CI: +0.3, +12.0% +1.6%, 95% CI: -3.1%, +6.2% 89% 88% 74% 68% HIV RNA <50 (%) 60 50 40 30 20 10 Table EFF 4-5 0 ATV/r LPV/r ATV/r LPV/r N=440 N=443 N=440 N=443 Molina et al. ICAAC 2008; Abstract H1250-d.

SINGLE: FDA Snapshot, Week 144 HIV RNA <50 copies/ml at Week 96, n (%) Categories of response and failure ABC/3TC/DTG N=414 TDF/FTC/EFV N=419 HIV RNA <50 copies/ml 72% 63% Virological failures 10% 7% Discontinuation of treatment 18% 30% Drug resistance 0% 1.4% More virological failures in the DTG arm No significant difference in drug resistance between the arms

Patients (%) MERIT: percentage of patients with undetectable HIV-1 RNA at Week 48 (primary endpoint) 100 90 80 70 60 50 40 30 20 10 0 N= EFV + CBV MERIT Study 48 weeks 73.1 36 1 MVC + CBV 70.6 69.3 36 0 EFV + CBV <400 copies/ml <50 copies/ml -3.0* (-9.5 ) -4.2* (-10.9 ) *Difference (adjusted for randomisation strata) Lower bound of 1-sided 97.5% confidence interval; non-inferiority margin = -10% Per-protocol analysis: <400 copies/ml difference = -4.1 (-10.5 ), <50 copies/ml difference = -4.4 (-11.2 ) 36 1 65.3 36 0 MVC + CBV Intent-to-treat (ITT) analysis

MERIT trial: summary of discontinuations through 48 weeks Includes all patients who received at least one dose of study medication Reason for discontinuation EFV + CBV N=361 MVC + CBV N=360 All, n (%) 91 (25.2) 97 (26.9) Adverse event, n (%) 49 (13.6) 15 (4.2) Lack of efficacy, n (%) 15 (4.2) 43 (11.9) Other reason, n (%) 9 (2.5) 14 (3.9) Withdrew consent or lost to follow-up, n (%) 18 (5.0) 25 (6.9) MERIT Study 48 weeks

What are the consequences of virological failure? Patients could fail with high level drug resistance, or no resistance Patients could discontinue the randomised treatment with HIV RNA <50, and switch straight onto an alternative: not a failure in ACTG trials A patient could decide to go on holiday, stop taking ARV s, then come back and be re-suppressed on original Rx Patients could have a serious drug toxicity (e.g. Stevens-Johnsons syndrome) Why do we count all these endpoints equally???

Percentage of genotyped first-line HAART failures with NNRTI, M184V, TAMS and PI resistance 100% 90% 91.1% 80% 70% 69.0% 60% 57.3% 50% 40% 37.2% 30% 20% 10% 0% 0.3% Major NNRTI 16.5% M184V/I 0.2% 1.0% Any TAMS Major PI Major NNRTI M184V/I 1.0% 0.0% Any TAMS Major PI Major NNRTI M184V/I 11.2% Any TAMS 0.0% Major PI PI Studies NNRTI Studies Developing World NNRTI Studies Gupta et al. BHIVA 2008.

TAF Phase 3 trials: FDA Snapshot, Wk 96 HIV RNA <50 copies/ml at Week 96, n (%) Categories of response and failure TAF/FTC/ELV N=866 TDF/FTC/ELV N=867 HIV RNA <50 copies/ml 87% 85% Virological failures 5% 5% Grade 3 or 4 clinical adverse events 12% 12% Grade 3 or 4 lab abnormalities 28% 25% No difference in overall efficacy or safety between TAF and TDF German IQWIG rejects application for TAF Value versus generic TDF?

Are these patients virological failures? FDA Snapshot algorithm: failure is simply having HIV RNA >50 copies/ml at Week 48, or being off study drug TLOVR algorithm: failure is stopping randomised treatment, rebound of HIV RNA >50 copies/ml at any time, or never suppressed <50 copies/ml ACTG trials: failure is rebound of HIV RNA >200 copies/ml at any time, or never suppressed. Switches in treatment allowed

HIV RNA copies/ml SENSE: HIV RNA versus time, patient 3 Treatment arm: efavirenz Patient did not take randomised treatment daily, between Day 1 and Week 6 1,000,000 100,000 10,000 1,000 100 10 Listing EFF: 1 25/02/11 412,000 3,440 131,000 26,900 33,400 Phenotype: 3TC resistant EFV resistant NVP resistant ETR sensitive Time - Weeks 51,800 M184I/M/V (NRTI) L74I/L (NRTI) V90I (NNRTI) K103N (NNRTI) Patient switched to TDF/ZDV/ATV/r 26,100 0 10 20 30 40 50 60 732 61

HIV RNA (copies/ml) Patient in MONET trial Treatment arm: DRV/r Patient interrupted treatment for 1 month no resistance ARV treatment started: 1999 Prior antiretrovirals taken: ABC, ddi, 3TC, ZDV, EFV, NFV, LPV/r Nadir CD4 count: 191 Hepatitis C: Antibody Positive HCV RNA PCR: <50 NRTI intensification: No 7 14 Status: Completed trial Time in MONET trial (weeks) Week 4 Week 48 Week 96 Week 144 DRV PK 9110 65 3310 2380 RTV PK 257 19 120 392 Adherence >95% 95% >95% >95% Adverse events / Investigator comments: Dr Pulido, Spain Poor adherence: Patient interrrupted treatment by her own decision, for a month. Listing EFF 1, PK 1 08/03/11 Red graph numbers = Optical density readings for HIV RNA samples <50 copies/ml

HIV RNA copies/ml SENSE: HIV RNA versus time, patient 3 (etravirine arm) 1,000,000 100,000 359,000 78,800 FDA Snapshot = success TLOVR = failure ACTG = success 10,000 1,000 245 Genotype: No mutations Phenotype: ETR sensitive 100 104 100 <50 <50 <50 <50 10 Listing EFF: 1 25/02/11 0 10 20 30 40 50 60 Time - Weeks

Endpoints: checks If the treatment arms had equivalent efficacy, was this also true looking only at the virological endpoints? If there was a difference between the arms, what was the reason for this? One treatment could be more effective (HIV RNA suppression) One treatment could be better tolerated People might stop treatment in one arm for other reasons ACTG trials analysis: uses only virological endpoints: discontinuation for adverse events or other reasons are not endpoints. This raises efficacy rates for ACTG trials versus TLOVR algorithm MRC PIVOT Trial: should resistance at failure be the new endpoint?

Cohort studies Why do we believe the results, when they are not randomised?

Real-life evaluation of treatments There could be patients who are included in the final approved drug label, but who were excluded from the Phase 3 trials (e.g. CNS abnormalities on dolutegravir). There may be unresolved safety issues which can only be answered by long-term follow up. For example, does treating HIV+ pregnant women with antietrovirals lead to toxicity in their children? Sometimes you cannot randomise

Real-life evaluation of treatments Clinical trials often have strict inclusion criteria so will not represent all patients who would be treated with a drug. Screen failure rates in Janssen HIV clinical trials (naïve) Trial Screened Screen Failures (n, %) THRIVE (RPV) 947 267 (26%) ECHO (RPV) 948 254 (27%) ARTEMIS (DRV/r) 843 154 (18%) SENSE (ETR) 193 36 (19%)

Detecting rare adverse events in cohorts Rare toxicities are often only seen after a drug has been approved large numbers needed to detect rare but serious events Total Sample size 25000 20000 15000 10000 5000 Number of patients required to detect a 100% rise in risk Attempted suicide, efavirenz (0.3%) Drug-related hepatitis, darunavir (0.5%) Severe GI bleeding, aspirin (1.8%) 0 0 1 2 3 4 5 Background risk of serious events (%)

Sources of bias in cohort studies Correlation to causation false associations (no control group) Channelling bias patients with certain risk factors are put onto particular treatments, creating a false association between the risk factor and the treatment Confounding factors factors are not corrected for in multivariate analyses Lead-time bias an intervention changes the time when a disease is first detected Behavioural disinhibition the behaviour of people can counteract the potential benefits of a treatment Regression to the mean people who are ill at baseline can get better with no intervention at all

Vitamin A antioxidant to prevent cancer?

Vitamin A: is it good for you? In cohort studies, Vitamin A supplements were associated with improved survival In randomised trials, Vitamin A supplements were associated with worse survival Taking Vitamin A could be a marker of health awareness It is very hard to eliminate bias from analyses of cohort studies

Tenofovir and survival in HIV: Africa Randomised studies have shown improved tolerability for tenofovir versus zidovudine. Two studies have evaluated the effects of tenofovir versus ZDV or d4t on survival in African patients in large access programmes: Study 1: 18,866 patients in PEPFAR, Zambia. Tenofovir associated with a 43% increase in mortality, compared with zidovudine. Chi et al 2011 (JAIDS, 58: 475-480) Study 2: 6,196 patients in Johannesbourg, South Africa Tenofovir associated with a 29% decrease in mortality, compared with zidovudine. Velen et al, PLoS One 2013, 8(5)

Cohort studies versus randomised trials We can never eliminate the chance of bias from cohort studies Randomised trials should remain the gold standard for making decisions When looking at results from a cohort study: Have the same results been seen in other cohorts? Is there another explanation for the results (e.g. channelling bias)? Remember the experience from cohorts in other disease areas

Absolute risk versus relative risk Absolute risk is for making decisions. Relative risk should be strictly for research A rise in risk from 1/1000 to 2/1000 is a doubling in relative risk Relative risk can be alarming for patients! e.g. abacavir is associated with a 50% rise in the risk of heart attacks The number one cause of death for people with HIV in the UK is late diagnosis Relative risk is often misused and misinterpreted Much better to express results for a population of 100 patients: what percentage would develop safety issues or resistance on a new treatment?

Treatment-emergent NRTI / NNRTI resistance at failure in randomised trials of efavirenz (ITT) Trial (n) Treatment M184 I/V Major NNRTI mutations mutations Gilead 903 TDF/3TC/EFV (n=299) 12 (4.0%) 16 (5.4%) Gilead 934 TDF/FTC/EFV (n=255) 2 (0.8%) 9 (3.5%) CNAA3024 ABC/3TC/EFV (n=324) 2 (0.6%) 4 (1.2%) CNAA3021 ABC/3TC/EFV (n=770) 15 (1.9%) 14 (1.8%) Gilead 934 ZDV/3TC/EFV (n=254) 7 (2.8%) 16 (6.3%) CNAA3024 ZDV/3TC/EFV (n=325) 4 (1.2%) 4 (1.2%) EPV20001 ZDV/3TC/EFV (n=554) 14 (2.5%) 19 (3.4%) Abbott M03-613 ZDV/3TC/EFV (n=51) 1 (2.0%) 1 (2.0%) Gilead 903 d4t/3tc/efv (n=301) 8 (2.7%) 12 (4.0%) 2NN ZDV/3TC/EFV (n=400) 10 (2.5%) n.d. ECHO/THRIVE TDF/FTC/EFV (n=682) 9 (1.3%) 15 (2.2%) Total 1.6% 2.4% ECHO/THRIVE: Cohen et al. WAC, Vienna, 2010. Meta-analysis: Gupta el al. CID 2008;47:712 22.

Publication bias There are many examples of HIV clinical trials which were never published; often these had unfavourable outcomes Some of these were presented at conferences but never written up They can be very hard to find by MEDLINE searches Are you only seeing part of the story about a treatment? Look for trials in www.clinicaltrials.gov, or the European Summary of Product Characteristics (SMPC) Systematic reviews often include all available results, including conference presentations

Percent HIV RNA <400 DMP-005 trial: ZDV/3TC + EFV 200, 400, 600 mg o.d. Presented, not published 100 80 60 40 20 Efavirenz 200 mg + ZDV/3TC Efavirenz 400 mg + ZDV/3TC Efavirenz 600 mg + ZDV/3TC 0 0 2 4 6 8 10 12 14 16 Weeks in study EFV 200 mg N= 32 34 34 30 29 32 31 EFV 400 mg N= 31 31 33 28 30 28 28 EFV 600 mg N= 32 29 32 28 30 27 28 Haas et al. 5th CROI 1998; Abstract 698.

Levels of evidence to support treatment guidelines System of grading evidence to support types of treatment and care Used by WHO, IAS and other groups in their guidelines documents Example A1: supported by large prospective randomised trials B and C: cohort studies down to expert opinion

Virological suppression comparing 3TC and FTC-including regimens Ford N, Shubber Z, Hill A, Vitoria M, et al. (2013) Comparative Efficacy of Lamivudine and Emtricitabine: A Systematic Review and Meta-Analysis of Randomized Trials. PLoS ONE 8(11): e79981. doi:10.1371/journal.pone.0079981 http://www.plosone.org/article/info:doi/10.1371/journal.pone.0079981

4 randomised trials of PI/r + NRTI versus PI/r + 2NRTIs HIV RNA <50 copies/ml (switch = failure endpoint) Overall, in 4 randomised trials of 1090 patients, PI/r + 3TC showed HIV RNA suppression rates 4% higher than PI/r + 2NRTIs This difference was within the limits for non-inferiority (lower 95% confidence interval -1% There was no evidence for heterogeneity between the trials (p=0.10). Favours 3-drug treatment Favours 2-drug treatment

Conclusions 7 key questions to ask: 1. What was the primary endpoint of the study? What was this result? 2. Was the study randomized? 3. Was the study properly powered to show a difference? 4. What was the difference in true virological endpoints between the arms? 5. Is the difference between the treatments clinically meaningful? 6. Are you seeing all of the evidence on a treatment? 7. Do you believe results from a cohort study, if randomized trials say something else?