MEASURING THE UNDIAGNOSED FRACTION:

Similar documents
Estimate of the Number of Persons Living with HIV in Massachusetts

BRINGING MODELS DOWN TO EARTH

Strengthening Linkages: why & how to connect our dots

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

Regional Collaboration to Optimize the Cascade: The Northwestern Public Health CFAR Consortium

Estimates of New HIV Infections in the United States

2018 HIV/AIDS Epidemiology Update 2017 Data. James Dowling Health Program Coordinator Division of Public Health

Appendix NEW METHOD FOR ESTIMATING HIV INCIDENCE AND TIME FROM INFECTION TO DIAGNOSIS USING HIV SURVEILLANCE DATA: RESULTS FOR FRANCE

What is a probability? Two schools in statistics: frequentists and Bayesians.

Estimates of New HIV Infections in the United States

2017 HIV/AIDS Epidemiology Update 2016 Data. James Dowling Health Program Coordinator Division of Public Health

Spatial and Temporal Data Fusion for Biosurveillance

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

Estimating HIV incidence in the United States from HIV/AIDS surveillance data and biomarker HIV test results

Improving Patient Engagement in HIV Care: Health Department Strategies

U.S. Counties Vulnerability to Rapid Dissemination of HIV/HCV Infections Among People Who Inject Drugs

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS

Ordinal Data Modeling

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

HIV & AIDS Cases in Alameda County

An Introduction to Multiple Imputation for Missing Items in Complex Surveys

Second generation HIV surveillance: Better data for decision making

Bayesian Inference Bayes Laplace

A Brief Introduction to Bayesian Statistics

Supplementary notes for lecture 8: Computational modeling of cognitive development

Mathematical-Statistical Modeling to Inform the Design of HIV Treatment Strategies and Clinical Trials

Bayesian Statistics Estimation of a Single Mean and Variance MCMC Diagnostics and Missing Data

SCOPE OF HIV/AIDS IN MINNESOTA

Estimated HIV Incidence in the United States,

County of Orange Health Care Agency, Public Health Services HIV/AIDS Surveillance and Monitoring Program

MS&E 226: Small Data

Estimating and comparing cancer progression risks under varying surveillance protocols: moving beyond the Tower of Babel

Defining and Measuring Recent infection

BALTIMORE COUNTY HIV/AIDS EPIDEMIOLOGICAL PROFILE Fourth Quarter Data reported throuh December 31, 2007

The University of Texas MD Anderson Cancer Center Division of Quantitative Sciences Department of Biostatistics. CRM Suite. User s Guide Version 1.0.

HIV Incidence Report, Sexually Transmitted Diseases, HIV and Tuberculosis Section, Epidemiology and Surveillance Unit

(2) In each graph above, calculate the velocity in feet per second that is represented.

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

Data inputs for HIV models and how they inform HIV prevention interventions Dobromir Dimitrov

Strategic Planning for HIV/AIDS

Understanding HIV/AIDS: The current state of the HIV epidemic in British Columbia Mark Gilbert, MD, MHSc, FRCPC

BREAST CANCER EPIDEMIOLOGY MODEL:

Mathematical-Statistical Modeling to Inform the Design of HIV Treatment Strategies and Clinical Trials

Improving efficiency in health. Hepatitis C virus session

Design of a Community Randomized HIV Prevention Trial in Botswana

Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time.

THE GOOD, THE BAD, & THE UGLY: WHAT WE KNOW TODAY ABOUT LCA WITH DISTAL OUTCOMES. Bethany C. Bray, Ph.D.

PSI RESEARCH & METRICS TOOLKIT

Sexual Health, HIV, and STDs

Estimation of delay to diagnosis and incidence in HIV using indirect evidence of infection dates

Bayesian and Frequentist Approaches

Epidemiology: Overview of Key Concepts and Study Design. Polly Marchbanks

Washington State Getting to the End of AIDS

The Epidemic Model 1. Problem 1a: The Basic Model

Bayesian Hierarchical Models for Fitting Dose-Response Relationships

Sample Size Considerations. Todd Alonzo, PhD

Women In Science Scholars Annual Meeting Highlights October 3, 2016

PROTECT YOURSELF + PROTECT YOUR PARTNER. trichomoniasis THE FACTS

Case-control studies. Hans Wolff. Service d épidémiologie clinique Département de médecine communautaire. WHO- Postgraduate course 2007 CC studies

In 2015, blacks comprised 12% of the US population, but accounted for 45% of those infected with HIV. Whites were 62% of the population, but

Ending the HIV epidemic in New York City: Innovations and Progress

Methodologic issues relating to the use of STARHS to determine HIV incidence: The Ontario experience

Mountain West AIDS Education and Training Center

11/8/2016. The Challenge of HIV Treatment

Using a validated computer simulation to assess HIV prevention efforts in Connecticut

Behind the Cascade: Analyzing Spatial Patterns Along the HIV Care Continuum

Bayesian additive decision trees of biomarker by treatment interactions for predictive biomarkers detection and subgroup identification

HIV/AIDS Epidemiology in Alameda County: State of the County Report

Missouri Statewide Integrated HIV Prevention and Care Plan, including the Statewide Coordinated Statement of Need, CY

10.1 Estimating with Confidence. Chapter 10 Introduction to Inference

Linkage of hepatitis and HIV surveillance systems to improve completeness of injection drug use risk data for co-infected Floridians

Kelvin Chan Feb 10, 2015

Statistical Hocus Pocus? Assessing the Accuracy of a Diagnostic Screening Test When You Don t Even Know Who Has the Disease

To provide you with necessary knowledge and skills to accurately perform 3 HIV rapid tests and to determine HIV status.

HIV Housing Care Continuum Webinar 1 August 3, 2016

Salmonella outbreaks: Assessing causes and trends

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Joanne Stekler, MD, MPH Assistant Professor of Medicine Deputy Director, Public Health Seattle & King County HIV/STD Program

Cross-Border Cooperation in the Global Fight against Communicable Diseases

Outline. Successful Integration of Hepatitis Vaccination Services into Programs for High-Risk Adults An Update of State-Based Programs

Method Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects

New SASI Analysis: In the Deep South, Significant Percentages of People Most Impacted by HIV Live Outside Large Urban Areas

Quality Measurement in HIV Testing in Managed Care or Large Health Systems

Advanced Decision Analysis: Markov Models

HIV EPIDEMIOLOGY IN NEW YORK CITY

Workbook Method for estimating HIV prevalence in low level or concentrated epidemics

Stochastic models for the spread of infection

Black MSM, HIV, and the Social Determinants of Health Imperative

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Long-term Survival of Children with Human Immunodeficiency Virus Infection in New York City: Estimates from Population-based Surveillance Data

Backcalculating HIV incidence and predicting AIDS in Australia, Cambodia and Vietnam. Australia

Step 1: Learning Objectives

EPI-LOG. West Virginia. Statewide Disease Facts & Comparisons. New surveillance techniques show HIV infection rates went underestimated, says CDC

Summary Report: Survey of Hepatitis C Virus counseling and testing services at HIV counseling and testing sites, and health service sites.

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

Data Use to Inform HIV Programs and Policies. Usma Khan, MS Hilary Spindler, MPH Prevention and Public Health Group Global Health Sciences

2016 NYS HIV Quality of Care Review

New Jersey HIV/AIDS Epidemiologic Overview, 2017 (Data based upon the HIV/AIDS Reporting System ehars, unless otherwise noted.)

Minnesota Cancer Alliance SUMMARY OF MEMBER INTERVIEWS REGARDING EVALUATION

Transcription:

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 1 MEASURING THE UNDIAGNOSED FRACTION: Understanding the UW and CDC back-calculation models Martina Morris, PhD Director, UW CFAR SPRC Jeanette K Birnbaum, PhD Research Scientist, UW CFAR Based on work originally developed by Ian Fellows, PhD

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 2 Outline 1. Back-calculation, the basics The original method, developed in the 1980s 2. Current back calculation methods UW: Testing history back-calculation CDC: Extended back-calculation 3. Comparing the results for WA State Future work

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 3 BACK-CALCULATION The basics

Cases Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 4 Basic idea What you see now Is based on infections that happened in the past Time Can you use new diagnoses to back-calculate past incidence?

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 5 Time from Incidence to Diagnosis Imagine an HIV Dx always happens within 3 years of infection 25% get Dx in first year 50% in second year 25% in third year TID distribution of Time from Infection to Diagnosis

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 6 Time from Incidence to Diagnosis With this TID (25%, 50%, 25%) And 100 new infections this year The observed HIV Dx curve in the future would look like this: Observed Diagnoses 100 80 60 40 20 0 True incidence: 100 cases in year 1 only Dx=1 Dx=2 Dx=3 25% 50% 25% One year of incidence (n=100), distributed over 3 years of Dx TID

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 7 Tracking in tabular form From infections (unobserved) to diagnoses (observed) Year of Incidence (t) # New Infections 10 8 6 4 2 0 2013 (z=0) Year of Diagnosis (t+z) 2014 (z=1) 2015 (z=2) 2013 100 25 50 25 2014 2015 Dx=1 Dx=2 Dx=3 25% 50% 25% Obs Dx(t+z ) = New Infections(t) * TID(t+z )

Observed HIV Dx Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 8 With multiple years of incidence? Assume constant incidence (100 cases each year) And the same TID (25-50-25%) Annual observed HIV Dx is now a mix of cases from previous (up to 3) years Diagnosis by year: Constant Incidence 300 250 Incident cases from: Year 1 200 150 100 50 0 True incidence: constant 1 2 3 4 Year of Dx Note: With constant incidence Obs Dx = Incidence after max TID

Observed HIV Dx Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 9 With multiple years of incidence? Assume constant incidence (100 cases each year) And the same TID (25-50-25%) Annual observed HIV Dx is now a mix of cases from previous (up to 3) years Diagnosis by year: Constant Incidence 300 250 200 Incident cases from: Year 1 Year 2 150 100 50 0 True incidence: constant 1 2 3 4 Year of Dx Note: With constant incidence Obs Dx = Incidence after max TID

Observed HIV Dx Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 10 With multiple years of incidence? Assume constant incidence (100 cases each year) And the same TID (25-50-25%) Annual observed HIV Dx is now a mix of cases from previous (up to 3) years Diagnosis by year: Constant Incidence 300 250 200 Incident cases from: Year 1 Year 2 Year 3 150 100 50 0 True incidence: constant 1 2 3 4 Year of Dx Note: With constant incidence and TID Obs Dx = Incidence after max TID

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 11 Multi-year incidence, tabular form Year of Incidence (t) New Infections Obs Dx( t+z ) = Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 100 25 50 25 2014 100 25 50 25 2015 100 25 50 25 Total Dx 300 25 75 100 We re assuming constant incidence Z z=0 New Infections TID(t+z) Future years TID t+0 = 25% t+1 = 50% t+2 = 25%

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 12 Undiagnosed cases calculation Year of Incidence New Infections Year of Diagnosis 2013 2014 2015 2013 100 25 50 25 2014 100 25 50 2015 100 25 Totals 300 25 75 100 300 incident by 2015 200 diagnosed by 2015 Undiagnosed (2015) = Cumulative incidence Cumulative diagnosed = 300 200 = 100

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 13 This would be straightforward, if If you could observe everything Incidence TID Dx cases But we only observe Dx cases Year of Incidence 2013 2014 2015 Totals New Infections Year of Diagnosis 2013 2014 2015 100 25 50 25 100 25 50 100 25 300 25 75 100 If we can estimate the TID from some other data, then we can back calculate the new infections, and the undiagnosed cases Z Obs Dx( t+z ) = z=0 New Infections TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 14 Start with observed Dx Year of Incidence (t) 2013 2014 2015 Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 Total Dx 25 75 100 Obs Dx( t+z ) = Z z=0 New Infections TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 15 Use an estimated TID Year of Incidence (t) Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 25 50 0.25*NI 2014 25 0.50*NI 25 2015 0.25*NI 50 25 Total Dx 25 75 100 t+0 = 25% t+1 = 50% t+2 = 25% TID Obs Dx( t+z ) = Z z=0 New Infections TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 16 Back-fill in all the cells Year of Incidence (t) Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 0.25*NI 0.50*NI 0.25*NI 2014 0.25*NI 0.50*NI 0.25*NI 2015 0.25*NI 0.50*NI 0.25*NI Total Dx 25 75 100 t+0 = 25% t+0 = 25% t+1 = 50% NI=New Infections Obs Dx( t+z ) = Z k=0 New Infections TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 17 And solve for the NI (New Infections) Year of Incidence (t) New Infections Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 100 25 50 25 2014 100 25 50 25 2015 100 25 50 25 Total Dx 300 25 75 100 Obs Dx( t+z ) = Z z=0 New Infections TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 18 Note (1) Year of Incidence (t) New Infections Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 100 25 50 25 2014 100 25 50 25 2015 100 25 50 25 Total Dx 300 25 75 100 We assume constant incidence here, but the approach also works if incidence is changing

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 19 Note (2) Year of Incidence (t) New Infections Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 100 25 50 25 2014 100 25 50 25 2015 100 25 50 25 Total Dx 300 25 75 100 We assume a constant TID here too, but the approach also works if the TID is changing

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 20 Note (3) Year of Incidence (t) New Infections Year of Diagnosis (t+z) 2013 2014 2015 2016 2017 2013 100 25 50 25 2014 100 25 50 25 2015 100 25 50 25 Total Dx 300 25 75 100 We can also use this method to project diagnoses forward

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 21 Summary Observe Dx( t+z ) = Z z=0 Back calculate Estimate from other data New Infections TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 22 ORIGINAL BACK-CALCULATION From the way way back

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 23 Context First used in 1986 by Brookmeyer and Gail for the HIV epidemic At that time, only AIDS Dx were available So the goal was to back calculate HIV incidence from AIDS Dx Not much data available http://www.stat.wisc.edu/~yandell/stat/50-year/brookmeyer_ron.pdf

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 24 Original back-calculation Observe AIDS Dx(t+z ) = Uses AIDS Dx, and AIDS TID Z z=0 Back calculate HIV Incidence(t+Z z) AIDS TID(t+z) Incidence is not assumed to be constant Estimate from data* * Data sources (all from the 1980s): Multicenter Hemophilia Cohort Study (N=373) International Registry of Seroconverters (MSM, N=1020) Amsterdam cohort studies (IDU, N=173; MSM, N=348) Multicenter AIDS Cohort Study (MSM, N=1861)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 25 Estimating the AIDS TID ( Incubation Period ) Multiple approaches Different parametric forms (Weibull, Gamma) Different # stages of infection (2-5) Problems Time of infection usually not known Loss to follow-up Representativeness of cohorts Treatment delays AIDS onset

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 26 Key improvement to the method Incorporating data on HIV Dx Necessary because treatment dramatically reduced AIDS Dx Possible because HIV Dx became reportable

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 27 Fast forward to now Before testing + treatment Original AIDS Back-Calc Current UW Testing History Back-Calc CDC Extended Back-Calc

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 28 UW vs CDC: Overview Similarities: Both incorporate data on observed HIV Dx But use this in very different ways Both use complex algorithms for estimation UW : EM Algorithm (EM = Expectation-Maximization) CDC : Bayesian MCMC (MCMC = Markov Chain Monte Carlo ) Differences: Use different information in the back-calc process Select and weight cases from the Dx populations differently

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 29 UW TESTING HISTORY METHOD Originally developed by Ian Fellows, PhD Fellows, I., M. Morris, J. Dombrowski, S. Buskin, A. Bennett and M. R. Golden (2015). "A New Method for Estimating the Number of Undiagnosed HIV Infected Based on HIV Testing History, with an Application to Men Who Have Sex with Men in Seattle/King County, WA." PLOS One 10(7): e0129551.

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 30 UW Testing History back-calculation HIV Dx(t+z ) = A simple approach: Use HIV Dx to back calculate HIV incidence So for this we need to estimate the HIV TID Z k=0 Back calculate HIV Incidence(t+Z z) HIV TID(t+z)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 31 Use testing histories to estimate the TID past Window of possible infection present Last negative test HIV diagnosis Testing histories give us an infection window, but When did infection occur in the window? What if people never had a LNT, or have missing data?

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 32 Full Model overview (we ll take it in pieces) Stratify by the LNT Last Negative Test (LNT) 1 TID Estimation: Undiagnosed estimation Yes No Window = ITI Window = min(18, age-16) Base Case Uniform distn across window Upper Bound All inf at start of window Incidence Back- Calculation Missing Assume MAR 2 3 3 important assumptions

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 33 1. Model for repeat testers Stratify by the LNT Last Negative Test (LNT) TID Estimation: Undiagnosed estimation Yes No Window = ITI ITI = inter-test interval Base Case Uniform distn across window Upper Bound All inf at start of window Incidence Back- Calculation Missing 1 First assumption: Distribution of infection probability

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 34 Assumption 1: Infection probability distribution past Window of possible infection present Last negative test HIV diagnosis # infections Base Case Upper Bound # infections Last neg test Diagnosis Last neg test Diagnosis Uniform: Distributes the probability of infection uniformly across the possible interval At last neg test: Probability=1 that infection occurred on the day after the last negative test

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 35 2. Model for Dx with no previous test Stratify by the LNT Last Negative Test (LNT) TID Estimation: Undiagnosed estimation Yes No Window = min(18, age-16) Base Case Uniform distn across window Upper Bound All inf at start of window Incidence Back- Calculation Missing 2 Second assumption: Window length

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 36 Assumption 2: Window length if no previous test AIDS TID 95% of HIV+ progress to AIDS in 18 years (Lui 1996) Age 16 is the median age of sexual debut in the US 95% So we take the minimum of these as the window length past Window of possible infection present And then apply base case or upper bound assumption for the distribution of infection probability Min(18 years ago, age-16) HIV diagnosis

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 37 3. Model for cases missing test info Stratify by the LNT Last Negative Test (LNT) Yes No Missing TID Estimation: Base Case Uniform distn across window Upper Bound All inf at start of window Assume MAR Undiagnosed estimation Incidence Back- Calculation 3 Third Assumption: Treatment of missing cases

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 38 Assumption 3: If Dx case is missing test information We have two options: Include these when estimating the TID distribution Assuming the maximum possible infection window IMPACT: A conservative (longer) estimate of the time spent undiagnosed. Exclude these when estimating the TID distribution We still use them in the back-calculation, we just give them the TID estimated from the other cases Assumes these cases are missing at random (MAR)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 39 Full Model overview Stratify by the LNT Last Negative Test (LNT) 1 TID Estimation: Undiagnosed estimation Yes No Window = ITI Window = min(18, age-16) Base Case Uniform distn across window Upper Bound All inf at start of window Incidence Back- Calculation Missing Assume MAR 2 3 3 important assumptions

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 40 Results WA State: ITI dist n (2006-2015) The distribution of inter-test intervals For all non-missing cases 46% have an LNT (24% HIV/AIDS Dx) 12% no LNT (51% HIV/AIDS Dx) 42% of cases are missing, so not included in TID estimation (39% HIV/AIDS Dx)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 41 Results WA State: HIV TID (2006-2014) Test found no significant variation in TID over time 64% 45% Estimate Mean TID % UnDx at 1yr Base Case 2.5 years 45% Upper Bound 5 years 64%

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 42 Results WA State: Incidence & UnDx Observed Dx (black) Estimated Incidence (colors) Obs=Est and Base=UB suggests relative stability in the dynamics Estimated UnDx cases Upper bound ~ 2x Base Case 2014: ~100 new cases/qtr 1200-2500 UnDx cases

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 43 CDC EXTENDED BACK CALC An, Q., J. Kang, R. Song and H. I. Hall (2015). "A Bayesian hierarchical model with novel prior specifications for estimating HIV testing rates." Statistics in Medicine (in press).

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 44 CDC Extended Back-Calc Overview Uses AIDS Dx, like original method Adds data on HIV Dx Stratifies observed cases by HIV Dx only vs. Concurrent HIV/AIDS diagnosis (AIDS Dx within 1 year of HIV Dx) And it uses a Bayesian estimation approach So it will need prior distributions to start the estimation algorithm

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 45 CDC Model overview Stratifies by concurrent HIV/AIDS dx Undiagnosed estimation Data Cleaning Dx Cases AIDS in same year HIV only 1 AIDS TID Testing Rates (estimated) Incidence AND Testing rate Back- Calculation Priors are part of the Bayesian estimation method Priors on testing rate distributions Priors on incidence distributions 2 3

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 46 Key difference: competing risks of Dx In the UW TH model HIV+ person faces only one risk : the risk of HIV Dx by testing In the CDC model HIV+ person faces two risks: AIDS Dx or HIV Dx So the model has to specify how these processes unfold over time HIV+ Event at time t AIDS Dx HIV Dx UnDx Depends on: AIDS TID (Dx), not tested AIDS TID (nodx), not tested earlier, tested now AIDS TID (nodx), not tested

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 47 Assumption 1: Historical AIDS TID This is how they distribute the probability of HIV infection back in time for someone diagnosed with AIDS Gamma(2,4) past Window of possible infection present Max time to AIDS AIDS diagnosis

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 48 Assumption 2: HIV testing rate priors HIV testing rates are not observed they are estimated Huh? But these rates are used to estimate incidence. So how can they be estimated too?

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 49 Bayesian estimation in a nutshell Say you have a parameter you want to estimate, p But you don t have a simple formula for it in terms of the data you observe (here HIV Dx and AIDS Dx) Draw a starting value from a distribution Reflects what we think/know about the value of p This is the prior distribution Plug it in and calculate the predicted Dx Compare the predicted Dx to the observed Update the estimate of p in the prior Repeat until predicted=observed

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 50 Assumption 2: HIV testing rate priors HIV testing rates are not observed they are estimated Prior distributions for the annual rates, 1977-present Mean testing rate (starting values for prior) Example priors for different means The testing rate is for HIV+ persons (not the pop n)

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 51 Assumption 3: HIV incidence prior HIV incidence is also estimated in this model Specifies another prior distribution to start Mean incidence (starting values for prior) (mean HIV Dx over all years) Single prior, but posterior estimates allowed to vary by year

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 52 Overview of the CDC model Two unknown parameter sets to estimate: Annual testing rates {p i H } Annual HIV incidence {λ i } Two observed data series Annual HIV Dx Annual AIDS Dx One fixed assumption (the AIDS TID) Two Bayesian priors for testing rates and HIV incidence

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 53 CDC Results: National testing rates 1985-2010 Mean rate estimate stabilizes at around 22% per year

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 54 CDC Results: National mean time to HIV Dx (Note: WA analysis starts in 2006, est 2.5-5 yrs) 2010: 3.3 years

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 55 CDC Results: National UnDx estimate for 2012 CDC National Model Undiagnosed Fraction 12.8 (12.0-13.6) Total Undiagnosed 1,218,400 (1,207,100 1,228,200) 156,300 (144,100 165,900) Note: This requires estimating the number of persons living with HIV, and we re not describing how they do that here

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 56 Other aspects of both models Both can be used to estimate the UnDx Fraction But need an estimate of PLWH for the denominator Both models assume there is some stability in the year to year changes (smoothing) For UW method: Incidence counts in adjacent years are smoothed For CDC method: Both incidence and testing rates are smoothed Both models can stratify estimation by other factors Sex Risk exposure Geography (though this raises issues about modeling migration) But small sample sizes will lead to unstable estimates

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 57 Summary of model differences Uses of HIV Dx UW estimates the HIV TID to back-calculate HIV incidence Using measured inter-test intervals for HIV Dx when available Relies on the max AIDS TID window for cases Dx on their first test Variation in the TID by year can be evaluated and incorporated CDC estimates testing rates as part of the HIV incidence back-calculation Calibrated to best fit observed HIV and AIDS Dx trends Relies also on the AIDS TID The annual average rate is allowed to vary over time Uses of AIDS Dx UW does not use this (but could be adapted) CDC uses this to estimate both testing rates and HIV incidence

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 58 COMPARING RESULTS FOR WA What we know now, and plans for future investigation

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 59 WA state estimates for 2012 All UnDx Fraction Undiagnosed Total CDC* UW % DIFFERENCE 11.0 (7.7 15.0) 1,700 (1,200--2,400) 15,500 (14,900--16,100) 10.6 (18.8) -3.8% 1,410 (2,750) -20.6% 13,310 (14,650) -16.5% MSM UnDx Fraction 11.7 (7.5 16.5) 6.8 (12.6) -72.1% Undiagnosed 1,200 (730--1,800) 647 (1274) -85.5% Total 10,300 (9,900--10,800) 9,519 (10,147) -8.2% *Source: WA DOH

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 60 Potential Sources of Differences PLWH denominator for estimating the undiagnosed fraction Doesn t seem to be the driving difference since the totals are similar So, that leaves one of: Case selection/weighting CDC does adjustments and weighting for reporting delays and missing data But I m guessing this is not the primary driver Model structure and assumptions If testing histories provide more precision for MSM, our estimates may be better Not sure what could lead to their estimates being better

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 61 Future Work Compare results using identical datasets Accommodate weights in testing history method to use CDC data Can t run CDC method on our data since it needs to go back to 1977, which requires doing all their data cleaning relevant to the older cases Test both models on a mock dataset in which incidence is known Simulation study Generate the mock data from an independent model of HIV natural history

Friday, May 27, 2016 SPRC-PHSKC Lunchbox Talks 62 THANK YOU