Meta-analysis of external validation studies

Similar documents
Systematic reviews of prognostic studies 3 meta-analytical approaches in systematic reviews of prognostic studies

Systematic reviews of prognostic studies: a meta-analytical approach

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

Systematic reviews of prediction modeling studies: planning, critical appraisal and data collection

Summarising and validating test accuracy results across multiple studies for use in clinical practice

Meta-analysis of few small studies in small populations and rare diseases

Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases

Meta-analysis of clinical prediction models

Development, validation and application of risk prediction models

From single studies to an EBM based assessment some central issues

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

Advanced IPD meta-analysis methods for observational studies

Web Extra material. Comparison of non-laboratory-based risk scores for predicting the occurrence of type 2

Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes

Bayesian meta-analysis of Papanicolaou smear accuracy

Detecting small-study effects and funnel plot asymmetry in meta-analysis of survival data: A comparison of new and existing tests

Methods for evidence synthesis in the case of very few studies. Guido Skipka IQWiG, Cologne, Germany

Models for potentially biased evidence in meta-analysis using empirically based priors

It s hard to predict!

Multivariate meta-analysis for non-linear and other multi-parameter associations

Study protocol v. 1.0 Systematic review of the Sequential Organ Failure Assessment score as a surrogate endpoint in randomized controlled trials

Introduction to Meta-analysis of Accuracy Data

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis

Analysis of left-censored multiplex immunoassay data: A unified approach

Explicit inclusion of treatment in prognostic modelling was recommended in observational and randomised settings

Methods Research Report. An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy

Using Statistical Principles to Implement FDA Guidance on Cardiovascular Risk Assessment for Diabetes Drugs

Estimating interaction on an additive scale between continuous determinants in a logistic regression model

Applications of Bayesian methods in health technology assessment

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

Module Overview. What is a Marker? Part 1 Overview

Multivariate meta-analysis using individual participant data

Using mixture priors for robust inference: application in Bayesian dose escalation trials

Bayesian random-effects meta-analysis made simple

An Introduction to Bayesian Statistics

Statistical Tolerance Regions: Theory, Applications and Computation

Summary of main challenges and future directions

The prevalence and history of knee osteoarthritis in general practice: a case control study

AJAE appendix for Defining Access to Health Care: Evidence on the Importance of. Quality and Distance in Rural Tanzania * Heather Klemick

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

CHAMP: CHecklist for the Appraisal of Moderators and Predictors

AN INDEPENDENT VALIDATION OF QRISK ON THE THIN DATABASE

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Kernel Density Estimation for Random-effects Meta-analysis

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

Reporting and Methods in Clinical Prediction Research: A Systematic Review

Template 1 for summarising studies addressing prognostic questions

Meta-analyses triggered by previous (false-)significant findings: problems and solutions

Prediction models for cardiovascular disease risk in the general population: systematic review

Estimating HIV incidence in the United States from HIV/AIDS surveillance data and biomarker HIV test results

Beyond RevMan: Meta-Regression. Analysis in Context

Genetic risk prediction for CHD: will we ever get there or are we already there?

Sensitivity, specicity, ROC

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Bayesian Dose Escalation Study Design with Consideration of Late Onset Toxicity. Li Liu, Glen Laird, Lei Gao Biostatistics Sanofi

Bayesian hierarchical modelling

Combining Predictors for Classification Using the Area Under the ROC Curve

Fixed Effect Combining

Overview of Multivariable Prediction Modelling. Methodological Conduct & Reporting: Introducing TRIPOD guidelines

Small-area estimation of mental illness prevalence for schools

What is probability. A way of quantifying uncertainty. Mathematical theory originally developed to model outcomes in games of chance.

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests

Discrimination and Reclassification in Statistics and Study Design AACC/ASN 30 th Beckman Conference

Reflection Questions for Math 58B

5/10/2010. ISPOR Good Research Practices for Comparative Effectiveness Research: Indirect Treatment Comparisons Task Force

Performance Assessment for Radiologists Interpreting Screening Mammography

Standards for Heterogeneity of Treatment Effect (HTE)

PRACTICAL STATISTICS FOR MEDICAL RESEARCH

Cancer Incidence Predictions (Finnish Experience)

Some alternatives for Inhomogeneous Poisson Point Processes for presence only data

Correlation and regression

Downloaded from:

Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document.

EPI 200C Final, June 4 th, 2009 This exam includes 24 questions.

Identifying patients with undetected pancreatic cancer in primary care:

Evidence synthesis with limited studies

Improved control for confounding using propensity scores and instrumental variables?

Seeing the Forest & Trees: novel analyses of complex interventions to guide health system decision making

Empirical evidence on sources of bias in randomised controlled trials: methods of and results from the BRANDO study

Bayesian Modeling of Multivariate Spatial Binary Data with Application to Dental Caries

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

Overview. All-cause mortality for males with colon cancer and Finnish population. Relative survival

Bayes-Verfahren in klinischen Studien

Biostatistics Primer

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Designing a Bayesian randomised controlled trial in osteosarcoma. How to incorporate historical data?

(A) (B) 2019 American Diabetes Association. Published online at

TWISTED SURVIVAL: IDENTIFYING SURROGATE ENDPOINTS FOR MORTALITY USING QTWIST AND CONDITIONAL DISEASE FREE SURVIVAL. Beth A.

RAG Rating Indicator Values

Karel G.M. Moons UMC Utrecht Julius Center for Health Sciences and Primary Care,

Systematic Reviews and meta-analyses of Diagnostic Test Accuracy. Mariska Leeflang

Heart Rate in Patients with Coronary Artery Disease - the Lower the Better? An Analysis from the Treating to New Targets (TNT) trial

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome

Today Retrospective analysis of binomial response across two levels of a single factor.

A note on the graphical presentation of prediction intervals in random-effects meta-analyses

Summary HTA. HTA-Report Summary

Transcription:

Meta-analysis of external validation studies Thomas Debray, PhD Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, The Netherlands Cochrane Netherlands, Utrecht, The Netherlands T.Debray@umcutrecht.nl

Meta-analysis of external validation studies - Introduction Prediction ˆ Turn available information about individuals into a statement about the probability:... of having a particular disease diagnosis... of developing a particular event prognosis ˆ Use of multiple predictors Subject characteristics History and physical examination results Imaging results (Bio)markers ˆ Typical aims to inform patients and their families to guide treatment and other clinical decisions to create risk groups Vigo, July 10, 2017 TPA Debray 2/23

Meta-analysis of external validation studies - Introduction What is a good model? Accurate predictions CALIBRATION DISCRIMINATION IMPACT Improve patient outcomes Ability to distinguish between low and high risk patients Good and consistent performance across different settings and populations GENERALITY SUPPORT Influence decision making Vigo, July 10, 2017 TPA Debray 3/23

Meta-analysis of external validation studies - Introduction Numerous models for same target population + outcomes 1995 We believe that the main reasons why doctors reject published prognostic models are lack of clinical credibility and lack of evidence that a prognostic model can support decisions about patient care. Wyatt JC, Altman DG. Commentary: Prognostic models: clinically useful or quickly forgotten? BMJ. 1995;311(7019):1539 41. Vigo, July 10, 2017 TPA Debray 4/23

Meta-analysis of external validation studies - Introduction Numerous models for same target population + outcomes 1995 2012 Comparing risk prediction models should be routine when deriving a new model for the same purpose Collins GS, Moons KGM. Comparing risk prediction models. BMJ. 2012;344:e3186 e3186. Vigo, July 10, 2017 TPA Debray 5/23

Meta-analysis of external validation studies - Introduction Numerous models for same target population + outcomes 1995 2012 2016 There is an excess of models predicting incident CVD in the general population. The usefulness of most of the models remains unclear. Damen JAAG, Hooft L, et al. Prediction models for cardiovascular disease risk in the general population: systematic review. BMJ. 2016;353:i2416. Vigo, July 10, 2017 TPA Debray 6/23

Meta-analysis of external validation studies - Introduction Numerous models for same target population + outcomes 1995 2012 2016 2017 Formal guidance for systematic review and meta-analysis Vigo, July 10, 2017 TPA Debray 7/23

Motivating example Previous guidance focused on meta-analysis of logistic regression models. Hence, focus of today is on survival models. Framingham Risk Score (Wilson et al. 1998) ˆ Model type: Cox regression ˆ Outcome: Fatal or non-fatal coronary heart disease (CHD) ˆ Timing: Initial CHD within 10 years ˆ Evidence: 24 validations in male populations Summarize estimates of model performance ˆ Concordance statistic (cstat) ˆ Ratio of observed versus expected events (OE) ˆ Calibration slope (slope) Vigo, July 10, 2017 TPA Debray 8/23

Statistical framework - data extraction Key problem: Poor and inconsistent reporting of prediction model performance. Standard error of the c-statistic [ ] c (1 c) 1 + n 1 c 2 c + m c 1+c SE(c) mn with c the reported c-statistic, n the number of observed events, m the total number of non-events and m = n = 1 2 (m + n) 1. Newcombe RG. Confidence intervals for an effect size measure based on the Mann-Whitney statistic. Part 2: asymptotic methods and evaluation. Stat Med. 2006;25(4):559 73. Vigo, July 10, 2017 TPA Debray 9/23

Estimated versus reported standard error of the c-statistic 0.100 EuroSCORE II Framingham Reported standard error 0.075 0.050 0.025 0.000 0.000 0.025 0.050 0.075 0.1000.000 0.025 0.050 0.075 0.100 Estimated standard error Vigo, July 10, 2017 TPA Debray 10/23

Statistical framework - data extraction Ratio of observed versus expected events ˆ Reference value: 1 ˆ Observed survival probability (S KM,t ) ˆ Expected (predicted) event rate (P E,t ) (O:E) t = 1 S KM,t P E,t SE(O:E) t = 1 P E,t SE(S KM,t ) If unavailable, SE(S KM,t ) can be approximated as well: SE(S KM,t ) S KM,t (1 S KM,t ) N t Vigo, July 10, 2017 TPA Debray 11/23

Extraction of event rates: an example Vigo, July 10, 2017 TPA Debray 12/23

Statistical framework - data extraction Calibration slope ˆ Reference value: 1 ˆ The calibration slope β is calculated as follows using IPD: y k Bernoulli(p k ) logit(p k ) = α + β LP k ˆ When missing, we can derive β from reported event counts across j risk strata (e.g. as presented in calibration tables): O j Binom(N j, p O,j ) logit(p O,j ) = η + β logit(p E,j ) Vigo, July 10, 2017 TPA Debray 13/23

Extracted risk estimates for the Framingham Risk Score 0.5 0.4 Observed risk 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 Expected risk The diagonal line indicates perfect calibration. Risk estimates were reported for 5 years follow-up (dashed lines), 7.5 years follow-up (dotted lines) and 10 years follow-up (full lines).

Statistical framework - data extraction #validations 24 20 10 0 21 21 19 10 11 2 0 0 16 10 4 2 11 11 0 0 analysed reported study authors approximated cstat SE.cstat OE slope For 10 studies, calibration performance was only available for < 10 years follow-up. Vigo, July 10, 2017 TPA Debray 15/23

Statistical framework - meta-analysis models Meta-analysis of the c-statistic ˆ Need to apply logit transformation logit(c i ) N ( µ discr, Var (logit(c i )) + τdiscr 2 ) ˆ Use delta method to derive Var (logit(c i )) from SE(c i ) ˆ For Bayesian models, we propose weakly informative priors: µ discr N (0, 10 6 ) τdiscr Unif(0, 2) τdiscr Student-t(0, 0.5 2, 3) T [0, 10] based on empirical data from 26 meta-analyses (with ˆτ discr ranging from 0 to 0.50). Vigo, July 10, 2017 TPA Debray 16/23

Statistical framework - meta-analysis models Meta-analysis of the c-statistic Estimation K Summary 95% CI 95% PI REML 21 0.69 0.66 0.71 0.59 0.77 Bayesian (Unif) 24 0.69 0.66 0.71 0.59 0.78 Bayesian (Student-t) 24 0.69 0.66 0.71 0.59 0.78 For 3 studies, we did not have information on c i but could nevertheless approximate SE(c i ). Vigo, July 10, 2017 TPA Debray 17/23

Statistical framework - meta-analysis models Meta-analysis of the total O:E ratio We can use different models to account for sampling variability: Option 1 ln(o:e) i N ( µ cal.oe, Var (ln(o:e) i ) + τcal.oe 2 ) Option 2 O i Binom (N i, p O,i ) E i Binom (N i, p E,i ) ln (p O,i /p E,i ) N ( µ cal.oe, τcal.oe 2 ) Option 3 O i Poisson (E i exp(η i )) η i N ( µ cal.oe, τcal.oe 2 ) For all models, the interpretation of µ cal.oe and τ cal.oe is the same. Vigo, July 10, 2017 TPA Debray 18/23

Statistical framework - meta-analysis models Meta-analysis of the total O:E ratio (c ed) ˆ For Bayesian models, we propose weakly informative priors: µ cal.oe N (0, 10 6 ) τ cal.oe Unif(0, 2) τcal.oe Student-t(0, 1.5 2, 3) T [0, 10] based on empirical data from 16 meta-analyses (with ˆτ cal.oe ranging from 0 to 1.39). ˆ Possible to extrapolate event rates at time l to time t using: ( ) t ln(1 pl ) p t = 1 S KM,t = 1 exp l Straightforward to integrate in Bayesian estimation framework Vigo, July 10, 2017 TPA Debray 19/23

Statistical framework - meta-analysis models Meta-analysis of the total O:E ratio Estimation K Summary 95% CI 95% PI REML 1 6 0.56 0.28 1.16 0.09 3.62 Bayesian 1 (Unif) 6 0.61 0.19 1.08 0.00 2.84 Bayesian 1 (Student-t) 6 0.61 0.20 1.07 0.00 2.63 ML 3 6 0.56 0.25 1.26 0.03 11.29 Bayesian 3 (Unif) 7 0.60 0.19 1.09 0.00 2.91 Bayesian 3 (Student-t) 7 0.60 0.18 1.05 0.00 2.67 When applying extrapolation, we have 10 additional studies for meta-analysis (similar results). Vigo, July 10, 2017 TPA Debray 20/23

Statistical framework - meta-analysis models Meta-analysis of the calibration slope ˆ No transformations needed ˆ Rely on binomial approximation O ij Binom(N ij, p O,ij ) logit(p O,ij ) = α i + β i logit(p E,ij ) β i N (µ cal.slope, τ 2 cal.slope ) ˆ For Bayesian models, we propose weakly informative priors: µcal.slope N (0, 10 6 ) τcal.slope Unif(0, 2) τ cal.slope Student-t(0, 1.5 2, 3) T [0, 10] Vigo, July 10, 2017 TPA Debray 21/23

Statistical framework - meta-analysis models Meta-analysis of the calibration slope Estimation K Summary 95% CI 95% PI ML 3 1.03 0.90 1.16 0.20 1.87 Bayesian 3 1.05 0.47 1.64-0.01 2.22 Bayesian 3 1.05 0.51 1.65-0.06 2.17 When applying extrapolation, we have 8 additional studies for meta-analysis (similar results but smaller intervals). Vigo, July 10, 2017 TPA Debray 22/23

Meta-analysis of external validation studies - Conclusion Final remarks ˆ Substantial efforts often needed to restore missing information ˆ Bayesian estimation methods recommended to fully propagate uncertainty arising from data restoration ˆ Development of R package metamisc to assist in data preparation and meta-analysis ˆ Presence of statistical heterogeneity most likely ˆ Straightforward extension to meta-regression and multivariate meta-analysis Vigo, July 10, 2017 TPA Debray 23/23