Computer Models for Medical Diagnosis and Prognostication

Similar documents
Lessons learned From The National PCI Registry

Module Overview. What is a Marker? Part 1 Overview

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

BMI 541/699 Lecture 16

Safety of Single- Versus Multi-vessel Angioplasty for Patients with AMI and Multi-vessel CAD

RISK PREDICTION MODEL: PENALIZED REGRESSIONS

ROC (Receiver Operating Characteristic) Curve Analysis

It s hard to predict!

A Risk Score to Predict In-Hospital Mortality for Percutaneous Coronary Interventions

Modeling and Risk Prediction in the Current Era of Interventional Cardiology

Abstract Background: Methods: Results: Conclusions:

Revascularization after Drug-Eluting Stent Implantation or Coronary Artery Bypass Surgery for Multivessel Coronary Disease

Supplemental Material

Risk modeling for Breast-Specific outcomes, CVD risk, and overall mortality in Alliance Clinical Trials of Breast Cancer

CVD risk assessment using risk scores in primary and secondary prevention

Sensitivity, specicity, ROC

Coronary Artery Stenosis. Insight from MAIN-COMPARE Study

Logistic Regression Predicting the Chances of Coronary Heart Disease. Multivariate Solutions

Statistics, Probability and Diagnostic Medicine

Relationship between body mass index, coronary disease extension and clinical outcomes in patients with acute coronary syndrome

Various performance measures in Binary classification An Overview of ROC study

PROMUS Element Experience In AMC

CARDIOVASCULAR RISK ASSESSMENT ADDITION OF CHRONIC KIDNEY DISEASE AND RACE TO THE FRAMINGHAM EQUATION PAUL E. DRAWZ, MD, MHS

Coronary Plaque Sealing: The DEFER Study and more...

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

Journal of the American College of Cardiology Vol. 42, No. 10, by the American College of Cardiology Foundation ISSN /03/$30.

Interventional Cardiology

Hayden Smith, PhD, MPH /\ v._

Declaration of conflict of interest. Nothing to disclose

Critical reading of diagnostic imaging studies. Lecture Goals. Constantine Gatsonis, PhD. Brown University

Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers

Development, validation and application of risk prediction models

Percutaneous Intervention of Unprotected Left Main Disease

Assessment of performance and decision curve analysis

Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning

An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Influence of Hypertension and Diabetes Mellitus on. Family History of Heart Attack in Male Patients

Management of cardiovascular disease - coronary interventions -

LIBERTY 360 Study. 15-Jun-2018 Data 1. Olinic Dm, et al. Int Angiol. 2018;37:

New insights in stent thrombosis: Platelet function monitoring. Franz-Josef Neumann Herz-Zentrum Bad Krozingen

VCU Pauley Heart Center: A 2009 US News Top 50 Heart and Heart Surgery Hospital

Periprocedural Myocardial Infarction and Clinical Outcome In Bifurcation Lesion

Unprotected LM intervention

Between Coronary Angiography and Fractional Flow Reserve

Screening (Diagnostic Tests) Shaker Salarilak

FFR in Multivessel Disease

Systematic reviews of prognostic studies 3 meta-analytical approaches in systematic reviews of prognostic studies

High-sensitivity Troponin T Predicts Recurrent Cardiovascular Events in Patients with Stable Coronary Heart Disease: KAROLA Study 8 Year FU

PERFORMANCE MEASURES

Clinical Seminar. Which Diabetic Patient is a Candidate for Percutaneous Coronary Intervention - European Perspective

Final Clinical and Angiographic Results From a Nationwide Registry of FIREBIRD Sirolimus- Eluting Stent: Firebird In China (FIC) Registry (PI R. Gao)

Clinical Lessons from BMC2-PCI

Identifying Deviations from Usual Medical Care using a Statistical Approach

A simple screening score for diabetes for the Korean population

CLINICAL CONSEQUENCES OF THE

Background. Drug-eluting stents (DES) are increasingly used in complex patients with a high clinical or lesion-related risk of adverse events.

Worksheet for Structured Review of Physical Exam or Diagnostic Test Study

FFR vs. icecg in Coronary Bifurcations (FIESTA) - preliminary results. Dobrin Vassilev MD, PhD National Heart Hospital Sofia, Bulgaria

Discrimination and Reclassification in Statistics and Study Design AACC/ASN 30 th Beckman Conference

Biost 590: Statistical Consulting

Left Main PCI. Integrated Use of IVUS and FFR. Seung-Jung Park, MD, PhD

Cardiology Department Coimbra Hospital and Medical School Portugal

Machine learning II. Juhan Ernits ITI8600

The MAIN-COMPARE Study

FRACTIONAL FLOW RESERVE: STANDARD OF CARE

New Insight about FFR and IVUS MLA

Critical Appraisal of Risk Adjusted Analysis and Public Reporting of Outcomes in Cardiac Surgery

NCVD-PCI Registry. Percutaneous Coronary Intervention (PCI) Registry, MALAYSIA c/o National Heart Association of Malaysia

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

high SYNTAX Score? I Sheiban Division of Cardiology Interventional Card. University of Turin Turin / Italy

2010, Metzler Helfried

Pathology of percutaneous interventions (PCI) in coronary arteries. Allard van der Wal, MD.PhD; Pathologie AMC, Amsterdam, NL

Do We Need Platelet Function Assays?

Cardiovascular diseases are the leading cause of morbidity and mortality

SYNTAX score before decision making! Corrado Tamburino, MD, PhD

FFR vs icecg in Coronary Bifurcations FIESTA ClinicalTrials.gov Identifier: NCT

Part [1.0] Introduction to Development and Evaluation of Dynamic Predictions

Development of a risk score to identify patients with type 2 diabetes mellitus and multivessel coronary artery disease who can defer bypass surgery

What do the guidelines say?

Systematic reviews of prognostic studies: a meta-analytical approach

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

Technical Notes for PHC4 s Report on CABG and Valve Surgery Calendar Year 2005

Richard Grocott Mason

Direct Thrombin Inhibitors for PCI Pharmacology: Role of Bivalirudin in High-Risk PCI

COMPARE Trial Elvin Kedhi Maasstad Ziekenhuis Rotterdam The Netherlands

LM stenting - Cypher

Resolute in Bifurcation Lesions: Data from the RESOLUTE Clinical Program

UNIVERSITY OF OKLAHOMA INTERVENTIONAL CARDIOLOGY FELLOWSHIP CURRICULUM

Impact of Chronic Kidney Disease on Long-Term Outcome in Coronary Bypass Candidates Treated with Percutaneous Coronary Intervention

Myocardial revascularisation in multivessel disease: The value of risks scores

ROLE OF CORONARY PRESSURE & FFR IN MULTIVESSEL DISEASE

Journal of the American College of Cardiology Vol. 39, No. 10, by the American College of Cardiology Foundation ISSN /02/$22.

PCI for Left Anterior Descending Artery Ostial Stenosis

Inter-regional differences and outcome in unstable angina

Alex versus Xience Registry Preliminary report

Culprit Lesion Remodeling and Long-term (> 5years) Prognosis in Patients with Acute Coronary Syndrome

Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT

Zheng Yao Sr. Statistical Programmer

Supplementary Table S1: Proportion of missing values presents in the original dataset

Although the prevalence and incidence of type 2 diabetes mellitus

Transcription:

Computer Models for Medical Diagnosis and Prognostication Lucila Ohno-Machado, MD, PhD Division of Biomedical Informatics Clinical pattern recognition and predictive models Evaluation of binary classifiers (calibration) Ethical implications for clinical practice Source: DOE

Computer- Based Models for Medical Diagnosis and Prognostication Lucila Ohno-Machado, MD, PhD Division of Biomedical Informatics University of California San Diego

Risk Assessment Popular risk calculators Gail Model (Breast cancer) Framingham Risk Calculator (CVD) APACHE (ICU mortality) Use of individual estimates Prophylaxis for breast cancer Cholesterol management guidelines Continuation of life support

22%

16%

APACHE II Mortality in intensive care units (ICUs) 12 physiologic predictors

Individualized Genome How many individual genotypes are needed to predict disease?

Logistic Regression p i = ( β + β x 1+ e 1 0 i i ) p=1 pi log 1 p i = β 0 + β x i i x x

Coronary Angioplasty and Stenting

Risk of death in angioplasty National average of deaths after angioplasty is 2%, which is stated in the informed consent. "Informed consent and good clinical practice require a discussion of risks and benefits Alexander et al, 52th ACC meeting Less than 10% of the patients have an estimated risk of death around 2%. Are we lying to the other 90%?

Dataset: Attributes Collected History Presentation Angiographic Procedural Operator/Lab age acute MI occluded number lesions annual volume gender primary lesion type multivessel device experience diabetes rescue (A,B1,B2,C) number stents daily volume iddm CHF class graft lesion stent types (8) lab device history CABG angina class vessel treated closure device experience baseline creatinine cardiogenic shock ostial gp 2b3a antagonists unscheduled case CRI failed CABG dissection post ESRD rotablator hyperlipidemia atherectomy angiojet max pre stenosis max post stenosis no reflow Data Source: Medical Record Clinician Derived Other Resnic et al, J Am Col Card 2001; Matheny et al, J Biomed Inf 2005

Study Population: Descriptive Statistics Development Set Validation Set Cases 2,804 1,460 Women 909 (32.4%) 433 (29.7%) Age > 74yrs 595 (21.2%) 308 (22.5%) Acute MI 250 (8.9%) 144 (9.9%) Primary 156 (5.6%) 95 (6.5%) Shock 62 (2.2%) 20 (1.4%) Class 3/4 CHF 176 (6.3%) 80 (5.5%) gp IIb/IIIa antagonist 1,005 (35.8%) 777 (53.2%) p=.066 p=.340 p=.311 p=.214 p=.058 p=.298 p<.001 Death 67 (2.4%) 24 (1.6%) Death, MI, CABG (MACE) 177 (6.3%) 96 (6.6%) p=.110 p=.739

Multivariate Models Logistic Regression Model Odds p-value Ratio Age > 74yrs 2.51 0.02 B2/C Lesion 2.12 0.05 Acute MI 2.06 0.13 Class 3/4 CHF 8.41 0.00 Left main PCI 5.93 0.03 IIb/IIIa Use 0.57 0.20 Stent Use 0.53 0.12 Cardiogenic Shock 7.53 0.00 Unstable Angina 1.70 0.17 Tachycardic 2.78 0.04 Chronic Renal Insuf. 2.58 0.06 Prognostic Risk Score Model beta coefficient Risk Value 0.921 2 0.752 1 0.724 1 2.129 4 1.779 3-0.554-1 -0.626-1 2.019 4 0.531 1 1.022 2 0.948 2 Artificial Neural Network

Logistic Regression, Score, and Neural Networks 1.00 Validation Set: 1460 Cases 0.90 0.80 0.70 Sensitivity 0.60 0.50 0.40 LR Score ann 0.30 0.20 0.10 ROC = 0.50 ROC Area LR: 0.840 Score: 0.855 ann: 0.835 0.00 0.00 0.20 0.40 0.60 0.80 1.00 1 - Specificity

Risk Score of Death Unadjusted Overall Mortality Rate = 2.1% 3000 60% Number of Cases 2500 2000 1500 1000 62% Number of Cases 26% Mortality Risk 21.5% 53.6% 50% 40% 30% 20% 12.4% 500 0 7.6% 0.4% 1.4% 2.2% 2.9% 1.6% 1.3% 0 to 2 3 to 4 5 to 6 7 to 8 9 to 10 >10 Risk Score Category 10% 0%

External Validations for CVD Models Model/Cohort AKA Year Published External Validations Framingham Risk Score FRS Dawber et al, 1951 8 1 Framingham Risk Score FRS Kannel et al, 1976 9 4 Framingham Risk Score FRS Anderson et al, 1991 2 29 Glostrup Glostrup Schroll et al, 1992 10 1 European Society of Cardiology ESC Pyorala et al, 1994 11 1 Framingham Risk Score FRS Wilson et al, 1998 12 32 Framingham Risk Score for ATP III FRS ATP III ATP III, 2001 13 5 Framingham Risk Score FRS D Agostino et al, 2001 14 9 UK Prospective Diabetes Study UKPDS Stevens et al, 2001 15 1 Framingham Point System FPS ATP III, 2002 1 2 Prospective Cardiovascular Munster Study PROCAM Assman et al, 2002 16 6 Finnish Diabetes Risk Score FINDRISC Lindstrom et al, 2003 17 6 Systematic Coronary Risk Evaluation SCORE Conroy et al, 2003 18 8 Diabetes Epidemiology: Collaborative Analysis of Diagnostic Criteria in Europe ASSessing cardiovascular risk using SIGN guidelines DECODE Balkau et al, 2004 19 1 ASSIGN SIGN, 2007 20 2 Total 108

Predicted / Observed Framingham models tested on European populations

Predicted / Observed Framingham models tested on European populations European models tested on North American populations

Questions Which model is right? True probability would be the gold-standard What is the true probability? Are the models adequate in discrimination and calibration?

Your Risk this program shows the estimated health risks of people with your same age, gender, and risk factor levels x p=1

this means that 5 of 100 people with this level of risk will have a heart attack or die

Patients like you Input space Output space people with your same age, gender, and risk factor levels me people with this level of risk

Patients like you height me gender

Patients like you

Patients like you height 1 me 0 1 gender

Patients like you height risk 1 2 0 1 me gender

Evaluation of Predictive Models Error Discrimination Area under ROC Calibration Plot of groups: observed vs expected Hosmer-Lemeshow statistic

Discrimination of Binary Outcomes Estimate and Observed outcome ( gold standard, true ) Estimate True 0.3 0 0.2 0 0.5 1 0.1 0 Classification into category 0 or 1 is based on thresholded estimates (e.g., if estimate > 0.5 then consider positive )

threshold normal Disease True Negative (TN) FN FP True Positive (TP) 0 e.g. 0.5 1.0

nl D Sens = TP/TP+FN Spec = TN/TN+FP nl TN FN PPV = TP/TP+FP D FP TP NPV = TN/TN+FN - + Accuracy = TN +TP nl D

Sensitivity = 50/50 = 1 Specificity = 40/50 = 0.8 threshold nl nl 40 D 0 40 D 10 50 60 50 50 nl disease TN TP FP 0.0 0.4 1.0

Sensitivity = 40/50 =.8 Specificity = 45/50 =.9 threshold nl nl 45 D 10 50 D 5 40 50 50 50 nl disease TN TP FN FP 0.0 0.6 1.0

Sensitivity = 30/50 =.6 Specificity = 1 threshold nl nl 50 D 20 70 D 0 30 30 50 50 nl disease TN TP FN 0.0 0.7 1.0

Threshold 0.7 Threshold 0.4 nl D nl D 40 0 10 50 50 50 40 60 1 Threshold 0.6 nl D nl D 45 10 5 40 50 50 50 50 Sensitivity ROC curve nl D nl D 50 0 20 30 70 30 0 1 - Specificity 1 50 50

1 ROC curve 0 1 - Specificity 1 Sensitivity All Thresholds

Areas Under the ROC curve concordance index measure adequacy of risk ranking (#concordant pairs + ½ #ties) /all pairs do not measure adequacy of risk estimates (collective nor individual) Discordant Pairs

Predicted / Observed Framingham models tested on European populations

Calibration D = 0 Measures how close the average estimate is to the observed proportion Goodness-of-fit Hosmer- Lemeshow statistics

Calibration D = 0 D = 1 HL = 1 3 D= 0 l= 1 π Dl o Dl π Dl 2

Risk Score of Death Unadjusted Overall Mortality Rate = 2.1% 3000 60% Number of Cases 2500 2000 1500 1000 62% Number of Cases 26% Mortality Risk 21.5% 53.6% 50% 40% 30% 20% 12.4% 500 0 7.6% 0.4% 1.4% 2.2% 2.9% 1.6% 1.3% 0 to 2 3 to 4 5 to 6 7 to 8 9 to 10 >10 Risk Score Category 10% 0%

Interventional Cardiology Models Validation 5278 patients from BWH (2001-2004) (external validation set) Comparisons use Areas under the ROC curve (AUC) and the Hosmer-Lemeshow goodness-of-fit statistic (deciles)

Calibration Are predictions obtained from external models good for individual counseling?

APACHE II Mortality in intensive care units (ICUs) 12 physiologic predictors

Summary of all comparison studies in terms of discrimination (AUC)

Summary of HL-GOF H and C statistics. X ² values and degrees of freedom are listed where available, p values are listed otherwise.

Standardized mortality ratio in different study comparisons

3 1.4 x2 1.2 1 0.8 0.6 0.4 0.2 0-0.5-0.2 0 0.5 1 1.5 Cluster 1, Y=0 Cluster 1, Y=1 Cluster 2, Y=0 Cluster 2, Y=1 Cluster 3, Y=0 Cluster 3, Y=1 Cluster 4, Y=0 Cluster 4, Y=1-0.4 x1 Simulated data set Clusters 1 to 4 centered on (0,0), (0,1), (1,0), and (1,1), with Gaussian noise True probability for clusters 1 to 4: 0.01, 0.40, 0.60, and 0.99

3 2 1.4 1.2 4 1.4 1.2 LR x2 1 0.8 0.6 0.4 1 0.2 3 0-0.5-0.2 0 0.5 1 1.5 Cluster 1, Y=0 Cluster 1, Y=1 Cluster 2, Y=0 Cluster 2, Y=1 Cluster 3, Y=0 Cluster 3, Y=1 Cluster 4, Y=0 Cluster 4, Y=1 x2 1 0.8 0.6 0.4 0.2 0-0.5-0.2 0 0.5 1 1.5-0.4 x1-0.4 x1 Simulated data set Clusters 1 to 4 centered on (0,0), (0,1), (1,0), and (1,1), with Gaussian noise True probability for clusters 1 to 4: 0.01, 0.40, 0.60, and 0.99

x2 1.4 1.2 1 ANN 0.8 0.6 0.4 0.2 0-0.5-0.2 0 0.5 1 1.5 Cluster 1, Y=0 Cluster 1, Y=1 Cluster 2, Y=0 Cluster 2, Y=1 Cluster 3, Y=0 Cluster 3, Y=1 Cluster 4, Y=0 Cluster 4, Y=1-0.4 x1

40 Calibration Plot LR Neural Network 35 Sum of squared errors 52.363 50.894 30 Mean squared error 0.130 0.127 Expected 25 20 15 Neural Network Logistic Regression Cross-entropy error 154.543 150.838 Mean cross-entropy error 0.386 0.377 Sum of residuals 103.226 100.412 10 Mean residual 0.2580 0.251 5 AUC 0.889 0.895 0 0 10 20 30 40 Observed HL-C 6.437 11.773 p 0.598 0.161 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Proportion of events Average estimate Neural Net Average estimate Logistic Reg 1 2 3 4 Clusters LR Neural Network Cluster 2 min (GS: 0.4).20.43 Cluster 2 max.80.58 Cluster 3 min (GS: 0.6).29.65 Cluster 3 max.85.73

Genotype genome transcription RNA transcriptome Source: DOE Protein translation proteome Phenotype physical exam, imaging Physiology tests Metabolites laboratory

Will we ever achieve individualized risk assessment? If so, how can we evaluate it? Acknowledgments NIH, Komen Foundation Fred Resnic, Michael Matheny