An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Similar documents
NIH Public Access Author Manuscript Comput Cardiol (2010). Author manuscript; available in PMC 2014 March 25.

Predictive Models for Healthcare Analytics

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)

The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care

In-hospital Intensive Care Unit Mortality Prediction Model

Sequential Modeling for Mortality Prediction in the ICU. Tejas G. Sundaresan

Predicting Breast Cancer Survival Using Treatment and Patient Factors

IDENTIFYING SEPSIS IN THE PREHOSPITAL SETTING

Postanalytical phase

A new measure of acute physiological derangement for patients with exacerbations of obstructive airways disease: The COPD and Asthma Physiology Score

Statistics, Probability and Diagnostic Medicine

An Improved Algorithm To Predict Recurrence Of Breast Cancer

Supplementary materials

Critical care resources are often provided to the too well and as well as. to the too sick. The former include the patients admitted to an ICU

Screening (Diagnostic Tests) Shaker Salarilak

During the past two decades,

An Introduction to ROC curves. Mark Whitehorn. Mark Whitehorn

Chemistry Reference Ranges and Critical Values

Chemistry Reference Ranges and Critical Values

Web Only Supplement. etable 1. Comorbidity Attributes. Comorbidity Attributes Hypertension a

Prof. Dr. Iman Riad Mohamed Abdel Aal

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN

Prediction of mortality by pediatric risk of mortality (PRISM III) score in teriary care rural hospital in India

Zheng Yao Sr. Statistical Programmer

Early lactate clearance rate is an indicator of Outcome in severe sepsis and septic shock

Written Guidelines for Laboratory Testing in Intensive Care - Still Effective After 3 Years

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

Tables of Normal Values (As of February 2005)

FAILURE OF NONINVASIVE VENTILATION FOR DE NOVO ACUTE HYPOXEMIC RESPIRATORY FAILURE: ROLE OF TIDAL VOLUME

SEPTIC SHOCK PREDICTION FOR PATIENTS WITH MISSING DATA. Joyce C Ho, Cheng Lee, Joydeep Ghosh University of Texas at Austin

Unsupervised Pattern Discovery in Sparsely Sampled Clinical Time Series

Computer Models for Medical Diagnosis and Prognostication

Sepsis 3 & Early Identification. Disclosures. Objectives 9/19/2016. David Carlbom, MD Medical Director, HMC Sepsis Program

Diagnostic tests, Laboratory tests

Ryan Salter, Michael Bailey, Rinaldo Bellomo, Glenn Eastwood, Niklas Nielsen, David Pilcher, Alistair Nichol, Manoj Saxena, Yahya Shehabi, Paul Young

Delta Check Calculation Guide

The Good News. More storage capacity allows information to be saved Economic and social forces creating more aggregation of data

Receiver operating characteristic

Appendix. Supplementary figures and tables

Prediction of micrornas and their targets

Comparison of Different Scoring System in Predicting the Severity and Prognosis of Acute Pancreatitis

MASTER S THESIS. Ali El Adi. Deep neural networks to forecast cardiac and respiratory deterioration of intensive care patients

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes

Who should get a Biomarker Assessment? A focus on Biomarkers you may have at your hospital and risk scores

An Empirical and Formal Analysis of Decision Trees for Ranking

Organ Data. Chapter 5

Biomarkers for ARDS not so simple. John Laffey. Critical Illness and Injury Research Centre St Michael s Hospital, University of Toronto, CANADA

Protein Structure & Function. University, Indianapolis, USA 3 Department of Molecular Medicine, University of South Florida, Tampa, USA

Clinical Significance of Plasma Ammonia in Patients with Generalized Convulsion

OXYGENATION AND ACID- BASE EVALUATION. Chapter 1

Supplementary Online Content

EXTRACORPOREAL MEMBRANE OXYGENATION (ECMO) REFERRAL FORM

Supplementary Online Content

Biomarker adaptive designs in clinical trials

Individual Study Table Referring to Part of the Dossier. Use only) Name of Finished Product:

Sepsis: Mitigating Denials Amid Definition Disparity

Understanding Blood Tests

Predicting Breast Cancer Survivability Rates

Staging Sepsis for the Emergency Department: Physician

OUTCOME PREDICTION IS IMportant

Copyright 2016 Aftab Hassan

Data-Mining-Based Coronary Heart Disease Risk Prediction Model Using Fuzzy Logic and Decision Tree

Module Overview. What is a Marker? Part 1 Overview

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes

Identifikation von Risikofaktoren in der koronaren Herzchirurgie

Admissions with severe sepsis in adult, general critical care units in England, Wales and Northern Ireland

Predicting Intensive Care Unit (ICU) Length of Stay (LOS) Via Support Vector Machine (SVM) Regression

Schedule of Accreditation issued by United Kingdom Accreditation Service 2 Pine Trees, Chertsey Lane, Staines-upon-Thames, TW18 3HR, UK

3 papers from ED. counting sepsis sepsis 3 wet or dry?

Ammonia level at admission predicts in-hospital mortality for patients with alcoholic hepatitis

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Comparison of VACUETTE Heparin Gel Tubes for Common Chemistry Analytes

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH

Modelling and Application of Logistic Regression and Artificial Neural Networks Models

Acute Stroke with Alteplase Administration Order Set

MEDICAL SCORING FOR BREAST CANCER RECURRENCE. Nurul Husna bt Jamian UiTM (Perak), Tapah Campus

A Predictive Chronological Model of Multiple Clinical Observations T R A V I S G O O D W I N A N D S A N D A M. H A R A B A G I U

Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods

NEW RCPCH REFERENCE RANGES-

The Impact of a Mouthwash Program on the Risk of Nosocomial Pneumonia at the ICU

Supplemental Information

Mid Infrared Milk Testing for Evaluation of Health Status in Dairy Cows METABOLIC STATUS. Use of Biomarkers for Negative Energy Balance and Ketosis

Sepsis Wave II Webinar Series. Sepsis Reassessment

Supplementary Table 1. Criteria for selection of normal control individuals among healthy volunteers

SUPPLEMENTARY DATA. Supplementary Table S1. Clinical characteristics of the study subjects.*

Derivative-Free Optimization for Hyper-Parameter Tuning in Machine Learning Problems

Complete Medical History

Supplementary Appendix

The NEPHROCHECK Test System Training Program. The NEPHROCHECK Training Program US: Astute Medical, Inc PN Rev D 2014/09/16

Schedule of Accreditation issued by United Kingdom Accreditation Service 2 Pine Trees, Chertsey Lane, Staines-upon-Thames, TW18 3HR, UK

BMI 541/699 Lecture 16

Medicine 2.0: Using Machine Learning to Transform Medical Practice and Discovery

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015

Routine Clinic Lab Studies

Demographics, Microbiology and Outcome in Necrotizing Soft Tissue Infections

BREAST CANCER EPIDEMIOLOGY MODEL:

Med Chem 535P ~ Diagnostic Medicinal Chemistry. General Comments

Transcription:

An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework Soumya GHOSE, Jhimli MITRA 1, Sankalp KHANNA 1 and Jason DOWLING 1 1. The Australian e-health and Research Centre, Commonwealth Scientific and Industrial Research Organization (CSIRO ), Australia HEALTH AND BIO-SECURITY FLAGSHIP

Critical Care Statistics Australia/New Zealand Source : ANZICS CORE Annual Report 2012 2013 2 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

ICU Risk/Mortality Prediction System SAPS II is a severity of disease classification system (Le Gall, Lemeshow, Saulnier, 1993). Its name stands for "Simplified Acute Physiology Score", and is one of several ICU scoring systems. SAPS II was designed to measure the severity of disease for patients admitted to Intensive care units aged 15 or more. 24 hours after admission to the ICU, the measurement has been completed and resulted in an integer point score between 0 and 163 and a predicted mortality between 0% and 100%. No new score can be calculated during the stay. If a patient is discharged from the ICU and readmitted, a new SAPS II score can be calculated. The parameters are: Age, Heart Rate, Systolic Blood Pressure, Temperature, Glasgow Coma Scale, Mechanical Ventilation or CPAP, PaO2, FiO2, Urine Output, Blood Urea Nitrogen, Sodium, Potassium, Bicarbonate, Bilirubin, White Blood Cell, Chronic diseases, and Type of admission 3 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Critical Care Statistics Australia/New Zealand Source : ANZICS CORE Annual Report 2012 2013 4 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Physionet Challenge In 2012, an academic challenge, Physionet [1], prompted several research attempts to model and predict the risk of inpatient mortality of ICU patients. (Public dataset available online). Parameters : Blood Pressure - Invasive (diastolic, mean, systolic), Blood Pressure - Non-invasive (diastolic), Blood Pressure - Non-invasive (mean), Blood Pressure - Noninvasive (systolic), Albumin, Alkaline phosphate, Alkaline transaminase, Aspartate transaminase, Bilirubin, Blood urea nitrogen, Cholesterol, Creatinine, Fractional inspired oxygen, Glasgow Coma Score, Glucose, Serum bicarbonate, Hematocrit, Heart rate Serum potassium, Lactate, Serum magnesium, Mechanical ventilation, Serum sodium PaCo2, PaO2, ph, Platelets, Respiration rate, SaO2, Temperature, Troponin-I, Troponin- T, Urine output, WBC, and Weight. Snapshot : Five static variables and thirty-seven time series variables (recorded for vital signs) analysed over a period of 48 hours. (Not all variables were recorded for all patients and not all recorded variables were sampled in equal interval.) The dataset comprised of information related to 4000 ICU stays of adult patients who were admitted to cardiac, medical, surgical and trauma ICUs. SAPS score was provided for baseline comparison. [1] Physionet Challeng, http://physionet.org/challenge/2012/, accessed on [14th March, 2015]. 5 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

ICU Risk/Mortality Prediction Machine Learning Training Data Feature Extraction Supervised Learning Prediction Model Final Results Test Feature Data Extraction Training Data Physionet Challenge Feature Extraction Gradient Features Supervised Learning Random Forest Prediction Model 6 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

ICU Risk/Mortality Prediction Feature Extraction Watch the gradient Extracted Features : - Data for every time series was further sampled at every second hour for the entire 48-hour period. The mean, maximum, minimum values and the standard deviation of each of these intervals within the 48-hour period, for each time series attribute and the static variables were concatenated to create the feature vector for a particular patient. 7 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

8 ICU Risk/Mortality Prediction Random Forest 4 9 1 2 3 5 v < leaf nodes split nodes 6 7 10 11 12 13 < ICCV 2009 tutuorial 14 15 16 17 category c 8 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

ICU Risk/Mortality Prediction Random Forest Try several lines, chosen at random Keep line that best separates data information gain Recurse y feature vectors are x, y coordinates: v = [x, y] T split functions are lines with parameters a, b: f n (v) = ax + by threshold determines intercepts: t n four classes: purple, blue, red, green x 9 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

ICU Risk/Mortality Prediction Random Forest Try several lines, chosen at random y Keep line that best separates data information gain Recurse x 10 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework 10

ICU Risk/Mortality Prediction Random Forest Try several lines, chosen at random y Keep line that best separates data information gain Recurse x 11 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework 11

Physionet Challenge - Validation Dataset size 4000 patients with outcome. A ten-fold cross-validation was employed and common evaluation metrics like the true positive rate (TPR, or Sensitivity), false positive rate (FPR), positive predictive value (PPV), negative predictive value (NPV) and accuracy were computed to evaluate the performance of the classifier. Receiver Operating Characteristic (ROC) curve analysis was employed to measure the performance of the models, with the c-statistic (or AUC), representing the area under the ROC curve, also used as a measure of discrimination and model performance. Ten fold cross validation implies out of 4000 patients the model was trained with 3600 patients data and validated with 400 patients. Physionet Challeng, http://physionet.org/challenge/2012/, accessed on [14th March, 2015]. 12 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Physionet Challenge - Results True Positive Rate (TPR or Sensitivity)= False Positive Rate (FPR) = Positive Predictive Value (PPV) = Negative Predictive Value (NPV) = Accuracy = TP TN TP Random guessing score :- 0.139 SAPS score for the dataset :- 0.296 TP/( TP FN ) FP /( FP TN) TP/( TP FP) TN /( TN FN ) /( TN FP FN ) TPR FPR PPV NPV Accuracy Random Forest 0.78 0.5 0.8 0.87 0.87 Physionet Challeng, http://physionet.org/challenge/2012/, accessed on [14th March, 2015]. 13 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Physionet Challenge - Results Physionet challenge score = minimum (TPR, PPV) Model Score Logistic regression 0.44 Cluster analysis 0.39 SVM 0.71 Bayesian Ensemble 0.30 Random forest (proposed) 0.78 Physionet Challeng, http://physionet.org/challenge/2012/, accessed on [14th March, 2015]. 14 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Physionet Challenge - Results Mean AUC = 0.79 (10 fold cross-validation) Physionet Challeng, http://physionet.org/challenge/2012/, accessed on [14th March, 2015]. 15 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Conclusions An automatic ICU mortality risk prediction system has been proposed with a random forest classifier. The method outperforms the traditional SAPS-1 scoring method often used in hospitals. The prediction model validated with a large dataset. It performs better than some of the state-of-theart predictive models like logistic regression and SVM 16 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Thank You. Questions? 17 An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework