Detection of Informative Disjunctive Patterns in Support of Clinical

Similar documents
T-Cube Web Interface for Real-time Biosurveillance in Sri Lanka and Tamil Nadu

CDR Kimberly Elenberg, Office of Food Defense and Emergency Response, Food Safety Inspection Service, United States Department of Agriculture

Chapter 1. Introduction

To review examples where joint analyses of food safety

Detecting Anomalous Patterns of Care Using Health Insurance Claims

Predictive Analytics for Retention in HIV Care

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India

Interactive Analysis of Multidimensional Data. for Adverse Event Detection

Pythia WEB ENABLED TIMED INFLUENCE NET MODELING TOOL SAL. Lee W. Wagenhals Alexander H. Levis

Machine Learning and Event Detection for the Public Good

Does Machine Learning. In a Learning Health System?

AN INFORMATION VISUALIZATION APPROACH TO CLASSIFICATION AND ASSESSMENT OF DIABETES RISK IN PRIMARY CARE

Visual and Decision Informatics (CVDI)

Predicting Breast Cancer Survival Using Treatment and Patient Factors

A Deep Learning Approach to Identify Diabetes

CS 4365: Artificial Intelligence Recap. Vibhav Gogate

Recent advances in non-experimental comparison group designs

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)

Statement of research interest

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Discovering Meaningful Cut-points to Predict High HbA1c Variation

The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care


Key points and Q&A. A framework for malaria elimination (2017) Global Malaria Programme. Rationale for updated guidance.

Applied Machine Learning, Lecture 11: Ethical and legal considerations; domain effects and domain adaptation

T. R. Golub, D. K. Slonim & Others 1999

CHINESE INFORMATION SYSTEM FOR INFECTIOUS DISEASES CONTROL AND PREVENTION. Center for Public Health Surveillance and Information Services, China CDC

Youth Using Behavioral Health Services. Making the Transition from the Child to Adult System

A Survey on Prediction of Diabetes Using Data Mining Technique

Data Mining in Bioinformatics Day 7: Clustering in Bioinformatics

A Framework for Utilizing Lab Test Results for Clinical Prediction of ICU Patients

Infectious Diseases New York City Department of Health and Mental Hygiene, Division of Disease Control (Bureau of HIV/AIDS Prevention and Control)

An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

GLYCEMIC CONTROL SURVEY

In-hospital Intensive Care Unit Mortality Prediction Model

OHSU. Update in Sepsis

On the Combination of Collaborative and Item-based Filtering

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor

Predicting the Effect of Diabetes on Kidney using Classification in Tanagra

Quicker Diagnostic Testing of Cardiac Conditions in the ER

Improving Type 2 Diabetes Therapy Adherence and Persistence in the United States

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Lecture #4: Overabundance Analysis and Class Discovery

Uses of the NIH Collaboratory Distributed Research Network

Lecture 2: Foundations of Concept Learning

Approved Care Model for Project 3gi: Integration of Palliative Care into the PCMH Model

Following the health of half a million participants

Scientific Journal of Informatics Vol. 3, No. 2, November p-issn e-issn

Social Determinants of Health

Automated Medical Diagnosis using K-Nearest Neighbor Classification

In-House Chronic Pain and Opioid Use Reporting Guide

CS148 - Building Intelligent Robots Lecture 5: Autonomus Control Architectures. Instructor: Chad Jenkins (cjenkins)

ANALYSIS AND DETECTION OF BRAIN TUMOUR USING IMAGE PROCESSING TECHNIQUES

Knowledge Discovery and Data Mining. Testing. Performance Measures. Notes. Lecture 15 - ROC, AUC & Lift. Tom Kelsey. Notes

4. Model evaluation & selection

Addiction and Dependence

Predicting Potential Domestic Violence Re-offenders Using Machine Learning. Rajhas Balaraman Supervisor : Dr. Timothy Graham

Artificial Intelligence For Homeopathic Remedy Selection

PubH 7405: REGRESSION ANALYSIS. Propensity Score

Disulphide Connectivity Prediction in Proteins Based on Secondary Structures and Cysteine Separation

Statistical modeling for prospective surveillance: paradigm, approach, and methods

Learning Classifier Systems (LCS/XCSF)

DETERMINING IBD TRIGGER FOODS USING MACHINE LEARNING AND PYTHON

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT

Innovative Risk and Quality Solutions for Value-Based Care. Company Overview

Predicting the Onset of Tachycardia for Patients in Intensive Care Units

Not all NLP is Created Equal:

Utilizing Posterior Probability for Race-composite Age Estimation

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

Assignment 2: Experimental Design

Identifying Deviations from Usual Medical Care using a Statistical Approach

American Addiction Centers Outcomes Study Long-Term Outcomes Among Residential Addiction Treatment Clients. Centerstone Research Institute

Improving Type 2 Diabetes Therapy Compliance and Persistence in Brazil

Best Practice Models for Opioid Treatment Programs Motivated Stepped Care: An Adaptive Treatment Approach. Van L. King, MD

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Data complexity measures for analyzing the effect of SMOTE over microarrays

The Regression-Discontinuity Design

Improving Type 2 Diabetes Therapy Adherence and Persistence in Mexico

Managing Diabetes and Hyperglycemia Safely in the Complex Hospital Setting

Lessons Learned from Meaningful Use Implementation. Cynthia L. Bero, MPH Partners HealthCare May 19, 2011

Reveal Relationships in Categorical Data

Part 1: Job Details and Approval

Predicting Kidney Cancer Survival from Genomic Data

Progress Report: Health Care Quality Indicators Project. Meeting of OECD Health Data National Correspondents, 8 October, 2009, Paris

Data Mining Approaches for Diabetes using Feature selection

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s

Five Mistakes and Omissions That Increase Your Risk of Workplace Violence

Conditional Outlier Detection for Clinical Alerting

Framework for Defining Evidentiary Standards for Biomarker Qualification: Minimal Residual Disease (MRD) in Multiple Myeloma (MM)

Multiclass Classification of Cervical Cancer Tissues by Hidden Markov Model

Final Project Report Sean Fischer CS229 Introduction

37 th ANNUAL JP MORGAN HEALTHCARE CONFERENCE

Does it Matter How You Ask? Question Wording and Males' Reporting of Contraceptive Use at Last Sex

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

COLLOIDAL SILVER: WHERE DOES IT GO WHEN YOU DRINK IT? HOW LONG DOES IT STAY THERE?

Assigning B cell Maturity in Pediatric Leukemia Gabi Fragiadakis 1, Jamie Irvine 2 1 Microbiology and Immunology, 2 Computer Science

Increasing Dialysis Patients Physical Activity and Well-Being

Transcription:

Detection of Informative Disjunctive Patterns in Support of Clinical Artur Informatics Dubrawski, Ph.D. M.Eng. Maheshkumar (Robin) Sabhnani, M.Sc. The Auton Lab School of Computer Science Carnegie Mellon University www.autonlab.org awd@cs.cmu.edu

Detection of Informative Disjunctive Patterns in Support of Clinical Informatics 1. Brief introduction 2. Patterns in Clinical Data and their representation 3. Example applications: A. Discovering patterns of care correlated with increased probability of readmission to an Intensive Care Unit B. Microarray analysis in cancer research C. Over-densities in outpatient diagnoses 4. Summary Slide 2 of 24

Carnegie Mellon University is in Pittsburgh, PA, USA Slide 3 of 24

CMU Auton Lab: Research and applications Central topic of our research: scalable, self-adaptive analytic systems with real life impact Working on 1+ sponsored projects. Current and past funding from NSF, DARPA, DHS, HSARPA, ONR, AFRL, NASA, USDA, FDA, CDC, a few Fortune 1 companies, and a number smaller corporate & academic sponsors and partners Deliverables Algorithms for fast and scalable statistical machine ~2 people: 2 regular+3 affiliated learning and analytics faculty, 1 post-doc, 6 analysts and Software for embedding in programmers, 6 PhD students; production systems plus a few interns; led by Artur Software available for Dubrawski and Jeff Schneider download at www.autonlab.org Slide 4 of 24

Interactiveanalytics Astrophysics Learning Locomotion Saving sea turtles x Bio-surveillance UnitedNations CTBTO Foodsafety Fleetprognostics Nuclear threatdetection Slide 5 of 24 Safety of agriculture

Patterns in Clinical Data A pattern in clinical care may be e.g. predictive of a specific outcome: If ( X is followed ) then Prob( Outcome = )=P One goal of Clinical Informatics is to automate searching through historical records of treatment for such predictive patterns Solutions typically employ any subset of: Data Mining, Machine Learning, Probability, Statistics The key tradeoffs: Slide 6 of 24

Patterns in Clinical Data If ( X is followed ) then Prob( Outcome = ) = P One representation of X: X : ( A = ai AND B = bk ) where A, B attributes of data, and ai, bk their values Example: The patient received 12 doses of heparin throughout their stay at ICU (A= 1-12 doses ) and their stay lasted at most 5 but more than 3 days (B= (3,5] ), and A and B represent dosage of heparin and length of stay, respectively That statement X was conjunctive A conjunctive-disjunctive statement would look Slide 7 of 24

Motivation of this talk Many current approaches to Clinical Data Analysis rely on conjunctive statements to represent relationships between multiple dimensions of data This talk argues for conjunctive-disjunctive representation of patterns as a more powerful alternative Our proposed framework develops models that are conjunctive with respect to features of data and disjunctive with respect to their values (we assume the features are discrete) Slide 8 of 24

Example application: ICU readmissions Problem statement: ICU readmissions are undesirable A readmission occurs if a recently discharged ICU patients returns within 48 hours (common definition) Research question: Can the records of treatments help identify who of the current patients are at the elevated risk of readmission? Plausible solution: Mine available data for combinations of features and values that correlate with increased readmission rates observed among past patients Slide 9 of 24

Our data and Approach 29,441 admissions to ICUs 189 (.64%) resulted in readmission Available features of data (more than 9, overall): Demographics: age, gender 64 Medications: Average volume of medication per day when it was administered (numeric) 5,34 IOs: Average volume of fluid intake/outlet per hour during ICU stay (numeric) 4,133 Vitals: was the particular vital measured during the stay (binary) Approach: Use this data to train classifiers of the binary outcome: {ReadmissionOccurred, NoReadmission} Slide 1 of 24

Predicting readmissions with Decision Trees Lift = Rate of readmissions in the selected pattern / Prior rate of readmissions (=.64%) It reflects predictive utility of a pattern. We are interested in Lifts > 1. 3 Decision Tree 2.5 ift L 2 1.5 1.5.2.4.6.8 Proportion of positives explained Slide 11 of 24 1 The decision tree was able to reliably identify 3 conjunctive patterns of varying utilities (lifts) and data coverages (recalls)

Predicting readmissions with Decision Trees Lift = Rate of readmissions in the selected pattern / Prior rate of readmissions (=.64%) It reflects predictive utility of a pattern. We are interested in Lifts > 1. 3 Health Care Proxy = D5W 25ml + 5mcgkgmin Nitroprusside >= 6.583 Decision Tree #positives : 7 #negatives : 434 Lift : 2.52 2.5 ift L 2 1.5 Health Care Proxy = 1 Albumin 25% >= 12.5 #positives : 4 #negatives : 292.5 Lift : 2.14 Health Care Proxy = 1 Po Intake < 15.2 #positives : 47 #negatives : 4628 Lift : 1.59.2.4.6.8 Proportion of positives explained Slide 12 of 24 1 The decision tree was able to reliably identify 3 conjunctive patterns of varying utilities (lifts) and data coverages (recalls)

A relatively simple disjunctive algorithm The Subset Search algorithm: Iterate through all features in data, picking one at a time: 1. 2. 3. 4. If the feature is numeric, discretize it, keeping distribution of positive examples in data uniform Find a subset of values of the feature that maximizes the lift and selects the minimum count of positive examples in data Check the usefulness of this set of values: It must increase the combined lift of the developed conjunctive-disjunctive pattern, if it were added to it, by the amount greater than some threshold 5. If it is useful, add it to the learned pattern 6. If the resultant lift is sufficiently high, terminate algorithm 7. 8. Remove the current feature from consideration and recursively execute the algorithm picking another feature Return the current feature to the set of selectable features and continue The Subset Search algorithm relatively quickly learns from data predictive rules that have the conjunctive-disjunctive form Slide 13 of 24

Predicting readmissions with Subset Search Lift = Rate of readmissions in the selected pattern / Prior rate of readmissions (=.64%) It reflects predictive utility of a pattern. We25are interested in Lifts > 1. 2 ift L 15 1 5 Subset Search Decision Tree Using varying sets of parameters, the Subset Search algorithm found a few patterns of superior utility to those identified.2.4.6.8 1 by the decision Proportion of positives explained tree Slide 14 of 24

Predicting readmissions with Subset Search Lift = Rate of readmissions in the selected Propofol : [.], (265., Age : (., 86.]pattern 348.75], (535.25, 595.], Pre-Admission / Prior rate of readmissions (=.64%)Intake : [, (895., 2267.5] 665], (14, 3], [5,.9% Normal Saline 1ml : 55] It reflects predictive utility of a pattern. (., 1.] Sterile Water 1ml : Cath Lab Intake : [., [.], (66.67, 68.75] We are interested in Lifts > 1.: [., 8.83], 285.] D5W 25 Pre-admission Output : [, (1.88, 357.14] Subset Search 46], (685, 7] #positives : 139 Decision Tree #positives : 96 #negatives : 1625 2 #negatives : 9292 Lift : 1.33 Lift : 1.6 15 ift L Morphine Sulfate : [., 3.] varying OR CrystalloidUsing : [., 36.] PE 2 FS PO : sets [.,of1.] Urine Out Condom Cath : [., parameters, the 25.91] #positives : 189 Subset Search #negatives : 27327 algorithm found Lift : 1.7 1 5 a few patterns of superior utility to those identified.2.4.6.8 1 by the decision Proportion of positives explained tree Slide 15 of 24

Predicting readmissions with Subset Search ift L TF residual : (19.5, 26.39], (39.65, 5.72] Lift = Rate of readmissions in the selected pattern Pre-admission intake :. Priorbolus rate:of(., readmissions (=.64%) Free /water 237.86] Urine out forey : [., 129.69] #positives 11 It reflects: predictive utility of a pattern. #negatives : 57 Lift :are 23 interested in Lifts > 1..9% normal saline (1ml) + 1Uhr Insulin : We25 (2.54, 2.74], (3.2,3.4], (5.1, 9.35] Subset Search (12.67, 222.2] Insulin : [,69], DSW 25ml + 25Uhr Heparin : [., 7.68] Decision Tree 2 D5W : [., 8.79], (1., 1.59] #positives : 2 #negatives : 445 15 Lift : 6.72 1 5 Using varying sets of D5W : (., 8.29], (16.37, 357.14] parameters, the OR Crystalloid : Search [., Subset 2.] algorithm found a D5W 25ml + 4mcgkgmin Levophed-k : (., 4.75] few patterns of Urine Out Void : [., 23.] superior utility to #positives : 39 those identified #negatives : 1747.2.4.6.8 1 Lift : 3.28 by the decision Proportion of positives explained tree Slide 16 of 24

So, what has just happened? We have seen how much benefit can be attained by relaxing constraints about the form of the patterns in Clinical Data Our Subset Search algorithm was able to find patterns of substantially greater utility than the simpler conjunctive scheme used by the decision tree But what are the down-sides? Greater complexity of the resulting patterns carries the greater risk of over-fitting (we need to carefully test for that) Computation costs to learn models from data may be higher than for the alternative conjunctive models However, if we are brave enough, we may consider aiming at an even more formidable task: Learning OVERLAPPING conjunctive-disjunctive patterns Slide 17 of 24

Example application: Microarray analysis Problem statement: In cancer research we often want to identify bi-clusters of genes and samples with elevated activation levels They can help design effective therapies samples Research challenge: The bi-clusters may overlap, with additive effects on activation s n e g Standard solution: Iteratively find the best bicluster, remove its effect from data, repeat It is often sub-optimal Slide 18 of 24

Our data and Approach Two real-world tumor data sets: 1. Breast cancer data (13,666 genes, 117 samples) 2. Lung cancer data (12,625 genes, 125 samples) Competing algorithms:. Large Average Submatrix algorithm (LAS) [Shabalin et al. Finding large average submatrices in high dimensional data. Annals of Statistics, 3(3):985 112, 29]. Find the best fitting cluster in data via hill climbing optimization in feature-value space, using random restarts to avoid local maxima. Remove the effect of the best fitting cluster from data and repeat. Disjunctive Anomaly Detection algorithm (DAD) [Sabhnani et al. Detection of disjunctive anomalous patterns in multidimensional data. Advances in Disease Surveillance, 29.].. Iteratively find the next best fitting cluster in data using round-robin hill climbing optimization with random restarts Each time the new cluster is found, refit all clusters added so far using non-negative leastsquare regression, then continue Slide 19 of 24

Detecting bi-clusters in cancer microarrays -2.5E+6 LAS DAD LAS-alpha -2.1E+6-2.15E+6-2.2E+6 d h e lik g o L Breast tumor data -2.25E+6 As we hoped for, the algorithm designed to handle overlapping patterns did better -2.3E+6 5 1 15 2 25 3 35 4 K -2.5E+6 LAS DAD Much better LAS-alpha The level of accuracy of disjunctive representation that -2.1E+6-2.15E+6-2.2E+6 LAS algorithm achieves using 4 clusters, Lung tumor data d h e lik g o L -2.25E+6-2.3E+6 5 1 15 2 25 3 35 4 DAD accomplishes using fewer than 15 clusters K Slide 2 of 24

Where else do we see the need for models capable of representing multiple disjunctive overlapping patterns? In Sri Lanka! Weekly Epidemiological Reports show an increase in disease rates in Gampaha in summer 28 25 gampaha 2 ts n u o C 15 1 5 6/1 6/8 6/15 6/22 6/29 7/6 7/13 7/2 7/27 8/3 8/1 8/17 8/24 8/31 Slide 21 of 24

Where else do we see the need for models capable of representing multiple disjunctive overlapping patterns? In Sri Lanka! Weekly Epidemiological Reports show an increase in disease rates in Gampaha in summer 28 2 food poisoning in gampaha leptospirosis in gampaha 15 ts n u o C More details in an upcoming Auton Lab paper by Sabhnani et al. Apparently, it did not involve just one escalating disease pattern: 25 gampaha 1 5 6/1 6/8 6/15 6/22 6/29 7/6 7/13 7/2 7/27 8/3 8/1 8/17 8/24 8/31 Slide 22 of 24

Summary Disjunctive patterns are present in many types of data encountered in Clinical Informatics Ability to discover them can be quite useful in a range of applications We have examined in-patient and out-patient data, as well as clinical study data There is a need and room for new computational approaches to handle such patterns Current, popular algorithms are predominantly conjunctive, and/or use a greedy search-then-eliminate strategy In our preliminary comparative studies, we have observed improved accuracies and/or lower complexities of the disjunctive models when compared against well performing competitors Challenges to overcome before moving fast-forward remain Computational scalability is a formidable one Slide 23 of 24 This material is based upon work supported by the National Science Foundation, grant NSF IIS-

Summary Disjunctive patterns are present in many types of data encountered in Clinical Informatics Ability to discover them can be quite useful in a range of applications We have examined in-patient and out-patient data, as well as clinical study data There is a need and room for new computational approaches to handle such patterns Current, popular algorithms are predominantly conjunctive, and/or a greedy search-theneliminate strategy In our preliminary comparative studies, we have observed improved accuracies and/or lower complexities of the disjunctive models when compared against well performing competitors Challenges to overcome before moving fast-forward remain Computational scalability is a formidable one By the way, if you are interested in real-time bio-surveillance place in Sri Lanka, take a look at this movie: http://www.autonlab.org/autonweb/library/videos.html Slide 24 of 24 taking This material is based upon work supported by the National Science Foundation, grant NSF IIS-