Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports

Similar documents
Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports

Multi-modal Patient Cohort Identification from EEG Report and Signal Data

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning

arxiv: v1 [cs.lg] 4 Feb 2019

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.

ELECTROENCEPHALOGRAPHIC SLOWING: A PRIMARY SOURCE OF ERROR IN AUTOMATIC SEIZURE DETECTION

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor

DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation

A Predictive Chronological Model of Multiple Clinical Observations T R A V I S G O O D W I N A N D S A N D A M. H A R A B A G I U

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials

OPTIMIZING CHANNEL SELECTION FOR SEIZURE DETECTION

Formulating Emotion Perception as a Probabilistic Model with Application to Categorical Emotion Classification

Intelligent Machines That Act Rationally. Hang Li Toutiao AI Lab

A Probabilistic Reasoning Method for Predicting the Progression of Clinical Findings from Electronic Medical Records

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods

Intelligent Machines That Act Rationally. Hang Li Bytedance AI Lab

Convolutional Neural Networks for Text Classification

The Impact of Belief Values on the Identification of Patient Cohorts

Extraction of Adverse Drug Effects from Clinical Records

Audiovisual to Sign Language Translator

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks

Rolls,E.T. (2016) Cerebral Cortex: Principles of Operation. Oxford University Press.

Deep Learning based Information Extraction Framework on Chinese Electronic Health Records

Automatic Context-Aware Image Captioning

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

Language to Logical Form with Neural Attention

Medical Knowledge Attention Enhanced Neural Model. for Named Entity Recognition in Chinese EMR

Learning the Fine-Grained Information Status of Discourse Entities

An Edge-Device for Accurate Seizure Detection in the IoT

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos

Learning Convolutional Neural Networks for Graphs

Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance *

Recognizing Scenes by Simulating Implied Social Interaction Networks

Standardize and Optimize. Trials and Drug Development

Biceps Activity EMG Pattern Recognition Using Neural Networks

5th Mini-Symposium on Cognition, Decision-making and Social Function: In Memory of Kang Cheng

Direct memory access using two cues: Finding the intersection of sets in a connectionist model

Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011

A HMM-based Pre-training Approach for Sequential Data

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions?

Machine Reading for Question Answering: from symbolic to neural computation. Jianfeng Gao, Rangan Majumder and Bill Dolan Microsoft AI & R 7/17/2017

ERA: Architectures for Inference

SVM-based Discriminative Accumulation Scheme for Place Recognition

AMERICAN BOARD OF CLINICAL NEUROPHYSIOLOGY

GIANT: Geo-Informative Attributes for Location Recognition and Exploration

A Descriptive Delta for Identifying Changes in SNOMED CT

Why did the network make this prediction?

M.Sc. in Cognitive Systems. Model Curriculum

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

Distillation of Knowledge from the Research Literatures on Alzheimer s Dementia

Applying Data Mining for Epileptic Seizure Detection

CROWD ANALYTICS VIA ONE SHOT LEARNING AND AGENT BASED INFERENCE. Peter Tu, Ming-Ching Chang, Tao Gao. GE Global Research

CHAPTER 6 HUMAN BEHAVIOR UNDERSTANDING MODEL

AI Support for Communication Disabilities. Shaun Kane University of Colorado Boulder

A Framework for Conceptualizing, Representing, and Analyzing Distributed Interaction. Dan Suthers

Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team

Graphical Induction of Qualified Medical Knowledge

arxiv: v2 [cs.cv] 19 Dec 2017

On the Use of Brainprints as Passwords

A hierarchical modeling approach to data analysis and study design in a multi-site experimental fmri study

Semantic Structure of the Indian Sign Language

Association of Neurophysiological Technologists of Australia Inc (also known as ANTA)

A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks

Bundles of Synergy A Dynamical View of Mental Function

BayesOpt: Extensions and applications

Active User Affect Recognition and Assistance

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

University of Canberra. This thesis is available in print format from the University of Canberra Library.

CS-E Deep Learning Session 4: Convolutional Networks

3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients

Amy Kruse, Ph.D. Strategic Analysis, Inc. LCDR Dylan Schmorrow USN Defense Advanced Research Projects Agency

What Effect Do Schemas Have On The Recall Of

Lesson 6 Learning II Anders Lyhne Christensen, D6.05, INTRODUCTION TO AUTONOMOUS MOBILE ROBOTS

Schema-Driven Relationship Extraction from Unstructured Text

Artificial Neural Networks

A Multichannel Deep Belief Network for the Classification of EEG Data

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

Classification of EEG signals in an Object Recognition task

Object recognition and hierarchical computation

Epilepsy and Neuropsychology

Exploiting deduction and abduction services for information retrieval. Ralf Moeller Hamburg University of Technology

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

Word Length Processing via Region-to-Region Connectivity

Annotation and Retrieval System Using Confabulation Model for ImageCLEF2011 Photo Annotation

Neuroinformatics. Ilmari Kurki, Urs Köster, Jukka Perkiö, (Shohei Shimizu) Interdisciplinary and interdepartmental

EARLY STAGE DIAGNOSIS OF LUNG CANCER USING CT-SCAN IMAGES BASED ON CELLULAR LEARNING AUTOMATE

Keywords Artificial Neural Networks (ANN), Echocardiogram, BPNN, RBFNN, Classification, survival Analysis.

How can Natural Language Processing help MedDRA coding? April Andrew Winter Ph.D., Senior Life Science Specialist, Linguamatics

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts

Jia Jia Tsinghua University 26/09/2017

Annotating Temporal Relations to Determine the Onset of Psychosis Symptoms

Recognition of sign language gestures using neural networks

Object Detectors Emerge in Deep Scene CNNs

Timing and the cerebellum (and the VOR) Neurophysiology of systems 2010

Cognitive Neuroscience History of Neural Networks in Artificial Intelligence The concept of neural network in artificial intelligence

arxiv: v1 [cs.ai] 5 Oct 2018 October 8, 2018

Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods

Transcription:

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Ramon Maldonado, BS, Travis Goodwin, PhD Sanda M. Harabagiu, PhD The University of Texas at Dallas Human Language Technology Research Institute http://www.hlt.utdallas.edu/~{ramon, travis, sanda}

Conflicts There are no conflicts of interest

Outline 1. Introduction 2. The data 3. Memory-Augmented Active Deep Learning 4. EEG-RelNet for long distance relation detection 5. Experimental Results 6. Conclusion

Introduction Clinical electroencephalography (EEG) is the most important investigation in the diagnosis and management of epilepsies. As more clinical EEG becomes available, the interpretation of EEG signals can be improved by providing neurologists with results of search for patients that exhibit similar EEG characteristics. MERCuRY (Multi-modal ElectroencephalogRam patient Cohort discovery) - Goodwin & Harabagiu (2016) 1 for cohort identification

Introduction QUERY: Patients with shifting arrhythmic delta suspected of underlying cerebrovascular disease EXAMPLE RECORD: CLINICAL HISTORY: 55 y/o man admitted for change in mental status, with a past medical history of GI bleed, anemia, encephalopathy, and others. MEDICATIONS: Pantoprazole, Folic Acid, Carvedilol DESCRIPTION OF THE RECORD: The background EEG is characterized by slowing and disorganization. There is prominent shifting arrhythmic delta activity more prominent in the left mid to anterior temporal region. Photic stimulation generates scant driving. IMPRESSION: Abnormal EEG due to: 1. Marked background slowing and disorganization; 2. Some arrhythmic delta activity CLINICAL CORRELATION: These findings are supportive of a bihemispheric disturbance of cerebral function. These are nonspecific findings which can be seen in a toxic and metabolic encephalopathy and/or underlying cerebrovascular disease

Introduction Current state-of-the-art relation identification methods require arguments to be in the same sentence Our method operates on pairs of concepts from the same EEG reports not constrained to appear in the same sentence/section Accurate relations enable higher quality Medical Knowledge Embeddings (MKE) (Maldonado et al., 2017) Capture knowledge from a large corpus of EEG reports using deep learning Represent concepts and relations as low-dimensional vectors Vector representation helps resolve semantic heterogeneity from differences in terminology

Introduction EEG-RelNet Extension of RelNet from Bansal, Neelakantan, & McCallum (2017) Processes entire EEG report to model cross-sentence relations Memory-augmented neural network with abstract memory cells Concept memory cells Relation memory cells Memories implicitly model current knowledge about medical concepts and their relations in the EEG report

Outline 1. Introduction 2. The data 3. Memory-Augmented Active Deep Learning 4. EEG-RelNet for long distance relation detection 5. Experimental Results 6. Conclusion

The Data EEG reports from Temple University Hospital (TUH) 25,000 reports from 15,000 patients collected over 12 years Sections: 1. Clinical History: Lists past and current medical problems, symptoms, signs, and treatments as well as significant medical events. 2. Medications 3. Introduction: depiction of the techniques used for the EEG 4. Description: a complete and objective description of the EEG, noting all observed activity, patterns, and events 5. Impression: states whether the EEG test is normal or abnormal and, if abnormal, lists the abnormalities in order of importance 6. Clinical Correlation: explains what the EEG findings mean in terms of clinical interpretation

The Data - Preprocessing Multi-Task Active Deep Learning system for concept detection & attribute classification (Maldonado et. al 2017) EEG activities, EEG events, medical problems, treatments Medical concept attributes Modality Polarity Medical Concept Type 16 EEG activity attributes (morphology, dispersal, location, etc.) 365,218 concepts with 3,062,846 attributes Normalization UMLS for medical problems and treatments 10 EEG event types EEG activities are normalized by morphology attribute

Outline 1. Introduction 2. The data 3. Memory-Augmented Active Deep Learning 4. EEG-RelNet for long distance relation detection 5. Experimental Results 6. Conclusion

Memory-Augmented Active Deep Learning The purpose of the Memory-Augmented Active Deep Learning (MAADL) system is to automatically identify relations between pairs of medical concepts in EEG reports, regardless of how distant they are. The MAADL Paradigm consists of 5 steps: STEP 1: The development of an annotation schema STEP 2: Annotation of initial training data STEP 3: Design of deep learning method capable of learning from the data STEP 4: Development of sampling methods for MAADL STEP 5: Usage of the Active Learning system involving: STEP 5.a: Accepting/Editing annotations of sampled examples STEP 5.b: Re-training the deep learning model

MAADL Annotation Schema Three relation types: EVIDENCES, EVOKES, TREATMENT-FOR 1 EVIDENCES 2 EVOKES EEG event EEG activity medical problem treatment medical problem EEG event EEG activity medical problem treatment EEG activity 3 TREATMENT FOR treatment medical problem

Memory-Augmented Active Deep Learning EEG Reports Deep Learning-Based Identification of medical concepts + their attributes Automatically recognized Relations between medical concepts Active Learning Loop STEP 4 EEG Report Annotation SAMPLING STEP 1 STEP 2 Initial Training Data Manual Annotation of Relations between Medical Concepts. Relation types: EVIDENCES EVOKES TREATMENT-FOR EEG Reports with Seed Annotations of Relations between Medical Concepts STEP 3 Deep Learning-Based Recognition of Relations EEG-RelNet Relational Memories STEP 5 Validation/ Editing of Sampled Annotations of Relations in EEG Reports Re-Training Data

Outline 1. Introduction 2. The data 3. Memory-Augmented Active Deep Learning 4. EEG-RelNet for long distance relation detection 5. Experimental Results 6. Conclusion

EEG-RelNet EEG-RelNet Operates on the full text of an EEG report Considers each pair of medical concepts identified in the report Detects EVIDENCES, EVOKES, and TREATMENT-FOR relations between any pair of concepts 3 modules Input Encoding Module encodes the report into concept- and sentence-level embedding vectors Dynamic Relational Memory maintains and updates 2 sets of hidden states (memories) to accumulate information about each concept and potential relation Output Module feeds the updated memories to prediction layer

EEG-RelNet Input Encoding Module Learn an embedding c i R N for each medical concept c i in the report c i = W C c, a 1 c i,, a A c i c is the normalized concept name embedding a 1 c i,, a A c i are the attribute embeddings for concept c i Learn an embedding s i R N for each sentence s i by combining word embeddings e j for each word w j in sentence s i s i = σ j m f j e j f 1, f m is a learned positional mask e j = c i if word w j corresponds to concept c i

EEG-RelNet Dynamic Relational Memory (DRM) Module To account for long-distance relations, we use the DRM Module to keep track of the interactions between the medical concepts in each report The DRM for a report with d concepts consists of: d Concept Memories which accumulate information about each concept d(d 1) Relation Memories which accumulate information about the potential relations between each pair of concepts

EEG-RelNet c Concept Memory Cell Relation Memory Cell c Concept Memory Cell c Concept Memory Cell c c Concept Memory Cell Concept Memory Cell Relation Memory Cell Relation Memory Cell Relation Memory Cell Relation Memory Cell c c Concept Memory Cell Concept Memory Cell Relational Memory c c Concept Memory Cell Concept Memory Cell Relational Memory s s s

EEG-RelNet Concept Memory h i is updated as follows: i c = s k, h i + c i i + i h i = φ W u h i + W v c i + W s s k h i h i + i c h i c i i s Relation Memory r i is updated as follows: c c ij = i j s k, r ij ij + i r ij = φ W A r ij + W B s k r ij r ij + ij r ij j ij ij s

EEG-RelNet Output Module Identifies the relation (if any) between any pair of medical concepts from the report using the updated DRM after processing each sentence in the report To determine the relation between concepts c i and c j 1. We pass the concept memories h i and h j along with the relation memory r ij to a fully connected PReLU layer: q ij = φ W q h i, h j, r ij ; 2. Then softmax: R ij = softmax φ W z q ij. 3. The predicted relation type is R ij = argmax t R t ij

Outline 1. Introduction 2. The data 3. Memory-Augmented Active Deep Learning 4. EEG-RelNet for long distance relation detection 5. Experimental Results 6. Conclusion

Experimental Results To measure the impact of the EEG-RelNet architecture, we compare against: 1. EEG-RelNet_NRM: a configuration of EEG-RelNet with no relation memories (i.e. q ij = φ W q h i, h j ) 2. EEG-RelNet_NA: a configuration of EEG-RelNet that ignores medical concept attributes (i.e. c i = c) 3. A Heuristic baseline: a high-recall, rule-based method from our previous work using concept type and section information to relate concepts deterministically We use 5-fold cross validation on the full set of 140 manually annotated EEG reports selected via active learning containing 1513 relations between 3691 medical concepts

Experimental Results The performance of the MAADL system for detecting relations between medical concepts in EEG reports

Experimental Results Active Learning Learning Curves shown for the first 100 EEG reports annotated and evaluated with F 1 measure.

Experimental Results - Discussion Best performance for EVOKES relations 0.8371 vs 0.7116 (TREATMENT-FOR), 0.6939 (EVIDENCES) EVOKES always involves an EEG Activity EEG activities have a richer representation Context is often clearer for EVOKES relations Could use existing ontologies to inform medical problem and treatment concepts (e.g. TREATMENT-FOR(Lamictal, seizure)) TREATMENT-FOR relations have high recall and low precision Most common treatments (medications) have uninformative context Improves markedly as AL progresses

Outline 1. Introduction 2. The data 3. Memory-Augmented Active Deep Learning 4. EEG-RelNet for long distance relation detection 5. Experimental Results 6. Conclusion

Conclusion In this work, we presented a novel active deep learning framework for identifying relations between medical concepts in EEG reports. EEG-RelNet: a novel deep neural architecture capable of inferring relations between medical concepts not mentioned in the same sentence or section To our knowledge, this is the first method capable of detecting relations between distant medical concepts in medical narratives Updates a set of memories that accumulate information about each concept and potential relation in the EEG report Experimental results show promise Future work Incorporate outside knowledge to improve representation of medical problems/treatments Use MAADL system to improve Medical Knowledge Embeddings for epilepsy

Acknowledgements Research reported in this publication was supported by the National Human Genome Research Institute of the National Institutes of Health under award number 1U01HG008468. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Questions