Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials

Size: px
Start display at page:

Download "Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials"

Transcription

1 Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials Riccardo Miotto and Chunhua Weng Department of Biomedical Informatics Columbia University, New York City

2 Acknowledgments The research was supported by two grants: 1. R01 LM Bridging the semantic gap between clinical research eligibility criteria and clinical data from The National Library of Medicine (PI: Weng) 2. UL1 TR Clinical and Translational Science Award to Columbia University (PI: Ginsberg) 2

3 The Top 5 Challenges for Clinical Research 1. Recruitment 2. Recruitment 3. Recruitment 4. Recruitment 5. Recruitment 3

4 The Opportunity Samson W. Tu, Carol A. Kemper, Nancy M. Lane, Robert W. Carlson, and Mark A. Musen. A methodology for determining patients eligibility for clinical trials. Journal of Methods of Information in Medicine, 32(4): , EHRs contains rich information for identifying eligible patients for clinical trials As of 2015, EHRs have become pervasive in medicine 4

5 Our Initial Plan To derive a computable representation of the clinical trial eligibility criteria and to align it with patient EHR records Free-text Eligibility Criteria Text-based Knowledge Acquisition Semantic Alignment Potentially Eligible Patients 5

6 and Attempts 6

7 and Attempts 7

8 and Attempts 8

9 and Attempts 9

10 and Attempts 10

11 and Attempts 11

12 Data quality issues (Weiskopf, JAMIA 2013) Completeness Concordance Correctness Plausibility Currency and Challenges Data representation heterogeneity: structured, unstructured Lack of common information models Concept granularity discrepancies even using the same T Incomplete knowledge, imprecise disease classification Workflow and practical issues: recent visit, belief in research, etc. 12

13 An Alternative To derive a computable representation of the clinical trial eligibility criteria target patient and to align it with patient EHR records find patients similar to the target 13

14 Our Conceptual Framework 1. A set of eligible patients or clinical trial participants is manually identified their EHRs are aggregated to derive the target patient 2. The target patient is applied to any unseen patient of a clinical data warehouse to check the eligibility status 3. For each patient the framework returns a relevance score the higher the score, the more likely the patient is eligible for the trial 4. Patients can be be ranked by their relevance scores potentially eligible patients are at the top of the list manual identification performed by the investigator can be done quickly 14

15 Example Related Work 15

16 Methods: comparison to related work State-of-the-art: train a binary classifier to determine if a patient is eligible or not the classifier is the target patient Limitation of the state-of-the-art training a binary classifier requires a list of participants and also a list of ineligible patients so that finding ineligible patients can be as difficult as laborious as finding eligible patients Alternative Approach generate the target patient by modeling only the trial participants Advantages Rely only on a patient representation using comprehensive data without formally representing eligibility criteria 16

17 A Pilot Study Study Goal to show the feasibility of using only minimal trial participants to discover new potentially eligible patients Our implementation favored a simple design to ensure a focused and correct evaluation more complicated implementations are more likely to introduce mistakes in the process Flexible customization on how to: process and summarize patient EHR data represent the clinical trial participants discover potential eligible patients 17

18 Patient EHR Processing (1) EHR data types medication orders, ICD-9 diagnosis, laboratory results, clinical notes Each participant is represented by four vectors, one per data type Medication orders count the presence of each code in every participant Laboratory results count the presence of each test if results were categorical or expressed with different scales average test result values if expressed using the same unit measure 18

19 Patient EHR Processing (2) ICD9 codes count the presence of each code in every participant Clinical notes extract relevant tags limited presence of stop words, matching to UMLS use UMLS to normalize tags aggregate synonyms and semantically similar concepts under the same tag remove negated tags remove temporally consecutive similar notes topic modeling unsupervised inference process that captures patterns of word co-occurrences within a heterogeneous set of notes to define topics represent each note as a a vector of topic probabilities 19

20 Target Patient (1) Participant EHR representations are aggregated to derive the target patient 20

21 Target Patient (2) Target patient for each data type, retain only the concepts frequently shared by the participants motivated by the small number of participants available for some trials a concepts is frequent when appears in at least 60% of the participants average concept occurrences over all participants Target patient represented by four vectors aggregated common patterns among the trial s participants same structure of a regular patient representation favor direct comparisons with patient data 21

22 Finding New Eligible Patients EHR data of unseen patients are matched to the target patient of a clinical trial Relevance score indicating the patient eligible likelihood pairwise cosine similarity within each data type aggregating the scores using a weighted linear combination w-comb The higher the relevance score, the more likely is the patient eligible List of patients sorted by score the most likely candidates at the top of the list to speed up manual review 22

23 Evaluation: study design Dataset 13 clinical trials from Columbia University 262 unique participants additional 30k patients extracted at random from the data warehouse 2-fold cross validation half participants to derive the target patient half participants plus the 30k random patients to test Ranking experiment for each trial rank all the patients in the test set by their relevance score with the corresponding target patient measure at which point of the list the participants in the test set are ranked 23

24 Evaluation: 13 multi-site trials 24

25 Evaluation: 13 multi-site trials 25

26 Evaluation: comparisons w-comb obtains the best results take advantage of the heterogeneous data types AMIA 2014 Annual Symposium, November

27 Evaluation: AUC Area under the ROC Curve =

28 Evaluation: results Precision-at-5 for each trial in each fold every trial has at least one potentially eligible patient within the top 5 position of the ranking list (P5 0.2) 28

29 Conclusions EHR data of clinical trial participants can be used to recommend new eligible patients ranking results consistent among multiple trials of different medical conditions satisfactory results regardless the number of participants used for training Potential applicative scenarios self-standing tool constantly monitoring a clinical data warehouse and alerting investigators when a new potentially eligible patient is identified to be used on request to rank all patients in a data warehouse to find eligible patients for a specific trial component to integrate approaches processing eligibility criteria AMIA 2014 Annual Symposium, November

30 Ongoing Works Patient representation test more sophisticated techniques to represent EHR data model laboratory results accounting for the temporal trends of the values model diagnosis and medications using a well-chosen probability distribution to handle the incompleteness problem of EHR data Improve the target patient statistical model that can be trained by only estimating the distributions of participants associated with each trial e.g., hidden Markov models, mixture models Extend the experimental framework more and diverse clinical trials more patients in the dataset Tackle the complexity of trial design, e.g., the need for > 1 target model 30

31 Chunhua Weng, Ph.D.

A Method for Analyzing Commonalities in Clinical Trial Target Populations

A Method for Analyzing Commonalities in Clinical Trial Target Populations A Method for Analyzing Commonalities in Clinical Trial Target Populations Zhe (Henry) He 1, Simona Carini 2, Tianyong Hao 1, Ida Sim 2, and Chunhua Weng 1 1 Department of Biomedical Informatics, Columbia

More information

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials Journal of the American Miotto R, Weng Medical C. J Am Med Informatics Inform Assoc 2015;0:1 10. Association doi:10.1093/jamia/ocu050, Advance Research Access and Applications published March 13, 2015

More information

Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study

Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study Li Li, MS, Herbert S. Chase, MD, Chintan O. Patel, MS Carol Friedman, PhD, Chunhua

More information

Semantic Alignment between ICD-11 and SNOMED-CT. By Marcie Wright RHIA, CHDA, CCS

Semantic Alignment between ICD-11 and SNOMED-CT. By Marcie Wright RHIA, CHDA, CCS Semantic Alignment between ICD-11 and SNOMED-CT By Marcie Wright RHIA, CHDA, CCS World Health Organization (WHO) owns and publishes the International Classification of Diseases (ICD) WHO was entrusted

More information

Conditional Outlier Detection for Clinical Alerting

Conditional Outlier Detection for Clinical Alerting Conditional Outlier Detection for Clinical Alerting Milos Hauskrecht, PhD 1, Michal Valko, MSc 1, Iyad Batal, MSc 1, Gilles Clermont, MD, MS 2, Shyam Visweswaran MD, PhD 3, Gregory F. Cooper, MD, PhD 3

More information

Understanding and Reducing Clinical Data Biases. Daniel Fort

Understanding and Reducing Clinical Data Biases. Daniel Fort Understanding and Reducing Clinical Data Biases Daniel Fort Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy under the Executive Committee of the Graduate School

More information

Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials

Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials Bhavesh D. Goswami, Lawrence O. Hall, Dmitry B. Goldgof, Eugene Fink, and Jeffrey P. Krischer bgoswami@csee.usf.edu,

More information

Temporal Knowledge Representation for Scheduling Tasks in Clinical Trial Protocols

Temporal Knowledge Representation for Scheduling Tasks in Clinical Trial Protocols Temporal Knowledge Representation for Scheduling Tasks in Clinical Trial Protocols Chunhua Weng, M.S., 1 Michael Kahn, M.D., Ph.D., 2 & John Gennari, Ph.D. 1 1 Biomedical and Health Informatics, University

More information

Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials

Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials Bhavesh D. Goswami 1, Lawrence O. Hall 1, Dmitry B. Goldgof 1, Eugene Fink 2, Jeffrey P. Krischer 1 bgoswami@csee.usf.edu,

More information

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

A Comparison of Collaborative Filtering Methods for Medication Reconciliation A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,

More information

Predictive and Similarity Analytics for Healthcare

Predictive and Similarity Analytics for Healthcare Predictive and Similarity Analytics for Healthcare Paul Hake, MSPA IBM Smarter Care Analytics 1 Disease Progression & Cost of Care Health Status Health care spending Healthy / Low Risk At Risk High Risk

More information

How preferred are preferred terms?

How preferred are preferred terms? How preferred are preferred terms? Gintare Grigonyte 1, Simon Clematide 2, Fabio Rinaldi 2 1 Computational Linguistics Group, Department of Linguistics, Stockholm University Universitetsvagen 10 C SE-106

More information

A Deep Learning Approach to Identify Diabetes

A Deep Learning Approach to Identify Diabetes , pp.44-49 http://dx.doi.org/10.14257/astl.2017.145.09 A Deep Learning Approach to Identify Diabetes Sushant Ramesh, Ronnie D. Caytiles* and N.Ch.S.N Iyengar** School of Computer Science and Engineering

More information

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Ramon Maldonado, BS, Travis Goodwin, PhD Sanda M. Harabagiu, PhD The University

More information

A Novel Temporal Similarity Measure for Patients Based on Irregularly Measured Data in Electronic Health Records

A Novel Temporal Similarity Measure for Patients Based on Irregularly Measured Data in Electronic Health Records ACM-BCB 2016 A Novel Temporal Similarity Measure for Patients Based on Irregularly Measured Data in Electronic Health Records Ying Sha, Janani Venugopalan, May D. Wang Georgia Institute of Technology Oct.

More information

Handling Partial Preferences in the Belief AHP Method: Application to Life Cycle Assessment

Handling Partial Preferences in the Belief AHP Method: Application to Life Cycle Assessment Handling Partial Preferences in the Belief AHP Method: Application to Life Cycle Assessment Amel Ennaceur 1, Zied Elouedi 1, and Eric Lefevre 2 1 University of Tunis, Institut Supérieur de Gestion de Tunis,

More information

Pilot Study: Clinical Trial Task Ontology Development. A prototype ontology of common participant-oriented clinical research tasks and

Pilot Study: Clinical Trial Task Ontology Development. A prototype ontology of common participant-oriented clinical research tasks and Pilot Study: Clinical Trial Task Ontology Development Introduction A prototype ontology of common participant-oriented clinical research tasks and events was developed using a multi-step process as summarized

More information

Data Structures vs. Study Results:

Data Structures vs. Study Results: Data Structures vs. Study Results: Confessions of a failed epidemiologist who had an informatics epiphany CG Chute, MD DrPH, Bloomberg Distinguished Professor of Health Informatics April 7, 2015 1 Chris

More information

AN ONTOLOGICAL APPROACH TO REPRESENTING AND REASONING WITH TEMPORAL CONSTRAINTS IN CLINICAL TRIAL PROTOCOLS

AN ONTOLOGICAL APPROACH TO REPRESENTING AND REASONING WITH TEMPORAL CONSTRAINTS IN CLINICAL TRIAL PROTOCOLS AN ONTOLOGICAL APPROACH TO REPRESENTING AND REASONING WITH TEMPORAL CONSTRAINTS IN CLINICAL TRIAL PROTOCOLS Ravi D. Shankar, Susana B. Martins, Martin J. O Connor and Amar K. Das Stanford Medical Informatics,

More information

Innovative Risk and Quality Solutions for Value-Based Care. Company Overview

Innovative Risk and Quality Solutions for Value-Based Care. Company Overview Innovative Risk and Quality Solutions for Value-Based Care Company Overview Meet Talix Talix provides risk and quality solutions to help providers, payers and accountable care organizations address the

More information

A Framework for Conceptualizing, Representing, and Analyzing Distributed Interaction. Dan Suthers

A Framework for Conceptualizing, Representing, and Analyzing Distributed Interaction. Dan Suthers 1 A Framework for Conceptualizing, Representing, and Analyzing Distributed Interaction Dan Suthers Work undertaken with Nathan Dwyer, Richard Medina and Ravi Vatrapu Funded in part by the U.S. National

More information

Statement of research interest

Statement of research interest Statement of research interest Milos Hauskrecht My primary field of research interest is Artificial Intelligence (AI). Within AI, I am interested in problems related to probabilistic modeling, machine

More information

MAY 1, 2001 Prepared by Ernest Valente, Ph.D.

MAY 1, 2001 Prepared by Ernest Valente, Ph.D. An Explanation of the 8 and 30 File Sampling Procedure Used by NCQA During MAY 1, 2001 Prepared by Ernest Valente, Ph.D. Overview The National Committee for Quality Assurance (NCQA) conducts on-site surveys

More information

A Real-Time Screening Alert Improves Patient Recruitment Efficiency

A Real-Time Screening Alert Improves Patient Recruitment Efficiency A Real-Time Screening Alert Improves Patient Recruitment Efficiency Chunhua Weng, PhD 1, Candido Batres, MD 2, Tomas Borda, MD 2, Nicole G. Weiskopf, BA 1 Adam B. Wilcox, PhD 1, J Thomas Bigger, MD 2,

More information

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION OPT-i An International Conference on Engineering and Applied Sciences Optimization M. Papadrakakis, M.G. Karlaftis, N.D. Lagaros (eds.) Kos Island, Greece, 4-6 June 2014 A NOVEL VARIABLE SELECTION METHOD

More information

Asthma Surveillance Using Social Media Data

Asthma Surveillance Using Social Media Data Asthma Surveillance Using Social Media Data Wenli Zhang 1, Sudha Ram 1, Mark Burkart 2, Max Williams 2, and Yolande Pengetnze 2 University of Arizona 1, PCCI-Parkland Center for Clinical Innovation 2 {wenlizhang,

More information

Comparing Decision Support Methodologies for Identifying Asthma Exacerbations

Comparing Decision Support Methodologies for Identifying Asthma Exacerbations MEDINFO 2007 K. Kuhn et al. (Eds) IOS Press, 2007 2007 The authors. All rights reserved. Comparing Decision Support Methodologies for Identifying Asthma Exacerbations Judith W Dexheimer a, Laura E Brown

More information

Schema-Driven Relationship Extraction from Unstructured Text

Schema-Driven Relationship Extraction from Unstructured Text Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 2007 Schema-Driven Relationship Extraction from Unstructured Text Cartic

More information

Jia Jia Tsinghua University 26/09/2017

Jia Jia Tsinghua University 26/09/2017 Jia Jia jjia@tsinghua.edu.cn Tsinghua University 26/09/2017 Stage 1: Online detection of mental health problems Stress Detection via Harvesting Social Media Detecting Stress Based on Social Interactions

More information

An Ontology for Healthcare Quality Indicators: Challenges for Semantic Interoperability

An Ontology for Healthcare Quality Indicators: Challenges for Semantic Interoperability 414 Digital Healthcare Empowering Europeans R. Cornet et al. (Eds.) 2015 European Federation for Medical Informatics (EFMI). This article is published online with Open Access by IOS Press and distributed

More information

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach Osama Mohammed, Rachid Benlamri and Simon Fong* Department of Software Engineering, Lakehead University, Ontario, Canada

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

A HMM-based Pre-training Approach for Sequential Data

A HMM-based Pre-training Approach for Sequential Data A HMM-based Pre-training Approach for Sequential Data Luca Pasa 1, Alberto Testolin 2, Alessandro Sperduti 1 1- Department of Mathematics 2- Department of Developmental Psychology and Socialisation University

More information

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Dr. Sascha Losko, Dr. Karsten Wenger, Dr. Wenzel Kalus, Dr. Andrea Ramge, Dr. Jens Wiehler,

More information

Identifying Deviations from Usual Medical Care using a Statistical Approach

Identifying Deviations from Usual Medical Care using a Statistical Approach Identifying Deviations from Usual Medical Care using a Statistical Approach Shyam Visweswaran, MD, PhD 1, James Mezger, MD, MS 2, Gilles Clermont, MD, MSc 3, Milos Hauskrecht, PhD 4, Gregory F. Cooper,

More information

Clinical decision support (CDS) and Arden Syntax

Clinical decision support (CDS) and Arden Syntax Clinical decision support (CDS) and Arden Syntax Educational material, part 1 Medexter Healthcare Borschkegasse 7/5 A-1090 Vienna www.medexter.com www.meduniwien.ac.at/kpa (academic) Better care, patient

More information

PGT: Measuring Mobility Relationship using Personal, Global and Temporal Factors

PGT: Measuring Mobility Relationship using Personal, Global and Temporal Factors PGT: using Personal, Global and Temporal Factors Hongjian Wang, Zhenhui Li, Wang-Chien Lee Penn State University ICDM 2014 Shenzhen Measure the mobility relationship strength Given trajectories of two

More information

A Descriptive Delta for Identifying Changes in SNOMED CT

A Descriptive Delta for Identifying Changes in SNOMED CT A Descriptive Delta for Identifying Changes in SNOMED CT Christopher Ochs, Yehoshua Perl, Gai Elhanan Department of Computer Science New Jersey Institute of Technology Newark, NJ, USA {cro3, perl, elhanan}@njit.edu

More information

Learning Optimal Individualized Treatment Rules from Electronic Health Record Data

Learning Optimal Individualized Treatment Rules from Electronic Health Record Data 2016 IEEE International Conference on Healthcare Informatics Learning Optimal Individualized Treatment Rules from Electronic Health Record Data Yuanjia Wang, Peng Wu, Ying Liu Department of Biostatistics

More information

S10: Data Quality Assessment and Control Framework for Secondary Use of Healthcare Data

S10: Data Quality Assessment and Control Framework for Secondary Use of Healthcare Data S10: Data Quality Assessment and Control Framework for Secondary Use of Healthcare Data Presented by: Meredith Nahm Zozus Duke University October 7, 2014 1:00 pm - 1:45 pm 1 What do we mean by Secondary

More information

Leveraging Electronic Health Data in a Multinational Clinical Trial: Early Learnings from the HARMONY- OUTCOMES EHR Ancillary Study

Leveraging Electronic Health Data in a Multinational Clinical Trial: Early Learnings from the HARMONY- OUTCOMES EHR Ancillary Study Leveraging Electronic Health Data in a Multinational Clinical Trial: Early Learnings from the HARMONY- OUTCOMES EHR Ancillary Study Emily O Brien, PhD Assistant Professor Lesley Curtis, PhD Professor Department

More information

Problem-Oriented Patient Record Summary: An Early Report on a Watson Application

Problem-Oriented Patient Record Summary: An Early Report on a Watson Application Problem-Oriented Patient Record Summary: An Early Report on a Watson Application Murthy Devarakonda, Dongyang Zhang, Ching-Huei Tsou, Mihaela Bornea IBM Research and Watson Group Yorktown Heights, NY Abstract

More information

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 DETAILED COURSE OUTLINE Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 Hal Morgenstern, Ph.D. Department of Epidemiology UCLA School of Public Health Page 1 I. THE NATURE OF EPIDEMIOLOGIC

More information

The openehr approach. Background. Approach. Heather Leslie, July 17, 2014

The openehr approach. Background. Approach. Heather Leslie, July 17, 2014 Heather Leslie, July 17, 2014 The openehr approach Background The open source openehr 1 architecture specifications 2 have evolved as the result of over 20 years of research and international implementations,

More information

Discovering Meaningful Cut-points to Predict High HbA1c Variation

Discovering Meaningful Cut-points to Predict High HbA1c Variation Proceedings of the 7th INFORMS Workshop on Data Mining and Health Informatics (DM-HI 202) H. Yang, D. Zeng, O. E. Kundakcioglu, eds. Discovering Meaningful Cut-points to Predict High HbAc Variation Si-Chi

More information

THE DYNAMICS OF MOTIVATION

THE DYNAMICS OF MOTIVATION 92 THE DYNAMICS OF MOTIVATION 1. Motivation is a highly dynamic construct that is constantly changing in reaction to life experiences. 2. Needs and goals are constantly growing and changing. 3. As individuals

More information

SNOMED CT and Orphanet working together

SNOMED CT and Orphanet working together SNOMED CT and Orphanet working together Ian Green Business Services Executive, IHTSDO Dr. Romina Armando INSERM Session outline What is Orphanet? Rare disorders Orphanet nomenclature Mappings to other

More information

Not all NLP is Created Equal:

Not all NLP is Created Equal: Not all NLP is Created Equal: CAC Technology Underpinnings that Drive Accuracy, Experience and Overall Revenue Performance Page 1 Performance Perspectives Health care financial leaders and health information

More information

A Data Quality Assessment Guideline for Electronic Health Record Data Reuse

A Data Quality Assessment Guideline for Electronic Health Record Data Reuse egems Generating Evidence & Methods to improve patient outcomes A Data Quality Assessment Guideline for Electronic Health Record Data Reuse Nicole G. Weiskopf, PhD; i Suzanne Bakken; ii,iii RN PhD; George

More information

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) AUTHORS: Tejas Prahlad INTRODUCTION Acute Respiratory Distress Syndrome (ARDS) is a condition

More information

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks 1 Rumor Detection on Twitter with Tree-structured Recursive Neural Networks Jing Ma 1, Wei Gao 2, Kam-Fai Wong 1,3 1 The Chinese University of Hong Kong 2 Victoria University of Wellington, New Zealand

More information

The AAA statement on Ethnography and Institutional Review Boards (2004) provides a useful working definition:

The AAA statement on Ethnography and Institutional Review Boards (2004) provides a useful working definition: Ethnographic Research and IRB-SBS Protocols INTRODUCTION The Department of Anthropology has drafted the following guidelines intended to help inform the process of IRB oversight of ethnographic research

More information

Deep Patient: Predict the Medical Future of Patients with Artificial Intelligence and EHRs

Deep Patient: Predict the Medical Future of Patients with Artificial Intelligence and EHRs Deep Patient: Predict the Medical Future of Patients with Artificial Intelligence and EHRs Riccardo Miotto, Ph.D. New York, NY Institute for Next Generation Healthcare Dept. of Genetics and Genomic Sciences

More information

An Evolutionary Approach to the Representation of Adverse Events

An Evolutionary Approach to the Representation of Adverse Events Medical Informatics in a United and Healthy Europe K.-P. Adlassnig et al. (Eds.) IOS Press, 2009 2009 European Federation for Medical Informatics. All rights reserved. doi:10.3233/978-1-60750-044-5-537

More information

Methodological Issues in Measuring the Development of Character

Methodological Issues in Measuring the Development of Character Methodological Issues in Measuring the Development of Character Noel A. Card Department of Human Development and Family Studies College of Liberal Arts and Sciences Supported by a grant from the John Templeton

More information

POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY

POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY No. of Printed Pages : 12 MHS-014 POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY Time : 2 hours Maximum Marks : 70 PART A Attempt all questions.

More information

What Helps Where And Why? Semantic Relatedness for Knowledge Transfer

What Helps Where And Why? Semantic Relatedness for Knowledge Transfer What Helps Where And Why? Semantic Relatedness for Knowledge Transfer Marcus Rohrbach 1,2 Michael Stark 1,2 György Szarvas 1 Iryna Gurevych 1 Bernt Schiele 1,2 1 Department of Computer Science, TU Darmstadt

More information

Electronic Health Record Summarization over Heterogeneous and Irregularly Sampled Clinical. Data. Rimma Pivovarov

Electronic Health Record Summarization over Heterogeneous and Irregularly Sampled Clinical. Data. Rimma Pivovarov Electronic Health Record Summarization over Heterogeneous and Irregularly Sampled Clinical Data Rimma Pivovarov Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy

More information

READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT

READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT BAHADORREZA OFOGHI 1, SAMIN SIDDIQUI 1, and KARIN VERSPOOR 1,2 1 Department of Computing and

More information

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions?

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions? Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions? Isa Maks and Piek Vossen Vu University, Faculty of Arts De Boelelaan 1105, 1081 HV Amsterdam e.maks@vu.nl, p.vossen@vu.nl

More information

Non-Pharmacological Strategies to Ameliorate Symptoms of Alzheimer s Disease and Related Dementia (NPSASA)

Non-Pharmacological Strategies to Ameliorate Symptoms of Alzheimer s Disease and Related Dementia (NPSASA) Non-Pharmacological Strategies to Ameliorate Symptoms of Alzheimer s Disease and Related Dementia (NPSASA) Competition objectives: The Alzheimer s Association is launching a new initiative to stimulate

More information

Naturalistic Driving Performance During Secondary Tasks

Naturalistic Driving Performance During Secondary Tasks University of Iowa Iowa Research Online Driving Assessment Conference 2007 Driving Assessment Conference Jul 11th, 12:00 AM Naturalistic Driving Performance During Secondary Tasks James Sayer University

More information

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience. Outline: Motivation. What s the attention mechanism? Soft attention vs. Hard attention. Attention in Machine translation. Attention in Image captioning. State-of-the-art. 1 Motivation: Attention: Focusing

More information

An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation

An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation An Intelligent Writing Assistant Module for Narrative Clinical Records based on Named Entity Recognition and Similarity Computation 1,2,3 EMR and Intelligent Expert System Engineering Research Center of

More information

Introduction to Computational Neuroscience

Introduction to Computational Neuroscience Introduction to Computational Neuroscience Lecture 5: Data analysis II Lesson Title 1 Introduction 2 Structure and Function of the NS 3 Windows to the Brain 4 Data analysis 5 Data analysis II 6 Single

More information

Identifying Parkinson s Patients: A Functional Gradient Boosting Approach

Identifying Parkinson s Patients: A Functional Gradient Boosting Approach Identifying Parkinson s Patients: A Functional Gradient Boosting Approach Devendra Singh Dhami 1, Ameet Soni 2, David Page 3, and Sriraam Natarajan 1 1 Indiana University Bloomington 2 Swarthmore College

More information

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s Using Bayesian Networks to Analyze Expression Data Xu Siwei, s0789023 Muhammad Ali Faisal, s0677834 Tejal Joshi, s0677858 Outline Introduction Bayesian Networks Equivalence Classes Applying to Expression

More information

REDUCING CLINICAL NOISE FOR BODY

REDUCING CLINICAL NOISE FOR BODY REDUCING CLINICAL NOISE FOR BODY MASS INDEX MEASURES DUE TO UNIT AND TRANSCRIPTION ERRORS IN THE ELECTRONIC HEALTH RECORD March 28, 2017 Dana C. Crawford, PhD Associate Professor Epidemiology and Biostatistics

More information

Efficient AUC Optimization for Information Ranking Applications

Efficient AUC Optimization for Information Ranking Applications Efficient AUC Optimization for Information Ranking Applications Sean J. Welleck IBM, USA swelleck@us.ibm.com Abstract. Adequate evaluation of an information retrieval system to estimate future performance

More information

Implicit Attitude. Brian A. Nosek. University of Virginia. Mahzarin R. Banaji. Harvard University

Implicit Attitude. Brian A. Nosek. University of Virginia. Mahzarin R. Banaji. Harvard University 1 Implicit Attitude Brian A. Nosek University of Virginia Mahzarin R. Banaji Harvard University Contact Information Brian Nosek 102 Gilmer Hall; Box 400400 Department of Psychology University of Virginia

More information

Leveraging Standards for Effective Visualization of Early Efficacy in Clinical Trial Oncology Studies

Leveraging Standards for Effective Visualization of Early Efficacy in Clinical Trial Oncology Studies PharmaSUG 2018 - Paper DV-26 Leveraging Standards for Effective Visualization of Early Efficacy in Clinical Trial Oncology Studies Kelci Miclaus and Lili Li, JMP Life Sciences SAS Institute ABSTRACT Solid

More information

A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models

A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models Jonathan Y. Ito and David V. Pynadath and Stacy C. Marsella Information Sciences Institute, University of Southern California

More information

Research Questions and Survey Development

Research Questions and Survey Development Research Questions and Survey Development R. Eric Heidel, PhD Associate Professor of Biostatistics Department of Surgery University of Tennessee Graduate School of Medicine Research Questions 1 Research

More information

A Framework for Optimal Cancer Care Pathways in Practice

A Framework for Optimal Cancer Care Pathways in Practice A to Guide Care Cancer Care A for Care in Practice SUPPORTING CONTINUOUS IMPROVEMENT IN CANCER CARE Developed by the National Cancer Expert Reference Group to support the early adoption of the A to Guide

More information

Chapter 1. Introduction

Chapter 1. Introduction Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a

More information

Using a Guideline-Centered Approach for the Design of a Clinical Decision Support System to Promote Smoking Cessation

Using a Guideline-Centered Approach for the Design of a Clinical Decision Support System to Promote Smoking Cessation Using a Guideline-Centered Approach for the Design of a Clinical Decision Support System to Promote Smoking Cessation Richard N. Shiffman, MD, MCIS George Michel, MS Abdelwaheb Essaihi, MD Theodore W.

More information

M.Sc. in Cognitive Systems. Model Curriculum

M.Sc. in Cognitive Systems. Model Curriculum M.Sc. in Cognitive Systems Model Curriculum April 2014 Version 1.0 School of Informatics University of Skövde Sweden Contents 1 CORE COURSES...1 2 ELECTIVE COURSES...1 3 OUTLINE COURSE SYLLABI...2 Page

More information

A Predictive Chronological Model of Multiple Clinical Observations T R A V I S G O O D W I N A N D S A N D A M. H A R A B A G I U

A Predictive Chronological Model of Multiple Clinical Observations T R A V I S G O O D W I N A N D S A N D A M. H A R A B A G I U A Predictive Chronological Model of Multiple Clinical Observations T R A V I S G O O D W I N A N D S A N D A M. H A R A B A G I U T H E U N I V E R S I T Y O F T E X A S A T D A L L A S H U M A N L A N

More information

Problem solving therapy

Problem solving therapy Introduction People with severe mental illnesses such as schizophrenia may show impairments in problem-solving ability. Remediation interventions such as problem solving skills training can help people

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

A Lexical-Ontological Resource forconsumerheathcare

A Lexical-Ontological Resource forconsumerheathcare A Lexical-Ontological Resource forconsumerheathcare Elena Cardillo FBK-IRST, Via Sommarive 18, 38123 Trento, Italy cardillo@fbk.eu Abstract. In Consumer Healthcare Informatics it is still difficult for

More information

Allergen immunotherapy for the treatment of allergic rhinitis and/or asthma

Allergen immunotherapy for the treatment of allergic rhinitis and/or asthma ril 2014 Allergen immunotherapy for the treatment of allergic rhinitis and/or asthma FINAL COMPREHENSIVE RESEARCH PLAN June 2015 Study Team: Systematic Review Unit FINAL COMPREHENSIVE RESEARCH PLAN: Systematic

More information

Answers to end of chapter questions

Answers to end of chapter questions Answers to end of chapter questions Chapter 1 What are the three most important characteristics of QCA as a method of data analysis? QCA is (1) systematic, (2) flexible, and (3) it reduces data. What are

More information

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text Anthony Nguyen 1, Michael Lawley 1, David Hansen 1, Shoni Colquist 2 1 The Australian e-health Research Centre, CSIRO ICT

More information

NIH Public Access Author Manuscript Stud Health Technol Inform. Author manuscript; available in PMC 2010 February 28.

NIH Public Access Author Manuscript Stud Health Technol Inform. Author manuscript; available in PMC 2010 February 28. NIH Public Access Author Manuscript Published in final edited form as: Stud Health Technol Inform. 2009 ; 150: 537 541. An Evolutionary Approach to the Representation of Adverse Events Werner Ceusters

More information

How many speakers? How many tokens?:

How many speakers? How many tokens?: 1 NWAV 38- Ottawa, Canada 23/10/09 How many speakers? How many tokens?: A methodological contribution to the study of variation. Jorge Aguilar-Sánchez University of Wisconsin-La Crosse 2 Sample size in

More information

SPICE: Semantic Propositional Image Caption Evaluation

SPICE: Semantic Propositional Image Caption Evaluation SPICE: Semantic Propositional Image Caption Evaluation Presented to the COCO Consortium, Sept 2016 Peter Anderson1, Basura Fernando1, Mark Johnson2 and Stephen Gould1 1 Australian National University 2

More information

Classification and Statistical Analysis of Auditory FMRI Data Using Linear Discriminative Analysis and Quadratic Discriminative Analysis

Classification and Statistical Analysis of Auditory FMRI Data Using Linear Discriminative Analysis and Quadratic Discriminative Analysis International Journal of Innovative Research in Computer Science & Technology (IJIRCST) ISSN: 2347-5552, Volume-2, Issue-6, November-2014 Classification and Statistical Analysis of Auditory FMRI Data Using

More information

Statistical Considerations for Research Design. Analytic Goal. Analytic Process

Statistical Considerations for Research Design. Analytic Goal. Analytic Process Statistical Considerations for Research Design John E. Orav, Ph.D. Analytic Goal Usually, to compare an outcome between groups of research subjects, where one of the groups has received a treatment, intervention,

More information

FDA Workshop NLP to Extract Information from Clinical Text

FDA Workshop NLP to Extract Information from Clinical Text FDA Workshop NLP to Extract Information from Clinical Text Murthy Devarakonda, Ph.D. Distinguished Research Staff Member PI for Watson Patient Records Analytics Project IBM Research mdev@us.ibm.com *This

More information

MISSING DATA ESTIMATION FOR CANCER DIAGNOSIS SUPPORT

MISSING DATA ESTIMATION FOR CANCER DIAGNOSIS SUPPORT MISSING DATA ESTIMATION FOR CANCER DIAGNOSIS SUPPORT Witold Jacak (a), Karin Proell (b) (a) Department of Software Engineering Upper Austria University of Applied Sciences Hagenberg, Softwarepark 11, Austria

More information

Bottom-Up Model of Strategy Selection

Bottom-Up Model of Strategy Selection Bottom-Up Model of Strategy Selection Tomasz Smoleń (tsmolen@apple.phils.uj.edu.pl) Jagiellonian University, al. Mickiewicza 3 31-120 Krakow, Poland Szymon Wichary (swichary@swps.edu.pl) Warsaw School

More information

Abstract. Background. Objective

Abstract. Background. Objective Molecular epidemiology of clinical tissues with multi-parameter IHC Poster 237 J Ruan 1, T Hope 1, J Rheinhardt 2, D Wang 2, R Levenson 1, T Nielsen 3, H Gardner 2, C Hoyt 1 1 CRi, Woburn, Massachusetts,

More information

The Meta on Meta-Analysis. Presented by Endia J. Lindo, Ph.D. University of North Texas

The Meta on Meta-Analysis. Presented by Endia J. Lindo, Ph.D. University of North Texas The Meta on Meta-Analysis Presented by Endia J. Lindo, Ph.D. University of North Texas Meta-Analysis What is it? Why use it? How to do it? Challenges and benefits? Current trends? What is meta-analysis?

More information

Multi-modal Patient Cohort Identification from EEG Report and Signal Data

Multi-modal Patient Cohort Identification from EEG Report and Signal Data Multi-modal Patient Cohort Identification from EEG Report and Signal Data Travis R. Goodwin and Sanda M. Harabagiu The University of Texas at Dallas Human Language Technology Research Institute http://www.hlt.utdallas.edu

More information

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor Text mining for lung cancer cases over large patient admission data David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor Opportunities for Biomedical Informatics Increasing roll-out

More information

Discussion Meeting for MCP-Mod Qualification Opinion Request. Novartis 10 July 2013 EMA, London, UK

Discussion Meeting for MCP-Mod Qualification Opinion Request. Novartis 10 July 2013 EMA, London, UK Discussion Meeting for MCP-Mod Qualification Opinion Request Novartis 10 July 2013 EMA, London, UK Attendees Face to face: Dr. Frank Bretz Global Statistical Methodology Head, Novartis Dr. Björn Bornkamp

More information

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

Patient Subtyping via Time-Aware LSTM Networks

Patient Subtyping via Time-Aware LSTM Networks Patient Subtyping via Time-Aware LSTM Networks Inci M. Baytas Computer Science and Engineering Michigan State University 428 S Shaw Ln. East Lansing, MI 48824 baytasin@msu.edu Fei Wang Healthcare Policy

More information

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Author's response to reviews Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Authors: Jestinah M Mahachie John

More information