Binding Ontologies and Coding Systems to Electronic Health Records and Message

Similar documents
When and Why to use a Classifier?

Investigating implementing CEN with HL7 V3 and SNOMED CT Final Report

CIMI Modeling Architecture, Methodology & Style Guide

Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework

Deliberating on Ontologies: The Present Situation. Simon Milton Department of Information Systems, The University of Melbourne

Causal Knowledge Modeling for Traditional Chinese Medicine using OWL 2

An introduction to case finding and outcomes

HL7 Cross-Paradigm Specification: CIMI Logical Models, Release 1

An explicit specification of a conceptualization

What Is A Knowledge Representation? Lecture 13

Chapter 02 Developing and Evaluating Theories of Behavior

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach

The openehr approach. Background. Approach. Heather Leslie, July 17, 2014

Role Representation Model Using OWL and SWRL

Representation of Part-Whole Relationships in SNOMED CT

What can Ontology Contribute to an Interdisciplinary Project?

Foundations of AI. 10. Knowledge Representation: Modeling with Logic. Concepts, Actions, Time, & All the Rest

LECTURE 5: REACTIVE AND HYBRID ARCHITECTURES

The Logic of Categorization

Using the CEN/ISO Standard for Categorial Structure to Harmonise the Development of WHO International Terminologies

A Fuzzy Logic System to Encode Emotion-Related Words and Phrases

From where does the content of a certain geo-communication come? semiotics in web-based geo-communication Brodersen, Lars

INTERVIEWS II: THEORIES AND TECHNIQUES 5. CLINICAL APPROACH TO INTERVIEWING PART 1

Automatic generation of MedDRA terms groupings using an ontology

Defined versus Asserted Classes: Working with the OWL Ontologies. NIF Webinar February 9 th 2010

Semantic Interoperability for Health Network. Deliverable 4.4: Report on interface specifications between semantic artefacts

Quality requirements for EHR Archetypes

Using Scripts to help in Biomedical Text Interpretation

Plato's Ambisonic Garden

Artificial Intelligence Programming Probability

Theory, Models, Variables

Why Do It the Hard Way? The Case for an Expressive Description Logic for SNOMED

Clinical Observation Modeling

Semantic Alignment between ICD-11 and SNOMED-CT. By Marcie Wright RHIA, CHDA, CCS

To conclude, a theory of error must be a theory of the interaction between human performance variability and the situational constraints.

Semantic Interoperability for Health Network. Deliverable 4.2: Ontology / Information models covering the heart failure use case

PRSB e-discharge summary phase 2 Medications and medical devices information model

Group Assignment #1: Concept Explication. For each concept, ask and answer the questions before your literature search.

On the diversity principle and local falsifiability

observational studies Descriptive studies

Opposition principles and Antonyms in Medical Terminological Systems: Structuring Diseases Description with Explicit Existential Quantification

PARADIGMS, THEORY AND SOCIAL RESEARCH

Data driven Ontology Alignment. Nigam Shah

Lecture 2: Foundations of Concept Learning

Survey of Knowledge Base Content

A Lexical-Ontological Resource forconsumerheathcare

Student Minds Turl Street, Oxford, OX1 3DH

Combining Archetypes with Fast Health Interoperability Resources in Future-proof Health Information Systems

Thursday, July 14, Monotonicity

Intelligence as the Tests Test It

Chapter 2 What Knowledge Is

Denny Borsboom Jaap van Heerden Gideon J. Mellenbergh

Contribution of Clinical Archetypes, and the Challenges, towards Achieving Semantic Interoperability for EHRs

Martha C. Nussbaum Creating Capabilities. The Human Development Approach Cambrigde Mass.-London, The Belknap Press of Harvard University Press, 2011

Hypothesis-Driven Research

Durkheim. Durkheim s fundamental task in Rules of the Sociological Method is to lay out

A Content Model for the ICD-11 Revision

CHAPTER 4 Designing Studies

Knowledge Representation defined. Ontological Engineering. The upper ontology of the world. Knowledge Representation

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

What Happened to Bob? Semantic Data Mining of Context Histories

On the constructive nature of natural language quantifiers. Allan Ramsay School of Computer Science, University of Manchester, Manchester M13 9PL, UK

ORC: an Ontology Reasoning Component for Diabetes

Comments on David Rosenthal s Consciousness, Content, and Metacognitive Judgments

Heiner Oberkampf. DISSERTATION for the degree of Doctor of Natural Sciences (Dr. rer. nat.)

III. WHAT ANSWERS DO YOU EXPECT?

Audio: In this lecture we are going to address psychology as a science. Slide #2

Accepted Manuscript. Use of Ontology Structure and Bayesian Models to Aid the Crowdsourcing of ICD-11 Sanctioning Rules

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

Behaviorism: An essential survival tool for practitioners in autism

OBSERVATION-BASED PROACTIVE COMMUNICATION IN MULTI-AGENT TEAMWORK

Foundations for a Science of Social Inclusion Systems

Welcome to next lecture in the class. During this week we will introduce the concepts of risk and hazard analysis and go over the processes that

Remarks on Bayesian Control Charts

Functionalist theories of content

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

computation and interpretation of indices of reliability. In

Kaiser Permanente Convergent Medical Terminology (CMT)

An OWL Ontology of Norms and Normative Judgements

Volker Halbach, Axiomatic Theories of Truth, Cambridge: Cambridge. University Press, 2011, ix+365pp., $85.00, ISBN

Research Scholar, Department of Computer Science and Engineering Man onmaniam Sundaranar University, Thirunelveli, Tamilnadu, India.

EER Assurance Criteria & Assertions IAASB Main Agenda (June 2018)

Underlying Theory & Basic Issues

ICS 606. Intelligent Autonomous Agents 1. Intelligent Autonomous Agents ICS 606 / EE 606 Fall Reactive Architectures

Including Quantification in Defeasible Reasoning for the Description Logic EL

Ontology of Observing: The Biological Foundations of Self-Consciousness and of The Physical Domain of Existence

Appearance properties

Chapter 02. Basic Research Methodology

Using Semantic Web technologies for Clinical Trial Recruitment

Authoring SNOMED CT Generic Authoring Principles. Penni Hernandez and Cathy Richardson Senior Terminologists

Cambridge Pre-U 9773 Psychology June 2013 Principal Examiner Report for Teachers

Hypertension encoded in GLIF

SkillBuilder Shortcut: Levels of Evidence

Response to the ASA s statement on p-values: context, process, and purpose

Spirituality in the Workplace

AMIA 2005 Symposium Proceedings Page - 166

School of Nursing, University of British Columbia Vancouver, British Columbia, Canada

Answers to end of chapter questions

GE SLO: Ethnic-Multicultural Studies Results

Transcription:

Binding Ontologies and Coding Systems to Electronic Health Records and Message Alan Rector 1, Rahil Qamar 1 & Tom Marley 2 1 University of Manchester 2 University of Salford with acknowledgements to the Semantic Mining Network of Excellence, the UK Connecting for Health programme, and the Terminfo Group rector@cs.man.ac.uk www.co-ode.org www.semanticmining.org/

Plan of the paper The use case and requirements Theoretical background Engineering background Issues in using OWL Summary

Medical IT s odd organisational structure Separate / independent development Medical Ontologies / Terminologies SNOMED, GALEN, NCI thesaurus, potentiual OBO Disease Ontology, etc. Medical information models HL7 messages OpenEHR Archetypes A Common Manifestation of the Oddity The value set problem Even after the ontology is designed, the EHR/Message still has to define the valid value sets in terms of it Related problems Relation of Simple Knowledge Organisation System (SKOS) and other thesauri with ontologies Classes as Values Semantic Web Best Practice Working Group (SWBP) (Noy et al)

What this paper is not about Whether any specific coding system, ontology, or information model is appropriate, true, good, bad, or otherwise The methods presented here are pertinent even with a perfect ontology and a fully adequate and well aligned information model Although they are more obvious with ill formed ontologies and information systems...and do allow us to cope with imperfect ontologies and suboptimal record structures There is another paper to be written on how best to construct ontologies and information models to fit together This is not it We have tested with HL7 and SNOMED becaue that gave us independently developed test cases & NHS paid This implies neither endorsement nor criticism of either

Requirements summary Explicit Code Binding Interface ( CBI ) interface What codes can be used when and where Expressivity Meets requirements on next slide Compositional coding systems (SNOMED-CT) Deal with compositional coding systems Part of general methodology for progressively constraining information models Factoring information models into re-usable submodels Use of standard languages with well defined semantics Formally specified independently testable constraints

Expressivity for the Code Binding Interface Any enumerated list of codes (without their subcodes A code and all its subcodes All the subcodes of a code but not the parent code Any boolean combination of the above

There is no first order solution to these requirements You cannot use a first order expression about the members of classes to select a class and exclude its subclasses Unless you really mean to exclude the individual in the subclasses which in general you do not Diabetes Type 1 Diabetes Diabetes Type 2 Metabolic disorder

Pointed out the obvious Data structures and what they carry information about are different Information models and ontologies are at different levels Information structures are meta to ontologies The purpose of modelling an ontology is to represent our conceptualisation of the world The question is does allow us to make correct predictions about our observations of the world The purpose of modelling an information structure is to specify valid data structures structures to carry information about that world To constrain the data structures to just those which a given software system can process

Data structures and what they carry information about have different characteristics Example: All persons have a sex However not all data structures about people have a field for sex Information structures are intrinsically closed Valid structures can be exhaustively and completely described (up to recursion) Ontologies are intrinsically open We can never describe the world completely

Fundamental task for EHRs and Messages Begin with logical statements about the world Our best efforts to represent what we believe is true Encode those statements in data structures for transmission Provably valid for the software in hand Decode data structures faithfully back to logical statements about the world With only well defined loss of information

From logical statements about the world to information structures and back again (possibly with loss) Logical statements about the world: All diabetes are metabolic diseases John has diabetes & it is brittle and long standing John has diabetes and it is brittle Valid Specifications for data structures Valid diabetic data structures have: a topic of code for diabetes, a diagnosis code that is diabetes or one of its subcodes, and a brittleness code that is one of the subcodes for diabetic brittlenes and nothing else

Caveat: Do not confuse higher order knowledge about the world and meta knowledge about the representation Higher order knowledge about the classes themselves Endangered species Darwin described Galapagos Finches Information about the software artifact Editorial The representation of species in this ontology was authored by Alan Rector on the basis of defintions in the OED, Wordnet, and UMLS Structural The identifier for the class Diabetes in this representation is 12345 In this representation system the immediate primitive superclass of Diabetes is metabolic disorder the inferred immediate superclasses are Chronic disease, Cardiovascular risk factor, what we are concerned with in this paper

Representing Information Models and Codes: Basic approach An information model can the thought of of as A logical theory of classes of information structures The instances of the classes are concrete data structures - EHRs, messages, etc - carrying data about specific patients, tests, organisations, cases of disease,...

Simple example of a class of information structures for Diabetes Three fields - one example of each case Field Valid Code Set Topic Exact code for diabetes Diagnosis The code for diabetes or any kind of diabetes Brittleness Any of the subcodes of brittleness but not brittleness itself

ontology representations and codes The instances of ontology represent (conceptualisations of) things in the world Cases of diabetes Patients Insulin metabolism islet cells The instances in data structures are data items in human artefacts Information structures of associations and attributes, elements, etc. Individual codes represent representations of classes in given representation of an ontology What is true of a given logical / mathematical / software artefact

Model of Meaning & Model of Data structure Model of Meaning/ ontology Diabetes Type 1 Diabetes Diabetes Type 2 Metabolic disorder Model of data structures n Information Model Diabetes Observation topic Participation Observation ONLY is_subcode_of is_subcode_of code_for_ metabolic_disorder code_for_diabetes Model of codes in Information Model Type 1 Diabetes Observation ONLY code_for_ diabetes_type_1 is_subcode_of code_for_ diabetes_type_2 diagnosis

Software Engineering Issues Clean testable, maintainable The Code Binding Interfaced Placholder codes Classses of codes used in the the information model to express constraints Only a member of this class of placeholder codes can be used here But not defined in the information model Defer binding A notion partly taken from Beale s Archetype Definition Language s Ontology section, which would better be called a Code Binding Section And whose semantics are weak - we propose here a methodology for a stronger semantics 17

Binding using placeholders Code Binding Interface

Outline of Process Ontologies Transform Information Models Coding System Import Binding Module Reasoner Validate Applications Data Structures

Doing it in OWL Why use OWL? Treat OWL as a logic constraint language with tools and community Can be used to represent ontologies Can also be used to represent constraints data structures Generic with well defined semantics: It means what it means Forces distinctions often left implicit to be made explicit The good news: Can be explicit about which expressions are open or closed Can be explicit about layering and metamodels The bad news: Must be explicit about which expressions are open or closed Must be explicit about layering and meta models NB using some constructs from OWL 1.1 but included in all current OWL reasoners 20

Key issues in OWL Open World Reasoning Negation means provably false Expressions not present are merely under-specified OWL, Descriptin Logics, FoL False means cannot be proved true What is missing is false Databases, logic programming, most rules systems, SQL, etc. Absence of the Unique Name Assumption Two entities are different only if we say they are different Means that it is easy to say that two things are logically equivalent In most database, logic programming, systems, etc. if two things have different names they are different Therefore: Owl often requires additional explicit axioms (since you have the freedom to do either, you must specify which) Closure axioms when closed world reasoning is required Distinguishing axioms when things are known to be different 21

Example definition of a class of data structures CLASS Diabetic_data_structure has_attr EXACTLY 1 Topic, has_attr EXACTLY 1 Diagnosis, has_attr EXACTLY 1 Brittleness, 22

Example definition of a class of data structures with placeholders CLASS Diabetic_data_structure has_attr EXACTLY 1 Topic, has_attr SOME (Topic & has_code SOME Placeholder_diabetes_only_code), has_attr EXACTLY 1 Diagnosis, has_attr SOME (Diagnosis & has_code SOME Placeholder_diabetes_or_sub), has_attr EXACTLY 1 Brittleness, has_attr SOME (Brittleness & has_code ONLY Placeholder_diab_brittleness_sub). 23

Example definition of a class of data structures with placeholders and closure axiom CLASS Diabetic_data_structure has_attr EXACTLY 1 Topic, has_attr SOME (Topic & has_code SOME Placeholder_diabetes_only_code), has_attr EXACTLY 1 Diagnosis, has_attr SOME (Diagnosis & has_code SOME Placeholder_diabetes_or_sub), has_attr EXACTLY 1 Brittleness, has_attr SOME (Brittleness & has_code ONLY Placeholder_diab_brittleness_sub), has_attr ONLY (Topic OR Diagnosis OR Brittleness). 24

The Code Binding Interface A set of equivalence axioms The code for diabetes (only) Placeholder_diabetes_only_code {code_for_diabetes} The code for diabetes or any of its subcodes Placeholder_diabetes_or_sub {code_for_diabetes} OR is_sub_of VALUE code_for_diabetes The code for any subcode of diabetic_brittleness Placeholder_diab_brittleness_sub is_sub_of VALUE code_for_diabetic_brittleness 25

A fragment of the ontology and the derived coding system CLASS Diabetes Metabolic_disorder, has_quality EXACTLY 1 Brittleness. CLASS Diabetes_type_1 Diabetes, is_caused_by_some (Damage AND has_locus SOME Pancreatic_islet_cells).... INDIVIDUAL code_for_diabetes CODE, has_sub VALUE code_for_diabetes_type_1, has_sub VALUE code_for_diabetes_type_2, has_sub ONLY {code_for_diabetes_type_1, code_for_diabetes_type_2}. 26

A fragment of the ontologym and the derived coding system Note: Not everything in the ontology is necessarily represented in the coding system In this coding system causation by pancreatic islet cell defect is ignored in We could introduce additional features in the coding system not from the ontology Whether or not this would be a good idea We could deliberately create a more complex relationship for has sub A reasonable way to account systematicaly for broader than/narrower than 27

Summary Ontologies represent (our conceptualisation of) the world The criteria for correctness is our predictions of observations of the world Data Structures are used to convey information about (our conceptualisation of) the world They criteria for correctness is validity for use in multiple information systems Codes are data structures which represent the representations in ontological artefacts They are meta to the ontology Ontologies & coding systems will continue to be developed independently Formal interfaces and deferred binding are therefore required between them EVEN WERE THEY PERFECT The mechanism of placeholder codes is presented (in part) here is sufficient to form such a Code Binding Interface 28