Semantic Clarification of the Representation of Procedures and Diseases in SNOMED CT

Similar documents
Representation of Part-Whole Relationships in SNOMED CT

A Descriptive Delta for Identifying Changes in SNOMED CT

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach

AMIA 2005 Symposium Proceedings Page - 166

Auditing SNOMED Relationships Using a Converse Abstraction Network

A Description Logics Approach to Clinical Guidelines and Protocols

A Study on a Comparison of Diagnostic Domain between SNOMED CT and Korea Standard Terminology of Medicine. Mijung Kim 1* Abstract

Semantic Alignment between ICD-11 and SNOMED-CT. By Marcie Wright RHIA, CHDA, CCS

Is the International Classification for Patient Safety a Classification?

or

With Semantic Analysis from Noun Phrases to SNOMED CT and Classification Codes H. R. Straub, M. Duelli, Semfinder AG, Kreuzlingen, Switzerland

Automatic generation of MedDRA terms groupings using an ontology

The Semantics of Intention Maintenance for Rational Agents

Ontological Queries Supporting Decision Process in KaSeA System

Temporal Primitives, an Alternative to Allen Operators

SNOMED CT: Who Needs to Know What?

The Significance of SNODENT

Grounding Ontologies in the External World

Biomedical resources for text mining

Effect of (OHDSI) Vocabulary Mapping on Phenotype Cohorts

Authoring SNOMED CT Generic Authoring Principles. Penni Hernandez and Cathy Richardson Senior Terminologists

Role Representation Model Using OWL and SWRL

Deliberating on Ontologies: The Present Situation. Simon Milton Department of Information Systems, The University of Melbourne

To what entities does an ICD-9-CM code refer? A realist approach.

An introduction to case finding and outcomes

Binding Ontologies and Coding Systems to Electronic Health Records and Message

A Unified View of Consequence Relation, Belief Revision and Conditional Logic

Can SNOMED CT replace ICD-coding?

PLANNING THE RESEARCH PROJECT

Operationalizing Prostate Cancer Clinical Pathways: An Ontological Model to Computerize, Merge and Execute Institution-Specific Clinical Pathways

The openehr approach. Background. Approach. Heather Leslie, July 17, 2014

Answers to end of chapter questions

Toward a Diagnosis Driven (Dental) Profession through Controlled Terminology

Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework

Using SNOMED CT Codes for Coding Information in Electronic Health Records for Stroke Patients

Oncology Ontology in the NCI Thesaurus

Abstract. Introduction and Background. by Hari Nandigam MD, MSHI, and Maxim Topaz, PhD, RN, MA

Chapter 2. Knowledge Representation: Reasoning, Issues, and Acquisition. Teaching Notes

Using the CEN/ISO Standard for Categorial Structure to Harmonise the Development of WHO International Terminologies

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

Identifying anatomical concepts associated with ICD10 diseases

Issues in Integrating Epidemiology and Research Information in Oncology: Experience with ICD-O3 and the NCI Thesaurus

Subjective Consciousness: A Self-Representational Theory

INTERVIEWS II: THEORIES AND TECHNIQUES 5. CLINICAL APPROACH TO INTERVIEWING PART 1

Copyright Warning & Restrictions

CIMI Modeling Architecture, Methodology & Style Guide

IAASB Exposure Draft, Proposed ISAE 3000 (Revised), Assurance Engagements Other Than Audits or Reviews of Historical Financial Information

USER GUIDE. SNOMED RT Systematized Nomenclature of Medicine Reference Terminology VERSION 1.0

The Logic of Categorization

Knowledge Representation of Traditional Chinese Acupuncture Points Using the UMLS and a Terminology Model

Clinical terms and ICD 11

Opposition principles and Antonyms in Medical Terminological Systems: Structuring Diseases Description with Explicit Existential Quantification

Formalizing UMLS Relations using Semantic Partitions in the context of task-based Clinical Guidelines Model

CSC2130: Empirical Research Methods for Software Engineering

Thinking the environment aurally An enactive approach to auditory-architectural research and design

A Framework Ontology for Computer-based Patient Record Systems

Matching GP terms to the ICD-10-AM index

Copyright Warning & Restrictions

Evaluating OWL 2 Reasoners in the context of Clinical Decision Support in Lung Cancer Treatment Selection

A Difference that Makes a Difference: Welfare and the Equality of Consideration

Analyzing the Semantics of Patient Data to Rank Records of Literature Retrieval

Including Quantification in Defeasible Reasoning for the Description Logic EL

An Evolutionary Approach to the Representation of Adverse Events

NIH Public Access Author Manuscript Stud Health Technol Inform. Author manuscript; available in PMC 2010 February 28.

Cultural Introspection: Findings of a Pilot Study

Bayesian (Belief) Network Models,

A Content Model for the ICD-11 Revision

Perception Lie Paradox: Mathematically Proved Uncertainty about Humans Perception Similarity

Fuzzy-set Qualitative Comparative Analysis Summary 1

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials

Estimating and Analysing Coordination in Medical Terminologies

An Ontology for Healthcare Quality Indicators: Challenges for Semantic Interoperability

Research Ethics and Philosophies

SNOMED Clinical Terms Fundamentals

APPLYING ONTOLOGY AND SEMANTIC WEB TECHNOLOGIES TO CLINICAL AND TRANSLATIONAL STUDIES

Foundations for a Science of Social Inclusion Systems

COURSE: NURSING RESEARCH CHAPTER I: INTRODUCTION

Group Assignment #1: Concept Explication. For each concept, ask and answer the questions before your literature search.

The Regression-Discontinuity Design

Kaiser Permanente Convergent Medical Terminology (CMT)

Representation Theorems for Multiple Belief Changes

Observations on the Use of Ontologies for Autonomous Vehicle Navigation Planning

Research Scholar, Department of Computer Science and Engineering Man onmaniam Sundaranar University, Thirunelveli, Tamilnadu, India.

FIT ISO STANDARDS COMMITTEE POSITION PAPER ON THE REMOVAL OF THE NOTES TO SECTION OF ISO 17100

Why Do It the Hard Way? The Case for an Expressive Description Logic for SNOMED

What Is A Knowledge Representation? Lecture 13

Ontological Representation of Laboratory Test Observables: Challenges and Perspectives in the SNOMED CT Observable Entity Model Adoption

What can Ontology Contribute to an Interdisciplinary Project?

Interface Terminologies: Facilitating Direct Entry of Clinical Data into Electronic Health Record Systems

How preferred are preferred terms?

Research Methodology in Social Sciences. by Dr. Rina Astini

A Lexical-Ontological Resource forconsumerheathcare

Plan Recognition through Goal Graph Analysis

Temporal Knowledge Representation for Scheduling Tasks in Clinical Trial Protocols

UMLS and phenotype coding

UNIT 5 - Association Causation, Effect Modification and Validity

Closing Plenary April 2019 Business Meeting. Kelly Kuru April 10, 2019

ERA: Architectures for Inference

Conceptual and Empirical Arguments for Including or Excluding Ego from Structural Analyses of Personal Networks

Prototypes Vs Exemplars in Concept Representation

Transcription:

773 Semantic Clarification of the Representation of Procedures and Diseases in SNOMED CT Stefan Schulz a, Udo Hahn b, Jeremy Rogers c a Department of Medical Informatics, Freiburg University, Germany b Language And Information Engineering Lab, Jena University, Germany c School of Computer Science, Manchester University, United Kingdom Abstract SNOMED CT is emerging as a reference terminology for the entire health care process. It claims to be founded on logic-based modeling principles. In this work we analyze a special encoding scheme for SNOMED disease and procedure entities, the so-called relationship groups which had been devised in order to avoid ambiguities in entity definitions. We show that these artifacts may represent hidden mereological relations. We also report discrepancies encountered between the defined semantics of many SNOMED CT entity terms and their intuitive meaning, and inconsistencies detected between the definition of some complex composed entities and the definition of their top-level parents. As a result we formulate recommendations for improvements of SNOMED CT. Keywords: SNOMED, Knowledge Representation, Logic 1. Introduction SNOMED Clinical Terms (SNOMED CT) is a huge clinical terminology constructed by merging, expanding, and restructuring the previous SNOMED version RT and the Clinical Terms Version 3 (former Read Codes). SNOMED CT contains around 364,000 concepts, 984,000 terms and 1.45 million defined relationships between concepts 1. In the coming years it will be deployed for routine usage in several countries (U.S., U.K., Denmark), and it is intensively being analyzed by medical terminologists and decision makers in many other countries. SNOMED CT is a concept-oriented controlled vocabulary which has been designed according to previously suggested criteria for computer-based medical terminologies [1]. SNOMED CT is described as a clinical reference terminology, i.e. a set of concepts and relationships that provides a common reference point for comparison and aggregation of data about the entire health care process [2]. SNOMED CT concepts belong to multiple is-a hierarchies and are related with one another by various semantic relationships such as is-a, has-associated topography, has-action, has-associated morphology. Interestingly, SNOMED CT explicitly encodes part-of relationships only between 1 http://www.snomed.org

774 anatomic entities [3]. There is no relation with which to describe partonomy between diseases or procedure entities. 2. Relationship groups 2 in SNOMED SNOMED NT and CT follow a formal semantics based on KRSS, an early description logic [4], and have used a terminological classifier for terminology development [5,6]. In the UMLS distribution, however, the relational format of the MRREL table requires SNOMED CT content to be compressed into the common OAV (object attribute value triplets) format. For a logic-based model such a representation is ambiguous because it (i) obscures which attribute-value pairs are sufficient for the entity3 definitions, (ii) lacks role quantifications, and (iii) does not indicate whether sets of related attribute-value pairs should be interpreted as disjunction, conjunction or optional. Table 1 gives an example: Table 1. Example of OAV (object attribute-value) format of SNOMED CT entities such as distributed via the UMLS Metathesaurus SNOMED Concept 1 SNOMED Relationship SNOMED Concept 2 Renal glomerular disease has_finding_site Kidney Renal glomerular disease has_onset Gradual onset Renal glomerular disease has_onset Sudden onset Since 2004, the UMLS Rich Release Format (RRF) has encoded further information in the MRSAT table, including which entities are defined or primitive, which OAV triplets are optional (qualifiers) and which are restrictions, and which relationship group each belongs to. The purpose of relationship groups such as introduced by SNOMED is best explained by an example: Removal of foreign body from the stomach by incision involves Stomach structure and Digestive structure as values of the attribute has_procedure_site, and both Incision and Removal as values of the attribute has_method. However, such a simplistic representation is ambiguous: it could also be interpreted as Removal of the stomach and incision of a foreign body. Relationship groups declare associations between sets of OAV triplets (see Table 2). Although each group has an integer value, this does not imply any temporal or other ordering between groups. The SNOMED CT Technical Implementation and Technical Reference makes the following statement about relationship groups: Relationships, for a concept that are logically associated with each other. The Relationship group field in the Relationships Table is used to group these rows together for a concept. As relationship groups occur in about 17,000 disease entities and 13,000 procedure entities, according to [5], this phenomenon constitutes a major issue in SNOMED CT. In [5], Spackman et al. propose a description logics representation for relationship groups, in which they are expressed by an anonymous relation, named rg. From an ontological point of view, the proposed solution is, however, rather obscure. In the following we therefore explore the possible semantics of SNOMED CT relationship groups. We show that some basic assumptions of SNOMED CT are ontologically problematic, and we propose a 2 also called role groups [5] 3 Many ontological assumptions of SNOMED CT are still unclear. E.g., different things like Foot, Absent Foot, Football (qualifier value), Europe, Love, mmol, Yin excess, Kiel Classification, are concepts in SNOMED CT. Hence we use for the sake of neutrality the term entity for what SNOMED names concept.

775 solution for clarification which will be mostly compatible with the current SNOMED CT architecture. Table 2. Entries in the SNOMED CT core relationships table for the entity 64550003: Removal of foreign body from the stomach by incision, using three relationship groups SNOMED Concept 1 SNOMED Relationship SNOMED Concept 2 RG Removal of Foreign Access Open Approach 0 Body from the Stomach Is A Removal of foreign body from 0 by Incision digestive system Is A Removal of foreign body from 0 stomach Is A Incision of stomach 0 Method Removal - action 1 Direct Morphology Foreign body 1 Procedure site-indirect Digestive structure 1 Method Incision - action 2 Procedure site Stomach Structure 2 3. Ontological Analysis of Relationship Groups We refer to the same parsimonious variant of description logics as used by [5]. Entity names are characterized by initial capital letters. They can be joined by the AND operator. As an example, the expression AcuteDigestiveSystemDisorder AND AcuteInflammatoryDisease denotes inflammatory diseases of the digestive system, i.e. the intersection of entities subsumed by AcuteDigestiveSystemDisorder with all those subsumed by the entity AcuteInflammatoryDisease (or the set of all entities subsumed by both). Relation symbols begin with lower case, e.g. hasassociatedmorphology. Roles are formed by a quantifier (here only the existential quantifier,, is used), a relation symbol, followed by a dot and an entity symbol. For example, hasassociatedmorphology.inflammation denotes the entity whose instantiation is the set of all individuals related to an instance of Inflammation by the relation hasassociatedmorphology. We can therefore rewrite the role group 1 and 2 entries in Table 2: (1) RemovalOfForeignBodyFromTheStomachByIncision IMPLIES rg.( hasproceduresite.stomachstructure AND hasmethod.incisionaction) AND rg.( hasproceduresite. DigestiveStructure AND hasdirectmorphology.foreignbody AND hasmethod.removalaction) Let us now look at the parent entities, RemovalOfForeignBodyFromDigestiveSystem and IncisionOfStomach 4. The first is an is-a descendent of RemovalProcedure, and the latter an is-a descendent of IncisionProcedure. Consequently, all instances of the entity RemovalOfForeignBodyFromTheStomachByIncision are instances of both IncisionProcedure and RemovalProcedure and as a result must therefore inherit the properties of both Incision and Removal. This is hardly imaginable: In this case, objects 4 As contained in the SNOMED CT sources from the UMLS, or visualized by the SNOMED CT Browser at http://snomed.vetmed.vt.edu/sct/menu.cfm

776 would be equally incised and removed. In a strict upper level ontology Incision and Removal are expected to be mutually exclusive. In reality, the surgeon first performs the incision and then the removal: Incision and Removal are two separate sub-procedures and so are properly not parents but parts 5 of the entity RemovalOfForeignBodyFromTheStomachByIncision. Fig. 1 gives a graphic outline of this procedure which begins with the incision of the wall of the stomach, followed by the removal action and the closure of the wound (the latter is not mentioned in the procedure definition). These time-dependent sub-procedures stand to the main procedure in a part-of relationship. This is concordant with the commonly accepted mereological (part-whole) view of actions and processes, which are, according to [8] characterized by time-dependent parts. Having this in mind it seems straightforward to re-interpret the relationship group attribute rg in (1) as the mereological primitive has-part: RemovalOfForeignBodyFromTheStomachByIncision IMPLIES has-part.( hasproceduresite.stomachstructure AND hasmethod.incisionaction) AND has-part.( hasproceduresite. DigestiveStructure AND hasdirectmorphology.foreignbody AND hasmethod.removalaction) (2) Similarly, for the parent entities we obtain: IncisionOfStomach IMPLIES has-part.( hasproceduresite.stomachstructure AND hasmethod.incisionaction) (3) RemovalOfForeignBodyFromDigestiveSystem IMPLIES has-part.( hasproceduresite. DigestiveStructure AND hasdirectmorphology.foreignbody AND hasmethod.removalaction) (4) Looking up the SNOMED CT hierarchy, we obtain exactly these definitions after replacing rg by has-part. At a first glance this seems strange, since the main rationale for relationship groups, viz. the avoidance of ambiguities, makes no sense, here. Entity names such as Incision of Stomach, suggest definitions without the has-part role: IncisingAStomach IMPLIES hasproceduresite.stomachstructure AND hasmethod.incisionaction (5) Fig. 1. Graphical Representation of the Process Removal of Foreign Body from the Stomach by Incision 5 Entity A has B as part is equivalent to the DL expression A IMPLIES has-part.b,

777 The semantic difference is the following: Whereas IncisingAStomach denotes the atomic procedure of performing an incision onto a stomach, SNOMED s IncisionOfStomach subsumes any complex procedure during which an incision of stomach is being performed. Analogously, RemovalOfForeignBodyFromDigestiveSystem subsumes any complex procedure during which a foreign body is extracted from the digestive system. Looking still higher up the SNOMED CT hierarchy, IncisionOfStomach is a child entity of IncisionProcedure, which is itself related to an Incision action by the relation has_method: IncisionProcedure IMPLIES rg.( hasmethod.incisionaction) (6) As indicated above, the semantics of rg may be improved to derive: IncisionProcedure IMPLIES has-part.( hasmethod.incisionaction) (7) An IncisionProcedure is, therefore, any procedure which has a part characterized by the enactment of an Incision. Only this broader definition justifies IncisionProcedure being the ancestor of nearly one thousand entities: so many distinct flavours of incision do not exist, but more than one thousand surgical procedures have an incision as part of their description. We have taken our example from the procedure branch of SNOMED CT. We could have used, as well, numerous examples from the disease / disorders branch, e.g. Acute Perforated Appendicitis, which is subsumed by both Inflammation and Perforation (in the above sense). However, there may be situations in SNOMED where the translation of rg as has-part would not be correct. For example, within the current SNOMED content, it is possible to construct the post-coordinated composition of an urgent swab of the left eye, by adding the urgent and left qualifiers as appropriate. The flattened (not role grouped) representation that would result would be: Entity 1 Relationship Entity2 isa Specimen from Conjunctiva specimensourcetopography Conjunctival Structure Urgent Swab of the Left Eye specimenprocedure Taking of Swab priority Urgent laterality Left Here it would then be impossible to tell whether the attribute Left should be applied to the swab or the eye. So, we might want to formalize: UrgentSwabOfLeftEye IMPLIES Specimen from Conjunctiva AND rg.( specimensourcetopography.conjunctivalstructure AND Laterality.Left) AND rg.( specimenprocedure.takingoffswab AND.Priority.Urgent) (8) 4. Conclusion Our analysis of relationship groups in SNOMED CT revealed weaknesses which motivated us to make some recommendations which would improve SNOMED CT in clarity and which would remove inconsistencies from the terminology. These suggestions

778 would encompass only minor modifications of the SNOMED CT architecture: Rename the relationship group attribute rg by has-part or has-subprocess where it appears between a complex process and its subprocesses (i.e. especially in the disease and procedure chapters of SNOMED CT). Make a clearer distinction between atomic entities (such as IncisionAction) and those entities which have atomic entities as parts (such as IncisionProcess). The present entity names are misleading. Finally, one has to take into account, there are scenarios in which the use of relationship groups seem adequate, without, however, corresponding to a mereological relation. A more detailed ontological inquiry of these cases is still due. 5. Acknowledgments This work was supported by the EU Network of Excellence Semantic Interoperability and Data Mining in Biomedicine (NoE 507505), cf. http://www.semanticmining.org. We also thank Ulrike Sattler (Manchester, UK) and Kent Spackman (Portland, OR, U.S.), for their helpful discussions. 6. References [1] Cimino JJ. Desiderata for controlled medical vocabularies in the twenty-first century. Methods of Information in Medicine, 1998: 37(4/5):394-403. [2] Spackman KA, Campbell K, and Cote RA. SNOMED RT: A reference terminology for health care. In Daniel R. Masys, editor, AMIA'97 - Proceedings of the 1997 AMIA Annual Fall Symposium, pp. 640-644. Philadelphia, PA: Hanley & Belfus, 1997. [3] Spackman KA and Reynoso G. Examining SNOMED from the perspective of formal ontological principles: Some preliminary analysis and observations. In Hahn U, Schulz S, and Cornet S, editors, KR-MED 2004 - Proceedings of the 1st International Workshop on Formal Biomedical Knowledge Representation, Collocated with the 9th International Conference on the Principles of Knowledge Representation and Reasoning (KR 2004), pp. 81-87. Whistler, B.C., Canada, June 1, 2004. Bethesda, MD: American Medical Informatics Association (AMIA), 2004. Published via http://ceur-ws.org/vol-102/. [4] Baader F, Calvanese D, McGuinness D, Nardi D, Patel-Schneider P. The Description Logic Handbook. Theory, Implementation and Applications. Cambridge, U.K. Cambridge University Press. [5] Spackman KA, Dionne R, Mays E, and Weis J. Role grouping as an extension to the description logic of ONTYLOG, motivated by entity modeling in SNOMED. In Kohane IS, editor, AMIA 2002 - Proceedings of the Annual Symposium of the American Medical Informatics Association, pp. 712-716. Philadelphia, PA: Hanley & Belfus, 2002. [6] Spackman KA and Campbell KE. Compositional entity representation using SNOMED: Towards further convergence of clinical terminologies. In Chute CG, editor, AMIA'98 - Proceedings of the 1998 AMIA Annual Fall Symposium, pp. 740-744. Philadelphia, PA: Hanley & Belfus, 1998. [7] Schulz S and Hahn U. Medical knowledge reengineering: Converting major portions of the UMLS into a terminological knowledge base. International Journal of Medical Informatics 2001: 64(2/3):207-221. [8] Simons P. Parts: A Study in Ontology. Oxford: Clarendon Press, 1987. Address for correspondence PD Dr. med. Stefan Schulz, Abteilung Medizinische Informatik, Universtätsklinikum Freiburg Stefan-Meier-Str. 26, D-79106 Freiburg (Germany), stschulz@uni-freiburg.de, http://www.imbi.unifreiburg.de/medinf/~schulz.htm