When and Why to use a Classifier?

Similar documents
Binding Ontologies and Coding Systems to Electronic Health Records and Message

Representation of Part-Whole Relationships in SNOMED CT

What Is A Knowledge Representation? Lecture 13

Causal Knowledge Modeling for Traditional Chinese Medicine using OWL 2

Clinical Observation Modeling

An explicit specification of a conceptualization

Kaiser Permanente Convergent Medical Terminology (CMT)

Why Do It the Hard Way? The Case for an Expressive Description Logic for SNOMED

ORC: an Ontology Reasoning Component for Diabetes

Heiner Oberkampf. DISSERTATION for the degree of Doctor of Natural Sciences (Dr. rer. nat.)

AMIA 2005 Symposium Proceedings Page - 166

An introduction to case finding and outcomes

CLINICIAN-LED E-HEALTH RECORDS. (AKA GETTING THE LITTLE DATA RIGHT) Dr Heather Leslie Ocean Informatics/openEHR Foundation

Ontology-Based Personalized System to Support Patients at Home. Mukasine Angelique. Supervisors Professor Rune Fensli Jan Pettersen Nytun

Time-Oriented Question Answering from Clinical Narratives Using Semantic-Web Techniques

Authoring SNOMED CT Generic Authoring Principles. Penni Hernandez and Cathy Richardson Senior Terminologists

Investigating implementing CEN with HL7 V3 and SNOMED CT Final Report

Harvard-MIT Division of Health Sciences and Technology HST.952: Computing for Biomedical Scientists. Data and Knowledge Representation Lecture 6

Using the CEN/ISO Standard for Categorial Structure to Harmonise the Development of WHO International Terminologies

Exploiting deduction and abduction services for information retrieval. Ralf Moeller Hamburg University of Technology

Defined versus Asserted Classes: Working with the OWL Ontologies. NIF Webinar February 9 th 2010

Isolated systolic hypertension in older patients diuretics (preferred), dihydropyridines

Citation for published version (APA): Geus, A. F. D., & Rotterdam, E. P. (1992). Decision support in aneastehesia s.n.

Clinical Genome Knowledge Base and Linked Data technologies. Aleksandar Milosavljevic

How do Categories Work?

Clinical Decision Support Systems. 朱爱玲 Medical Informatics Group 24 Jan,2003

International Journal of Software and Web Sciences (IJSWS)

Deriving an Abstraction Network to Support Quality Assurance in OCRe

Research on Construction and SWRL Reasoning of Ontology of Maize Diseases

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach

Inferencing in Artificial Intelligence and Computational Linguistics

Foundations of AI. 10. Knowledge Representation: Modeling with Logic. Concepts, Actions, Time, & All the Rest

AN ONTOLOGICAL APPROACH TO REPRESENTING AND REASONING WITH TEMPORAL CONSTRAINTS IN CLINICAL TRIAL PROTOCOLS

What Happened to Bob? Semantic Data Mining of Context Histories

Combining Archetypes with Fast Health Interoperability Resources in Future-proof Health Information Systems

Weighted Ontology and Weighted Tree Similarity Algorithm for Diagnosing Diabetes Mellitus

The openehr approach. Background. Approach. Heather Leslie, July 17, 2014

Use of GELLO v.1.x, GLIF 3.5, SNOMED CT and EN archetypes

Mentors, Coaches and the Empowerment Factor Why Functional Fluency is Important

A Descriptive Delta for Identifying Changes in SNOMED CT

Hypertension encoded in GLIF

APPLYING ONTOLOGY AND SEMANTIC WEB TECHNOLOGIES TO CLINICAL AND TRANSLATIONAL STUDIES

Safe and Economic Re-Use of Ontologies: A Logic-Based Methodology and Tool Support

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

Chapter 2. Knowledge Representation: Reasoning, Issues, and Acquisition. Teaching Notes

HL7 Cross-Paradigm Specification: CIMI Logical Models, Release 1

Role Representation Model Using OWL and SWRL

IE 5203 Decision Analysis Lab I Probabilistic Modeling, Inference and Decision Making with Netica

Lecture 9: Lab in Human Cognition. Todd M. Gureckis Department of Psychology New York University

Programming with Goals (3)

SAGE. Nick Beard Vice President, IDX Systems Corp.

Application of the Commission's Recommendations for the Protection of People in

CIMI Modeling Architecture, Methodology & Style Guide

#HealthForAll ichc2017.org

BeliefOWL: An Evidential Representation in OWL ontology

How to code rare diseases with international terminologies?


Lecture 2: Foundations of Concept Learning

Knowledge Representation defined. Ontological Engineering. The upper ontology of the world. Knowledge Representation

Development of a methodology and an ontological schema for medical terminology

Types of Modifications

SFHOH4 SQA Unit Code H4PX 04 Contribute to the production of dental images

WHAT ARE DECISION MAKERS BARRIERS, FACILITATORS AND EVIDENCE NEEDS REGARDING PEER SUPPORT WORKING? EXECUTIVE SUMMARY

ICRP 128 ICRP ICRP ICRP 1928

Goal Conflict Concerns. Marc

Research Scholar, Department of Computer Science and Engineering Man onmaniam Sundaranar University, Thirunelveli, Tamilnadu, India.

Clinical decision support (CDS) and Arden Syntax

Support for Cognitive Science Experiments

An Interval-Based Representation of Temporal Knowledge

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org

HL7/LOINC Clinical Document Ontology Implementation Guide. (US Realm) Draft Standard for Trial Use

An Ontology for Healthcare Quality Indicators: Challenges for Semantic Interoperability

IHE Quality, Research and Public Health Technical Framework Supplement. Early Hearing Care Plan (EHCP) Trial Implementation

ORC: an Ontology Reasoning Component for Diabetes

Managing Values in Science and Risk Assessment

CONCEPTUAL FRAMEWORK, EPISTEMOLOGY, PARADIGM, &THEORETICAL FRAMEWORK

or

ADVANCED VBA FOR PROJECT FINANCE Near Future Ltd. Registration no

RESEARCH PROJECT. Comparison of searching tools and outcomes of different providers of the Medline database (OVID and PubMed).

My experiences with Haskell as a person with Asperger s Syndrome

Auditing SNOMED Relationships Using a Converse Abstraction Network

Lesson Overview. Human Genetic Disorders. Lesson Overview Human Genetic Disorders

APPROVAL SHEET. Uncertainty in Semantic Web. Doctor of Philosophy, 2005

Discovering Identity Problems: A Case Study

Case scenarios: Patient Group Directions

Grounding Ontologies in the External World

Closing Plenary April 2019 Business Meeting. Kelly Kuru April 10, 2019

Why Is it so Hard? A. L. Rector Clinical Terminology: 1. Introduction. 1.1 The Problem of Medical Terminology

OWL Ontology for Solar UV Exposure and Human Health

The Human Behaviour-Change Project

Intelligent Agents. CmpE 540 Principles of Artificial Intelligence

Cognition, ontologies and description logics

Overview. cis32-spring2003-parsons-lect15 2

Whatever you think you can do or believe you can do, begin it. Action has magic, grace and power in it. - Goethe

Ontology-Based Modeling of Clinical Reasoning in Traditional Chinese Medicine

Overview EXPERT SYSTEMS. What is an expert system?

Representing clinical concepts

Inference Inspector: Improving the Verification of Ontology Authoring Actions

Visual book review 1 Safe and Sound, AI in hazardous applications by John Fox and Subrata Das

Transcription:

When and Why to use a Classifier? Alan Rector with acknowledgement to Jeremy Rogers, Pieter Zanstra,, & the GALEN Consortium Nick Drummond, Matthew Horridge, Hai Wang in CO-ODE/ ODE/HyOntUSE Information Management Group Dept of Computer Science, U Manchester Holger Knublauch, Ray Fergerson, and the Protégé-Owl Team rector@cs.man.ac.uk co-ode ode-admin@cs.man.ac.uk www.co-ode.org ode.org protege.stanford.org rd.org www.opengalen.org OpenGALEN 1

Reasons to classify (1) Managing Compositional ontologies / Terminologies Conceptual Lego Managing combinatorial explosions - the exploding bicycle Empowering users Just in time ontologies Give the users the Lego set with limited connectors Organising polyhierarchies / Modularizing ontologies Normalising ontologies Multiaxial indexing of resources Providing multiple views - Reorganising the ontology by new abstractions Constraining ontologies & schemas Enforcing constraints Imposing policies For clinical Statements with SNOMED entries in request mode context is request

Reasons to classify(2) Matching instances against classes Resource/service discovery Self-describing storage Archetypes & templates Providing a skeleton for default reasoning & Prototypes (but not to do the reasoning itself) Molluscs typically have shells Cephalopods are kinds of Molluscs but typically do not have shells Nautiloids are kinds of Cephalopods but typically do have shells» Nautilus ancestor are kinds of Nautiloids but do (did) not have shells Biology is full of exceptions

Classification is about Classes Classification works for Organising & constraining classes / schemas Identifying the classes to which an instance definitely belongs Or those to which it cannot belong Classification is open world Negation as unsatisfiability not == impossible ( unsatisfiable ) Databases, logic programming, PAL, queries etc are closed world Negation as failure not == cannot be found

Reasons not to Classify To query large number of instances Open world ( A-Box ) reasoning does not work over large numbers of instances If the question is closed world E.g. Drugs licensed for treatment of asthma If the query requires non-dl reasoning E.g. numerical, optimisation, probabilistic, Would like to have a more powerful hybrid reasoner For Metadata and Higher Order Information Classifiers are strictly first order A few things can be kluged If there are complex defaults and exceptions Prototypical Knowledge E.g. Molluscs typically have shells NB Simple exceptions can be handled, but requires care

Use instead To query large numbers of instances OR If the query is closed world Queries / constraints over databases Instance stores / triple stores / Rules DL-programming JESS, Algernon, Prolog, Belief revision / non-monotonic reasoning If query requires Non DL Reasoning Hybrid reasoners or?swrl? No good examples at the moment For defaults and Exceptions & Prototypical Knowledge Traditional frame systems More expressive default structure than Protégé Exceptions for classes as well as instances» Over-riding rather than narrowing

Classification to build Ontologies: Conceptual Lego gene hand protein extremity cell body expression chronic acute Lung inflammation infection abnormal normal ischaemic deletion polymorphism bacterial

Logic-based Ontologies: Conceptual Lego SNPolymorphism of CFTRGene causing Defect in MembraneTransport of ChlorideIon causing Increase in Viscosity of Mucus in CysticFibrosis Hand which is anatomically normal

Species Genes Protein Linking taxonomies: Conceptual Lego Normalisation CFTRGene in humans Function Disease Protein coded by (CFTRgene & in humans) Membrane transport mediated by (Protein coded by (CFTRgene in humans)) Disease caused by (abnormality in (Membrane transport mediated by (Protein coded by (CTFR gene & in humans))))

Conceptual Lego and Normalisation Practical Example

Take a Few Simple Concepts & Properties

Combine them in Descriptions which can be simple. Sickle cell disease is a disease caused some sickling haemoglobin

or which can be as complex as you like Cytstic fibrosisis is caused by some non- normal ion transport that is the function of a protein coded for by a CFTR gene

Add some definitions Diseases linked to CFTR Genes

We have built a simple tree easy to maintain

Let the classifier organise it

If you want more abstractions, just add new definitions (re-use existing data) Diseases linked to abnormal proteins

And let the classifier work again

And again For a view based on species Diseases linked genes described in the mouse

And let classifier check consistency (My first try wasn t)

Normalising (untangling) Ontologies Structure Structure Part-whole Function Function Part-whole

Untangling and Enrichment Using a classifier to make life easier Substance - Protein - - ProteinHormone - - - Insulin -Steroid - - SteroidHormone ---Cortisol - Hormone - -ProteinHormone - - - Insulin - - SteroidHormone ---Cortisol - Catalyst - - Enzyme ---ATPase -Substance -- Protein - - - Insulin ---ATPase -Steroid - - Cortisol Hormone ProteinHormone SteroidHomone Catalyst Enzyme - PhsioloicRole --HormoneRole - - CatalystRole Substance & playsrole-somevaluesfrom HormoneRole Protein & playsrole somevaluesfrom HormoneRole Steroid & playsrole somevaluesfrom HormoneRole Substance & playsrole somevaluesfrom CatalystRole Protein & playsrole somevaluesfrom CatalystRole Insulin playsrole somevaluesfrom HormoneRole Cortisol playsrole somevaluesfrom HormoneRole ATPase playsrole somevaluesfrom CatalystRole Substance - Protein --ProteinHormone - - - Insulin --Enzyme ---ATPase -Steroid --SteroidHomone^ ---Cortisol -Hormone --ProteinHormone^ - - - Insulin^ --SteroidHormone^ ---Cortisol^ - Catalyst --Enzyme^ ---ATPase^

Normalisation & Quality Assurance Humans recognise errors of commision easily Miss errors of omission Classifiers convert errors of omission to errors of commission Inadequate definitions create orphans BodyPart Parts of Heart Ventricle CardiacSeptum Classifiers flag errors of commision Over definition leads to inconsistency (unsatisfiability) Pneumonia located in the brain

Enforcing constraints & policies A class with both necessary & sufficient and additional necessary conditions acts as a rule The Unit testing Framework supports checking that rules are enforced

A Probe class to check a constraint All Tests Passed

Skeleton for Defaults & Exceptions beta blocker use of beta blocker in asthma contraindication asthma serious cardioselective cardioselective beta blocker use of cardioselective beta blocker in asthma contraindication Experience: Normalised ontologists lead to clean default inheritance mild

When to Classify but isn t having a classifier an intollerable overhead for the applications? It depends on the life cycle you choose Life cycles Pre-coordination Just in time coordination Post Coordination

Pre-coordination If the terms can be enumerated in advance Authors Asserted form Sources Classifier Compiler Classified form binary ( high level language intermediate representations ) Overwhelming the dominant pattern today. binary can be OWL Light RDF(S), XML Schema, OBO, Application Users

Commit Results to a Pre-Coordinated Ontology Assert ( Commit ) changes inferred by classifier

Post coordination When there are combinatorially many potential terms Lazy classification on demand kernel model API Externally available resource Big on the outside: small kernel on the inside Avoid the exploding bicycle

Post Coordination Authors Asserted form Sources (& intermediate representations) Asserted Ontology Store Terminology Services Classifier Compiler Application But requires a classifier available at application time Users

Authors A Compromise Just in Time Coordination Asserted form Sources (& intermediate representations) Asserted Ontology Store Terminology Services Classifier Compiler Pre-coordinated Cache Only the occasional new notion Requires classification - need not be real time Application Users

Summary Why Classify Managing Compositional ontologies / Terminologies Conceptual Lego Empowering users - just in time classification Providing views & deferring decisions on abstractions Quality assurance Constraining concepts & schemas Providing a skeleton for default reasoning & Prototypes When to classify Pre-coordination Just-in-time coordination Post-coordination

Summary: When to Classify? Applications do not need a classifier to benefit from classification Pre-coordination If concepts/terms can be predicted When classifier is not available at run time When we must fit with legacy applications Post-coordination When a a few concepts are needed from a large potential set When a classifier is available and time cost is acceptable When applications can be built or adapted to take advantage

Skeleton for Fractal tailoring forms for clinical trials Hypertension Idiopathic Hypertension` In our company s studies In Phase 2 studies Idiopathic Hypertension in our co s Phase 2 study

More on Views A problem in the digital anatomist To an anatomist, the Pericardium and the Heart are separate organs To a clinician they are part of the same organ A disease of the pericardium counts as a kind of heart disease

Represent context and views by variant properties Organ is_part_of OrganPart Pericardium is_clinically_part_of Heart CardiacValve is_structurally_part_of Disease of (Heart or part-of-heart) Disease of Pericardium

Protégé-OWL alternative views Disorder of Clinical heart Disorder of heart of any part of the heart (including clinical and functional parts) Disorder of FMA heart Disorder of heart or any structural part of the heart