Lecture 3: Bayesian Networks 1

Similar documents
IE 5203 Decision Analysis Lab I Probabilistic Modeling, Inference and Decision Making with Netica

Reasoning with Bayesian Belief Networks

A Taxonomy of Decision Models in Decision Analysis

3. L EARNING BAYESIAN N ETWORKS FROM DATA A. I NTRODUCTION

Bayes Theorem Application: Estimating Outcomes in Terms of Probability

Artificial Intelligence Programming Probability

Implementation of Perception Classification based on BDI Model using Bayesian Classifier

TEACHING YOUNG GROWNUPS HOW TO USE BAYESIAN NETWORKS.

CS 4365: Artificial Intelligence Recap. Vibhav Gogate

Bayesian Networks Representation. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University

Bayesian Belief Network Based Fault Diagnosis in Automotive Electronic Systems

Prof. Carlos Bana e Costa

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?

MS&E 226: Small Data

A Tutorial on Bayesian Networks for System Health Management

Expert Systems. Artificial Intelligence. Lecture 4 Karim Bouzoubaa

DECISION ANALYSIS WITH BAYESIAN NETWORKS

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation

Artificial Intelligence Lecture 7

How Does Analysis of Competing Hypotheses (ACH) Improve Intelligence Analysis?

Bayesian Perception & Decision for Intelligent Mobility

Lesson 87 Bayes Theorem

Causal Protein Signalling Networks

High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany

Application of Bayesian Network Model for Enterprise Risk Management of Expressway Management Corporation

Expert System for Medical Diagnosis of Hypertension and Anaemia

Progress in Risk Science and Causality

Bayesian Networks in Medicine: a Model-based Approach to Medical Decision Making

MTAT Bayesian Networks. Introductory Lecture. Sven Laur University of Tartu

CS343: Artificial Intelligence

CISC453 Winter Probabilistic Reasoning Part B: AIMA3e Ch

A Bayesian Network Analysis of Eyewitness Reliability: Part 1

Representation and Analysis of Medical Decision Problems with Influence. Diagrams

in Engineering Prof. Dr. Michael Havbro Faber ETH Zurich, Switzerland Swiss Federal Institute of Technology

Prentice Hall. A Survey of Mathematics with Applications, 7th Edition Mississippi Mathematics Framework 2007 Revised,

Causal Knowledge Modeling for Traditional Chinese Medicine using OWL 2

Durham Research Online

Artificial Intelligence

A Bayesian Network Model for Analysis of the Factors Affecting Crime Risk

Introducing Bayesian Networks

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection

Bugbears or Legitimate Threats? (Social) Scientists Criticisms of Machine Learning. Sendhil Mullainathan Harvard University

Bayesian networks for decision making under uncertainty How to combine data, evidence, opinion and guesstimates to make decisions

Supplementary notes for lecture 8: Computational modeling of cognitive development

Semiotics and Intelligent Control

Bayesian Concepts: An Introduction

Knowledge Based Systems

Reasoning with Uncertainty. Reasoning with Uncertainty. Bayes Rule. Often, we want to reason from observable information to unobservable information

Machine Learning-based Inference Analysis for Customer Preference on E-Service Features. Zi Lui, Zui Zhang:, Chenggang Bai-', Guangquan Zhang:

How do you design an intelligent agent?

Probabilistic Graphical Models: Applications in Biomedicine

Posture Monitor. User Manual. Includes setup, guidelines and troubleshooting information for your Posture Monitor App

ERA: Architectures for Inference

Phone Number:

Spatial Cognition for Mobile Robots: A Hierarchical Probabilistic Concept- Oriented Representation of Space

Pythia WEB ENABLED TIMED INFLUENCE NET MODELING TOOL SAL. Lee W. Wagenhals Alexander H. Levis

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

Problem Set and Review Questions 2

Chapter 2. Knowledge Representation: Reasoning, Issues, and Acquisition. Teaching Notes

Four Example Application Domains

P(HYP)=h. P(CON HYP)=p REL. P(CON HYP)=q REP

Bayes Linear Statistics. Theory and Methods

Evaluation of Bayesian Networks Used for Diagnostics 1

VIDEO SURVEILLANCE AND BIOMEDICAL IMAGING Research Activities and Technology Transfer at PAVIS

Bayesian (Belief) Network Models,

Using Dynamic Bayesian Networks for a Decision Support System Application to the Monitoring of Patients Treated by Hemodialysis

CPS331 Lecture: Coping with Uncertainty; Discussion of Dreyfus Reading

LSP 121. LSP 121 Math and Tech Literacy II. Topics. Risk Analysis. Risk and Error Types. Greg Brewster, DePaul University Page 1

CS324-Artificial Intelligence

Cost-Utility Analysis (CUA) Explained

Intelligent Agents. Philipp Koehn. 16 February 2017

CREATING (??) A SAFETY CULTURE. Robert L Carraway, Darden School of Business 2013 Air Charter Safety Symposium February 2013

Causal learning in an imperfect world

Lecture 9 Internal Validity

Knowledge Discovery for Clinical Decision Support

A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models

Introduction to Stocks & Flows

NON-INTRUSIVE REAL TIME HUMAN FATIGUE MODELLING AND MONITORING

Scenes Categorization based on Appears Objects Probability

Continuous/Discrete Non Parametric Bayesian Belief Nets with UNICORN and UNINET

Rethinking Cognitive Architecture!

observational studies Descriptive studies

Bayesian Inference Bayes Laplace

An assistive application identifying emotional state and executing a methodical healing process for depressive individuals.

Model calibration and Bayesian methods for probabilistic projections

Representing Variable Source Credibility in Intelligence Analysis with Bayesian Networks

Decisions and Dependence in Influence Diagrams

Hypothesis-Driven Research

A Biostatistics Applications Area in the Department of Mathematics for a PhD/MSPH Degree

A probabilistic method for food web modeling

Collaborative Project of the 7th Framework Programme. WP6: Tools for bio-researchers and clinicians

Artificial Doctors In A Human Era

Lecture 2 Agents & Environments (Chap. 2) Outline

Using Probabilistic Reasoning to Develop Automatically Adapting Assistive

How to Escape the Statistical Purgatory of Stress Testing. Riccardo Rebonato Oxford, PIMCO

Macro Programming through Bayesian Networks: Distributed Inference and Anomaly Detection

Chronic Obstructive Pulmonary Disease, shortened to COPD, is an umbrella term for a group of conditions which cause long-term damage to the airways.

AGENT-BASED SYSTEMS. What is an agent? ROBOTICS AND AUTONOMOUS SYSTEMS. Today. that environment in order to meet its delegated objectives.

An Introduction to Bayesian Networks and their Contemporary Applications. Table of Contents

Transcription:

Lecture 3: Bayesian Networks 1 Jingpeng Li 1 Content Reminder from previous lecture: Bayes theorem Bayesian networks Why are they currently interesting? Detailed example from medical diagnostics Bayesian networks and decision making What are Bayesian networks used for? More real-world applications Summary: Bayesian networks References 2 1

Bayes theorem For any two events, A and B: p(b A) = p(a B) x p(b) / p(a) 'p(a)' as "the probability of A", and 'p(a B)' as "the probability of A given that B has occurred". Allows us to update all the probabilities in a Bayesian Network, when any one piece of information changes. Simple example: Winter in Stirling. Rain: 50%, Cloudy: 80% (sometimes it is cloudy without rain). We know that If it rains it is cloudy (i.e. p(c R) = 1.0) What are the chances of rain given that it is cloudy? Bayes: p(r C) = p(r)p(c R)/p(C) = 0.5 x 1.0 / 0.8 = 0.625 = 5/8 So 5/8 of the time, in Stirling during winter, if it is cloudy then it is rainy 3 Bayesian Networks What are they? Bayesian nets are a network-based framework for representing and analysing models involving uncertainty What are they used for? Intelligent decision aids, data fusion, intelligent diagnostic aids, automated free text understanding, data mining Where did they come from? Cross fertilization of ideas between the artificial intelligence, decision analysis, and statistic communities 4 2

Why are Bayesian Networks interesting nowadays? Development of propagation algorithms followed by availability of easy to use commercial software Growing number of creative applications: dementia diagnosis, cancer care symptom modelling, likelihood of car purchase How are they different from other: Knowledge-based systems tools: uncertainty is handled in mathematically rigorous yet efficient and simple way Probabilistic analysis tools: network representation of problems, use of Bayesian statistics, and the synergy between these 5 Example from medical diagnostics Visit to Asia Smoking Patient Information Tuberculosis Lung Cancer Bronchitis Diseases Tuberculosis or Cancer XRay Result Dyspnea Diagnostic Tests Network represents a knowledge structure that models the relationship between diseases, their causes and effects, patient information and diagnostic tests 6 3

Characteristics of the links in a BN The direction of the link arrows roughly corresponds to causality The nodes higher up in the diagram tend to influence those below The links may from loops, but they may not form cycles This does not limits the modeling power of the network. We need to be careful when building BNs Avoiding cycles makes possible very fast update algorithms. 7 Example from medical diagnosis Knowledge from medical text books and public statistics to incorporate in a Bayesian network model 30% of the UK population smokes Lung cancer can be found in about 70 people per 100,000 TB occurs in about 10 people per 100,000. Bronchitis can be found in about 800 people per 100,000. Dyspnea can be found in about 10% of people, but most of that is due to asthma and causes other than TB, lung cancer, or bronchitis 8 4

A first attempt using only text book information and public statistics. We need to gather more data specific for a clinic treating respiratory diseases 9 Data related to patients with respiratory conditions 50% of patients smoke. 1% have TB 5.5% have lung cancer 45% have some form of mild or chronic bronchitis This would represent a new patient that has been referred to the clinic from whom we do not have yet any additional knowledge As more knowledge is gathered for each particular patient, the probabilities in the network will automatically adjust. 10 5

New patient arrives. We gather specific information about her. She tells us that she is often short of breath (dyspnea). We add this to the network (Dyspnea Present = 100) The probabilities for all three illnesses has increased. Why is this? Most likely disease now is Bronchitis, but we need more information! 11 We ask her if she has been to Asia recently. Surprisingly, she answers "yes The chances of TB has increased from 2% to 9% The chances of lung cancer and bronchitis have decreased. Why is this? We need more information for an accurate diagnosis 12 6

We ask if the patient smokes, the answer is yes The current best hypothesis is still Bronchitis To be sure, we order a diagnostic X-Ray 13 If the X-Rays is normal: this strongly confirms the possibility of a Bronchitis, and reduces the likelihood of TB or lung cancer 14 7

If the X-Rays is Abnormal: Big difference. Bronchitis is still the most probable of the three separate illnesses, but it is less than the combination hypothesis of TB or Lung Cancer. We can decided to conduct further test (blood, tissue, etc.) 15 Bayesian networks and decision theory Decision theory: interdisciplinary subject. Science of decision making. Example of decisions Shall I bring the umbrella today? I am looking for a house to buy. Shall I buy this one? There is the notion of utility that the user wants to maximise (wellbeing, leisure time, money, increased survival, etc.) BNs can be extended to compute utility, given the degree of knowledge we have on a situation BNs have become very popular in business and civic decision making The idea is to find a plan that maximises the expected utility Decision networks: produce the best decision for the user 16 8

What are Bayesian networks used for? In any situation where modeling an uncertain reality is involved (i.e. probabilities are present) Decision support: when ever it is helpful to make intelligent, justifiable, quantifiable decisions that will maximise the changes of a desirable outcome Application areas: Diagnosis Prediction Risk assessment and Finance Sensor fusion Monitoring and alerting 17 Diagnosis Typical example of a Diagnostic networks The diagnosis can be medical or mechanical Many industrial applications are for determining component failure (nuclear, airline, construction industry) Top nodes: predispositions which influence the likelihood of diseases Second and third layer (potentially more layers): diseases or internal conditions and failure states Last layer: nodes of observables Links: correspond to causation 18 9

Applications of Bayesian networks Prediction BN present casual chains (i.e cause-effect relationships between parent and child nodes Given evidence of past events, run the BN to see what the most likely future outcomes will be Examples Weather forecasting Stock market prediction Ecological modeling Financial risk management Difficult decisions need to be made, where often all the factors influencing a case are unknown BN are able to make intelligent, quantifiable decisions whatever information is available. Examples: Bank officers Insurance underwriters Investment advisors 19 Applications of Bayesian networks Sensor fusion Data from various sources must be integrated to arrive to an interpretation Example: Data from various cameras, different angles and resolution BN are robust to missing data, they combine information well A single sensor has limited information, the combination of all the sensors produce a better interpretation Robot vision, sonar image Monitoring and alerting An extension to diagnosing a system Deciding when to send an alert that the system is not working properly There are high costs for both: when the system fails, and when sending a false alarm Example: Vista used by the NASA. Interpreted live telemetry and provided advice on possible failures of the propulsion system 20 10

More real-world applications Industrial Processor Fault Diagnosis - by Intel Auxiliary Turbine Diagnosis - GEMS by GE Diagnosis of space shuttle propulsion systems - VISTA by NASA/Rockwell Situation assessment for nuclear power plant - NRC Military Automatic Target Recognition - MITRE Autonomous control of unmanned underwater vehicle - Lockheed Martin Assessment of Intent Medical Diagnosis Internal Medicine Pathology diagnosis - Intellipath by Chapman & Hall Breast Cancer Manager with Intellipath Commercial Financial Market Analysis Information Retrieval Software troubleshooting and advice - Windows 95 & Office 97 Pregnancy and Child Care - Microsoft Software debugging - American Airlines SABRE online reservation system 21 Interesting properties of BNs Probabilities need not be exact to be useful Approximate probabilities, even subjective ones, give good results BNs are quite robust to imperfect knowledge Combinations of several strands of imperfect knowledge can produce strong conclusions Causal Conditional Probabilities are easier to estimate than the reverse People are better at estimating probabilities "in the forward direction Example: doctors can give probability estimates for "if the patient has lung cancer, what are the chances their X-ray will be abnormal?", rather than the reverse, "if the X-ray is abnormal, what are the chances of lung cancer being the cause?" 22 11

Summary: Bayesian Networks Represent joint probability distribution as a directed graph: structure for representing knowledge about uncertain variables computational architecture for computing the impact of evidence on beliefs Knowledge structure: variables are depicted as nodes arcs represent probabilistic dependence between variables conditional probabilities encode the strength of the dependencies Computational architecture: computes posterior probabilities given evidence about selected nodes exploits probabilistic independence for efficient computation 23 References Netica Tutorial: An introduction Tutorial on Bayes Nets by Norsys Software Corp. Asia example Lauritzen, Steffen L. and David J. Spiegelhalter (1988) "Local computations with probabilities on graphical structures and their application to expert systems" in Journal Royal Statistics Society B, 50(2), 157-194. Other tutorials What Are "Bayesian Network Models? by Bruce G. Marcot Tutorial: Introduction to Belief Networks by Amos Storkey Bayesian Networks Models in Ecology by Bruce G. Marcot 24 12