Bayesian Networks Representation. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University

Similar documents
CS 4365: Artificial Intelligence Recap. Vibhav Gogate

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

EECS 433 Statistical Pattern Recognition

Bayesian (Belief) Network Models,

Introduction to Bayesian Networks

ERA: Architectures for Inference

A Taxonomy of Decision Models in Decision Analysis

Lecture 3: Bayesian Networks 1

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

CISC453 Winter Probabilistic Reasoning Part B: AIMA3e Ch

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection

Data mining for Obstructive Sleep Apnea Detection. 18 October 2017 Konstantinos Nikolaidis

A Tutorial on Bayesian Networks for System Health Management

Introduction to Computational Neuroscience

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016

Application of Bayesian Network Model for Enterprise Risk Management of Expressway Management Corporation

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s

Using Bayesian Networks for Daily Activity Prediction

Introducing Bayesian Networks

IE 5203 Decision Analysis Lab I Probabilistic Modeling, Inference and Decision Making with Netica

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

in Engineering Prof. Dr. Michael Havbro Faber ETH Zurich, Switzerland Swiss Federal Institute of Technology

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

Signal Detection Theory and Bayesian Modeling

DECISION ANALYSIS WITH BAYESIAN NETWORKS

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Progress in Risk Science and Causality

Reasoning with Bayesian Belief Networks

AClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL

MS&E 226: Small Data

An Introduction to Bayesian Networks and their Contemporary Applications. Table of Contents

Recognizing Scenes by Simulating Implied Social Interaction Networks

Lecture 9 Internal Validity

Hybrid HMM and HCRF model for sequence classification

Part I - Multiple Choice. 10 points total.

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

Durham Research Online

Identifying the Zygosity Status of Twins Using Bayes Network and Estimation- Maximization Methodology

ParkDiag: A Tool to Predict Parkinson Disease using Data Mining Techniques from Voice Data

AALBORG UNIVERSITY. Prediction of the Insulin Sensitivity Index using Bayesian Network. Susanne G. Bøttcher and Claus Dethlefsen

Bayesian Face Recognition Using Gabor Features

Artificial Intelligence Programming Probability

Lecture 13: Finding optimal treatment policies

Learning from data when all models are wrong

Explaining Inferences in Bayesian Networks

Prognostic Prediction in Patients with Amyotrophic Lateral Sclerosis using Probabilistic Graphical Models

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

From data to models: incorporating uncertainty into decision support systems. Outline. Probabilistic vs Mechanistic models.

A gentle introduction to the mathematics of biosurveillance: Bayes Rule and Bayes Classifiers

Predicting Group Emotion in Kindergarten Classes by Modular Bayesian Networks

Automatic Extraction of Synoptic Data. George Cernile Artificial Intelligence in Medicine AIM

Argument Analysis Using Bayesian Networks

High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany

Expert System for Medical Diagnosis of Hypertension and Anaemia

Causal Reasoning Engine: An Explanation-Based Approach to Syndromic Surveillance

A Bayesian Network Model for Analysis of Detection Performance in Surveillance Systems

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

3. L EARNING BAYESIAN N ETWORKS FROM DATA A. I NTRODUCTION

GeNIeRate: An Interactive Generator of Diagnostic Bayesian Network Models

Model Uncertainty in Classical Conditioning

Rethinking Cognitive Architecture!

Mapping FMEA into Bayesian Networks

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Spatial Cognition for Mobile Robots: A Hierarchical Probabilistic Concept- Oriented Representation of Space

Classifying instantaneous cognitive states from fmri data

Artificial Intelligence Lecture 7

Sequential Decision Making

A Bayesian Rule Generation Framework for Omic Biomedical Data Analysis. Jonathan Llyle Lustgarten

Machine Learning-based Inference Analysis for Customer Preference on E-Service Features. Zi Lui, Zui Zhang:, Chenggang Bai-', Guangquan Zhang:

Bayesian Networks for the Human Element Leroy A. "Jack" Jackson and MAJ Jonathan K. Alt, US Army Training and Doctrine Command

Vision as Bayesian inference: analysis by synthesis?

Learning and Using Bayesian Networks for Diagnosis and User Profiling

Macro Programming through Bayesian Networks: Distributed Inference and Anomaly Detection

BREAST CANCER EPIDEMIOLOGY MODEL:

Causal Inference in Statistics A primer, J. Pearl, M Glymur and N. Jewell

Bayesian Concepts in Software Testing: An Initial Review

Introduction to Deep Reinforcement Learning and Control

Approximate Inference in Bayes Nets Sampling based methods. Mausam (Based on slides by Jack Breese and Daphne Koller)

COMBINATION DEEP BELIEF NETWORKS AND SHALLOW CLASSIFIER FOR SLEEP STAGE CLASSIFICATION.

Fusing with Context: A Bayesian Approach to Combining Descriptive Attributes

Bayesian Inference Bayes Laplace

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks

Oscillatory Neural Network for Image Segmentation with Biased Competition for Attention

Bayesian Belief Network Based Fault Diagnosis in Automotive Electronic Systems

Lecture 11: Clustering to discover disease subtypes and stages

Bayesian Networks for Modeling Emotional State and Personality: Progress Report

Representation and Analysis of Medical Decision Problems with Influence. Diagrams

Machine Learning in Dose-Response Assessment:

A Bayesian Network Model of Knowledge-Based Authentication

Prediction of Successful Memory Encoding from fmri Data

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

Data analysis and binary regression for predictive discrimination. using DNA microarray data. (Breast cancer) discrimination. Expression array data

Bayesian Networks in Medicine: a Model-based Approach to Medical Decision Making

Comparing Decision Support Methodologies for Identifying Asthma Exacerbations

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Machine Learning and Event Detection for the Public Good

Human Activities: Handling Uncertainties Using Fuzzy Time Intervals

Transcription:

Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University March 16 th, 2005

Handwriting recognition Character recognition, e.g., kernel SVMs r r r a c r rr z c c b

Webpage classification Company home page vs Personal home page vs Univeristy home page vs

Handwriting recognition 2

Webpage classification 2

Today Bayesian networks One of the most exciting advancements in statistical AI in the last 10-15 years Generalizes naïve Bayes and logistic regression classifiers Compact representation for exponentially-large probability distributions Exploit conditional independencies

Causal structure Suppose we know the following: The flu causes sinus inflammation Allergies cause sinus inflammation Sinus inflammation causes a runny nose Sinus inflammation causes headaches How are these connected?

Possible queries Flu Allergy Inference Sinus Most probable explanation Headache Nose Active data collection

Car starts BN 18 binary attributes Inference P(BatteryAge Starts=f) 2 18 terms, why so fast? Not impressed? HailFinder BN more than 3 54 = 58149737003040059690390169 terms

Factored joint distribution - Preview Flu Allergy Sinus Headache Nose

Number of parameters Flu Allergy Sinus Headache Nose

Key: Independence assumptions Flu Allergy Sinus Headache Nose Knowing sinus separates the variables from each other

(Marginal) Independence Flu and Allergy are (marginally) independent Flu = t Flu = f More Generally: Allergy = t Allergy = f Flu = t Flu = f Allergy = t Allergy = f

Conditional independence Flu and Headache are not (marginally) independent Flu and Headache are independent given Sinus infection More Generally:

The independence assumption Flu Headache Sinus Allergy Nose Local Markov Assumption: A variable X is independent of its non-descendants given its parents

Explaining away Local Markov Assumption: A variable X is independent of its non-descendants given its parents Flu Allergy Sinus Headache Nose

Naïve Bayes revisited Local Markov Assumption: A variable X is independent of its non-descendants given its parents

What about probabilities? Conditional probability tables (CPTs) Flu Allergy Sinus Headache Nose

Joint distribution Flu Allergy Sinus Headache Nose Why can we decompose? Markov Assumption!

Real Bayesian networks applications Diagnosis of lymph node disease Speech recognition Microsoft office and Windows http://www.research.microsoft.com/research/dtg/ Study Human genome Robot mapping Robots to identify meteorites to study Modeling fmri data Anomaly detection Fault dianosis Modeling sensor network data

A general Bayes net Set of random variables Directed acyclic graph Encodes independence assumptions CPTs Joint distribution:

Another example Variables: B Burglar E Earthquake A Burglar alarm N Neighbor calls R Radio report Both burglars and earthquakes can set off the alarm If the alarm sounds, a neighbor may call An earthquake may be announced on the radio

Another example Building the BN B Burglar E Earthquake A Burglar alarm N Neighbor calls R Radio report

Defining a BN Given a set of variables and conditional independence assumptions Choose an ordering on variables, e.g., X 1,, X n For i = 1 to n Add X i to the network Define parents of X i, Pa Xi, in graph as the minimal subset of {X 1,,X i-1 } such that local Markov assumption holds X i independent of rest of {X 1,,X i-1 }, given parents Pa Xi Define/learn CPT P(X i Pa Xi )

How many parameters in a BN? Discrete variables X 1,, X n Graph Defines parents of X i, Pa Xi CPTs P(X i Pa Xi )

Defining a BN 2 We may not know conditional independence assumptions and even variables Given a set of variables and conditional independence assumptions Choose an ordering on variables, e.g., X 1,, X n For i = 1 to n Add X i to the network There are good orderings and bad ones A bad ordering may need more parents per variable must Define parents of X i, Pa Xi, in graph as the minimal learn more parameters subset of {X 1,,X i-1 } such that local Markov assumption holds X i independent of rest of {X 1,,X i-1 }, given parents Pa Xi Define/learn CPT P(X i Pa Xi ) How???

Learning the CPTs Data For each discrete variable X i x (1) x (m)

Learning Bayes nets Known structure Unknown structure Fully observable data Missing data

Queries in Bayes nets Given BN, find: Probability of X given some evidence, P(X e) Most probable explanation, max x1,,x n P(x 1,,x n e) Most informative query Learn more about these next class

What you need to know Bayesian networks A compact representation for large probability distributions Not an algorithm Semantics of a BN Conditional independence assumptions Representation Variables Graph CPTs Why BNs are useful Learning CPTs from fully observable data Play with applet!!!

Acknowledgements JavaBayes applet http://www.pmr.poli.usp.br/ltd/software/javabayes/ho me/index.html