High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany

Similar documents
High-level Vision. Bernd Neumann Slides for the course in WS 2003/04. Faculty of Informatics Hamburg University Germany

Spatial Cognition for Mobile Robots: A Hierarchical Probabilistic Concept- Oriented Representation of Space

Artificial Intelligence Lecture 7

M.Sc. in Cognitive Systems. Model Curriculum

Bayesian Perception & Decision for Intelligent Mobility

Inferencing in Artificial Intelligence and Computational Linguistics

EEL-5840 Elements of {Artificial} Machine Intelligence

CS343: Artificial Intelligence

Rethinking Cognitive Architecture!

Recognizing Scenes by Simulating Implied Social Interaction Networks

CS 4365: Artificial Intelligence Recap. Vibhav Gogate

ERA: Architectures for Inference

Chapter 2. Knowledge Representation: Reasoning, Issues, and Acquisition. Teaching Notes

Facial Event Classification with Task Oriented Dynamic Bayesian Network

Grounding Ontologies in the External World

Ontology-based Realtime Activity Monitoring Using Beam Search

Development of methods for the analysis of movement and orientation behaviour in wayfinding tasks based on the case study mirror maze

Lecture 2 Agents & Environments (Chap. 2) Outline

Clinical decision support (CDS) and Arden Syntax

CISC453 Winter Probabilistic Reasoning Part B: AIMA3e Ch

Natural Scene Statistics and Perception. W.S. Geisler

Lecture 3: Bayesian Networks 1

VIDEO SURVEILLANCE AND BIOMEDICAL IMAGING Research Activities and Technology Transfer at PAVIS

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

Vorlesung Grundlagen der Künstlichen Intelligenz

Psychology 466: Judgment & Decision Making

Computer Aided Investigation: Visualization and Analysis of data from Mobile communication devices using Formal Concept Analysis.

Affective Game Engines: Motivation & Requirements

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?

Evaluating the Causal Role of Unobserved Variables

CHAPTER 6 HUMAN BEHAVIOR UNDERSTANDING MODEL

in Engineering Prof. Dr. Michael Havbro Faber ETH Zurich, Switzerland Swiss Federal Institute of Technology

Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework

What is Artificial Intelligence? A definition of Artificial Intelligence. Systems that act like humans. Notes

Support system for breast cancer treatment

CS148 - Building Intelligent Robots Lecture 5: Autonomus Control Architectures. Instructor: Chad Jenkins (cjenkins)

What Is A Knowledge Representation? Lecture 13

Models for Inexact Reasoning. Imprecision and Approximate Reasoning. Miguel García Remesal Department of Artificial Intelligence

A Description Logics Approach to Clinical Guidelines and Protocols

382C Empirical Studies in Software Engineering. Lecture 1. Introduction. Dewayne E Perry ENS present, Dewayne E Perry

Lecture 9 Internal Validity

Group Assignment #1: Concept Explication. For each concept, ask and answer the questions before your literature search.

Risk Mediation in Association Rules:

Learning Deterministic Causal Networks from Observational Data

Exploiting deduction and abduction services for information retrieval. Ralf Moeller Hamburg University of Technology

Observations on the Use of Ontologies for Autonomous Vehicle Navigation Planning

Lecture 5- Hybrid Agents 2015/2016

Causal Knowledge Modeling for Traditional Chinese Medicine using OWL 2

Context, information s meaning, and perception

Thinking and Intelligence

A Framework for Conceptualizing, Representing, and Analyzing Distributed Interaction. Dan Suthers

N. Laskaris, S. Fotopoulos, A. Ioannides

An exploration of the predictors of instruction following in an academic environment

Hypothesis-Driven Research

Methodological Issues in Measuring the Development of Character

LECTURE 5: REACTIVE AND HYBRID ARCHITECTURES

CS 771 Artificial Intelligence. Intelligent Agents

Team-Project Geometry and Probability for motion and action

Perceptual Anchoring with Indefinite Descriptions

POND-Hindsight: Applying Hindsight Optimization to POMDPs

Research Methodologies

Hierarchical Task Network Planning with Common-Sense Reasoning for Multiple-People Behaviour Analysis

Analysis A step in the research process that involves describing and then making inferences based on a set of data.

Chapter 7. Mental Representation

Probabilistic Models of the Cortex: Stat 271. Alan L. Yuille. UCLA.

Agents and Environments

Reinforcement Learning

Visual book review 1 Safe and Sound, AI in hazardous applications by John Fox and Subrata Das

Expert judgements in risk and reliability analysis

Computational Cognitive Science

COMP329 Robotics and Autonomous Systems Lecture 15: Agents and Intentions. Dr Terry R. Payne Department of Computer Science

Instructor s Test Bank. Social Research Methods

Intelligent Agents. Soleymani. Artificial Intelligence: A Modern Approach, Chapter 2

International Journal of Computer Engineering and Applications, Volume XI, Issue IX, September 17, ISSN

Agents and Environments

KECERDASAN BUATAN 3. By Sirait. Hasanuddin Sirait, MT

CDR Kimberly Elenberg, Office of Food Defense and Emergency Response, Food Safety Inspection Service, United States Department of Agriculture

Mechanical rationality, decision making and emotions

I. Introduction. Armin Falk IZA and University of Bonn April Falk: Behavioral Labor Economics: Psychology of Incentives 1/18

APPROVAL SHEET. Uncertainty in Semantic Web. Doctor of Philosophy, 2005

Artificial Intelligence CS 6364

Difference to Inference 1. Running Head: DIFFERENCE TO INFERENCE. interactivity. Thomas E. Malloy. University of Utah, Salt Lake City, Utah

How the brain works. ABSTRACT This paper analyses the associative capabilities of the brain and takes the consequences of that capability.

Implicit Information in Directionality of Verbal Probability Expressions

Modeling State Space Search Technique for a Real World Adversarial Problem Solving

Intelligent Agents. BBM 405 Fundamentals of Artificial Intelligence Pinar Duygulu Hacettepe University. Slides are mostly adapted from AIMA

MSc Neuroimaging for Clinical & Cognitive Neuroscience

Design Methodology. 4th year 1 nd Semester. M.S.C. Madyan Rashan. Room No Academic Year

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

Integrating Cognition, Perception and Action through Mental Simulation in Robots

A behaviour analysis of Theory of Mind: from interpretation to application

School of Nursing, University of British Columbia Vancouver, British Columbia, Canada

Concepts and Categories

COMPUTATIONAL IMAGERY

Gifted/Talented Students from Poverty-Varied Faces

DEVELOPING THE RESEARCH FRAMEWORK Dr. Noly M. Mascariñas

Computational Models for Belief Revision, Group Decisions and Cultural Shifts

Module 1. Introduction. Version 1 CSE IIT, Kharagpur

Bayesian Networks Representation. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University

Survey of Knowledge Base Content

Transcription:

High-level Vision Bernd Neumann Slides for the course in WS 2004/05 Faculty of Informatics Hamburg University Germany neumann@informatik.uni-hamburg.de http://kogs-www.informatik.uni-hamburg.de 1 Contents (1) Introduction to High-level Vision What is Computer Vision? What is High-level Vision? Problem Areas and State-of-the-art Historical Examples Relational Matching Representing Scenes as Relational Structures Matching Taxonomical and Compositional Hierarchies Description Logics for Scene Representation Syntax of RACER Concrete Domains Reasoning Services of RACER Representing Aggregates Logics of Scene Interpretation Finite Model Construction Partial Model Construction Navigating in Hallucination Space 2 1

Contents (2) Scene Interpretation as Configuration Structure-based Configuration Configuration Tool KONWERK Scene Interpretation with KONWERK Project Interlude: Modelling Table-Laying Scenes Performing Scene Interpretation Experiments Probabilistic Models Bayesian Networks Hidden Markov Models Probabilistic Inferences in Knowledge Bases Learning Learning Probability Distributions EM Algorithm 3 Website The website for this course can be reached via http://kogs-www.informatik.uni-hamburg.de/~neumann/hbd-ws-2004/ You will find PDF copies of the slides and possibly other useful information related to the course. The website will be updated each week on Friday. 4 2

Introduction: What is High-level Vision? 5 What is Computer Vision? Computer Vision is the academic discipline dealing with task-oriented reconstruction and interpretation of a scene by means of images. scene: image: reconstruction and interpretation: task-oriented: section of the real world stationary (3D) or moving (4D) view of a scene projection, density image (2D) depth image (2 1/2D) image sequence (3D) computer-internal scene description quantitative + qualitative + symbolic for a purpose, to fulfill a particular task context-dependent, supporting actions of an agent 6 3

What Is High-level Vision? High-level vision is the task of "understanding" a scene beyond singleobject recognition. Typical examples are traffic scene understanding for driver assistance, inferring user intentions in smart-room scenarios, recognizing team behavior in robocup games, discovering criminal acts in monitoring tasks. Characteristics: Interpretations involve several objects and occurrences. Interpretations depend on temporal and spatial relations between parts of a scene Interpretations describe the scene in qualitative terms, omitting geometric details. Interpretations include inferred facts, unobservable in the scene. Interpretations are based on conceptual knowledge and experience about the world. "Scene understanding" means roughly the same as "high-level vision". Examples for High-level Vision (1) high-level vision means understanding every-day occurrences Garbage collection in Hamburg (1 frame of a sequence) We want to recognize parts, activities, intentions, spatial & temporal relations 4

Examples for High-level Vision (2) High-level vision is silent movie understanding Buster Keaton in "The Navigator" We want to recognize episodes, the "story", emotions, funnyness Some application scenarios for high-level vision street traffic observations (long history) cameras monitoring parking lots, railway platforms, supermarkets, nuclear power plants,... video archiving and retrieval soccer commentator smart room cameras autonomous robot applications (e.g. robot watchmen, playmate for children ) 5

Characteristics of High-level Scene Interpretation Tasks interpretations typically involve several interrelated objects spatial and temporal relations are important interpretations may build on common sense knowledge application scenarios are highly diverse domains may be very large learning and adaptation may be required reliability and complexity management may become important issues economical application development requires generic approach Cognitive Computer Vision High-level vision is strongly related to "cognitive vision", a term created for vision comparable to human vision: Cognitive computer vision is concerned with integration and control of vision systems using explicit but not necessarily symbolic models of context, situation and goal-directed behaviour. Cognitive vision implies functionalities for knowledge representation, learning, reasoning about events & structures, recognition and categorization, and goal specification, all of which are concerned with the semantics of the relationship between the visual agent and its environment. Topics of cognitive vision: integration and control explicit models not necessarily symbolic context situation goal-directed behaviour knowledge representation learning reasoning recognition categorization goal specification visual agent 6

Multidisciplinary Contributions to Cognitive Vision Cognitive Vision research requires multidisciplinary efforts and escape from traditional research community boundaries. Knowledge Representation & Reasoning KR languages logic-based reasoning services default theories reasoning about actions & change Description Logics spatial and temporal calculi Robotics planning, goal-directed behaviour manipulation sensor integration navigation localization, mapping, SLAM integrative architectures Computer Vision object recognition, tracking bottom-up image analysis geometry and shape hypothesize-and-test control probabilistic methods Cognitive Vision Cognitive Science psychophysical models neural models conceptual spaces qualitative representations naive physics Learning & Data Mining concept learning inductive generalization clustering knowledge discovery Natural Language high-level concepts qualitative descriptions NL scene descriptions communication Uncertain Reasoning Bayesian nets, belief nets decision & estimation causality probabilistic learning Basic Structure of High-level Scene Interpretation high-level scene interpretations task context scene models geometrical scene description image sequences of dynamic scenes 7

Representation Levels for High-level Scene Interpretation symbolic scene descriptions symbolic domain pictorial scene representations metric domain metric feature representations map-type feature representations signal domain raw image sequence Context and Task Dependence Interpretations may depend on - domain context - spatial context - temporal context - intentional context - task context - communicative context - focus of attention - a priori probabilities Constructing an interpretation is not a mapping from image data into interpretation space. 8

Signal-symbol Problems (1) Mapping from quantitative into qualitative representations Example: "along" "Citytour" (Wahlster 87) Mapping from qualitative into quantitative representations Example: "abbiegen" "typicality fields" (Mohnhaupt & Neumann 91) Signal-symbol Problems (2) Symbol grounding road1 danger1 gully1 hole1 cover1 What does a symbol refer to in the real world? 9

Signal-symbol Problems (3) Grounded symbolic reasoning Deductions from symbolic knowledge about a scene should not only be correct w.r.t. to the underlying logic but also w.r.t. to common sense. Examples: (implies (and house (some near lake)) mosquito-house) (instance house1 house) (instance lake1 lake) (related house1 lake1 near) (instance house1 (not (mosquito-house))) => inconsistent! (instance house1 house) (instance cup1 cup) (related house1 cup1 inside) => inconsistent??? Uncertainty Problems (1) Fuzzyness of concepts Many high-level concepts have unsharp boundaries. "behind" "overtake" "meet" => mapping into logical propositions may be problematic Fuzzy set theory offers "degree of applicability" Probability theory offers statistical measures for language use rear 0.3 0.3 1.0 0.7 0.7 Fuzzy definition of behind 10

Uncertainty Problems (2) Uncertainty of data Example: Object boundaries house boundary is not discernable Image interpretation is fundamentally ill-defined Uncertainty Problems (3) Exploring multiple hypotheses Answers from several disciplines: graph matching heuristic search optimization theory logic theories probability & utility theory case-based reasoning neural networks particle physics (and others) Mixed bottom-up and top-down interpretation strategies have been rarely explored 11

Uncertainty Problems (4) Cultural clash betweeen logical and probabilistic reasoning Probabilistic methods are nor yet seamlessly integrated with logical calculi Interesting recent developments: First-order probabilistic inference (Poole 03) Probabilistic relational models (http://dags.stanford.edu/prms/) Example for reasoning in image interpretation: (from Kanade s invited lecture at IJCAI-03: "Computer Vision: AI or Non-AI Problem?") car on left side of street (uncertain orientation of car) japanese signs => left-hand traffic orientation of car resolved A State-of-the-art Example of Scene Interpretation recognizing assaults recognizing thefts at a phonebooth 12