CS148 - Building Intelligent Robots Lecture 5: Autonomus Control Architectures. Instructor: Chad Jenkins (cjenkins)

Similar documents
Unmanned autonomous vehicles in air land and sea

(c) KSIS Politechnika Poznanska

Behavior Architectures

Agent-Based Systems. Agent-Based Systems. Michael Rovatsos. Lecture 5 Reactive and Hybrid Agent Architectures 1 / 19

38. Behavior-Based Systems

ICS 606. Intelligent Autonomous Agents 1. Intelligent Autonomous Agents ICS 606 / EE 606 Fall Reactive Architectures

On Three Layer Architectures (Erann Gat) Matt Loper / Brown University Presented for CS296-3

LECTURE 5: REACTIVE AND HYBRID ARCHITECTURES

Robotics Summary. Made by: Iskaj Janssen

DYNAMICISM & ROBOTICS

Lecture 5- Hybrid Agents 2015/2016

Part I Part 1 Robotic Paradigms and Control Architectures

Module 1. Introduction. Version 1 CSE IIT, Kharagpur

AIR FORCE INSTITUTE OF TECHNOLOGY

An Overview on Soft Computing in Behavior Based Robotics

A brief comparison between the Subsumption Architecture and Motor Schema Theory in light of Autonomous Exploration by Behavior

Affective Action Selection and Behavior Arbitration for Autonomous Robots

Robot Behavior Genghis, MIT Callisto, GATech

CS 771 Artificial Intelligence. Intelligent Agents

How do you design an intelligent agent?

An Abstract Behavior Representation for Robust, Dynamic Sequencing in a Hybrid Architecture

Robot Learning Letter of Intent

EEL-5840 Elements of {Artificial} Machine Intelligence

MOBILE & SERVICE ROBOTICS RO OBOTIC CA 01. Supervision and control

Intelligent Agents. Philipp Koehn. 16 February 2017

Putting Minsky and Brooks Together. Bob Hearn MIT AI Lab

A Modular Hierarchical Behavior-Based Architecture

Artificial Intelligence

Motion Control for Social Behaviours

Artificial Intelligence Agents and Environments 1

An Escalation Model of Consciousness

Embodiment in GLAIR: A Grounded Layered Architecture. with Integrated Reasoning for Autonomous Agents. Henry Hexmoor. Johan Lammens.

Semiotics and Intelligent Control

Agents and Environments

Pavlovian, Skinner and other behaviourists contribution to AI

Artificial Intelligence Lecture 7

EICA: Combining Interactivity with Autonomy for Social Robots

Introduction to Artificial Intelligence 2 nd semester 2016/2017. Chapter 2: Intelligent Agents

Artificial Intelligence

Chapter 2: Intelligent Agents

Self-organized and Evolvable Cognitive Architecture for Intelligent Agents and Multi-Agent Systems

CS 331: Artificial Intelligence Intelligent Agents

Intelligent Agents. CmpE 540 Principles of Artificial Intelligence

CS 331: Artificial Intelligence Intelligent Agents

Cognitive Neuroscience History of Neural Networks in Artificial Intelligence The concept of neural network in artificial intelligence

M.Sc. in Cognitive Systems. Model Curriculum

Intelligent Agents. Instructor: Tsung-Che Chiang

Agents. Environments Multi-agent systems. January 18th, Agents

Rational Agents (Ch. 2)

CS343: Artificial Intelligence

Intelligent Agents. Instructor: Tsung-Che Chiang

A method to define agricultural robot behaviours

Institute of Psychology C.N.R. - Rome. Using emergent modularity to develop control systems for mobile robots

CS 331: Artificial Intelligence Intelligent Agents. Agent-Related Terms. Question du Jour. Rationality. General Properties of AI Systems

Bundles of Synergy A Dynamical View of Mental Function

Topological Considerations of Memory Structure

ERA: Architectures for Inference

Agents. This course is about designing intelligent agents Agents and environments. Rationality. The vacuum-cleaner world

Princess Nora University Faculty of Computer & Information Systems ARTIFICIAL INTELLIGENCE (CS 370D) Computer Science Department

Intelligent Agents. Soleymani. Artificial Intelligence: A Modern Approach, Chapter 2

Thesis Rehabilitation robotics (RIA) Robotics for Bioengineering Forefront research at PRISMA Lab and ICAROS Center

COMP150 Behavior-Based Robotics

KECERDASAN BUATAN 3. By Sirait. Hasanuddin Sirait, MT

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to publication record in Explore Bristol Research PDF-document

Overview. What is an agent?

Dynamic Control Models as State Abstractions

Introduction and Historical Background. August 22, 2007

Deliberating on Ontologies: The Present Situation. Simon Milton Department of Information Systems, The University of Melbourne

Intelligent Agents. BBM 405 Fundamentals of Artificial Intelligence Pinar Duygulu Hacettepe University. Slides are mostly adapted from AIMA

CS324-Artificial Intelligence

A Hierarchical Architecture for Behavior-Based Robots

ENVIRONMENTAL REINFORCEMENT LEARNING: A Real-time Learning Architecture for Primitive Behavior Refinement

Foundations of AI. 10. Knowledge Representation: Modeling with Logic. Concepts, Actions, Time, & All the Rest

Agents and Environments

Artificial Intelligence. Intelligent Agents

AS robots interactions with humans become increasingly

Artificial Intelligence CS 6364

NEURAL SYSTEMS FOR INTEGRATING ROBOT BEHAVIOURS

COMP329 Robotics and Autonomous Systems Lecture 15: Agents and Intentions. Dr Terry R. Payne Department of Computer Science

Toward A Cognitive Computer Vision System

Intelligent Autonomous Agents. Ralf Möller, Rainer Marrone Hamburg University of Technology

Lecture 6. Perceptual and Motor Schemas

Development of Action Server and Client for Pick and Place with PR2 Robot

Intelligent Agents. Chapter 2 ICS 171, Fall 2009

What is AI? The science of making machines that:

Dr. Mustafa Jarrar. Chapter 2 Intelligent Agents. Sina Institute, University of Birzeit

AGENT-BASED SYSTEMS. What is an agent? ROBOTICS AND AUTONOMOUS SYSTEMS. Today. that environment in order to meet its delegated objectives.

Lecture 13: Finding optimal treatment policies

Rational Agents (Ch. 2)

Gaze Control in a Multiple-Task Active-Vision System


Behavior-based Robotics And The Reactive Paradigm A Survey

Selecting Behaviors using Fuzzy Logic

Reactivity and Deliberation in Decision-Making Systems

Mike Davies Director, Neuromorphic Computing Lab Intel Labs

Integrating Cognition, Emotion and Autonomy

Web-Mining Agents Cooperating Agents for Information Retrieval

Defect Removal. RIT Software Engineering

Reactive agents and perceptual ambiguity

High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany

Transcription:

Lecture 5 Control Architectures Slide 1 CS148 - Building Intelligent Robots Lecture 5: Autonomus Control Architectures Instructor: Chad Jenkins (cjenkins)

Lecture 5 Control Architectures Slide 2 Administrivia How are the labs/projects going? Standard track on Thursday First part of lab: demonstrate robot implementation Assignment of Lab/Project 2 Advanced track Setting up individual project discussion meetings Assignment of Lab 3 on Thursday articulated structure in Gazebo

Lecture 5 Control Architectures Slide 3 Autonomous controllers, in actuality Increasing complexity in DOF requires more sophisticated controllers Objective: reward function trajectory Sub-goals (Actions) Configurations (Desireds) Controller Task-level Controller Pruning or Behavior Indexing x d Motor-level Controller u Commands y Observed states Autonomous control architectures (this lecture) Kinematic states to actuator forces (previous lecture) Robot Dynamics x x States Robot (Plant)

Lecture 5 Control Architectures Slide 4 Task-level Control Objective Sensing (y) Task-level Controller Sub-goals (u) Produce robot control commands using observations from robot sensing (y) to meet task-level objectives to maintain constraint validity control system makes decisions autonomously write code to keep human in the control loop Robot control is the means by which the sensing and action of a robot are coordinated The infinitely many possible robot control programs all fall along a well-defined control spectrum

Lecture 5 Control Architectures Slide 5 Traditional AI approach to robot control What methods of decision making and AI are used for chess playing? Would these methods work for robot control? Deep Blue/IBM Would they work for a robot used to play chess? Max Planck Institute-Tübingen

Lecture 5 Control Architectures Slide 6 Modular task-specific control Implement individual controllers to achieve subgoals Coordinate controllers through a finite state machine A example room cleaning controller Pick up trash object Pre Post Throw away object Pre Post Pre Post Enter Room Pre Post Survey room for trash

Lecture 5 Control Architectures Slide 8 Robot control spectrum Arkin/ Behavior-Based Robotics

Lecture 5 Control Architectures Slide 9 Thinking versus acting Thinking/Deliberating requires (a lot of) correct information involves planning (looking into the future) flexible for increasing complexity slow, speed decreases with complexity Acting/Reaction innate/built-in or learned (from looking into the past) limited flexibility for increasing complexity fast, regardless of complexity

Lecture 5 Control Architectures Slide 10 Robot control approaches Deliberative (Planner-based) Control Think hard, act later. Reactive Control Don t think, (re)act. Hybrid Control Think and act separately & concurrently. Behavior-Based Control Think the way you act.

Lecture 5 Control Architectures Slide 11 A Brief History Deliberative Control (late 70s) Comp. Schema Theory (early 80s) Subsumption Architecture (mid 80s) Situated Automata (mid 80s) Behavior-Based Systems (late 80s) Hybrid Systems (late 80s/early 90s)

Lecture 5 Control Architectures Slide 12 Deliberative control summary Reasoning only, based on internal models Relies heavily on symbolic representations and world models Hierarchical in structure, typically rigid Communication and control in predetermined and predictable ways Capable of learning and prediction Too slow for real-time response

Lecture 5 Control Architectures Slide 13 Reactive robot control Reactive control is a technique for tightly coupling perception and action, typically in the context of motor behaviors, to produce a timely robotic response in dynamic and unstructured worlds No world models, persisting state, history, or lookahead/search/planning are used Systems are collections of reactive rules Can be quite powerful

Lecture 5 Control Architectures Slide 15 Subsumption Architecture Rodney Brooks 1986, MIT AI Lab A method for structuring reactive systems Bottom-up design/construction Layered sets of reactive rules (implemented as AFSMs) Tight sense-act feedback rules inside the layers

Lecture 5 Control Architectures Slide 16 System decomposition Deliberative: sense/plan/act sense the world plan over possibilities act on plan Subsumption: each module reacts to sensing each module commands actuators module interaction

Lecture 5 Control Architectures Slide 17 Subsumption components Each layer presents a competence (behavior) Higher/newer layers subsume or use lower/older layers Minimal inter-layer communication Minimal use of state Disadvantage: originally designed by hand inhibitor level 3 inputs AFSM outputs level 2 suppressor level 1

Lecture 5 Control Architectures Slide 18 Reactive control summary Highly effective for dynamic domains where fast reaction is critical Can be learned; most of reinforcement learning is aimed at learning reactive policies! When designed, requires a priori enumeration of relevant situations/conditions Doesn t look into the past or the future Excellent and ubiquitous substrate for both hybrid and behavior-based systems

Lecture 5 Control Architectures Slide 19 Hybrid Robot Control Combining reactive and deliberative control Usually called three-layer systems The major challenge is the middle layer, which must coordinate the other two, which operate on very different time-scales and representations (signals v. symbols) Currently one of the two dominant control paradigms in robotics

Lecture 5 Control Architectures Slide 20 Hybrid Robot Control Can take the best of the reactive and deliberative properties May also suffer from the worst of both Designing the middle layer/glue is extremely difficult, and usually special-purpose Layer coordination is an important question, just like behavior coordination in BBC Not best suited for all problems and domains (e.g., multi-robot control)

Lecture 5 Control Architectures Slide 22 Principal interface strategies Selection: Planning is viewed as configuration Advising: Planning is viewed as advice giving Adaptation: Planning is viewed as adaptation of controller Postponing: Planning is viewed as a least commitment process

Lecture 5 Control Architectures Slide 23 Behavior-Based Control An alternative to hybrid control Has the same expressiveness properties as hybrid control Historically grew out of reactive systems, but not constrained Can contain reactive components, just like hybrid systems The key difference is in the deliberative component

Lecture 5 Control Architectures Slide 24 Behavior-Based Control, cont. Behavior-based control systems: are networks of behaviors using uniform representation and similar time-scale respond in real-time (i.e., are reactive) are not stateless (i.e., not merely reactive) utilize distributed representations allow for a variety of behavior coordination mechanisms

Lecture 5 Control Architectures Slide 25 Hybrid vs. Behavior-based Deliberative planners rely heavily on world models can readily integrate world knowledge have broader perspective and scope Behavior-based systems afford modular development real-time robust performance in dynamic world provide for incremental growth tightly coupled with incoming sensory data

Lecture 5 Control Architectures Slide 26 What are behaviors? Behaviors are processes, dynamical systems building blocks for control, representation, and learning in BBC observable, time-extended robot-environment interactions coupling sensing & action control laws/processes that exploit system dynamics to achieve/maintain specific goals take inputs from sensors or other beh s send outputs to effectors or other beh s

Lecture 5 Control Architectures Slide 27 The basis behavior approach Use a small basis set of additive beh s hand-coded, learned, or evolved Based on dual constraints top-down (task) + bottom-up (robot & environ.) Combined with general operators arbitration and fusion Principle applied to a variety of problems coordination & learning in robots teams & humanoids

Lecture 5 Control Architectures Slide 28 Behavior representation Direct feedback loops/control laws mapping sensors to effectors Schemas sensory or motor Procedures any combination: sensory, motor, sensory to motor, or representational (behavior to behavior) Logic programs

Lecture 5 Control Architectures Slide 29 Maja J Mataric, "Environment Learning Using a Distributed Representation", Proceedings, IEEE International Conference on Robotics and Automation, Cincinnati, May 1990, 402-406. Example of representation A network of behaviors representing spatial landmarks, used for path planning by message-passing (Mataric 90)

Lecture 5 Control Architectures Slide 30 Behavior coordination The general action selection problem Two options: arbitration -> selecting among behaviors fusion -> combining behaviors Arbitration is simpler and much more prevalent, also lends itself to learning mechanisms Various control architectures use a mixture of the two options at different levels

Lecture 5 Control Architectures Slide 31 Behavior arbitration Priority-based Subsumption architecture State-based discrete event systems Bayesian decision theory Winner-take-all spreading of activation

Lecture 5 Control Architectures Slide 32 Behavior fusion Voting DAMN (e.g., No hands across America) Fuzzy (formalized voting) Decision theoretic Superposition (linear combinations) potential fields motor schemas dynamical systems

Lecture 5 Control Architectures Slide 33 Example of behavior coordination Fusion: flocking (formations) Arbitration: foraging (search, coverage)

Lecture 5 Control Architectures Slide 34 Behavior-Based Control summary Alternative to hybrid systems; encourages uniform time-scale and representation throughout the system Scalable and robust Behaviors are reusable; behavior libraries Facilitates learning Requires a clever means of distributing representation and any potentially time-extended computation

Lecture 5 Control Architectures Slide 35 Task-level control (summary) Objective Sensing (y) Task-level Controller Sub-goals (u) Produce control to meet objectives, maintain constraints Planning: exhaustive search across control commands intractable due to exponential search space Reactive systems: modules that react to current situation inflexible to accomplishing long-term objectives

Lecture 5 Control Architectures Slide 36 Task-level control (summary) Objective Sensing (y) Task-level Controller Sub-goals (u) Produce control to meet objectives, maintain constraints Planning: exhaustive search across control commands intractable due to exponential search space Task-specific controllers Hybrid systems: plan over a set of reactive behaviors Behavior-based systems: control through interacting behaviors Probablistic road maps: graph of valid configurations Reactive systems: modules that react to current situation inflexible to accomplishing long-term objectives

Lecture 5 Control Architectures Slide 37 Additional references R. Arkin, Behavior-Based Robotics