Pavlovian, Skinner and other behaviourists contribution to AI

Similar documents
Module 1. Introduction. Version 1 CSE IIT, Kharagpur

Behavior Architectures

Learning and Adaptive Behavior, Part II

Lesson 6 Learning II Anders Lyhne Christensen, D6.05, INTRODUCTION TO AUTONOMOUS MOBILE ROBOTS

Dr. Braj Bhushan, Dept. of HSS, IIT Guwahati, INDIA

CS148 - Building Intelligent Robots Lecture 5: Autonomus Control Architectures. Instructor: Chad Jenkins (cjenkins)

(c) KSIS Politechnika Poznanska

Unmanned autonomous vehicles in air land and sea

Agent-Based Systems. Agent-Based Systems. Michael Rovatsos. Lecture 5 Reactive and Hybrid Agent Architectures 1 / 19

Chapter 5: How Do We Learn?

Agents. Environments Multi-agent systems. January 18th, Agents

Robot Learning Letter of Intent

An Artificial Synaptic Plasticity Mechanism for Classical Conditioning with Neural Networks

LECTURE 5: REACTIVE AND HYBRID ARCHITECTURES

ICS 606. Intelligent Autonomous Agents 1. Intelligent Autonomous Agents ICS 606 / EE 606 Fall Reactive Architectures

Robotics Summary. Made by: Iskaj Janssen

Psychology, Ch. 6. Learning Part 1

Semiotics and Intelligent Control

acquisition associative learning behaviorism B. F. Skinner biofeedback

How do you design an intelligent agent?

EEL-5840 Elements of {Artificial} Machine Intelligence

DYNAMICISM & ROBOTICS

Learning. Learning: Problems. Chapter 6: Learning

Reactive agents and perceptual ambiguity

Bronze statue of Pavlov and one of his dogs located on the grounds of his laboratory at Koltushi Photo taken by Jackie D. Wood, June 2004.

ERA: Architectures for Inference

Introduction and Historical Background. August 22, 2007

Intelligent Agents. Philipp Koehn. 16 February 2017

Part I Part 1 Robotic Paradigms and Control Architectures

Oscillatory Neural Network for Image Segmentation with Biased Competition for Attention

Bundles of Synergy A Dynamical View of Mental Function

Embodied Intelligence

Evolution of Plastic Sensory-motor Coupling and Dynamic Categorization

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

DEPARTMENT OF ARTS AND EDUCATION COURSE OUTLINE WINTER PY1040 (B3): Basic Psychological Processes 3 (3-0-0) 45 Hours for 15 Weeks

KECERDASAN BUATAN 3. By Sirait. Hasanuddin Sirait, MT

A Reinforcement Learning Approach Involving a Shortest Path Finding Algorithm

Artificial intelligence (and Searle s objection) COS 116: 4/29/2008 Sanjeev Arora

Artificial Intelligence

Psychological Hodgepodge. Mr. Mattingly Psychology

Introduction to Artificial Intelligence 2 nd semester 2016/2017. Chapter 2: Intelligent Agents

Cognitive Neuroscience History of Neural Networks in Artificial Intelligence The concept of neural network in artificial intelligence

DEFINITION. Learning is the process of acquiring knowledge (INFORMATIN ) and new responses. It is a change in behavior as a result of experience

Evolving Imitating Agents and the Emergence of a Neural Mirror System

Behavioural Psychology

Vorlesung Grundlagen der Künstlichen Intelligenz

Classical Conditioning. Learning. Classical conditioning terms. Classical Conditioning Procedure. Procedure, cont. Important concepts

Artificial Intelligence. Intelligent Agents

Dikran J. Martin Introduction to Psychology

Classical and Instrumental Conditioning. Lecture 8

Psychological Foundations of Curriculum. Kevin Thompson

A Motivation-Based Action-Selection-Mechanism Involving Temporal Prediction

The Advantages of Evolving Perceptual Cues

acquisition associative learning behaviorism A type of learning in which one learns to link two or more stimuli and anticipate events

Neuroplasticity:. Happens in at least 3 ways: - - -

Grounding Ontologies in the External World

Time Experiencing by Robotic Agents

BRAIN PLASTICITY. Neuroplasticity:. Happens in at least 3 ways: - - -

Affective Action Selection and Behavior Arbitration for Autonomous Robots

Developmental Psychology and Learning SOCIAL EDUCATION COURSE Academic year 2014/2015

Integrating Cognition, Emotion and Autonomy

Dynamic Sensory Gating Mechanism. data. Conditional Reactive Hierarchical Memory

APPROACHES IN PSYCHOLOGY

Chapter 1. Give an overview of the whole RL problem. Policies Value functions. Tic-Tac-Toe example

1. A type of learning in which behavior is strengthened if followed by a reinforcer or diminished if followed by a punisher.

Sparse Coding in Sparse Winner Networks

Cognitive Robotics. Outline

Learning Theories Reviewed

LEARNING. Learning. Type of Learning Experiences Related Factors

Artificial Intelligence Lecture 7

Lecture 5- Hybrid Agents 2015/2016

Artificial Intelligence Agents and Environments 1

M.Sc. in Cognitive Systems. Model Curriculum

Biologically-Inspired Human Motion Detection

Chapter 1: Behaviorism and the beginning of cognitive science

DEPARTMENT OF ARTS AND EDUCATION COURSE OUTLINE FALL 2015 PY 1040 (C2) BASIC PSYCHOLOGICAL PROCESSES 3 (3-0-0) 45 Hours

Modeling Individual and Group Behavior in Complex Environments. Modeling Individual and Group Behavior in Complex Environments

Challenges of Embodied Intelligence

Learning the Meaning of Neural Spikes Through Sensory-Invariance Driven Action

Lecture 5: Learning II. Major Phenomenon of Classical Conditioning. Contents

Conditioning and Concept Formation in Embodied Agents

Classical Conditioning Classical Conditioning - a type of learning in which one learns to link two stimuli and anticipate events.

Challenges and opportunities for humanmachine collaboration at beyond human scales

Unit 06 - Overview. Click on the any of the above hyperlinks to go to that section in the presentation.

Oxford Foundation for Theoretical Neuroscience and Artificial Intelligence

Myers PSYCHOLOGY. (7th Ed) Chapter 8. Learning. James A. McCubbin, PhD Clemson University. Worth Publishers

NEOCORTICAL CIRCUITS. specifications

Classical conditioning or classical learning theory

Unit 6 Learning.

AU B. Sc.(Hon's) (Fifth Semester) Esamination, Introduction to Artificial Neural Networks-IV. Paper : -PCSC-504

Associative Learning

Artificial Intelligence

An Escalation Model of Consciousness

38. Behavior-Based Systems

A Neurally-Inspired Model for Detecting and Localizing Simple Motion Patterns in Image Sequences

A brief comparison between the Subsumption Architecture and Motor Schema Theory in light of Autonomous Exploration by Behavior

Chapter 5: Learning and Behavior Learning How Learning is Studied Ivan Pavlov Edward Thorndike eliciting stimulus emitted

Introduction to Computational Neuroscience

3/7/2010. Theoretical Perspectives


Transcription:

Pavlovian, Skinner and other behaviourists contribution to AI Witold KOSIŃSKI Dominika ZACZEK-CHRZANOWSKA Polish Japanese Institute of Information Technology, Research Center Polsko Japońska Wyższa Szko la Technik Komputerowych ul. Koszykowa 86, 02-008 Warszawa wkos@pjwstk.edu.pl mado@pjwstk.edu.pl PJWSTK Avatar Page 1 of 12

1. Abstract Abstract Page 2 of 12 A version of the definition of intelligent behaviour will be supplied in the context of real and artificial systems. Short presentation of principles of learning, starting with Pavlovian s conditioned response (reflex) through reinforced response and operant conditioning of Thorndike and Skinner will be given. Some tools of artificial intelligence that act according to those principle will be presented. An attempt will be made to show how some simple rules for behaviour modifications can lead to a complex intelligent behaviour. Contents Pavlov -a Nobel price winner Intelligence : visions of main behaviourists resume Critics and propositions for solution Pavlov[1906]

Page 3 of 12 2. 1. Abilities to: reasoning imagination insight judgement Binet test, IQ test 2. Three fundamental cognitive processes: abstraction learning dealing with novelty. 3. Ability to profit from experiments ability to behave adaptively, to function successfully within particular environments.

Page 4 of 12 3. Adaptive changes of behaviour = learning Behaviour is considered intelligent when it can be seen to be adaptive. Critics: there are many behavioural changes that one would like to call learning although they are not at all adaptive. We call behaviour intelligence only when we see how that behaviour is adaptive. intelligence is in the eyes of the observer [Brooks 1991] Principle of learning

4. Biological (animal) learning Abstract Page 5 of 12 Stimuli response associations make the behaviour unstable (depends on the place, initial position A 0 ). Stimulus approach associations make the behaviour stable (the goal stimulus S 0 can be approached from many directions, places A i, i = 1, 2,...) Place approach associations make the behaviour stable (more advanced learning: it requires the ability to use a configuration of stimuli to identify a place to approach). Response chain makes the behaviour unstable (more than a simple S R, single stimulus S 0 triggers a whole sequence of responses: R 0, R 1, R 2,...). Stimulus approach chain makes the behaviour stable (sequence of stimuli:s 0, S 1, S 2,... are approached in order). Place approach association can be also linked in chains (the same stimuli (landmarks) can be used many times to locate different places).

Page 6 of 12 S R S associations, where S and S are situations, one can make inference that if it is in S and performs response R it will end up in S (it is a form of expectation learning and correspond to Tolman s postulate about animals learning). S R S associations, where a stimulus S is followed by a response R with a reinforcement, stimulus, S (S gets more intense as a goal is approched). S S associations are examples of the classical conditioning of Pavlov s dog ( in which the bell has been associated with food which in turn activates salivation) =stimulus substitution theory of conditioning Hebb s theory. Balkenius [1994] Skinner, 1999, Sahrkey and Ziemke[1999]

5. Abstract Page 7 of 12 How to match relevant animal learning theories with AI research: Rule based systems are very often similar to S-R associations. Example: look-up tables (LUT) in both AI and control. (Notice some inputs may be not stored) Samuel s checkers program, 1959, is similar to reinforcement learning described by a set of S R S associations. Most AI planning systems, based on acquisition of knowledge about the world, make use of representations similar to S R S associations of the form: precondition action outcome where the outcome of one rule is the precondition for the next. Planning is a search for a sequence of rules that leads from the start to the goal.

Page 8 of 12 Recent critics: Classical AI systems lack generalization capabilities: complete systems cannot be made from studies of isolated modulus. Classical AI systems lack robustness and cannot perform in real time, and run on sequential machines. Classical AI systems are goal based and organized hierarchically; their processing is done centrally. Real world differs from virtual ones: it has its own dynamics. Virtual world has states with complete information on them, they are static. The frame problem appears, i.e. how models of parts of the real world can be kept in tune with the real world as it is changing, and how to determine which changes in the world are relevant to a given situation without having to test all possible changes.

6. Suggested solutions Abstract Page 9 of 12 Complete system (agent) must possess the architecture: 1. with direct coupling of perception to action, 2. with dynamic interaction with the environment, 3. with intrinsic mechanisms to cope with resource limitations and incomplete knowledge, 4. with decentralized processing, Complete system has to be the autonomous agent (selfsufficient agent, equipped with the appropriate learning mechanism, with its own history, adaptive), Complete system has to be the situated agent (it acquires information about its environment only through its sensors and interacts with the world on its own), Complete system has to be embodied ( it must interact with its environment, is continuously subjected to physical forces, to energy dissipation, to damage, to any influence in the environment).

Page 10 of 12 7. Neural network architecture embedded in robot Robot Example: rule of Hebb to modify the weights w ij of an artificial neural network (ANN) w ij (t + 1) = w ij (t) + ηo j a i where η is the so-called learning rate, o j the output of the node (PE) j, and a i the activation of the node i.

Page 11 of 12 8. Complete system = system equipped with AI Complete system is behaviour based, not goal based. Complete system includes sensors and effectors. Sensory signals (stimuli) should be mapped (relatively) directly to effector motors (responses). Complete system is equipped with a large number of parallel processes connected (only loosely) to one another. This leads to embodied cognitive sciences and to embodied intelligence introduced by Rodney Brooks [1991] and the subsumption architecture. Since complete (i.e. intelligent) systems are behaviour based the behaviourists contributions are obvious.

Acknowledgement. The first author (W.K.) would like to thanks Mrs. and Mr. A. Adamkiewicz for the inspired discussions on psychology during his stay in their Dom na L akach in Izby, Beskid Niski Mountains, in the summer 2003. The help of Dr. Piotr Gol abek in collecting materials used in the preparation of the paper is highly acknowledged. Page 12 of 12