Intelligent Machines That Act Rationally. Hang Li Bytedance AI Lab

Similar documents
Intelligent Machines That Act Rationally. Hang Li Toutiao AI Lab

KECERDASAN BUATAN 3. By Sirait. Hasanuddin Sirait, MT

EEL-5840 Elements of {Artificial} Machine Intelligence

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta

Artificial Intelligence

Introduction to Deep Reinforcement Learning and Control

CS324-Artificial Intelligence

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.

Artificial Intelligence

Intelligent Agents. CmpE 540 Principles of Artificial Intelligence

Sparse Coding in Sparse Winner Networks

M.Sc. in Cognitive Systems. Model Curriculum

CS343: Artificial Intelligence

Artificial Intelligence. Admin.

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning

Grounding Ontologies in the External World

ICS 606. Intelligent Autonomous Agents 1. Intelligent Autonomous Agents ICS 606 / EE 606 Fall Reactive Architectures

Agent-Based Systems. Agent-Based Systems. Michael Rovatsos. Lecture 5 Reactive and Hybrid Agent Architectures 1 / 19

CS 771 Artificial Intelligence. Intelligent Agents

Intelligent Agents. Outline. Agents. Agents and environments

LECTURE 5: REACTIVE AND HYBRID ARCHITECTURES

Functionalism. (1) Machine Functionalism

ISSN (PRINT): , (ONLINE): , VOLUME-5, ISSUE-5,

Connect with amygdala (emotional center) Compares expected with actual Compare expected reward/punishment with actual reward/punishment Intuitive

Artificial Intelligence Lecture 7

Definitions. The science of making machines that: This slide deck courtesy of Dan Klein at UC Berkeley

Vorlesung Grundlagen der Künstlichen Intelligenz

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

Agents and Environments

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Marcus Hutter Canberra, ACT, 0200, Australia

Innate and Learned Emotion Network

Computational Cognitive Neuroscience

Artificial Intelligence

Semi-Supervised Disentangling of Causal Factors. Sargur N. Srihari

What is AI? The science of making machines that:

Knowledge Based Systems

Katsunari Shibata and Tomohiko Kawano

ERA: Architectures for Inference

Between-word regressions as part of rational reading

AI and Philosophy. Gilbert Harman. Tuesday, December 4, Early Work in Computational Linguistics (including MT Lab at MIT)

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China


Brain Mechanisms Explain Emotion and Consciousness. Paul Thagard University of Waterloo

Foundations of Artificial Intelligence

Oscillatory Neural Network for Image Segmentation with Biased Competition for Attention

Dynamic Rule-based Agent

Reinforcement Learning

Open-Domain Chatting Machine: Emotion and Personality

Cognitive Neuroscience History of Neural Networks in Artificial Intelligence The concept of neural network in artificial intelligence

The Somatic Marker Hypothesis: Human Emotions in Decision-Making

arxiv: v1 [cs.ai] 5 Oct 2018 October 8, 2018

Fodor on Functionalism EMILY HULL

COMP329 Robotics and Autonomous Systems Lecture 15: Agents and Intentions. Dr Terry R. Payne Department of Computer Science

Reactivity and Deliberation in Decision-Making Systems

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports

Application of artificial intelligence in military aptitude tests

Learning Objectives. Learning Objectives 17/03/2016. Chapter 4 Perspectives on Consumer Behavior

AI and Philosophy. Gilbert Harman. Thursday, October 9, What is the difference between people and other animals?

Contents. Foundations of Artificial Intelligence. Agents. Rational Agents

Vision as Bayesian inference: analysis by synthesis?

SPICE: Semantic Propositional Image Caption Evaluation

An expert system (Autodoc) for diagnosing diseases and prescribing medication using visual basic.net

Formulating Emotion Perception as a Probabilistic Model with Application to Categorical Emotion Classification

Agents. This course is about designing intelligent agents Agents and environments. Rationality. The vacuum-cleaner world

Conceptual Spaces. A Bridge Between Neural and Symbolic Representations? Lucas Bechberger

Artificial intelligence (and Searle s objection) COS 116: 4/29/2008 Sanjeev Arora

Human cogition. Human Cognition. Optical Illusions. Human cognition. Optical Illusions. Optical Illusions

Introduction and Historical Background. August 22, 2007

Intelligent Agents. Russell and Norvig: Chapter 2

Lecture 2 Agents & Environments (Chap. 2) Outline

What is Artificial Intelligence? A definition of Artificial Intelligence. Systems that act like humans. Notes

Artificial Neural Networks

CS221 / Autumn 2017 / Liang & Ermon. Lecture 19: Conclusion

Collaborative Problem Solving: Operationalizing Trauma Informed Care

[1] provides a philosophical introduction to the subject. Simon [21] discusses numerous topics in economics; see [2] for a broad economic survey.

Optical Illusions 4/5. Optical Illusions 2/5. Optical Illusions 5/5 Optical Illusions 1/5. Reading. Reading. Fang Chen Spring 2004

III Semester M.Tech. (IT) Examination, Dec. 2010/Jan LOGIC AND FUNCTIONAL PROGRAMMING PART-A

Chapter 2: Intelligent Agents

CS148 - Building Intelligent Robots Lecture 5: Autonomus Control Architectures. Instructor: Chad Jenkins (cjenkins)

Dr. Mustafa Jarrar. Chapter 2 Intelligent Agents. Sina Institute, University of Birzeit

Module 1. Introduction. Version 1 CSE IIT, Kharagpur

Introduction to Artificial Intelligence 2 nd semester 2016/2017. Chapter 2: Intelligent Agents

Damasio s error. Aaron Sloman. (In The Philosophy Magazine 2004)

Introduction to Medical Computing

Domain Adversarial Training for Accented Speech Recognition

Learning to Identify Irrelevant State Variables

Lecture 17: The Cognitive Approach

A HMM-based Pre-training Approach for Sequential Data

Building Intelligent Chatting Machines: Emotion, Personality, and Commonsense Knowledge

Recognizing Ambiguity

Modeling Individual and Group Behavior in Complex Environments. Modeling Individual and Group Behavior in Complex Environments

Lectures I & II : Stimuli and Responses, Environment and Behavior: Where is the Mind? Overview of the Development of Mind and Consciousness.

EECS 433 Statistical Pattern Recognition

Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team

Bayesian and Frequentist Approaches

Intelligent Agents. Soleymani. Artificial Intelligence: A Modern Approach, Chapter 2

Affective Computing Ana Paiva & João Dias. Lecture 1. Course Presentation

A Scoring Policy for Simulated Soccer Agents Using Reinforcement Learning

Transcription:

Intelligent Machines That Act Rationally Hang Li Bytedance AI Lab

Four Definitions of Artificial Intelligence Building intelligent machines (i.e., intelligent computers) Thinking humanly Acting humanly Thinking rationally Acting rationally

The Four Approaches to AI Four approaches represent different motivations and views Four approaches need different theoretical foundations and methodologies If the goal is to build intelligent tools for humans, then taking the rational act approach is most appropriate Computers that act rationally also think rationally, but act is key We consider the rational act approach here

Talk Outline Intelligent Machines That Act Rationally Humans Rational Behavior Architecture of Rationally Acting Machines Reinforcement Learning Our Work: Learning for Paraphrase Generation Summary

Characteristics of Machines Acting Rationally 1. Interaction with environment: acting as agent in an environment 2. Goal-oriented: acting to achieve a goal 3. Evaluation criterion: to evaluate performance of goal achievement 4. Functionalism: given an input, generate an output, not to mimic humans 5. Performance: comparable to or better than humans

Notes on Computers Acting Rationally Easier to make progress than the other approaches Because of not enough understanding of human brains mechanism Becoming main stream of AI research now Due to the advancement of machine learning technologies Doing better than humans in various tasks does not mean realization of human intelligence

Talk Outline Intelligent Machines That Act Rationally Humans Rational Behavior Architecture of Rationally Acting Machines Reinforcement Learning Our Work: Learning for Paraphrase Generation Summary

Question: how do humans think and act rationally?

Rene Descartes View on Rationality Dualism: body and mind are separated Rationalism: rational thinking should only rely on reason, not on sensibility Reason = power of mind to think logically Rene Descartes

Emotion is Essential in Human Thinking Three levels of mind Reason, emotion, and body function jointly participate in thinking We are not necessarily thinking machines; we are feeling machines that think Antonio Damasio Reason, Decision Making Emotion, Feeling Body Function

Experiment: Emotion and Judgement Setting Subjects are divided into two groups Taken into a room First group is asked to recall experiences of social exclusion and second group is asked to recall experiences of social inclusion Subjects are then asked to estimate temperature of room Finding: average temperature of first group is lower than average temperature of second group Conclusion: emotion affects judgment

Rational Thinking Needs Emotion Descartes s dualism is wrong Rational think needs not only reason, but also emotion Patients who suffer from damages in emotion capability (obitofrontal cortex) cannot think and act rationally

Experiment: Emotion and Rationality Setting Four decks of cards Subject is asked to continuously pick up cards; depending on result, he can either win or lose money Goal for subject is to win the game Two safe decks: probability of winning is larger, amount of money in each win is small Two risky decks: probability of losing is larger, amount of money in each win is large

Experiment: Emotion and Rationality Observation Ordinary people pick up cards from risky decks at beginning, but gradually realize risk and move to safe decks Patients who lost emotion capability do not feel pain of loss and continuously pick up cards from risky decks, until going bankrupt Conclusion Emotion does affect rational behavior

To Build Computers Acting Rationally Mainly include reason capability, and also emotion function and body function, depending on tasks Example: emotion is important factor for dialogue systems Example: self-charging is important feature for sweeping robots I like you

Talk Outline Intelligent Machines That Act Rationally Humans Rational Behavior Architecture of Rationally Acting Machines Reinforcement Learning Our Work: Learning for Paraphrase Generation Summary

An Architecture of Rationally Acting Machines Decision Making Inference Understanding Learning Perception Central Executive Unit Percept External Sensor Short Term Memory Long Term Memory Action Actuator Internal Sensor Feeling

Notes on Architecture The architecture is defined at function level which can be shared by computers and humans Signals are from both environment and body Sensors transform inputs in different modalities into the same representations The use of long term memory and short term memory makes it possible to continuously acquire information and knowledge

Talk Outline Intelligent Machines That Act Rationally Humans Rational Behavior Architecture of Rationally Acting Machines Reinforcement Learning Our Work: Learning for Paraphrase Generation Summary

Machine Learning for Rational Behavior Machine learning Supervised learning Unsupervised learning Reinforcement learning Machine learning, particularly reinforcement learning, learns models for rational acts (recognition, understanding, inference, decision making) All formalized as optimization problems

Data: Model: Policy function: Value function: Reinforcement Learning -! = ($, &, ', $ ) *+, 3 $ 4 $, &, 5($, &) Model-based Learning: Model-free Learning: 0 $ : 0 $ = 3 & $ or & = 0($). $ : E( 7 *+, 8 * 5 $ *, & * 0) 3 $ 4 $, &, 5($, &).($) 0($). $ 0 $ or 0($) Agent $ ' & Environment

Reinforcement Learning Learned models are used in sequential decision process Reinforcement learning also includes partially observed Markov decision process, hierarchical reinforcement learning, inverse reinforcement learning Learning is formalized as optimization

Talk Outline Intelligent Machines That Act Rationally Humans Rational Behavior Architecture of Rationally Acting Machines Reinforcement Learning Our Work: Learning for Paraphrase Generation Summary

Not only learn how to act rationally but also learn how to evaluate rationality (i.e., to learn both policy function and reward function in reinforcement learning)

Joint Work with Zichao Li Xin Jiang Lifeng Shang

Task: Paraphrase Generation Ambiguity and variability are nature of natural languages Ambiguity: same expression represents different meanings. Variability: different expressions represent same meaning Paraphrase auto generation: creates a number of synonymous expressions given an expression distance between sun and earth Paraphrase Generator how far is sun from earth? how many miles is sun from earth?

Previous Work: Sequence to Sequence Model as Generator how far is sun from earth? Generator: Sequence-to-Sequence Model distance between sun and earth Supervised Learning Training data: paraphrase pairs Loss function: cross entropy Reinforcement Learning Training data: paraphrase pairs Reward: BLEU or ROUGE score

Main Challenge It is challenging to define a semantic similarity measure to guide the training of generator The evaluation measure needs to judgment whether two sentences are semantically similar Previously, lexical measures such as BLEU and ROUGE are used, which are not ideal Ref: the Iraqi weapons are to be handed over to the army with two weeks MT: in two weeks Iraq s weapons will give to the army 1-gram precision: 6/10 2-gram precision: 3/9 3-gram precision: 3/8 ( ) 1 3 3 BLEU = Õ n =1 p æ 6 3 2 ö n BLEU = ç * * = 0. 368 è10 9 8 ø 1 3

New Framework: Generator and Evaluator Using machine learning to learn both generator and evaluator Learning of evaluator is equally important Evaluator: Deep Matching Model distance between sun and earth Generator: Sequence-to-Sequence Model distance between earth and sun

Learning of Evaluator: Supervised Learning 1. Train evaluator with positive and negative examples 2. Train generator with positive examples and evaluator! " #, % " #! & #, % & #! ' #, % ' # Positive examples! " #, % " #! & #, % & #! ' #, % ' #! " (, % " (! & (, % & (! ) (, % ) ( Generator: Sequence-to-Sequence Model Positive examples Evaluator: Deep Matching Model Negative examples Reinforcement Learning

Learning of Evaluator: Inverse Reinforcement Learning 1. Train generator with positive examples 2. Train evaluator with positive examples and generated examples 3. Repeat above! " #, % " #! ' #, % ' #! ( #, % ( # Inverse Reinforcement Learning! " #, % " &! ' #, % ' &! ( #, % ( & Positive examples and generated examples Evaluator: Deep Matching Model! " #, % " #! ' #, % ' #! ) #, % ) # Positive examples Generator: Sequence-to-Sequence Model Reinforcement Learning

Inverse Reinforcement Learning Find (,! & #$% ' # ( * #,! & #$% ' # ( * #,,, Find 8,! 3 4 5, 7 8 5, 4! 3 4 5, 7 8 5, 4, 7 9 :;< (>) =! 3 4 5, 7 ABC 0, 1 + 8 G(5, H4) 8 G 5, 4 J K ( H4, 4) Reinforcement Learning (Policy Gradient) Find,,! & #$% ' # ( * #,! & #$% ' # ( * #,,, Find 7,! 3 4 5, 7 8 5, 4! 3 4 5, 7 8 5, 4, 7 L M 9 ;< (7) =! 3(N K,M) L M log R 4 5, 7 8 G (5, 4)

Experimental Results Model ROUGE-1 BLEU-1 BLEU-2 METEOR Generator: Seq2Seq Generator: Seq2Seq with Copy +Evaluator Rouge 58.50 30.70 35.91 25.16 61.40 35.12 39.90 29.63 62.97 36.60 41.39 29.64 +Evaluator: SL 63.69 37.61 42.85 31.78 +Evaluator: IRL 63.63 37.33 42.68 31.62

AlphaGo Related Work Monte Carlo Tree Search Policy Network: to choose actions Value Network: to evaluate actions GAN (Generative Adversarial Networks) Generator: capturing data distribution Discriminator: deciding whether data is from generator or true distribution

Talk Outline Intelligent Machines That Act Rationally Humans Rational Behavior Architecture of Rationally Acting Machines Reinforcement Learning Our Work: Learning for Paraphrase Generation Summary

Summary Machines that act rationally Agent in environment, goal-oriented, evaluation criterion, functionalism, better performances Becoming main approach to AI Humans rational behavior not only needs reason but also emotion Reinforcement learning is useful for machines acting rationally Framework for paraphrase generation, including learning of generator and evaluator

References Stuart Russell and Peter Norvig, Artificial Intelligence: A Modern Approach, Third Edition, Prentice Hall, 2009. Anthony Damasio, Descartes' Error: Emotion, Reason, and the Human Brain, Penguin Books, 2005. Benjamin K. Bergen, Louder Than Words: The New Science of How the Mind Makes Meaning, Basic Books, 2012. Shum, He, and Li, From Eliza to Xiaoice, Challenges and Opportunities with Social Chatbots, 2018. Zichao Li, Xin Jiang, Lifeng Shang, Hang Li, Paraphrase Generation Using Deep Reinforcement Learning, arxiv:1711.00279, 2017.

Thank you!