THE FORMATION OF HABITS The implicit supervision of the basal ganglia

Similar documents
Instrumental Conditioning VI: There is more than one kind of learning

GBME graduate course. Chapter 43. The Basal Ganglia

Category Learning in the Brain

Teach-SHEET Basal Ganglia

THE BRAIN HABIT BRIDGING THE CONSCIOUS AND UNCONSCIOUS MIND

Gangli della Base: un network multifunzionale

Anatomy of the basal ganglia. Dana Cohen Gonda Brain Research Center, room 410

COGNITIVE SCIENCE 107A. Motor Systems: Basal Ganglia. Jaime A. Pineda, Ph.D.

Computational cognitive neuroscience: 8. Motor Control and Reinforcement Learning

Making Things Happen 2: Motor Disorders

BIOMED 509. Executive Control UNM SOM. Primate Research Inst. Kyoto,Japan. Cambridge University. JL Brigman

This is a repository copy of Goal-directed and habitual control in the basal ganglia: implications for Parkinson's disease.

Basal Ganglia. Introduction. Basal Ganglia at a Glance. Role of the BG

Large Scale modeling of the Basal Ganglia

Cortical and basal ganglia contributions to habit learning and automaticity

The Globus Pallidus Pars Interna in Goal-Oriented and Routine Behaviors: Resolving a Long-Standing Paradox

ISIS NeuroSTIC. Un modèle computationnel de l amygdale pour l apprentissage pavlovien.

Research Article A Biologically Inspired Computational Model of Basal Ganglia in Action Selection

Degree of freedom problem

Reinforcement Learning and Dimensionality Reduction: a model in Computational Neuroscience

KINE 4500 Neural Control of Movement. Lecture #1:Introduction to the Neural Control of Movement. Neural control of movement

Habits, action sequences and reinforcement learning

A Neurocomputational Model of Dopamine and Prefrontal Striatal Interactions during Multicue Category Learning by Parkinson Patients

Basal ganglia Sujata Sofat, class of 2009

THE BRAIN HABIT BRIDGING THE CONSCIOUS AND UNCONSCIOUS MIND. Mary ET Boyle, Ph. D. Department of Cognitive Science UCSD

Brain anatomy and artificial intelligence. L. Andrew Coward Australian National University, Canberra, ACT 0200, Australia

This Lecture: Psychology of Memory and Brain Areas Involved

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

3 A Tripartite Cognitive Architecture. 4 Broad Points

For more information about how to cite these materials visit

Organization of the nervous system 2

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Modelling Decision Making under Uncertainty Machine Learning and Neural Population Techniques. Elaine Duffin

Decision neuroscience seeks neural models for how we identify, evaluate and choose

Systems Neuroscience November 29, Memory

Basal Ganglia Anatomy, Physiology, and Function. NS201c

A Computational Model of Complex Skill Learning in Varied-Priority Training

Contributions of the striatum to learning, motivation, and performance: an associative account

Reinforcement Learning, Conflict Monitoring, and Cognitive Control: An Integrative Model of Cingulate-Striatal Interactions and the ERN

Implications of a Dynamic Causal Modeling Analysis of fmri Data. Andrea Stocco University of Washington, Seattle

Learning Working Memory Tasks by Reward Prediction in the Basal Ganglia

CSE511 Brain & Memory Modeling Lect 22,24,25: Memory Systems

Basal Ganglia George R. Leichnetz, Ph.D.

A Role for Dopamine in Temporal Decision Making and Reward Maximization in Parkinsonism

On the nature of Rhythm, Time & Memory. Sundeep Teki Auditory Group Wellcome Trust Centre for Neuroimaging University College London

Evaluating the roles of the basal ganglia and the cerebellum in time perception

Reinforcement learning and the brain: the problems we face all day. Reinforcement Learning in the brain

Modeling the sensory roles of noradrenaline in action selection

A behavioral investigation of the algorithms underlying reinforcement learning in humans

By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism

Emotion Explained. Edmund T. Rolls

A. General features of the basal ganglia, one of our 3 major motor control centers:

The Neuroscience of Addiction: A mini-review

A. General features of the basal ganglia, one of our 3 major motor control centers:

Behavioral Neuroscience: Fear thou not. Rony Paz

Learning Objectives.

Brain Dynamics, Reward, & Mocap

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

marijuana and the teen brain MARY ET BOYLE, PH. D. DEPARTMENT OF COGNITIVE SCIENCE UCSD

Choosing the Greater of Two Goods: Neural Currencies for Valuation and Decision Making

2. Which of the following is not an element of McGuire s chain of cognitive responses model? a. Attention b. Comprehension c. Yielding d.

Hold your horses: A dynamic computational role for the subthalamic nucleus in decision making

A computational model of integration between reinforcement learning and task monitoring in the prefrontal cortex

A quick overview of LOCEN-ISTC-CNR theoretical analyses and system-level computational models of brain: from motivations to actions

Neuroscience of Consciousness II

The Contribution of the Medial Prefrontal Cortex, Orbitofrontal Cortex, and Dorsomedial Striatum to Behavioral Flexibility

NSCI 324 Systems Neuroscience

Data Analysis. Memory and Awareness in Fear Conditioning. Delay vs. Trace Conditioning. Discrimination and Reversal. Complex Discriminations

The Basal Ganglia Do Not Select Reach Targets but Control the Urgency of Commitment

Multitasking. Multitasking. Intro Problem Solving in Computer Science McQuain

Exploring Reflections and Conversations of Breaking Unconscious Racial Bias. Sydney Spears Ph.D., LSCSW

Huntington s Disease COGS 172

Online Journal Club-Article Review

The basal ganglia optimize decision making over general perceptual hypotheses

Behavioral Neuroscience: Fear thou not. Rony Paz

Psych3BN3 Topic 4 Emotion. Bilateral amygdala pathology: Case of S.M. (fig 9.1) S.M. s ratings of emotional intensity of faces (fig 9.

Modeling the interplay of short-term memory and the basal ganglia in sequence processing

NEURAL CONTROL OF MOVEMENT: ENGINEERING THE RHYTHMS OF THE BRAIN

The Motor Systems. What s the motor system? Plan

The human brain. of cognition need to make sense gives the structure of the brain (duh). ! What is the basic physiology of this organ?

Cognitive Neuroscience History of Neural Networks in Artificial Intelligence The concept of neural network in artificial intelligence

Attention: Neural Mechanisms and Attentional Control Networks Attention 2

Effects of lesions of the nucleus accumbens core and shell on response-specific Pavlovian i n s t ru mental transfer

Chapter 5: How Do We Learn?

A Dual Process Reinforcement Learning Account for Sequential Decision Making and Skill Learning

THE AMYGDALA AND REWARD

Hold Your Horses: A Dynamic Computational Role for the Subthalamic Nucleus in Decision Making

Ch 8. Learning and Memory

Two-Point Threshold Experiment

Cost-Sensitive Learning for Biological Motion

THE PREFRONTAL CORTEX. Connections. Dorsolateral FrontalCortex (DFPC) Inputs

Hebbian Plasticity for Improving Perceptual Decisions

Jonas Rose bird brains

How do individuals with congenital blindness form a conscious representation of a world they have never seen? brain. deprived of sight?

Neurophysiology of systems

Learning. Learning: Problems. Chapter 6: Learning

Objectives. Objectives Continued 8/13/2014. Movement Education and Motor Learning Where Ortho and Neuro Rehab Collide

The Nature of the Potential

Basal Ganglia. Steven McLoon Department of Neuroscience University of Minnesota

Distinct valuation subsystems in the human brain for effort and delay

Transcription:

THE FORMATION OF HABITS The implicit supervision of the basal ganglia MEROPI TOPALIDOU 12e Colloque de Société des Neurosciences Montpellier May 1922, 2015

THE FORMATION OF HABITS The implicit supervision of the basal ganglia MEROPI TOPALIDOU 12e Colloque de Société des Neurosciences Montpellier May 1922, 2015

THE FORMATION OF HABITS The implicit supervision of the basal ganglia MEROPI TOPALIDOU 12e Colloque de Société des Neurosciences Montpellier May 1922, 2015

GoalDirected Actions VS Habits Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome sensitive to devaluation of the outcome Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome direct initiation of responding by stimulus and/or context presentation sensitive to devaluation of the outcome Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome direct initiation of responding by stimulus and/or context presentation sensitive to devaluation of the outcome Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome sensitive to devaluation of the outcome direct initiation of responding by stimulus and/or context presentation resistant to devaluation of the outcome Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome sensitive to devaluation of the outcome direct initiation of responding by stimulus and/or context presentation resistant to devaluation of the outcome behavior adjusts to reflect the new value of the outcome that the action would obtain Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

GoalDirected Actions VS Habits initiation of response is under direct control of the current value of outcome sensitive to devaluation of the outcome direct initiation of responding by stimulus and/or context presentation resistant to devaluation of the outcome behavior adjusts to reflect the new value of the outcome that the action would obtain habits persist even if the reward becomes less attractive or if the action is not necessary to earn the reward. Belin et al. (2008), Yin (2008), Foerde & Shohamy (2011), Doll et al. (2012)

Cortex Basal Ganglia Novel behaviors require attention and flexible thinking and therefore are dependent on cortex, whereas automatic behaviors has been assumed to be primarily mediated by subcortical structures. Much evidence suggests however, that subcortical structures, such as the striatum, make significant contributions to initial learning. More recently, evidence has been accumulating that neurons in the associative striatum are selectively activated during early learning, whereas those in the sensori striatum are more active after automaticity has developed. At the same time, other recent reports suggest that automatic behaviors are striatum and dopamineindependent, and may be mediated entirely within cortex. Resolving this apparent conflict should be a major goal of future research. These ideas led to the theory that dominated the 20th century: Novel behaviors require attention and flexible thinking and therefore are dependent on cortex, whereas automatic behaviors require neither of these and so are not mediated primarily by cortex. Instead, it has long been assumed that automatic behaviors are primarily mediated by subcortical structures.

Cortex Basal Ganglia Goal Directed actions go here Cortex leads decision once learned Habits go there BG teach cortex during learning phase Daw, Niv & Dayan (2005) Ashby, Turner & Horvitz (2010) Novel behaviors require attention and flexible thinking and therefore are dependent on cortex, whereas automatic behaviors has been assumed to be primarily mediated by subcortical structures. Much evidence suggests however, that subcortical structures, such as the striatum, make significant contributions to initial learning. More recently, evidence has been accumulating that neurons in the associative striatum are selectively activated during early learning, whereas those in the sensori striatum are more active after automaticity has developed. At the same time, other recent reports suggest that automatic behaviors are striatum and dopamineindependent, and may be mediated entirely within cortex. Resolving this apparent conflict should be a major goal of future research. These ideas led to the theory that dominated the 20th century: Novel behaviors require attention and flexible thinking and therefore are dependent on cortex, whereas automatic behaviors require neither of these and so are not mediated primarily by cortex. Instead, it has long been assumed that automatic behaviors are primarily mediated by subcortical structures.

Outline Experiment Computational model Results

Experimental setup Two monkeys, simple twoarmed bandit task with P=0.75 and P=0.25. Habitual condition (known stimuli pair, same every day) Novel condition (unfamiliar stimuli pair, new every day) Habitual Condition Trial start Cue presentation Go signal Decision Reward Trial stop Prelearned cues 0.75 0.25 Novel Condition Novel cues (every day) 0.75 0.25 1.0s 1.5s 1.0s 1.5s 1.0s 1.5s Time Piron et al. (submitted)

Experimental results 1.0 Mean success rate 0.8 0.6 0.4 0.2 0.0 saline HC 0 20 40 60 80 100 120 Number of trials NC Mean success rate 1.0 0.8 0.6 0.4 0.2 0.0 * HC NC Saline Mean of first 25 trials * HC NC Saline Mean of last 25 trials Piron et al. (submitted)

Experimental results Muscimol injection in GPi disrupts learning in novel conditions (NC) but performances remains intact (but slower) in habitual conditions (HC). Mean success rate 1.0 0.8 0.6 0.4 0.2 0.0 saline muscimol HC 0 20 40 60 80 100 120 Number of trials NC Mean success rate 1.0 0.8 0.6 0.4 0.2 0.0 * * * * HC NC HC NC Saline Muscimol Mean of last 25 trials Piron et al. (submitted)

Experimental conclusion If habits were stored in basal ganglia, monkeys would not achieve peak performances in muscimol conditions for familiar stimuli. If habits were learned in cortex, monkeys would be able to reach peak performances in muscimol conditions for unfamiliar stimuli. 1.0 Mean success rate 0.8 0.6 0.4 0.2 0.0 saline muscimol HC 0 20 40 60 80 100 120 Number of trials NC Piron et al. (submitted)

Computational model Neural Network Neuron Rate model Two segregated loops: Cognitive loop allows to choose a shape Motor loop allows to reach a shape External current 2 External current + 1 + Cortex + External current Cortex associative (4x4 units) Cortex GPe HYPERDIRECT PATHWAY Striatum DIRECT PATHWAY Striatum associative (4x4 units) 3 Striatum GPe INDIRECT PATHWAY STN GPi GPi STN Thalamus Thalamus Topalidou et al. (in prep.)

Corticobasal competition Cognitive decision has to intervene in decision. Cortical decision Thanks to lateral competition, cortex can take a decision without interaction with BG. External current External current External current + 1 + Cortex 2 + Cortex associative (4x4 units) Cortex CorticoBasal decision GPe HYPERDIRECT PATHWAY Striatum DIRECT PATHWAY Striatum associative (4x4 units) 3 Striatum GPe INDIRECT PATHWAY STN GPi GPi STN Thalamus Thalamus Topalidou et al. (in prep.)

Acting is learning Learning occurs at three different places simultaneously. 1 & 2 Hebbian learning 3 Reinforcement learning Cortex learns to reproduce previous repertories, regardless of whether or not are appropriate (HL). Fast basal ganglia trialanderror learning (RL) biases slow cortical one (HL) ensuring that the correct behavior is produced. Hélie et al. (2014) External current 2 External current + 1 + GPe INDIRECT PATHWAY Cortex HYPERDIRECT PATHWAY STN Striatum DIRECT PATHWAY + GPi Thalamus External current Cortex associative (4x4 units) Striatum associative (4x4 units) GPi Thalamus Striatum Cortex 3 STN GPe Topalidou et al. (in prep.)

Computational results Intact model peak performances on familiar conditions can learn novel conditions Lesioned model (GPi) peak performances on familiar conditions cannot learn novel conditions External current + External current External current (Monkey results) + 1 + GPe INDIRECT PATHWAY Cortex HYPERDIRECT PATHWAY STN 2 Striatum DIRECT PATHWAY GPi Thalamus Cortex associative (4x4 units) Striatum associative (4x4 units) GPi Thalamus Striatum Cortex 3 STN GPe Mean success rate 1.0 0.8 0.6 0.4 0.2 0.0 saline muscimol HC 0 20 40 60 80 100 120 Number of trials NC Topalidou et al. (in prep.)

Sensitivity to reward devaluation

Conclusion The acquisition and the expression of habits are two entangled processes that can be dissociated experimentally. This experimental dissociation sheds light on the nature of the interaction between the basal ganglia and the cortex and their respective role in the initial formation and the later expression of habits. The model suggests that the basal ganglia implicitly supervises the cortex where habits are actually stored, but the cortex cannot learn them on its own. In the future, the model will be tested in different protocols in order to ensure the accuracy of its predictions.

Acknowledgements Nicolas Rougier T. Boraud C. Piron D. Kase A. Leblois

Acknowledgements Nicolas Rougier T. Boraud C. Piron D. Kase A. Leblois

Acknowledgements Nicolas Rougier T. Boraud C. Piron D. Kase A. Leblois

Acknowledgements Nicolas Rougier T. Boraud C. Piron D. Kase A. Leblois

Acknowledgements Nicolas Rougier T. Boraud C. Piron D. Kase A. Leblois

Reaction period (ms) 550 500 450 400 0 * * * * HC NC HC NC Saline Muscimol