Shadowing and Blocking as Learning Interference Models

Similar documents
Modulators of Spike Timing-Dependent Plasticity

Modeling Depolarization Induced Suppression of Inhibition in Pyramidal Neurons

Cellular Neurobiology BIPN140

Part 11: Mechanisms of Learning

Basics of Computational Neuroscience: Neurons and Synapses to Networks

Evaluating the Effect of Spiking Network Parameters on Polychronization

Solving the distal reward problem with rare correlations

Neuromorphic computing

VS : Systemische Physiologie - Animalische Physiologie für Bioinformatiker. Neuronenmodelle III. Modelle synaptischer Kurz- und Langzeitplastizität

Sample Lab Report 1 from 1. Measuring and Manipulating Passive Membrane Properties

Supplementary Figure 1. Basic properties of compound EPSPs at

Reinforcement Learning in a Neurally Controlled Robot Using Dopamine Modulated STDP

SUPPLEMENTARY INFORMATION

Theme 2: Cellular mechanisms in the Cochlear Nucleus

Efficient Emulation of Large-Scale Neuronal Networks

Bursting dynamics in the brain. Jaeseung Jeong, Department of Biosystems, KAIST

Beyond Vanilla LTP. Spike-timing-dependent-plasticity or STDP

Recognition of English Characters Using Spiking Neural Networks

Spiking Inputs to a Winner-take-all Network

Rolls,E.T. (2016) Cerebral Cortex: Principles of Operation. Oxford University Press.

Synaptic plasticityhippocampus. Neur 8790 Topics in Neuroscience: Neuroplasticity. Outline. Synaptic plasticity hypothesis

Neuron Phase Response

STDP enhances synchrony in feedforward network

MOLECULAR BIOLOGY OF DRUG ADDICTION. Sylvane Desrivières, SGDP Centre

Chapter 2: Cellular Mechanisms and Cognition

A general error-based spike-timing dependent learning rule for the Neural Engineering Framework

Mike Davies Director, Neuromorphic Computing Lab Intel Labs

Information Processing During Transient Responses in the Crayfish Visual System

Memory Systems II How Stored: Engram and LTP. Reading: BCP Chapter 25

Acetylcholine again! - thought to be involved in learning and memory - thought to be involved dementia (Alzheimer's disease)

Realization of Visual Representation Task on a Humanoid Robot

Computational cognitive neuroscience: 2. Neuron. Lubica Beňušková Centre for Cognitive Science, FMFI Comenius University in Bratislava

Synaptic Transmission: Ionic and Metabotropic

Bioscience in the 21st century

Dynamics of Hodgkin and Huxley Model with Conductance based Synaptic Input

Implementing Spike-Timing-Dependent Plasticity on SpiNNaker Neuromorphic Hardware

Synaptic Plasticity and the NMDA Receptor

Solving the distal reward problem with rare correlations

File name: Supplementary Information Description: Supplementary Figures, Supplementary Table and Supplementary References

SUPPLEMENTARY INFORMATION. Supplementary Figure 1

BIPN140 Lecture 12: Synaptic Plasticity (II)

Learning = an enduring change in behavior, resulting from experience.

Dynamic Stochastic Synapses as Computational Units

Technischer Bericht TUM. Institut für Informatik. Technische Universität München

Active Control of Spike-Timing Dependent Synaptic Plasticity in an Electrosensory System

The Role of Mitral Cells in State Dependent Olfactory Responses. Trygve Bakken & Gunnar Poplawski

Signal detection in networks of spiking neurons with dynamical synapses

Running PyNN Simulations on SpiNNaker

Resonant synchronization of heterogeneous inhibitory networks

Synaptic Integration

Independent Component Analysis in Neuron Model

Solving the Distal Reward Problem through Linkage of STDP and Dopamine Signaling

Simplified Model of Machado-Joseph Disease in Comparison to Parkinson s Disease and Normal Models

1) Drop off in the Bi 150 box outside Baxter 331 or to the head TA (jcolas).

Multi compartment model of synaptic plasticity

2Lesson. Outline 3.3. Lesson Plan. The OVERVIEW. Lesson 3.3 Why does applying pressure relieve pain? LESSON. Unit1.2

A developmental learning rule for coincidence. tuning in the barn owl auditory system. Wulfram Gerstner, Richard Kempter J.

Synaptic Plasticity and Connectivity Requirements to Produce Stimulus-Pair Specific Responses in Recurrent Networks of Spiking Neurons

Modeling Excitatory and Inhibitory Chemical Synapses

Cellular Bioelectricity

NERVOUS SYSTEM 1 CHAPTER 10 BIO 211: ANATOMY & PHYSIOLOGY I

Consciousness as representation formation from a neural Darwinian perspective *

CHAPTER 44: Neurons and Nervous Systems

Quantal Analysis Problems

Spike-Timing Theory of Working Memory

Ultrastructural Contributions to Desensitization at the Cerebellar Mossy Fiber to Granule Cell Synapse

Different inhibitory effects by dopaminergic modulation and global suppression of activity

Neural response time integration subserves. perceptual decisions - K-F Wong and X-J Wang s. reduced model

BIPN 140 Problem Set 6

The storage and recall of memories in the hippocampo-cortical system. Supplementary material. Edmund T Rolls

Information processing at single neuron level*

arxiv: v1 [q-bio.nc] 25 Apr 2017

Model neurons!!!!synapses!

Computing with Spikes in Recurrent Neural Networks

BIPN 140 Problem Set 6

Chapter 5: Learning and Behavior Learning How Learning is Studied Ivan Pavlov Edward Thorndike eliciting stimulus emitted

Effects of Inhibitory Synaptic Current Parameters on Thalamocortical Oscillations

Intro. Comp. NeuroSci. Ch. 9 October 4, The threshold and channel memory

Axon initial segment position changes CA1 pyramidal neuron excitability

B C. i) At what region(s) on the drawing above would you expect a high density of voltage-gated sodium channels?

Neurons! John A. White Dept. of Bioengineering

Modeling of Hippocampal Behavior

In Vitro Analog of Operant Conditioning in Aplysia

1) Drop off in the Bi 150 box outside Baxter 331 or to the head TA (jcolas).

Heterogeneous networks of spiking neurons: self-sustained activity and excitability

Temporal coding in the sub-millisecond range: Model of barn owl auditory pathway

Levodopa vs. deep brain stimulation: computational models of treatments for Parkinson's disease

Timing and the cerebellum (and the VOR) Neurophysiology of systems 2010

Computational Psychiatry: Tasmia Rahman Tumpa

Portions from Chapter 6 CHAPTER 7. The Nervous System: Neurons and Synapses. Chapter 7 Outline. and Supporting Cells

Computational Approaches in Cognitive Neuroscience

Relative contributions of cortical and thalamic feedforward inputs to V2

Hierarchical dynamical models of motor function

Lecture 22: A little Neurobiology

Cerebral Cortex. Edmund T. Rolls. Principles of Operation. Presubiculum. Subiculum F S D. Neocortex. PHG & Perirhinal. CA1 Fornix CA3 S D

Input-speci"c adaptation in complex cells through synaptic depression

Memory, Attention, and Decision-Making

Reading Neuronal Synchrony with Depressing Synapses

Designing Behaviour in Bio-inspired Robots Using Associative Topologies of Spiking-Neural-Networks

Ameen Alsaras. Ameen Alsaras. Mohd.Khatatbeh

Transcription:

Shadowing and Blocking as Learning Interference Models Espoir Kyubwa Dilip Sunder Raj Department of Bioengineering Department of Neuroscience University of California San Diego University of California San Diego La Jolla, CA 92092 La Jolla, CA 92092 Abstract The link between reward and reward-eliciting actions may be impaired by increased sensitivity to background noise during the waiting period (Shadowing) and may also be interfered simultaneous presentation of two redundant inputs. Clinical studies have shown acute Schizophrenia patients lack the ability to blocking (learning interference). The model presented in this study aims to illustrate the effects of these interferences on the reinforced learning model and to identify the parameters that contribute to this perturbation. Based on our model, blocking may result due to decreased in response to coincidental firing or depletion of eligibility tracer with time while shadowing may result due to increased in sensitivity to background noise. Understanding the model of learning interference may provide insight into diagnosis of acute Schizophrenia.

1 Introduction Classical Pavolian conditioning large-scale behaviour as well as neural networks relies upon the reward consistently coming seconds after the reward triggering actions. The distal-reward problems is the problem encountered when trying to link the reward to the reward-eliciting action as it comes second after, when the pattern of activity has changed. Various models have been hypothesised, but the most effective is the coincident-activity detector reliant upon a slow decaying synaptic tag that occurs when post-synaptic activity succeeds pre-synaptic activity (Izhikevich 2007). Figure 1: Neuron B however has synaptic connections onto Neuron A and Neuron D has connections onto B. As a result if the activity of Neuron B consistently precedes that of Neuron A then a direction connection forms between B and C, bypassing C; similar case occurs with Neuron D as well. Classical conditioning relies upon the neuronal system being able to consistently update its associations, allowing the first event (firing) in the resulting cascade to be directly associated with the final outcome. Blocking on the other hand is impaired learning of an association between a conditioned stimulus and unconditioned stimulus (Jones, Gray et al. 1990). This occurs due to simultaneous activity of another condition a stimulus that is already associated with the unconditioned stimuli. The blocking effect and lack thereof through dopamine super-sensitisation has been extensively demonstrated and studied in rate models (Crider and Blockel et al. 1986). It has essentially been found that the blocking effect is not as significant during incidental learning; as more attention is paid to non-salient aspects of the information.

Figure 2: Neuronal Model for Blocking, impaired association between B and C. Neuron A and C are already synaptically connected, therefore when B and A fire simultaneously before C, only A is strengthened. Neuron B is instead blocked, as it is ignored as redundant information from a parallel channel. Clinical studies with acute Schizophrenia patients revealed an absence of the blocking effect (Hemsley & Richardson, 1980). This is due to the fact that patients are more aware of and distracted by non-salient aspects of the information, making it hard for them to distinguish between relevant and redundant information (Jones, Gray et al. 1992). Shadowing however is a reduction in the significance of the reward eliciting activity due to an increase in the sensitivity to background noise. Parkinson (1972) showed that there is impaired short-term memory during multi-tasking activity. The increase in sensitivity leads to a greater proportion of the synapses being potentiated and increasing in synaptic strength. As a result, it is difficult to discern the actual stimuli (reward-eliciting behaviour) among all the noise. Figure 3 illustrates this concept; since both neuron A and B synaptic connection to neuron C result increase synaptic strength during high dopamine level, then neuron C can not determine which connection results in dopamine increase.

Figure 3: Neuronal model for Shadowing. Increases in the sensitivity to background noise leads to both Neuron A and B synaptic connections increasing in their synaptic connections with Neuron C. As a result it becomes significantly more difficult to discern the stimuli from Neuron A, when B is potentiated as well. Aims There was two distinct parts to this investigation, each aimed at investigating various aspects of learning interference in the presence of background noise. The first part involved modelling blocking of redundant information from a parallel stream, as a means to generate more efficient transmission. The other section involved modelling shadowing of the reward-eliciting stimuli by introducing jitter into the system and making it more sensitive to background noise. Therefore the overall aims of this investigation were to explore various modes of learning interference. 2 Method In order to model blocking and shadowing, a model described by Izhikevich (2006) was implemented. All neurons are modeled as spiking neuron with one gating variable u and one voltage variable v, in the equations below. v = 0.04v 2 + 5v +140 u + I current u = a(bv u) if v = 30mv then neuron fires The synaptic connection between neurons and the modulation of synaptic strength is modeled using the phenomenological model of Dopamine (DA)

modulation of Spike Timing Plasticity (STDP). The state of each blocked synapse can be described by its synaptic strength (s), and eligibility trace (c) in the following equation: c = c + STDP(τ )δ(t t pre/ post ) τ c s = c(d d 0 )*be at d = d + DA(t) τ d τ = t post t pre Where d(t) describes the extracellular concentration of dopamine, d_0 is the tonic extracellular concentration of dopamine and the parameters a and b are constants that were used to decay the blocked synapses. In the simulation a= 0.01, b=1.5 and d_0= 0.01. For non-blocked neurons s = c(d d 0 ) As described in Izhikevich (2007), the Dirac function provides the stepincrease in variable c(t) when pre proceeds post synaptic neuron firing. c(t) changes by the amount specified by STDP within the time interval. When preproceed postsynaptic neuron firing then STDP results in potentiation of the postsynaptic neuron and decreased synaptic strength in the blocked neurons or increase synaptic strength in the non-blocked neuron. To model shadowing, the time constant for the eligibility trace was increased to increase the probability both STDP occurring during high level of dopamine concentration extracellular. This will increase the random strengthening of the synapse in the neurons that fire randomly. c c = + STDP(τ )δ(t t pre/ post ) τ shadowing τ shadowing > τ c This simulation of this model utilizes a network of 100 spiking neurons with 10 sypnases per neuron as described by Izhikevich (2006). 80% of the neurons were excitatory and 20% per inhibitory neurons with random poisonlike spiking behavior. The solutions to the differential equations of each neuron when synaptic strength is increased are shown in figure 1 of Izhikevich (2007).

Figure 4: Solutions to the differential equation of the neuro network. 3 Results From the 100 neuron simulated for 1500 sec using Izhikevich (2007) reinforcement model as the control group, we see that when pre synaptic neuron fires before the post synaptic neuron, STDP occurs and eligibility tracer concentration pulsed as shown in Figure 5 as the green curve. If dopamine concentration is high during a high eligibility concentration as indicated by the red x in figure 5, the synaptic strength increase in a multiplicative effect of eligibility tracer and dopamine concentration as indicated by the blue curve in figure 5. The first simulation was to compare the blocking network model with the control model. As shown in Figure 5, the synaptic strength of the control increases with every coincidental firing with dopamine and eligibility tracer on, while in the blocking network the designated blocked neurons show a decreased response in the synaptic strength increase and a very small time constant. There is a decrease in synaptic strength response to coincidental reward driven by presence of dopamine and eligibility tracer with time in the network with blocking as indicated by Figure 5. For shadowing, the histogram was used to assess the decrease of shadowing with time. In the control at time zero, Figure 6A, the histogram of classical reinforcement learning begins with majority of one synaptic strength and some Gaussian distribution during the waiting period for reward. However with time, this histogram is transformed as the synaptic strength of the coincidental firing neurons are increased reflecting the increase in eligibility trace during high dopamine concentration, Figure 6B. The variance of the Gaussian distribution increase within the network as the network increases in sampling. In the shadowing simulation, there is a dramatic difference in the synaptic strength histogram. As expected there the distribution moves from a Gaussian

distribution of random noise with one neuron at the maximum synaptic strength to a uniform distribution where most neurons have high synaptic strength as indicated by Figure 7. Figure 5: A. Synaptic plasticity or increase in synaptic strength B Blocking. (Neuron blocking, or decreased in synaptic strength with time. Synaptic strength of neurons is indicated by blue, red indicates the dopamine injection and green indicate the eligibility tracer, which may be negative or positive. Figure 6: A. Random (Bolzmann distribution) firing at time zero. B Classical. With increased sampling rate, only one neuron exhibit synaptic plasticity with maximum synaptic strength. Red dot indicates the neuron of interest.

Figure 6: A. Random (Bolzmann distribution) at time zero. B.Shadowing With increased sampling (after 1.5 sec simulation) the Bolzmann distribution evolved close to a uniform distribution with more than 60 percent of the neuron exhibits synaptic plasticity with maximum synaptic strength. 5 Discussion Blocking and shadowing were both successfully modelled in a 100 neuronal network in Matlab, as various forms of learning interference. Blocking was found to cause redundant information in the form of coincidental firing of a secondary neuron to be suppressed. Shadowing on the other hand caused the original signal to be masked, making it harder to discern in the presence of a great number of synapses all increasing in synaptic strength. Therefore blocking and shadowing are opposite forms of learning interference and both were successfully modelled within this investigation. Blocking of a secondary neuron with coincidental firing was modelled in this investigation through the introduction of a secondary type of synapse that decayed in synaptic strength. This is primarily due to the introduction of an exponential decay factor.. The modification of the synaptic strengths and currents for the specific neuron led to a consistent decrease in the synaptic strength over a period of time as expected. Initially the synapse responds to the activity to the same extent as the non-blocked one, however over a period of time there is desensitisation and this was clearly seen with the low synaptic strengths. Physiologically, the decrease in the response to incidental firing may be due to the depletion of eligibility tracer with time or impaired assembly of eligibility tracer machineries. Shadowing however was modelled by introducing a greater extent of jitter and degree of variability within the system, by increasing the coincidence interval by an order of magnitude. A greater number of random firings as a result had the required configuration of pre-synaptic before post-synaptic leading to them being potentiated and increasing in synaptic strength in the presence of dopamine injection. The overall effect was a greater number of synapses

being potentiated, making the original stimulus and resulting synapse connection harder to discern; effectively shadowing the actual learning. The validity of this computational model for the learning interference conditions in animal models is still unverified. Therefore experimental studies on rat hippocampus region with stimuli paradigms aimed to elicit blocking and shadowing through coincidental activity would be a great future direction. This would test whether the simplification of the biological processed underlying synaptic plasticity at both the pre-synaptic and post-synaptic terminal, as a simple synaptic tag is valid. This is essential as many neurobiology studies have demonstrated that synaptic plasticity has many different forms that relies upon very distinct molecular mechanism that have unique time courses as well (Zurcker and Regehr 2002). Understanding the model of learning interference may provide insight into diagnosis of acute Schizophrenia and also understanding the mechanism of learning. The next step would be to actually introduce a larger number of gating variables within the neuronal network and analyse the resulting membrane voltage changes rather than just the synaptic strength. This would allow an effective dynamic analysis of how the membrane currents and the firing rates of the population of neuron change within these learning interference models. Overall however this investigation has been relatively successful in modelling both blocking and shadowing and their resulting effects on synaptic strength changes for a population of 100 neurons.

6 References [1] Jones, S.H., Gray, J.A. & Hemsley, D.R. (1990). The Kamin blocking effect, incidental learning and psychoticism. British Journal of Psychology 81(1): 95-110 [2] Jones, S.H., Gray, J.A. & Hemsley, D.R. (1990). Loss of the Kamin blocking effect in acute but not chronic schizophrenics. Biological Psychiatry 32(9): 739-755 [3] Crider, A., Blockel,L., and Solomon, P.R. (1986). A selective attention deficit in the rat following induced dopamine supersensitivity. Behavioural Neuroscience 100(3):315-319 [4] Izhikevich, E.M. (2007). Solving the Distal Reward Problem through Linkage of STDP and Dopamine Signalling. Oxford University Press 17(1): 2443-2452 [5] Izhikevich, E.M. (2007). Polychronization: Computation With Spikes. Neural Computation 18:245-282 [6] Zucker, R.S. and Regehr, W.G. (2002). Short-term synaptic plasticity. Annual. Review of Physiology 64(1):355-405 [7] Parkinson, Stanley R. 1972 Short-term memory while shadowing: Multiple-item recall of visually and of aurally presented letters. Journal of Experimental Psychology. 2: 256-265.