Becoming symbol-minded. Judy S. DeLoache

Similar documents
Supporting Text. PFC No Gate SRN-PFC

Biologically-Inspired Control in Problem Solving. Thad A. Polk, Patrick Simen, Richard L. Lewis, & Eric Freedman

Cell Responses in V4 Sparse Distributed Representation

Prefrontal cortex. Executive functions. Models of prefrontal cortex function. Overview of Lecture. Executive Functions. Prefrontal cortex (PFC)

2 Summary of Part I: Basic Mechanisms. 4 Dedicated & Content-Specific

Lateral Inhibition Explains Savings in Conditioning and Extinction

Task Switching. Higher-Level Cognition Oct 7, 2008

Cerebral Cortex. Edmund T. Rolls. Principles of Operation. Presubiculum. Subiculum F S D. Neocortex. PHG & Perirhinal. CA1 Fornix CA3 S D

Supplementary materials for: Executive control processes underlying multi- item working memory

3 The Range of Frontal Functions. 5 Top-down vs bottom-up PFC. 1 Higher Level Cognition: What s Missing Planning, problem solving, reasoning, complex

Impaired Cognitive Flexibility and Intact Cognitive Control in Autism: A Computational Cognitive Neuroscience Approach

Computational Explorations in Cognitive Neuroscience Chapter 7: Large-Scale Brain Area Functional Organization

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

3 A Tripartite Cognitive Architecture. 4 Broad Points

A model to explain the emergence of reward expectancy neurons using reinforcement learning and neural network $

Memory, Attention, and Decision-Making

A Neural Model of Context Dependent Decision Making in the Prefrontal Cortex

Different inhibitory effects by dopaminergic modulation and global suppression of activity

Rolls,E.T. (2016) Cerebral Cortex: Principles of Operation. Oxford University Press.

The storage and recall of memories in the hippocampo-cortical system. Supplementary material. Edmund T Rolls

Introduction to Computational Neuroscience

Geography of the Forehead

How to Build a Cognitive Task

A neural circuit model of decision making!

Realization of Visual Representation Task on a Humanoid Robot

Target-to-distractor similarity can help visual search performance

Learning Working Memory Tasks by Reward Prediction in the Basal Ganglia

196 IEEE TRANSACTIONS ON AUTONOMOUS MENTAL DEVELOPMENT, VOL. 2, NO. 3, SEPTEMBER 2010

Neural codes PSY 310 Greg Francis. Lecture 12. COC illusion

Cortical Organization. Functionally, cortex is classically divided into 3 general types: 1. Primary cortex:. - receptive field:.

(Visual) Attention. October 3, PSY Visual Attention 1

Computational & Systems Neuroscience Symposium

A Computational Model of Prefrontal Cortex Function

Modeling Individual and Group Behavior in Complex Environments. Modeling Individual and Group Behavior in Complex Environments

Brain Mechanisms Explain Emotion and Consciousness. Paul Thagard University of Waterloo

Representational Switching by Dynamical Reorganization of Attractor Structure in a Network Model of the Prefrontal Cortex

Representation 1. Discussion Question. Roskies: Downplaying the Similarities of Neuroimages to Photographs

Choosing the Greater of Two Goods: Neural Currencies for Valuation and Decision Making

Psychology Perception

Neural response time integration subserves. perceptual decisions - K-F Wong and X-J Wang s. reduced model

Emotion Explained. Edmund T. Rolls

A model of dual control mechanisms through anterior cingulate and prefrontal cortex interactions

The Integration of Features in Visual Awareness : The Binding Problem. By Andrew Laguna, S.J.

Lesson 6 Learning II Anders Lyhne Christensen, D6.05, INTRODUCTION TO AUTONOMOUS MOBILE ROBOTS

FAILURES OF OBJECT RECOGNITION. Dr. Walter S. Marcantoni

Is Cognitive Science Special? In what way is it special? Cognitive science is a delicate mixture of the obvious and the incredible

A HMM-based Pre-training Approach for Sequential Data

Why is dispersion of memory important*

Selective Attention. Modes of Control. Domains of Selection

A Neural Network Model of Naive Preference and Filial Imprinting in the Domestic Chick

Resistance to forgetting associated with hippocampus-mediated. reactivation during new learning

Dynamic Social Simulation with Multi-Agents having Internal Dynamics

A general error-based spike-timing dependent learning rule for the Neural Engineering Framework

Reactive agents and perceptual ambiguity

Supplementary Figure 1. Localization of face patches (a) Sagittal slice showing the location of fmri-identified face patches in one monkey targeted

Intro to Cognitive Neuroscience. Numbers and Math

A Dynamic Neural Network Model of Sensorimotor Transformations in the Leech

Supplementary Figure 1. Recording sites.

Selective bias in temporal bisection task by number exposition

Theoretical Neuroscience: The Binding Problem Jan Scholz, , University of Osnabrück

THE PREFRONTAL CORTEX. Connections. Dorsolateral FrontalCortex (DFPC) Inputs

Selective Attention. Inattentional blindness [demo] Cocktail party phenomenon William James definition

Mental Imagery. What is Imagery? What We Can Imagine 3/3/10. What is nature of the images? What is the nature of imagery for the other senses?

The Function and Organization of Lateral Prefrontal Cortex: A Test of Competing Hypotheses

Working Memory (Goal Maintenance and Interference Control) Edward E. Smith Columbia University

Early Learning About the World From Symbolic Media

Neuron, Volume 63 Spatial attention decorrelates intrinsic activity fluctuations in Macaque area V4.

Learning in neural networks

Introduction to Computational Neuroscience

Cognitive Control as Alternation of Activation and Suppression in the Stroop Task

Shadowing and Blocking as Learning Interference Models

Memory retention the synaptic stability versus plasticity dilemma

How has Computational Neuroscience been useful? Virginia R. de Sa Department of Cognitive Science UCSD

Perception & Attention

Hierarchical dynamical models of motor function

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning

EDGE DETECTION. Edge Detectors. ICS 280: Visual Perception

Biologically-Inspired Human Motion Detection

A Drift Diffusion Model of Proactive and Reactive Control in a Context-Dependent Two-Alternative Forced Choice Task

Differences of Face and Object Recognition in Utilizing Early Visual Information

REINFORCEMENT LEARNING OF DIMENSIONAL ATTENTION FOR CATEGORIZATION JOSHUA L. PHILLIPS

Overview of the visual cortex. Ventral pathway. Overview of the visual cortex

Consciousness as representation formation from a neural Darwinian perspective *

CPSC81 Final Paper: Facial Expression Recognition Using CNNs

Category Learning in the Brain

Brain Based Change Management

Vision and Action. 10/3/12 Percep,on Ac,on 1

Behaviorism: Laws of the Observable

Sparse Coding in Sparse Winner Networks

Why is our capacity of working memory so large?

Modelling the Stroop Effect: Dynamics in Inhibition of Automatic Stimuli Processing

Lecture 1: Neurons. Lecture 2: Coding with spikes. To gain a basic understanding of spike based neural codes

Time Experiencing by Robotic Agents

Exploring the Functional Significance of Dendritic Inhibition In Cortical Pyramidal Cells

Tasks of Executive Control TEC. Interpretive Report. Developed by Peter K. Isquith, PhD, Robert M. Roth, PhD, Gerard A. Gioia, PhD, and PAR Staff

Cognition in Parkinson's Disease and the Effect of Dopaminergic Therapy

DYNAMIC APPROACH to the PATHOLOGICAL BRAIN

A model of the interaction between mood and memory

Hebbian Plasticity for Improving Perceptual Decisions

Transcription:

Symbols and Rules

Becoming symbol-minded Judy S. DeLoache

Symbols def: A symbol is something that someone intends to represent something other than istelf. This is among the looser definitions for symbols. It places an emphasis on intention and use.

Illustrative examples Symbols Represent Things (Priessler and Carey) 18 24 mo. Shown picture of whisk, associated with the word Will associate with object alone, or abject and picture Symbols are General Baby Signs at 11 mo. grater vocabulary at 3 years 13 18 mo. accept non-verbal labels, 20 mo. show verbal bias

Symbols are intentional Infants and toddlers are sensitive to intentions 68 Review TRENDS in Cognitive Sciences Vol.8 No.2 February 2004 Older Children take their own intentions very seriously. A balloon A lollipop The experimenter The child Figure 1. When asked to draw a balloon and a lollipop, 4-year-olds produced depicted objects. Simi and moving objects on The manual explor arises from uncertain difference between re understand the signi investigate. This inte that the more a depict more infants explore exploration and black [30]. Thus, because in of pictures, they some if they were real objec This conclusion h Yonas and his colleag 9-month-old infants w patches, and textured gent definition of gras their study made ver depicted entities. From the infants did manua did not respond to t Inspired by this repor

[29,30]. Instead, they behave like the infant in Figure 2a. a Symbol They manually represents explore something the images, other than frequently itself feeling, rubbing and striking the picture surface. Sometimes they even grasp at the pictures as if trying to pick up the i A e e o r Figure 2. (a) Nine-month-olds often manually explore realistic photographs, revealing Nine that mo. theygrasp do not atfully realistic understand color the photographs. critical difference between a picture and its referent. This child is making grasping motions at a highly realistic color photograph of an object. (b) This 9-month-old boy is leaning over preparing to put his lips on the nipple of the bottle. He apparently recognizes the content of the photograph, but does not appreciate how the depicted bottle differs from a real one. Reproduced with permission from [29]. r a a v v F e i v t b r

Information about a hidden object via symbolic representation 2.5-year-olds fail, 3-year-olds succeed Dual Representation Increasing salience of model as object decreases success for 3 yo Decreasing salience as object increases success for 2.5 yo something viewing via window shrinking machine

Prefrontal cortex and flexible cognitive control: Rules without symbols Rougier, Noelle, Braver, Cohen, O Reilly

Mechanisms of the PFC embodied in a neural network model result in a self-organized abstract rule-like PFC representation that supports flexible cognitive control.

Properties of the PFC informing the model 1. PFC can maintain representations to be held in working memory 2. Representations are adaptively maintained by switching between active maintenance and rapid updating Unexpected reward stabilizes the PFC Lack of expected reward destabilizes PFC Controlled by dopamine from the VTA (ventral tegmental area) or without dopamine from the basal ganglia 3. PFC modulates processing via extensive connections to other cortical areas

Fig. 1. Model and example stimuli. (a) The model with the complete PFC system. Stimuli are presented in two possible locations (left, right). Rows represent different stimulus dimensions (e.g., color, size, shape, etc., labeled A E for simplicity), and columns represent different features (red, orange green, and blue; small, medium, etc., numbered 1 4). Other inputs include a task input indicating current task to perform (NF, name feature; MF, match feature; SF, smaller feature; LF, larger feature), and, for the instructed condition (used to control for lack of maintenance in non-pfc networks), a cue to the currently relevant dimension. Output responses are generated over the response layer, which has units for the different stimulus features, plus a No unit to signal nonmatch in the matching task. The hidden layers represent posterior cortical pathways associated with different types of inputs (e.g., visual and verbal). The AG unit is the adaptive gating unit, providing a temporal differences (TD) based dynamic gating signal to the PFC context layer. The weights into the AG unit learn via the TD mechanism, whereas all other weights learn using the Leabra algorithm that combines standard Hebbian and error-driven learning mechanisms, together with k-winners-take-all inhibitory competition within layers and point-neuron activation dynamics (26) (also see supporting information). (b) Example stimuli and correct responses for one of the tasks (NF) across three trials where the current rule is to focus on the Shape dimension (the same rule was blocked over 200 trials to allow networks plenty of time to adapt to each rule). The corresponding input and target patterns for the network are shown below each trial, with the unit meanings given by the legend in the lower left. The network must maintain the current dimension rule to perform correctly. ically intact and frontally damaged people on benchmark tasks of cognitive control. Methods sults, as described in supporting information; the basic case was task switching every block of 25 trials, with dimension switching after two iterations through all of the tasks). These conditions were designed to simulate simple forms of real-world learning experience

Fig. 2. Representations (synaptic weights) that developed in four different network configurations. (a) Posterior cortex only (no PFC) trained on all tasks. (b) PFC without the adaptive gating mechanism (all tasks). (c) Full PFC trained only on task pairs (name feature and match feature in this case). (d) Full PFC (all tasks). Each image shows the weights from the hidden units (a) or PFC (b d) to the response layer. Larger squares correspond to units (all 30 in the PFC and a random and representative

Fig. 3. Generalization and learning results. (a) Crosstask generalization results (% correct on task-novel stimuli) for the full PFC network and a variety of control networks, with either only two tasks (Task Pairs) or all four tasks (All Tasks) used during training (n 10 for each network, error bars are standard errors). Overall, the full PFC model generalizes substantially better than the other models, and this interacts with the level of training such that performance on the All Tasks condition is substantially better than the Task Pairs condition (with no differences in numbers of training trials or training stimuli). With one feature left out of training for each of four dimensions, training represented only 31.6% (324) of the total possible stimulus inputs (1,024); the 85% generalization performance on the remaining test items therefore represents good productive abilities. The other networks are: Posterior, a single large hidden unit layer between inputs and response, a simple model of posterior cortex without any special active maintenance abilities; P Rec, posterior full recurrent connectivity among hidden units, allows hidden layer to maintain information over time via attractor dynamics; P Self, posterior self-recurrent connections from hidden units to themselves, allows individual units to maintain activations over time; SRN, simple recurrent network, with a context layer that is a copy of the hidden layer on the prior step, a widely used form of temporal maintenance; SRN-PFC, an SRN context layer applied to the PFC layer in the full model (identical to the full PFC model except for this difference), tests for role of separated hidden layers; NoGate, the full PFC model without the AG adaptive gating unit. (b) The correlation of generalization performance with the extent to which the units distinctly and orthogonally encode stimulus dimensions for the networks shown in Fig. 2. This was computed by comparing each unit s pattern of weights to the set of five orthogonal, complete dimensional target patterns (i.e., the A dimension target pattern has a 1 for each A feature, and 0s for the features in all other dimensions, etc.). A numeric value between 0 and 1, where 1 represents a completely orthogonal and complete dimensional representation was computed for unit i as: di maxk wi tk k wi tk ; where tk is the dimensional target pattern k, and wi is the weight vector for unit i, and wi tk represents the normalized dot product of the two vectors (i.e., the cosine). This value was then averaged across all units in the layer and then correlated with that network s generalization performance. (c) Relative stability of PFC and hidden layer (posterior cortex) in the model, as indexed by Euclidean distance between weight states at the end of subsequent epochs (epoch 2,000 trials). The PFC takes longer to stabilize (i.e., exhibits greater levels of weight change across epochs) than the posterior cortex. For PFC, within-pfc recurrent weights were used. For Hidden, weights from

gical task of the full imulated ating the ct effects of color d perfornt word ata from lated by (a) with cy of the er. In the le feature e conflict features on cue inas to be (RT) was r of cycles n the retiplied by msec). (b) a 30% leits in the ompared patients ons (six of al PFC) and matched controls (Ctrl) (data in seconds to complete a block of trials; model cycles were transformed as RT cycles 5.5 30 ict Word reading conditions were not run on the human subjects). The main effect of damage is an overall slowing of color naming, on that the PFC provides top-down support to this weaker pathway via abstract dimensional representations. (c) Performance in a monstrating the classic pattern of increasing perseveration with increased PFC damage (% of units removed, posttraining). Perseverantial productions of feature names corresponding to the previously relevant dimension after a switch. Clearly, the simulated PFC is critical ng. (d) WCST results (perseverations) for the three different training conditions used by ref. 28 (128 is the standard case plotted before,

Language, embodiment, and the cognitive niche Andy Clarke

Words as targets in the material world Example of Sheba the chimpanzee The simple act of labeling creates for the learner a new realm of perceptible objects upon which to target her more basic capacities of statistical and associative learning. The actual presence of tags or labels... alters the computational burdens involved in certain kinds of learning and problem-solving

Words as constituents of hybrid thoughts: numbers Dehaene et al. suggest that precise mathematical thought relies on... 1-ness 2-ness 3-ness more-than-that-ness Rough Magnitude comparisons learnt capacity to use specific number words of a language and appreciate that each distinct number represents a distinct quantity

Thinking about thinking Linguaform reason Thinking about the thoughts and beliefs of others Being self evaluative To formulate thoughts in words is to create an object available to ourselves and to others, and, as an object, it is the kind of thing we can have thoughts about. Stable attendable structure to which subsequent thinking can attach.

A stabilizing force in neural networks? Neural networks are fluid and need stabilizing The models can be strongly context sensitive Words, instead of being cues, might be like encountered items and act directly on mental states