Networks and Hierarchical Processing: Object Recognition in Human and Computer Vision

Similar documents
Object recognition and hierarchical computation

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns

using deep learning models to understand visual cortex

Just One View: Invariances in Inferotemporal Cell Tuning

Cognitive Modelling Themes in Neural Computation. Tom Hartley

Adventures into terra incognita

Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition

Object Detectors Emerge in Deep Scene CNNs

Shape Representation in V4: Investigating Position-Specific Tuning for Boundary Conformation with the Standard Model of Object Recognition

A quantitative theory of immediate visual recognition

Modeling the Deployment of Spatial Attention

Invariant Recognition Shapes Neural Representations of Visual Input

Visual Categorization: How the Monkey Brain Does It

Eccentricity Dependent Deep Neural Networks: Modeling Invariance in Human Vision

IN this paper we examine the role of shape prototypes in

Perception & Attention

PHY3111 Mid-Semester Test Study. Lecture 2: The hierarchical organisation of vision

Reading Assignments: Lecture 5: Introduction to Vision. None. Brain Theory and Artificial Intelligence

Probabilistic Models of the Cortex: Stat 271. Alan L. Yuille. UCLA.

Primary Visual Pathways (I)

CS294-6 (Fall 2004) Recognizing People, Objects and Actions Lecture: January 27, 2004 Human Visual System

Neural representation of action sequences: how far can a simple snippet-matching model take us?

Multi-attention Guided Activation Propagation in CNNs

The Visual System. Cortical Architecture Casagrande February 23, 2004

A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex

Object vision (Chapter 4)

A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex

Lateral Geniculate Nucleus (LGN)

Neural Mechanisms Underlying Visual Object Recognition

A Detailed Look at Scale and Translation Invariance in a Hierarchical Neural Model of Visual Object Recognition

Neuromorphic convolutional recurrent neural network for road safety or safety near the road

Photoreceptors Rods. Cones

Attentive Stereoscopic Object Recognition

Information and neural computations

Object recognition in cortex: Neural mechanisms, and possible roles for attention

Flexible, High Performance Convolutional Neural Networks for Image Classification

Chapter 3: 2 visual systems

A quantitative theory of immediate visual recognition

Continuous transformation learning of translation invariant representations

A Hubel Wiesel Model of Early Concept Generalization Based on Local Correlation of Input Features


B657: Final Project Report Holistically-Nested Edge Detection

THE ENCODING OF PARTS AND WHOLES

Goals. Visual behavior jobs of vision. The visual system: overview of a large-scale neural architecture. kersten.org

LISC-322 Neuroscience Cortical Organization

Perception & Attention. Perception is the best understood cognitive function, esp in terms of underlying biological substrates.

Bottom-up and top-down processing in visual perception

EDGE DETECTION. Edge Detectors. ICS 280: Visual Perception

Convolutional and LSTM Neural Networks

arxiv: v2 [cs.cv] 19 Dec 2017

Invariant Pattern Recognition using Bayesian Inference on Hierarchical Sequences

Early Stages of Vision Might Explain Data to Information Transformation

Prof. Greg Francis 7/31/15

COGS 101A: Sensation and Perception

How Does Our Visual System Achieve Shift and Size Invariance?

The Eye. Cognitive Neuroscience of Language. Today s goals. 5 From eye to brain. Today s reading

Position invariant recognition in the visual system with cluttered environments

How Does Our Visual System Achieve Shift and Size Invariance?

Nonlinear processing in LGN neurons

A Neurally-Inspired Model for Detecting and Localizing Simple Motion Patterns in Image Sequences

Plasticity of Cerebral Cortex in Development

Sparse Coding in Sparse Winner Networks

Stephen Grossberg, Jeffrey Markowitz, and Yongqiang Cao

2/3/17. Visual System I. I. Eye, color space, adaptation II. Receptive fields and lateral inhibition III. Thalamus and primary visual cortex

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks

CS-E Deep Learning Session 4: Convolutional Networks

arxiv: v1 [q-bio.nc] 12 Jun 2014

Cell Responses in V4 Sparse Distributed Representation

Introduction to Computational Neuroscience

A Bayesian inference theory of attention: neuroscience and algorithms Sharat Chikkerur, Thomas Serre, and Tomaso Poggio

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

From feedforward vision to natural vision: The impact of free viewing and clutter on monkey inferior temporal object representations

1 Perception& Attention. 5 The Retina. 3 Overview of the Visual System. 6 LGN of the Thalamus. 4 Two Streams: Ventral what vs.

Recurrent Refinement for Visual Saliency Estimation in Surveillance Scenarios

Identify these objects

The Visual System s Internal Model of the World

Evaluation of a Shape-Based Model of Human Face Discrimination Using fmri and Behavioral Techniques

Convolutional and LSTM Neural Networks

Neuronal responses to plaids

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing

Gabriel Kreiman Children s Hospital, Harvard Medical School

M Cells. Why parallel pathways? P Cells. Where from the retina? Cortical visual processing. Announcements. Main visual pathway from retina to V1

OPTO 5320 VISION SCIENCE I

An Introduction to Biologically-Inspired Visual Recognition

Chapter 5: Perceiving Objects and Scenes

A quantitative theory of immediate visual recognition

COGS 101A: Sensation and Perception

Recognizing Complex Mental States With Deep Hierarchical Features For Human-Robot Interaction

arxiv: v1 [stat.ml] 23 Jan 2017

Hierarchical Convolutional Features for Visual Tracking

Competing Frameworks in Perception

Competing Frameworks in Perception

Canonical Cortical Circuits

arxiv: v3 [stat.ml] 28 Oct 2017

Biologically Motivated Local Contextual Modulation Improves Low-Level Visual Feature Representations

Computational Principles of Cortical Representation and Development

Elias B Issa, PhD. Curriculum Vitae 08/2016. RESEARCH INTEREST The neural mechanisms and computational principles underlying high-level vision

Overview of the visual cortex. Ventral pathway. Overview of the visual cortex

Attentional Masking for Pre-trained Deep Networks

Transcription:

Networks and Hierarchical Processing: Object Recognition in Human and Computer Vision Guest&Lecture:&Marius&Cătălin&Iordan&& CS&131&8&Computer&Vision:&Foundations&and&Applications& 01&December&2014

1. Processing Pathways in the Human Visual System what and where pathways building features in the ventral stream 2. Hierarchical Pattern Recognition Systems early stages: small scale neural network injecting neuroscience knowledge into design 3. Third Age of the Neural Network: Modern Deep Nets extremely large scale ( data + computation ) sudden, huge performance boost for recognition Roadmap 23

From Retina to Cortex world retina (compression) LGN V1 visual cortex (expansion) Processing Pathways review 7

The Flow of Information IT IT Weiner & Grill-Spector (2012) Processing Pathways review 35

V1 Retina Van Essen (1991) Processing Pathways review 36

Specialization: What and Where Pathways monkey lesion studies Mishkin & Ungerleider 1982 Processing Pathways what and where pathways 37

Specialization: What and Where Pathways monkey lesion studies lesion where pathway: difficulty in spatial reasoning lesion what pathway: difficulty in object recognition Mishkin & Ungerleider 1982 Processing Pathways what and where pathways 37

Specialization: What and Where Pathways monkey lesion studies lesion where pathway: difficulty in spatial reasoning lesion what pathway: difficulty in object recognition Mishkin & Ungerleider 1982 Processing Pathways what and where pathways 37

Object Recognition: The What Pathway DiCarlo & Cox (2007) Processing Pathways building features in the ventral stream 38

Object Recognition: The What Pathway V1 IT Freeman & Simoncelli (2011), Tanaka (1997) Processing Pathways building features in the ventral stream 39

Object Recognition: Building Features and Invariance visual processing is done in stages each area performs a transformation on its inputs invariance is built gradually across many successive steps DiCarlo & Cox (2007) Processing Pathways key points 10 3

2. Hierarchical Pattern Recognition Systems neuroscience-inspired computer vision Hierarchical Computation 11 3

Neocognitron: Neural Network Layer 2 Layer 1 Cell Layer 3 Hierarchical Computation neocognitron 12 3

Neocognitron: Neural Network Fukushima (1988) Hierarchical Computation neocognitron 13 3

Neocognitron: Neural Network input layer S-cell layer C-cell layer Fukushima (1988) Hierarchical Computation neocognitron 13 3

Neocognitron: S-Cell C-cell layer / input layer S-cell layer C-cell layer LEARNING SUM THRESHOLD Fukushima (1988) Hierarchical Computation neocognitron 14 3

Neocognitron: S-Cell figure courtesy of A. Alahi Hierarchical Computation neocognitron 15 3

Neocognitron: C-Cell Pooling building position invariance Fukushima (1988) Hierarchical Computation neocognitron 16 3

Neocognitron: Network input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell Fukushima (1988) Hierarchical Computation neocognitron 17 3

Neocognitron: Network input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell Fukushima (1988) Hierarchical Computation neocognitron 17 3

Neocognitron: Network input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell Fukushima (1988) Hierarchical Computation neocognitron 17 3

Neocognitron: Network input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell Fukushima (1988) Hierarchical Computation neocognitron 17 3

Neocognitron: Robust Results Fukushima (1988) Hierarchical Computation neocognitron 18 3

Neocognitron biologically inspired hierarchical processing pipeline invariance is built gradually across many successive steps simple neural network solves complicated non-linear problem input layer S-cell layer C-cell layer Fukushima (1988) Hierarchical Computation neocognitron key points 19 3

Feed-Forward Object Recognition Model Hubel & Wiesel (1962), Riesenhuber & Poggio (1999), Serre et al. (2007) Hierarchical Computation feed-forward model 20 3

Feed-Forward Object Recognition Model Hubel & Wiesel (1962), Riesenhuber & Poggio (1999), Serre et al. (2007) Hierarchical Computation feed-forward model 20 3

Feed-Forward Object Recognition Model task: is there an animal in the picture? Serre et al. (2007) Hierarchical Computation feed-forward model 21 3

Feed-Forward Object Recognition Model task: is there an animal in the picture? Serre et al. (2007) Hierarchical Computation feed-forward model 22 3

Feed-Forward Object Recognition Model task: is there an animal in the picture? Serre et al. (2007) Hierarchical Computation feed-forward model 22 3

Feed-Forward Object Recognition Model task: is there an animal in the picture? Serre et al. (2007) Hierarchical Computation feed-forward model 24 3

Feed-Forward Object Recognition Model biologically-inspired processing pipeline patches receptive fields building invariance hierarchical processing major drawback? no feedback Serre et al. (2007) Hierarchical Computation feed-forward model key points 25 3

Discussion

Discussion how closely should we aim to copy human vision? is reverse-engineering human vision a self-imposed limitation? perfect recognition vs. visual understanding?

3. Modern Neural Networks extremely large scale data and computation Modern Neural Networks 27 3

Deep Convolutional Neural Network 650,000 cells 60,000,000 parameters input convolution pooling convolution pooling fully-connected Krizhevsky et al. (2013) Modern Neural Networks deep CNN 28 3

Deep Convolutional Neural Network 650,000 cells 60,000,000 parameters input convolution pooling convolution pooling fully-connected Krizhevsky et al. (2013) Modern Neural Networks deep CNN 28 3

Deep Convolutional Neural Network Want to know more about state-of-the-art neural networks? CS 231N http://vision.stanford.edu/cs231n/ Winter QT 2015 Fei-Fei Li & Andrej Karpathy Modern Neural Networks CS 231N 29 3

Object Recognition: Building Features and Invariance visual processing is done in stages each area performs a transformation on its inputs invariance is built gradually across many successive steps DiCarlo & Cox (2007) Processing Pathways key points 30 3

Feed-Forward Object Recognition Model biologically-inspired processing pipeline patches receptive fields building invariance hierarchical processing major drawbacks? no feedback much less complex than human vision Serre et al. (2007) Hierarchical Computation feed-forward model key points 31 3

Deep Convolutional Neural Network extremely large scale data and computation sudden, huge performance boost for recognition if you want to know more, take CS 231N in Winter QT Krizhevsky et al. (2013) Modern Neural Networks deep CNN key points 32 3

End-Quarter Feedback Forms