Action Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem

Size: px
Start display at page:

Download "Action Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem"

Transcription

1 Action Recognition Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem

2 This section: advanced topics Convolutional neural networks in vision Action recognition Vision and Language 3D Scenes and Context

3 What is an action? Action: a transition from one state to another Who is the actor? How is the state of the actor changing? What (if anything) is being acted on? How is that thing changing? What is the purpose of the action (if any)?

4 How do we represent actions? Categories Walking, hammering, dancing, skiing, sitting down, standing up, jumping Poses Nouns and Predicates <man, swings, hammer> <man, hits, nail, w/ hammer>

5 What is the purpose of action recognition? To describe

6 What is the purpose of action recognition? To predict

7 What is the purpose of action recognition? To understand the intention and motivation Predicting Motivations of Actions by Leveraging Text, CVPR 2016

8 How can we identify actions? Motion Pose Held Objects Nearby Objects

9 Representing Motion Optical Flow with Motion History Bobick Davis 2001

10 Representing Motion Space-Time Volumes Blank et al. 2005

11 Representing Motion Optical Flow with Split Channels optical flow split into pos/neg channels blurred pos/neg flow Efros et al. 2003

12 Representing Motion Tracked Points Matikainen et al. 2009

13 Representing Motion Space-Time Interest Points Moving corner Ball hits wall Corner detectors in space-time Balls collide Balls collide (different scale) Laptev 2005

14 Representing Motion Space-Time Interest Points Laptev 2005

15 Examples of Action Recognition Systems Feature-based classification Recognition using pose and objects

16 Action recognition as classification Retrieving actions in movies, Laptev and Perez, 2007

17 Remember image categorization Training Images Training Image Features Training Labels Classifier Training Trained Classifier Test Image Image Features Testing Trained Classifier Prediction Outdoor

18 Remember spatial pyramids. Compute histogram in each spatial bin

19 Features for Classifying Actions 1. Spatio-temporal pyramids Image Gradients Optical Flow

20 Features for Classifying Actions 2. Spatio-temporal interest points Corner detectors in space-time Descriptors based on Gaussian derivative filters over x, y, time

21 Classification Boosted stubs for pyramids of optical flow, gradient Nearest neighbor for STIP

22 Searching the video for an action 1. Detect keyframes using a trained HOG detector in each frame 2. Classify detected keyframes as positive (e.g., drinking ) or negative ( other )

23 Accuracy in searching video With keyframe detection Without keyframe detection

24 Talk on phone Get out of car Learning realistic human actions from movies, Laptev et al. 2008

25 Approach Space-time interest point detectors Descriptors HOG, HOF Pyramid histograms (3x3x2) SVMs with Chi-Squared Kernel Interest Points Spatio-Temporal Binning

26 Results

27 Action Recognition using Pose and Objects Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities, B. Yao and Li Fei-Fei, 2010 Slide Credit: Yao/Fei-Fei

28 Human-Object Interaction Holistic image based classification Integrated reasoning Human pose estimation Head Torso Slide Credit: Yao/Fei-Fei

29 Human-Object Interaction Holistic image based classification Integrated reasoning Human pose estimation Object detection Tennis racket Slide Credit: Yao/Fei-Fei

30 Human-Object Interaction Holistic image based classification Integrated reasoning Human pose estimation Object detection Action categorization Tennis racket Head Torso Activity: Tennis Forehand Slide Credit: Yao/Fei-Fei

31 Human pose estimation & Object detection Human pose estimation is challenging. Difficult part appearance Self-occlusion Image region looks like a body part Felzenszwalb & Huttenlocher, 2005 Ren et al, 2005 Ramanan, 2006 Ferrari et al, 2008 Yang & Mori, 2008 Andriluka et al, 2009 Eichner & Ferrari, 2009 Slide Credit: Yao/Fei-Fei

32 Human pose estimation & Object detection Human pose estimation is challenging. Felzenszwalb & Huttenlocher, 2005 Ren et al, 2005 Ramanan, 2006 Ferrari et al, 2008 Yang & Mori, 2008 Andriluka et al, 2009 Eichner & Ferrari, 2009 Slide Credit: Yao/Fei-Fei

33 Human pose estimation & Object detection Facilitate Given the object is detected. Slide Credit: Yao/Fei-Fei

34 Human pose estimation & Object detection Small, low-resolution, partially occluded Object detection is challenging Image region similar to detection target Viola & Jones, 2001 Lampert et al, 2008 Divvala et al, 2009 Vedaldi et al, 2009 Slide Credit: Yao/Fei-Fei

35 Human pose estimation & Object detection Object detection is challenging Viola & Jones, 2001 Lampert et al, 2008 Divvala et al, 2009 Vedaldi et al, 2009 Slide Credit: Yao/Fei-Fei

36 Human pose estimation & Object detection Facilitate Given the pose is estimated. Slide Credit: Yao/Fei-Fei

37 Human pose estimation & Object detection Mutual Context Slide Credit: Yao/Fei-Fei

38 Mutual Context Model Representation A: Activity A O: Tennis forehand Croquet shot Volleyball smash Object Human pose H H: Tennis racket Croquet mallet Volleyball O P 1 P 2 Body parts P N f O f1 f2 fn Intra-class variations More than one H for each A; Unobserved during training. Image evidence P: l P : location; θ P : orientation; s P : scale. f: Shape context. [Belongie et al, 2002] Slide Credit: Yao/Fei-Fei

39 Learning Results Cricket defensive shot Cricket bowling Croquet shot Slide Credit: Yao/Fei-Fei

40 Learning Results Tennis forehand Tennis serve Volleyball smash Slide Credit: Yao/Fei-Fei

41 Model Inference I The learned models Slide Credit: Yao/Fei-Fei

42 Model Inference I The learned models Head detection Compositional Inference Torso detection Layout of the object and body parts. * * A1, H1, O1, P1, n n [Chen et al, 2007] Tennis racket detection Slide Credit: Yao/Fei-Fei

43 Model Inference I The learned models Output * * A1 H1 O1 P1, * * A, H, O, P,,,, n n K K K K n n Slide Credit: Yao/Fei-Fei

44 Dataset and Experiment Setup Sport data set: 6 classes 180 training (supervised with object and part locations) & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei

45 Dataset and Experiment Setup Sport data set: 6 classes 180 training (supervised with object and part locations) & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei

46 Object Detection Results Valid region Cricket bat Cricket ball Sliding window [Andriluka et al, 2009] Pedestrian context [Dalal & Triggs, 2006] Our Method Precision Recall Croquet mallet Tennis racket Volleyball Precision Recall Slide Credit: Yao/Fei-Fei

47 Precision Background clutter Small object Object Detection Results 1 Our Method 1 Our 1Method 0.8 Pedestrian as context Scanning 0.8 window detector Pedestrian as context Our Method Sliding window Pedestrian context Scanning Our window method detector 0.8 Pedestrian as context Scanning window detector Recall Recall Recall Precision Precision Precision Recall Cricket ball Volleyball Precision Recall 59 Slide Credit: Yao/Fei-Fei

48 Dataset and Experiment Setup Sport data set: 6 classes 180 training & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei

49 Human Pose Estimation Results Method Torso Upper Leg Lower Leg Upper Arm Lower Arm Head Ramanan, 2006 Andriluka et al, 2009 Our full model Slide Credit: Yao/Fei-Fei

50 Human Pose Estimation Results Method Torso Upper Leg Lower Leg Upper Arm Lower Arm Head Ramanan, 2006 Andriluka et al, 2009 Our full model Tennis serve model Our estimation result Andriluka et al, 2009 Volleyball smash model Our estimation Andriluka result et al, 2009 Slide Credit: Yao/Fei-Fei

51 Human Pose Estimation Results Method Torso Upper Leg Lower Leg Upper Arm Lower Arm Head Ramanan, 2006 Andriluka et al, 2009 Our full model One pose per class Estimation result Estimation result Estimation result Estimation result Slide Credit: Yao/Fei-Fei

52 Dataset and Experiment Setup Sport data set: 6 classes 180 training & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei

53 Activity Classification Results Classification accuracy % 78.9% 52.5% Cricket shot Tennis forehand 0.5 Our model Our model Gupta et al, 2009 Gupta et al, 2009 Bag-of- Words Bag-of-words SIFT+SVM Slide Credit: Yao/Fei-Fei

54 Motion features Dense Trajectory Action Recognition by Dense Trajectories, CVPR 2011 Action Recognition with Improved Trajectories, ICCV 2013

55 Video classification with CNNs Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014

56 Video classification with CNNs Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014

57 Two-stream CNN Two-Stream Convolutional Networks for Action Recognition in Videos, NIPS 2014

58 3D Convolutional Networks Learning Spatiotemporal Features with 3D Convolutional Networks, ICCV 2015

59 Action recognition -> Semantic role Labeling

60

61 Take-home messages Action recognition is an open problem. How to define actions? How to infer them? What are good visual cues? How do we incorporate higher level reasoning?

62 Take-home messages Some work done, but it is just the beginning of exploring the problem. So far Actions are mainly categorical (could be framed in terms of effect or intent) Most approaches are classification using simple features (spatial-temporal histograms of gradients or flow, s-t interest points, SIFT in images) Just a couple works on how to incorporate pose and objects Not much idea of how to reason about long-term activities or to describe video sequences

Recognising Human-Object Interaction via Exemplar based Modelling

Recognising Human-Object Interaction via Exemplar based Modelling 2013 IEEE International Conference on Computer Vision Recognising Human-Object Interaction via Exemplar based Modelling Jian-Fang Hu, Wei-Shi Zheng, Jianhuang Lai, Shaogang Gong, and Tao Xiang School of

More information

Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition

Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Cristian Sminchisescu Presented by Mit Shah Motivation Current Computer Vision Annotations subjectively

More information

Hierarchical Convolutional Features for Visual Tracking

Hierarchical Convolutional Features for Visual Tracking Hierarchical Convolutional Features for Visual Tracking Chao Ma Jia-Bin Huang Xiaokang Yang Ming-Husan Yang SJTU UIUC SJTU UC Merced ICCV 2015 Background Given the initial state (position and scale), estimate

More information

Beyond R-CNN detection: Learning to Merge Contextual Attribute

Beyond R-CNN detection: Learning to Merge Contextual Attribute Brain Unleashing Series - Beyond R-CNN detection: Learning to Merge Contextual Attribute Shu Kong CS, ICS, UCI 2015-1-29 Outline 1. RCNN is essentially doing classification, without considering contextual

More information

Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling

Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling AAAI -13 July 16, 2013 Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling Sheng-hua ZHONG 1, Yan LIU 1, Feifei REN 1,2, Jinghuan ZHANG 2, Tongwei REN 3 1 Department of

More information

Exemplar-based Action Recognition in Video

Exemplar-based Action Recognition in Video WILLEMS et al.: EXEMPLAR-BASED ACTION RECOGNITION IN VIDEO 1 Exemplar-based Action Recognition in Video Geert Willems 1 homes.esat.kuleuven.be/~gwillems Jan Hendrik Becker 1 homes.esat.kuleuven.be/~jhbecker

More information

Part 1: Bag-of-words models. by Li Fei-Fei (Princeton)

Part 1: Bag-of-words models. by Li Fei-Fei (Princeton) Part 1: Bag-of-words models by Li Fei-Fei (Princeton) Object Bag of words Analogy to documents Of all the sensory impressions proceeding to the brain, the visual experiences are the dominant ones. Our

More information

Classroom Data Collection and Analysis using Computer Vision

Classroom Data Collection and Analysis using Computer Vision Classroom Data Collection and Analysis using Computer Vision Jiang Han Department of Electrical Engineering Stanford University Abstract This project aims to extract different information like faces, gender

More information

Lecture 11 Visual recognition

Lecture 11 Visual recognition Lecture 11 Visual recognition Descriptors (wrapping up) An introduction to recognition Image classification Silvio Savarese Lecture 11-12-Feb-14 The big picture Feature Detection e.g. DoG Feature Description

More information

Putting Context into. Vision. September 15, Derek Hoiem

Putting Context into. Vision. September 15, Derek Hoiem Putting Context into Vision Derek Hoiem September 15, 2004 Questions to Answer What is context? How is context used in human vision? How is context currently used in computer vision? Conclusions Context

More information

Medical Image Analysis

Medical Image Analysis Medical Image Analysis 1 Co-trained convolutional neural networks for automated detection of prostate cancer in multiparametric MRI, 2017, Medical Image Analysis 2 Graph-based prostate extraction in t2-weighted

More information

MEMORABILITY OF NATURAL SCENES: THE ROLE OF ATTENTION

MEMORABILITY OF NATURAL SCENES: THE ROLE OF ATTENTION MEMORABILITY OF NATURAL SCENES: THE ROLE OF ATTENTION Matei Mancas University of Mons - UMONS, Belgium NumediArt Institute, 31, Bd. Dolez, Mons matei.mancas@umons.ac.be Olivier Le Meur University of Rennes

More information

Rich feature hierarchies for accurate object detection and semantic segmentation

Rich feature hierarchies for accurate object detection and semantic segmentation Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik UC Berkeley Tech Report @ http://arxiv.org/abs/1311.2524! Detection

More information

Learning to Generate Long-term Future via Hierarchical Prediction. A. Motion-Based Pixel-Level Evaluation, Analysis, and Control Experiments

Learning to Generate Long-term Future via Hierarchical Prediction. A. Motion-Based Pixel-Level Evaluation, Analysis, and Control Experiments Appendix A. Motion-Based Pixel-Level Evaluation, Analysis, and Control Experiments In this section, we evaluate the predictions by deciles of motion similar to Villegas et al. (2017) using Peak Signal-to-Noise

More information

Active Deformable Part Models Inference

Active Deformable Part Models Inference Active Deformable Part Models Inference Menglong Zhu Nikolay Atanasov George J. Pappas Kostas Daniilidis GRASP Laboratory, University of Pennsylvania 3330 Walnut Street, Philadelphia, PA 19104, USA Abstract.

More information

Estimating scene typicality from human ratings and image features

Estimating scene typicality from human ratings and image features Estimating scene typicality from human ratings and image features Krista A. Ehinger (kehinger@mit.edu) Department of Brain & Cognitive Sciences, MIT, 77 Massachusetts Ave. Jianxiong Xiao (jxiao@csail.mit.edu)

More information

Long Term Activity Analysis in Surveillance Video Archives. Ming-yu Chen. June 8, 2010

Long Term Activity Analysis in Surveillance Video Archives. Ming-yu Chen. June 8, 2010 Long Term Activity Analysis in Surveillance Video Archives Ming-yu Chen Language Technologies Institute School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Thesis Committee: Alexander

More information

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS SEOUL Oct.7, 2016 DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS Gunhee Kim Computer Science and Engineering Seoul National University October

More information

EXEMPLAR-BASED HUMAN INTERACTION RECOGNITION: FEATURES AND KEY POSE SEQUENCE MODEL

EXEMPLAR-BASED HUMAN INTERACTION RECOGNITION: FEATURES AND KEY POSE SEQUENCE MODEL EXEMPLAR-BASED HUMAN INTERACTION RECOGNITION: FEATURES AND KEY POSE SEQUENCE MODEL by Bo Gao B.Eng., Xi an Jiaotong University, China, 2009 a Thesis submitted in partial fulfillment of the requirements

More information

CAN A SMILE REVEAL YOUR GENDER?

CAN A SMILE REVEAL YOUR GENDER? CAN A SMILE REVEAL YOUR GENDER? Antitza Dantcheva, Piotr Bilinski, Francois Bremond INRIA Sophia Antipolis, France JOURNEE de la BIOMETRIE 2017, Caen, 07/07/17 OUTLINE 1. Why Gender Estimation? 2. Related

More information

BLACK PEAR TRUST SUBJECT PLAN - PE

BLACK PEAR TRUST SUBJECT PLAN - PE Purpose of Study A high-quality physical education curriculum inspires all pupils to succeed and excel in competitive sport and other physically-demanding activities. It should provide opportunities for

More information

CONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE

CONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE CONSTRAINED SEMI-SUPERVISED LEARNING USING ATTRIBUTES AND COMPARATIVE ATTRIBUTES Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta The Robotics Institute Carnegie Mellon University SUPERVISION BIG-DATA

More information

Computational modeling of visual attention and saliency in the Smart Playroom

Computational modeling of visual attention and saliency in the Smart Playroom Computational modeling of visual attention and saliency in the Smart Playroom Andrew Jones Department of Computer Science, Brown University Abstract The two canonical modes of human visual attention bottomup

More information

KS4 Physical Education

KS4 Physical Education KS4 Physical Education Component of Fitness These icons indicate that teacher s notes or useful web addresses are available in the Notes Page. This icon indicates that the slide contains activities created

More information

Physical Education Department COURSE Outline:

Physical Education Department COURSE Outline: Course Title: Teacher(s) + e-mail: Cycle/Division: Grade Level: Physical Education Rami: Rami.r.a@greenwood.sch.ae ; Abeer: abeer@greenwood.sch.ae Middle School 7 A,B,E and F Credit Unit: 0.5 Duration:

More information

Biologically-Inspired Human Motion Detection

Biologically-Inspired Human Motion Detection Biologically-Inspired Human Motion Detection Vijay Laxmi, J. N. Carter and R. I. Damper Image, Speech and Intelligent Systems (ISIS) Research Group Department of Electronics and Computer Science University

More information

Object recognition and hierarchical computation

Object recognition and hierarchical computation Object recognition and hierarchical computation Challenges in object recognition. Fukushima s Neocognitron View-based representations of objects Poggio s HMAX Forward and Feedback in visual hierarchy Hierarchical

More information

Vision: Over Ov view Alan Yuille

Vision: Over Ov view Alan Yuille Vision: Overview Alan Yuille Why is Vision Hard? Complexity and Ambiguity of Images. Range of Vision Tasks. More 10x10 images 256^100 = 6.7 x 10 ^240 than the total number of images seen by all humans

More information

PE Scheme of Work Overview

PE Scheme of Work Overview PE Scheme of Work Overview Areas of PE curriculum Games inc. invasion, striking and fielding, net and wall Dance Gymnastics Athletics Outdoors and adventure inc. residential Swimming planned and monitored

More information

different dance form from among folk, social,

different dance form from among folk, social, Standard 1: The physically literate individual demonstrates proficiency in a variety of motor skills and movement patterns. S1.M1 Demonstrates correct rhythm and pattern for a different dance form from

More information

Crowd behavior representation: an attribute based approach

Crowd behavior representation: an attribute based approach DOI 10.1186/s40064-016-2786-0 RESEARCH Open Access Crowd behavior representation: an attribute based approach Hamidreza Rabiee 1, Javad Haddadnia 2* and Hossein Mousavi 3 *Correspondence: haddadnia@hsu.ac.ir

More information

ABSTRACT I. INTRODUCTION

ABSTRACT I. INTRODUCTION 2018 IJSRSET Volume 4 Issue 2 Print ISSN: 2395-1990 Online ISSN : 2394-4099 National Conference on Advanced Research Trends in Information and Computing Technologies (NCARTICT-2018), Department of IT,

More information

Human Activities: Handling Uncertainties Using Fuzzy Time Intervals

Human Activities: Handling Uncertainties Using Fuzzy Time Intervals The 19th International Conference on Pattern Recognition (ICPR), Tampa, FL, 2009 Human Activities: Handling Uncertainties Using Fuzzy Time Intervals M. S. Ryoo 1,2 and J. K. Aggarwal 1 1 Computer & Vision

More information

Shu Kong. Department of Computer Science, UC Irvine

Shu Kong. Department of Computer Science, UC Irvine Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge and philosophy 4. Fine-grained classification with

More information

Automatic Classification of Perceived Gender from Facial Images

Automatic Classification of Perceived Gender from Facial Images Automatic Classification of Perceived Gender from Facial Images Joseph Lemley, Sami Abdul-Wahid, Dipayan Banik Advisor: Dr. Razvan Andonie SOURCE 2016 Outline 1 Introduction 2 Faces - Background 3 Faces

More information

Competing Frameworks in Perception

Competing Frameworks in Perception Competing Frameworks in Perception Lesson II: Perception module 08 Perception.08. 1 Views on perception Perception as a cascade of information processing stages From sensation to percept Template vs. feature

More information

Competing Frameworks in Perception

Competing Frameworks in Perception Competing Frameworks in Perception Lesson II: Perception module 08 Perception.08. 1 Views on perception Perception as a cascade of information processing stages From sensation to percept Template vs. feature

More information

Dynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition

Dynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition Dynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition Stefan Mathe 1,3 and Cristian Sminchisescu 2,1 1 Institute of Mathematics of the Romanian Academy (IMAR) 2 Faculty

More information

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering SUPPLEMENTARY MATERIALS 1. Implementation Details 1.1. Bottom-Up Attention Model Our bottom-up attention Faster R-CNN

More information

CS4495 Computer Vision Introduction to Recognition. Aaron Bobick School of Interactive Computing

CS4495 Computer Vision Introduction to Recognition. Aaron Bobick School of Interactive Computing CS4495 Computer Vision Introduction to Recognition Aaron Bobick School of Interactive Computing What does recognition involve? Source: Fei Fei Li, Rob Fergus, Antonio Torralba. Verification: is that

More information

Hierarchical Local Binary Pattern for Branch Retinal Vein Occlusion Recognition

Hierarchical Local Binary Pattern for Branch Retinal Vein Occlusion Recognition Hierarchical Local Binary Pattern for Branch Retinal Vein Occlusion Recognition Zenghai Chen 1, Hui Zhang 2, Zheru Chi 1, and Hong Fu 1,3 1 Department of Electronic and Information Engineering, The Hong

More information

(Visual) Attention. October 3, PSY Visual Attention 1

(Visual) Attention. October 3, PSY Visual Attention 1 (Visual) Attention Perception and awareness of a visual object seems to involve attending to the object. Do we have to attend to an object to perceive it? Some tasks seem to proceed with little or no attention

More information

Object Detectors Emerge in Deep Scene CNNs

Object Detectors Emerge in Deep Scene CNNs Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are

More information

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman Department of Computer Science University of Texas at Austin {sjhwang,grauman}@cs.utexas.edu

More information

Fine-Grained Image Classification Using Color Exemplar Classifiers

Fine-Grained Image Classification Using Color Exemplar Classifiers Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, and Qingming Huang 1,4 1 School of Computer and Control Engineering,

More information

RECENT progress in computer visual recognition, in particular

RECENT progress in computer visual recognition, in particular IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, REVISED, AUGUST 2014. 1 Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Member,

More information

Geneva CUSD 304 Content-Area Curriculum Frameworks Grades 6-12 Physical Education & Health

Geneva CUSD 304 Content-Area Curriculum Frameworks Grades 6-12 Physical Education & Health Geneva CUSD 304 Content-Area Curriculum Frameworks Grades 6-12 Physical Education & Health Mission Statement Course Sequence (Grades 6-12) Physical Education and Health assists students of all abilities

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Overlap between default mode network (DMN) and movie/recall maps.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Overlap between default mode network (DMN) and movie/recall maps. Supplementary Figure 1 Overlap between default mode network (DMN) and movie/recall maps. We defined the DMN for each individual using the posterior medial cortex ROI as a seed for functional connectivity

More information

Supplemental Materials for Paper 113

Supplemental Materials for Paper 113 Supplemental Materials for Paper 113 August 19, 2012 In this document, we include the following items: 1. Concept definition: the full list of our 101 concepts including actions, scenes, and objects. 2.

More information

Change Blindness. The greater the lie, the greater the chance that it will be believed.

Change Blindness. The greater the lie, the greater the chance that it will be believed. Change Blindness The greater the lie, the greater the chance that it will be believed. (kurt@kloover.com) Department of Computer Science Rochester Institute of Technology 1 Definitions Seeing: the use

More information

On Laban Movement Analysis and The Eight Efforts

On Laban Movement Analysis and The Eight Efforts Grimes Somerset Theatre Factory On Laban Movement Analysis and The Eight Efforts What Is It? Laban technique was created by a Hungarian man named Rudolph Laban, who was interested in studying the language

More information

Physical Education Scope and Sequence: Grades

Physical Education Scope and Sequence: Grades The Scope and Sequence document represents an articulation of what students should know and be able to do. The document supports teachers in knowing how to help students achieve the goals of the standards

More information

RECENT progress in computer visual recognition, in particular

RECENT progress in computer visual recognition, in particular 1 Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Member, IEEE, Cristian Sminchisescu, Member, IEEE arxiv:1312.7570v1 [cs.cv] 29 Dec 2013 Abstract

More information

Multi-attention Guided Activation Propagation in CNNs

Multi-attention Guided Activation Propagation in CNNs Multi-attention Guided Activation Propagation in CNNs Xiangteng He and Yuxin Peng (B) Institute of Computer Science and Technology, Peking University, Beijing, China pengyuxin@pku.edu.cn Abstract. CNNs

More information

Describing Clothing by Semantic Attributes

Describing Clothing by Semantic Attributes Describing Clothing by Semantic Attributes Huizhong Chen 1, Andrew Gallagher 2,3, and Bernd Girod 1 Department of Electrical Engineering, Stanford University, Stanford, California 1 Kodak Research Laboratories,

More information

Fine-Grained Image Classification Using Color Exemplar Classifiers

Fine-Grained Image Classification Using Color Exemplar Classifiers Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, Qingming Huang 1, 4 1 School of Computer and Control Engineering,

More information

Predicting the location of interactees in novel human-object interactions

Predicting the location of interactees in novel human-object interactions In Proceedings of the Asian Conference on Computer Vision (ACCV), 2014. Predicting the location of interactees in novel human-object interactions Chao-Yeh Chen and Kristen Grauman University of Texas at

More information

Using the Soundtrack to Classify Videos

Using the Soundtrack to Classify Videos Using the Soundtrack to Classify Videos Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/

More information

Expression Recognition for Severely Demented Patients in Music Reminiscence-Therapy

Expression Recognition for Severely Demented Patients in Music Reminiscence-Therapy Expression Recognition for Severely Demented Patients in Music Reminiscence-Therapy Antitza Dantcheva 1 *, Piotr Bilinski 2 *, Hung Thanh Nguyen 1, Jean-Claude Broutart 3, Francois Bremond 1 1 Inria, Sophia

More information

Physical Education Curriculum Map Pool/Canoeing Grade: 7

Physical Education Curriculum Map Pool/Canoeing Grade: 7 Pool/Canoeing Grade: 7 Participation in physical activity impacts wellness throughout a lifetime. Quality lifelong movement is based on scientific concepts/principles. How can physical activity choices

More information

SVM-based Discriminative Accumulation Scheme for Place Recognition

SVM-based Discriminative Accumulation Scheme for Place Recognition SVM-based Discriminative Accumulation Scheme for Place Recognition Andrzej Pronobis CAS/CVAP, KTH Stockholm, Sweden pronobis@csc.kth.se Óscar Martínez Mozos AIS, University Of Freiburg Freiburg, Germany

More information

Decoding the Representation of Code in the Brain: An fmri Study of Code Review and Expertise

Decoding the Representation of Code in the Brain: An fmri Study of Code Review and Expertise Decoding the Representation of Code in the Brain: An fmri Study of Code Review and Expertise Benjamin Floyd, Tyler Santander, University of Virginia University of Michigan University of Michigan Looking

More information

Multimodal interactions: visual-auditory

Multimodal interactions: visual-auditory 1 Multimodal interactions: visual-auditory Imagine that you are watching a game of tennis on television and someone accidentally mutes the sound. You will probably notice that following the game becomes

More information

Designing Caption Production Rules Based on Face, Text and Motion Detections

Designing Caption Production Rules Based on Face, Text and Motion Detections Designing Caption Production Rules Based on Face, Text and Motion Detections C. Chapdelaine *, M. Beaulieu, L. Gagnon R&D Department, Computer Research Institute of Montreal (CRIM), 550 Sherbrooke West,

More information

arxiv: v1 [cs.cv] 16 Jun 2012

arxiv: v1 [cs.cv] 16 Jun 2012 How important are Deformable Parts in the Deformable Parts Model? Santosh K. Divvala, Alexei A. Efros, Martial Hebert Carnegie Mellon University arxiv:1206.3714v1 [cs.cv] 16 Jun 2012 Abstract. The main

More information

Rolling Programme of Outcomes and Themes. PE- Foundation

Rolling Programme of Outcomes and Themes. PE- Foundation Rolling Programme of Outcomes and Themes PE- Foundation Year A 2015/16 Autumn Spring Summer Theme Outcomes Theme Outcomes Theme Outcomes Split into 2 half-terms only for group 1 Multi skills Circuits ABC

More information

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta Image Captioning using Reinforcement Learning Presentation by: Samarth Gupta 1 Introduction Summary Supervised Models Image captioning as RL problem Actor Critic Architecture Policy Gradient architecture

More information

Center Cass School District 66

Center Cass School District 66 Center Cass School District 66 C URRICULUM GUIDE Physical Education Physical development programs offer students the opportunity to grow physically, emotionally, socially, and academically. This contributes

More information

Physical Education Skills Progression. Eden Park Primary School Academy

Physical Education Skills Progression. Eden Park Primary School Academy Physical Education Skills Progression Eden Park Primary School Academy In order to ensure broad and balanced coverage, we follow these principles: Within each phase, PE is taught twice weekly for a total

More information

Shu Kong. Department of Computer Science, UC Irvine

Shu Kong. Department of Computer Science, UC Irvine Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge 4. Fine-grained classification with holistic representation

More information

ITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION

ITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION ITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION Yunxiang Mao 1, Zhaozheng Yin 1, Joseph M. Schober 2 1 Missouri University of Science and Technology 2 Southern Illinois University

More information

Social Group Discovery from Surveillance Videos: A Data-Driven Approach with Attention-Based Cues

Social Group Discovery from Surveillance Videos: A Data-Driven Approach with Attention-Based Cues CHAMVEHA ET AL.: SOCIAL GROUP DISCOVERY FROM SURVEILLANCE VIDEOS 1 Social Group Discovery from Surveillance Videos: A Data-Driven Approach with Attention-Based Cues Isarun Chamveha 1 isarunc@iis.u-tokyo.ac.jp

More information

Efficient Deep Model Selection

Efficient Deep Model Selection Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes

More information

Convolutional Neural Networks (CNN)

Convolutional Neural Networks (CNN) Convolutional Neural Networks (CNN) Algorithm and Some Applications in Computer Vision Luo Hengliang Institute of Automation June 10, 2014 Luo Hengliang (Institute of Automation) Convolutional Neural Networks

More information

Using Deep Convolutional Networks for Gesture Recognition in American Sign Language

Using Deep Convolutional Networks for Gesture Recognition in American Sign Language Using Deep Convolutional Networks for Gesture Recognition in American Sign Language Abstract In the realm of multimodal communication, sign language is, and continues to be, one of the most understudied

More information

VENUS: A System for Novelty Detection in Video Streams with Learning

VENUS: A System for Novelty Detection in Video Streams with Learning VENUS: A System for Novelty Detection in Video Streams with Learning Roger S. Gaborski, Vishal S. Vaingankar, Vineet S. Chaoji, Ankur M. Teredesai Laboratory for Applied Computing, Rochester Institute

More information

Learning Spatiotemporal Gaps between Where We Look and What We Focus on

Learning Spatiotemporal Gaps between Where We Look and What We Focus on Express Paper Learning Spatiotemporal Gaps between Where We Look and What We Focus on Ryo Yonetani 1,a) Hiroaki Kawashima 1,b) Takashi Matsuyama 1,c) Received: March 11, 2013, Accepted: April 24, 2013,

More information

Complementary Computing for Visual Tasks: Meshing Computer Vision with Human Visual Processing

Complementary Computing for Visual Tasks: Meshing Computer Vision with Human Visual Processing Complementary Computing for Visual Tasks: Meshing Computer Vision with Human Visual Processing Ashish Kapoor, Desney Tan, Pradeep Shenoy and Eric Horvitz Microsoft Research, Redmond, WA 98052, USA University

More information

Emotion Recognition using a Cauchy Naive Bayes Classifier

Emotion Recognition using a Cauchy Naive Bayes Classifier Emotion Recognition using a Cauchy Naive Bayes Classifier Abstract Recognizing human facial expression and emotion by computer is an interesting and challenging problem. In this paper we propose a method

More information

Group Behavior Analysis and Its Applications

Group Behavior Analysis and Its Applications Group Behavior Analysis and Its Applications CVPR 2015 Tutorial Lecturers: Hyun Soo Park (University of Pennsylvania) Wongun Choi (NEC America Laboratory) Schedule 08:30am-08:50am 08:50am-09:50am 09:50am-10:10am

More information

Automated Embryo Stage Classification in Time-Lapse Microscopy Video of Early Human Embryo Development

Automated Embryo Stage Classification in Time-Lapse Microscopy Video of Early Human Embryo Development Automated Embryo Stage Classification in Time-Lapse Microscopy Video of Early Human Embryo Development Yu Wang, Farshid Moussavi, and Peter Lorenzen Auxogyn, Inc. 1490 O Brien Drive, Suite A, Menlo Park,

More information

Firearm Detection in Social Media

Firearm Detection in Social Media Erik Valldor and David Gustafsson Swedish Defence Research Agency (FOI) C4ISR, Sensor Informatics Linköping SWEDEN erik.valldor@foi.se david.gustafsson@foi.se ABSTRACT Manual analysis of large data sets

More information

Year 4 Physical Education Scheme of Work

Year 4 Physical Education Scheme of Work Year New National Curriculum Pupils should continue to apply and develop a broader range of skills, learning how to use them in different ways and to link them to make actions and sequences of movement.

More information

arxiv: v1 [cs.cv] 18 May 2016

arxiv: v1 [cs.cv] 18 May 2016 BEYOND CAPTION TO NARRATIVE: VIDEO CAPTIONING WITH MULTIPLE SENTENCES Andrew Shin, Katsunori Ohnishi, and Tatsuya Harada Grad. School of Information Science and Technology, The University of Tokyo, Japan

More information

(SAT). d) inhibiting automatized responses.

(SAT). d) inhibiting automatized responses. Which of the following findings does NOT support the existence of task-specific mental resources? 1. a) It is more difficult to combine two verbal tasks than one verbal task and one spatial task. 2. b)

More information

CROWD ANALYTICS VIA ONE SHOT LEARNING AND AGENT BASED INFERENCE. Peter Tu, Ming-Ching Chang, Tao Gao. GE Global Research

CROWD ANALYTICS VIA ONE SHOT LEARNING AND AGENT BASED INFERENCE. Peter Tu, Ming-Ching Chang, Tao Gao. GE Global Research CROWD ANALYTICS VIA ONE SHOT LEARNING AND AGENT BASED INFERENCE Peter Tu, Ming-Ching Chang, Tao Gao GE Global Research ABSTRACT For the purposes of inferring social behavior in crowded conditions, three

More information

An Attentional Framework for 3D Object Discovery

An Attentional Framework for 3D Object Discovery An Attentional Framework for 3D Object Discovery Germán Martín García and Simone Frintrop Cognitive Vision Group Institute of Computer Science III University of Bonn, Germany Saliency Computation Saliency

More information

Automatic Visual Mimicry Expression Analysis in Interpersonal Interaction

Automatic Visual Mimicry Expression Analysis in Interpersonal Interaction Automatic Visual Mimicry Expression Analysis in Interpersonal Interaction Xiaofan Sun x.f.sun@utwente.nl Anton Nijholt a.nijholt@utwente.nl Khiet P. Truong k.p.truong@utwente.nl Maja Pantic Department

More information

Generic object recognition May 17 th, 2018

Generic object recognition May 17 th, 2018 Generic object recognition May 17 th, 2018 Yong Jae Lee UC Davis Visual words Map high dimensional descriptors to tokens/words by quantizing the feature space Quantize via clustering, let cluster centers

More information

FEATURE EXTRACTION USING GAZE OF PARTICIPANTS FOR CLASSIFYING GENDER OF PEDESTRIANS IN IMAGES

FEATURE EXTRACTION USING GAZE OF PARTICIPANTS FOR CLASSIFYING GENDER OF PEDESTRIANS IN IMAGES FEATURE EXTRACTION USING GAZE OF PARTICIPANTS FOR CLASSIFYING GENDER OF PEDESTRIANS IN IMAGES Riku Matsumoto, Hiroki Yoshimura, Masashi Nishiyama, and Yoshio Iwai Department of Information and Electronics,

More information

IDENTIFICATION OF REAL TIME HAND GESTURE USING SCALE INVARIANT FEATURE TRANSFORM

IDENTIFICATION OF REAL TIME HAND GESTURE USING SCALE INVARIANT FEATURE TRANSFORM Research Article Impact Factor: 0.621 ISSN: 2319507X INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK IDENTIFICATION OF REAL TIME

More information

LOCOMOTION MOVEMENT SKILLS

LOCOMOTION MOVEMENT SKILLS MEMORIAL OVAL PRIMARY SCHOOL - Physical Education Continuum SACSA Band / Standard SACSA Outcomes Physical Activity & Participation Year Level STABILITY MOVEMENTS SKILLS LOCOMOTION MOVEMENT SKILLS MANIPULATION

More information

EDGE DETECTION. Edge Detectors. ICS 280: Visual Perception

EDGE DETECTION. Edge Detectors. ICS 280: Visual Perception EDGE DETECTION Edge Detectors Slide 2 Convolution & Feature Detection Slide 3 Finds the slope First derivative Direction dependent Need many edge detectors for all orientation Second order derivatives

More information

Fire P.I.T. Benefits of Fitness

Fire P.I.T. Benefits of Fitness Fire P.I.T Benefits of Fitness Fire PIT: Benefits of Fitness Benefits of Fitness The word health is often associated only with physical fitness, but there are other components of health. FITNESS = READINESS.

More information

Marking Period 3-5 week course- Activity Selection. Marking Period 1- One Marking Period dedicated to Health

Marking Period 3-5 week course- Activity Selection. Marking Period 1- One Marking Period dedicated to Health Week Week Week Week DEPARTMENT Physical Education COURSE: Grades 9 to 12 Marking Period 1- One Marking Period dedicated to Health Marking Period 3-5 week course- Activity Selection 1 N/A 21 Ultimate Frisbee

More information

Physical Education Curriculum Mapping Worksheet Benchmark 3

Physical Education Curriculum Mapping Worksheet Benchmark 3 Demonstrate motor skill competency in a variety of physical activities and motor skill proficiency in one physical activity Demonstrate movement principles (mechanics, Content: Tennis Skills Learned: Serve,

More information

arxiv: v1 [cs.cv] 26 Jul 2016

arxiv: v1 [cs.cv] 26 Jul 2016 Noname manuscript No. (will be inserted by the editor) Salient Object Subitizing Jianming Zhang Shugao Ma Mehrnoosh Sameki Stan Sclaroff Margrit Betke Zhe Lin Xiaohui Shen Brian Price Radomír Měch arxiv:67.755v

More information

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF)

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) 1 Desired Results Developmental Profile (2015) [DRDP (2015)] Correspondence to California Foundations: Physical Development Health (PD-HLTH) and the Overall, the Physical Development Health (PD-HLTH) domain

More information

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF)

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) 1 Desired Results Developmental Profile (2015) [DRDP (2015)] Correspondence to California Foundations: Physical Development Health (PD-HLTH) and the Overall, the Physical Development Health (PD-HLTH) domain

More information