Action Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem
|
|
- Amos James
- 5 years ago
- Views:
Transcription
1 Action Recognition Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem
2 This section: advanced topics Convolutional neural networks in vision Action recognition Vision and Language 3D Scenes and Context
3 What is an action? Action: a transition from one state to another Who is the actor? How is the state of the actor changing? What (if anything) is being acted on? How is that thing changing? What is the purpose of the action (if any)?
4 How do we represent actions? Categories Walking, hammering, dancing, skiing, sitting down, standing up, jumping Poses Nouns and Predicates <man, swings, hammer> <man, hits, nail, w/ hammer>
5 What is the purpose of action recognition? To describe
6 What is the purpose of action recognition? To predict
7 What is the purpose of action recognition? To understand the intention and motivation Predicting Motivations of Actions by Leveraging Text, CVPR 2016
8 How can we identify actions? Motion Pose Held Objects Nearby Objects
9 Representing Motion Optical Flow with Motion History Bobick Davis 2001
10 Representing Motion Space-Time Volumes Blank et al. 2005
11 Representing Motion Optical Flow with Split Channels optical flow split into pos/neg channels blurred pos/neg flow Efros et al. 2003
12 Representing Motion Tracked Points Matikainen et al. 2009
13 Representing Motion Space-Time Interest Points Moving corner Ball hits wall Corner detectors in space-time Balls collide Balls collide (different scale) Laptev 2005
14 Representing Motion Space-Time Interest Points Laptev 2005
15 Examples of Action Recognition Systems Feature-based classification Recognition using pose and objects
16 Action recognition as classification Retrieving actions in movies, Laptev and Perez, 2007
17 Remember image categorization Training Images Training Image Features Training Labels Classifier Training Trained Classifier Test Image Image Features Testing Trained Classifier Prediction Outdoor
18 Remember spatial pyramids. Compute histogram in each spatial bin
19 Features for Classifying Actions 1. Spatio-temporal pyramids Image Gradients Optical Flow
20 Features for Classifying Actions 2. Spatio-temporal interest points Corner detectors in space-time Descriptors based on Gaussian derivative filters over x, y, time
21 Classification Boosted stubs for pyramids of optical flow, gradient Nearest neighbor for STIP
22 Searching the video for an action 1. Detect keyframes using a trained HOG detector in each frame 2. Classify detected keyframes as positive (e.g., drinking ) or negative ( other )
23 Accuracy in searching video With keyframe detection Without keyframe detection
24 Talk on phone Get out of car Learning realistic human actions from movies, Laptev et al. 2008
25 Approach Space-time interest point detectors Descriptors HOG, HOF Pyramid histograms (3x3x2) SVMs with Chi-Squared Kernel Interest Points Spatio-Temporal Binning
26 Results
27 Action Recognition using Pose and Objects Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities, B. Yao and Li Fei-Fei, 2010 Slide Credit: Yao/Fei-Fei
28 Human-Object Interaction Holistic image based classification Integrated reasoning Human pose estimation Head Torso Slide Credit: Yao/Fei-Fei
29 Human-Object Interaction Holistic image based classification Integrated reasoning Human pose estimation Object detection Tennis racket Slide Credit: Yao/Fei-Fei
30 Human-Object Interaction Holistic image based classification Integrated reasoning Human pose estimation Object detection Action categorization Tennis racket Head Torso Activity: Tennis Forehand Slide Credit: Yao/Fei-Fei
31 Human pose estimation & Object detection Human pose estimation is challenging. Difficult part appearance Self-occlusion Image region looks like a body part Felzenszwalb & Huttenlocher, 2005 Ren et al, 2005 Ramanan, 2006 Ferrari et al, 2008 Yang & Mori, 2008 Andriluka et al, 2009 Eichner & Ferrari, 2009 Slide Credit: Yao/Fei-Fei
32 Human pose estimation & Object detection Human pose estimation is challenging. Felzenszwalb & Huttenlocher, 2005 Ren et al, 2005 Ramanan, 2006 Ferrari et al, 2008 Yang & Mori, 2008 Andriluka et al, 2009 Eichner & Ferrari, 2009 Slide Credit: Yao/Fei-Fei
33 Human pose estimation & Object detection Facilitate Given the object is detected. Slide Credit: Yao/Fei-Fei
34 Human pose estimation & Object detection Small, low-resolution, partially occluded Object detection is challenging Image region similar to detection target Viola & Jones, 2001 Lampert et al, 2008 Divvala et al, 2009 Vedaldi et al, 2009 Slide Credit: Yao/Fei-Fei
35 Human pose estimation & Object detection Object detection is challenging Viola & Jones, 2001 Lampert et al, 2008 Divvala et al, 2009 Vedaldi et al, 2009 Slide Credit: Yao/Fei-Fei
36 Human pose estimation & Object detection Facilitate Given the pose is estimated. Slide Credit: Yao/Fei-Fei
37 Human pose estimation & Object detection Mutual Context Slide Credit: Yao/Fei-Fei
38 Mutual Context Model Representation A: Activity A O: Tennis forehand Croquet shot Volleyball smash Object Human pose H H: Tennis racket Croquet mallet Volleyball O P 1 P 2 Body parts P N f O f1 f2 fn Intra-class variations More than one H for each A; Unobserved during training. Image evidence P: l P : location; θ P : orientation; s P : scale. f: Shape context. [Belongie et al, 2002] Slide Credit: Yao/Fei-Fei
39 Learning Results Cricket defensive shot Cricket bowling Croquet shot Slide Credit: Yao/Fei-Fei
40 Learning Results Tennis forehand Tennis serve Volleyball smash Slide Credit: Yao/Fei-Fei
41 Model Inference I The learned models Slide Credit: Yao/Fei-Fei
42 Model Inference I The learned models Head detection Compositional Inference Torso detection Layout of the object and body parts. * * A1, H1, O1, P1, n n [Chen et al, 2007] Tennis racket detection Slide Credit: Yao/Fei-Fei
43 Model Inference I The learned models Output * * A1 H1 O1 P1, * * A, H, O, P,,,, n n K K K K n n Slide Credit: Yao/Fei-Fei
44 Dataset and Experiment Setup Sport data set: 6 classes 180 training (supervised with object and part locations) & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei
45 Dataset and Experiment Setup Sport data set: 6 classes 180 training (supervised with object and part locations) & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei
46 Object Detection Results Valid region Cricket bat Cricket ball Sliding window [Andriluka et al, 2009] Pedestrian context [Dalal & Triggs, 2006] Our Method Precision Recall Croquet mallet Tennis racket Volleyball Precision Recall Slide Credit: Yao/Fei-Fei
47 Precision Background clutter Small object Object Detection Results 1 Our Method 1 Our 1Method 0.8 Pedestrian as context Scanning 0.8 window detector Pedestrian as context Our Method Sliding window Pedestrian context Scanning Our window method detector 0.8 Pedestrian as context Scanning window detector Recall Recall Recall Precision Precision Precision Recall Cricket ball Volleyball Precision Recall 59 Slide Credit: Yao/Fei-Fei
48 Dataset and Experiment Setup Sport data set: 6 classes 180 training & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei
49 Human Pose Estimation Results Method Torso Upper Leg Lower Leg Upper Arm Lower Arm Head Ramanan, 2006 Andriluka et al, 2009 Our full model Slide Credit: Yao/Fei-Fei
50 Human Pose Estimation Results Method Torso Upper Leg Lower Leg Upper Arm Lower Arm Head Ramanan, 2006 Andriluka et al, 2009 Our full model Tennis serve model Our estimation result Andriluka et al, 2009 Volleyball smash model Our estimation Andriluka result et al, 2009 Slide Credit: Yao/Fei-Fei
51 Human Pose Estimation Results Method Torso Upper Leg Lower Leg Upper Arm Lower Arm Head Ramanan, 2006 Andriluka et al, 2009 Our full model One pose per class Estimation result Estimation result Estimation result Estimation result Slide Credit: Yao/Fei-Fei
52 Dataset and Experiment Setup Sport data set: 6 classes 180 training & 120 testing images Tasks: Object detection; Pose estimation; Cricket defensive shot Cricket bowling Croquet shot Activity classification. Tennis forehand Tennis serve Volleyball smash [Gupta et al, 2009] Slide Credit: Yao/Fei-Fei
53 Activity Classification Results Classification accuracy % 78.9% 52.5% Cricket shot Tennis forehand 0.5 Our model Our model Gupta et al, 2009 Gupta et al, 2009 Bag-of- Words Bag-of-words SIFT+SVM Slide Credit: Yao/Fei-Fei
54 Motion features Dense Trajectory Action Recognition by Dense Trajectories, CVPR 2011 Action Recognition with Improved Trajectories, ICCV 2013
55 Video classification with CNNs Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014
56 Video classification with CNNs Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014
57 Two-stream CNN Two-Stream Convolutional Networks for Action Recognition in Videos, NIPS 2014
58 3D Convolutional Networks Learning Spatiotemporal Features with 3D Convolutional Networks, ICCV 2015
59 Action recognition -> Semantic role Labeling
60
61 Take-home messages Action recognition is an open problem. How to define actions? How to infer them? What are good visual cues? How do we incorporate higher level reasoning?
62 Take-home messages Some work done, but it is just the beginning of exploring the problem. So far Actions are mainly categorical (could be framed in terms of effect or intent) Most approaches are classification using simple features (spatial-temporal histograms of gradients or flow, s-t interest points, SIFT in images) Just a couple works on how to incorporate pose and objects Not much idea of how to reason about long-term activities or to describe video sequences
Recognising Human-Object Interaction via Exemplar based Modelling
2013 IEEE International Conference on Computer Vision Recognising Human-Object Interaction via Exemplar based Modelling Jian-Fang Hu, Wei-Shi Zheng, Jianhuang Lai, Shaogang Gong, and Tao Xiang School of
More informationActions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition
Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Cristian Sminchisescu Presented by Mit Shah Motivation Current Computer Vision Annotations subjectively
More informationHierarchical Convolutional Features for Visual Tracking
Hierarchical Convolutional Features for Visual Tracking Chao Ma Jia-Bin Huang Xiaokang Yang Ming-Husan Yang SJTU UIUC SJTU UC Merced ICCV 2015 Background Given the initial state (position and scale), estimate
More informationBeyond R-CNN detection: Learning to Merge Contextual Attribute
Brain Unleashing Series - Beyond R-CNN detection: Learning to Merge Contextual Attribute Shu Kong CS, ICS, UCI 2015-1-29 Outline 1. RCNN is essentially doing classification, without considering contextual
More informationVideo Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling
AAAI -13 July 16, 2013 Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling Sheng-hua ZHONG 1, Yan LIU 1, Feifei REN 1,2, Jinghuan ZHANG 2, Tongwei REN 3 1 Department of
More informationExemplar-based Action Recognition in Video
WILLEMS et al.: EXEMPLAR-BASED ACTION RECOGNITION IN VIDEO 1 Exemplar-based Action Recognition in Video Geert Willems 1 homes.esat.kuleuven.be/~gwillems Jan Hendrik Becker 1 homes.esat.kuleuven.be/~jhbecker
More informationPart 1: Bag-of-words models. by Li Fei-Fei (Princeton)
Part 1: Bag-of-words models by Li Fei-Fei (Princeton) Object Bag of words Analogy to documents Of all the sensory impressions proceeding to the brain, the visual experiences are the dominant ones. Our
More informationClassroom Data Collection and Analysis using Computer Vision
Classroom Data Collection and Analysis using Computer Vision Jiang Han Department of Electrical Engineering Stanford University Abstract This project aims to extract different information like faces, gender
More informationLecture 11 Visual recognition
Lecture 11 Visual recognition Descriptors (wrapping up) An introduction to recognition Image classification Silvio Savarese Lecture 11-12-Feb-14 The big picture Feature Detection e.g. DoG Feature Description
More informationPutting Context into. Vision. September 15, Derek Hoiem
Putting Context into Vision Derek Hoiem September 15, 2004 Questions to Answer What is context? How is context used in human vision? How is context currently used in computer vision? Conclusions Context
More informationMedical Image Analysis
Medical Image Analysis 1 Co-trained convolutional neural networks for automated detection of prostate cancer in multiparametric MRI, 2017, Medical Image Analysis 2 Graph-based prostate extraction in t2-weighted
More informationMEMORABILITY OF NATURAL SCENES: THE ROLE OF ATTENTION
MEMORABILITY OF NATURAL SCENES: THE ROLE OF ATTENTION Matei Mancas University of Mons - UMONS, Belgium NumediArt Institute, 31, Bd. Dolez, Mons matei.mancas@umons.ac.be Olivier Le Meur University of Rennes
More informationRich feature hierarchies for accurate object detection and semantic segmentation
Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik UC Berkeley Tech Report @ http://arxiv.org/abs/1311.2524! Detection
More informationLearning to Generate Long-term Future via Hierarchical Prediction. A. Motion-Based Pixel-Level Evaluation, Analysis, and Control Experiments
Appendix A. Motion-Based Pixel-Level Evaluation, Analysis, and Control Experiments In this section, we evaluate the predictions by deciles of motion similar to Villegas et al. (2017) using Peak Signal-to-Noise
More informationActive Deformable Part Models Inference
Active Deformable Part Models Inference Menglong Zhu Nikolay Atanasov George J. Pappas Kostas Daniilidis GRASP Laboratory, University of Pennsylvania 3330 Walnut Street, Philadelphia, PA 19104, USA Abstract.
More informationEstimating scene typicality from human ratings and image features
Estimating scene typicality from human ratings and image features Krista A. Ehinger (kehinger@mit.edu) Department of Brain & Cognitive Sciences, MIT, 77 Massachusetts Ave. Jianxiong Xiao (jxiao@csail.mit.edu)
More informationLong Term Activity Analysis in Surveillance Video Archives. Ming-yu Chen. June 8, 2010
Long Term Activity Analysis in Surveillance Video Archives Ming-yu Chen Language Technologies Institute School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Thesis Committee: Alexander
More informationDEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS
SEOUL Oct.7, 2016 DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS Gunhee Kim Computer Science and Engineering Seoul National University October
More informationEXEMPLAR-BASED HUMAN INTERACTION RECOGNITION: FEATURES AND KEY POSE SEQUENCE MODEL
EXEMPLAR-BASED HUMAN INTERACTION RECOGNITION: FEATURES AND KEY POSE SEQUENCE MODEL by Bo Gao B.Eng., Xi an Jiaotong University, China, 2009 a Thesis submitted in partial fulfillment of the requirements
More informationCAN A SMILE REVEAL YOUR GENDER?
CAN A SMILE REVEAL YOUR GENDER? Antitza Dantcheva, Piotr Bilinski, Francois Bremond INRIA Sophia Antipolis, France JOURNEE de la BIOMETRIE 2017, Caen, 07/07/17 OUTLINE 1. Why Gender Estimation? 2. Related
More informationBLACK PEAR TRUST SUBJECT PLAN - PE
Purpose of Study A high-quality physical education curriculum inspires all pupils to succeed and excel in competitive sport and other physically-demanding activities. It should provide opportunities for
More informationCONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE
CONSTRAINED SEMI-SUPERVISED LEARNING USING ATTRIBUTES AND COMPARATIVE ATTRIBUTES Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta The Robotics Institute Carnegie Mellon University SUPERVISION BIG-DATA
More informationComputational modeling of visual attention and saliency in the Smart Playroom
Computational modeling of visual attention and saliency in the Smart Playroom Andrew Jones Department of Computer Science, Brown University Abstract The two canonical modes of human visual attention bottomup
More informationKS4 Physical Education
KS4 Physical Education Component of Fitness These icons indicate that teacher s notes or useful web addresses are available in the Notes Page. This icon indicates that the slide contains activities created
More informationPhysical Education Department COURSE Outline:
Course Title: Teacher(s) + e-mail: Cycle/Division: Grade Level: Physical Education Rami: Rami.r.a@greenwood.sch.ae ; Abeer: abeer@greenwood.sch.ae Middle School 7 A,B,E and F Credit Unit: 0.5 Duration:
More informationBiologically-Inspired Human Motion Detection
Biologically-Inspired Human Motion Detection Vijay Laxmi, J. N. Carter and R. I. Damper Image, Speech and Intelligent Systems (ISIS) Research Group Department of Electronics and Computer Science University
More informationObject recognition and hierarchical computation
Object recognition and hierarchical computation Challenges in object recognition. Fukushima s Neocognitron View-based representations of objects Poggio s HMAX Forward and Feedback in visual hierarchy Hierarchical
More informationVision: Over Ov view Alan Yuille
Vision: Overview Alan Yuille Why is Vision Hard? Complexity and Ambiguity of Images. Range of Vision Tasks. More 10x10 images 256^100 = 6.7 x 10 ^240 than the total number of images seen by all humans
More informationPE Scheme of Work Overview
PE Scheme of Work Overview Areas of PE curriculum Games inc. invasion, striking and fielding, net and wall Dance Gymnastics Athletics Outdoors and adventure inc. residential Swimming planned and monitored
More informationdifferent dance form from among folk, social,
Standard 1: The physically literate individual demonstrates proficiency in a variety of motor skills and movement patterns. S1.M1 Demonstrates correct rhythm and pattern for a different dance form from
More informationCrowd behavior representation: an attribute based approach
DOI 10.1186/s40064-016-2786-0 RESEARCH Open Access Crowd behavior representation: an attribute based approach Hamidreza Rabiee 1, Javad Haddadnia 2* and Hossein Mousavi 3 *Correspondence: haddadnia@hsu.ac.ir
More informationABSTRACT I. INTRODUCTION
2018 IJSRSET Volume 4 Issue 2 Print ISSN: 2395-1990 Online ISSN : 2394-4099 National Conference on Advanced Research Trends in Information and Computing Technologies (NCARTICT-2018), Department of IT,
More informationHuman Activities: Handling Uncertainties Using Fuzzy Time Intervals
The 19th International Conference on Pattern Recognition (ICPR), Tampa, FL, 2009 Human Activities: Handling Uncertainties Using Fuzzy Time Intervals M. S. Ryoo 1,2 and J. K. Aggarwal 1 1 Computer & Vision
More informationShu Kong. Department of Computer Science, UC Irvine
Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge and philosophy 4. Fine-grained classification with
More informationAutomatic Classification of Perceived Gender from Facial Images
Automatic Classification of Perceived Gender from Facial Images Joseph Lemley, Sami Abdul-Wahid, Dipayan Banik Advisor: Dr. Razvan Andonie SOURCE 2016 Outline 1 Introduction 2 Faces - Background 3 Faces
More informationCompeting Frameworks in Perception
Competing Frameworks in Perception Lesson II: Perception module 08 Perception.08. 1 Views on perception Perception as a cascade of information processing stages From sensation to percept Template vs. feature
More informationCompeting Frameworks in Perception
Competing Frameworks in Perception Lesson II: Perception module 08 Perception.08. 1 Views on perception Perception as a cascade of information processing stages From sensation to percept Template vs. feature
More informationDynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition
Dynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition Stefan Mathe 1,3 and Cristian Sminchisescu 2,1 1 Institute of Mathematics of the Romanian Academy (IMAR) 2 Faculty
More informationBottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering SUPPLEMENTARY MATERIALS 1. Implementation Details 1.1. Bottom-Up Attention Model Our bottom-up attention Faster R-CNN
More informationCS4495 Computer Vision Introduction to Recognition. Aaron Bobick School of Interactive Computing
CS4495 Computer Vision Introduction to Recognition Aaron Bobick School of Interactive Computing What does recognition involve? Source: Fei Fei Li, Rob Fergus, Antonio Torralba. Verification: is that
More informationHierarchical Local Binary Pattern for Branch Retinal Vein Occlusion Recognition
Hierarchical Local Binary Pattern for Branch Retinal Vein Occlusion Recognition Zenghai Chen 1, Hui Zhang 2, Zheru Chi 1, and Hong Fu 1,3 1 Department of Electronic and Information Engineering, The Hong
More information(Visual) Attention. October 3, PSY Visual Attention 1
(Visual) Attention Perception and awareness of a visual object seems to involve attending to the object. Do we have to attend to an object to perceive it? Some tasks seem to proceed with little or no attention
More informationObject Detectors Emerge in Deep Scene CNNs
Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are
More informationReading Between The Lines: Object Localization Using Implicit Cues from Image Tags
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman Department of Computer Science University of Texas at Austin {sjhwang,grauman}@cs.utexas.edu
More informationFine-Grained Image Classification Using Color Exemplar Classifiers
Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, and Qingming Huang 1,4 1 School of Computer and Control Engineering,
More informationRECENT progress in computer visual recognition, in particular
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, REVISED, AUGUST 2014. 1 Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Member,
More informationGeneva CUSD 304 Content-Area Curriculum Frameworks Grades 6-12 Physical Education & Health
Geneva CUSD 304 Content-Area Curriculum Frameworks Grades 6-12 Physical Education & Health Mission Statement Course Sequence (Grades 6-12) Physical Education and Health assists students of all abilities
More informationNature Neuroscience: doi: /nn Supplementary Figure 1. Overlap between default mode network (DMN) and movie/recall maps.
Supplementary Figure 1 Overlap between default mode network (DMN) and movie/recall maps. We defined the DMN for each individual using the posterior medial cortex ROI as a seed for functional connectivity
More informationSupplemental Materials for Paper 113
Supplemental Materials for Paper 113 August 19, 2012 In this document, we include the following items: 1. Concept definition: the full list of our 101 concepts including actions, scenes, and objects. 2.
More informationChange Blindness. The greater the lie, the greater the chance that it will be believed.
Change Blindness The greater the lie, the greater the chance that it will be believed. (kurt@kloover.com) Department of Computer Science Rochester Institute of Technology 1 Definitions Seeing: the use
More informationOn Laban Movement Analysis and The Eight Efforts
Grimes Somerset Theatre Factory On Laban Movement Analysis and The Eight Efforts What Is It? Laban technique was created by a Hungarian man named Rudolph Laban, who was interested in studying the language
More informationPhysical Education Scope and Sequence: Grades
The Scope and Sequence document represents an articulation of what students should know and be able to do. The document supports teachers in knowing how to help students achieve the goals of the standards
More informationRECENT progress in computer visual recognition, in particular
1 Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Member, IEEE, Cristian Sminchisescu, Member, IEEE arxiv:1312.7570v1 [cs.cv] 29 Dec 2013 Abstract
More informationMulti-attention Guided Activation Propagation in CNNs
Multi-attention Guided Activation Propagation in CNNs Xiangteng He and Yuxin Peng (B) Institute of Computer Science and Technology, Peking University, Beijing, China pengyuxin@pku.edu.cn Abstract. CNNs
More informationDescribing Clothing by Semantic Attributes
Describing Clothing by Semantic Attributes Huizhong Chen 1, Andrew Gallagher 2,3, and Bernd Girod 1 Department of Electrical Engineering, Stanford University, Stanford, California 1 Kodak Research Laboratories,
More informationFine-Grained Image Classification Using Color Exemplar Classifiers
Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, Qingming Huang 1, 4 1 School of Computer and Control Engineering,
More informationPredicting the location of interactees in novel human-object interactions
In Proceedings of the Asian Conference on Computer Vision (ACCV), 2014. Predicting the location of interactees in novel human-object interactions Chao-Yeh Chen and Kristen Grauman University of Texas at
More informationUsing the Soundtrack to Classify Videos
Using the Soundtrack to Classify Videos Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/
More informationExpression Recognition for Severely Demented Patients in Music Reminiscence-Therapy
Expression Recognition for Severely Demented Patients in Music Reminiscence-Therapy Antitza Dantcheva 1 *, Piotr Bilinski 2 *, Hung Thanh Nguyen 1, Jean-Claude Broutart 3, Francois Bremond 1 1 Inria, Sophia
More informationPhysical Education Curriculum Map Pool/Canoeing Grade: 7
Pool/Canoeing Grade: 7 Participation in physical activity impacts wellness throughout a lifetime. Quality lifelong movement is based on scientific concepts/principles. How can physical activity choices
More informationSVM-based Discriminative Accumulation Scheme for Place Recognition
SVM-based Discriminative Accumulation Scheme for Place Recognition Andrzej Pronobis CAS/CVAP, KTH Stockholm, Sweden pronobis@csc.kth.se Óscar Martínez Mozos AIS, University Of Freiburg Freiburg, Germany
More informationDecoding the Representation of Code in the Brain: An fmri Study of Code Review and Expertise
Decoding the Representation of Code in the Brain: An fmri Study of Code Review and Expertise Benjamin Floyd, Tyler Santander, University of Virginia University of Michigan University of Michigan Looking
More informationMultimodal interactions: visual-auditory
1 Multimodal interactions: visual-auditory Imagine that you are watching a game of tennis on television and someone accidentally mutes the sound. You will probably notice that following the game becomes
More informationDesigning Caption Production Rules Based on Face, Text and Motion Detections
Designing Caption Production Rules Based on Face, Text and Motion Detections C. Chapdelaine *, M. Beaulieu, L. Gagnon R&D Department, Computer Research Institute of Montreal (CRIM), 550 Sherbrooke West,
More informationarxiv: v1 [cs.cv] 16 Jun 2012
How important are Deformable Parts in the Deformable Parts Model? Santosh K. Divvala, Alexei A. Efros, Martial Hebert Carnegie Mellon University arxiv:1206.3714v1 [cs.cv] 16 Jun 2012 Abstract. The main
More informationRolling Programme of Outcomes and Themes. PE- Foundation
Rolling Programme of Outcomes and Themes PE- Foundation Year A 2015/16 Autumn Spring Summer Theme Outcomes Theme Outcomes Theme Outcomes Split into 2 half-terms only for group 1 Multi skills Circuits ABC
More informationImage Captioning using Reinforcement Learning. Presentation by: Samarth Gupta
Image Captioning using Reinforcement Learning Presentation by: Samarth Gupta 1 Introduction Summary Supervised Models Image captioning as RL problem Actor Critic Architecture Policy Gradient architecture
More informationCenter Cass School District 66
Center Cass School District 66 C URRICULUM GUIDE Physical Education Physical development programs offer students the opportunity to grow physically, emotionally, socially, and academically. This contributes
More informationPhysical Education Skills Progression. Eden Park Primary School Academy
Physical Education Skills Progression Eden Park Primary School Academy In order to ensure broad and balanced coverage, we follow these principles: Within each phase, PE is taught twice weekly for a total
More informationShu Kong. Department of Computer Science, UC Irvine
Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge 4. Fine-grained classification with holistic representation
More informationITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION
ITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION Yunxiang Mao 1, Zhaozheng Yin 1, Joseph M. Schober 2 1 Missouri University of Science and Technology 2 Southern Illinois University
More informationSocial Group Discovery from Surveillance Videos: A Data-Driven Approach with Attention-Based Cues
CHAMVEHA ET AL.: SOCIAL GROUP DISCOVERY FROM SURVEILLANCE VIDEOS 1 Social Group Discovery from Surveillance Videos: A Data-Driven Approach with Attention-Based Cues Isarun Chamveha 1 isarunc@iis.u-tokyo.ac.jp
More informationEfficient Deep Model Selection
Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes
More informationConvolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN) Algorithm and Some Applications in Computer Vision Luo Hengliang Institute of Automation June 10, 2014 Luo Hengliang (Institute of Automation) Convolutional Neural Networks
More informationUsing Deep Convolutional Networks for Gesture Recognition in American Sign Language
Using Deep Convolutional Networks for Gesture Recognition in American Sign Language Abstract In the realm of multimodal communication, sign language is, and continues to be, one of the most understudied
More informationVENUS: A System for Novelty Detection in Video Streams with Learning
VENUS: A System for Novelty Detection in Video Streams with Learning Roger S. Gaborski, Vishal S. Vaingankar, Vineet S. Chaoji, Ankur M. Teredesai Laboratory for Applied Computing, Rochester Institute
More informationLearning Spatiotemporal Gaps between Where We Look and What We Focus on
Express Paper Learning Spatiotemporal Gaps between Where We Look and What We Focus on Ryo Yonetani 1,a) Hiroaki Kawashima 1,b) Takashi Matsuyama 1,c) Received: March 11, 2013, Accepted: April 24, 2013,
More informationComplementary Computing for Visual Tasks: Meshing Computer Vision with Human Visual Processing
Complementary Computing for Visual Tasks: Meshing Computer Vision with Human Visual Processing Ashish Kapoor, Desney Tan, Pradeep Shenoy and Eric Horvitz Microsoft Research, Redmond, WA 98052, USA University
More informationEmotion Recognition using a Cauchy Naive Bayes Classifier
Emotion Recognition using a Cauchy Naive Bayes Classifier Abstract Recognizing human facial expression and emotion by computer is an interesting and challenging problem. In this paper we propose a method
More informationGroup Behavior Analysis and Its Applications
Group Behavior Analysis and Its Applications CVPR 2015 Tutorial Lecturers: Hyun Soo Park (University of Pennsylvania) Wongun Choi (NEC America Laboratory) Schedule 08:30am-08:50am 08:50am-09:50am 09:50am-10:10am
More informationAutomated Embryo Stage Classification in Time-Lapse Microscopy Video of Early Human Embryo Development
Automated Embryo Stage Classification in Time-Lapse Microscopy Video of Early Human Embryo Development Yu Wang, Farshid Moussavi, and Peter Lorenzen Auxogyn, Inc. 1490 O Brien Drive, Suite A, Menlo Park,
More informationFirearm Detection in Social Media
Erik Valldor and David Gustafsson Swedish Defence Research Agency (FOI) C4ISR, Sensor Informatics Linköping SWEDEN erik.valldor@foi.se david.gustafsson@foi.se ABSTRACT Manual analysis of large data sets
More informationYear 4 Physical Education Scheme of Work
Year New National Curriculum Pupils should continue to apply and develop a broader range of skills, learning how to use them in different ways and to link them to make actions and sequences of movement.
More informationarxiv: v1 [cs.cv] 18 May 2016
BEYOND CAPTION TO NARRATIVE: VIDEO CAPTIONING WITH MULTIPLE SENTENCES Andrew Shin, Katsunori Ohnishi, and Tatsuya Harada Grad. School of Information Science and Technology, The University of Tokyo, Japan
More information(SAT). d) inhibiting automatized responses.
Which of the following findings does NOT support the existence of task-specific mental resources? 1. a) It is more difficult to combine two verbal tasks than one verbal task and one spatial task. 2. b)
More informationCROWD ANALYTICS VIA ONE SHOT LEARNING AND AGENT BASED INFERENCE. Peter Tu, Ming-Ching Chang, Tao Gao. GE Global Research
CROWD ANALYTICS VIA ONE SHOT LEARNING AND AGENT BASED INFERENCE Peter Tu, Ming-Ching Chang, Tao Gao GE Global Research ABSTRACT For the purposes of inferring social behavior in crowded conditions, three
More informationAn Attentional Framework for 3D Object Discovery
An Attentional Framework for 3D Object Discovery Germán Martín García and Simone Frintrop Cognitive Vision Group Institute of Computer Science III University of Bonn, Germany Saliency Computation Saliency
More informationAutomatic Visual Mimicry Expression Analysis in Interpersonal Interaction
Automatic Visual Mimicry Expression Analysis in Interpersonal Interaction Xiaofan Sun x.f.sun@utwente.nl Anton Nijholt a.nijholt@utwente.nl Khiet P. Truong k.p.truong@utwente.nl Maja Pantic Department
More informationGeneric object recognition May 17 th, 2018
Generic object recognition May 17 th, 2018 Yong Jae Lee UC Davis Visual words Map high dimensional descriptors to tokens/words by quantizing the feature space Quantize via clustering, let cluster centers
More informationFEATURE EXTRACTION USING GAZE OF PARTICIPANTS FOR CLASSIFYING GENDER OF PEDESTRIANS IN IMAGES
FEATURE EXTRACTION USING GAZE OF PARTICIPANTS FOR CLASSIFYING GENDER OF PEDESTRIANS IN IMAGES Riku Matsumoto, Hiroki Yoshimura, Masashi Nishiyama, and Yoshio Iwai Department of Information and Electronics,
More informationIDENTIFICATION OF REAL TIME HAND GESTURE USING SCALE INVARIANT FEATURE TRANSFORM
Research Article Impact Factor: 0.621 ISSN: 2319507X INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK IDENTIFICATION OF REAL TIME
More informationLOCOMOTION MOVEMENT SKILLS
MEMORIAL OVAL PRIMARY SCHOOL - Physical Education Continuum SACSA Band / Standard SACSA Outcomes Physical Activity & Participation Year Level STABILITY MOVEMENTS SKILLS LOCOMOTION MOVEMENT SKILLS MANIPULATION
More informationEDGE DETECTION. Edge Detectors. ICS 280: Visual Perception
EDGE DETECTION Edge Detectors Slide 2 Convolution & Feature Detection Slide 3 Finds the slope First derivative Direction dependent Need many edge detectors for all orientation Second order derivatives
More informationFire P.I.T. Benefits of Fitness
Fire P.I.T Benefits of Fitness Fire PIT: Benefits of Fitness Benefits of Fitness The word health is often associated only with physical fitness, but there are other components of health. FITNESS = READINESS.
More informationMarking Period 3-5 week course- Activity Selection. Marking Period 1- One Marking Period dedicated to Health
Week Week Week Week DEPARTMENT Physical Education COURSE: Grades 9 to 12 Marking Period 1- One Marking Period dedicated to Health Marking Period 3-5 week course- Activity Selection 1 N/A 21 Ultimate Frisbee
More informationPhysical Education Curriculum Mapping Worksheet Benchmark 3
Demonstrate motor skill competency in a variety of physical activities and motor skill proficiency in one physical activity Demonstrate movement principles (mechanics, Content: Tennis Skills Learned: Serve,
More informationarxiv: v1 [cs.cv] 26 Jul 2016
Noname manuscript No. (will be inserted by the editor) Salient Object Subitizing Jianming Zhang Shugao Ma Mehrnoosh Sameki Stan Sclaroff Margrit Betke Zhe Lin Xiaohui Shen Brian Price Radomír Měch arxiv:67.755v
More informationCorrespondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF)
1 Desired Results Developmental Profile (2015) [DRDP (2015)] Correspondence to California Foundations: Physical Development Health (PD-HLTH) and the Overall, the Physical Development Health (PD-HLTH) domain
More informationCorrespondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF)
1 Desired Results Developmental Profile (2015) [DRDP (2015)] Correspondence to California Foundations: Physical Development Health (PD-HLTH) and the Overall, the Physical Development Health (PD-HLTH) domain
More information