Group Behavior Analysis and Its Applications

Similar documents
Hierarchical Convolutional Features for Visual Tracking

Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling

Predicting Primary Gaze Behavior using Social Saliency Fields

Predicting Primary Gaze Behavior using Social Saliency Fields

Attentional Masking for Pre-trained Deep Networks

Social Group Discovery from Surveillance Videos: A Data-Driven Approach with Attention-Based Cues

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation

Action Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem

Computational modeling of visual attention and saliency in the Smart Playroom

Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS

An Attentional Framework for 3D Object Discovery

Human Activities: Handling Uncertainties Using Fuzzy Time Intervals

Medical Image Analysis

The University of Tokyo, NVAIL Partner Yoshitaka Ushiku

TURKISH SIGN LANGUAGE RECOGNITION USING HIDDEN MARKOV MODEL

Efficient Deep Model Selection

CONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE

VIDEO SALIENCY INCORPORATING SPATIOTEMPORAL CUES AND UNCERTAINTY WEIGHTING

B657: Final Project Report Holistically-Nested Edge Detection

Active Deformable Part Models Inference

VIDEO SURVEILLANCE AND BIOMEDICAL IMAGING Research Activities and Technology Transfer at PAVIS

Unsupervised Learning of Micro-Action Exemplars using a Product Manifold

Predicting the location of interactees in novel human-object interactions

Learning Spatiotemporal Gaps between Where We Look and What We Focus on

AFOSR PI Meeting Dec 1-3, 2014 Program Director: Dr. Darema Dynamic Integration of Motion and Neural Data to Capture Human Behavior

Family Member Identification from Photo Collections

AUTOMATIC HUMAN BEHAVIOUR RECOGNITION AND EXPLANATION FOR CCTV VIDEO SURVEILLANCE

An Attention-based Activity Recognition for Egocentric Video

Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge

Domain Generalization and Adaptation using Low Rank Exemplar Classifiers

Putting Context into. Vision. September 15, Derek Hoiem

Viewpoint Invariant Convolutional Networks for Identifying Risky Hand Hygiene Scenarios

Personal Space Augmented Reality Tool

Beyond R-CNN detection: Learning to Merge Contextual Attribute

American Speech-Language-Hearing Association Conference November 18-20, 2010, Philadelphia, PA

Finger spelling recognition using distinctive features of hand shape

STRATEGIC DIRECTIONS

An Experimental Analysis of Saliency Detection with respect to Three Saliency Levels

Theoretical Neuroscience: The Binding Problem Jan Scholz, , University of Osnabrück

Autism Spectrum Disorders: An update on research and clinical practices for SLPs

Computational Auditory Scene Analysis: An overview and some observations. CASA survey. Other approaches

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

Computational Perception /785. Auditory Scene Analysis

The 29th Fuzzy System Symposium (Osaka, September 9-, 3) Color Feature Maps (BY, RG) Color Saliency Map Input Image (I) Linear Filtering and Gaussian

High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany

A Multimodal Interface for Robot-Children Interaction in Autism Treatment

Validating the Visual Saliency Model

Travel Time-dependent Maximum Entropy Inverse Reinforcement Learning for Seabird Trajectory Prediction

Can Saliency Map Models Predict Human Egocentric Visual Attention?

Social Activity Measurement with Face Detection Using First-Person Video as a Lifelog

Shu Kong. Department of Computer Science, UC Irvine

AUTISM SPECTRUM DISORDER: DSM-5 DIAGNOSTIC CRITERIA. Lisa Joseph, Ph.D.

Saliency in Crowd. Ming Jiang, Juan Xu, and Qi Zhao

Eating and Drinking Recognition via Integrated Information of Head Directions and Joint Positions in a Group

Device Modeling as Prompting Strategy for Users of AAC Devices. Meher Banajee, Ph.D., CCC-SLP Nino Acuna, M.A. Hannah Deshotels, B.A.

AUTISM SPECTRUM DISORDERS

Saliency in Crowd. 1 Introduction. Ming Jiang, Juan Xu, and Qi Zhao

LSA64: An Argentinian Sign Language Dataset

Shu Kong. Department of Computer Science, UC Irvine

GIANT: Geo-Informative Attributes for Location Recognition and Exploration

DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation

Automated Volumetric Cardiac Ultrasound Analysis

EEL 6586, Project - Hearing Aids algorithms

FEATURE EXTRACTION USING GAZE OF PARTICIPANTS FOR CLASSIFYING GENDER OF PEDESTRIANS IN IMAGES

An Evaluation of RGB-D Skeleton Tracking for Use in Large Vocabulary Complex Gesture Recognition

ASD Working Group Endpoints

Kinematic and Dynamic Adaptation of

TAGTEACH FOR SPECIAL NEEDS CHILDREN IN A GROUP SETTING. Kate Nicoll, LCSW Soul Friends, Inc.

Intensive Training. Early Childhood Intensive Training K-12 Intensive Training Building Your Future Intensive Training

Multimedia Augmented Reality With Picture Exchange Communication System for Autism Spectrum Disorder

Top 10 Tools to Build Your ASD Toolbox. Michelle Rigler EdD & Amy Rutherford M.Ed, LPC-MHSP University of Tennessee Chattanooga

Autism Spectrum Disorder What is it?

Words are not enough: Social Communication & Early Signs of Autism Spectrum Disorders

Prediction of Negative Symptoms of Schizophrenia from Facial Expressions and Speech Signals

Communication and ASD: Key Concepts for Educational Teams

HIWIN Thesis Award 2007

Kinect Based Edutainment System For Autistic Children

Discovering Important People and Objects for Egocentric Video Summarization

Expression Recognition. Mohammad Amanzadeh

Autism Spectrum Disorder What is it?

Supplementary Online Content

Motion Control for Social Behaviours

arxiv: v2 [cs.cv] 17 Apr 2015

Building a Home to Care for Your Clients: Part 2 COMMUNICATION TOOLBOX

Virtual Shapers & Movers: Form and Motion affect Sex Perception

Increasing Social Awareness in Adults with Autism Spectrum Disorders

References 1. Dunn, L. M., & Dunn, L. M. (2006). Peabody Picture Vocabulary Test, 4th edition. Minnesota: American Guidance Service. 2. Wilkinson, K.

Intensive Training. Early Childhood Intensive Training K-12 Intensive Training Building Your Future Intensive Training

Using Deep Convolutional Networks for Gesture Recognition in American Sign Language

The Autism Opportunity Designing for Community-Based Solutions

Neuromorphic convolutional recurrent neural network for road safety or safety near the road

Real Time Sign Language Processing System

CS221 / Autumn 2017 / Liang & Ermon. Lecture 19: Conclusion

Multi-attention Guided Activation Propagation in CNNs

The Nuts and Bolts of Diagnosing Autism Spectrum Disorders In Young Children. Overview

Biologically-Inspired Human Motion Detection

Local Image Structures and Optic Flow Estimation

Diagnosing Autism, and What Comes After. Natalie Roth, Ph. D. Clinical Psychologist, Alternative Behavior Strategies

Learned Region Sparsity and Diversity Also Predict Visual Attention

Transcription:

Group Behavior Analysis and Its Applications CVPR 2015 Tutorial Lecturers: Hyun Soo Park (University of Pennsylvania) Wongun Choi (NEC America Laboratory)

Schedule 08:30am-08:50am 08:50am-09:50am 09:50am-10:10am 10:10am-10:45am 10:45am-11:15am 11:15am-12:20pm 12:20pm-12:30pm Introduction Social statics Gaze/Head signals Body signals Invited talk Action Localization Ivan Laptev (INRIA) Coffee break Invited talk Capturing Subtle Social Behaviors in the Panoptic Studio Yaser Sheikh (CMU) Social dynamics Model based approach Data driven approach Summary and open problems

Species Hierarchy Proximity Synchronization Reciprocity Group Behavior

Species Hierarchy Proximity Synchronization Causality Mathematical Definition of Group Behaviors? Group Behavior

Granger Causality [Granger Econometrica69] A time series X could be considered to causally influence a time series Y if predictions of future values of Y based on the joint history of X and Y were more accurate than predictions based on Y alone. Y Y(t ˆ + t) Y(t + t) X Time Y(t + t) Y(t ˆ + t Y(0),,Y(t)) Y(t + t) Y(t ˆ + t Y(0),,Y(t),X(0),,X(t)) Prediction based on history of Y Prediction based on history of Y and X t t+ t

Mathematical Definition of Group Behaviors Behaviors of X and Y are a group behavior if spatial/temporal predictions of Y based on X and Y jointly are more accurate than predictions based on Y alone. X Y Y(t + t) Y(t ˆ + t Y(0),,Y(t)) Y(t + t) Y(t ˆ + t Y(0),,Y(t),X(0),,X(t))

Mathematical Definition of Group Behaviors Behaviors of X and Y are a group behavior if spatial/temporal predictions of Y based on X and Y jointly are more accurate than predictions based on Y alone. [Yang CVPR12] [Park ICCV13] [Alahi CVPR14]

Mathematical Definition of Group Behaviors Behaviors of X and Y are a group behavior if spatial/temporal predictions of Y based on X and Y jointly are more accurate than predictions based on Y alone. Not group behavior [Choi ECCV14]

Social Signals: A Realization of Group Behaviors

Nonverbal social signals 67% Vinciarelli et al., Social Signal Processing: Survey of an Emerging Domain, Image and Vision Computing, 2009. Hogan and Stubbs, Can t Get Through 8 Barriers to Communication, 2003.

Social Signal Processing noise Signals Transmitter Channel Receiver Signal reconstruction

Social Signal Processing noise Signals Transmitter Channel Receiver Signal reconstruction Social signals Visual observation Understanding

Social Signal Processing noise Signals Transmitter Channel Receiver Signal reconstruction Social signals Cameras Computational rep.

Challenges In Social Signal Processing

Challenge I: Micro Social Signals Social signals are too subtle to be detected by computer vision solutions, e.g., activity recognition frameworks. KTH Action Dataset

Challenge II: Signal Interdependency

Challenge III: Ambiguity JPL First-Person Interaction Dataset

Challenge IV: Scene Variability JPL First-Person Interaction Dataset

Broad Impact: Why Social Interactions?

Computational Behavioral Science Clinician Subject Autism Spectrum Disorder (ASD)

Multimodal Dyadic Behavior Dataset Gaze estimation http://www.cbi.gatech.edu/mmdb/

Collaborative Behavior Monitoring First Person Interaction Dataset [Park CVPR15]

http://gamma.cs.unc.edu/reach/crowdt/ Crowd Behavior Analysis

Education Assistance Collaboration Entertainment http://humanoid.itu.edu.tr/ Social Robotics

Social Space Design

Main Tasks

Social Interaction Detection / Joint Attention Localization Input: image and bounding boxes Output: groups with activity label Input: images from first person cameras Output: to localize joint attention in 3D Talking Talking

Space / Time Relationship Input: a video of social interactions Output: to find a causal relationship Input: image Output: proxemics label and skeletons Image Hand-Shoulder

Group Activity Detection/Recognition Input: video and box tracks Output: group activity labels in time Talking

Group Behavior Prediction / Anomaly Detection Input: a crowd video Output: individual tracking Input: pedestrian tracks Output: to detect abnormal behaviors

Social Role Discovery Input: videos with event labels, box tracks Output: groups with activity label Wedding Wedding Σ α Σ β α β Role1 Role2 Role3 Role1 Role3 Role2 s i ψ u i ψ p m i v

Measurement Tool

Critani et Cristani al., BMVC et al., 2011 BMVC 2011 Fathi et Fathi al., CVPR et al., 2012 CVPR 2012 Park et al., Park NIPS et 2012 al., NIPS 2012 Rodriguez et al., ICCV 2011 Lan et al., PAMI 2012 Chakraborty et al., CVPR 2013 Yang et al., CVPR 2011 Alahi et al., CVPR 2014 Choi et al., ECCV 2014 Li et al., ICCV 2013 Ryoo et al., CVPR 2013 Pusiol et al., CogSci 2014 Arev et al., SIGGRAPH 2014 Third person view Distance between eyes and camera First person view

Critani et Cristani al., BMVC et al., 2011 BMVC 2011 Fathi et Fathi al., CVPR et al., 2012 CVPR 2012 Park et al., Park NIPS et 2012 al., NIPS 2012 Noninvasiveness Measurement accuracy Third person view Distance between eyes and camera First person view

Scene dynamism Static scene Dynamic scene Rehg, CVPR13 Prabhaker, ECCV12 Prabhakar, CVPR12 Patron-Perez, BMVC10 Yang, CVPR12 Hoai, CVPR14 Dyadic interaction Lan, CVPR12 Ramanathan, CVPR13 Antic, ECCV14 Fathi, CVPR12 Choi, ECCV14 Park, NIPS12, ICCV13 Ding, ECCV10 Choi, ECCV12, CVPR14 Direkoglu, ECCV12 Cristani, BMVC11 Park, CVPR15 Arev, SIGGRAPH14 Wang, ECCV10 Gallagher, CVPR09 Crowd interaction Number of group members Rodriguez, ICCV11a, ICCVb Mehran, CVPR09 Alahi, CVPR14