Domain Generalization and Adaptation using Low Rank Exemplar Classifiers

Size: px
Start display at page:

Download "Domain Generalization and Adaptation using Low Rank Exemplar Classifiers"

Transcription

1 Domain Generalization and Adaptation using Low Rank Exemplar Classifiers 报告人 : 李文 苏黎世联邦理工学院计算机视觉实验室 李文 苏黎世联邦理工学院 9/20/2017 1

2 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Summary 李文 苏黎世联邦理工学院 9/20/2017 2

3 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Summary 李文 苏黎世联邦理工学院 9/20/2017 3

4 Domain Adaptation: Examples Examples Web <-> Consumer Sketch <-> Photo Synthetic <-> Real Fall <-> Winter K. Saenko, B. Kulis, M. Fritz and T. Darrell. Adapting Visual Category Models to New Domains. In ECCV, T. Kim M. Cha, H. Kim, J. Lee J. Kim. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks. In ICCV 2017 Placeholder X. Peng, K. for Saenko. organisational Synthetic unit to Real name Adaptation / logo with Generative Correlation Alignment Networks. Arxiv , (edit J. in Hoffman, slide master D. Wang, via View F. Yu, > T. Slide Darrel. Master ) FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation. Arxiv , 李文 苏黎世联邦理工学院 9/20/

5 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) 李文 苏黎世联邦理工学院 9/20/2017 5

6 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world test data changes easily 李文 苏黎世联邦理工学院 9/20/2017 6

7 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot 李文 苏黎世联邦理工学院 9/20/2017 7

8 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot 李文 苏黎世联邦理工学院 9/20/2017 8

9 Dataset Bias Name that dataset! Caltech-101 LabelMe MSRC COIL-100 UIUC ImageNet Tiny 15 Scenes Corel Caltech-256 PASCAL 07 SUN09 (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/2017 9

10 Dataset Bias Name that dataset! Caltech LabelMe 7 MSRC 3 COIL UIUC 2 ImageNet 5 Tiny Images 4 15 Scenes 9 Corel 10 Caltech PASCAL 07 6 SUN09 8 Current vision datasets contains their own biases, regardless of their semantic categories. (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/

11 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot 李文 苏黎世联邦理工学院 9/20/

12 Dataset Bias Real-world objects vary a lot (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/

13 Dataset Bias Impact of Dataset Bias Cross-dataset Classification Performance Car Classification S L P I C M drop S % L % P % I % C % M % (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/

14 Dataset Bias Impact of Dataset Bias Cross-dataset Classification Performance Car Classification S L P I C M drop S % L % P % I % C % M % Dataset bias harms the crossdataset classification performance. (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/

15 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot Re-collect data? Cost money and time! 李文 苏黎世联邦理工学院 9/20/

16 Domain Adaptation: Problem Description Domain Adaptation bike: cup: Source Domain P s (x, y) P s (x, y) P t (x, y) Target Domain P t (x, y) 李文 苏黎世联邦理工学院 9/20/

17 Domain Adaptation: Problem Description (Unsupervised) Domain Adaptation n s + Source domain: *(x s i, y s i ) i=1 Target domain: *x t n i t i=1 + Data distribution mismatch: P s (x s, y s ) P t (x t, y t ) or P s (x s ) P t (x t ) Feature space and label space are consistent: x s, x t R D y s, y t L 李文 苏黎世联邦理工学院 9/20/

18 Domain Adaptation: Problem Description (Unsupervised) Domain Adaptation n s + Source domain: *(x s i, y s i ) i=1 Target domain: *x t n i t i=1 + Data distribution mismatch: P s (x s, y s ) P t (x t, y t ) or P s (x s ) P t (x t ) Feature space and label space are consistent: x s, x t R D y s, y t L Related Concepts: Transfer Learning Label space are different (cross-task) Heterogeneous Domain Adaptation Feature space are different (cross-feature) 李文 苏黎世联邦理工学院 9/20/

19 Domain Adaptation: Related Works Feature-Level Methods Strategy: Traditional methods P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. 李文 苏黎世联邦理工学院 9/20/

20 Domain Adaptation: Related Works Feature-Level Methods Strategy: Traditional methods P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. CNN based methods Top-down: DAN, JAN, GRL, DRCN, Bottom-Up: AdaBN, AutoDIAL source target CNN cls loss DA loss 李文 苏黎世联邦理工学院 9/20/

21 Domain Adaptation: Related Works Feature-Level Methods Strategy: Traditional methods P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. CNN based methods Top-down: DAN, JAN, GRL, DRCN, Bottom-Up: AdaBN, AutoDIAL source target CNN loss Domain Adaptive Normalization 李文 苏黎世联邦理工学院 9/20/

22 Domain Adaptation: Related Works Feature-Level Methods Strategy: P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) Traditional methods For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. CNN based methods Top-down: DAN, JAN, GRL, DRCN, Bottom-Up: AdaBN, AutoDIAL Image-Level CycleGAN, DiscoGAN, DualGAN, UNIT 李文 苏黎世联邦理工学院 9/20/

23 Domain Adaptation: Related Works Instance-Level Methods Strategy: Methods KMM, DA-SVM Deep Methods Transductive DA, Associative DA P s (x s ) P t (x t ) g(x s )P s (x s ) P t (x t ) 李文 苏黎世联邦理工学院 9/20/

24 Domain Generalization What if we do not know about target domain? Multi-source domain generalization source-1 source target source-2? source-m Domain Adaptation Domain Generalization K. Muandet, D. Balduzzi, B. Scholkopf. Domain generalization via invariant feature representation. In ICML, 李文 苏黎世联邦理工学院 9/20/

25 Domain Generalization Problems Real world domains are not distinctive source-1 source-2? source-m 李文 苏黎世联邦理工学院 9/20/

26 Domain Generalization Problems A more common case, single but diverse source domain source-1 source-2? source domain source-m 李文 苏黎世联邦理工学院 9/20/

27 Domain Generalization Latent Domain Discovery Partition one source domain into multiple latent domains source domain latent-1 latent-2 latent-3 J. Hoffman, B. Kulis, T. Darrell, K. Saenko. Discovering Latent Domains For Multisource Domain Adaptation. In ECCV (edit in slide B. Gong, master K. via Grauman, View > and Slide F. Sha. Master ) Reshaping Visual Datasets for Domain Adaptation. In NIPS 2013 李文 苏黎世联邦理工学院 9/20/

28 Domain Generalization Latent Domain Discovery Partition one source domain into multiple latent domains source domain latent-1 latent-2 latent-3 How many latent domains? Non-trivial to disentangle correlated variances J. Hoffman, B. Kulis, T. Darrell, K. Saenko. Discovering Latent Domains For Multisource Domain Adaptation. In ECCV (edit in slide B. Gong, master K. via Grauman, View > and Slide F. Sha. Master ) Reshaping Visual Datasets for Domain Adaptation. In NIPS 2013 李文 苏黎世联邦理工学院 9/20/

29 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Evolving Domain Adaptation Conclusions and Future Work 李文 苏黎世联邦理工学院 9/20/

30 Low-Rank Exemplar Classifiers for Domain Generalization One-stage Approach source-1 source source-2 source-m domain adaptation domain generalization 李文 苏黎世联邦理工学院 9/20/

31 Low-Rank Exemplar Classifiers for Domain Generalization One-stage Approach source domain adaptation domain generalization 李文 苏黎世联邦理工学院 9/20/

32 Exemplar SVMs (E-SVMs) SVM vs E-SVMs Each exemplar SVM is trained using one positive samples and all negative samples SVM E-SVM 1 E-SVM 2 T. Malisiewicz, A. Gupta, A. Efros. Ensemble of Exemplar-SVMs for Object Detection and Beyond. In ICCV 2011 李文 苏黎世联邦理工学院 9/20/

33 Exemplar SVMs (E-SVMs) SVM vs E-SVMs Each exemplar SVM is trained using one positive samples and all negative samples bike, clean background front viewpoint bike, indoor background front viewpoint SVM E-SVM 1 E-SVM 2 李文 苏黎世联邦理工学院 9/20/

34 LRE-SVMs for Domain Generalization Ensemble Exemplar Classifiers Example Exemplar Classifiers Test Sample 李文 苏黎世联邦理工学院 9/20/

35 LRE-SVMs for Domain Generalization Ensemble Exemplar Classifiers Example Sensitive to noise Exemplar Classifiers Test Sample 李文 苏黎世联邦理工学院 9/20/

36 Exemplar SVMs (E-SVMs) Learning Objective Training data where loss is defined as Unify all exemplar SVMs 李文 苏黎世联邦理工学院 9/20/

37 Low Rank Exemplar SVMs (LRE-SVMs) Low Rank Property Using n E-SVMs to predict n positive training samples 李文 苏黎世联邦理工学院 9/20/

38 Low Rank Exemplar SVMs (LRE-SVMs) Low Rank Property Using n E-SVMs to predict n positive training samples G W R n n is low-rank 李文 苏黎世联邦理工学院 9/20/

39 Low Rank Exemplar SVMs (LRE-SVMs) Objective Adding a nuclear-norm based regularizer on G(W) Where G W = g ij 李文 苏黎世联邦理工学院 9/20/

40 Optimization Introducing an intermediate variable Alternating optimization Fix W, Update F Fix F, Update W Main computational cost! Placeholder J. Cai, for E. organisational J. Cands, and Z. unit Shen. name A singular / logo value thresholding algorithm for matrix completion, (edit in slide In SIAM master Journal via View on Optimization, > Slide Master ) vol. 20, no. 4, pp , 2010 李文 苏黎世联邦理工学院 9/20/

41 Low Rank Exemplar LS-SVMs (LRE-LSSVMs) Using least square SVM as the base classifier Loss function is changed Closed form solution 李文 苏黎世联邦理工学院 9/20/

42 Low Rank Exemplar LS-SVMs (LRE-LSSVMs) A fast solution exemplar n+m Cost only O((n + m) 2 ) 李文 苏黎世联邦理工学院 9/20/

43 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Evolving Domain Adaptation Conclusions and Future Work 李文 苏黎世联邦理工学院 9/20/

44 LRE-SVMs for Domain Generalization Ensemble Exemplar Classifiers Example Exemplar Classifiers Test Sample 李文 苏黎世联邦理工学院 9/20/

45 LRE-SVMs for Domain Adaptation Ensemble Exemplar Classifiers Similarity between E-SVM training data and the target domain f(mmd) Source Domain Target Domain 李文 苏黎世联邦理工学院 9/20/

46 LRE-SVMs for Domain Adaptation Learn A Unified Classifier where f is the prediction score obtained using ensemble E-SVMs Placeholder L. Duan, for D. organisational Xu, and I. W. Tsang. unit name Domain / logo adaptation from multiple sources: A domain-dependent regularization approach. (edit in In slide T-NNLS, master vol. via 23, View no. 3, > Slide pp , Master ) March 2012 李文 苏黎世联邦理工学院 9/20/

47 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Conclusions and Future Work 李文 苏黎世联邦理工学院 9/20/

48 Datasets Action Recognition IXMAS dataset: 5 actions from 5 different view points Mix several viewpoints, and leave remaining s for test Object Recognition Office-Caltech Dataset: 10 classes from 4 domains, Caltech-256, Amazon, DSLR, Webcam Mix several domains, and leave remaining's for test 李文 苏黎世联邦理工学院 9/20/

49 Domain Generalization Action Recognition Results (0,1)->(2,3,4) (2,3,4)->(0,1) (0,1,2,3)->4 SVM E-SVMs E-LSSVMs LRE-SVMs LRE-LSSVMs Summary Ensemble exemplar classifiers help domain generalization LRE-SVMs and LRE-LSSVMs improves E-SVMs and E-LSSVMs. 李文 苏黎世联邦理工学院 9/20/

50 Domain Generalization Experimental Comparisons 李文 苏黎世联邦理工学院 9/20/

51 LRE-SVMs v.s. LRE-LSSVMs Training Time IXMAS, (0,1) -> (2,3,4) Training Time (seconds) LRE-SVMS LRE-LSSVMs Summary LRE-LSSVM is more than 80 times faster than LRE-SVMs 李文 苏黎世联邦理工学院 9/20/

52 Low Rank Regularizer Visualization on the prediction matrix G(W) IXMAS, (0,1) -> (2,3,4) Clustering viewpoints, actors 李文 苏黎世联邦理工学院 9/20/

53 Low Rank Regularizer Visualization on the prediction matrix G(W) IXMAS, (0,1) -> (2,3,4) Clustering viewpoints, actors 李文 苏黎世联邦理工学院 9/20/

54 Domain Adaptation Action Recognition Using target domain information can further improves the accuracy (0,1)->(2,3,4) (2,3,4)->(0,1) (0,1,2,3)->4 SVM LRE-LSSVMs LRE-LSSVMs-DA 李文 苏黎世联邦理工学院 9/20/

55 Domain Adaptation: Object Recognition Comparison with deep domain adaptation approaches 图表标题 (A,C)->(D,W) (D,W)->(A,C) (C,D,W)->A SVM Ours DAN DAN+Ours GRL GRL+Ours 李文 苏黎世联邦理工学院 9/20/

56 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Evolving Domain Adaptation Summary 李文 苏黎世联邦理工学院 9/20/

57 Summary Summary Exploiting inner-domain structure helps cross-domain generalization ability LRE-SVMs and LRE-LSSVMs Learn locality domain property based on E-SVMs Exploit the low-rank structure of prediction matrix Complementary to current CNN based (global) domain adaptation methods. 李文 苏黎世联邦理工学院 9/20/

58 Future Trends Increasingly important in deep learning era large scale training data => high cost on annotation new principles: GRL, AdaBN/AutoDIAL, JAN, CycleGAN New issues: domain generalization, transfer learning, heterogeneous domain adaptation Task-oriented: synthetic to real adaptation, semantic segmentation, video recognition, image captioning, VQA 李文 苏黎世联邦理工学院 9/20/

59 Collaborators and References Collaborators Zheng Xu Li Niu Dengxin Dai Dong Xu Luc Van Gool References 1) W. Li, Z. Xu, D. Xu, D. Dai, and L. Van Gool. Domain Generalization and Adaptation using Low Rank Exemplar SVMs. In T-PAMI, 2017 (In Press) 2) Z. Xu, W. Li, L. Niu, and D. Xu. Exploiting Low-rank Structure from Latent Domains for Domain Generalization. In ECCV 李文 苏黎世联邦理工学院 9/20/

60 WebVision Challenge Learning from Web Data Flickr Images Google Images, Learning Representation CNN Categorization Detection Segmentation Dataset 1,000 categories, 2.4 million images, 50,000 validation images Challenge WebVision classification track Transfer learning track $10,000 cash prize! 李文 苏黎世联邦理工学院 9/20/

61 Thank you! 欢迎提问和指正! Wen Li - Learning from Web Data 62

DMAIN adaptation techniques, which aim to reduce the

DMAIN adaptation techniques, which aim to reduce the Domain Generalization and Adaptation using Low Rank Exemplar SVMs Wen Li, Zheng Xu, Dong Xu, Dengxin Dai, and Luc Van Gool Abstract Domain adaptation between diverse source and target domains is a challenging

More information

CONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE

CONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE CONSTRAINED SEMI-SUPERVISED LEARNING USING ATTRIBUTES AND COMPARATIVE ATTRIBUTES Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta The Robotics Institute Carnegie Mellon University SUPERVISION BIG-DATA

More information

Beyond R-CNN detection: Learning to Merge Contextual Attribute

Beyond R-CNN detection: Learning to Merge Contextual Attribute Brain Unleashing Series - Beyond R-CNN detection: Learning to Merge Contextual Attribute Shu Kong CS, ICS, UCI 2015-1-29 Outline 1. RCNN is essentially doing classification, without considering contextual

More information

Deep Networks and Beyond. Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University

Deep Networks and Beyond. Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University Deep Networks and Beyond Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University Artificial Intelligence versus Human Intelligence Understanding

More information

Object Recognition: Conceptual Issues. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman

Object Recognition: Conceptual Issues. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman Issues in recognition The statistical viewpoint Generative vs. discriminative methods

More information

Shu Kong. Department of Computer Science, UC Irvine

Shu Kong. Department of Computer Science, UC Irvine Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge 4. Fine-grained classification with holistic representation

More information

Shu Kong. Department of Computer Science, UC Irvine

Shu Kong. Department of Computer Science, UC Irvine Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge and philosophy 4. Fine-grained classification with

More information

Learning visual biases from human imagination

Learning visual biases from human imagination Learning visual biases from human imagination Carl Vondrick Hamed Pirsiavash Aude Oliva Antonio Torralba Massachusetts Institute of Technology University of Maryland, Baltimore County {vondrick,oliva,torralba}@mit.edu

More information

Automatic Beautification for Group-photo Facial Expressions using Novel Bayesian GANs

Automatic Beautification for Group-photo Facial Expressions using Novel Bayesian GANs Automatic Beautification for Group-photo Facial Expressions using Novel Bayesian GANs Ji Liu 1, Shuai Li 1,2, Wenfeng Song 1, Liang Liu 1, Hong Qin 3, and Aimin Hao 1 1 Beihang University, 2 Beihang University

More information

Multi-attention Guided Activation Propagation in CNNs

Multi-attention Guided Activation Propagation in CNNs Multi-attention Guided Activation Propagation in CNNs Xiangteng He and Yuxin Peng (B) Institute of Computer Science and Technology, Peking University, Beijing, China pengyuxin@pku.edu.cn Abstract. CNNs

More information

Object Detectors Emerge in Deep Scene CNNs

Object Detectors Emerge in Deep Scene CNNs Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are

More information

Network Dissection: Quantifying Interpretability of Deep Visual Representation

Network Dissection: Quantifying Interpretability of Deep Visual Representation Name: Pingchuan Ma Student number: 3526400 Date: August 19, 2018 Seminar: Explainable Machine Learning Lecturer: PD Dr. Ullrich Köthe SS 2018 Quantifying Interpretability of Deep Visual Representation

More information

Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling

Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling AAAI -13 July 16, 2013 Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling Sheng-hua ZHONG 1, Yan LIU 1, Feifei REN 1,2, Jinghuan ZHANG 2, Tongwei REN 3 1 Department of

More information

Active Deformable Part Models Inference

Active Deformable Part Models Inference Active Deformable Part Models Inference Menglong Zhu Nikolay Atanasov George J. Pappas Kostas Daniilidis GRASP Laboratory, University of Pennsylvania 3330 Walnut Street, Philadelphia, PA 19104, USA Abstract.

More information

Medical Image Analysis

Medical Image Analysis Medical Image Analysis 1 Co-trained convolutional neural networks for automated detection of prostate cancer in multiparametric MRI, 2017, Medical Image Analysis 2 Graph-based prostate extraction in t2-weighted

More information

Putting Context into. Vision. September 15, Derek Hoiem

Putting Context into. Vision. September 15, Derek Hoiem Putting Context into Vision Derek Hoiem September 15, 2004 Questions to Answer What is context? How is context used in human vision? How is context currently used in computer vision? Conclusions Context

More information

GIANT: Geo-Informative Attributes for Location Recognition and Exploration

GIANT: Geo-Informative Attributes for Location Recognition and Exploration GIANT: Geo-Informative Attributes for Location Recognition and Exploration Quan Fang, Jitao Sang, Changsheng Xu Institute of Automation, Chinese Academy of Sciences October 23, 2013 Where is this? La Sagrada

More information

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS SEOUL Oct.7, 2016 DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS Gunhee Kim Computer Science and Engineering Seoul National University October

More information

Action Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem

Action Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem Action Recognition Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem This section: advanced topics Convolutional neural networks in vision Action recognition Vision and Language 3D

More information

Hierarchical Convolutional Features for Visual Tracking

Hierarchical Convolutional Features for Visual Tracking Hierarchical Convolutional Features for Visual Tracking Chao Ma Jia-Bin Huang Xiaokang Yang Ming-Husan Yang SJTU UIUC SJTU UC Merced ICCV 2015 Background Given the initial state (position and scale), estimate

More information

arxiv: v1 [cs.cv] 17 Aug 2017

arxiv: v1 [cs.cv] 17 Aug 2017 Deep Learning for Medical Image Analysis Mina Rezaei, Haojin Yang, Christoph Meinel Hasso Plattner Institute, Prof.Dr.Helmert-Strae 2-3, 14482 Potsdam, Germany {mina.rezaei,haojin.yang,christoph.meinel}@hpi.de

More information

Differential Attention for Visual Question Answering

Differential Attention for Visual Question Answering Differential Attention for Visual Question Answering Badri Patro and Vinay P. Namboodiri IIT Kanpur { badri,vinaypn }@iitk.ac.in Abstract In this paper we aim to answer questions based on images when provided

More information

Convolutional Neural Networks (CNN)

Convolutional Neural Networks (CNN) Convolutional Neural Networks (CNN) Algorithm and Some Applications in Computer Vision Luo Hengliang Institute of Automation June 10, 2014 Luo Hengliang (Institute of Automation) Convolutional Neural Networks

More information

Family Member Identification from Photo Collections

Family Member Identification from Photo Collections 2015 IEEE Winter Conference on Applications of Computer Vision Family Member Identification from Photo Collections Qieyun Dai 1,2 Peter Carr 2 Leonid Sigal 2 Derek Hoiem 1 1 University of Illinois at Urbana-Champaign

More information

Attentional Masking for Pre-trained Deep Networks

Attentional Masking for Pre-trained Deep Networks Attentional Masking for Pre-trained Deep Networks IROS 2017 Marcus Wallenberg and Per-Erik Forssén Computer Vision Laboratory Department of Electrical Engineering Linköping University 2014 2017 Per-Erik

More information

Region Proposals. Jan Hosang, Rodrigo Benenson, Piotr Dollar, Bernt Schiele

Region Proposals. Jan Hosang, Rodrigo Benenson, Piotr Dollar, Bernt Schiele Region Proposals Jan Hosang, Rodrigo Benenson, Piotr Dollar, Bernt Schiele Who has read a proposal paper? 2 Who has read a proposal paper? Who knows what Average Recall is? 2 Who has read a proposal paper?

More information

Task-driven Webpage Saliency

Task-driven Webpage Saliency Task-driven Webpage Saliency Quanlong Zheng 1[0000 0001 5059 0078], Jianbo Jiao 1,2[0000 0003 0833 5115], Ying Cao 1[0000 0002 9288 3167], and Rynson W.H. Lau 1[0000 0002 8957 8129] 1 Department of Computer

More information

Part 1: Bag-of-words models. by Li Fei-Fei (Princeton)

Part 1: Bag-of-words models. by Li Fei-Fei (Princeton) Part 1: Bag-of-words models by Li Fei-Fei (Princeton) Object Bag of words Analogy to documents Of all the sensory impressions proceeding to the brain, the visual experiences are the dominant ones. Our

More information

Learning Saliency Maps for Object Categorization

Learning Saliency Maps for Object Categorization Learning Saliency Maps for Object Categorization Frank Moosmann, Diane Larlus, Frederic Jurie INRIA Rhône-Alpes - GRAVIR - CNRS, {Frank.Moosmann,Diane.Larlus,Frederic.Jurie}@inrialpes.de http://lear.inrialpes.fr

More information

Recognising Human-Object Interaction via Exemplar based Modelling

Recognising Human-Object Interaction via Exemplar based Modelling 2013 IEEE International Conference on Computer Vision Recognising Human-Object Interaction via Exemplar based Modelling Jian-Fang Hu, Wei-Shi Zheng, Jianhuang Lai, Shaogang Gong, and Tao Xiang School of

More information

Acquiring Visual Classifiers from Human Imagination

Acquiring Visual Classifiers from Human Imagination Acquiring Visual Classifiers from Human Imagination Carl Vondrick, Hamed Pirsiavash, Aude Oliva, Antonio Torralba Massachusetts Institute of Technology {vondrick,hpirsiav,oliva,torralba}@mit.edu White

More information

Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition

Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Cristian Sminchisescu Presented by Mit Shah Motivation Current Computer Vision Annotations subjectively

More information

Visual Saliency Based on Multiscale Deep Features Supplementary Material

Visual Saliency Based on Multiscale Deep Features Supplementary Material Visual Saliency Based on Multiscale Deep Features Supplementary Material Guanbin Li Yizhou Yu Department of Computer Science, The University of Hong Kong https://sites.google.com/site/ligb86/mdfsaliency/

More information

Vision: Over Ov view Alan Yuille

Vision: Over Ov view Alan Yuille Vision: Overview Alan Yuille Why is Vision Hard? Complexity and Ambiguity of Images. Range of Vision Tasks. More 10x10 images 256^100 = 6.7 x 10 ^240 than the total number of images seen by all humans

More information

When Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment

When Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment When Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment Honglin Zheng1, Tianlang Chen2, Jiebo Luo3 Department of Computer Science University of Rochester, Rochester,

More information

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience. Outline: Motivation. What s the attention mechanism? Soft attention vs. Hard attention. Attention in Machine translation. Attention in Image captioning. State-of-the-art. 1 Motivation: Attention: Focusing

More information

Power SVM: Generalization with Exemplar Classification Uncertainty

Power SVM: Generalization with Exemplar Classification Uncertainty Power SVM: Generalization with Exemplar Classification Uncertainty Weiyu Zhang Stella X. Yu Shang-Hua Teng University of Pennsylvania UC Berkeley / ICSI University of Southern California zhweiyu@seas.upenn.edu

More information

Annotation and Retrieval System Using Confabulation Model for ImageCLEF2011 Photo Annotation

Annotation and Retrieval System Using Confabulation Model for ImageCLEF2011 Photo Annotation Annotation and Retrieval System Using Confabulation Model for ImageCLEF2011 Photo Annotation Ryo Izawa, Naoki Motohashi, and Tomohiro Takagi Department of Computer Science Meiji University 1-1-1 Higashimita,

More information

Fine-Grained Image Classification Using Color Exemplar Classifiers

Fine-Grained Image Classification Using Color Exemplar Classifiers Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, and Qingming Huang 1,4 1 School of Computer and Control Engineering,

More information

Chapter 2 Where Next in Object Recognition and how much Supervision Do We Need?

Chapter 2 Where Next in Object Recognition and how much Supervision Do We Need? Chapter 2 Where Next in Object Recognition and how much Supervision Do We Need? Sandra Ebert and Bernt Schiele Abstract Object class recognition is an active topic in computer vision still presenting many

More information

Generic object recognition May 17 th, 2018

Generic object recognition May 17 th, 2018 Generic object recognition May 17 th, 2018 Yong Jae Lee UC Davis Visual words Map high dimensional descriptors to tokens/words by quantizing the feature space Quantize via clustering, let cluster centers

More information

Rich feature hierarchies for accurate object detection and semantic segmentation

Rich feature hierarchies for accurate object detection and semantic segmentation Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik UC Berkeley Tech Report @ http://arxiv.org/abs/1311.2524! Detection

More information

Fine-Grained Image Classification Using Color Exemplar Classifiers

Fine-Grained Image Classification Using Color Exemplar Classifiers Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, Qingming Huang 1, 4 1 School of Computer and Control Engineering,

More information

Incorporating Word Correlation Knowledge into Topic Modeling. Pengtao Xie. Joint work with Diyi Yang and Eric Xing

Incorporating Word Correlation Knowledge into Topic Modeling. Pengtao Xie. Joint work with Diyi Yang and Eric Xing Incorporating Word Correlation Knowledge into Topic Modeling Pengtao Xie Joint work with Diyi Yang and Eric Xing 1 Outline Motivation Incorporating Word Correlation Knowledge into Topic Modeling Experiments

More information

Group Behavior Analysis and Its Applications

Group Behavior Analysis and Its Applications Group Behavior Analysis and Its Applications CVPR 2015 Tutorial Lecturers: Hyun Soo Park (University of Pennsylvania) Wongun Choi (NEC America Laboratory) Schedule 08:30am-08:50am 08:50am-09:50am 09:50am-10:10am

More information

arxiv: v1 [cs.cv] 16 Jun 2012

arxiv: v1 [cs.cv] 16 Jun 2012 How important are Deformable Parts in the Deformable Parts Model? Santosh K. Divvala, Alexei A. Efros, Martial Hebert Carnegie Mellon University arxiv:1206.3714v1 [cs.cv] 16 Jun 2012 Abstract. The main

More information

Object Detection. Honghui Shi IBM Research Columbia

Object Detection. Honghui Shi IBM Research Columbia Object Detection Honghui Shi IBM Research 2018.11.27 @ Columbia Outline Problem Evaluation Methods Directions Problem Image classification: Horse (People, Dog, Truck ) Object detection: categories & locations

More information

Where should saliency models look next?

Where should saliency models look next? Where should saliency models look next? Zoya Bylinskii 1, Adrià Recasens 1, Ali Borji 2, Aude Oliva 1, Antonio Torralba 1, and Frédo Durand 1 1 Computer Science and Artificial Intelligence Laboratory Massachusetts

More information

Progressive Attention Guided Recurrent Network for Salient Object Detection

Progressive Attention Guided Recurrent Network for Salient Object Detection Progressive Attention Guided Recurrent Network for Salient Object Detection Xiaoning Zhang, Tiantian Wang, Jinqing Qi, Huchuan Lu, Gang Wang Dalian University of Technology, China Alibaba AILabs, China

More information

AClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL

AClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL AClass: A Simple, Online Probabilistic Classifier Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL AClass: A Simple, Online Probabilistic Classifier or How I learned to stop worrying

More information

Improving the Interpretability of DEMUD on Image Data Sets

Improving the Interpretability of DEMUD on Image Data Sets Improving the Interpretability of DEMUD on Image Data Sets Jake Lee, Jet Propulsion Laboratory, California Institute of Technology & Columbia University, CS 19 Intern under Kiri Wagstaff Summer 2018 Government

More information

IN this paper we examine the role of shape prototypes in

IN this paper we examine the role of shape prototypes in On the Role of Shape Prototypes in Hierarchical Models of Vision Michael D. Thomure, Melanie Mitchell, and Garrett T. Kenyon To appear in Proceedings of the International Joint Conference on Neural Networks

More information

Towards disentangling underlying explanatory factors Yoshua Bengio. July 13th, 2018 ICML 2018 Workshop on Learning with Limited Labels

Towards disentangling underlying explanatory factors Yoshua Bengio. July 13th, 2018 ICML 2018 Workshop on Learning with Limited Labels Towards disentangling underlying explanatory factors Yoshua Bengio July 13th, 2018 ICML 2018 Workshop on Learning with Limited Labels Generalizing Beyond i.i.d. Data Current ML theory is strongly dependent

More information

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation Neuro-Inspired Statistical Pi Prior Model lfor Robust Visual Inference Qiang Ji Rensselaer Polytechnic Institute National Science Foundation 1 Status of Computer Vision CV has been an active area for over

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Efficient Boosted Exemplar-based Face Detection

Efficient Boosted Exemplar-based Face Detection Efficient Boosted Exemplar-based Face Detection Haoxiang Li, Zhe Lin, Jonathan Brandt, Xiaohui Shen, Gang Hua Stevens Institute of Technology Hoboken, NJ 07030 {hli18, ghua}@stevens.edu Adobe Research

More information

Assigning Relative Importance to Scene Elements

Assigning Relative Importance to Scene Elements Assigning Relative Importance to Scene Elements Igor L. O Bastos, William Robson Schwartz Smart Surveillance Interest Group, Department of Computer Science Universidade Federal de Minas Gerais, Minas Gerais,

More information

DEEP CONVOLUTIONAL ACTIVATION FEATURES FOR LARGE SCALE BRAIN TUMOR HISTOPATHOLOGY IMAGE CLASSIFICATION AND SEGMENTATION

DEEP CONVOLUTIONAL ACTIVATION FEATURES FOR LARGE SCALE BRAIN TUMOR HISTOPATHOLOGY IMAGE CLASSIFICATION AND SEGMENTATION DEEP CONVOLUTIONAL ACTIVATION FEATURES FOR LARGE SCALE BRAIN TUMOR HISTOPATHOLOGY IMAGE CLASSIFICATION AND SEGMENTATION Yan Xu1,2, Zhipeng Jia2,, Yuqing Ai2,, Fang Zhang2,, Maode Lai4, Eric I-Chao Chang2

More information

Computational modeling of visual attention and saliency in the Smart Playroom

Computational modeling of visual attention and saliency in the Smart Playroom Computational modeling of visual attention and saliency in the Smart Playroom Andrew Jones Department of Computer Science, Brown University Abstract The two canonical modes of human visual attention bottomup

More information

Toward a Taxonomy and Computational Models of Abnormalities in Images

Toward a Taxonomy and Computational Models of Abnormalities in Images Toward a Taxonomy and Computational Models of Abnormalities in Images Babak Saleh Dept. of Computer Science Rutgers University New Jersey, USA Ahmed Elgammal Dept. of Computer Science Rutgers University

More information

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman Department of Computer Science University of Texas at Austin {sjhwang,grauman}@cs.utexas.edu

More information

Kai-Wei Chang UCLA. What It Takes to Control Societal Bias in Natural Language Processing. References:

Kai-Wei Chang UCLA. What It Takes to Control Societal Bias in Natural Language Processing. References: What It Takes to Control Societal Bias in Natural Language Processing Kai-Wei Chang UCLA References: http://kwchang.net Kai-Wei Chang (kwchang.net/talks/sp.html) 1 A father and son get in a car crash and

More information

Visual Scene Understanding

Visual Scene Understanding Visual Scene Understanding Aude Oliva Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Website: http://cvcl.mit.edu PPA High-level Scene Representation I. Long-term Memory

More information

Automatic Prostate Cancer Classification using Deep Learning. Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden

Automatic Prostate Cancer Classification using Deep Learning. Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden Automatic Prostate Cancer Classification using Deep Learning Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden Outline Autoencoders, theory Motivation, background and goal for prostate

More information

arxiv: v1 [cs.cv] 21 Nov 2018

arxiv: v1 [cs.cv] 21 Nov 2018 Rethinking ImageNet Pre-training Kaiming He Ross Girshick Piotr Dollár Facebook AI Research (FAIR) arxiv:1811.8883v1 [cs.cv] 21 Nov 218 Abstract We report competitive results on object detection and instance

More information

Flexible, High Performance Convolutional Neural Networks for Image Classification

Flexible, High Performance Convolutional Neural Networks for Image Classification Flexible, High Performance Convolutional Neural Networks for Image Classification Dan C. Cireşan, Ueli Meier, Jonathan Masci, Luca M. Gambardella, Jürgen Schmidhuber IDSIA, USI and SUPSI Manno-Lugano,

More information

Jia Jia Tsinghua University 26/09/2017

Jia Jia Tsinghua University 26/09/2017 Jia Jia jjia@tsinghua.edu.cn Tsinghua University 26/09/2017 Stage 1: Online detection of mental health problems Stress Detection via Harvesting Social Media Detecting Stress Based on Social Interactions

More information

Learned Region Sparsity and Diversity Also Predict Visual Attention

Learned Region Sparsity and Diversity Also Predict Visual Attention Learned Region Sparsity and Diversity Also Predict Visual Attention Zijun Wei 1, Hossein Adeli 2, Gregory Zelinsky 1,2, Minh Hoai 1, Dimitris Samaras 1 1. Department of Computer Science 2. Department of

More information

arxiv: v1 [cs.cv] 12 Dec 2016

arxiv: v1 [cs.cv] 12 Dec 2016 Text-guided Attention Model for Image Captioning Jonghwan Mun, Minsu Cho, Bohyung Han Department of Computer Science and Engineering, POSTECH, Korea {choco1916, mscho, bhhan}@postech.ac.kr arxiv:1612.03557v1

More information

Semi-Supervised Disentangling of Causal Factors. Sargur N. Srihari

Semi-Supervised Disentangling of Causal Factors. Sargur N. Srihari Semi-Supervised Disentangling of Causal Factors Sargur N. srihari@cedar.buffalo.edu 1 Topics in Representation Learning 1. Greedy Layer-Wise Unsupervised Pretraining 2. Transfer Learning and Domain Adaptation

More information

arxiv: v2 [cs.cv] 19 Dec 2017

arxiv: v2 [cs.cv] 19 Dec 2017 An Ensemble of Deep Convolutional Neural Networks for Alzheimer s Disease Detection and Classification arxiv:1712.01675v2 [cs.cv] 19 Dec 2017 Jyoti Islam Department of Computer Science Georgia State University

More information

Learning Spatiotemporal Gaps between Where We Look and What We Focus on

Learning Spatiotemporal Gaps between Where We Look and What We Focus on Express Paper Learning Spatiotemporal Gaps between Where We Look and What We Focus on Ryo Yonetani 1,a) Hiroaki Kawashima 1,b) Takashi Matsuyama 1,c) Received: March 11, 2013, Accepted: April 24, 2013,

More information

Vector Learning for Cross Domain Representations

Vector Learning for Cross Domain Representations Vector Learning for Cross Domain Representations Shagan Sah, Chi Zhang, Thang Nguyen, Dheeraj Kumar Peri, Ameya Shringi, Raymond Ptucha Rochester Institute of Technology, Rochester, NY 14623, USA arxiv:1809.10312v1

More information

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes Renwu Gao 1(B), Faisal Shafait 2, Seiichi Uchida 3, and Yaokai Feng 3 1 Information Sciene and Electrical Engineering, Kyushu

More information

High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany

High-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany High-level Vision Bernd Neumann Slides for the course in WS 2004/05 Faculty of Informatics Hamburg University Germany neumann@informatik.uni-hamburg.de http://kogs-www.informatik.uni-hamburg.de 1 Contents

More information

Experiment Presentation CS Chris Thomas Experiment: What is an Object? Alexe, Bogdan, et al. CVPR 2010

Experiment Presentation CS Chris Thomas Experiment: What is an Object? Alexe, Bogdan, et al. CVPR 2010 Experiment Presentation CS 3710 Chris Thomas Experiment: What is an Object? Alexe, Bogdan, et al. CVPR 2010 1 Preliminaries Code for What is An Object? available online Version 2.2 Achieves near 90% recall

More information

NMF-Density: NMF-Based Breast Density Classifier

NMF-Density: NMF-Based Breast Density Classifier NMF-Density: NMF-Based Breast Density Classifier Lahouari Ghouti and Abdullah H. Owaidh King Fahd University of Petroleum and Minerals - Department of Information and Computer Science. KFUPM Box 1128.

More information

Comparative object similarity for improved recognition with few or no examples

Comparative object similarity for improved recognition with few or no examples Comparative object similarity for improved recognition with few or no examples Gang Wang David Forsyth 2 Derek Hoiem 2 Dept. of Electrical and Computer Engineering 2 Dept. of Computer Science University

More information

A Review of Co-saliency Detection Technique: Fundamentals, Applications, and Challenges

A Review of Co-saliency Detection Technique: Fundamentals, Applications, and Challenges REGULAR PAPER 1 A Review of Co-saliency Detection Technique: Fundamentals, Applications, and Challenges Dingwen Zhang, Huazhu Fu, Junwei Han, Senior Member, IEEE, Feng Wu, Fellow, IEEE arxiv:1604.07090v2

More information

Automated Volumetric Cardiac Ultrasound Analysis

Automated Volumetric Cardiac Ultrasound Analysis Whitepaper Automated Volumetric Cardiac Ultrasound Analysis ACUSON SC2000 Volume Imaging Ultrasound System Bogdan Georgescu, Ph.D. Siemens Corporate Research Princeton, New Jersey USA Answers for life.

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 Exam policy: This exam allows two one-page, two-sided cheat sheets (i.e. 4 sides); No other materials. Time: 2 hours. Be sure to write

More information

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering SUPPLEMENTARY MATERIALS 1. Implementation Details 1.1. Bottom-Up Attention Model Our bottom-up attention Faster R-CNN

More information

arxiv: v2 [cs.cv] 17 Jun 2016

arxiv: v2 [cs.cv] 17 Jun 2016 Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions? Abhishek Das 1, Harsh Agrawal 1, C. Lawrence Zitnick 2, Devi Parikh 1, Dhruv Batra 1 1 Virginia Tech,

More information

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil CSE 5194.01 - Introduction to High-Perfomance Deep Learning ImageNet & VGG Jihyung Kil ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton,

More information

Capturing human category representations by sampling in deep feature spaces

Capturing human category representations by sampling in deep feature spaces Capturing human category representations by sampling in deep feature spaces Joshua C. Peterson 1 (jpeterson@berkeley.edu) Jordan W. Suchow 1 (suchow@berkeley.edu) Krisha Aghi 2 (kaghi@berkeley.edu) Alexander

More information

Unsupervised Learning of Micro-Action Exemplars using a Product Manifold

Unsupervised Learning of Micro-Action Exemplars using a Product Manifold 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 Unsupervised Learning of Micro-Action Exemplars using a Product Manifold Stephen O Hara and Bruce A. Draper Colorado

More information

Domain Adversarial Training for Accented Speech Recognition

Domain Adversarial Training for Accented Speech Recognition Domain Adversarial Training for Accented Speech Recognition Sining Sun [1-3], Ching-Feng Yeh [2], Mei-Yuh Hwang [2], Mari Ostendorf [3], Lei Xie [1] Northwestern Polytechnical University [1] Mobvoi AI

More information

Chapter 1. Introduction

Chapter 1. Introduction Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a

More information

Efficient Deep Model Selection

Efficient Deep Model Selection Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes

More information

The University of Tokyo, NVAIL Partner Yoshitaka Ushiku

The University of Tokyo, NVAIL Partner Yoshitaka Ushiku Recognize, Describe, and Generate: Introduction of Recent Work at MIL The University of Tokyo, NVAIL Partner Yoshitaka Ushiku MIL: Machine Intelligence Laboratory Beyond Human Intelligence Based on Cyber-Physical

More information

Visual interpretation in pathology

Visual interpretation in pathology 13 Visual interpretation in pathology Tissue architecture (alteration) evaluation e.g., for grading prostate cancer Immunohistochemistry (IHC) staining scoring e.g., HER2 in breast cancer (companion diagnostic

More information

Recommending Outfits from Personal Closet

Recommending Outfits from Personal Closet Recommending Outfits from Personal Closet Pongsate Tangseng 1, Kota Yamaguchi 2, and Takayuki Okatani 1,3 1 Tohoku University, Sendai, Japan 2 CyberAgent, Inc., Tokyo, Japan 3 RIKEN Center for AIP, Tokyo,

More information

DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation

DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation Jimmy Wu 1, Bolei Zhou 1, Diondra Peck 2, Scott Hsieh 3, Vandana Dialani, MD 4 Lester Mackey 5, and Genevieve

More information

Cervical cytology intelligent diagnosis based on object detection technology

Cervical cytology intelligent diagnosis based on object detection technology Cervical cytology intelligent diagnosis based on object detection technology Meiquan Xu xumeiquan@126.com Weixiu Zeng Semptian Co., Ltd. Machine Learning Lab. zengweixiu@gmail.com Hunhui Wu 736886978@qq.com

More information

Towards The Deep Model: Understanding Visual Recognition Through Computational Models. Panqu Wang Dissertation Defense 03/23/2017

Towards The Deep Model: Understanding Visual Recognition Through Computational Models. Panqu Wang Dissertation Defense 03/23/2017 Towards The Deep Model: Understanding Visual Recognition Through Computational Models Panqu Wang Dissertation Defense 03/23/2017 Summary Human Visual Recognition (face, object, scene) Simulate Explain

More information

Affective Computing in 2018

Affective Computing in 2018 Affective Computing in 2018 Department of Computer Science and Technology Tsinghua University Jia Jia 13651399048 jjia@mail.tsinghua.edu.cn 1 Affective Computing Emotion is the driving force of user s

More information

Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network

Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network Original Article Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network Aida Allahverdi 1, Siavash Akbarzadeh 1, Alireza Khorrami Moghaddam 2, Armin Allahverdy

More information

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images Sachin Mehta 1, Ezgi Mercan 1, Jamen Bartlett 2, Donald Weaver 2, Joann G. Elmore 1, and Linda Shapiro 1 1 University

More information