Domain Generalization and Adaptation using Low Rank Exemplar Classifiers
|
|
- Hector Watkins
- 5 years ago
- Views:
Transcription
1 Domain Generalization and Adaptation using Low Rank Exemplar Classifiers 报告人 : 李文 苏黎世联邦理工学院计算机视觉实验室 李文 苏黎世联邦理工学院 9/20/2017 1
2 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Summary 李文 苏黎世联邦理工学院 9/20/2017 2
3 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Summary 李文 苏黎世联邦理工学院 9/20/2017 3
4 Domain Adaptation: Examples Examples Web <-> Consumer Sketch <-> Photo Synthetic <-> Real Fall <-> Winter K. Saenko, B. Kulis, M. Fritz and T. Darrell. Adapting Visual Category Models to New Domains. In ECCV, T. Kim M. Cha, H. Kim, J. Lee J. Kim. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks. In ICCV 2017 Placeholder X. Peng, K. for Saenko. organisational Synthetic unit to Real name Adaptation / logo with Generative Correlation Alignment Networks. Arxiv , (edit J. in Hoffman, slide master D. Wang, via View F. Yu, > T. Slide Darrel. Master ) FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation. Arxiv , 李文 苏黎世联邦理工学院 9/20/
5 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) 李文 苏黎世联邦理工学院 9/20/2017 5
6 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world test data changes easily 李文 苏黎世联邦理工学院 9/20/2017 6
7 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot 李文 苏黎世联邦理工学院 9/20/2017 7
8 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot 李文 苏黎世联邦理工学院 9/20/2017 8
9 Dataset Bias Name that dataset! Caltech-101 LabelMe MSRC COIL-100 UIUC ImageNet Tiny 15 Scenes Corel Caltech-256 PASCAL 07 SUN09 (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/2017 9
10 Dataset Bias Name that dataset! Caltech LabelMe 7 MSRC 3 COIL UIUC 2 ImageNet 5 Tiny Images 4 15 Scenes 9 Corel 10 Caltech PASCAL 07 6 SUN09 8 Current vision datasets contains their own biases, regardless of their semantic categories. (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/
11 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot 李文 苏黎世联邦理工学院 9/20/
12 Dataset Bias Real-world objects vary a lot (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/
13 Dataset Bias Impact of Dataset Bias Cross-dataset Classification Performance Car Classification S L P I C M drop S % L % P % I % C % M % (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/
14 Dataset Bias Impact of Dataset Bias Cross-dataset Classification Performance Car Classification S L P I C M drop S % L % P % I % C % M % Dataset bias harms the crossdataset classification performance. (edit A. Torralba, in slide master and A. via Efros. View Unbiased > Slide Look Master ) at Dataset Bias. In CVPR 李文 苏黎世联邦理工学院 9/20/
15 Visual Recognition System Visual Recognition System Labeled Training Images Learning Test Images Independently and identically distributed (i.i.d. assumption) i.i.d assumption may not always hold Data collection bias is inevitable The real-world visual data varies a lot Re-collect data? Cost money and time! 李文 苏黎世联邦理工学院 9/20/
16 Domain Adaptation: Problem Description Domain Adaptation bike: cup: Source Domain P s (x, y) P s (x, y) P t (x, y) Target Domain P t (x, y) 李文 苏黎世联邦理工学院 9/20/
17 Domain Adaptation: Problem Description (Unsupervised) Domain Adaptation n s + Source domain: *(x s i, y s i ) i=1 Target domain: *x t n i t i=1 + Data distribution mismatch: P s (x s, y s ) P t (x t, y t ) or P s (x s ) P t (x t ) Feature space and label space are consistent: x s, x t R D y s, y t L 李文 苏黎世联邦理工学院 9/20/
18 Domain Adaptation: Problem Description (Unsupervised) Domain Adaptation n s + Source domain: *(x s i, y s i ) i=1 Target domain: *x t n i t i=1 + Data distribution mismatch: P s (x s, y s ) P t (x t, y t ) or P s (x s ) P t (x t ) Feature space and label space are consistent: x s, x t R D y s, y t L Related Concepts: Transfer Learning Label space are different (cross-task) Heterogeneous Domain Adaptation Feature space are different (cross-feature) 李文 苏黎世联邦理工学院 9/20/
19 Domain Adaptation: Related Works Feature-Level Methods Strategy: Traditional methods P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. 李文 苏黎世联邦理工学院 9/20/
20 Domain Adaptation: Related Works Feature-Level Methods Strategy: Traditional methods P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. CNN based methods Top-down: DAN, JAN, GRL, DRCN, Bottom-Up: AdaBN, AutoDIAL source target CNN cls loss DA loss 李文 苏黎世联邦理工学院 9/20/
21 Domain Adaptation: Related Works Feature-Level Methods Strategy: Traditional methods P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. CNN based methods Top-down: DAN, JAN, GRL, DRCN, Bottom-Up: AdaBN, AutoDIAL source target CNN loss Domain Adaptive Normalization 李文 苏黎世联邦理工学院 9/20/
22 Domain Adaptation: Related Works Feature-Level Methods Strategy: P s (x s ) P t (x t ) P s (g(x s )) P t (x t ) Traditional methods For example, TCA, SGF, GFK, SA, DIP, based on subspace and manifold principles. CNN based methods Top-down: DAN, JAN, GRL, DRCN, Bottom-Up: AdaBN, AutoDIAL Image-Level CycleGAN, DiscoGAN, DualGAN, UNIT 李文 苏黎世联邦理工学院 9/20/
23 Domain Adaptation: Related Works Instance-Level Methods Strategy: Methods KMM, DA-SVM Deep Methods Transductive DA, Associative DA P s (x s ) P t (x t ) g(x s )P s (x s ) P t (x t ) 李文 苏黎世联邦理工学院 9/20/
24 Domain Generalization What if we do not know about target domain? Multi-source domain generalization source-1 source target source-2? source-m Domain Adaptation Domain Generalization K. Muandet, D. Balduzzi, B. Scholkopf. Domain generalization via invariant feature representation. In ICML, 李文 苏黎世联邦理工学院 9/20/
25 Domain Generalization Problems Real world domains are not distinctive source-1 source-2? source-m 李文 苏黎世联邦理工学院 9/20/
26 Domain Generalization Problems A more common case, single but diverse source domain source-1 source-2? source domain source-m 李文 苏黎世联邦理工学院 9/20/
27 Domain Generalization Latent Domain Discovery Partition one source domain into multiple latent domains source domain latent-1 latent-2 latent-3 J. Hoffman, B. Kulis, T. Darrell, K. Saenko. Discovering Latent Domains For Multisource Domain Adaptation. In ECCV (edit in slide B. Gong, master K. via Grauman, View > and Slide F. Sha. Master ) Reshaping Visual Datasets for Domain Adaptation. In NIPS 2013 李文 苏黎世联邦理工学院 9/20/
28 Domain Generalization Latent Domain Discovery Partition one source domain into multiple latent domains source domain latent-1 latent-2 latent-3 How many latent domains? Non-trivial to disentangle correlated variances J. Hoffman, B. Kulis, T. Darrell, K. Saenko. Discovering Latent Domains For Multisource Domain Adaptation. In ECCV (edit in slide B. Gong, master K. via Grauman, View > and Slide F. Sha. Master ) Reshaping Visual Datasets for Domain Adaptation. In NIPS 2013 李文 苏黎世联邦理工学院 9/20/
29 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Evolving Domain Adaptation Conclusions and Future Work 李文 苏黎世联邦理工学院 9/20/
30 Low-Rank Exemplar Classifiers for Domain Generalization One-stage Approach source-1 source source-2 source-m domain adaptation domain generalization 李文 苏黎世联邦理工学院 9/20/
31 Low-Rank Exemplar Classifiers for Domain Generalization One-stage Approach source domain adaptation domain generalization 李文 苏黎世联邦理工学院 9/20/
32 Exemplar SVMs (E-SVMs) SVM vs E-SVMs Each exemplar SVM is trained using one positive samples and all negative samples SVM E-SVM 1 E-SVM 2 T. Malisiewicz, A. Gupta, A. Efros. Ensemble of Exemplar-SVMs for Object Detection and Beyond. In ICCV 2011 李文 苏黎世联邦理工学院 9/20/
33 Exemplar SVMs (E-SVMs) SVM vs E-SVMs Each exemplar SVM is trained using one positive samples and all negative samples bike, clean background front viewpoint bike, indoor background front viewpoint SVM E-SVM 1 E-SVM 2 李文 苏黎世联邦理工学院 9/20/
34 LRE-SVMs for Domain Generalization Ensemble Exemplar Classifiers Example Exemplar Classifiers Test Sample 李文 苏黎世联邦理工学院 9/20/
35 LRE-SVMs for Domain Generalization Ensemble Exemplar Classifiers Example Sensitive to noise Exemplar Classifiers Test Sample 李文 苏黎世联邦理工学院 9/20/
36 Exemplar SVMs (E-SVMs) Learning Objective Training data where loss is defined as Unify all exemplar SVMs 李文 苏黎世联邦理工学院 9/20/
37 Low Rank Exemplar SVMs (LRE-SVMs) Low Rank Property Using n E-SVMs to predict n positive training samples 李文 苏黎世联邦理工学院 9/20/
38 Low Rank Exemplar SVMs (LRE-SVMs) Low Rank Property Using n E-SVMs to predict n positive training samples G W R n n is low-rank 李文 苏黎世联邦理工学院 9/20/
39 Low Rank Exemplar SVMs (LRE-SVMs) Objective Adding a nuclear-norm based regularizer on G(W) Where G W = g ij 李文 苏黎世联邦理工学院 9/20/
40 Optimization Introducing an intermediate variable Alternating optimization Fix W, Update F Fix F, Update W Main computational cost! Placeholder J. Cai, for E. organisational J. Cands, and Z. unit Shen. name A singular / logo value thresholding algorithm for matrix completion, (edit in slide In SIAM master Journal via View on Optimization, > Slide Master ) vol. 20, no. 4, pp , 2010 李文 苏黎世联邦理工学院 9/20/
41 Low Rank Exemplar LS-SVMs (LRE-LSSVMs) Using least square SVM as the base classifier Loss function is changed Closed form solution 李文 苏黎世联邦理工学院 9/20/
42 Low Rank Exemplar LS-SVMs (LRE-LSSVMs) A fast solution exemplar n+m Cost only O((n + m) 2 ) 李文 苏黎世联邦理工学院 9/20/
43 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Evolving Domain Adaptation Conclusions and Future Work 李文 苏黎世联邦理工学院 9/20/
44 LRE-SVMs for Domain Generalization Ensemble Exemplar Classifiers Example Exemplar Classifiers Test Sample 李文 苏黎世联邦理工学院 9/20/
45 LRE-SVMs for Domain Adaptation Ensemble Exemplar Classifiers Similarity between E-SVM training data and the target domain f(mmd) Source Domain Target Domain 李文 苏黎世联邦理工学院 9/20/
46 LRE-SVMs for Domain Adaptation Learn A Unified Classifier where f is the prediction score obtained using ensemble E-SVMs Placeholder L. Duan, for D. organisational Xu, and I. W. Tsang. unit name Domain / logo adaptation from multiple sources: A domain-dependent regularization approach. (edit in In slide T-NNLS, master vol. via 23, View no. 3, > Slide pp , Master ) March 2012 李文 苏黎世联邦理工学院 9/20/
47 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Conclusions and Future Work 李文 苏黎世联邦理工学院 9/20/
48 Datasets Action Recognition IXMAS dataset: 5 actions from 5 different view points Mix several viewpoints, and leave remaining s for test Object Recognition Office-Caltech Dataset: 10 classes from 4 domains, Caltech-256, Amazon, DSLR, Webcam Mix several domains, and leave remaining's for test 李文 苏黎世联邦理工学院 9/20/
49 Domain Generalization Action Recognition Results (0,1)->(2,3,4) (2,3,4)->(0,1) (0,1,2,3)->4 SVM E-SVMs E-LSSVMs LRE-SVMs LRE-LSSVMs Summary Ensemble exemplar classifiers help domain generalization LRE-SVMs and LRE-LSSVMs improves E-SVMs and E-LSSVMs. 李文 苏黎世联邦理工学院 9/20/
50 Domain Generalization Experimental Comparisons 李文 苏黎世联邦理工学院 9/20/
51 LRE-SVMs v.s. LRE-LSSVMs Training Time IXMAS, (0,1) -> (2,3,4) Training Time (seconds) LRE-SVMS LRE-LSSVMs Summary LRE-LSSVM is more than 80 times faster than LRE-SVMs 李文 苏黎世联邦理工学院 9/20/
52 Low Rank Regularizer Visualization on the prediction matrix G(W) IXMAS, (0,1) -> (2,3,4) Clustering viewpoints, actors 李文 苏黎世联邦理工学院 9/20/
53 Low Rank Regularizer Visualization on the prediction matrix G(W) IXMAS, (0,1) -> (2,3,4) Clustering viewpoints, actors 李文 苏黎世联邦理工学院 9/20/
54 Domain Adaptation Action Recognition Using target domain information can further improves the accuracy (0,1)->(2,3,4) (2,3,4)->(0,1) (0,1,2,3)->4 SVM LRE-LSSVMs LRE-LSSVMs-DA 李文 苏黎世联邦理工学院 9/20/
55 Domain Adaptation: Object Recognition Comparison with deep domain adaptation approaches 图表标题 (A,C)->(D,W) (D,W)->(A,C) (C,D,W)->A SVM Ours DAN DAN+Ours GRL GRL+Ours 李文 苏黎世联邦理工学院 9/20/
56 Outline Problems Domain Adaptation and Domain Generalization Low Rank Exemplar Classifiers Low Rank Exemplar Classifiers (LRE-SVMs and LRE-LSSVMs) Domain Generalization and Adaptation Experiments Domain Generalization Domain Adaptation Evolving Domain Adaptation Summary 李文 苏黎世联邦理工学院 9/20/
57 Summary Summary Exploiting inner-domain structure helps cross-domain generalization ability LRE-SVMs and LRE-LSSVMs Learn locality domain property based on E-SVMs Exploit the low-rank structure of prediction matrix Complementary to current CNN based (global) domain adaptation methods. 李文 苏黎世联邦理工学院 9/20/
58 Future Trends Increasingly important in deep learning era large scale training data => high cost on annotation new principles: GRL, AdaBN/AutoDIAL, JAN, CycleGAN New issues: domain generalization, transfer learning, heterogeneous domain adaptation Task-oriented: synthetic to real adaptation, semantic segmentation, video recognition, image captioning, VQA 李文 苏黎世联邦理工学院 9/20/
59 Collaborators and References Collaborators Zheng Xu Li Niu Dengxin Dai Dong Xu Luc Van Gool References 1) W. Li, Z. Xu, D. Xu, D. Dai, and L. Van Gool. Domain Generalization and Adaptation using Low Rank Exemplar SVMs. In T-PAMI, 2017 (In Press) 2) Z. Xu, W. Li, L. Niu, and D. Xu. Exploiting Low-rank Structure from Latent Domains for Domain Generalization. In ECCV 李文 苏黎世联邦理工学院 9/20/
60 WebVision Challenge Learning from Web Data Flickr Images Google Images, Learning Representation CNN Categorization Detection Segmentation Dataset 1,000 categories, 2.4 million images, 50,000 validation images Challenge WebVision classification track Transfer learning track $10,000 cash prize! 李文 苏黎世联邦理工学院 9/20/
61 Thank you! 欢迎提问和指正! Wen Li - Learning from Web Data 62
DMAIN adaptation techniques, which aim to reduce the
Domain Generalization and Adaptation using Low Rank Exemplar SVMs Wen Li, Zheng Xu, Dong Xu, Dengxin Dai, and Luc Van Gool Abstract Domain adaptation between diverse source and target domains is a challenging
More informationCONSTRAINED SEMI-SUPERVISED LEARNING ATTRIBUTES USING ATTRIBUTES AND COMPARATIVE
CONSTRAINED SEMI-SUPERVISED LEARNING USING ATTRIBUTES AND COMPARATIVE ATTRIBUTES Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta The Robotics Institute Carnegie Mellon University SUPERVISION BIG-DATA
More informationBeyond R-CNN detection: Learning to Merge Contextual Attribute
Brain Unleashing Series - Beyond R-CNN detection: Learning to Merge Contextual Attribute Shu Kong CS, ICS, UCI 2015-1-29 Outline 1. RCNN is essentially doing classification, without considering contextual
More informationDeep Networks and Beyond. Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University
Deep Networks and Beyond Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University Artificial Intelligence versus Human Intelligence Understanding
More informationObject Recognition: Conceptual Issues. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman Issues in recognition The statistical viewpoint Generative vs. discriminative methods
More informationShu Kong. Department of Computer Science, UC Irvine
Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge 4. Fine-grained classification with holistic representation
More informationShu Kong. Department of Computer Science, UC Irvine
Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge and philosophy 4. Fine-grained classification with
More informationLearning visual biases from human imagination
Learning visual biases from human imagination Carl Vondrick Hamed Pirsiavash Aude Oliva Antonio Torralba Massachusetts Institute of Technology University of Maryland, Baltimore County {vondrick,oliva,torralba}@mit.edu
More informationAutomatic Beautification for Group-photo Facial Expressions using Novel Bayesian GANs
Automatic Beautification for Group-photo Facial Expressions using Novel Bayesian GANs Ji Liu 1, Shuai Li 1,2, Wenfeng Song 1, Liang Liu 1, Hong Qin 3, and Aimin Hao 1 1 Beihang University, 2 Beihang University
More informationMulti-attention Guided Activation Propagation in CNNs
Multi-attention Guided Activation Propagation in CNNs Xiangteng He and Yuxin Peng (B) Institute of Computer Science and Technology, Peking University, Beijing, China pengyuxin@pku.edu.cn Abstract. CNNs
More informationObject Detectors Emerge in Deep Scene CNNs
Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are
More informationNetwork Dissection: Quantifying Interpretability of Deep Visual Representation
Name: Pingchuan Ma Student number: 3526400 Date: August 19, 2018 Seminar: Explainable Machine Learning Lecturer: PD Dr. Ullrich Köthe SS 2018 Quantifying Interpretability of Deep Visual Representation
More informationVideo Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling
AAAI -13 July 16, 2013 Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling Sheng-hua ZHONG 1, Yan LIU 1, Feifei REN 1,2, Jinghuan ZHANG 2, Tongwei REN 3 1 Department of
More informationActive Deformable Part Models Inference
Active Deformable Part Models Inference Menglong Zhu Nikolay Atanasov George J. Pappas Kostas Daniilidis GRASP Laboratory, University of Pennsylvania 3330 Walnut Street, Philadelphia, PA 19104, USA Abstract.
More informationMedical Image Analysis
Medical Image Analysis 1 Co-trained convolutional neural networks for automated detection of prostate cancer in multiparametric MRI, 2017, Medical Image Analysis 2 Graph-based prostate extraction in t2-weighted
More informationPutting Context into. Vision. September 15, Derek Hoiem
Putting Context into Vision Derek Hoiem September 15, 2004 Questions to Answer What is context? How is context used in human vision? How is context currently used in computer vision? Conclusions Context
More informationGIANT: Geo-Informative Attributes for Location Recognition and Exploration
GIANT: Geo-Informative Attributes for Location Recognition and Exploration Quan Fang, Jitao Sang, Changsheng Xu Institute of Automation, Chinese Academy of Sciences October 23, 2013 Where is this? La Sagrada
More informationDEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS
SEOUL Oct.7, 2016 DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS Gunhee Kim Computer Science and Engineering Seoul National University October
More informationAction Recognition. Computer Vision Jia-Bin Huang, Virginia Tech. Many slides from D. Hoiem
Action Recognition Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem This section: advanced topics Convolutional neural networks in vision Action recognition Vision and Language 3D
More informationHierarchical Convolutional Features for Visual Tracking
Hierarchical Convolutional Features for Visual Tracking Chao Ma Jia-Bin Huang Xiaokang Yang Ming-Husan Yang SJTU UIUC SJTU UC Merced ICCV 2015 Background Given the initial state (position and scale), estimate
More informationarxiv: v1 [cs.cv] 17 Aug 2017
Deep Learning for Medical Image Analysis Mina Rezaei, Haojin Yang, Christoph Meinel Hasso Plattner Institute, Prof.Dr.Helmert-Strae 2-3, 14482 Potsdam, Germany {mina.rezaei,haojin.yang,christoph.meinel}@hpi.de
More informationDifferential Attention for Visual Question Answering
Differential Attention for Visual Question Answering Badri Patro and Vinay P. Namboodiri IIT Kanpur { badri,vinaypn }@iitk.ac.in Abstract In this paper we aim to answer questions based on images when provided
More informationConvolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN) Algorithm and Some Applications in Computer Vision Luo Hengliang Institute of Automation June 10, 2014 Luo Hengliang (Institute of Automation) Convolutional Neural Networks
More informationFamily Member Identification from Photo Collections
2015 IEEE Winter Conference on Applications of Computer Vision Family Member Identification from Photo Collections Qieyun Dai 1,2 Peter Carr 2 Leonid Sigal 2 Derek Hoiem 1 1 University of Illinois at Urbana-Champaign
More informationAttentional Masking for Pre-trained Deep Networks
Attentional Masking for Pre-trained Deep Networks IROS 2017 Marcus Wallenberg and Per-Erik Forssén Computer Vision Laboratory Department of Electrical Engineering Linköping University 2014 2017 Per-Erik
More informationRegion Proposals. Jan Hosang, Rodrigo Benenson, Piotr Dollar, Bernt Schiele
Region Proposals Jan Hosang, Rodrigo Benenson, Piotr Dollar, Bernt Schiele Who has read a proposal paper? 2 Who has read a proposal paper? Who knows what Average Recall is? 2 Who has read a proposal paper?
More informationTask-driven Webpage Saliency
Task-driven Webpage Saliency Quanlong Zheng 1[0000 0001 5059 0078], Jianbo Jiao 1,2[0000 0003 0833 5115], Ying Cao 1[0000 0002 9288 3167], and Rynson W.H. Lau 1[0000 0002 8957 8129] 1 Department of Computer
More informationPart 1: Bag-of-words models. by Li Fei-Fei (Princeton)
Part 1: Bag-of-words models by Li Fei-Fei (Princeton) Object Bag of words Analogy to documents Of all the sensory impressions proceeding to the brain, the visual experiences are the dominant ones. Our
More informationLearning Saliency Maps for Object Categorization
Learning Saliency Maps for Object Categorization Frank Moosmann, Diane Larlus, Frederic Jurie INRIA Rhône-Alpes - GRAVIR - CNRS, {Frank.Moosmann,Diane.Larlus,Frederic.Jurie}@inrialpes.de http://lear.inrialpes.fr
More informationRecognising Human-Object Interaction via Exemplar based Modelling
2013 IEEE International Conference on Computer Vision Recognising Human-Object Interaction via Exemplar based Modelling Jian-Fang Hu, Wei-Shi Zheng, Jianhuang Lai, Shaogang Gong, and Tao Xiang School of
More informationAcquiring Visual Classifiers from Human Imagination
Acquiring Visual Classifiers from Human Imagination Carl Vondrick, Hamed Pirsiavash, Aude Oliva, Antonio Torralba Massachusetts Institute of Technology {vondrick,hpirsiav,oliva,torralba}@mit.edu White
More informationActions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition
Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition Stefan Mathe, Cristian Sminchisescu Presented by Mit Shah Motivation Current Computer Vision Annotations subjectively
More informationVisual Saliency Based on Multiscale Deep Features Supplementary Material
Visual Saliency Based on Multiscale Deep Features Supplementary Material Guanbin Li Yizhou Yu Department of Computer Science, The University of Hong Kong https://sites.google.com/site/ligb86/mdfsaliency/
More informationVision: Over Ov view Alan Yuille
Vision: Overview Alan Yuille Why is Vision Hard? Complexity and Ambiguity of Images. Range of Vision Tasks. More 10x10 images 256^100 = 6.7 x 10 ^240 than the total number of images seen by all humans
More informationWhen Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment
When Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment Honglin Zheng1, Tianlang Chen2, Jiebo Luo3 Department of Computer Science University of Rochester, Rochester,
More informationMotivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.
Outline: Motivation. What s the attention mechanism? Soft attention vs. Hard attention. Attention in Machine translation. Attention in Image captioning. State-of-the-art. 1 Motivation: Attention: Focusing
More informationPower SVM: Generalization with Exemplar Classification Uncertainty
Power SVM: Generalization with Exemplar Classification Uncertainty Weiyu Zhang Stella X. Yu Shang-Hua Teng University of Pennsylvania UC Berkeley / ICSI University of Southern California zhweiyu@seas.upenn.edu
More informationAnnotation and Retrieval System Using Confabulation Model for ImageCLEF2011 Photo Annotation
Annotation and Retrieval System Using Confabulation Model for ImageCLEF2011 Photo Annotation Ryo Izawa, Naoki Motohashi, and Tomohiro Takagi Department of Computer Science Meiji University 1-1-1 Higashimita,
More informationFine-Grained Image Classification Using Color Exemplar Classifiers
Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, and Qingming Huang 1,4 1 School of Computer and Control Engineering,
More informationChapter 2 Where Next in Object Recognition and how much Supervision Do We Need?
Chapter 2 Where Next in Object Recognition and how much Supervision Do We Need? Sandra Ebert and Bernt Schiele Abstract Object class recognition is an active topic in computer vision still presenting many
More informationGeneric object recognition May 17 th, 2018
Generic object recognition May 17 th, 2018 Yong Jae Lee UC Davis Visual words Map high dimensional descriptors to tokens/words by quantizing the feature space Quantize via clustering, let cluster centers
More informationRich feature hierarchies for accurate object detection and semantic segmentation
Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik UC Berkeley Tech Report @ http://arxiv.org/abs/1311.2524! Detection
More informationFine-Grained Image Classification Using Color Exemplar Classifiers
Fine-Grained Image Classification Using Color Exemplar Classifiers Chunjie Zhang 1, Wei Xiong 1, Jing Liu 2, Yifan Zhang 2, Chao Liang 3, Qingming Huang 1, 4 1 School of Computer and Control Engineering,
More informationIncorporating Word Correlation Knowledge into Topic Modeling. Pengtao Xie. Joint work with Diyi Yang and Eric Xing
Incorporating Word Correlation Knowledge into Topic Modeling Pengtao Xie Joint work with Diyi Yang and Eric Xing 1 Outline Motivation Incorporating Word Correlation Knowledge into Topic Modeling Experiments
More informationGroup Behavior Analysis and Its Applications
Group Behavior Analysis and Its Applications CVPR 2015 Tutorial Lecturers: Hyun Soo Park (University of Pennsylvania) Wongun Choi (NEC America Laboratory) Schedule 08:30am-08:50am 08:50am-09:50am 09:50am-10:10am
More informationarxiv: v1 [cs.cv] 16 Jun 2012
How important are Deformable Parts in the Deformable Parts Model? Santosh K. Divvala, Alexei A. Efros, Martial Hebert Carnegie Mellon University arxiv:1206.3714v1 [cs.cv] 16 Jun 2012 Abstract. The main
More informationObject Detection. Honghui Shi IBM Research Columbia
Object Detection Honghui Shi IBM Research 2018.11.27 @ Columbia Outline Problem Evaluation Methods Directions Problem Image classification: Horse (People, Dog, Truck ) Object detection: categories & locations
More informationWhere should saliency models look next?
Where should saliency models look next? Zoya Bylinskii 1, Adrià Recasens 1, Ali Borji 2, Aude Oliva 1, Antonio Torralba 1, and Frédo Durand 1 1 Computer Science and Artificial Intelligence Laboratory Massachusetts
More informationProgressive Attention Guided Recurrent Network for Salient Object Detection
Progressive Attention Guided Recurrent Network for Salient Object Detection Xiaoning Zhang, Tiantian Wang, Jinqing Qi, Huchuan Lu, Gang Wang Dalian University of Technology, China Alibaba AILabs, China
More informationAClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL
AClass: A Simple, Online Probabilistic Classifier Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL AClass: A Simple, Online Probabilistic Classifier or How I learned to stop worrying
More informationImproving the Interpretability of DEMUD on Image Data Sets
Improving the Interpretability of DEMUD on Image Data Sets Jake Lee, Jet Propulsion Laboratory, California Institute of Technology & Columbia University, CS 19 Intern under Kiri Wagstaff Summer 2018 Government
More informationIN this paper we examine the role of shape prototypes in
On the Role of Shape Prototypes in Hierarchical Models of Vision Michael D. Thomure, Melanie Mitchell, and Garrett T. Kenyon To appear in Proceedings of the International Joint Conference on Neural Networks
More informationTowards disentangling underlying explanatory factors Yoshua Bengio. July 13th, 2018 ICML 2018 Workshop on Learning with Limited Labels
Towards disentangling underlying explanatory factors Yoshua Bengio July 13th, 2018 ICML 2018 Workshop on Learning with Limited Labels Generalizing Beyond i.i.d. Data Current ML theory is strongly dependent
More informationNeuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation
Neuro-Inspired Statistical Pi Prior Model lfor Robust Visual Inference Qiang Ji Rensselaer Polytechnic Institute National Science Foundation 1 Status of Computer Vision CV has been an active area for over
More informationComputational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq
Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological
More informationEfficient Boosted Exemplar-based Face Detection
Efficient Boosted Exemplar-based Face Detection Haoxiang Li, Zhe Lin, Jonathan Brandt, Xiaohui Shen, Gang Hua Stevens Institute of Technology Hoboken, NJ 07030 {hli18, ghua}@stevens.edu Adobe Research
More informationAssigning Relative Importance to Scene Elements
Assigning Relative Importance to Scene Elements Igor L. O Bastos, William Robson Schwartz Smart Surveillance Interest Group, Department of Computer Science Universidade Federal de Minas Gerais, Minas Gerais,
More informationDEEP CONVOLUTIONAL ACTIVATION FEATURES FOR LARGE SCALE BRAIN TUMOR HISTOPATHOLOGY IMAGE CLASSIFICATION AND SEGMENTATION
DEEP CONVOLUTIONAL ACTIVATION FEATURES FOR LARGE SCALE BRAIN TUMOR HISTOPATHOLOGY IMAGE CLASSIFICATION AND SEGMENTATION Yan Xu1,2, Zhipeng Jia2,, Yuqing Ai2,, Fang Zhang2,, Maode Lai4, Eric I-Chao Chang2
More informationComputational modeling of visual attention and saliency in the Smart Playroom
Computational modeling of visual attention and saliency in the Smart Playroom Andrew Jones Department of Computer Science, Brown University Abstract The two canonical modes of human visual attention bottomup
More informationToward a Taxonomy and Computational Models of Abnormalities in Images
Toward a Taxonomy and Computational Models of Abnormalities in Images Babak Saleh Dept. of Computer Science Rutgers University New Jersey, USA Ahmed Elgammal Dept. of Computer Science Rutgers University
More informationReading Between The Lines: Object Localization Using Implicit Cues from Image Tags
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman Department of Computer Science University of Texas at Austin {sjhwang,grauman}@cs.utexas.edu
More informationKai-Wei Chang UCLA. What It Takes to Control Societal Bias in Natural Language Processing. References:
What It Takes to Control Societal Bias in Natural Language Processing Kai-Wei Chang UCLA References: http://kwchang.net Kai-Wei Chang (kwchang.net/talks/sp.html) 1 A father and son get in a car crash and
More informationVisual Scene Understanding
Visual Scene Understanding Aude Oliva Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Website: http://cvcl.mit.edu PPA High-level Scene Representation I. Long-term Memory
More informationAutomatic Prostate Cancer Classification using Deep Learning. Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden
Automatic Prostate Cancer Classification using Deep Learning Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden Outline Autoencoders, theory Motivation, background and goal for prostate
More informationarxiv: v1 [cs.cv] 21 Nov 2018
Rethinking ImageNet Pre-training Kaiming He Ross Girshick Piotr Dollár Facebook AI Research (FAIR) arxiv:1811.8883v1 [cs.cv] 21 Nov 218 Abstract We report competitive results on object detection and instance
More informationFlexible, High Performance Convolutional Neural Networks for Image Classification
Flexible, High Performance Convolutional Neural Networks for Image Classification Dan C. Cireşan, Ueli Meier, Jonathan Masci, Luca M. Gambardella, Jürgen Schmidhuber IDSIA, USI and SUPSI Manno-Lugano,
More informationJia Jia Tsinghua University 26/09/2017
Jia Jia jjia@tsinghua.edu.cn Tsinghua University 26/09/2017 Stage 1: Online detection of mental health problems Stress Detection via Harvesting Social Media Detecting Stress Based on Social Interactions
More informationLearned Region Sparsity and Diversity Also Predict Visual Attention
Learned Region Sparsity and Diversity Also Predict Visual Attention Zijun Wei 1, Hossein Adeli 2, Gregory Zelinsky 1,2, Minh Hoai 1, Dimitris Samaras 1 1. Department of Computer Science 2. Department of
More informationarxiv: v1 [cs.cv] 12 Dec 2016
Text-guided Attention Model for Image Captioning Jonghwan Mun, Minsu Cho, Bohyung Han Department of Computer Science and Engineering, POSTECH, Korea {choco1916, mscho, bhhan}@postech.ac.kr arxiv:1612.03557v1
More informationSemi-Supervised Disentangling of Causal Factors. Sargur N. Srihari
Semi-Supervised Disentangling of Causal Factors Sargur N. srihari@cedar.buffalo.edu 1 Topics in Representation Learning 1. Greedy Layer-Wise Unsupervised Pretraining 2. Transfer Learning and Domain Adaptation
More informationarxiv: v2 [cs.cv] 19 Dec 2017
An Ensemble of Deep Convolutional Neural Networks for Alzheimer s Disease Detection and Classification arxiv:1712.01675v2 [cs.cv] 19 Dec 2017 Jyoti Islam Department of Computer Science Georgia State University
More informationLearning Spatiotemporal Gaps between Where We Look and What We Focus on
Express Paper Learning Spatiotemporal Gaps between Where We Look and What We Focus on Ryo Yonetani 1,a) Hiroaki Kawashima 1,b) Takashi Matsuyama 1,c) Received: March 11, 2013, Accepted: April 24, 2013,
More informationVector Learning for Cross Domain Representations
Vector Learning for Cross Domain Representations Shagan Sah, Chi Zhang, Thang Nguyen, Dheeraj Kumar Peri, Ameya Shringi, Raymond Ptucha Rochester Institute of Technology, Rochester, NY 14623, USA arxiv:1809.10312v1
More informationA Hierarchical Visual Saliency Model for Character Detection in Natural Scenes
A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes Renwu Gao 1(B), Faisal Shafait 2, Seiichi Uchida 3, and Yaokai Feng 3 1 Information Sciene and Electrical Engineering, Kyushu
More informationHigh-level Vision. Bernd Neumann Slides for the course in WS 2004/05. Faculty of Informatics Hamburg University Germany
High-level Vision Bernd Neumann Slides for the course in WS 2004/05 Faculty of Informatics Hamburg University Germany neumann@informatik.uni-hamburg.de http://kogs-www.informatik.uni-hamburg.de 1 Contents
More informationExperiment Presentation CS Chris Thomas Experiment: What is an Object? Alexe, Bogdan, et al. CVPR 2010
Experiment Presentation CS 3710 Chris Thomas Experiment: What is an Object? Alexe, Bogdan, et al. CVPR 2010 1 Preliminaries Code for What is An Object? available online Version 2.2 Achieves near 90% recall
More informationNMF-Density: NMF-Based Breast Density Classifier
NMF-Density: NMF-Based Breast Density Classifier Lahouari Ghouti and Abdullah H. Owaidh King Fahd University of Petroleum and Minerals - Department of Information and Computer Science. KFUPM Box 1128.
More informationComparative object similarity for improved recognition with few or no examples
Comparative object similarity for improved recognition with few or no examples Gang Wang David Forsyth 2 Derek Hoiem 2 Dept. of Electrical and Computer Engineering 2 Dept. of Computer Science University
More informationA Review of Co-saliency Detection Technique: Fundamentals, Applications, and Challenges
REGULAR PAPER 1 A Review of Co-saliency Detection Technique: Fundamentals, Applications, and Challenges Dingwen Zhang, Huazhu Fu, Junwei Han, Senior Member, IEEE, Feng Wu, Fellow, IEEE arxiv:1604.07090v2
More informationAutomated Volumetric Cardiac Ultrasound Analysis
Whitepaper Automated Volumetric Cardiac Ultrasound Analysis ACUSON SC2000 Volume Imaging Ultrasound System Bogdan Georgescu, Ph.D. Siemens Corporate Research Princeton, New Jersey USA Answers for life.
More informationUNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014
UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 Exam policy: This exam allows two one-page, two-sided cheat sheets (i.e. 4 sides); No other materials. Time: 2 hours. Be sure to write
More informationBottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering SUPPLEMENTARY MATERIALS 1. Implementation Details 1.1. Bottom-Up Attention Model Our bottom-up attention Faster R-CNN
More informationarxiv: v2 [cs.cv] 17 Jun 2016
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions? Abhishek Das 1, Harsh Agrawal 1, C. Lawrence Zitnick 2, Devi Parikh 1, Dhruv Batra 1 1 Virginia Tech,
More informationCSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil
CSE 5194.01 - Introduction to High-Perfomance Deep Learning ImageNet & VGG Jihyung Kil ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton,
More informationCapturing human category representations by sampling in deep feature spaces
Capturing human category representations by sampling in deep feature spaces Joshua C. Peterson 1 (jpeterson@berkeley.edu) Jordan W. Suchow 1 (suchow@berkeley.edu) Krisha Aghi 2 (kaghi@berkeley.edu) Alexander
More informationUnsupervised Learning of Micro-Action Exemplars using a Product Manifold
8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 Unsupervised Learning of Micro-Action Exemplars using a Product Manifold Stephen O Hara and Bruce A. Draper Colorado
More informationDomain Adversarial Training for Accented Speech Recognition
Domain Adversarial Training for Accented Speech Recognition Sining Sun [1-3], Ching-Feng Yeh [2], Mei-Yuh Hwang [2], Mari Ostendorf [3], Lei Xie [1] Northwestern Polytechnical University [1] Mobvoi AI
More informationChapter 1. Introduction
Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a
More informationEfficient Deep Model Selection
Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes
More informationThe University of Tokyo, NVAIL Partner Yoshitaka Ushiku
Recognize, Describe, and Generate: Introduction of Recent Work at MIL The University of Tokyo, NVAIL Partner Yoshitaka Ushiku MIL: Machine Intelligence Laboratory Beyond Human Intelligence Based on Cyber-Physical
More informationVisual interpretation in pathology
13 Visual interpretation in pathology Tissue architecture (alteration) evaluation e.g., for grading prostate cancer Immunohistochemistry (IHC) staining scoring e.g., HER2 in breast cancer (companion diagnostic
More informationRecommending Outfits from Personal Closet
Recommending Outfits from Personal Closet Pongsate Tangseng 1, Kota Yamaguchi 2, and Takayuki Okatani 1,3 1 Tohoku University, Sendai, Japan 2 CyberAgent, Inc., Tokyo, Japan 3 RIKEN Center for AIP, Tokyo,
More informationDeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation
DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation Jimmy Wu 1, Bolei Zhou 1, Diondra Peck 2, Scott Hsieh 3, Vandana Dialani, MD 4 Lester Mackey 5, and Genevieve
More informationCervical cytology intelligent diagnosis based on object detection technology
Cervical cytology intelligent diagnosis based on object detection technology Meiquan Xu xumeiquan@126.com Weixiu Zeng Semptian Co., Ltd. Machine Learning Lab. zengweixiu@gmail.com Hunhui Wu 736886978@qq.com
More informationTowards The Deep Model: Understanding Visual Recognition Through Computational Models. Panqu Wang Dissertation Defense 03/23/2017
Towards The Deep Model: Understanding Visual Recognition Through Computational Models Panqu Wang Dissertation Defense 03/23/2017 Summary Human Visual Recognition (face, object, scene) Simulate Explain
More informationAffective Computing in 2018
Affective Computing in 2018 Department of Computer Science and Technology Tsinghua University Jia Jia 13651399048 jjia@mail.tsinghua.edu.cn 1 Affective Computing Emotion is the driving force of user s
More informationDifferentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network
Original Article Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network Aida Allahverdi 1, Siavash Akbarzadeh 1, Alireza Khorrami Moghaddam 2, Armin Allahverdy
More informationY-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images
Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images Sachin Mehta 1, Ezgi Mercan 1, Jamen Bartlett 2, Donald Weaver 2, Joann G. Elmore 1, and Linda Shapiro 1 1 University
More information