DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION

Size: px
Start display at page:

Download "DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION"

Transcription

1 DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION Michele Valenti 1 (valenti.michele.w@gmail.com), Aleksandr Diment 2, Giambattista Parascandolo 2, Stefano Squartini 1, Tuomas Virtanen 2 1 Università Politecnica delle Marche, Italy 2 Tampere University of Technology, Finland

2 DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION Michele Valenti 1 (valenti.michele.w@gmail.com), Aleksandr Diment 2, Giambattista Parascandolo 2, Stefano Squartini 1, Tuomas Virtanen 2 1 Università Politecnica delle Marche, Italy 2 Tampere University of Technology, Finland

3 Outline Introduction Our system Training modes Results Challenge ranking

4 Introduction What is acoustic scene classification?

5 Introduction What is acoustic scene classification? Home Car Forest path Audio

6 Our system Overview Audio Feature extraction Sequence splitting CNN Scores averaging Label

7 Our system Audio Features Features Raw audio Log-mel spectrogram

8 Our system Features Sequence splitting Sequence Raw audio segment Log-mel spectrogram Sequence splitting

9 Our system Convolutional neural network Sequence

10 Our system Convolutional neural network Sequences CNN 128 Sequence Feature maps

11 Our system Convolutional neural network Sequences CNN 128 Batch normalization Sequence Feature maps

12 Our system Convolutional neural network Sequences CNN Sequence Feature maps Subsampled feature maps

13 Our system Convolutional neural network Sequences CNN Sequence Feature maps Subsampled feature maps New feature maps

14 Our system Convolutional neural network Sequences CNN Time shrinking Sequence Feature maps Subsampled feature maps New feature maps

15 Our system Sequences CNN Convolutional neural network Flattening Sequence Feature maps Subsampled feature maps New feature maps

16 Our system Sequences CNN Convolutional neural network Fully-connected softmax layer 256 Sequence Feature maps Subsampled feature maps New feature maps

17 Our system Sequences Convolutional neural network 128 Sequence Feature maps New Subsampled feature maps feature maps CNN

18 Our system Scores averaging Class prediction scores Prediction scores Scores averaging

19 Our system Prediction scores Scores averaging Scores averaging Class prediction scores! " Σ argmax File s class

20 Training

21 Training Cross-validation setup Training + validation Test Fold 1 Test Fold 2 Test Fold 3 Test Fold 4

22 Training Non-full training Training + validation Fold n Test Training Validation

23 Training Non-full training Training + validation Fold n Test Training Non-full training Validation

24 Training Non-full training Training Training + validation Fold n Test Training Validation Accuracies Validation Epochs

25 Training Non-full training Training Training + validation Fold n Test Training Validation Accuracies Convergence time Validation Epochs

26 Training Non-full training Training + validation Fold n Test Training Validation Training

27 Training Training + validation Fold n Test Non-full training Full training Training Training Validation

28 Results Test data Training + validation Test Fold 1 Test Fold 2 Test Fold 3 Test Fold 4

29 Results Sequence length 80 Non-full training Full training Accuracy (%) ,5 1, Sequence length (s)

30 Results Sequence length 80 Non-full training Full training Accuracy (%) ,5 1, Sequence length (s)

31 Results Sequence length 80 Non-full training Full training Accuracy (%) ,5 1, Sequence length (s)

32 Results Class accuracies Class Accuracy (%) Beach 75.6 Bus 76.9 Café/Restaurant 74.4 Car 91.0 City center 93.6 Forest path 96.2 Grocery store 88.5 Home 80.8 Class Accuracy (%) Library 66.6 Metro station 96.2 Office 97.4 Park 59.0 Residential area 73.1 Train 46.2 Tram 78.2

33 Results Class accuracies Class Accuracy (%) Beach 75.6 Bus 76.9 Café/Restaurant 74.4 Car 91.0 City center 93.6 Forest path 96.2 Grocery store 88.5 Home 80.8 Class Accuracy (%) Library 66.6 Metro station 96.2 Office 34.6% 97.4 Residential area Park 59.0 Residential area 73.1 Train 46.2 Tram % Bus

34 Results Other classifiers System Sequence length (s) Non-full training Accuracy (%) Full training Baseline GMM (MFCC) Two-layer CNN (MFCC) Two-layer MLP (log-mel) One-layer CNN (log-mel) Two-layer CNN (log-mel)

35 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data

36 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data New training New validation

37 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data New training New validation 400 epochs convergence

38 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data Final training for 400 epochs

39 Challenge ranking ,7 88,7 87,7 87,2 86,4 86,4 86,2 85,9 85,6 85,4 84,6 84,1 77,2 62,8

40 DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION Michele Valenti 1 (valenti.michele.w@gmail.com), Aleksandr Diment 2, Giambattista Parascandolo 2, Stefano Squartini 1, Tuomas Virtanen 2 1 Università Politecnica delle Marche, Italy 2 Tampere University of Technology, Finland

41 Results Feature comparison System Sequence length (s) Non-full training Accuracy (%) Full training Two-layer CNN (MFCC) Two-layer CNN (log-mel)

Improved Acoustic Scene Classification with DNN and CNN

Improved Acoustic Scene Classification with DNN and CNN Please contact the conference organizers at dcasechallenge@gmail.com if you require an accessible file, as the files provided by ConfTool Pro to reviewers are filtered to remove author information, and

More information

Acoustic Scene Classification by Ensembling Gradient Boosting Machine and Convolutional Neural Networks

Acoustic Scene Classification by Ensembling Gradient Boosting Machine and Convolutional Neural Networks Acoustic Scee Classificatio by Esemblig Gradiet Boostig Machie ad Covolutioal Neural Networks DCASE 2017 Eduardo Foseca, Rog Gog, Dmitry Bogdaov, Olga Slizovskaia, Emilia Gomez ad Xavier Serra Outlie Itroductio

More information

Enhanced Feature Extraction for Speech Detection in Media Audio

Enhanced Feature Extraction for Speech Detection in Media Audio INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Enhanced Feature Extraction for Speech Detection in Media Audio Inseon Jang 1, ChungHyun Ahn 1, Jeongil Seo 1, Younseon Jang 2 1 Media Research Division,

More information

Attentional Masking for Pre-trained Deep Networks

Attentional Masking for Pre-trained Deep Networks Attentional Masking for Pre-trained Deep Networks IROS 2017 Marcus Wallenberg and Per-Erik Forssén Computer Vision Laboratory Department of Electrical Engineering Linköping University 2014 2017 Per-Erik

More information

Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss

Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Audio-Visual Speech Recognition Using Bimodal-Trained Features for a Person with Severe Hearing Loss Yuki Takashima 1, Ryo Aihara 1, Tetsuya Takiguchi

More information

Computational modeling of visual attention and saliency in the Smart Playroom

Computational modeling of visual attention and saliency in the Smart Playroom Computational modeling of visual attention and saliency in the Smart Playroom Andrew Jones Department of Computer Science, Brown University Abstract The two canonical modes of human visual attention bottomup

More information

B657: Final Project Report Holistically-Nested Edge Detection

B657: Final Project Report Holistically-Nested Edge Detection B657: Final roject Report Holistically-Nested Edge Detection Mingze Xu & Hanfei Mei May 4, 2016 Abstract Holistically-Nested Edge Detection (HED), which is a novel edge detection method based on fully

More information

Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning

Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning Yi-Hsiang Hsu, MD, SCD Sep 16, 2017 yihsianghsu@hsl.harvard.edu Director & Associate

More information

A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals

A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals Luis Guarda Bräuning (1) Nicolas Astorga (1) Enrique López Droguett (1) Marcio Moura (2) Marcelo

More information

Object Detectors Emerge in Deep Scene CNNs

Object Detectors Emerge in Deep Scene CNNs Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are

More information

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images Sachin Mehta 1, Ezgi Mercan 1, Jamen Bartlett 2, Donald Weaver 2, Joann G. Elmore 1, and Linda Shapiro 1 1 University

More information

Learning Convolutional Neural Networks for Graphs

Learning Convolutional Neural Networks for Graphs GA-65449 Learning Convolutional Neural Networks for Graphs Mathias Niepert Mohamed Ahmed Konstantin Kutzkov NEC Laboratories Europe Representation Learning for Graphs Telecom Safety Transportation Industry

More information

Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment

Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment Tony Lindsey 1,3, Xiao Lu 1 and Mojtaba Tefagh 2 1 Department of Biomedical Informatics, Stanford

More information

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil CSE 5194.01 - Introduction to High-Perfomance Deep Learning ImageNet & VGG Jihyung Kil ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton,

More information

Acoustic Signal Processing Based on Deep Neural Networks

Acoustic Signal Processing Based on Deep Neural Networks Acoustic Signal Processing Based on Deep Neural Networks Chin-Hui Lee School of ECE, Georgia Tech chl@ece.gatech.edu Joint work with Yong Xu, Yanhui Tu, Qing Wang, Tian Gao, Jun Du, LiRong Dai Outline

More information

Using the Soundtrack to Classify Videos

Using the Soundtrack to Classify Videos Using the Soundtrack to Classify Videos Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/

More information

Reading Emotions from Speech using Deep Neural Networks

Reading Emotions from Speech using Deep Neural Networks Reading Emotions from Speech using Deep Neural Networks Anusha Balakrishnan Stanford University Computer Science Department anusha@cs.stanford.edu Alisha Rege Stanford University Computer Science Department

More information

Single-Channel Sound Source Localization Based on Discrimination of Acoustic Transfer Functions

Single-Channel Sound Source Localization Based on Discrimination of Acoustic Transfer Functions 3 Single-Channel Sound Source Localization Based on Discrimination of Acoustic Transfer Functions Ryoichi Takashima, Tetsuya Takiguchi and Yasuo Ariki Graduate School of System Informatics, Kobe University,

More information

Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks

Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks Interspeech 2018 2-6 September 2018, Hyderabad Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks Shahin Amiriparian 1, Alice Baird 1, Sahib Julka 1,

More information

Elad Hoffer*, Itay Hubara*, Daniel Soudry

Elad Hoffer*, Itay Hubara*, Daniel Soudry Train longer, generalize better: closing the generalization gap in large batch training of neural networks Elad Hoffer*, Itay Hubara*, Daniel Soudry *Equal contribution Better models - parallelization

More information

ACUTE LEUKEMIA CLASSIFICATION USING CONVOLUTION NEURAL NETWORK IN CLINICAL DECISION SUPPORT SYSTEM

ACUTE LEUKEMIA CLASSIFICATION USING CONVOLUTION NEURAL NETWORK IN CLINICAL DECISION SUPPORT SYSTEM ACUTE LEUKEMIA CLASSIFICATION USING CONVOLUTION NEURAL NETWORK IN CLINICAL DECISION SUPPORT SYSTEM Thanh.TTP 1, Giao N. Pham 1, Jin-Hyeok Park 1, Kwang-Seok Moon 2, Suk-Hwan Lee 3, and Ki-Ryong Kwon 1

More information

EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS?

EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS? EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS? Sefik Emre Eskimez, Kenneth Imade, Na Yang, Melissa Sturge- Apple, Zhiyao Duan, Wendi Heinzelman University of Rochester,

More information

Deep learning and non-negative matrix factorization in recognition of mammograms

Deep learning and non-negative matrix factorization in recognition of mammograms Deep learning and non-negative matrix factorization in recognition of mammograms Bartosz Swiderski Faculty of Applied Informatics and Mathematics Warsaw University of Life Sciences, Warsaw, Poland bartosz_swiderski@sggw.pl

More information

Visual interpretation in pathology

Visual interpretation in pathology 13 Visual interpretation in pathology Tissue architecture (alteration) evaluation e.g., for grading prostate cancer Immunohistochemistry (IHC) staining scoring e.g., HER2 in breast cancer (companion diagnostic

More information

Convolutional Neural Networks for Text Classification

Convolutional Neural Networks for Text Classification Convolutional Neural Networks for Text Classification Sebastian Sierra MindLab Research Group July 1, 2016 ebastian Sierra (MindLab Research Group) NLP Summer Class July 1, 2016 1 / 32 Outline 1 What is

More information

Flexible, High Performance Convolutional Neural Networks for Image Classification

Flexible, High Performance Convolutional Neural Networks for Image Classification Flexible, High Performance Convolutional Neural Networks for Image Classification Dan C. Cireşan, Ueli Meier, Jonathan Masci, Luca M. Gambardella, Jürgen Schmidhuber IDSIA, USI and SUPSI Manno-Lugano,

More information

Segmentation of Cell Membrane and Nucleus by Improving Pix2pix

Segmentation of Cell Membrane and Nucleus by Improving Pix2pix Segmentation of Membrane and Nucleus by Improving Pix2pix Masaya Sato 1, Kazuhiro Hotta 1, Ayako Imanishi 2, Michiyuki Matsuda 2 and Kenta Terai 2 1 Meijo University, Siogamaguchi, Nagoya, Aichi, Japan

More information

arxiv: v1 [cs.lg] 6 Oct 2016

arxiv: v1 [cs.lg] 6 Oct 2016 Combining Generative and Discriminative Neural Networks for Sleep Stages Classification Endang Purnama Giri 1,2, Mohamad Ivan Fanany 1, Aniati Murni Arymurthy 1, arxiv:1610.01741v1 [cs.lg] 6 Oct 2016 1

More information

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks Textbook, Sections 6.2.2, 6.3, 7.9, 7.11-7.13, 9.1-9.5 COMP9444 17s2 Convolutional Networks 1 Outline Geometry of Hidden Unit Activations

More information

Automatic Classification of Perceived Gender from Facial Images

Automatic Classification of Perceived Gender from Facial Images Automatic Classification of Perceived Gender from Facial Images Joseph Lemley, Sami Abdul-Wahid, Dipayan Banik Advisor: Dr. Razvan Andonie SOURCE 2016 Outline 1 Introduction 2 Faces - Background 3 Faces

More information

A Deep Learning Approach for Subject Independent Emotion Recognition from Facial Expressions

A Deep Learning Approach for Subject Independent Emotion Recognition from Facial Expressions A Deep Learning Approach for Subject Independent Emotion Recognition from Facial Expressions VICTOR-EMIL NEAGOE *, ANDREI-PETRU BĂRAR *, NICU SEBE **, PAUL ROBITU * * Faculty of Electronics, Telecommunications

More information

Virtual Promenade: A New Serious Game for the Rehabilitation of Older Adults with Post-fall Syndrome

Virtual Promenade: A New Serious Game for the Rehabilitation of Older Adults with Post-fall Syndrome Virtual Promenade: A New Serious Game for the Rehabilitation of Older Adults with Post-fall Syndrome P. Wargnier, E. Phuong, K. Marivan, S. Benveniste, F. Bloch, S. Reingewirtz, G. Kemoun and A.-S. Rigaud

More information

Patch-based Head and Neck Cancer Subtype Classification

Patch-based Head and Neck Cancer Subtype Classification Patch-based Head and Neck Cancer Subtype Classification Wanyi Qian, Guoli Yin, Frances Liu, Advisor: Olivier Gevaert, Mu Zhou, Kevin Brennan Stanford University wqian2@stanford.edu, guoliy@stanford.edu,

More information

MR-Radiomics in Neuro-Oncology

MR-Radiomics in Neuro-Oncology Klinik für Stereotaxie und funktionelle Neurochirurgie Institut für Neurowissenschaften und Medizin MR-Radiomics in Neuro-Oncology M. Kocher Klinik für funktionelle Neurochirurgie und Stereotaxie Forschungszentrum

More information

Speech Enhancement Based on Deep Neural Networks

Speech Enhancement Based on Deep Neural Networks Speech Enhancement Based on Deep Neural Networks Chin-Hui Lee School of ECE, Georgia Tech chl@ece.gatech.edu Joint work with Yong Xu and Jun Du at USTC 1 Outline and Talk Agenda In Signal Processing Letter,

More information

arxiv: v2 [cs.cv] 19 Dec 2017

arxiv: v2 [cs.cv] 19 Dec 2017 An Ensemble of Deep Convolutional Neural Networks for Alzheimer s Disease Detection and Classification arxiv:1712.01675v2 [cs.cv] 19 Dec 2017 Jyoti Islam Department of Computer Science Georgia State University

More information

Efficient Deep Model Selection

Efficient Deep Model Selection Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes

More information

Lung Nodule Segmentation Using 3D Convolutional Neural Networks

Lung Nodule Segmentation Using 3D Convolutional Neural Networks Lung Nodule Segmentation Using 3D Convolutional Neural Networks Research paper Business Analytics Bernard Bronmans Master Business Analytics VU University, Amsterdam Evert Haasdijk Supervisor VU University,

More information

UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES

UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES T. Arias-Vergara 1, J.C. Vásquez-Correa 1,2, J.R. Orozco-Arroyave 1,2, P. Klumpp 2, and E. Nöth 2 1 Faculty

More information

Sound, Mixtures, and Learning

Sound, Mixtures, and Learning Sound, Mixtures, and Learning Dan Ellis Laboratory for Recognition and Organization of Speech and Audio (LabROSA) Electrical Engineering, Columbia University http://labrosa.ee.columbia.edu/

More information

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Oral Presentation at MIE 2011 30th August 2011 Oslo Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Kirsi

More information

Mosquito Larva Classification Method Based on Convolutional Neural Networks

Mosquito Larva Classification Method Based on Convolutional Neural Networks Mosquito Larva Classification Method Based on Convolutional Neural Networks A. Sanchez-Ortiz, A. Fierro-Radilla, A. Arista-Jalife, M. Cedillo-Hernandez, M. Nakano-Miyatake ESIME Culhuacan, Instituto Politécnico

More information

NEONATAL SEIZURE DETECTION USING CONVOLUTIONAL NEURAL NETWORKS. Alison O Shea, Gordon Lightbody, Geraldine Boylan, Andriy Temko

NEONATAL SEIZURE DETECTION USING CONVOLUTIONAL NEURAL NETWORKS. Alison O Shea, Gordon Lightbody, Geraldine Boylan, Andriy Temko 2017 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, SEPT. 25 28, 2017, TOKYO, JAPAN NEONATAL SEIZURE DETECTION USING CONVOLUTIONAL NEURAL NETWORKS Alison O Shea, Gordon Lightbody,

More information

SPEECH recordings taken from realistic environments typically

SPEECH recordings taken from realistic environments typically IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 1 Coupled Dictionaries for Exemplar-based Speech Enhancement and Automatic Speech Recognition Deepak Baby, Student Member, IEEE, Tuomas Virtanen,

More information

Noise-Robust Speech Recognition Technologies in Mobile Environments

Noise-Robust Speech Recognition Technologies in Mobile Environments Noise-Robust Speech Recognition echnologies in Mobile Environments Mobile environments are highly influenced by ambient noise, which may cause a significant deterioration of speech recognition performance.

More information

Speech Enhancement Using Deep Neural Network

Speech Enhancement Using Deep Neural Network Speech Enhancement Using Deep Neural Network Pallavi D. Bhamre 1, Hemangi H. Kulkarni 2 1 Post-graduate Student, Department of Electronics and Telecommunication, R. H. Sapat College of Engineering, Management

More information

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos PMR5406 Redes Neurais e Lógica Fuzzy Aula 5 Alguns Exemplos APPLICATIONS Two examples of real life applications of neural networks for pattern classification: RBF networks for face recognition FF networks

More information

ImageCLEF2018: Transfer Learning for Deep Learning with CNN for Tuberculosis Classification

ImageCLEF2018: Transfer Learning for Deep Learning with CNN for Tuberculosis Classification ImageCLEF2018: Transfer Learning for Deep Learning with CNN for Tuberculosis Classification Amilcare Gentili 1-2[0000-0002-5623-7512] 1 San Diego VA Health Care System, San Diego, CA USA 2 University of

More information

Network Dissection: Quantifying Interpretability of Deep Visual Representation

Network Dissection: Quantifying Interpretability of Deep Visual Representation Name: Pingchuan Ma Student number: 3526400 Date: August 19, 2018 Seminar: Explainable Machine Learning Lecturer: PD Dr. Ullrich Köthe SS 2018 Quantifying Interpretability of Deep Visual Representation

More information

Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team

Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team Established in September, 2016 at 110 Canal st. Lowell, MA 01852,

More information

An on-line VAD based on Multi-Normalisation Scoring (MNS) of observation likelihoods

An on-line VAD based on Multi-Normalisation Scoring (MNS) of observation likelihoods An on-line VAD based on Multi-Normalisation Scoring (MNS) of observation likelihoods Igor Odriozola, Inma Hernaez, Eva Navas Aholab Signal Processing Laboratory, University of the Basque Country (UPV/EHU),

More information

Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011

Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011 Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011 I. Purpose Drawing from the profile development of the QIBA-fMRI Technical Committee,

More information

The Impact of Visual Saliency Prediction in Image Classification

The Impact of Visual Saliency Prediction in Image Classification Dublin City University Insight Centre for Data Analytics Universitat Politecnica de Catalunya Escola Tècnica Superior d Enginyeria de Telecomunicacions de Barcelona Eric Arazo Sánchez The Impact of Visual

More information

Recognition & Organization of Speech and Audio

Recognition & Organization of Speech and Audio Recognition & Organization of Speech and Audio Dan Ellis Electrical Engineering, Columbia University http://www.ee.columbia.edu/~dpwe/ Outline 1 2 3 4 5 Introducing Tandem modeling

More information

Visual Scene Understanding

Visual Scene Understanding Visual Scene Understanding Aude Oliva Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Website: http://cvcl.mit.edu PPA High-level Scene Representation I. Long-term Memory

More information

Image Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning

Image Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning Image Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning Charles Killam, LP.D. Certified Instructor, NVIDIA Deep Learning Institute NVIDIA Corporation

More information

Robust Neural Encoding of Speech in Human Auditory Cortex

Robust Neural Encoding of Speech in Human Auditory Cortex Robust Neural Encoding of Speech in Human Auditory Cortex Nai Ding, Jonathan Z. Simon Electrical Engineering / Biology University of Maryland, College Park Auditory Processing in Natural Scenes How is

More information

CPSC81 Final Paper: Facial Expression Recognition Using CNNs

CPSC81 Final Paper: Facial Expression Recognition Using CNNs CPSC81 Final Paper: Facial Expression Recognition Using CNNs Luis Ceballos Swarthmore College, 500 College Ave., Swarthmore, PA 19081 USA Sarah Wallace Swarthmore College, 500 College Ave., Swarthmore,

More information

Food/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model

Food/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model 1 Food/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model Ashutosh Singla, Lin Yuan, and Touradj Ebrahimi lin.yuan@epfl.ch Outline 2 o Introduction o Image Dataset

More information

Binaural Hearing for Robots Introduction to Robot Hearing

Binaural Hearing for Robots Introduction to Robot Hearing Binaural Hearing for Robots Introduction to Robot Hearing 1Radu Horaud Binaural Hearing for Robots 1. Introduction to Robot Hearing 2. Methodological Foundations 3. Sound-Source Localization 4. Machine

More information

Speech Emotion Detection and Analysis

Speech Emotion Detection and Analysis Speech Emotion Detection and Analysis Helen Chan Travis Ebesu Caleb Fujimori COEN296: Natural Language Processing Prof. Ming-Hwa Wang School of Engineering Department of Computer Engineering Santa Clara

More information

HHS Public Access Author manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2018 January 04.

HHS Public Access Author manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2018 January 04. Discriminative Localization in CNNs for Weakly-Supervised Segmentation of Pulmonary Nodules Xinyang Feng 1, Jie Yang 1, Andrew F. Laine 1, and Elsa D. Angelini 1,2 1 Department of Biomedical Engineering,

More information

Cervix Cancer Classification using Colposcopy Images by Deep Learning Method

Cervix Cancer Classification using Colposcopy Images by Deep Learning Method Cervix Cancer Classification using Colposcopy Images by Deep Learning Method Vasudha, Ajay Mittal, Mamta Juneja University Institute of Engineering and Technology (UIET), Panjab University, Chandigarh,

More information

Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning

Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning Yi-Hsiang Hsu, MD, SCD Sep 16, 2017 yihsianghsu@hsl.harvard.edu HSL GeneticEpi Center,

More information

Automatic Quality Assessment of Cardiac MRI

Automatic Quality Assessment of Cardiac MRI Automatic Quality Assessment of Cardiac MRI Ilkay Oksuz 02.05.2018 Contact: ilkay.oksuz@kcl.ac.uk http://kclmmag.org 1 Cardiac MRI Quality Issues Need for high quality images Wide range of artefacts Manual

More information

Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation

Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation Takumi Ege, Yoshikazu Ando, Ryosuke Tanno, Wataru Shimoda and Keiji Yanai Department of Informatics, The University of Electro-Communications,

More information

Improving 3D Ultrasound Scan Adequacy Classification Using a Three-Slice Convolutional Neural Network Architecture

Improving 3D Ultrasound Scan Adequacy Classification Using a Three-Slice Convolutional Neural Network Architecture EPiC Series in Health Sciences Volume 2, 2018, Pages 152 156 CAOS 2018. The 18th Annual Meeting of the International Society for Computer Assisted Orthopaedic Surgery Health Sciences Improving 3D Ultrasound

More information

Working Group Meeting #3 DRAFT Summary September 27, 2017

Working Group Meeting #3 DRAFT Summary September 27, 2017 Working Group Meeting #3 DRAFT Summary September 27, 2017 Introduction The Chinatown Revitalization Plan Working Group met as a large group for the third time on September 27, 2017. Approximately 12 committee

More information

arxiv: v1 [cs.cv] 13 Mar 2018

arxiv: v1 [cs.cv] 13 Mar 2018 RESOURCE AWARE DESIGN OF A DEEP CONVOLUTIONAL-RECURRENT NEURAL NETWORK FOR SPEECH RECOGNITION THROUGH AUDIO-VISUAL SENSOR FUSION Matthijs Van keirsbilck Bert Moons Marian Verhelst MICAS, Department of

More information

REZUMAT TEZA DE DOCTORAT

REZUMAT TEZA DE DOCTORAT Investeşte în oameni! FONDUL SOCIAL EUROPEAN Programul Operaţional Sectorial Dezvoltarea Resurselor Umane 2007 2013 Axa prioritară: 1 Educaţia şi formarea profesională în sprijinul creşterii economice

More information

Factoid Question Answering

Factoid Question Answering Factoid Question Answering CS 898 Project June 12, 2017 Salman Mohammed David R. Cheriton School of Computer Science University of Waterloo Motivation Source: https://www.apple.com/newsroom/2017/01/hey-siri-whos-going-to-win-the-super-bowl/

More information

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations Andy Nguyen, M.D., M.S. Medical Director, Hematopathology, Hematology and Coagulation Laboratory,

More information

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns 1. Introduction Vasily Morzhakov, Alexey Redozubov morzhakovva@gmail.com, galdrd@gmail.com Abstract Cortical

More information

LabROSA Research Overview

LabROSA Research Overview LabROSA Research Overview Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/ 1.

More information

Introduction Related Work Dataset & Features

Introduction Related Work Dataset & Features Life Sciences: Predicting Image Categories using Brain Decoding Charles Akin-David ( aakindav@stanford.edu ) Aarush Selvan ( aselvan@stanford.edu ) Minymoh Anelone ( manelone@stanford.edu ) Final Report

More information

Acoustic Sensing With Artificial Intelligence

Acoustic Sensing With Artificial Intelligence Acoustic Sensing With Artificial Intelligence Bowon Lee Department of Electronic Engineering Inha University Incheon, South Korea bowon.lee@inha.ac.kr bowon.lee@ieee.org NVIDIA Deep Learning Day Seoul,

More information

Towards The Deep Model: Understanding Visual Recognition Through Computational Models. Panqu Wang Dissertation Defense 03/23/2017

Towards The Deep Model: Understanding Visual Recognition Through Computational Models. Panqu Wang Dissertation Defense 03/23/2017 Towards The Deep Model: Understanding Visual Recognition Through Computational Models Panqu Wang Dissertation Defense 03/23/2017 Summary Human Visual Recognition (face, object, scene) Simulate Explain

More information

Audiovisual to Sign Language Translator

Audiovisual to Sign Language Translator Technical Disclosure Commons Defensive Publications Series July 17, 2018 Audiovisual to Sign Language Translator Manikandan Gopalakrishnan Follow this and additional works at: https://www.tdcommons.org/dpubs_series

More information

Automated diagnosis of pneumothorax using an ensemble of convolutional neural networks with multi-sized chest radiography images

Automated diagnosis of pneumothorax using an ensemble of convolutional neural networks with multi-sized chest radiography images Automated diagnosis of pneumothorax using an ensemble of convolutional neural networks with multi-sized chest radiography images Tae Joon Jun, Dohyeun Kim, and Daeyoung Kim School of Computing, KAIST,

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Ting DS, Cheung CY-L, Lim G, et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic

More information

Machine Learning in Precision Medicine Coronary Health Prediction - Cardiac Events (Atherosclerosis) - Heart Transplant (Vasculopathy)

Machine Learning in Precision Medicine Coronary Health Prediction - Cardiac Events (Atherosclerosis) - Heart Transplant (Vasculopathy) Machine Learning in Precision Medicine Coronary Health Prediction - Cardiac Events (Atherosclerosis) - Heart Transplant (Vasculopathy) M. Sonka + IIBI, Charles University, IKEM, CKTCH The University of

More information

On the Use of Brainprints as Passwords

On the Use of Brainprints as Passwords 9/24/2015 2015 Global Identity Summit (GIS) 1 On the Use of Brainprints as Passwords Zhanpeng Jin Department of Electrical and Computer Engineering Department of Biomedical Engineering Binghamton University,

More information

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning Inferring Clinical Correlations from EEG Reports with Deep Neural Learning Methods for Identification, Classification, and Association using EHR Data S23 Travis R. Goodwin (Presenter) & Sanda M. Harabagiu

More information

Dual Path Network and Its Applications

Dual Path Network and Its Applications Learning and Vision Group (NUS), ILSVRC 2017 - CLS-LOC & DET tasks Dual Path Network and Its Applications National University of Singapore: Yunpeng Chen, Jianan Li, Huaxin Xiao, Jianshu Li, Xuecheng Nie,

More information

CLASSIFICATION OF BUILDING NOISE TYPE/POSITION

CLASSIFICATION OF BUILDING NOISE TYPE/POSITION CLASSIFICATION OF BUILDING NOISE TYPE/POSITION VIA SUPERVISED LEARNING Anonymous authors Paper under double-blind review ABSTRACT This paper presents noise type/position classification of various impact

More information

A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition

A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition LászlóTóth and Tamás Grósz MTA-SZTE Research Group on Artificial Intelligence Hungarian Academy of Sciences

More information

Audio-Visual Speech Recognition for a Person with Severe Hearing Loss Using Deep Canonical Correlation Analysis

Audio-Visual Speech Recognition for a Person with Severe Hearing Loss Using Deep Canonical Correlation Analysis Audio-Visual Speech Recognition for a Person with Severe Hearing Loss Using Deep Canonical Correlation Analysis Yuki Takashima 1, Tetsuya Takiguchi 1, Yasuo Ariki 1, Kiyohiro Omori 2 1 Graduate School

More information

Modulation and Top-Down Processing in Audition

Modulation and Top-Down Processing in Audition Modulation and Top-Down Processing in Audition Malcolm Slaney 1,2 and Greg Sell 2 1 Yahoo! Research 2 Stanford CCRMA Outline The Non-Linear Cochlea Correlogram Pitch Modulation and Demodulation Information

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Tschandl P, Rosendahl C, Akay BN, et al. Expert-level diagnosis of nonpigmented skin cancer by combined convolutional neural networks. JAMA Dermatol. Published online November

More information

Leukemia Blood Cell Image Classification Using Convolutional Neural Network

Leukemia Blood Cell Image Classification Using Convolutional Neural Network Leukemia Blood Cell Image Classification Using Convolutional Neural Network T. T. P. Thanh, Caleb Vununu, Sukhrob Atoev, Suk-Hwan Lee, and Ki-Ryong Kwon Abstract Acute myeloid leukemia is a type of malignant

More information

Robust Speech Detection for Noisy Environments

Robust Speech Detection for Noisy Environments Robust Speech Detection for Noisy Environments Óscar Varela, Rubén San-Segundo and Luis A. Hernández ABSTRACT This paper presents a robust voice activity detector (VAD) based on hidden Markov models (HMM)

More information

Big Image-Omics Data Analytics for Clinical Outcome Prediction

Big Image-Omics Data Analytics for Clinical Outcome Prediction Big Image-Omics Data Analytics for Clinical Outcome Prediction Junzhou Huang, Ph.D. Associate Professor Dept. Computer Science & Engineering University of Texas at Arlington Dept. CSE, UT Arlington Scalable

More information

Lecture 9: Speech Recognition: Front Ends

Lecture 9: Speech Recognition: Front Ends EE E682: Speech & Audio Processing & Recognition Lecture 9: Speech Recognition: Front Ends 1 2 Recognizing Speech Feature Calculation Dan Ellis http://www.ee.columbia.edu/~dpwe/e682/

More information

arxiv: v1 [cs.cv] 28 Feb 2018

arxiv: v1 [cs.cv] 28 Feb 2018 Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 Challenge arxiv:1802.10508v1 [cs.cv] 28 Feb 2018 Fabian Isensee 1, Philipp Kickingereder 2, Wolfgang Wick 3, Martin

More information

arxiv: v2 [cs.cv] 8 Mar 2018

arxiv: v2 [cs.cv] 8 Mar 2018 Automated soft tissue lesion detection and segmentation in digital mammography using a u-net deep learning network Timothy de Moor a, Alejandro Rodriguez-Ruiz a, Albert Gubern Mérida a, Ritse Mann a, and

More information

Using Source Models in Speech Separation

Using Source Models in Speech Separation Using Source Models in Speech Separation Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/

More information

Tandem acoustic modeling: Neural nets for mainstream ASR?

Tandem acoustic modeling: Neural nets for mainstream ASR? Tandem acoutic modeling: for maintream ASR? Dan Elli International Computer Science Intitute Berkeley CA dpwe@ici.berkeley.edu Outline 2 3 Tandem acoutic modeling Inide Tandem ytem: What going on? Future

More information

ARTIFICIAL INTELLIGENCE FOR DIGITAL PATHOLOGY. Kyunghyun Paeng, Co-founder and Research Scientist, Lunit Inc.

ARTIFICIAL INTELLIGENCE FOR DIGITAL PATHOLOGY. Kyunghyun Paeng, Co-founder and Research Scientist, Lunit Inc. ARTIFICIAL INTELLIGENCE FOR DIGITAL PATHOLOGY Kyunghyun Paeng, Co-founder and Research Scientist, Lunit Inc. 1. BACKGROUND: DIGITAL PATHOLOGY 2. APPLICATIONS AGENDA BREAST CANCER PROSTATE CANCER 3. DEMONSTRATIONS

More information