DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION
|
|
- Moris Osborne
- 5 years ago
- Views:
Transcription
1 DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION Michele Valenti 1 (valenti.michele.w@gmail.com), Aleksandr Diment 2, Giambattista Parascandolo 2, Stefano Squartini 1, Tuomas Virtanen 2 1 Università Politecnica delle Marche, Italy 2 Tampere University of Technology, Finland
2 DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION Michele Valenti 1 (valenti.michele.w@gmail.com), Aleksandr Diment 2, Giambattista Parascandolo 2, Stefano Squartini 1, Tuomas Virtanen 2 1 Università Politecnica delle Marche, Italy 2 Tampere University of Technology, Finland
3 Outline Introduction Our system Training modes Results Challenge ranking
4 Introduction What is acoustic scene classification?
5 Introduction What is acoustic scene classification? Home Car Forest path Audio
6 Our system Overview Audio Feature extraction Sequence splitting CNN Scores averaging Label
7 Our system Audio Features Features Raw audio Log-mel spectrogram
8 Our system Features Sequence splitting Sequence Raw audio segment Log-mel spectrogram Sequence splitting
9 Our system Convolutional neural network Sequence
10 Our system Convolutional neural network Sequences CNN 128 Sequence Feature maps
11 Our system Convolutional neural network Sequences CNN 128 Batch normalization Sequence Feature maps
12 Our system Convolutional neural network Sequences CNN Sequence Feature maps Subsampled feature maps
13 Our system Convolutional neural network Sequences CNN Sequence Feature maps Subsampled feature maps New feature maps
14 Our system Convolutional neural network Sequences CNN Time shrinking Sequence Feature maps Subsampled feature maps New feature maps
15 Our system Sequences CNN Convolutional neural network Flattening Sequence Feature maps Subsampled feature maps New feature maps
16 Our system Sequences CNN Convolutional neural network Fully-connected softmax layer 256 Sequence Feature maps Subsampled feature maps New feature maps
17 Our system Sequences Convolutional neural network 128 Sequence Feature maps New Subsampled feature maps feature maps CNN
18 Our system Scores averaging Class prediction scores Prediction scores Scores averaging
19 Our system Prediction scores Scores averaging Scores averaging Class prediction scores! " Σ argmax File s class
20 Training
21 Training Cross-validation setup Training + validation Test Fold 1 Test Fold 2 Test Fold 3 Test Fold 4
22 Training Non-full training Training + validation Fold n Test Training Validation
23 Training Non-full training Training + validation Fold n Test Training Non-full training Validation
24 Training Non-full training Training Training + validation Fold n Test Training Validation Accuracies Validation Epochs
25 Training Non-full training Training Training + validation Fold n Test Training Validation Accuracies Convergence time Validation Epochs
26 Training Non-full training Training + validation Fold n Test Training Validation Training
27 Training Training + validation Fold n Test Non-full training Full training Training Training Validation
28 Results Test data Training + validation Test Fold 1 Test Fold 2 Test Fold 3 Test Fold 4
29 Results Sequence length 80 Non-full training Full training Accuracy (%) ,5 1, Sequence length (s)
30 Results Sequence length 80 Non-full training Full training Accuracy (%) ,5 1, Sequence length (s)
31 Results Sequence length 80 Non-full training Full training Accuracy (%) ,5 1, Sequence length (s)
32 Results Class accuracies Class Accuracy (%) Beach 75.6 Bus 76.9 Café/Restaurant 74.4 Car 91.0 City center 93.6 Forest path 96.2 Grocery store 88.5 Home 80.8 Class Accuracy (%) Library 66.6 Metro station 96.2 Office 97.4 Park 59.0 Residential area 73.1 Train 46.2 Tram 78.2
33 Results Class accuracies Class Accuracy (%) Beach 75.6 Bus 76.9 Café/Restaurant 74.4 Car 91.0 City center 93.6 Forest path 96.2 Grocery store 88.5 Home 80.8 Class Accuracy (%) Library 66.6 Metro station 96.2 Office 34.6% 97.4 Residential area Park 59.0 Residential area 73.1 Train 46.2 Tram % Bus
34 Results Other classifiers System Sequence length (s) Non-full training Accuracy (%) Full training Baseline GMM (MFCC) Two-layer CNN (MFCC) Two-layer MLP (log-mel) One-layer CNN (log-mel) Two-layer CNN (log-mel)
35 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data
36 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data New training New validation
37 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data New training New validation 400 epochs convergence
38 Challenge ranking Final training Extended training set Training + validation + test Evaluation set Secret challenge data Final training for 400 epochs
39 Challenge ranking ,7 88,7 87,7 87,2 86,4 86,4 86,2 85,9 85,6 85,4 84,6 84,1 77,2 62,8
40 DCASE 2016 CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION Michele Valenti 1 (valenti.michele.w@gmail.com), Aleksandr Diment 2, Giambattista Parascandolo 2, Stefano Squartini 1, Tuomas Virtanen 2 1 Università Politecnica delle Marche, Italy 2 Tampere University of Technology, Finland
41 Results Feature comparison System Sequence length (s) Non-full training Accuracy (%) Full training Two-layer CNN (MFCC) Two-layer CNN (log-mel)
Improved Acoustic Scene Classification with DNN and CNN
Please contact the conference organizers at dcasechallenge@gmail.com if you require an accessible file, as the files provided by ConfTool Pro to reviewers are filtered to remove author information, and
More informationAcoustic Scene Classification by Ensembling Gradient Boosting Machine and Convolutional Neural Networks
Acoustic Scee Classificatio by Esemblig Gradiet Boostig Machie ad Covolutioal Neural Networks DCASE 2017 Eduardo Foseca, Rog Gog, Dmitry Bogdaov, Olga Slizovskaia, Emilia Gomez ad Xavier Serra Outlie Itroductio
More informationEnhanced Feature Extraction for Speech Detection in Media Audio
INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Enhanced Feature Extraction for Speech Detection in Media Audio Inseon Jang 1, ChungHyun Ahn 1, Jeongil Seo 1, Younseon Jang 2 1 Media Research Division,
More informationAttentional Masking for Pre-trained Deep Networks
Attentional Masking for Pre-trained Deep Networks IROS 2017 Marcus Wallenberg and Per-Erik Forssén Computer Vision Laboratory Department of Electrical Engineering Linköping University 2014 2017 Per-Erik
More informationAudio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss
INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Audio-Visual Speech Recognition Using Bimodal-Trained Features for a Person with Severe Hearing Loss Yuki Takashima 1, Ryo Aihara 1, Tetsuya Takiguchi
More informationComputational modeling of visual attention and saliency in the Smart Playroom
Computational modeling of visual attention and saliency in the Smart Playroom Andrew Jones Department of Computer Science, Brown University Abstract The two canonical modes of human visual attention bottomup
More informationB657: Final Project Report Holistically-Nested Edge Detection
B657: Final roject Report Holistically-Nested Edge Detection Mingze Xu & Hanfei Mei May 4, 2016 Abstract Holistically-Nested Edge Detection (HED), which is a novel edge detection method based on fully
More informationImmuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning
Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning Yi-Hsiang Hsu, MD, SCD Sep 16, 2017 yihsianghsu@hsl.harvard.edu Director & Associate
More informationA Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals
A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals Luis Guarda Bräuning (1) Nicolas Astorga (1) Enrique López Droguett (1) Marcio Moura (2) Marcelo
More informationObject Detectors Emerge in Deep Scene CNNs
Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are
More informationY-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images
Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images Sachin Mehta 1, Ezgi Mercan 1, Jamen Bartlett 2, Donald Weaver 2, Joann G. Elmore 1, and Linda Shapiro 1 1 University
More informationLearning Convolutional Neural Networks for Graphs
GA-65449 Learning Convolutional Neural Networks for Graphs Mathias Niepert Mohamed Ahmed Konstantin Kutzkov NEC Laboratories Europe Representation Learning for Graphs Telecom Safety Transportation Industry
More informationFinal Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment
Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment Tony Lindsey 1,3, Xiao Lu 1 and Mojtaba Tefagh 2 1 Department of Biomedical Informatics, Stanford
More informationCSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil
CSE 5194.01 - Introduction to High-Perfomance Deep Learning ImageNet & VGG Jihyung Kil ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton,
More informationAcoustic Signal Processing Based on Deep Neural Networks
Acoustic Signal Processing Based on Deep Neural Networks Chin-Hui Lee School of ECE, Georgia Tech chl@ece.gatech.edu Joint work with Yong Xu, Yanhui Tu, Qing Wang, Tian Gao, Jun Du, LiRong Dai Outline
More informationUsing the Soundtrack to Classify Videos
Using the Soundtrack to Classify Videos Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/
More informationReading Emotions from Speech using Deep Neural Networks
Reading Emotions from Speech using Deep Neural Networks Anusha Balakrishnan Stanford University Computer Science Department anusha@cs.stanford.edu Alisha Rege Stanford University Computer Science Department
More informationSingle-Channel Sound Source Localization Based on Discrimination of Acoustic Transfer Functions
3 Single-Channel Sound Source Localization Based on Discrimination of Acoustic Transfer Functions Ryoichi Takashima, Tetsuya Takiguchi and Yasuo Ariki Graduate School of System Informatics, Kobe University,
More informationRecognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks
Interspeech 2018 2-6 September 2018, Hyderabad Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks Shahin Amiriparian 1, Alice Baird 1, Sahib Julka 1,
More informationElad Hoffer*, Itay Hubara*, Daniel Soudry
Train longer, generalize better: closing the generalization gap in large batch training of neural networks Elad Hoffer*, Itay Hubara*, Daniel Soudry *Equal contribution Better models - parallelization
More informationACUTE LEUKEMIA CLASSIFICATION USING CONVOLUTION NEURAL NETWORK IN CLINICAL DECISION SUPPORT SYSTEM
ACUTE LEUKEMIA CLASSIFICATION USING CONVOLUTION NEURAL NETWORK IN CLINICAL DECISION SUPPORT SYSTEM Thanh.TTP 1, Giao N. Pham 1, Jin-Hyeok Park 1, Kwang-Seok Moon 2, Suk-Hwan Lee 3, and Ki-Ryong Kwon 1
More informationEMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS?
EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS? Sefik Emre Eskimez, Kenneth Imade, Na Yang, Melissa Sturge- Apple, Zhiyao Duan, Wendi Heinzelman University of Rochester,
More informationDeep learning and non-negative matrix factorization in recognition of mammograms
Deep learning and non-negative matrix factorization in recognition of mammograms Bartosz Swiderski Faculty of Applied Informatics and Mathematics Warsaw University of Life Sciences, Warsaw, Poland bartosz_swiderski@sggw.pl
More informationVisual interpretation in pathology
13 Visual interpretation in pathology Tissue architecture (alteration) evaluation e.g., for grading prostate cancer Immunohistochemistry (IHC) staining scoring e.g., HER2 in breast cancer (companion diagnostic
More informationConvolutional Neural Networks for Text Classification
Convolutional Neural Networks for Text Classification Sebastian Sierra MindLab Research Group July 1, 2016 ebastian Sierra (MindLab Research Group) NLP Summer Class July 1, 2016 1 / 32 Outline 1 What is
More informationFlexible, High Performance Convolutional Neural Networks for Image Classification
Flexible, High Performance Convolutional Neural Networks for Image Classification Dan C. Cireşan, Ueli Meier, Jonathan Masci, Luca M. Gambardella, Jürgen Schmidhuber IDSIA, USI and SUPSI Manno-Lugano,
More informationSegmentation of Cell Membrane and Nucleus by Improving Pix2pix
Segmentation of Membrane and Nucleus by Improving Pix2pix Masaya Sato 1, Kazuhiro Hotta 1, Ayako Imanishi 2, Michiyuki Matsuda 2 and Kenta Terai 2 1 Meijo University, Siogamaguchi, Nagoya, Aichi, Japan
More informationarxiv: v1 [cs.lg] 6 Oct 2016
Combining Generative and Discriminative Neural Networks for Sleep Stages Classification Endang Purnama Giri 1,2, Mohamad Ivan Fanany 1, Aniati Murni Arymurthy 1, arxiv:1610.01741v1 [cs.lg] 6 Oct 2016 1
More informationCOMP9444 Neural Networks and Deep Learning 5. Convolutional Networks
COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks Textbook, Sections 6.2.2, 6.3, 7.9, 7.11-7.13, 9.1-9.5 COMP9444 17s2 Convolutional Networks 1 Outline Geometry of Hidden Unit Activations
More informationAutomatic Classification of Perceived Gender from Facial Images
Automatic Classification of Perceived Gender from Facial Images Joseph Lemley, Sami Abdul-Wahid, Dipayan Banik Advisor: Dr. Razvan Andonie SOURCE 2016 Outline 1 Introduction 2 Faces - Background 3 Faces
More informationA Deep Learning Approach for Subject Independent Emotion Recognition from Facial Expressions
A Deep Learning Approach for Subject Independent Emotion Recognition from Facial Expressions VICTOR-EMIL NEAGOE *, ANDREI-PETRU BĂRAR *, NICU SEBE **, PAUL ROBITU * * Faculty of Electronics, Telecommunications
More informationVirtual Promenade: A New Serious Game for the Rehabilitation of Older Adults with Post-fall Syndrome
Virtual Promenade: A New Serious Game for the Rehabilitation of Older Adults with Post-fall Syndrome P. Wargnier, E. Phuong, K. Marivan, S. Benveniste, F. Bloch, S. Reingewirtz, G. Kemoun and A.-S. Rigaud
More informationPatch-based Head and Neck Cancer Subtype Classification
Patch-based Head and Neck Cancer Subtype Classification Wanyi Qian, Guoli Yin, Frances Liu, Advisor: Olivier Gevaert, Mu Zhou, Kevin Brennan Stanford University wqian2@stanford.edu, guoliy@stanford.edu,
More informationMR-Radiomics in Neuro-Oncology
Klinik für Stereotaxie und funktionelle Neurochirurgie Institut für Neurowissenschaften und Medizin MR-Radiomics in Neuro-Oncology M. Kocher Klinik für funktionelle Neurochirurgie und Stereotaxie Forschungszentrum
More informationSpeech Enhancement Based on Deep Neural Networks
Speech Enhancement Based on Deep Neural Networks Chin-Hui Lee School of ECE, Georgia Tech chl@ece.gatech.edu Joint work with Yong Xu and Jun Du at USTC 1 Outline and Talk Agenda In Signal Processing Letter,
More informationarxiv: v2 [cs.cv] 19 Dec 2017
An Ensemble of Deep Convolutional Neural Networks for Alzheimer s Disease Detection and Classification arxiv:1712.01675v2 [cs.cv] 19 Dec 2017 Jyoti Islam Department of Computer Science Georgia State University
More informationEfficient Deep Model Selection
Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes
More informationLung Nodule Segmentation Using 3D Convolutional Neural Networks
Lung Nodule Segmentation Using 3D Convolutional Neural Networks Research paper Business Analytics Bernard Bronmans Master Business Analytics VU University, Amsterdam Evert Haasdijk Supervisor VU University,
More informationUNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES
UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES T. Arias-Vergara 1, J.C. Vásquez-Correa 1,2, J.R. Orozco-Arroyave 1,2, P. Klumpp 2, and E. Nöth 2 1 Faculty
More informationSound, Mixtures, and Learning
Sound, Mixtures, and Learning Dan Ellis Laboratory for Recognition and Organization of Speech and Audio (LabROSA) Electrical Engineering, Columbia University http://labrosa.ee.columbia.edu/
More informationApplying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem
Oral Presentation at MIE 2011 30th August 2011 Oslo Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Kirsi
More informationMosquito Larva Classification Method Based on Convolutional Neural Networks
Mosquito Larva Classification Method Based on Convolutional Neural Networks A. Sanchez-Ortiz, A. Fierro-Radilla, A. Arista-Jalife, M. Cedillo-Hernandez, M. Nakano-Miyatake ESIME Culhuacan, Instituto Politécnico
More informationNEONATAL SEIZURE DETECTION USING CONVOLUTIONAL NEURAL NETWORKS. Alison O Shea, Gordon Lightbody, Geraldine Boylan, Andriy Temko
2017 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, SEPT. 25 28, 2017, TOKYO, JAPAN NEONATAL SEIZURE DETECTION USING CONVOLUTIONAL NEURAL NETWORKS Alison O Shea, Gordon Lightbody,
More informationSPEECH recordings taken from realistic environments typically
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 1 Coupled Dictionaries for Exemplar-based Speech Enhancement and Automatic Speech Recognition Deepak Baby, Student Member, IEEE, Tuomas Virtanen,
More informationNoise-Robust Speech Recognition Technologies in Mobile Environments
Noise-Robust Speech Recognition echnologies in Mobile Environments Mobile environments are highly influenced by ambient noise, which may cause a significant deterioration of speech recognition performance.
More informationSpeech Enhancement Using Deep Neural Network
Speech Enhancement Using Deep Neural Network Pallavi D. Bhamre 1, Hemangi H. Kulkarni 2 1 Post-graduate Student, Department of Electronics and Telecommunication, R. H. Sapat College of Engineering, Management
More informationPMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos
PMR5406 Redes Neurais e Lógica Fuzzy Aula 5 Alguns Exemplos APPLICATIONS Two examples of real life applications of neural networks for pattern classification: RBF networks for face recognition FF networks
More informationImageCLEF2018: Transfer Learning for Deep Learning with CNN for Tuberculosis Classification
ImageCLEF2018: Transfer Learning for Deep Learning with CNN for Tuberculosis Classification Amilcare Gentili 1-2[0000-0002-5623-7512] 1 San Diego VA Health Care System, San Diego, CA USA 2 University of
More informationNetwork Dissection: Quantifying Interpretability of Deep Visual Representation
Name: Pingchuan Ma Student number: 3526400 Date: August 19, 2018 Seminar: Explainable Machine Learning Lecturer: PD Dr. Ullrich Köthe SS 2018 Quantifying Interpretability of Deep Visual Representation
More informationHighly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team
Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team Established in September, 2016 at 110 Canal st. Lowell, MA 01852,
More informationAn on-line VAD based on Multi-Normalisation Scoring (MNS) of observation likelihoods
An on-line VAD based on Multi-Normalisation Scoring (MNS) of observation likelihoods Igor Odriozola, Inma Hernaez, Eva Navas Aholab Signal Processing Laboratory, University of the Basque Country (UPV/EHU),
More informationIncorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011
Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011 I. Purpose Drawing from the profile development of the QIBA-fMRI Technical Committee,
More informationThe Impact of Visual Saliency Prediction in Image Classification
Dublin City University Insight Centre for Data Analytics Universitat Politecnica de Catalunya Escola Tècnica Superior d Enginyeria de Telecomunicacions de Barcelona Eric Arazo Sánchez The Impact of Visual
More informationRecognition & Organization of Speech and Audio
Recognition & Organization of Speech and Audio Dan Ellis Electrical Engineering, Columbia University http://www.ee.columbia.edu/~dpwe/ Outline 1 2 3 4 5 Introducing Tandem modeling
More informationVisual Scene Understanding
Visual Scene Understanding Aude Oliva Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Website: http://cvcl.mit.edu PPA High-level Scene Representation I. Long-term Memory
More informationImage Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning
Image Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning Charles Killam, LP.D. Certified Instructor, NVIDIA Deep Learning Institute NVIDIA Corporation
More informationRobust Neural Encoding of Speech in Human Auditory Cortex
Robust Neural Encoding of Speech in Human Auditory Cortex Nai Ding, Jonathan Z. Simon Electrical Engineering / Biology University of Maryland, College Park Auditory Processing in Natural Scenes How is
More informationCPSC81 Final Paper: Facial Expression Recognition Using CNNs
CPSC81 Final Paper: Facial Expression Recognition Using CNNs Luis Ceballos Swarthmore College, 500 College Ave., Swarthmore, PA 19081 USA Sarah Wallace Swarthmore College, 500 College Ave., Swarthmore,
More informationFood/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model
1 Food/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model Ashutosh Singla, Lin Yuan, and Touradj Ebrahimi lin.yuan@epfl.ch Outline 2 o Introduction o Image Dataset
More informationBinaural Hearing for Robots Introduction to Robot Hearing
Binaural Hearing for Robots Introduction to Robot Hearing 1Radu Horaud Binaural Hearing for Robots 1. Introduction to Robot Hearing 2. Methodological Foundations 3. Sound-Source Localization 4. Machine
More informationSpeech Emotion Detection and Analysis
Speech Emotion Detection and Analysis Helen Chan Travis Ebesu Caleb Fujimori COEN296: Natural Language Processing Prof. Ming-Hwa Wang School of Engineering Department of Computer Engineering Santa Clara
More informationHHS Public Access Author manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2018 January 04.
Discriminative Localization in CNNs for Weakly-Supervised Segmentation of Pulmonary Nodules Xinyang Feng 1, Jie Yang 1, Andrew F. Laine 1, and Elsa D. Angelini 1,2 1 Department of Biomedical Engineering,
More informationCervix Cancer Classification using Colposcopy Images by Deep Learning Method
Cervix Cancer Classification using Colposcopy Images by Deep Learning Method Vasudha, Ajay Mittal, Mamta Juneja University Institute of Engineering and Technology (UIET), Panjab University, Chandigarh,
More informationImmuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning
Immuno-Oncology Therapies and Precision Medicine: Personal Tumor-Specific Neoantigen Prediction by Machine Learning Yi-Hsiang Hsu, MD, SCD Sep 16, 2017 yihsianghsu@hsl.harvard.edu HSL GeneticEpi Center,
More informationAutomatic Quality Assessment of Cardiac MRI
Automatic Quality Assessment of Cardiac MRI Ilkay Oksuz 02.05.2018 Contact: ilkay.oksuz@kcl.ac.uk http://kclmmag.org 1 Cardiac MRI Quality Issues Need for high quality images Wide range of artefacts Manual
More informationImage-Based Estimation of Real Food Size for Accurate Food Calorie Estimation
Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation Takumi Ege, Yoshikazu Ando, Ryosuke Tanno, Wataru Shimoda and Keiji Yanai Department of Informatics, The University of Electro-Communications,
More informationImproving 3D Ultrasound Scan Adequacy Classification Using a Three-Slice Convolutional Neural Network Architecture
EPiC Series in Health Sciences Volume 2, 2018, Pages 152 156 CAOS 2018. The 18th Annual Meeting of the International Society for Computer Assisted Orthopaedic Surgery Health Sciences Improving 3D Ultrasound
More informationWorking Group Meeting #3 DRAFT Summary September 27, 2017
Working Group Meeting #3 DRAFT Summary September 27, 2017 Introduction The Chinatown Revitalization Plan Working Group met as a large group for the third time on September 27, 2017. Approximately 12 committee
More informationarxiv: v1 [cs.cv] 13 Mar 2018
RESOURCE AWARE DESIGN OF A DEEP CONVOLUTIONAL-RECURRENT NEURAL NETWORK FOR SPEECH RECOGNITION THROUGH AUDIO-VISUAL SENSOR FUSION Matthijs Van keirsbilck Bert Moons Marian Verhelst MICAS, Department of
More informationREZUMAT TEZA DE DOCTORAT
Investeşte în oameni! FONDUL SOCIAL EUROPEAN Programul Operaţional Sectorial Dezvoltarea Resurselor Umane 2007 2013 Axa prioritară: 1 Educaţia şi formarea profesională în sprijinul creşterii economice
More informationFactoid Question Answering
Factoid Question Answering CS 898 Project June 12, 2017 Salman Mohammed David R. Cheriton School of Computer Science University of Waterloo Motivation Source: https://www.apple.com/newsroom/2017/01/hey-siri-whos-going-to-win-the-super-bowl/
More informationDeep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations
Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations Andy Nguyen, M.D., M.S. Medical Director, Hematopathology, Hematology and Coagulation Laboratory,
More informationAn Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns
An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns 1. Introduction Vasily Morzhakov, Alexey Redozubov morzhakovva@gmail.com, galdrd@gmail.com Abstract Cortical
More informationLabROSA Research Overview
LabROSA Research Overview Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/ 1.
More informationIntroduction Related Work Dataset & Features
Life Sciences: Predicting Image Categories using Brain Decoding Charles Akin-David ( aakindav@stanford.edu ) Aarush Selvan ( aselvan@stanford.edu ) Minymoh Anelone ( manelone@stanford.edu ) Final Report
More informationAcoustic Sensing With Artificial Intelligence
Acoustic Sensing With Artificial Intelligence Bowon Lee Department of Electronic Engineering Inha University Incheon, South Korea bowon.lee@inha.ac.kr bowon.lee@ieee.org NVIDIA Deep Learning Day Seoul,
More informationTowards The Deep Model: Understanding Visual Recognition Through Computational Models. Panqu Wang Dissertation Defense 03/23/2017
Towards The Deep Model: Understanding Visual Recognition Through Computational Models Panqu Wang Dissertation Defense 03/23/2017 Summary Human Visual Recognition (face, object, scene) Simulate Explain
More informationAudiovisual to Sign Language Translator
Technical Disclosure Commons Defensive Publications Series July 17, 2018 Audiovisual to Sign Language Translator Manikandan Gopalakrishnan Follow this and additional works at: https://www.tdcommons.org/dpubs_series
More informationAutomated diagnosis of pneumothorax using an ensemble of convolutional neural networks with multi-sized chest radiography images
Automated diagnosis of pneumothorax using an ensemble of convolutional neural networks with multi-sized chest radiography images Tae Joon Jun, Dohyeun Kim, and Daeyoung Kim School of Computing, KAIST,
More informationSupplementary Online Content
Supplementary Online Content Ting DS, Cheung CY-L, Lim G, et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic
More informationMachine Learning in Precision Medicine Coronary Health Prediction - Cardiac Events (Atherosclerosis) - Heart Transplant (Vasculopathy)
Machine Learning in Precision Medicine Coronary Health Prediction - Cardiac Events (Atherosclerosis) - Heart Transplant (Vasculopathy) M. Sonka + IIBI, Charles University, IKEM, CKTCH The University of
More informationOn the Use of Brainprints as Passwords
9/24/2015 2015 Global Identity Summit (GIS) 1 On the Use of Brainprints as Passwords Zhanpeng Jin Department of Electrical and Computer Engineering Department of Biomedical Engineering Binghamton University,
More informationInferring Clinical Correlations from EEG Reports with Deep Neural Learning
Inferring Clinical Correlations from EEG Reports with Deep Neural Learning Methods for Identification, Classification, and Association using EHR Data S23 Travis R. Goodwin (Presenter) & Sanda M. Harabagiu
More informationDual Path Network and Its Applications
Learning and Vision Group (NUS), ILSVRC 2017 - CLS-LOC & DET tasks Dual Path Network and Its Applications National University of Singapore: Yunpeng Chen, Jianan Li, Huaxin Xiao, Jianshu Li, Xuecheng Nie,
More informationCLASSIFICATION OF BUILDING NOISE TYPE/POSITION
CLASSIFICATION OF BUILDING NOISE TYPE/POSITION VIA SUPERVISED LEARNING Anonymous authors Paper under double-blind review ABSTRACT This paper presents noise type/position classification of various impact
More informationA Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition
A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition LászlóTóth and Tamás Grósz MTA-SZTE Research Group on Artificial Intelligence Hungarian Academy of Sciences
More informationAudio-Visual Speech Recognition for a Person with Severe Hearing Loss Using Deep Canonical Correlation Analysis
Audio-Visual Speech Recognition for a Person with Severe Hearing Loss Using Deep Canonical Correlation Analysis Yuki Takashima 1, Tetsuya Takiguchi 1, Yasuo Ariki 1, Kiyohiro Omori 2 1 Graduate School
More informationModulation and Top-Down Processing in Audition
Modulation and Top-Down Processing in Audition Malcolm Slaney 1,2 and Greg Sell 2 1 Yahoo! Research 2 Stanford CCRMA Outline The Non-Linear Cochlea Correlogram Pitch Modulation and Demodulation Information
More informationSupplementary Online Content
Supplementary Online Content Tschandl P, Rosendahl C, Akay BN, et al. Expert-level diagnosis of nonpigmented skin cancer by combined convolutional neural networks. JAMA Dermatol. Published online November
More informationLeukemia Blood Cell Image Classification Using Convolutional Neural Network
Leukemia Blood Cell Image Classification Using Convolutional Neural Network T. T. P. Thanh, Caleb Vununu, Sukhrob Atoev, Suk-Hwan Lee, and Ki-Ryong Kwon Abstract Acute myeloid leukemia is a type of malignant
More informationRobust Speech Detection for Noisy Environments
Robust Speech Detection for Noisy Environments Óscar Varela, Rubén San-Segundo and Luis A. Hernández ABSTRACT This paper presents a robust voice activity detector (VAD) based on hidden Markov models (HMM)
More informationBig Image-Omics Data Analytics for Clinical Outcome Prediction
Big Image-Omics Data Analytics for Clinical Outcome Prediction Junzhou Huang, Ph.D. Associate Professor Dept. Computer Science & Engineering University of Texas at Arlington Dept. CSE, UT Arlington Scalable
More informationLecture 9: Speech Recognition: Front Ends
EE E682: Speech & Audio Processing & Recognition Lecture 9: Speech Recognition: Front Ends 1 2 Recognizing Speech Feature Calculation Dan Ellis http://www.ee.columbia.edu/~dpwe/e682/
More informationarxiv: v1 [cs.cv] 28 Feb 2018
Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 Challenge arxiv:1802.10508v1 [cs.cv] 28 Feb 2018 Fabian Isensee 1, Philipp Kickingereder 2, Wolfgang Wick 3, Martin
More informationarxiv: v2 [cs.cv] 8 Mar 2018
Automated soft tissue lesion detection and segmentation in digital mammography using a u-net deep learning network Timothy de Moor a, Alejandro Rodriguez-Ruiz a, Albert Gubern Mérida a, Ritse Mann a, and
More informationUsing Source Models in Speech Separation
Using Source Models in Speech Separation Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/
More informationTandem acoustic modeling: Neural nets for mainstream ASR?
Tandem acoutic modeling: for maintream ASR? Dan Elli International Computer Science Intitute Berkeley CA dpwe@ici.berkeley.edu Outline 2 3 Tandem acoutic modeling Inide Tandem ytem: What going on? Future
More informationARTIFICIAL INTELLIGENCE FOR DIGITAL PATHOLOGY. Kyunghyun Paeng, Co-founder and Research Scientist, Lunit Inc.
ARTIFICIAL INTELLIGENCE FOR DIGITAL PATHOLOGY Kyunghyun Paeng, Co-founder and Research Scientist, Lunit Inc. 1. BACKGROUND: DIGITAL PATHOLOGY 2. APPLICATIONS AGENDA BREAST CANCER PROSTATE CANCER 3. DEMONSTRATIONS
More information