DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation

Size: px
Start display at page:

Download "DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation"

Transcription

1 DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation Biyi Fang Michigan State University ACM SenSys 17 Nov 6 th, 2017 Biyi Fang (MSU) Jillian Co (MSU) Mi Zhang (MSU) 1

2 Deep Learning is Changing our Lives Now Self-Driving Face Recognition Speech Recognition Play Go 2

3 Background American Sign Language (ASL) is the primary language used by deaf people to communicate with others. Unfortunately, very few people with normal hearing understand sign language. Existing communication approaches have key limitations in cost, availability or convenience. Sign Language Interpreter Write on Paper Type on Phone 3

4 Sign Language Translation Technology A S L Characteristics of Signs Hand Shape Hand Movement Relative Location of Two Hands Sensors Computational Models 4

5 Limitations of Existing Sign Language Translation Systems EMG + Motion [Wu et al. 2015] RGB Camera [Zafrulla et al. 2010] Kinect [Chai et al. 2013] intrusive constrained by lighting condition and privacy intrusive lack of resolution 5

6 Our Solution: DeepASL A deep learning-based sign language translation framework that enables ubiquitous and non-intrusive ASL translation at both word and sentence levels. 6

7 Leap Motion (Infrared Sensing) Design Choice 3D Skeleton Joint Data Skeleton Joint Bone Extended Bone Elbow 7

8 Comparison with Existing Sign Language Translation Systems Non- Intrusive Lighting Condition Privacy Preserving High Resolution EMG + Motion RGB Camera Kinect DeepASL 8

9 System Architecture of DeepASL Sentence-Level Translation Word-Level Translation ASL Characteristics Extraction 9

10 ASL Characteristics Extraction Hand Shape + Relative Location of Two Hands Right Hand Shape Hand Movement [0, 0, 0] Right Hand Movement Left Hand Shape Left Hand Movement 10

11 ASL Characteristics Organization Right Hand Shape Right Hand Movement Left Hand Shape Thank Left Hand Movement Fully Connected Softmax Low-Level ASL Characteristics Mid-Level Right/Left Hand Representation High-Level Single-Sign Representation Probability Distribution over Vocabulary 11

12 Similar ASL Differentiation Some signs share very similar characteristics at the beginning of their trajectories. Want What 12

13 Similar ASL Differentiation A bidirectional recurrent neural network (B-RNN) model is incorporated to capture both forward and backward representation of a sign. Output Layer y t 1 y t y t+1 Backward Layer h t 1 h t h t+1 Forward Layer h t 1 h t h t+1 Input Layer x t 1 x t x t+1 13

14 Sentence-Level ASL Translation DeepASL adopts a probabilistic framework based on Connectionist Temporal Classification (CTC) [Graves et al. 2006] for sentence-level ASL Inference How are you How_are_you How are you Insert blank symbols Remove blank symbols It eliminates the restriction of pre-segmenting the whole sentence into individual words, enabling end-to-end whole-sentence translation. 14

15 Performance on Word-Level ASL Translation ASL Word Dataset 56 ASL words 11 participants 6440 samples In total Performance Average 95% accuracy Worst-case 91% on participant #11 15

16 Necessity of Model Components Model Translation Accuracy Increase Note Baseline ± 3.1 % 5.1 % No hand shape information Baseline ± 2.4 % 5.0 % No hand movement information Baseline ± 3.4 % 3.4 % No hierarchical structure Baseline ± 1.7 % 0.8 % No bidirectional structure DeepASL 94.5 ± 2.4 % 16

17 Performance on Sentence-Level ASL Translation ASL Sentence Dataset 4-word sentence from 16 ASL words 100 sentences 866 samples in total Performance Average 16% Top-1 word error rate (WER) Average 4% Top-5 WER 17

18 Application#1: ASL Tutor ASL Tutor helps hearing parents of deaf children learn ASL. MyASLTutor Looked-up Word & Explanation ASL Visualization 18

19 Application#2: ASL Interpreter ASL Interpreter enables two-way communication between deaf and hearing majority. Deaf Person First-person point of view of the deaf person using Microsoft HoloLens AR headset 19

20 Video:

21 Conclusions DeepASL represents the first deep learning-based sign language translation framework that enables ubiquitous and non-intrusive ASL translation at both word and sentence levels. DeepASL achieves an average 94.5% translation accuracy over 56 commonly used ASL words, and an average 16.1% word error rate on translating 100 sentences. Take an initiative on ASL sign data crowdsourcing. We believe that, with the crowdsourced efforts, ASL translation technology can be significantly advanced. 21

22 Thank You Biyi Fang Michigan State University Web: fangbiyi.com 22

Experimental evaluation of the accuracy of the second generation of Microsoft Kinect system, for using in stroke rehabilitation applications

Experimental evaluation of the accuracy of the second generation of Microsoft Kinect system, for using in stroke rehabilitation applications Experimental evaluation of the accuracy of the second generation of Microsoft Kinect system, for using in stroke rehabilitation applications Mohammad Hossein Saadatzi 1 Home-based Stroke Rehabilitation

More information

Accessible Computing Research for Users who are Deaf and Hard of Hearing (DHH)

Accessible Computing Research for Users who are Deaf and Hard of Hearing (DHH) Accessible Computing Research for Users who are Deaf and Hard of Hearing (DHH) Matt Huenerfauth Raja Kushalnagar Rochester Institute of Technology DHH Auditory Issues Links Accents/Intonation Listening

More information

Sign Language Recognition with the Kinect Sensor Based on Conditional Random Fields

Sign Language Recognition with the Kinect Sensor Based on Conditional Random Fields Sensors 2015, 15, 135-147; doi:10.3390/s150100135 Article OPEN ACCESS sensors ISSN 1424-8220 www.mdpi.com/journal/sensors Sign Language Recognition with the Kinect Sensor Based on Conditional Random Fields

More information

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning

Inferring Clinical Correlations from EEG Reports with Deep Neural Learning Inferring Clinical Correlations from EEG Reports with Deep Neural Learning Methods for Identification, Classification, and Association using EHR Data S23 Travis R. Goodwin (Presenter) & Sanda M. Harabagiu

More information

Real Time Sign Language Processing System

Real Time Sign Language Processing System Real Time Sign Language Processing System Dibyabiva Seth (&), Anindita Ghosh, Ariruna Dasgupta, and Asoke Nath Department of Computer Science, St. Xavier s College (Autonomous), Kolkata, India meetdseth@gmail.com,

More information

Sign Language Recognition using Kinect

Sign Language Recognition using Kinect Sign Language Recognition using Kinect Edon Mustafa 1, Konstantinos Dimopoulos 2 1 South-East European Research Centre, University of Sheffield, Thessaloniki, Greece 2 CITY College- International Faculty

More information

Academic Program / Discipline Area (for General Education) or Co-Curricular Program Area:

Academic Program / Discipline Area (for General Education) or Co-Curricular Program Area: PROGRAM LEARNING OUTCOME ASSESSMENT PLAN General Information Academic Year of Implementation: 2012 2013 Academic Program / Discipline Area (for General Education) or Co-Curricular Program Area: Pre-major

More information

TURKISH SIGN LANGUAGE RECOGNITION USING HIDDEN MARKOV MODEL

TURKISH SIGN LANGUAGE RECOGNITION USING HIDDEN MARKOV MODEL TURKISH SIGN LANGUAGE RECOGNITION USING HIDDEN MARKOV MODEL Kakajan Kakayev 1 and Ph.D. Songül Albayrak 2 1,2 Department of Computer Engineering, Yildiz Technical University, Istanbul, Turkey kkakajan@gmail.com

More information

Expression Recognition. Mohammad Amanzadeh

Expression Recognition. Mohammad Amanzadeh Expression Recognition Mohammad Amanzadeh Body Movement Movement is one of the most basic human skills that used for communicating and interacting with the environment Although we have an intuitive understanding

More information

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING 134 TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING H.F.S.M.Fonseka 1, J.T.Jonathan 2, P.Sabeshan 3 and M.B.Dissanayaka 4 1 Department of Electrical And Electronic Engineering, Faculty

More information

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta Image Captioning using Reinforcement Learning Presentation by: Samarth Gupta 1 Introduction Summary Supervised Models Image captioning as RL problem Actor Critic Architecture Policy Gradient architecture

More information

Using Deep Convolutional Networks for Gesture Recognition in American Sign Language

Using Deep Convolutional Networks for Gesture Recognition in American Sign Language Using Deep Convolutional Networks for Gesture Recognition in American Sign Language Abstract In the realm of multimodal communication, sign language is, and continues to be, one of the most understudied

More information

arxiv: v1 [cs.cv] 13 Mar 2018

arxiv: v1 [cs.cv] 13 Mar 2018 RESOURCE AWARE DESIGN OF A DEEP CONVOLUTIONAL-RECURRENT NEURAL NETWORK FOR SPEECH RECOGNITION THROUGH AUDIO-VISUAL SENSOR FUSION Matthijs Van keirsbilck Bert Moons Marian Verhelst MICAS, Department of

More information

An Approach to Hand Gesture Recognition for Devanagari Sign Language using Image Processing Tool Box

An Approach to Hand Gesture Recognition for Devanagari Sign Language using Image Processing Tool Box An Approach to Hand Gesture Recognition for Devanagari Sign Language using Image Processing Tool Box Prof. Abhijit V. Warhade 1 Prof. Pranali K. Misal 2 Assistant Professor, Dept. of E & C Engineering

More information

Audiovisual to Sign Language Translator

Audiovisual to Sign Language Translator Technical Disclosure Commons Defensive Publications Series July 17, 2018 Audiovisual to Sign Language Translator Manikandan Gopalakrishnan Follow this and additional works at: https://www.tdcommons.org/dpubs_series

More information

CHINESE SIGN LANGUAGE RECOGNITION WITH ADAPTIVE HMM. Jihai Zhang, Wengang Zhou, Chao Xie, Junfu Pu, and Houqiang Li

CHINESE SIGN LANGUAGE RECOGNITION WITH ADAPTIVE HMM. Jihai Zhang, Wengang Zhou, Chao Xie, Junfu Pu, and Houqiang Li CHINESE SIGN LANGUAGE RECOGNITION WITH ADAPTIVE HMM Jihai Zhang, Wengang Zhou, Chao Xie, Junfu Pu, and Houqiang Li University of Science and Technology of China, Hefei, China {jihzhang, pjh}@mail.ustc.edu.cn,

More information

Gesture Recognition using Marathi/Hindi Alphabet

Gesture Recognition using Marathi/Hindi Alphabet Gesture Recognition using Marathi/Hindi Alphabet Rahul Dobale ¹, Rakshit Fulzele², Shruti Girolla 3, Seoutaj Singh 4 Student, Computer Engineering, D.Y. Patil School of Engineering, Pune, India 1 Student,

More information

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Ramon Maldonado, BS, Travis Goodwin, PhD Sanda M. Harabagiu, PhD The University

More information

Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language

Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language Muhammad Faisal Information Technology University Lahore m.faisal@itu.edu.pk Abstract Human lip-reading is a challenging task.

More information

Analysis of Recognition System of Japanese Sign Language using 3D Image Sensor

Analysis of Recognition System of Japanese Sign Language using 3D Image Sensor Analysis of Recognition System of Japanese Sign Language using 3D Image Sensor Yanhua Sun *, Noriaki Kuwahara**, Kazunari Morimoto *** * oo_alison@hotmail.com ** noriaki.kuwahara@gmail.com ***morix119@gmail.com

More information

LSA64: An Argentinian Sign Language Dataset

LSA64: An Argentinian Sign Language Dataset LSA64: An Argentinian Sign Language Dataset Franco Ronchetti* 1, Facundo Quiroga* 1, César Estrebou 1, Laura Lanzarini 1, and Alejandro Rosete 2 1 Instituto de Investigación en Informática LIDI, Facultad

More information

Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech

Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech arxiv:1901.05613v1 [cs.cv] 17 Jan 2019 Shahjalal Ahmed, Md. Rafiqul Islam,

More information

Building an Application for Learning the Finger Alphabet of Swiss German Sign Language through Use of the Kinect

Building an Application for Learning the Finger Alphabet of Swiss German Sign Language through Use of the Kinect Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2014 Building an Application for Learning the Finger Alphabet of Swiss German

More information

ScienceDirect. Sign Language Unification: The Need for Next Generation Deaf Education

ScienceDirect. Sign Language Unification: The Need for Next Generation Deaf Education Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 48 (2015 ) 673 678 International Conference on Intelligent Computing, Communication & Convergence (ICCC-2015) (ICCC-2014)

More information

Sign Language in the Intelligent Sensory Environment

Sign Language in the Intelligent Sensory Environment Sign Language in the Intelligent Sensory Environment Ákos Lisztes, László Kővári, Andor Gaudia, Péter Korondi Budapest University of Science and Technology, Department of Automation and Applied Informatics,

More information

Sign Language to English (Slate8)

Sign Language to English (Slate8) Sign Language to English (Slate8) App Development Nathan Kebe El Faculty Advisor: Dr. Mohamad Chouikha 2 nd EECS Day April 20, 2018 Electrical Engineering and Computer Science (EECS) Howard University

More information

Recognition of sign language gestures using neural networks

Recognition of sign language gestures using neural networks Recognition of sign language gestures using neural s Peter Vamplew Department of Computer Science, University of Tasmania GPO Box 252C, Hobart, Tasmania 7001, Australia vamplew@cs.utas.edu.au ABSTRACT

More information

Performance Analysis of different Classifiers for Chinese Sign Language Recognition

Performance Analysis of different Classifiers for Chinese Sign Language Recognition IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 11, Issue 2, Ver. II (Mar-Apr.216), PP 47-54 www.iosrjournals.org Performance Analysis

More information

Efficient Deep Model Selection

Efficient Deep Model Selection Efficient Deep Model Selection Jose Alvarez Researcher Data61, CSIRO, Australia GTC, May 9 th 2017 www.josemalvarez.net conv1 conv2 conv3 conv4 conv5 conv6 conv7 conv8 softmax prediction???????? Num Classes

More information

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience.

Motivation: Attention: Focusing on specific parts of the input. Inspired by neuroscience. Outline: Motivation. What s the attention mechanism? Soft attention vs. Hard attention. Attention in Machine translation. Attention in Image captioning. State-of-the-art. 1 Motivation: Attention: Focusing

More information

Multi-Modality American Sign Language Recognition

Multi-Modality American Sign Language Recognition Rochester Institute of Technology RIT Scholar Works Presentations and other scholarship 9-2016 Multi-Modality American Sign Language Recognition Chenyang Zhang City College of New York Yingli Tian City

More information

HAND GESTURE RECOGNITION USING ADAPTIVE NETWORK BASED FUZZY INFERENCE SYSTEM AND K-NEAREST NEIGHBOR. Fifin Ayu Mufarroha 1, Fitri Utaminingrum 1*

HAND GESTURE RECOGNITION USING ADAPTIVE NETWORK BASED FUZZY INFERENCE SYSTEM AND K-NEAREST NEIGHBOR. Fifin Ayu Mufarroha 1, Fitri Utaminingrum 1* International Journal of Technology (2017) 3: 559-567 ISSN 2086-9614 IJTech 2017 HAND GESTURE RECOGNITION USING ADAPTIVE NETWORK BASED FUZZY INFERENCE SYSTEM AND K-NEAREST NEIGHBOR Fifin Ayu Mufarroha

More information

Skin cancer reorganization and classification with deep neural network

Skin cancer reorganization and classification with deep neural network Skin cancer reorganization and classification with deep neural network Hao Chang 1 1. Department of Genetics, Yale University School of Medicine 2. Email: changhao86@gmail.com Abstract As one kind of skin

More information

Implementation of image processing approach to translation of ASL finger-spelling to digital text

Implementation of image processing approach to translation of ASL finger-spelling to digital text Rochester Institute of Technology RIT Scholar Works Articles 2006 Implementation of image processing approach to translation of ASL finger-spelling to digital text Divya Mandloi Kanthi Sarella Chance Glenn

More information

N RISCE 2K18 ISSN International Journal of Advance Research and Innovation

N RISCE 2K18 ISSN International Journal of Advance Research and Innovation The Computer Assistance Hand Gesture Recognition system For Physically Impairment Peoples V.Veeramanikandan(manikandan.veera97@gmail.com) UG student,department of ECE,Gnanamani College of Technology. R.Anandharaj(anandhrak1@gmail.com)

More information

Developing a Game-Based Proprioception Reconstruction System for Patients with Ankle Sprain

Developing a Game-Based Proprioception Reconstruction System for Patients with Ankle Sprain Developing a Game-Based Proprioception Reconstruction System for Patients with Ankle Sprain Yu-Cheng Lin, Chung Shan Medical University, Taiwan Tzu-Fang Sheu, Providence University, Taiwan Hsiao Ping Lee,

More information

enterface 13 Kinect-Sign João Manuel Ferreira Gameiro Project Proposal for enterface 13

enterface 13 Kinect-Sign João Manuel Ferreira Gameiro Project Proposal for enterface 13 enterface 13 João Manuel Ferreira Gameiro Kinect-Sign Project Proposal for enterface 13 February, 2013 Abstract This project main goal is to assist in the communication between deaf and non-deaf people.

More information

Scalable ASL sign recognition using model-based machine learning and linguistically annotated corpora

Scalable ASL sign recognition using model-based machine learning and linguistically annotated corpora Boston University OpenBU Linguistics http://open.bu.edu BU Open Access Articles 2018-05-12 Scalable ASL sign recognition using model-based machine learning and linguistically annotated corpora Metaxas,

More information

Sign Language Gesture Classification Using Neural Networks

Sign Language Gesture Classification Using Neural Networks IberSPEECH 2018 21-23 November 2018, Barcelona, Spain Sign Language Gesture Classification Using Neural Networks Zuzanna Parcheta 1, Carlos-D. Martínez-Hinarejos 2 1 Sciling S.L., Carrer del Riu 321, Pinedo,

More information

A HMM-based Pre-training Approach for Sequential Data

A HMM-based Pre-training Approach for Sequential Data A HMM-based Pre-training Approach for Sequential Data Luca Pasa 1, Alberto Testolin 2, Alessandro Sperduti 1 1- Department of Mathematics 2- Department of Developmental Psychology and Socialisation University

More information

The Leap Motion controller: A view on sign language

The Leap Motion controller: A view on sign language The Leap Motion controller: A view on sign language Author Potter, Leigh-Ellen, Araullo, Jake, Carter, Lewis Published 2013 Conference Title The 25th Australian Computer-Human Interaction Conference DOI

More information

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns 1. Introduction Vasily Morzhakov, Alexey Redozubov morzhakovva@gmail.com, galdrd@gmail.com Abstract Cortical

More information

arxiv: v1 [stat.ml] 23 Jan 2017

arxiv: v1 [stat.ml] 23 Jan 2017 Learning what to look in chest X-rays with a recurrent visual attention model arxiv:1701.06452v1 [stat.ml] 23 Jan 2017 Petros-Pavlos Ypsilantis Department of Biomedical Engineering King s College London

More information

3. MANUAL ALPHABET RECOGNITION STSTM

3. MANUAL ALPHABET RECOGNITION STSTM Proceedings of the IIEEJ Image Electronics and Visual Computing Workshop 2012 Kuching, Malaysia, November 21-24, 2012 JAPANESE MANUAL ALPHABET RECOGNITION FROM STILL IMAGES USING A NEURAL NETWORK MODEL

More information

Provost s Learning Innovation Grant for

Provost s Learning Innovation Grant for Provost s Learning Innovation Grant for 2010-2011 Elisabetta D'Amanda (CLA/Foreign Languages), Ann Marie Kuntz and Kathy Darroch (NTID/College Operations, Special Access Services) Project: "Integrated

More information

Interpreter Preparation (IPP) IPP 101 ASL/Non-IPP Majors. 4 Hours. Prerequisites: None. 4 hours weekly (3-1)

Interpreter Preparation (IPP) IPP 101 ASL/Non-IPP Majors. 4 Hours. Prerequisites: None. 4 hours weekly (3-1) Interpreter Preparation (IPP) IPP 101 ASL/Non-IPP Majors 4 hours weekly (3-1) This course is designed for students who have no knowledge of American Sign Language. The focus of this course will be on developing

More information

An Evaluation of RGB-D Skeleton Tracking for Use in Large Vocabulary Complex Gesture Recognition

An Evaluation of RGB-D Skeleton Tracking for Use in Large Vocabulary Complex Gesture Recognition An Evaluation of RGB-D Skeleton Tracking for Use in Large Vocabulary Complex Gesture Recognition Christopher Conly, Zhong Zhang, and Vassilis Athitsos Department of Computer Science and Engineering University

More information

Interpreter Preparation (IPP) IPP 101 ASL/Non-IPP Majors. 4 Hours. Prerequisites: None. 4 hours weekly (3-1)

Interpreter Preparation (IPP) IPP 101 ASL/Non-IPP Majors. 4 Hours. Prerequisites: None. 4 hours weekly (3-1) Interpreter Preparation (IPP) IPP 101 ASL/Non-IPP Majors Prerequisites: None 4 hours weekly (3-1) This course is designed for students who have no knowledge of American Sign Language. The focus of this

More information

Can Generic Neural Networks Estimate Numerosity Like Humans?

Can Generic Neural Networks Estimate Numerosity Like Humans? Can Generic Neural Networks Estimate Numerosity Like Humans? Sharon Y. Chen (syc2138@columbia.edu) 1, Zhenglong Zhou (zzhou34@jhu.edu) 2, Mengting Fang (mtfang@mail.bnu.edu.cn) 3, and James L. McClelland

More information

1. INTRODUCTION. Vision based Multi-feature HGR Algorithms for HCI using ISL Page 1

1. INTRODUCTION. Vision based Multi-feature HGR Algorithms for HCI using ISL Page 1 1. INTRODUCTION Sign language interpretation is one of the HCI applications where hand gesture plays important role for communication. This chapter discusses sign language interpretation system with present

More information

Exploring the Structure and Function of Brain Networks

Exploring the Structure and Function of Brain Networks Exploring the Structure and Function of Brain Networks IAP 2006, September 20, 2006 Yoonsuck Choe Brain Networks Laboratory Department of Computer Science Texas A&M University choe@tamu.edu, http://research.cs.tamu.edu/bnl

More information

SignInstructor: An Effective Tool for Sign Language Vocabulary Learning

SignInstructor: An Effective Tool for Sign Language Vocabulary Learning SignInstructor: An Effective Tool for Sign Language Vocabulary Learning Xiujuan Chai, Zhuang Liu, Yongjun Li, Fang Yin, Xilin Chen Key Lab of Intelligent Information Processing of Chinese Academy of Sciences(CAS),

More information

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS

DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS SEOUL Oct.7, 2016 DEEP LEARNING BASED VISION-TO-LANGUAGE APPLICATIONS: CAPTIONING OF PHOTO STREAMS, VIDEOS, AND ONLINE POSTS Gunhee Kim Computer Science and Engineering Seoul National University October

More information

A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals

A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals A Novel Capsule Neural Network Based Model For Drowsiness Detection Using Electroencephalography Signals Luis Guarda Bräuning (1) Nicolas Astorga (1) Enrique López Droguett (1) Marcio Moura (2) Marcelo

More information

Detection and Recognition of Sign Language Protocol using Motion Sensing Device

Detection and Recognition of Sign Language Protocol using Motion Sensing Device Detection and Recognition of Sign Language Protocol using Motion Sensing Device Rita Tse ritatse@ipm.edu.mo AoXuan Li P130851@ipm.edu.mo Zachary Chui MPI-QMUL Information Systems Research Centre zacharychui@gmail.com

More information

CHARACTERISTICS OF STUDENTS WHO ARE: DEAF OR HARD OF HEARING

CHARACTERISTICS OF STUDENTS WHO ARE: DEAF OR HARD OF HEARING CHARACTERISTICS OF STUDENTS WHO ARE: DEAF OR HARD OF HEARING 1. In General: An estimated twenty one million Americans have some degree of hearing loss, mild to severe. Of the 60,000+ students identified

More information

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks 1 Rumor Detection on Twitter with Tree-structured Recursive Neural Networks Jing Ma 1, Wei Gao 2, Kam-Fai Wong 1,3 1 The Chinese University of Hong Kong 2 Victoria University of Wellington, New Zealand

More information

Your Next Personal Trainer: Instant Evaluation of Exercise Form Brandon Garcia, Russell Kaplan, Aditya Viswanathan

Your Next Personal Trainer: Instant Evaluation of Exercise Form Brandon Garcia, Russell Kaplan, Aditya Viswanathan Your Next Personal Trainer: Instant Evaluation of Exercise Form Brandon Garcia, Russell Kaplan, Aditya Viswanathan Introduction Maintenance of a healthy lifestyle depends on regular exercise. However,

More information

A Deep Learning Approach to Identify Diabetes

A Deep Learning Approach to Identify Diabetes , pp.44-49 http://dx.doi.org/10.14257/astl.2017.145.09 A Deep Learning Approach to Identify Diabetes Sushant Ramesh, Ronnie D. Caytiles* and N.Ch.S.N Iyengar** School of Computer Science and Engineering

More information

Automatic Irish Sign Language Recognition

Automatic Irish Sign Language Recognition Automatic Irish Sign Language Recognition Irene Hernández B.Sc. A Dissertation Presented to the University of Dublin, Trinity College in partial fulfilment of the requirements for the degree of Master

More information

Object recognition and hierarchical computation

Object recognition and hierarchical computation Object recognition and hierarchical computation Challenges in object recognition. Fukushima s Neocognitron View-based representations of objects Poggio s HMAX Forward and Feedback in visual hierarchy Hierarchical

More information

Quality Assessment of Human Hand Posture Recognition System Er. ManjinderKaur M.Tech Scholar GIMET Amritsar, Department of CSE

Quality Assessment of Human Hand Posture Recognition System Er. ManjinderKaur M.Tech Scholar GIMET Amritsar, Department of CSE Quality Assessment of Human Hand Posture Recognition System Er. ManjinderKaur M.Tech Scholar GIMET Amritsar, Department of CSE mkwahla@gmail.com Astt. Prof. Prabhjit Singh Assistant Professor, Department

More information

Hierarchical Convolutional Features for Visual Tracking

Hierarchical Convolutional Features for Visual Tracking Hierarchical Convolutional Features for Visual Tracking Chao Ma Jia-Bin Huang Xiaokang Yang Ming-Husan Yang SJTU UIUC SJTU UC Merced ICCV 2015 Background Given the initial state (position and scale), estimate

More information

Two Themes. MobileASL: Making Cell Phones Accessible to the Deaf Community. Our goal: Challenges: Current Technology for Deaf People (text) ASL

Two Themes. MobileASL: Making Cell Phones Accessible to the Deaf Community. Our goal: Challenges: Current Technology for Deaf People (text) ASL Two Themes MobileASL: Making Cell Phones Accessible to the Deaf Community MobileASL AccessComputing Alliance Advancing Deaf and Hard of Hearing in Computing Richard Ladner University of Washington ASL

More information

PIB Ch. 18 Sequence Memory for Prediction, Inference, and Behavior. Jeff Hawkins, Dileep George, and Jamie Niemasik Presented by Jiseob Kim

PIB Ch. 18 Sequence Memory for Prediction, Inference, and Behavior. Jeff Hawkins, Dileep George, and Jamie Niemasik Presented by Jiseob Kim PIB Ch. 18 Sequence Memory for Prediction, Inference, and Behavior Jeff Hawkins, Dileep George, and Jamie Niemasik Presented by Jiseob Kim Quiz Briefly describe the neural activities of minicolumn in the

More information

Characterization of 3D Gestural Data on Sign Language by Extraction of Joint Kinematics

Characterization of 3D Gestural Data on Sign Language by Extraction of Joint Kinematics Human Journals Research Article October 2017 Vol.:7, Issue:4 All rights are reserved by Newman Lau Characterization of 3D Gestural Data on Sign Language by Extraction of Joint Kinematics Keywords: hand

More information

An approach for Brazilian Sign Language (BSL) recognition based on facial expression and k-nn classifier

An approach for Brazilian Sign Language (BSL) recognition based on facial expression and k-nn classifier An approach for Brazilian Sign Language (BSL) recognition based on facial expression and k-nn classifier Tamires Martins Rezende 1 Cristiano Leite de Castro 1 Sílvia Grasiella Moreira Almeida 2 1 The Electrical

More information

Deep Learning for Computer Vision

Deep Learning for Computer Vision Deep Learning for Computer Vision Lecture 12: Time Sequence Data, Recurrent Neural Networks (RNNs), Long Short-Term Memories (s), and Image Captioning Peter Belhumeur Computer Science Columbia University

More information

Neuromorphic convolutional recurrent neural network for road safety or safety near the road

Neuromorphic convolutional recurrent neural network for road safety or safety near the road Neuromorphic convolutional recurrent neural network for road safety or safety near the road WOO-SUP HAN 1, IL SONG HAN 2 1 ODIGA, London, U.K. 2 Korea Advanced Institute of Science and Technology, Daejeon,

More information

CS-E Deep Learning Session 4: Convolutional Networks

CS-E Deep Learning Session 4: Convolutional Networks CS-E4050 - Deep Learning Session 4: Convolutional Networks Jyri Kivinen Aalto University 23 September 2015 Credits: Thanks to Tapani Raiko for slides material. CS-E4050 - Deep Learning Session 4: Convolutional

More information

Convolutional Neural Networks for Text Classification

Convolutional Neural Networks for Text Classification Convolutional Neural Networks for Text Classification Sebastian Sierra MindLab Research Group July 1, 2016 ebastian Sierra (MindLab Research Group) NLP Summer Class July 1, 2016 1 / 32 Outline 1 What is

More information

arxiv: v1 [cs.hc] 20 Feb 2014

arxiv: v1 [cs.hc] 20 Feb 2014 arxiv:1402.5047v1 [cs.hc] 20 Feb 2014 Real-time Automatic Emotion Recognition from Body Gestures Stefano Piana stefano.piana@dist.unige.it Francesca Odone francesca.odone@unige.it ABSTRACT Although psychological

More information

Externalization of Cognition: from local brains to the Global Brain. Clément Vidal, Global Brain Institute

Externalization of Cognition: from local brains to the Global Brain. Clément Vidal, Global Brain Institute Externalization of Cognition: from local brains to the Global Brain Clément Vidal, Global Brain Institute clement.vidal@philosophons.com 1 Introduction Humans use tools. create, use and refine tools. extends

More information

Recognizing American Sign Language Gestures from within Continuous Videos

Recognizing American Sign Language Gestures from within Continuous Videos Recognizing American Sign Language Gestures from within Continuous Videos Yuancheng Ye 1, Yingli Tian 1,2,, Matt Huenerfauth 3, and Jingya Liu 2 1 The Graduate Center, City University of New York, NY,

More information

A Real-time Gesture Recognition System for Isolated Swedish Sign Language Signs

A Real-time Gesture Recognition System for Isolated Swedish Sign Language Signs A Real-time Gesture Recognition System for Isolated Swedish Sign Language Signs Kalin Stefanov KTH Royal Institute of Technology TMH Speech, Music and Hearing Stockholm, Sweden kalins@kth.se Jonas Beskow

More information

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and private study only. The thesis may not be reproduced elsewhere

More information

Edge Based Grid Super-Imposition for Crowd Emotion Recognition

Edge Based Grid Super-Imposition for Crowd Emotion Recognition Edge Based Grid Super-Imposition for Crowd Emotion Recognition Amol S Patwardhan 1 1Senior Researcher, VIT, University of Mumbai, 400037, India ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for What you re in for Speech processing schemes for cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences

More information

HOW AI WILL IMPACT SUBTITLE PRODUCTION

HOW AI WILL IMPACT SUBTITLE PRODUCTION HOW AI WILL IMPACT SUBTITLE PRODUCTION 1 WHAT IS AI IN A BROADCAST CONTEXT? 2 What is AI in a Broadcast Context? I m sorry, Dave. I m afraid I can t do that. Image By Cryteria [CC BY 3.0 (https://creativecommons.org/licenses/by/3.0)],

More information

Multimodal monitoring of Parkinson s and Alzheimer s patients using the ICT4LIFE platform

Multimodal monitoring of Parkinson s and Alzheimer s patients using the ICT4LIFE platform Multimodal monitoring of Parkinson s and Alzheimer s patients using the ICT4LIFE platform Federico Alvarez 1, Mirela Popa 2, Nicholas Vretos 3, Alberto Belmonte-Hernández 1, Stelios Asteriadis 2, Vassilis

More information

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation

Neuro-Inspired Statistical. Rensselaer Polytechnic Institute National Science Foundation Neuro-Inspired Statistical Pi Prior Model lfor Robust Visual Inference Qiang Ji Rensselaer Polytechnic Institute National Science Foundation 1 Status of Computer Vision CV has been an active area for over

More information

REPORT. PROJECT: TRANS2WORK - School-to-Work Transition for Higher education students with disabilities in Serbia, Bosnia & Herzegovina and Montenegro

REPORT. PROJECT: TRANS2WORK - School-to-Work Transition for Higher education students with disabilities in Serbia, Bosnia & Herzegovina and Montenegro REPORT PROJECT: TRANS2WORK - School-to-Work Transition for Higher education students with disabilities in Serbia, Bosnia & Herzegovina and Montenegro Document title: Report of dissemination at the World

More information

Available online at ScienceDirect. Procedia Technology 24 (2016 )

Available online at   ScienceDirect. Procedia Technology 24 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Technology 24 (2016 ) 1068 1073 International Conference on Emerging Trends in Engineering, Science and Technology (ICETEST - 2015) Improving

More information

A Cascaded Speech to Arabic Sign Language Machine Translator using Adaptation

A Cascaded Speech to Arabic Sign Language Machine Translator using Adaptation A Cascaded Speech to Sign Language Machine Translator using Adaptation Shereen A. Mohamed Department of Mathematics and Computer Science, Faculty of Science, Alexandria University, Mohamed A. Abdou Informatics

More information

Multi-attention Guided Activation Propagation in CNNs

Multi-attention Guided Activation Propagation in CNNs Multi-attention Guided Activation Propagation in CNNs Xiangteng He and Yuxin Peng (B) Institute of Computer Science and Technology, Peking University, Beijing, China pengyuxin@pku.edu.cn Abstract. CNNs

More information

A Randomized- Controlled Trial of Foundations for Literacy

A Randomized- Controlled Trial of Foundations for Literacy A Randomized- Controlled Trial of Foundations for Literacy Dr. Stacey Tucci Georgia Pathway July 2018 Texas Statewide Conference on Education of the Deaf (SWCED) Funded by US Dept of Education Institute

More information

MULTI-CHANNEL COMMUNICATION

MULTI-CHANNEL COMMUNICATION INTRODUCTION Research on the Deaf Brain is beginning to provide a new evidence base for policy and practice in relation to intervention with deaf children. This talk outlines the multi-channel nature of

More information

Sign Language Recognition using Convolutional Neural Networks

Sign Language Recognition using Convolutional Neural Networks Sign Language Recognition using Convolutional Neural Networks Lionel Pigou, Sander Dieleman, Pieter-Jan Kindermans, Benjamin Schrauwen Ghent University, ELIS, Belgium Abstract. There is an undeniable communication

More information

Hand Gesture Recognition: Sign to Voice System (S2V)

Hand Gesture Recognition: Sign to Voice System (S2V) Hand Gesture Recognition: Sign to Voice System (S2V) Oi Mean Foong, Tan Jung Low, and Satrio Wibowo Abstract Hand gesture is one of the typical methods used in sign language for non-verbal communication

More information

Indian Sign Language Alpha-Numeric Character Classification using Neural Network

Indian Sign Language Alpha-Numeric Character Classification using Neural Network International Journal of Latest Research in Engineering and Technology (IJLRET) ISSN: 2454-531 Volume 2 - Issue 6 June 216 PP. 1-8 Indian Sign Language Alpha-Numeric Character Classification using Neural

More information

ABSTRACT I. INTRODUCTION

ABSTRACT I. INTRODUCTION International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 5 ISSN : 2456-3307 An Innovative Artificial Replacement to Facilitate

More information

Lions Hearing Center Of Michigan & Greater Metro Detroit Lions Club Deborah Love-Peel Scholarship For Deaf / Hard of Hearing Students

Lions Hearing Center Of Michigan & Greater Metro Detroit Lions Club Deborah Love-Peel Scholarship For Deaf / Hard of Hearing Students For Deaf / Hard of Hearing Full Name: Address: City: State: Zip Code: Phone/VP/text phone: E-mail: @ Alternate contact: Name of University: Major: Minor (Opt): Current grade level: freshman, sophomore,

More information

Member 1 Member 2 Member 3 Member 4 Full Name Krithee Sirisith Pichai Sodsai Thanasunn

Member 1 Member 2 Member 3 Member 4 Full Name Krithee Sirisith Pichai Sodsai Thanasunn Microsoft Imagine Cup 2010 Thailand Software Design Round 1 Project Proposal Template PROJECT PROPOSAL DUE: 31 Jan 2010 To Submit to proposal: Register at www.imaginecup.com; select to compete in Software

More information

Learning Period 3: 10/28-11/22

Learning Period 3: 10/28-11/22 Class: American Sign Language Instructor: Sarah Macedo Grade Level: 9 th -12 th Email: macedo_sarah@yahoo.com Location: Murrieta Learning Center Day/Time: Wednesday 2:00-3:00 (NOTE: we do not meet the

More information

Assessment: Course Four Column SPRING/SUMMER 2015

Assessment: Course Four Column SPRING/SUMMER 2015 Assessment: Course Four Column SPRING/SUMMER 2015 El Camino: s (HSA) - Sign Language Interpreter Training ECC: SLAN 101:Individualized American Sign Language Laboratory SLO #2 Vocabulary - Students will

More information

Collaborative Project of the 7th Framework Programme. WP6: Tools for bio-researchers and clinicians

Collaborative Project of the 7th Framework Programme. WP6: Tools for bio-researchers and clinicians G.A. nº 270086 Collaborative Project of the 7th Framework Programme WP6: Tools for bio-researchers and clinicians Deliverable 6.1: Design of Inference Engine v.1.0 31/10/2011 www.synergy-copd.eu Document

More information

A Comparison of Deaf, Hard of Hearing, and Hearing Young Adults Responses to a Health Risk Behavior Survey

A Comparison of Deaf, Hard of Hearing, and Hearing Young Adults Responses to a Health Risk Behavior Survey A Comparison of Deaf, Hard of Hearing, and Hearing Young Adults Responses to a Health Risk Behavior Survey Tamala David, MPA, MS, FNP Doctoral Student University of Rochester School of Nursing ASPH/CDC/PRC

More information

Glossary of Inclusion Terminology

Glossary of Inclusion Terminology Glossary of Inclusion Terminology Accessible A general term used to describe something that can be easily accessed or used by people with disabilities. Alternate Formats Alternate formats enable access

More information

Computational Cognitive Neuroscience

Computational Cognitive Neuroscience Computational Cognitive Neuroscience Computational Cognitive Neuroscience Computational Cognitive Neuroscience *Computer vision, *Pattern recognition, *Classification, *Picking the relevant information

More information