Fig. 1 High level block diagram of the binary mask algorithm.[1]

Similar documents
Hybrid Masking Algorithm for Universal Hearing Aid System

HybridMaskingAlgorithmforUniversalHearingAidSystem. Hybrid Masking Algorithm for Universal Hearing Aid System

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition

Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phase Components

NOISE REDUCTION ALGORITHMS FOR BETTER SPEECH UNDERSTANDING IN COCHLEAR IMPLANTATION- A SURVEY

HCS 7367 Speech Perception

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

Challenges in microphone array processing for hearing aids. Volkmar Hamacher Siemens Audiological Engineering Group Erlangen, Germany

FIR filter bank design for Audiogram Matching

EEL 6586, Project - Hearing Aids algorithms

FPGA IMPLEMENTATION OF COMB FILTER FOR AUDIBILITY ENHANCEMENT OF HEARING IMPAIRED

Adaptation of Classification Model for Improving Speech Intelligibility in Noise

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

Acoustic Signal Processing Based on Deep Neural Networks

BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED

IMPROVING CHANNEL SELECTION OF SOUND CODING ALGORITHMS IN COCHLEAR IMPLANTS. Hussnain Ali, Feng Hong, John H. L. Hansen, and Emily Tobey

UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

Sonic Spotlight. SmartCompress. Advancing compression technology into the future

GENERALIZATION OF SUPERVISED LEARNING FOR BINARY MASK ESTIMATION

Speech Enhancement, Human Auditory system, Digital hearing aid, Noise maskers, Tone maskers, Signal to Noise ratio, Mean Square Error

Binaural processing of complex stimuli

Study of perceptual balance for binaural dichotic presentation

ADVANCES in NATURAL and APPLIED SCIENCES

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Design and Implementation of Programmable Hearing Aid Using FPAA

An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant

Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise

The effect of wearing conventional and level-dependent hearing protectors on speech production in noise and quiet

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency

Group Delay or Processing Delay

HearIntelligence by HANSATON. Intelligent hearing means natural hearing.

A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPLANTS USING KALMAN WITH DRNL AND SSB TECHNIQUE

Proceedings of Meetings on Acoustics

Performance Comparison of Speech Enhancement Algorithms Using Different Parameters

CONTACTLESS HEARING AID DESIGNED FOR INFANTS

Representation of sound in the auditory nerve

HOW TO USE THE SHURE MXA910 CEILING ARRAY MICROPHONE FOR VOICE LIFT

Perceptual Effects of Nasal Cue Modification

Speech segregation in rooms: Effects of reverberation on both target and interferer

Echo Canceller with Noise Reduction Provides Comfortable Hands-free Telecommunication in Noisy Environments

Investigating the effects of noise-estimation errors in simulated cochlear implant speech intelligibility

SOLUTIONS Homework #3. Introduction to Engineering in Medicine and Biology ECEN 1001 Due Tues. 9/30/03

International Journal of Informative & Futuristic Research ISSN:

Investigating the effects of noise-estimation errors in simulated cochlear implant speech intelligibility

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation

Open Portable Platform for Hearing Aid Research

A. SEK, E. SKRODZKA, E. OZIMEK and A. WICHER

ReSound NoiseTracker II

Research Article The Acoustic and Peceptual Effects of Series and Parallel Processing

Hearing Aids. Bernycia Askew

Speech enhancement based on harmonic estimation combined with MMSE to improve speech intelligibility for cochlear implant recipients

HCS 7367 Speech Perception

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag

Proceedings of Meetings on Acoustics

Speech Enhancement Based on Deep Neural Networks

Who are cochlear implants for?

Hearing the Universal Language: Music and Cochlear Implants

Slow compression for people with severe to profound hearing loss

USER GUIDE FOR MATLAB BASED HEARING TEST SIMULATOR GUI FOR CLINICAL TESTING

OCCLUSION REDUCTION SYSTEM FOR HEARING AIDS WITH AN IMPROVED TRANSDUCER AND AN ASSOCIATED ALGORITHM

Computational Perception /785. Auditory Scene Analysis

A neural network model for optimizing vowel recognition by cochlear implant listeners

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

Assistive Listening Technology: in the workplace and on campus

Digital hearing aids are still

Headpiece Microphone Study

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

DESIGN OF AN EFFECTIVE DIGITAL RECONFIGURABLE FILTER BANK

White Paper: micon Directivity and Directional Speech Enhancement

Brad May, PhD Johns Hopkins University

CHAPTER 1 INTRODUCTION

Speech to Text Wireless Converter

A study of adaptive WDRC in hearing aids under noisy conditions

Time Varying Comb Filters to Reduce Spectral and Temporal Masking in Sensorineural Hearing Impairment

Combating the Reverberation Problem

Communication quality for students with a hearing impairment: An experiment evaluating speech intelligibility and annoyance

Modulation and Top-Down Processing in Audition

Beltone Electronics 2601 Patriot Boulevard Glenview, IL U.S.A. (800)

Exploring the parameter space of Cochlear Implant Processors for consonant and vowel recognition rates using normal hearing listeners

Speech Enhancement Using Deep Neural Network

Adaptive dynamic range compression for improving envelope-based speech perception: Implications for cochlear implants

CCi-MOBILE Research Platform for Cochlear Implants and Hearing Aids HANDS-ON WORKSHOP

Speech intelligibility in background noise with ideal binary time-frequency masking

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

Sound Texture Classification Using Statistics from an Auditory Model

Spectrograms (revisited)

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

Using Source Models in Speech Separation

Chapter 40 Effects of Peripheral Tuning on the Auditory Nerve s Representation of Speech Envelope and Temporal Fine Structure Cues

Role of F0 differences in source segregation

Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids

Simulations of high-frequency vocoder on Mandarin speech recognition for acoustic hearing preserved cochlear implant

An Auditory System Modeling in Sound Source Localization

Auditory Perception: Sense of Sound /785 Spring 2017

A Single-Channel Noise Reduction Algorithm for Cochlear-Implant Users

Transcription:

Implementation of Binary Mask Algorithm for Noise Reduction in Traffic Environment Ms. Ankita A. Kotecha 1, Prof. Y.A.Sadawarte 2 1 M-Tech Scholar, Department of Electronics Engineering, B. D. College of Engineering, Maharashtra, India 2 Professor, Department of Electronics Engineering, B. D. College of Engineering, Maharashtra, India Abstract: In this paper, we present a real-time implementation of the binary masking algorithm, which has been shown to significantly reduce the noise and improves speech-in-noise intelligibility. Binary masking algorithm is an effective way to improve the speech-in-noise intelligibility. The main goal is to implement binary mask algorithm for noise reduction. This paper presents the real time implementation of binary masking algorithm for noise reduction in traffic environment. We tested our algorithm implementation on an FPGA platform, and also software simulation in MTLAB produced results that verify that it effectively reduced the noise and measured noise reduction in the signal is 65%. And it minimizes the propagation delay up to 0.01ms. Index Terms Binary mask, Real-time systems, speech enhancement, cochlear implants (CIs), noise reduction, FIR filters. I. INTRODUCTION I N a noisy environment, such as a busy street or a crowded restaurant or the presence of the traffic noise the normal human hearing system is able to discern and comprehend speech, despite the multitude of intruding, competing sounds [1], [2]. Unfortunately, the ability to solve this Traffic noise problem is greatly diminished in individuals with sensorineural hearing loss [3], [4]. Current speech enhancement algorithms improve speech quality, but not necessarily intelligibility [8]. While hearing-impaired listeners do benefit from improved speech quality, communication problems still exist if intelligibility is not improved. The ideal binary mask is one algorithm specifically shown to improve speech intelligibility. In this paper, we will describe our real-time implementation of the IBM algorithm, realized in hardware on a field programmable gate array (FPGA). The low latency, low power and improved speech quality of our implementation make it practical for use in hearing aids and cochlear implants. Along with the binary masking algorithm, we are using the FIR filters which can filter out the noise up to 65% and improve the speech quality. II. IDEAL BINARY MASKING ALGORITHM A. Overview The idea about the binary mask (BM) algorithm is explained as follows. Speech is sparse in the time-frequency domain. If we consider that noise is also sparse in this domain, then it very likely does not overlap with the speech. So, it is easy to remove the noisy regions of the time-frequency plane (by applying the appropriate binary mask ), which will leave us with intact, noise-free speech. The algorithm Can also effective even if the noise is not sparse in the time-frequency domain; the overall signal-to-noise ratio (SNR) of the speech can be greatly improved by discarding those regions of the time-frequency plane whose SNR fails to exceed a specified threshold. A practical implementation of the BM algorithm generally has three stages spectral analysis, classification, and synthesis, as shown in Fig. Fig. 1 High level block diagram of the binary mask algorithm.[1] The first spectral analysis stage uses the fast Fourier transform (FFT) or a filter bank to map the original, noisy signal from the time domain to the time-frequency (TF) domain. In the second stage i.e. the classification stage, each TF unit is either identified as 387

belonging to class 1 (clean speech, a.k.a. target ), or class 0 (noise). Third is the synthesis stage In the synthesis stage, the TF-domain version of the original, noisy signal is multiplied by the binary mask, effectively removing all of the noise-containing portions of the signal. After the binary mask is applied, the TF units are then recombined to form a speech signal that is clean (or at least of higher SNR than before). B. Hardware module Fig. 2 Hardware model of the project Above fig shows the hardware module of the project which consist of FPGA Kit, mike, headphones, Amplifiers and power supply section. Actually main masking of the data has been done on the FPGA i.e. field programmable gate array. Ideal implementation of binary masking algorithm on the FPGA is much more feasible. Mike is used for giving the audio signal to the circuit. That audio signal is given to the FPGA. Amplifier section is used for amplifying the speech signal. And headphone is used for receiving the original output signal. C. Flow chart of BM algorithm Above flow chart gives the idea about masking of the data. Binary masking algorithm is the much more efficient in masking of the noise. After the binary masking has been done then that mask data has been given to the filter section. 388

D. FIR filter Fig. 3 Simple averaging FIR filter Filtering is a process of adjusting a signal - for example, removing noise. Noise in a sound waveform is represented by small, but frequent changes to the amplitude of the signal. A simple logic circuit that achieves the task of noise-filtering is an averaging Finite Impulse Response (FIR) filter. The schematic diagram of the filter is shown in Figure 3. An averaging filter, like the one shown in Figure 3, removes noise from a sound by averaging the values of adjacent samples. In this particular case, it removes small deviations in sound by looking at changes in the adjacent 8 samples. When using low-quality microphones, this filter should remove the noise produced when you speak to the microphone, making your voice sound clearer. E. Software simulation and results Fig. 4 Software simulation of Module 389

Figure 5 : a) Noise present in the input waveform, b) Noise present in the in the output waveform Fig 4. Shows software simulation of the model has been done in the MATLAB. The results of the design are shown in fig. 5 from fig we can observe that noise has been removed in the output waveform. Ideal binary masking is very feasible on the FPGA. It will significantly improves the speech quality by reducing the noise. III. CONCLUSION Binary masking algorithm is an effective way to improve the speech-in-noise intelligibility. This paper will provide the real time implementation of binary masking algorithm for noise reduction in traffic environment. It has been found that the binary mask algorithm will significantly improve the speech intelligibility and minimize the algorithm latency. The algorithm can also effective even if the noise is not sparse in the time-frequency domain by discarding those regions of the time-frequency plane whose SNR fails to exceed a specified threshold the overall signal-to-noise ratio (SNR) of the speech can be greatly improved. Main aim of this design is to improve the speech intelligibility, by reducing the noise and to minimize the algorithm latency. And it will reduce the noise up to 65%. This shows that IBM implementation on FPGA is more feasible. REFERENCES [1] D. S. Brungart, P. S. Chang, B. D. Simpson, and D Wang, Isolating the energetic component of speech-on- speech masking with ideal time frequency segregation, J. Acoust. Soc. Amer., vol. 120, no. 6, pp.4007 4018, Dec. 2006. [2] N. Madhu, A. Spriet, S. Jansen, R. Koning, and J.Wouters, The potential for speech intelligibility improvement using the ideal binary mask and the ideal wiener filter in single channel noise reduction systems: Application to auditory prostheses, IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 1, pp. 63 390

72, Jan. 2013. [3]N. Li and P. C. Loizou, Effect of spectral resolution on the intelligibility of ideal binary masked speech, J. Acoust. Soc. Amer., vol. 123, no. 4, pp. 64, Apr. 2008 [4]Z. Jin and D. Wang, A supervised learning approach to monaural segregation of reverberant speech, IEEE Trans. Audio, Speech, Lang.Process., vol. 15, no. 4, pp. 625 638, May 2009. [5]W. Cole, Group Delay and Processing Delay in Hearing Aids the Facts and the Fantasies, Audioscan, Dorchester, ON, Canada, Tech. Rep., 2005. [6]Texas Instruments, TMS320C5535, C5534, C5533 C5532 Fixed- Point Digital Signal processors, Aug. 2011, Aug. 2011, datasheet, Rev. Mar. 2012. [7]H. Peeters, C.-C. Lau, and F. Kuk Speech-in-noise potential of hearing aids with extended bandwidth, Hear Rev.vol.18,no.3,pp.28-36, March.2011 [8]J. Amores, N. Sebe, P. Radeva, T. Gevers and A. Smeulders, Boostin Contextual Information In Content Based Image Retrieval, Proc.Workshop on Multimedia Information Retrieval, in conjunction withacm Multimedia, 2004. [9]J. Assfalg, A. Del Bimbo, and P. Pala, Three-DimensionalInterfaces for Querying by Example in Content-Based ImageRetrieval, IEEE Trans. Visualization and ComputerGraphics, 8(4):305 318, 2002.K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M.Blei, and M. I.Jordan, Matching Words Pictures, Journal of MachinLearning Research, 3:1107-1135, 2003. 391