Performance Comparison of Speech Enhancement Algorithms Using Different Parameters

Similar documents
Speech Enhancement, Human Auditory system, Digital hearing aid, Noise maskers, Tone maskers, Signal to Noise ratio, Mean Square Error

Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phase Components

CHAPTER 1 INTRODUCTION

Adaptation of Classification Model for Improving Speech Intelligibility in Noise

Near-End Perception Enhancement using Dominant Frequency Extraction

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

Echo Canceller with Noise Reduction Provides Comfortable Hands-free Telecommunication in Noisy Environments

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation

FIR filter bank design for Audiogram Matching

Speech Enhancement Using Deep Neural Network

Fig. 1 High level block diagram of the binary mask algorithm.[1]

UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication

SUPPRESSION OF MUSICAL NOISE IN ENHANCED SPEECH USING PRE-IMAGE ITERATIONS. Christina Leitner and Franz Pernkopf

Novel Speech Signal Enhancement Techniques for Tamil Speech Recognition using RLS Adaptive Filtering and Dual Tree Complex Wavelet Transform

International Journal of Scientific & Engineering Research, Volume 5, Issue 3, March ISSN

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency

CURRENTLY, the most accurate method for evaluating

Speech Enhancement Based on Deep Neural Networks

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking

Acoustic Signal Processing Based on Deep Neural Networks

ReSound NoiseTracker II

Hybrid Masking Algorithm for Universal Hearing Aid System

ADVANCES in NATURAL and APPLIED SCIENCES

NOISE REDUCTION ALGORITHMS FOR BETTER SPEECH UNDERSTANDING IN COCHLEAR IMPLANTATION- A SURVEY

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

Research Article The Acoustic and Peceptual Effects of Series and Parallel Processing

Extraction of Unwanted Noise in Electrocardiogram (ECG) Signals Using Discrete Wavelet Transformation

GENERALIZATION OF SUPERVISED LEARNING FOR BINARY MASK ESTIMATION

Codebook driven short-term predictor parameter estimation for speech enhancement

HCS 7367 Speech Perception

Speech quality evaluation of a sparse coding shrinkage noise reduction algorithm with normal hearing and hearing impaired listeners

Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise

LATERAL INHIBITION MECHANISM IN COMPUTATIONAL AUDITORY MODEL AND IT'S APPLICATION IN ROBUST SPEECH RECOGNITION

[Solanki*, 5(1): January, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

SPEECH TO TEXT CONVERTER USING GAUSSIAN MIXTURE MODEL(GMM)

Speech recognition in noisy environments: A survey

HybridMaskingAlgorithmforUniversalHearingAidSystem. Hybrid Masking Algorithm for Universal Hearing Aid System

Modulation and Top-Down Processing in Audition

HCS 7367 Speech Perception

IMPROVING CHANNEL SELECTION OF SOUND CODING ALGORITHMS IN COCHLEAR IMPLANTS. Hussnain Ali, Feng Hong, John H. L. Hansen, and Emily Tobey

Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids

EEL 6586, Project - Hearing Aids algorithms

PCA Enhanced Kalman Filter for ECG Denoising

A Single-Channel Noise Reduction Algorithm for Cochlear-Implant Users

An active unpleasantness control system for indoor noise based on auditory masking

Modern Speech Enhancement Techniques in Text-Independent Speaker Verification

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

White Paper: micon Directivity and Directional Speech Enhancement

A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPLANTS USING KALMAN WITH DRNL AND SSB TECHNIQUE

Chapter 4. MATLAB Implementation and Performance Evaluation of Transform Domain Methods

The Removal of Environmental Noise in Cellular Communications by Perceptual Techniques.

Sonic Spotlight. SmartCompress. Advancing compression technology into the future

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition

Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners

The importance of phase in speech enhancement

Challenges in microphone array processing for hearing aids. Volkmar Hamacher Siemens Audiological Engineering Group Erlangen, Germany

The Use of a High Frequency Emphasis Microphone for Musicians Published on Monday, 09 February :50

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

Enhancement of Reverberant Speech Using LP Residual Signal

International Journal of Innovative Research in Advanced Engineering (IJIRAE) Volume 1 Issue 10 (November 2014)

FPGA IMPLEMENTATION OF COMB FILTER FOR AUDIBILITY ENHANCEMENT OF HEARING IMPAIRED

Speech Enhancement Using Temporal Masking in the FFT Domain

Voice Detection using Speech Energy Maximization and Silence Feature Normalization

Noise Cancellation using Adaptive Filters Algorithms

Parametric Optimization and Analysis of Adaptive Equalization Algorithms for Noisy Speech Signals

SRIRAM GANAPATHY. Indian Institute of Science, Phone: +91-(80) Bangalore, India, Fax: +91-(80)

Single channel noise reduction in hearing aids

Linguistic Phonetics Fall 2005

Gray Scale Image Edge Detection and Reconstruction Using Stationary Wavelet Transform In High Density Noise Values

Price list Tools for Research & Development

Perceptual Effects of Nasal Cue Modification

Issues faced by people with a Sensorineural Hearing Loss

Infant Hearing Development: Translating Research Findings into Clinical Practice. Auditory Development. Overview

Psychoacoustical Models WS 2016/17

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions.

FREQUENCY. Prof Dr. Mona Mourad Dr.Manal Elbanna Doaa Elmoazen ALEXANDRIA UNIVERSITY. Background

2/16/2012. Fitting Current Amplification Technology on Infants and Children. Preselection Issues & Procedures

A study of adaptive WDRC in hearing aids under noisy conditions

Slow compression for people with severe to profound hearing loss

Robust Neural Encoding of Speech in Human Auditory Cortex

Various Methods To Detect Respiration Rate From ECG Using LabVIEW

Computational Perception /785. Auditory Scene Analysis

SPEECH ENHANCEMENT. Chunjian Li Aalborg University, Denmark

Perceptual Evaluation of Signal Enhancement Strategies

Performance evaluation of the various edge detectors and filters for the noisy IR images

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

Investigating the effects of noise-estimation errors in simulated cochlear implant speech intelligibility

Quick detection of QRS complexes and R-waves using a wavelet transform and K-means clustering

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

Investigating the effects of noise-estimation errors in simulated cochlear implant speech intelligibility

ADVANCES in NATURAL and APPLIED SCIENCES

Development of novel algorithm by combining Wavelet based Enhanced Canny edge Detection and Adaptive Filtering Method for Human Emotion Recognition

Speech Compression for Noise-Corrupted Thai Dialects

Evidence base for hearing aid features:

A Brain Computer Interface System For Auto Piloting Wheelchair

In-Ear Microphone Equalization Exploiting an Active Noise Control. Abstract

Transcription:

Performance Comparison of Speech Enhancement Algorithms Using Different Parameters Ambalika, Er. Sonia Saini Abstract In speech communication system, background noise degrades the information or speech signal. For minimizing the effect of background noise different speech enhancement techniques are used. Some speech enhancement techniques are Spectral Subtraction, Wiener Filtering, Two Step Decision Directed Approach, Perceptual Decision Directed Approach. This paper includes the study of these s, and compare the performance using different parameters i.e., signal to noise ratio(snr), Peak Signal to Noise Ratio(PSNR), Mean Square Error(MSE), Normalized Root Mean Square Error(NRMSE). From the results we conclude that the performance of PDD approach is better than other described s. Index Terms Speech enhancement, Decision Directed Approach, Noise Masking Threshold, SNR, PSNR, MSE. I. INTRODUCTION Speech plays a major role in speech communication. There are various type of noise present in the environment that degrades the signal. Background noise is a natural part of a conversation. As a result, speech becomes noisy signal. There is a need to improve the quality of speech signal in noisy conditions by developing speech enhancement s to minimize the effect of the background noise. Over the past year, many speech enhancement techniques are developed for this purpose, and these are modified as per the requirement. Some schemes are attempted to reduce the effect of musical residual noise by human auditory system. This auditory system is based on the fact that the human ear cannot perceive residual noise when this level falls below the noise masking threshold(nmt). Only the audible noise components are removed, this results in the reduction of speech distortion[1]. Manuscript received July, 2016. First Author name- Ambalika M.Tech student in electronics and communication department of Seth Jai Parkash Mukand Lal Institute of Engineering & Technology. Second Author name- Er. Sonia Saini Lecturer in electronics and communication of Seth Jai Parkash Mukand Lal Institute of Engineering & Technology. II. SPEECH ENHANCEMENT TECHNIQUES A. Spectral Subtraction Algorithm The spectral subtraction techniques is the most common and widely used method due to its simplicity and easy to implement. In this technique, estimate the magnitude of noise spectrum and subtract it from the magnitude of noisy spectrum in the absence of speech signal, when only noise is present. The subtraction process needs to be done carefully to avoid speech distortion. If too much is subtracted, then some speech information might be removed, whereas if too short is subtracted, then much of the interfering noise may remains[2]. Noisy Speech FFT Y(w)-N(w) IFFT N(w) Noise Spectrum (estimate Fig1. Spectral subtraction technique Enhanced Signal B. Wiener Filtering The wiener filter is same as the spectral subtraction in the way that it is derived and makes an attempt to reduce the mean-square error in the frequency domain. These filters involve linear estimation of a desired signal sequence from another related sequence. This method is widely used in the field of signal processing. Based on different application requirements, a wiener filter is designed to enhance or improve the signal for that very desired frequency response. In this method, the spectral properties of the original signal and noise should be known before the actual processing [3]. The gain function of WF [4] is given by 1932

Noisy Speech y(n) Estimate Ps(w) Estimate Pn(w) Enhanced signal x(n) Fig 2. Wiener filtering C. Two-Step Decision Directed Algorithm The decision-directed method is better able to minimize the effect of musical residual noise, it introduces a frame delay appeared from the interpolation for estimating the a priori SNR. Therefore, a decision-directed method is performed again to enhance the estimated a priori SNR by eliminating the frame delay. These procedures develop a two step decision directed (TSDD) [1]. The gain function of TSDD is given as: Fig.3 Clean and noisy speech signal Where is the posteriori SNR, and is the gain factor used to estimate a priori SNR[1]. D. Perceptual Decision Directed Approach Fig.4 PDD Output Signal This technique is modified version of decision directed approach. In this, the main objective is to calculate noise masking threshold(nmt), which is further used to estimate the perceptual gain factor. The spectral estimate of a speech signal multiplying a perceptual gain factor noisy spectrum Y(m, w)[5]. is obtained by with the Where Fig.5 Wiener Filter Output Signal III. EXPERIMENTAL RESULTS The above mentioned techniques of speech enhancement were applied to the noisy speech input are shown below using cellular noise: Fig.6 TSDD Output Signal 1933

Fig.7 Spectral Subtraction Output Signal Spectrogram: The spectrogram is a graphical display of the power spectrum of speech as a function of time. The spectrogram of these s are shown below: Fig.10 Spectrogram of TSDD Fig.8 Spectrogram of perceptual decision directed Fig.11 Spectrogram of Spectral subtraction IV. PERFORMANCE EVALUATION To test the performance of proposed speech enhancement system, the objective quality measurement tests, signal-to-noise ratio(snr), peak signal-to-noise ratio(psnr), mean square error(mse), normalized mean square error(nrmse) are used. Table 1 represents the performance comparison of s using cellular noise and fig.12 shows the graph for the evaluation of the s. Fig.9 Spectrogram of wiener filtering 1934

PDD 0.365 1.214 1.040 1.570 Wiener 0.317 1.486 1.042 1.556 TSDD 0.152 1.073 1.043 1.663 0.240 1.126 1.037 1.624 Fig.13- Graph for the evaluation of s Table 3 shows the performance comparison of s using babble noise and fig.14 shows the graph for the evaluation of the s. Table 3: Parameters measure for babble noise signal PDD 6.173 1.215 0.514 1.021 Wiener 3.289 1.214 0.516 1.023 Fig.12 Graph for the evaluation of s TSDD 1.321 1.213 0.517 1.024 Table 2 shows the performance comparison of s using F16-cockpit noise and fig.13 shows the graph for the evaluation of the s. Table 2: Parameters measure for F16-cockpit noise PDD 6.436 8.366 0.526 0.873 Wiener 2.566 8.361 0.526 0.874 TSDD 3.121 8.359 0.527 0.875 1.346 1.213 0.519 1.026 4.647 8.364 0.527 0.875 Fig.14- Graph for the evaluation of s V. CONCLUSION It can be seen from the performance parameters that spectral subtraction method is better for many applications and easy to implement only for stationary noise, but there are some 1935

drawbacks of spectral subtraction technique. Wiener filtering is used to provide optimal performance and reduce the mean square error. TSDD method based on decision directed approach which is used to estimate the a priori SNR and it is used twice to reduce frame delay and for better estimation. Perceptual decision directed technique is the modified version of TSDD and it is based on human auditory system. It gives the much better results than the above described s. Perceptual decision directed(pdd) is used to improve the perceptual gain factor using noise masking threshold. Based on the analysis of the results we concluded that all these s performed well according to the type of signal on different parameters. Based on SNR, PDD is much better than other s. A higher PSNR generally indicates that the reconstruction is of higher quality, in some cases it may not. PSNR should be greater and MSE must be minimize for better estimation of the signal. REFERENCES [12] Teddy Surya Gunawan, Perceptual Speech Enhancement Exploiting Temporal Masking Properties of Human Auditory System, Science Direct of Speech Communication, Vol. 52, pp. 381-393 (2010). [13] Philipos C. Loizou, Israel Cohen, Special issue on Speech Enhancement, Science Direct of Speech Communication, Vol. 49, pp 527-529 (2007). [14] Yi Hu, Philipos C. Loizou, Subjective comparison and evaluation of speech enhancement s, Science Direct of Speech Communication, Vol. 49, pp. 588-601 (2007). [15] [15] Nathalie Virag, Single Channel Speech Enhancement Based on Masking Properties of the Human Auditory System, IEEE Transactions on Speech and Audio Processing, Vol. 7, No. 2, pp. 126-137, March 1999. First Author Ambalika M.Tech student in electronics and communication department of Seth Jai Parkash Mukand Lal Institute of Engineering & Technology. Second Author Er. Sonia Saini Lecturer in electronics and communication of Seth Jai Parkash Mukand Lal Institute of Engineering & Technology. [1] Ching-Ta Lu, Enhancement of Single Channel Speech Using Perceptual Decision Directed Approach, Science Direct of Speech Communication, Vol 53, pp 495-507, 2011. [2] Philips C. Loizou, Speech Enhancement: Theory and Practice, 1 st edition, Boca Raton, FL.: CRC, 2007. [3] Jimish Dodia, Darshana Gowda, GUI Based Performance Analysis of Speech Enhancement Techniques, International Journal of Scientific and Research Publications, Volume 3, Issue 9, pp.1-7, September 2013. [4] Navneet Upadhyay, Abhijit Karmakar, Speech Enhancement using Spectral Subtraction-type Algorithms: A Comparison and Simulation Study, Science Direct, Eleventh International Multi-Conference on Information Processing-2015, vol. 54 pp. 574-584. [5] Deepa Dhanaskodi, Shanmugam Arumugam, Speech Enhancement Algorithm Using Sub band Two Step Decision Directed Approach with Adaptive Weighting factor and Noise Masking Threshold, Journal of Computer Science vol. 6, pp. 941-948, 2011. [6] Philipos C. Loizou, Gibak Kim, Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions, IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19, Issue 1, pp. 47-56, January 2011. [7] Anuradha R. Fukane, Shashikant L. Sahare, Noise estimation Algorithms for Speech Enhancement in highly non-stationary Environments, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 2, pp. 39-44, March 2011. [8] Premananda B S and Dr. Uma B V, Speech Enhancement Algorithm to Reduce the Effect of Background Noise in Mobile Phones, International Journal of Wireless & Mobile Networks (IJWMN) Vol. 5, Issue 1, pp. 177-189, February 2013. [9] Milind U. Nemade1, Prof. Satish K. Shah, Performance Comparison of Single Channel Speech Enhancement Techniques for Personal Communication, International Journal of Innovative Research in Computer and Communication Engineering Vol. 1, Issue 1, pp. 67-76, March 2013. [10] Miss. Anuja Chougule, Dr. Mrs. V. V. Patil, Survey of Noise Estimation Algorithms for Speech Enhancement Using Spectral Subtraction, International Journal on Recent and Innovation Trends in Computing and Communication, Volume: 2 Issue-12, pp. 4156-4160, December 2014. [11] Yang Lu, Philipos C. Loizou, A geometric approach to spectral subtraction, Science Direct, Speech Communication, Vol. 50, pp. 453-466, January 2008. 1936