A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPLANTS USING KALMAN WITH DRNL AND SSB TECHNIQUE

Similar documents
An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Who are cochlear implants for?

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

ADVANCES in NATURAL and APPLIED SCIENCES

IMPROVING CHANNEL SELECTION OF SOUND CODING ALGORITHMS IN COCHLEAR IMPLANTS. Hussnain Ali, Feng Hong, John H. L. Hansen, and Emily Tobey

Representation of sound in the auditory nerve

J Jeffress model, 3, 66ff

Hearing the Universal Language: Music and Cochlear Implants

Brad May, PhD Johns Hopkins University

CHAPTER 1 INTRODUCTION

A neural network model for optimizing vowel recognition by cochlear implant listeners

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

ADVANCES in NATURAL and APPLIED SCIENCES

Design and Implementation of Speech Processing in Cochlear Implant

A Consumer-friendly Recap of the HLAA 2018 Research Symposium: Listening in Noise Webinar

University of Arkansas at Little Rock. remaining auditory neurons directly.

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

Modern cochlear implants provide two strategies for coding speech

FIR filter bank design for Audiogram Matching

Sonic Spotlight. SmartCompress. Advancing compression technology into the future

2/25/2013. Context Effect on Suprasegmental Cues. Supresegmental Cues. Pitch Contour Identification (PCI) Context Effect with Cochlear Implants

Fig. 1 High level block diagram of the binary mask algorithm.[1]

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

Simulations of high-frequency vocoder on Mandarin speech recognition for acoustic hearing preserved cochlear implant

Hearing. istockphoto/thinkstock

International Journal of Scientific & Engineering Research, Volume 5, Issue 3, March ISSN

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

Acoustics Research Institute

The REAL Story on Spectral Resolution How Does Spectral Resolution Impact Everyday Hearing?

EEL 6586, Project - Hearing Aids algorithms

SOLUTIONS Homework #3. Introduction to Engineering in Medicine and Biology ECEN 1001 Due Tues. 9/30/03

Computational Perception /785. Auditory Scene Analysis

Implant Subjects. Jill M. Desmond. Department of Electrical and Computer Engineering Duke University. Approved: Leslie M. Collins, Supervisor

Hearing. PSYCHOLOGY (8th Edition, in Modules) David Myers. Module 14. Hearing. Hearing

Adaptation of Classification Model for Improving Speech Intelligibility in Noise

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3

Peter S Roland M.D. UTSouthwestern Medical Center Dallas, Texas Developments

Topic 4. Pitch & Frequency

Proceedings of Meetings on Acoustics

REVISED. The effect of reduced dynamic range on speech understanding: Implications for patients with cochlear implants

Discrete Signal Processing

1. Human auditory pathway.

Signals, systems, acoustics and the ear. Week 5. The peripheral auditory system: The ear as a signal processor

Speech conveys not only linguistic content but. Vocal Emotion Recognition by Normal-Hearing Listeners and Cochlear Implant Users

Development of novel algorithm by combining Wavelet based Enhanced Canny edge Detection and Adaptive Filtering Method for Human Emotion Recognition

BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED

Differential-Rate Sound Processing for Cochlear Implants

Benefit of Advanced Directional Microphones for Bilateral Cochlear Implant Users

An Auditory System Modeling in Sound Source Localization

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking

Chapter 40 Effects of Peripheral Tuning on the Auditory Nerve s Representation of Speech Envelope and Temporal Fine Structure Cues

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

Coding Strategies for Cochlear Implants Under Adverse Environments

HCS 7367 Speech Perception

Simulation of an electro-acoustic implant (EAS) with a hybrid vocoder

International Journal of Informative & Futuristic Research ISSN:

PSY 215 Lecture 10 Topic: Hearing Chapter 7, pages

The effect of wearing conventional and level-dependent hearing protectors on speech production in noise and quiet

Auditory gist perception and attention

Auditory Physiology PSY 310 Greg Francis. Lecture 29. Hearing

PSY 310: Sensory and Perceptual Processes 1

A TEMPORAL MODEL OF FREQUENCY DISCRIMINATION IN ELECTRIC HEARING

Auditory nerve. Amanda M. Lauer, Ph.D. Dept. of Otolaryngology-HNS

Temporal Adaptation. In a Silicon Auditory Nerve. John Lazzaro. CS Division UC Berkeley 571 Evans Hall Berkeley, CA

Role of F0 differences in source segregation

LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 10 : 6 June 2010 ISSN

Robust Neural Encoding of Speech in Human Auditory Cortex

Series Preface. Preface. Acknowledgments. Abbreviations

Electric and Acoustic Stimulation in the Same Ear

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

BORDERLINE PATIENTS AND THE BRIDGE BETWEEN HEARING AIDS AND COCHLEAR IMPLANTS

Chapter 1: Introduction to digital audio

Cochlear Implant The only hope for severely Deaf

Sonic Spotlight. Binaural Coordination: Making the Connection

Sound and Hearing. Decibels. Frequency Coding & Localization 1. Everything is vibration. The universe is made of waves.

Michael Dorman Department of Speech and Hearing Science, Arizona State University, Tempe, Arizona 85287

Bioscience in the 21st century

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Hearing in the Environment

Research Article The Acoustic and Peceptual Effects of Series and Parallel Processing

Hearing Screening, Diagnostics and Intervention

Masking release and the contribution of obstruent consonants on speech recognition in noise by cochlear implant users

Providing Effective Communication Access

Topic 4. Pitch & Frequency. (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music)

whether or not the fundamental is actually present.

Speech Enhancement, Human Auditory system, Digital hearing aid, Noise maskers, Tone maskers, Signal to Noise ratio, Mean Square Error

ABSTRACT. Møller A (ed): Cochlear and Brainstem Implants. Adv Otorhinolaryngol. Basel, Karger, 2006, vol 64, pp

Deafness and hearing impairment

Adaptive dynamic range compression for improving envelope-based speech perception: Implications for cochlear implants

Voice Pitch Control Using a Two-Dimensional Tactile Display

A. SEK, E. SKRODZKA, E. OZIMEK and A. WICHER

UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication

Performance Comparison of Speech Enhancement Algorithms Using Different Parameters

Transcription:

A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPLANTS USING KALMAN WITH DRNL AND SSB TECHNIQUE Rohini S. Hallikar 1, Uttara Kumari 2 and K Padmaraju 3 1 Department of ECE, R.V. College of Engineering, Bangalore, India rohini_hallikar@yahoo.co.in 2 Department of ECE, R.V. College of Engineering, Bangalore, India 3 JNT University, Kakinada, Andhra Pradesh, India ABSTRACT Cochlear implant devices are known to exist since a long time. The purpose of the present work is to develop a speech algorithm for obtaining robust speech. In this paper, the technique of cochlear implant is first introduced, followed by discussions of some of the existing techniques available for obtaining speech. The next section introduces a new technique for obtaining robust speech. The key feature of this technique lies in the use of the advantages of an integrated approach involving the use of an estimation technique such as a kalman filter with non linear filter bank strategy, using Dual Resonance Non Linear(DRNL) and Single Side Band(SSB) Encoding method. A comparative study of the proposed method with the existing method indicates that the proposed method performs well compared to the existing method. KEYWORDS Cochlear Implants, kalman, Dual Resonance Non Linear (DRNL) 1. INTRODUCTION Cochlear Implants importance lies in the fact that, these devices provide partial hearing for those who are profoundly deaf. Specialized techniques are required for speech processing and this involves the understanding of many disciplines such as signal processing, bioengineering, otolaryngology, physiology, speech science etc. One important objective of these signal processing techniques is, to derive electrical stimuli from the speech signal.[1] For cochlear implants to work satisfactorily, robust auditory nerve fibres is an important requirement. Auditory nerve fibres, once they are stimulated, they impart the associated neural impulses to the brain, which is interpreted as sound by the brain. The loudness of a sound is directly proportional to two important factors, rates of firing and number of nerve fibres that are activated. The implant conveys sound information to the brain. This information also conveys the loudness of the sound which further depends upon amplitude of the stimulus current and pitch of the sound. [2] 2. SIGNAL PROCESSING TECHNIQUES User s hearing potential is maximized by the use of cochlear implant speech coding techniques. Techniques such as Multipeak, spectral maxima, Spectral peak, continuous interleaved sampling (CIS) etc are the examples of existing techniques. Above mentioned techniques do facilitate the cochlear implant users, to obtain their hearing and then communicating in quiet. But these techniques are at a disadvantage when complex listening has to take place. [3] DOI : 10.5121/acij.2012.3409 71

In complex listening, the speech coding technique has to achieve a balance with respect to the quality of the hearing and reduction in the noise level. This arrangement effectively reduces the effect of noise and at the same time gives better signal restoration. Also the use of a nonlinear filter bank model gives a robust formant representation and speech perception in noise. A Dual Resonance non Linear (DRNL) model is comparatively simpler than other adaptive non-linear models of the basilar membrane [4]. To achieve these objectives a kalman filter based on the voice generation model and a DRNL with SSB filter are used. 3. PROPOSED TECHNIQUES The proposed method is shown in figure 1.Input from the microphone is fed to an estimation algorithm such as a kalman filter. The processed signal is given to a preemphasis stage. From the preemphasis stage the signal is fed to a DRNL filter block. Output of DRNL filter is given to a SSB encoder followed by an envelope detection stage to obtain the biphasic pulses. This is followed by the reconstruction of the input signal. The input waveform is then compared with the reconstructed one on the basis of the correlation coefficient. The above processing is done initially considering clean speech and later corrupting the signal with noise. The following representation gives the step by step representation of the proposed method. Step 1 Input from the microphone to be given to kalman filter. Step 2 The output of first step to be preemphasized. Step 3 The preemphasized signal is given to a DRNL filter block. Step 4 Output of the DRNL filter to be given to an SSB encoder Step 5 SSB encoded signal to be given to an envelope detector. Step 6 Get biphasic pulses from the envelope detector. These biphasic pulses are used to obtain the reconstructed signal. 4. RESULTS For various inputs of speech, correlation coefficients of the proposed method along with existing method were computed. The correlation coefficient between the input signal and reconstructed waveform was computed by using (1). r = i (x i - x ) (y i - y ) (1) -------------------------------------- [ i (x i - x ) 2 (y i - y ) 2 ]^(0.5) Where, r is the correlation coefficient between two symbols x and y, with x and y representing their means. Table 1. Comparison of correlation coefficient for two different methods Fs=10KHz for the speech signal news.wav Listening Proposed Method (1) DRNL(2) conditions Quiet 0.8026 0.7888 5 db babble noise 0.4713 0.4377 10 db babble 0.5475 0.4768 noise 72

Table 2. Comparison of correlation coefficient for two different methods Fs=20 KHz for the speech signal news.wav Listening conditions Proposed Method (1) DRNL(2) Quiet 0.7940 0.7658 5 db babble noise 0.4609 0.4136 10 db babble noise 0.5375 0.4562 Correlation 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1 2 Quiet 5 db babble noise 10 db babble noise Fig 2: The performance of proposed method & DRNL for different listening conditions for news.wav using Fs=10 KHz Correlation Coefficient 0.8 0.6 0.4 0.2 0 1 2 Quiet 5 db babble noise 10 db babble noise Fig 3: The performance of proposed method with and DRNL methods for different listening conditions for news.wav using Fs=20 KHz Table 3: Comparative analysis of the improvement of proposed method over existing method for the speech signal news.wav Fs=10KHz Improvement of correlation coefficient of Proposed method over existing method(%) Quiet 1.749 5 db babble noise 7.6 10 db babble noise 14.828 73

Table 4: Comparative analysis of the improvement of proposed method over existing method for the speech signal news.wav Fs=20 KHz Improvement of correlation coefficient of Proposed method over existing method(%) Quiet 3.682 5 db babble noise 11.43 10 db babble noise 17.82 Table 3 and Table 4 convey the improvement of the proposed method over existing method. The proposed method showed good improvement over existing method for quiet as well as noisy conditions. 5. CONCLUSION The key contribution of this paper is the implementation of a new method which gives a superior performance in quiet and noisy conditions. REFERENCES [1] Philipos C. Loizou Signal-processing Techniques for cochlear Implants. A review of progress in deriving electrical stimuli from the speech signal, IEEE Engineering in medicine and Biology, pp.34-46, May/June 1999. [2] Philipos C. Loizou, Mimicking the human ear, An overview of signal-processing strategies for converting sound into electrical signals in cochlear implants, IEEE signal processing magazine pp 101-130,September 1998. [3] Hongyun Liu, Weidon Wang, Guangrong Liu, Zhengbo Zhang, An Improved Speech Coding Strategy For Cochlear Implants 3 rd International conference on Biomedical Engineering and Informatics, pp.1416-1419,ieee, 2010 [4] Mirko Marracci, Bernardo Tellini, Largo L. Lazzarino, Romano Giannetti., On the use of the Dual Resonance Non Linear filter for speech processing in a Hearing Aid Algorithm, Proceedings of IEEE International workshop on Medical Measurements and Applications, pp 521-524, 2011 [5] Hang Dong,Hong Sun, Speech Enhancement Based On Turbo Iteration, hal-00385393,version 1-19 May 2009 [6] Stephen So, Kamil K. W ojcicki, Kuldip K. Paliwal,, Single-channel speech enhancement using Kalman filtering in the modulation domain,annual Conference Of The International Speech Communication Association Interspeech, pp 993-996, 2010 [7] Strategies for representing speech information with cochlear implants, in cochlear implants: principles and practices, B. S. Wilson, J. K. Niparko, Ed, Lippincott Williams & Wilkins, Philadelphia, Pa, USA,., pp. 129-170,20007 [8] A psychoacoustic NofM -type speech coding strategy for cochlear implants, Waldo Nogueira, Andreas Buchner, Thomas Lenarz, Bernd Edler, published in the EURASIP Journal on Applied Signal Processing volume 30,Issue-5,pp.3044-3059, 2005 [9] A new speech coding for improving the quality of cochlear implant Chen wei-bing; Zhou linghong; Xiao zhong-ju; Chen guang-jie; Wang lin-jing BioMedical Engineering and Informatics, volume-1, pp. 478-482, 2008. [11] Single sideband encoder for music coding in cochlear implants. Kaibao Nie,Les Atlas and Jay Rubinstein,In Proceedings of ICASSP,pp. 4209-4212, 2008 74

Authors Advanced Computing: An International Journal ( ACIJ ), Vol.3, No.4, July 2012 Rohini S.Hallikar Completed B.E. (Electronics) degree from Dr.B.A.M.U Aurangabad, Maharashtra. Completed M.Tech (Digital Electronics & Communication), VTU, Belgaum. Currently working as Assistant Professor in department of Electronics and Communication, R.V.College of Engineering, Bangalore. 560059,.Karnataka, India Dr..M.Uttara Kumari Completed B.Tech(E&C) from Nagarjuna University. Completed M.E.(Signal Processing, Communication) from Bangalore University. Completed PhD from Andra University. Presently, working as the Head of the Department in department of Electronics and ommunication,r.v.college of Engineering, Bangalore. 560059,.Karnataka, India Dr. K. Padma raju Completed B.Tech (E & C) from Nagarjuna University.. Completed M.Tech(Electronic Instrumentation), National Institute of Technology Warangal. Completed PhD from Andra University. Completed Post Doc.Fellowship from Hoseo University South Korea. Currently Principal and Professor of Electronics and Communication Engineering and Jawaharlal Nehru Technological University Kakinada. Kakinada - 533 003, Andhra Pradesh, INDIA 75