Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition

Similar documents
What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

University of Arkansas at Little Rock. remaining auditory neurons directly.

An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant

Who are cochlear implants for?

A neural network model for optimizing vowel recognition by cochlear implant listeners

SOLUTIONS Homework #3. Introduction to Engineering in Medicine and Biology ECEN 1001 Due Tues. 9/30/03

Design and Implementation of Speech Processing in Cochlear Implant

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Bioscience in the 21st century

Hearing the Universal Language: Music and Cochlear Implants

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

ADVANCES in NATURAL and APPLIED SCIENCES

COM3502/4502/6502 SPEECH PROCESSING

Coding Strategies for Cochlear Implants Under Adverse Environments

International Journal of Information Technology Convergence and Services (IJITCS) Vol.2, No.5, October 2012

Signals, systems, acoustics and the ear. Week 5. The peripheral auditory system: The ear as a signal processor

1. Human auditory pathway.

A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPLANTS USING KALMAN WITH DRNL AND SSB TECHNIQUE

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

Simulations of high-frequency vocoder on Mandarin speech recognition for acoustic hearing preserved cochlear implant

Modern cochlear implants provide two strategies for coding speech

Hearing Aids. Bernycia Askew

EEL 6586, Project - Hearing Aids algorithms

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

752 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 51, NO. 5, MAY N. Lan*, K. B. Nie, S. K. Gao, and F. G. Zeng

Fig. 1 High level block diagram of the binary mask algorithm.[1]

HARDWARE SIMULATION OF SPEECH PROCESSOR FOR COCHLEAR IMPLANT

The Auditory Brainstem Implant. Manchester Royal Infirmary

Chapter 11: Sound, The Auditory System, and Pitch Perception

Speech (Sound) Processing

IMPROVING CHANNEL SELECTION OF SOUND CODING ALGORITHMS IN COCHLEAR IMPLANTS. Hussnain Ali, Feng Hong, John H. L. Hansen, and Emily Tobey

Cochlear Implant The only hope for severely Deaf

International Journal of Scientific & Engineering Research, Volume 5, Issue 3, March ISSN

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

Simulation of an electro-acoustic implant (EAS) with a hybrid vocoder

Quick Guide - eabr with Eclipse

Adaptive dynamic range compression for improving envelope-based speech perception: Implications for cochlear implants

Hearing. PSYCHOLOGY (8th Edition, in Modules) David Myers. Module 14. Hearing. Hearing

Hearing. istockphoto/thinkstock

Exploring the parameter space of Cochlear Implant Processors for consonant and vowel recognition rates using normal hearing listeners

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

Hearing Sound. The Human Auditory System. The Outer Ear. Music 170: The Ear

Music 170: The Ear. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) November 17, 2016

Sound and Hearing. Decibels. Frequency Coding & Localization 1. Everything is vibration. The universe is made of waves.

HEARING AND PSYCHOACOUSTICS

A. SEK, E. SKRODZKA, E. OZIMEK and A. WICHER

Michael Dorman Department of Speech and Hearing Science, Arizona State University, Tempe, Arizona 85287

FIR filter bank design for Audiogram Matching

Differential-Rate Sound Processing for Cochlear Implants

REVISED. The effect of reduced dynamic range on speech understanding: Implications for patients with cochlear implants

J Jeffress model, 3, 66ff

Discrete Signal Processing

Chapter 3. Sounds, Signals, and Studio Acoustics

Auditory Physiology Richard M. Costanzo, Ph.D.

History of Cochlear Implants and Auditory Brainstem Implants

PSY 215 Lecture 10 Topic: Hearing Chapter 7, pages

ENT 318 Artificial Organs Physiology of Ear

Monitoring of Cochlear Implants

whether or not the fundamental is actually present.

Spectrograms (revisited)

A truly remarkable aspect of human hearing is the vast

Speech conveys not only linguistic content but. Vocal Emotion Recognition by Normal-Hearing Listeners and Cochlear Implant Users

Mechanical Properties of the Cochlea. Reading: Yost Ch. 7

HCS 7367 Speech Perception

Cochlear implants. Carol De Filippo Viet Nam Teacher Education Institute June 2010

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

Design and Implementation of Programmable Hearing Aid Using FPAA

Can You Hear Me Now?

1- Cochlear Impedance Telemetry

Glossary For Parents. Atresia: closure of the ear canal or absence of an ear opening.

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Can You Hear Me Now? Learning Objectives 10/9/2013. Hearing Impairment and Deafness in the USA

CHAPTER 1 INTRODUCTION

ClaroTM Digital Perception ProcessingTM

College of Medicine Dept. of Medical physics Physics of ear and hearing /CH

DESIGN OF SMART HEARING AID

PSY 214 Lecture 16 (11/09/2011) (Sound, auditory system & pitch perception) Dr. Achtman PSY 214

Cochlear Implants. What is a Cochlear Implant (CI)? Audiological Rehabilitation SPA 4321

What is sound? Range of Human Hearing. Sound Waveforms. Speech Acoustics 5/14/2016. The Ear. Threshold of Hearing Weighting

CONVENTIONAL AND DIGITAL HEARING AIDS

Deafness and hearing impairment

Modulation and Top-Down Processing in Audition

An Auditory System Modeling in Sound Source Localization

AUDL GS08 and GAV1: 2013 Final exam page 1/13. You must complete all sections. Label all graphs. Show your work!

Implant Subjects. Jill M. Desmond. Department of Electrical and Computer Engineering Duke University. Approved: Leslie M. Collins, Supervisor

Brad May, PhD Johns Hopkins University

Removal of Baseline wander and detection of QRS complex using wavelets

Required Slide. Session Objectives

ABSTRACT. Møller A (ed): Cochlear and Brainstem Implants. Adv Otorhinolaryngol. Basel, Karger, 2006, vol 64, pp

Auditory System & Hearing

LIST OF FIGURES. Figure No. Title Page No. Fig. l. l Fig. l.2

International Journal of Informative & Futuristic Research ISSN:

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

research directions Cochlear implant G.M.CLARK FREQUENCY CODING ELECTRICAL RATE STIMULATION - PHYSIOLOGY AND PSYCHOPHYSICS Department ofotolaryngology

Temporal Analysis and Remote Monitoring of ECG Signal

Chapter 40 Effects of Peripheral Tuning on the Auditory Nerve s Representation of Speech Envelope and Temporal Fine Structure Cues

THE MECHANICS OF HEARING

Hearing. Juan P Bello

Transcription:

Implementation of Spectral Maxima Sound processing for cochlear implants by using Bark scale Frequency band partition Han xianhua 1 Nie Kaibao 1 1 Department of Information Science and Engineering, Shandong University, Jinan, China 250100 E_mail: hanxhua@ee.sdu.edu.cn Abstract A new method on the basis of Bark scale frequency-band partition was presented to improve the recognition performance of cochlear implants. In the nature of physics, it consists with human s cochlea filter properties. Also the mechanism of a cochlear implant and its spectral maxima sound processing (SMSP) strategy were presented. The time -frequency analyzing property of human's cochlea was analyzed. The theoretical determination of Bark scale frequency-band was also given with formula. Simulation results using Bark scale transform signal processing were discussed in detail and results show that the new method is feasible in speech processors for cochlear implants. Key words cochlear implant, Bark scale, SMSP strategy, speech signal processing I. INTRODUCTION Cochlear Implant is a hearing device restoring hearing ability to the deaf with electricity stimulation. It mainly acts as imitating the physiological function of peripheral acoustical nerve such as outer, middle and inner ear of normal people. The nucleus of surrounding acoustical system is cochlea in inner ear. Cochlea is a snail-shaped organ that translates sound energy into nerve impulses and then sends them to the brain for processing. Sound is transmitted in the form of traveling wave, so cochlea can be taken as theoretically that it is composed of a group of space-distributed band-filters, and the same long basilar membrane (about 1.5mm) in cochlea is equivalent to a filter of same bandwidth called Bark scale domain [1,2]. Multi-channel cochlear implant is designed based on cochlea s filtering properties for speech signal. Speech signal-processor deals with speech signal in segment band, and produces simulating signal of corresponding electrodes to excite different part acoustical nerves in cochlear. SMSP strategy is one of signal processing algorithms that was developed in the early 1990s cochlear implant. It divides speech signal into sub-bands using 16 band filters, and then detects out envelope information of each band signal and selects 6 or 8 maxima spectral to produce electrical stimulation current. In SMSP algorithm, the bandwidth of filters usually is logarithm-law or octave-law relation. However, this frequency-band partition of filters is not completely consistent with the nature of physics which is inherent in human ear. So the speech recognition performance of SMSP strategy should be improved. According to above analysis, this paper discusses the application of Bark scale transform in speech signal-processing of cochlear implants. We hope that the results would provide reference to the implementation of speech signal digital processing in cochlear implants, and at the same time, the relevant conclusion would be helpful for improving the speech recognizing ability of the deaf. SMSP STRATEGY A The mechanism of cochleae implant When sound signal transmits in the form of traveling wave in cochlea, the amplitude of its high frequency band is the strongest in cochlea s basal end, and the amplitude of its low frequency-band is the strongest in of cochlea s apical end. So the cochlea is a mechanical-frequency analyzer in the space.: This project is supported by the National Natural Science Foundation of China.(No.30000041). 0-7803-7211-5/01$10.00 2001 IEEE 1

Report Documentation Page Report Date 25 Oct 2001 Report Type N/A Dates Covered (from... to) - Title and Subtitle Implementation of Spectral Maxima Sound Processing for Cochlear Implants by Using Bark Scale Frequency Band Partition

Pre- amp The basal end of cochlea accepts high-frequency signal and the apical end accepts low-frequency signal. Acoustical frequency between the apical end and the basal end along the basilar membrane ranges from 16Hz to 20kHz Physiological experiments give further evidence that the frequency-analyzing feature of cochlea is similar to that of the same bandwidth filters in Bark scale domain. The frequency bandwidth of each filter is approximately equal to the length of basilar membrane in cochlea which includes about 1200 pieces of nerve fiber. Cochlear implant is the imitation of normal hearing man s cochlea. In artificial cochlea, a group of electrodes is implanted into cochlea s scala tympani along the basilar membrane. The speech processor outside of body produces electricity-stimulating signals corresponding as electrodes. The implanted circuit of generating stimulating-electricity current inner body produces stimulating-pulses. These pulses activate intact auditory nerves in cochlea. Then the exciting of auditory nerves near all electrodes is transmitted to speech center of pallium to induce hearing perceptual ability, and so restore primary language communication ability for the deaf. B Spectral maxima sound processing strategy in cochlear implants Unlike previous speech signal-processing algorithm developed for the Nucleus implant, SMSP strategy did not extract any features (e.g., F1, F2) from the speech waveform. Instead, it analyzed the speech signal a bank of 16 bandpass filters and a spectral maxima detector. Figure 1 shows the block Filter BPF1 BPF2 BPF16 Rect/ Maxima compression Selection Compress Select 6 or 8 Largest Ampli- -tude Figure1 The schematic of SMSP strategy Selected Amplitude Data Encoden RF diagram of the SMSP strategy in cochlea implant. In SMSP strategy, signal from the microphone is firstly pre-amplified and then sent through a bank of 16 bandpass filters with center frequencies ranging from 200 to 5,000 Hz. The output of each filter is rectified and low-pass filtered with cutoff frequency of 200 Hz. After computing all 16 filter outputs, the SMSP strategy selects, at certain time intervals, 6 or 8 largest filter outputs. The 6 or 8 amplitudes are finally logarithmically compressed, to fit the patient s electrical dynamic range, and after being modulated by bipolar pulse-sequences transmitted to the 6 or 8 selctected electrodes through a radio-frequency link. The term maxima refers to the largest filter amplitudes, which are not necessarily the spectral peaks 3 5. Electrical stimulation pulsatile current in SMSP strategy would be gained according as the varied amplitude of selected sub-band signal. The number of electrodes is 16, but only 6 or 8 electrodes is selected to been stimulated [6]. In SMSP strategy, how frequency-band partition should be made is critical for gaining effective parameters of stimulating electrodes, consequently the optimal recognition performance can be obtained. The frequency-bandwidth of filters in figure 1 is usually octave-law relation or logarithm-law relation. These band filters can be implemented in the form of digital or analog filter. At present, the implementation of filtering in cochlea implant is based on digital filter.[7,8]. The band-pass filters with frequency-band partition of speech signal in Bark scale domain consist with human s cochlea filter properties. The stimulating pulses which are derived from envelopes can effectively represent ones produced by nerve fibers on the same long basilar membrane of cochlea. This paper develops a new speech signal-processing method in cochlea implant where frequency-band partition of band-pass filters is based on Bark scale transform. BARK SCALE TRANSFORM AND ITS IMPLEMENTATION IN SMSP STRATEGY A Definition of Bark Scale transform The function of basilar membrane is similar to frequency-spectra analyzer. We can separate frequency of speech signal ranging from 20 to 2

16,000Hz into 24 frequency groups (critical band), and this partition of frequency groups is consistent with division of basilar membrane into many same long small parts. Each small part of basilar membrane is corresponding to a frequency group. The sound of these frequency on the same segment of basilar membrane may be evaluated as a whole by adding them together. So we can say that the corresponding relation between perceptual frequency of auditory organ and actual frequency of sound signal is nonlinearly mapped. So the concept of Bark scale can be introduced. Traunmullar presented the function relation between linear frequency and Bark scale frequency, i.e.: b=6.7asinh [ ( f 20 ) / 600 ] or b = 26.81f / (1960 + f ) Where b denotes Bark scale frequency frequency. Their relation is showed in figure 2. f is linear B the implementation of SMSP strategy with Bark scale transform In the nature of physics, the frequency-band partition in Bark scale domain in SMSP strategy consists with human s cochlea filter properties. So the band division of Bark scale domain can replace the partition of logarithm-law or octave-law in band-pass filters of SMSP algorithm. In experiments, firstly signal from the microphone is pre-amplified and then sent through a bank of 16 bandpass filters with Bark scale transform. Secondly the output of each filter is all-wave rectified and low-pass filtered with cutoff frequency of 200 Hz. After computing all 16 filtering outputs, the SMSP strategy selects, at certain intervals, 6 or 8 largest filtering outputs. The cosine wave whose frequency is center frequency corresponding to selected band-pass filter modulates Pre- amp Filter BPF1 BPF2 BPF16 Figure 3 The schematic of SMSP strategy using Bark scale band transform envelope signal. Finally the modulated signal after being added together directly drives loudhailer to sound. Figure 3 shows the block diagram of the SMSP strategy in this paper. In experiments, the performance of synthesized speech signal is estimated by normal people, so this algorithm takes no account of amplitude dynamic-range and omits compressing function. In SMSP strategy, the number of electrodes is 16. However Bark scale domain can be divided into 24 frequency-groups(clinical band) between 20 and 16,000Hz, so it s necessary that a few clinical bands should be combined together as bandwidth of band-pass filter according to sampling frequency of speech signal. Rect/ Maxima Selection Select 6 or 8 Largest Ampli- -tude Data Encoden Synthesized speech signal In this paper, we carried out experiments on 6 or 8 selected channels of SMSP strategies with Bark scale transform, and the center frequencies of corresponding band-pass filters. Table 1 shows center frequency, serial number of Bark scale and frequency-band in SMSP strategy. The sampling frequency of speech signal is 11,025Hz RESULTS AND DISCUSSION Figure2 The relation of Bark clinical band frequency band and linear frequency In our experiment, the input speech signal of Time Good (Fs=11.025kHz 8bit) is analyzed and processed with SMSP strategy using Bark scale transform, and then cosine signal whose frequency is the center frequency of corresponding selected band-pass filters modulates processed signal. Finally modulated signal is added to form synthesized speech signal. The frequency-cut of low-pass filter which 3

extracts envelope signal is 200Hz. Band-pass filters and low-pass demodulating filter are designed in the form of elliptical properties under Matlab environment. band-passed signal is all-wave rectified, i.e.: the negative amplitude value is reversed and the positive is not changed. In experiment, the original speech signal of Time and Good was processed with 6 or 8 selected channels presented SMSP strategy and the experimental results were obtained. In this paper, we only displays simulating results of 8 selected channels SMSP strategy. Figure 4 (a) shows original wave of Time and Good (b) shows synthesized speech wave processed with SMSP strategy. Figure 5 shows rectified envelope signal of Time with 8 selected maxima spectral in certain interval filtered with 16 bandpass elliptical filters. The center frequency and bandwidth of all band-pass filters are showed in table 1. We see from figure 4 that the synthesized speech signal using SMSP strategy with Bark scale transform basically is consistent with original signal. normal people can identify the sound which loudhailer utter with synthesized signal. In this signal-processing strategy, the combination of Bark scale serial numbers or a Bark scale forms bandwidth of band-pass filters. Envelope information of different band is obtain by filtering speech signal, and synthesized signal basicallyrepresents original speech. At the same time, this paper present a new method of frequency-band partition for SMSP strategy, and the speech signal-processing in cochlea implant consists with the physiological nature of cochlea. (a) Original Speech signal (b) synthesized signal figure 4 Original Speech signal and synthesized signal( Time and Good ) TABLE 1 The center frequency of bandpass filter in SMSP strategy and serial numbers of Bark scale domain center frequ- Frequency serial number of -ency Hz range Hz Bark scale domain 200 20 400 1 4 450 400 510 5 570 510 630 6 700 630 770 7 840 770 920 8 1000 920 1080 9 1170 1080 1270 10 1370 1270 1480 11 1600 1480 1720 12 1850 1720 2000 13 2150 2000 2320 14 2500 2320 2700 15 2900 2700 3150 16 3400 3150 3700 17 4000 3700 4400 18 5000 4400 5520 19 Figure 5 The processed 16 channels envelope signal of original speech time CONCLUSION This paper discusses the method of frequency-band partition in Bark scale domain is adopted in speech 4

signal-processing strategy of cochlea implant, and presents a new design scheme for cochlea implant. The experimental result shows that the frequency-band partition replacement of octave-law or logarithm-law in SMSP strategy with Bark scale domain is feasible. This frequency-band partition consists with human s cochlea filter properties. In this paper, we processed speech signal with SMSP strat egy under Matlab. The compressing function for adjusting envelope amplitude is taken out. Envelope signal is directly modulated and then drives loudhailer to sound after added together. Results show that the sound which synthesized signal drive loudhailer to emit can be identified in principle by normal people. REFERENCE [1] Yang Xingjun. Speech signal digital processing. Beijing,1995.8 Chapter 3. [2] FU Qing, Yi Ke-chu. " Bark Wavelet Transform of Speech and Its Application in Speech Recognition". Acta Electronica Sinica, 2000 10 vol 28 No 10 102 105 [3] Wilson B S, Lawson D T, Zerbi M, et al. New processing strategies in cochlear implantation. The American Journal of Otology, 1995, 16: 669 675 [4] wilson B S, Finley C C, Lawson D T, et al. Better speech recognition with cochlear implants. Nature, 1991, 352: 236 238 [5] Wilson B S, Finley C C, Lawson D T, et al. Design and evaluation of continuous interleaved sampling (CIS) processing strategy for multichannel cochlear implants. J of Rehab Research and development, 1993, 30: 110 116 [6] Loizou, P. (1999). "Introduction to cochlear implants," IEEE Engineering in Medicine and Biology Magazine,18(1), 32-42. [7] McDermott H J, Vandali A E, Richard J M, et al. A portable programmable digital sound processor for cochlear implant research. IEEE trans on Reha Eng, 1993, 194 100 [8] Fontaine R, Alary A, Mouine J, et al. A programmable speech processor for profoundly deaf people. The 17th Annual Conf of IEEE EMB. IEEE EMBS: Montreal, Canda, 1995 5