Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Similar documents
Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Who are cochlear implants for?

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

Role of F0 differences in source segregation

2/25/2013. Context Effect on Suprasegmental Cues. Supresegmental Cues. Pitch Contour Identification (PCI) Context Effect with Cochlear Implants

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

Hearing the Universal Language: Music and Cochlear Implants

HCS 7367 Speech Perception

HCS 7367 Speech Perception

The development of a modified spectral ripple test

Spectrograms (revisited)

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions.

Linguistic Phonetics Fall 2005

PLEASE SCROLL DOWN FOR ARTICLE

An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant

Simulations of high-frequency vocoder on Mandarin speech recognition for acoustic hearing preserved cochlear implant

Topics in Linguistic Theory: Laboratory Phonology Spring 2007

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

Speech, Hearing and Language: work in progress. Volume 11

REVISED. The effect of reduced dynamic range on speech understanding: Implications for patients with cochlear implants

Topic 4. Pitch & Frequency

Auditory Scene Analysis

Lecture 3: Perception

Issues faced by people with a Sensorineural Hearing Loss

The functional importance of age-related differences in temporal processing

Hearing Research 242 (2008) Contents lists available at ScienceDirect. Hearing Research. journal homepage:

Topic 4. Pitch & Frequency. (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music)

Research Article The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition

Psychoacoustical Models WS 2016/17

11 Music and Speech Perception

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

Static and Dynamic Spectral Acuity in Cochlear Implant Listeners for Simple and Speech-like Stimuli

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking

A dissertation presented to. the faculty of. In partial fulfillment. of the requirements for the degree. Doctor of Philosophy. Ning Zhou.

Auditory scene analysis in humans: Implications for computational implementations.

Mandarin tone recognition in cochlear-implant subjects q

Hearing in the Environment

Consonant Perception test

Representation of sound in the auditory nerve

Spectral-Ripple Resolution Correlates with Speech Reception in Noise in Cochlear Implant Users

Systems Neuroscience Oct. 16, Auditory system. http:

Perception of Spectrally Shifted Speech: Implications for Cochlear Implants

SPHSC 462 HEARING DEVELOPMENT. Overview Review of Hearing Science Introduction

ABSTRACT. Møller A (ed): Cochlear and Brainstem Implants. Adv Otorhinolaryngol. Basel, Karger, 2006, vol 64, pp

Assessing Hearing and Speech Recognition

Twenty subjects (11 females) participated in this study. None of the subjects had

Production of Stop Consonants by Children with Cochlear Implants & Children with Normal Hearing. Danielle Revai University of Wisconsin - Madison

Sound localization psychophysics

Exploring the parameter space of Cochlear Implant Processors for consonant and vowel recognition rates using normal hearing listeners

Binaural Hearing. Why two ears? Definitions

752 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 51, NO. 5, MAY N. Lan*, K. B. Nie, S. K. Gao, and F. G. Zeng

Binaural Hearing. Steve Colburn Boston University

Effects of Age and Hearing Loss on the Processing of Auditory Temporal Fine Structure

Noise Susceptibility of Cochlear Implant Users: The Role of Spectral Resolution and Smearing

The REAL Story on Spectral Resolution How Does Spectral Resolution Impact Everyday Hearing?

9/29/14. Amanda M. Lauer, Dept. of Otolaryngology- HNS. From Signal Detection Theory and Psychophysics, Green & Swets (1966)

EVALUATION OF SPEECH PERCEPTION IN PATIENTS WITH SKI SLOPE HEARING LOSS USING ARABIC CONSTANT SPEECH DISCRIMINATION LISTS

Music Perception in Cochlear Implant Users

Frequency refers to how often something happens. Period refers to the time it takes something to happen.

Peter S Roland M.D. UTSouthwestern Medical Center Dallas, Texas Developments

What is sound? Range of Human Hearing. Sound Waveforms. Speech Acoustics 5/14/2016. The Ear. Threshold of Hearing Weighting

Demonstration of a Novel Speech-Coding Method for Single-Channel Cochlear Stimulation

COM3502/4502/6502 SPEECH PROCESSING

Spectral peak resolution and speech recognition in quiet: Normal hearing, hearing impaired, and cochlear implant listeners

Categorical Perception

The role of temporal fine structure in sound quality perception

Hearing. and other senses

Language Speech. Speech is the preferred modality for language.

Speech (Sound) Processing

PHYS 1240 Sound and Music Professor John Price. Cell Phones off Laptops closed Clickers on Transporter energized

Auditory gist perception and attention

Proceedings of Meetings on Acoustics

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES

SPEECH PERCEPTION IN A 3-D WORLD

Infant Hearing Development: Translating Research Findings into Clinical Practice. Auditory Development. Overview

Chapter 11: Sound, The Auditory System, and Pitch Perception

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

J Jeffress model, 3, 66ff

Chapter 40 Effects of Peripheral Tuning on the Auditory Nerve s Representation of Speech Envelope and Temporal Fine Structure Cues

Does Wernicke's Aphasia necessitate pure word deafness? Or the other way around? Or can they be independent? Or is that completely uncertain yet?

Acoustic cues to tonal contrasts in Mandarin: Implications for cochlear implants

PATTERN ELEMENT HEARING AIDS AND SPEECH ASSESSMENT AND TRAINING Adrian FOURCIN and Evelyn ABBERTON

Hearing. Juan P Bello

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

Adaptive dynamic range compression for improving envelope-based speech perception: Implications for cochlear implants

Simulation of an electro-acoustic implant (EAS) with a hybrid vocoder

Hearing Research 242 (2008) Contents lists available at ScienceDirect. Hearing Research. journal homepage:

Hearing Research 231 (2007) Research paper

NIH Public Access Author Manuscript J Hear Sci. Author manuscript; available in PMC 2013 December 04.

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

Comment by Delgutte and Anna. A. Dreyer (Eaton-Peabody Laboratory, Massachusetts Eye and Ear Infirmary, Boston, MA)

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

Computational Perception /785. Auditory Scene Analysis

whether or not the fundamental is actually present.

Speech conveys not only linguistic content but. Vocal Emotion Recognition by Normal-Hearing Listeners and Cochlear Implant Users

Temporal Location of Perceptual Cues for Cantonese Tone Identification

Transcription:

The psychophysics of cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences Prelude Envelope and temporal fine structure (TFS): What's all the fuss? Decomposing waveforms Spectral analysis... Decomposes a wave into a sum of sinusoids to give a spectrum This particular temporal analysis Decomposes a wave into the product of two (usually) complicated waves known as the envelope and the temporal fine structure (TFS). Modulating a wave carrier (TFS) x modulator (envelope) = amplitudemodulated wave

Can work this backwards = x Fine structure and envelope Temporal fine structure relatively fast reflects spectral components of sounds in the sound waveform, and periodicity (in some definitions) envelope is the slower stuff think of all waves as being made by multiplying an envelope against a carrier http://research.meei.harvard.edu/chimera/motivation.html 24 JAN 2010 Both (all 3?) kinds of temporal features preserved in the auditory nerve Joris et al. 2004 Everyone agrees that Slowish envelopes (<30 Hz or so) are really important for speech perception Distinguish two features Envelope variations that are highly correlated across frequency And those that are not. Slower features too (4 Hz modulations)

Correlated and uncorrelated (across frequency) envelope modulations Correlated envelopes in speech one source of cues to consonants Correlated envelopes in speech one source of cues to consonants Changing manner of articulation push ship vs. push chip

Spectral dynamics are encoded in uncorrelated across-channel envelope modulations Proof that slow envelopes are sufficient: Noise-excited vocoding preserves envelopes, destroys TFS Modulations < 10 Hz are most important So what s missing? TFS said to be important for Perception of pitch Intonation and tone Glimpsing in noises that vary in level Glimpsing in noises that vary in level An ability that allows a listener to tolerate higher levels of noises than would otherwise be possible

A 3-way partition: typical for NHLs at higher frequencies and CIs Periodicity in CIs encoded primarily as changes in within channel modulation rate envelope + periodicity + fine-structure envelope + periodicity envelope alone The representation of periodicity can be messy! NHLs do use TFS for pitch Types of Spectrogram Wide-band Narrow-band Auditory An auditory spectrogram looks like a wide-band spectrogram at high frequencies and a narrow-band spectrogram at low frequencies (but with more temporal structure).

No glimpsing opportunities: A steady-state background noise But noises are typically not steady Does TFS have a role in glimpsing? CI users do not appear to be able to glimpse, Nor do NHLs in simulation studies... So perhaps TFS (or some aspect of periodicity) is necessary Summarise Waveforms (after any filter bank/spectral analysis) can be decomposed into the product of An envelope (something slow) Or maybe two kinds of envelope A TFS (something fast) One limitation of CIs may be poor access to TFS information Also sometimes used as a code word for pitch perception hence necessary for music.

Restricted dynamic range means compression is crucial The psychophysics of electrical stimulation in the cochlea Absolute thresholds and maximum acceptable loudness levels Nelson et al. (1996) JASA Intensity jnds in electrical (opposed to acoustic) stimulation: 1) miss Weber s Law more 2) are typically smaller, but not by enough to offset reduced dynamic range. Acoustic/electrical loudness matches 4 different stimulation frequencies vs. contralateral 1.5 khz tone CI users here had 7-45 discriminable steps in the total dynamic range, compared to 83 in normal hearing Nelson et al. (1996) JASA Eddington et al. 1978 Ann Otol Rhinol Laryngol

Loudness grows much faster in electrical stimulation (hyperrecruitment!) Temporal resolution: gap detection Shannon 1993 Temporal resolution: modulation detection (100 Hz) Temporal resolution: TMTFs More dependent on level (as for intensity jnd s) More dependent on level Otherwise similar to normal listeners (dashed lines) Fu 2002 NeuroReport Shannon 1992 J Acoust Soc Amer

Relationships to performance with speech modulation detection thresholds measured at 100 Hz, at a number of levels (previous slide) Fu 2002 NeuroReport Perceiving variations in amount of activity across electrodes Essential for signaling of spectral shape Spectral shape is encoded by relatively slow level changes across electrodes Recall preservation of fast modulation rates not necessary for intelligibility in noisevocoded speech Slow level changes across channels Th-ee-z d- ay - s a ch-i-ck - en-l-e-g is a r-a- re d -i - sh Discrimination of rippled noise find the maximum ripple density at which it is possible to discriminate standard ripple noise from its inverted version This test is hypothesized to provide a direct measure of the ability of listeners to perceive the frequency locations of spectral peaks in a broadband acoustic signal. Henry et al. 2005 J Acoust Soc Am

Discrimination of rippled noise Relationships to performance with speech in quiet 12 hvd by 20 talkers 16 VCVs by 4 talkers Henry et al. 2005 J Acoust Soc Am Henry et al. 2005 J Acoust Soc Am Statistical interlude: The effect of outliers Statistical interlude: The effect of outliers r 2 =0.28 p<0.01 r 2 =0.09 p>0.15 r 2 =0.37 p<0.005 r 2 =0.33 p<0.006 vowels consonants

Relationships to performance with speech in noise better perfo ormance SRT determined for selection of one of 12 spondees Why is speech melody (voice pitch) important to hear? Contributes to speech intelligibility in all languages A good supplement to lipread information May play an important role in separating speech from background noises Appears to play a more crucial role for the young child developing language Crucial in so-called tone languages Won et al. 2005 JARO Pitch based on a purely temporal code Pitch based on a purely temporal code Best normal performance for normal listeners about 0.2 % over entire range Merzenich et al. 1973 Shannon 1993 limited to 300 Hz or so Merzenich et al. 1973

Melody recognition CI users classifying rise/fall contours on diphthongs 12 songs familiar to most people, synthesised with and without natural rhythm Kong et al. (2004) Green et al. 2004 J Acoust Soc Amer Tones in Mandarin Chinese mother hemp horse scold How important is the loss of voice pitch to understanding speech in quiet? Eliminating tonal contrasts from speech still leaves tone languages intelligible because no single acoustic feature is indispensable in any language. Here, we trade off spectral resolution against presence or absence of tone (voice pitch variations/speech melody).

Vocoding as a way to trade off spectral resolution and tone original speech source wave bandpass filtered speech (channel 3 of 4) envelope from above envelope x source wave bandpass filtered envelope x source wave 4-channel vocoding frequ uency (Hz) source function 4-channel Fx 4-channel Vx original speech strawberry Experimental design Three tone languages Mandarin, Cantonese & Yoruba Two non-tonal languages English & Lithuanian Presented to groups of native listeners varying numbers of channels (1, 2, 4, 8 and 16) presence or absence of informative tone time (s)

Examples of the stimuli VxNx FxNx 1 2 4 8 16 proportion of ke ey words correct Main results number of channels (log scale) Results for Mandarin (simulations) No use of linguistic tone in Mandarin by CI users from the PhD thesis of Yu-Ching Kuo, 2006

Conclusions Melody coded as periodicity in rapid within-channel patterns Th-ee-z d- ay - s a ch-i-ck - en-l-e-g is a r-a- re d -i - sh Variations in fundamental frequency contribute to intelligibility in all languages but they are considerably more important in tone languages Getting tone into cochlear implants could be worth as much as a doubling in the number of channels. The representation of melody can be messy! Perception of fundamental pitch in complex waves is very poor Lower harmonics cannot be resolved as in normal hearing Phase-locking seems different Mis-match between place of excitation and temporal pattern may be important