Spectro-temporal response fields in the inferior colliculus of awake monkey

Similar documents
USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

Representation of sound in the auditory nerve

How is the stimulus represented in the nervous system?

Analysis of in-vivo extracellular recordings. Ryan Morrill Bootcamp 9/10/2014

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

Analysis of spectro-temporal receptive fields in an auditory neural network

Characterizing Neurons in the Primary Auditory Cortex of the Awake Primate U sing Reverse Correlation

Neurobiology of Hearing (Salamanca, 2012) Auditory Cortex (2) Prof. Xiaoqin Wang

Coincidence Detection in Pitch Perception. S. Shamma, D. Klein and D. Depireux

Neural Recording Methods

Responses of Neurons in Cat Primary Auditory Cortex to Bird Chirps: Effects of Temporal and Spectral Context

Dynamics of Precise Spike Timing in Primary Auditory Cortex

Chapter 11: Sound, The Auditory System, and Pitch Perception

Spectrograms (revisited)

Task Difficulty and Performance Induce Diverse Adaptive Patterns in Gain and Shape of Primary Auditory Cortical Receptive Fields

Spectrotemporal sensitivities in rat auditory cortical neurons

Contribution of Inhibition to Stimulus Selectivity in Primary Auditory Cortex of Awake Primates

Processing in The Cochlear Nucleus

Robust Neural Encoding of Speech in Human Auditory Cortex

Structure and Function of the Auditory and Vestibular Systems (Fall 2014) Auditory Cortex (3) Prof. Xiaoqin Wang

Estimating sparse spectro-temporal receptive fields with natural stimuli

Analyzing Auditory Neurons by Learning Distance Functions

Comment by Delgutte and Anna. A. Dreyer (Eaton-Peabody Laboratory, Massachusetts Eye and Ear Infirmary, Boston, MA)

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

LETTERS. Sustained firing in auditory cortex evoked by preferred stimuli. Xiaoqin Wang 1, Thomas Lu 1, Ross K. Snider 1 & Li Liang 1

Sound and Hearing. Decibels. Frequency Coding & Localization 1. Everything is vibration. The universe is made of waves.

Nonlinear Spectrotemporal Interactions Underlying Selectivity for Complex Sounds in Auditory Cortex

COM3502/4502/6502 SPEECH PROCESSING

Rhythm and Rate: Perception and Physiology HST November Jennifer Melcher

Over-representation of speech in older adults originates from early response in higher order auditory cortex

Systems Neuroscience Oct. 16, Auditory system. http:

Neural Representations of Temporally Asymmetric Stimuli in the Auditory Cortex of Awake Primates

Extra-Classical Tuning Predicts Stimulus-Dependent Receptive Fields in Auditory Neurons

Binaural Hearing. Steve Colburn Boston University

Theme 2: Cellular mechanisms in the Cochlear Nucleus

J Jeffress model, 3, 66ff

Auditory Cortical Responses Elicited in Awake Primates by Random Spectrum Stimuli

Adaptive Changes in Cortical Receptive Fields Induced by Attention to Complex Sounds

Signals, systems, acoustics and the ear. Week 5. The peripheral auditory system: The ear as a signal processor

Cortical Encoding of Auditory Objects at the Cocktail Party. Jonathan Z. Simon University of Maryland

Processing of the incomplete representation of the visual world

Towards a biological theory of phone5c percep5on

Auditory System & Hearing

Spectrotemporal Receptive Fields in the Lemniscal Auditory Thalamus and Cortex

Hearing in the Environment

The Central Auditory System

Lecture 3: Perception

FIRING PROPERTY OF INFERIOR COLLICULUS NEURONS AFFECTED BY FMR1 GENE MUTATION

The Structure and Function of the Auditory Nerve

Nature Neuroscience: doi: /nn Supplementary Figure 1. Trial structure for go/no-go behavior

Hearing. Juan P Bello

Acoustics Research Institute

FINE-TUNING THE AUDITORY SUBCORTEX Measuring processing dynamics along the auditory hierarchy. Christopher Slugocki (Widex ORCA) WAS 5.3.

Hearing Sound. The Human Auditory System. The Outer Ear. Music 170: The Ear

Music 170: The Ear. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) November 17, 2016

Involvement of Monkey Inferior Colliculus in Spatial Hearing

ABR assesses the integrity of the peripheral auditory system and auditory brainstem pathway.

Neural correlates of the perception of sound source separation

Neural Representations of the Cocktail Party in Human Auditory Cortex

Network Models of Frequency Modulated Sweep Detection

Sensory-Motor Interaction in the Primate Auditory Cortex During Self-Initiated Vocalizations

CROSSMODAL PLASTICITY IN SPECIFIC AUDITORY CORTICES UNDERLIES VISUAL COMPENSATIONS IN THE DEAF "

Twenty subjects (11 females) participated in this study. None of the subjects had

Spectral-Temporal Receptive Fields of Nonlinear Auditory Neurons Obtained Using Natural Sounds

Stimulus-Dependent Auditory Tuning Results in Synchronous Population Coding of Vocalizations in the Songbird Midbrain

Toward Extending Auditory Steady-State Response (ASSR) Testing to Longer-Latency Equivalent Potentials

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

Auditory nerve. Amanda M. Lauer, Ph.D. Dept. of Otolaryngology-HNS

Strength of Gamma Rhythm Depends on Normalization

Nonlinear processing in LGN neurons

EFFECTS OF TEMPORAL FINE STRUCTURE ON THE LOCALIZATION OF BROADBAND SOUNDS: POTENTIAL IMPLICATIONS FOR THE DESIGN OF SPATIAL AUDIO DISPLAYS

Sum of Neurally Distinct Stimulus- and Task-Related Components.

Topic 4. Pitch & Frequency

HCS 7367 Speech Perception

Cortical Encoding of Auditory Objects in the Cocktail Party Problem. Jonathan Z. Simon University of Maryland

The neural code for interaural time difference in human auditory cortex

LATERAL INHIBITION MECHANISM IN COMPUTATIONAL AUDITORY MODEL AND IT'S APPLICATION IN ROBUST SPEECH RECOGNITION

Neural Representations of the Cocktail Party in Human Auditory Cortex

I. INTRODUCTION. A. Extracellular physiological responses to AM stimuli.

Auditory Physiology Richard M. Costanzo, Ph.D.

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions.

Auditory System & Hearing

Chapter 5. Summary and Conclusions! 131

Behavioral generalization

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Neural Representations of Speech, and Speech in Noise, in Human Auditory Cortex

Auditory System. Barb Rohrer (SEI )

CS/NEUR125 Brains, Minds, and Machines. Due: Friday, April 14

THRESHOLD PREDICTION USING THE ASSR AND THE TONE BURST CONFIGURATIONS

Hearing Research 289 (2012) 1e12. Contents lists available at SciVerse ScienceDirect. Hearing Research

INFERRING AUDITORY NEURONAT RESPONSE PROPERTIES FROM NETWORK MODELS

Auditory Physiology PSY 310 Greg Francis. Lecture 30. Organ of Corti

The Neural Code for Auditory Space Depends on Sound Frequency and Head Size in an Optimal Manner

Structure and Function of the Auditory and Vestibular Systems (Fall 2014) Auditory Cortex (1) Prof. Xiaoqin Wang

PSY 215 Lecture 10 Topic: Hearing Chapter 7, pages

Neural Representations of the Cocktail Party in Human Auditory Cortex

Auditory Edge Detection: A Neural Model for Physiological and Psychoacoustical Responses to Amplitude Transients

Hearing II Perceptual Aspects

Introduction to Electrophysiology

Inhibitory Plasticity in a Lateral Band Improves Cortical Detection of Natural Vocalizations

Transcription:

3.6.QH Spectro-temporal response fields in the inferior colliculus of awake monkey Versnel, Huib; Zwiers, Marcel; Van Opstal, John Department of Biophysics University of Nijmegen Geert Grooteplein 655 EZ Nijmegen The Netherlands phone: +3--3669 fax: +3--3535 email: huibv@mbfys.kun.nl ABSTRACT The spectro-temporal response field (STRF) characterises the excitatory and inhibitory response properties of an auditory neuron, and it can be used to linearly predict responses to a wide range of sound stimuli. Using broadband sounds with spectro-temporal modulations ( ripples ), we determined STRFs from single-neuron recordings in the inferior colliculus of awake rhesus monkeys. A wide variety of ripple responses was found. For most neurons, responses to natural sounds could be well predicted on the basis of the STRF. These results suggest that spectro-temporal features are extracted at the level of the IC. INTRODUCTION Natural sounds as speech and music are generally broadband and characterized by complex dynamic spectral features. An important theme in auditory neuroscience concerns the neural processing of such sounds: how and where along the auditory pathway are features such as pitch, loudness and timbre represented? Auditory neurons have been classified on the basis of responses to simple stimuli such as tones and amplitude modulated (AM) broadband noise. Responses to tones provide spectral information: e.g. the neuron s frequency response yields its best frequency (BF) and tuning bandwidth. Responses to AM noise provide temporal information: e.g. the modulation transfer function (Langner and Schreiner, 9). However, these stimuli are very different from natural sounds, and the response features thus obtained are not sufficient to understand the mechanisms that underlie the processing of complex sounds. More appropriate stimuli should preferably integrate the spectral and temporal properties of sounds, yet allowing for a straightforward parameterization of these properties. One class of such stimuli is the so-called ripple : sinusoidal envelope modulations in the spectro-temporal domain, which are elementary D Fourier components (Kowalski et al. 996a, Klein et al., Depireux et al ). These stimuli are fully characterized by two parameters, one temporal and one spectral, and their spectro-temporal characteristics resemble those of natural sounds. From the neural responses to different ripples the spectrotemporal response field (STRF) can be derived. On the basis of the STRF and the spectrogram of an arbitrary stimulus, the response to that stimulus can be predicted by applying linear systems theory. A comparison of this linear prediction and the actual response yields information on the linearity of the neural processing. Most electrophysiological studies on the auditory system, including those using spectrotemporal stimuli, have been performed in anesthetized animals. It is conceivable that

anaesthesia affects the neural response characteristics. Therefore, we studied auditory processing of spectro-temporal sounds in awake rhesus monkeys. We performed extracellular single-unit recordings in the inferior colliculus (IC) to ripples, tones, and animal vocalizations. The distribution of STRFs was examined and the spectral dimension of STRFs was compared to the tuning curves obtained with tones. Finally, we examined whether neural responses to natural vocalizations could be predicted on the basis of STRFs. METHODS Recordings Single-neuron responses were recorded with glass-coated tungsten microelectrodes (impedance.5-.5 MΩ) in the IC of two adult male rhesus monkeys (macaca mulatta; weight ±7.5 kg). The chronic recording chamber (6 mm diameter) was placed over a trepane hole in the skull, centered above the midline, and mm posterior of the interaural line (Goossens and Van Opstal,, for details). Monkeys were rewarded with a drop of water for fixating a visual target at a straight ahead location for s during each trial. In monkey A, recordings were done in each hemisphere; in monkey B, in the right hemisphere only. Recording sites were evenly selected across an area of about mm. In a typical penetration, -3 units were recorded at any depth where acoustically evoked responses were encountered (no restriction to the central nucleus of IC). Prior to entering the IC the electrode passed through the superior colliculus. The BF of auditory neurons was found to increase in an orderly manner in dorsal-ventral direction, indicative for the IC (Langner and Schreiner, 9). Sound stimuli Stimuli were presented in the free field at the frontal central position at a distance of.9 m from the monkey. Pure tones were presented over a wide range of frequencies (3 59 Hz; /6 octave steps) at three sounds levels ( - 5-6 db SPL), with a duration of ms. Ripple stimuli consisted of a broadband complex of 6 components equally distributed (/octave) from.5 to khz. The envelopes were sinusoidally modulated in the spectro-temporal domain, such as shown in Fig.. In a typical experimental run, we presented all combinations of the ripple density Ω (- :.: cyc/oct), and the ripple velocity W (:: Hz), once. Note that Ω< corresponds to an upward direction of the spectral envelope (Fig., top), Ω> to a downward direction (Fig., middle), and that Ω= is equivalent to a pure amplitude modulation (Fig., bottom). The modulation depth was %, the sound level was 56 db SPL, and the duration was 5 ms. 6.5.5.5.5.5 Time (s) 6.5 W = Hz Ω = -. cyc/oct Direction: up W = Hz Ω =. cyc/oct Direction: down We presented 6 vocalizations, 3 macaque calls and 3 bird-song segments, which were obtained from other labs (www.wjh.harvard.edu/ ~mnkylab/media/rhesuscalls and www.mathlab.sunysb.edu/~tony/birds) and resampled to 5 khz. Artefacts and background noises were mostly removed. The calls were presented at 3 sound levels (-5-6 db SPL), durations varied from 3 to 6 ms, and the number of repetitions was 5..5.5.5.5 Time (s) 6.5.5.5.5.5 Time (s) W = Hz Ω = cyc/oct Fig.. Spectrograms of three ripple stimuli.

Ripple-data analysis A response window of 5 ms was used, from 5 ms after stimulus onset (thus avoiding transient onset responses) to stimulus offset. Period histograms (Fig. A) were constructed based on the period of the temporal modulation of the stimulus (inverse of W). A Fourier analysis was performed on the period histograms, and a magnitude A(W,Ω) and phase ϕ(w,ω) were derived (Fig. B), yielding a complete spectro-temporal transfer function T(W,Ω)=A(W, Ω) exp(iϕ(w,ω)). Inverse Fourier transformation of T(W,Ω) produces the STRF (Fig. C): STRF(t,x) = F - [T(w, Ω)] () with x the tonotopic distance in octaves. The spectral section of the STRF reflects B Magnitude A 3 6 Complete Ripple Transfer Function max Period Histograms - -.6 -. -. -.....6 Phase o 3 6 3 6 o - -.6-.-.-.....6 - -.6-.-. -.....6 Inverse Fourier Transformation frequency tuning, the temporal section STRF C reflects the impulse response. The.5 temporal response pattern of a cell to a sound stimulus, R(t), can be predicted by time-convolution and spectral integration with the stimulus spectrogram S(x,t):.5.5 R(t) =Σ x {STRF(x,t) t S(x,t)} () Three response parameters were 6 derived from the transfer function T: the cell s best ripple velocity, its best ripple density, and its direction Fig.. Analysis of ripple responses. preference D: D = (R up -R down )/(R up +R down ) (3) A best frequency (BF strf ) and latency were derived from the STRF. Frequency (octaves) excitation inhibition - o RESULTS Ripple responses Most neurons in IC followed the phase of the ripple modulation (e.g., see the period histograms in Fig. A). We found that these neurons were bandpass tuned to certain ripple velocities and densities (magnitude TFs in Figs B, 3). The ripple preference clearly varied between cells (compare Fig. 3A to 3B). Often the transfer function was symmetric: the upward-direction responses were similar to downward-direction responses, including the best ripple density and best ripple velocity (Figs. B, 3A,B). However, we also found cells with a clear preference for one direction (Fig. 3C), /9 cells showed such a preference (Fig. ). A substantial proportion

A B C D 3 6 3 6 3 6 3 6 Magnitude Transfer Function - -.6 -. -. -.....6 - -.6 -. -. -.....6 - -.6 -. -. -.....6 - -.6 -. -. -.....6 STRF 6 6 6 6 Tone Response Curve Spike Rate (/s) Fig. 3. The magnitude transfer function, STRF and tone response curve (frequency tuning) for four different IC neurons. A. Example of high best ripple velocity ( Hz) and low ripple density (. cyc/oct). D=.. BF STRF =. khz. B. Example of low best velocity ( Hz) and high best density (. cyc/oct). D=-.9. BF STRF =. khz. C. Example of asymmetric TF, direction preference is upward. D=.6. W b =3 Hz, Ω b =. cyc/oct. BF STRF =. khz. D. Example of onlyzero-ω response. W b =3 Hz, D=-.7. In all examples the spectral dimension of the STRF is comparable to the frequency tuning curve. BFs are similar, and even inhibitory regions agree, e.g. inhibition above BF in B. Note scale difference between STRF and tone response curves. of cells (33/9) preferred a ripple density of zero, i.e., a pure amplitude modulation (Figs. 3D,). When these cells responded to ripples with Ω, they did not follow the phase of the envelope. The STRFs generally showed excitation for a narrow range of frequencies with a latency of - ms so that a BF strf could easily be defined (Fig. 3A-C). Inhibition occurred about ms after excitation, and, in several cases the STRF showed inhibition to other frequencies, e.g., in Fig. 3B above BF strf. The frequency tuning given by the STRF was roughly similar to the frequency tuning obtained from responses to pure tones. In particular, BFs agreed for the majority of cells (5/66): the differences between their ripple and tone estimates of BF were less then.5 octave. Often, agreement was also found for the inhibitory sidebands, as in Fig. 3B. Broad frequency tuning coupled to zero ripple density preference was typically observed in cells with a broad tuning to tones (Fig. 3D). Note that temporal response properties given by the STRF are not reflected by the tone response curves. Figure shows that IC neurons were tuned to a wide range of ripple densities and ripple velocities. Further, symmetric responses (D ) as well as asymmetric (D ) were found and, 5 Spike Rate (/s) 5 5 Spike Rate (/s) 3 Spike Rate (/s) 6....

Fig.. Distributions of ripple response parameters over 9 IC neurons. A few of the cells with high best velocities (3 or Hz) which were additionally tested in a larger velocity range (- Hz; fixed density) sometimes had their best response above Hz. Number 3 Best Ripple Velocity N = 9 3 6 3 Hz Best Ripple Density N = 9 5 5....6 c/o Direction Preference N = 9 -. -... within the latter group, some cells preferred upward (D>) and others the downward direction (D<). Predictions of vocalization responses In 9 cells for which an STRF had been obtained, responses to the six different vocalizations were also recorded. Following Eqn., we predicted these responses using a time bin of.5 ms. Figure 5 shows two examples of predicted and measured responses. Almost all positive peaks in the response patterns were reproduced by the predictions, in particular in case A. Low-response segments corresponded to small deflections or inhibitory troughs of the predicted trace. The discrepancy between the predicted troughs and recorded near-zero response is a direct result of the straightforward linear prediction which does not account for half-wave rectification as exhibited by neural firings. The STRF in Fig 5B is interesting in that it reveals onset inhibition followed by excitation. Consequently, the vocalization response showed a large excitatory peak after stimulus offset, which was also predicted (albeit too early). Note that while most energy of the bird call was below BF strf, the prediction was fair. frequency (khz).5 A B STRF.5 frequency (khz) Stimulus Spectrogram Macaque scream Blackbird song Fig. 5. Prediction of responses to vocalizations on the basis of a cell s STRF and the stimulus spectrogram (Eqn ), for two different IC neurons. The spectrogram is shown over the frequency range of the STRF. The spontaneous spike rate (recorded before stimulus onset) has been subtracted from the response. A. Prediction with high correlation to response. B. Prediction with medium-high correlation to response. In general, for a given neuron, the best prediction is found for the vocalization which spectrum has the largest overlap with the STRF. Figure 6 shows for each cell the best correlation between prediction and response (out of six vocalizations). In cells, correlations were higher than in the example of Fig. 5B (R>.). In cells, predictions were poor (R<.5). Two of those cells had BFs above khz and were narrowly tuned, consequently, these cells did not respond to the vocalizations. The two other cells had BFs below Hz, and since the ripples lacked energy below 5 Hz, their STRF might not have been estimated well. The 5 cells with mediocre correlations (.5-.) had all nonmonotic rate-level curves (tested with broadband noise), and in four of those cells, the correlations were higher (>.) for lower levels (- or - Response Prediction R =.69 R =.

Fig. 6. Correlations of predictions with recorded responses to vocalizations for 9 IC neurons. For each neuron, the largest of 6 correlations found for 6 different vocalizations (at a high sound level) is shown. Correlation.5. Best db). Taken together, the linear prediction of IC cell responses was remarkably good for the majority of cells. Cases for which the prediction failed could be well explained. DISCUSSION This report presents two important findings on spectro-temporal representation of sounds in the monkey IC. First, many IC neurons show a distinct preference for specific spectro-temporal features, and these preferences vary substantially in both the temporal and spectral dimension across the IC. Second, responses to complex broadband sounds with distinct spectro-temporal features like the animal vocalizations used in our study can be linearly predicted on the basis of responses to ripples, which are the elementary spectro-temporal stimuli. We conclude from these two points that a spectro-temporal Fourier representation of the stimulus spectrum ( ripple analysis ), is already present at the level of the IC. The ripple analysis has previously been demonstrated in the auditory cortex (Kowalski et al. 996b; Depireux et al. 99). Other studies (e.g., Langner and Schreiner, 9) have shown that IC neurons are tuned to temporal modulations over a wide range of modulation frequencies ( velocities in this report), leading to the hypothesis of IC neurons as temporal filters. Our study suggests an extension of this notion to IC neurons as spectro-temporal filters. The auditory cortex might subsequently refine the analysis. The range of temporal, spectral and direction scales might be modified from IC to auditory cortex to enhance the animal s sensitivity to the acoustic range that dominates its environment and is most relevant to its survival. REFERENCES Depireux, D.A., Simon, J.Z. and Shamma, S.A. (99) Measuring the dynamics of neural responses in primary auditory cortex. Comments in Theoretical Biology 5: 9-. Depireux, D.A., Simon, J.Z., Klein, D.J. and Shamma, S.A. () Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex. J. Neurophysiol. 5: -3. Goossens, H.H.L.M. and Van Opstal, A.J. () Blink-perturbed saccades in monkey. II. Superior colliculus activity. J. Neurophysiol. 3: 33-35. Klein, D., Depireux, D., Simon, J. and Shamma, S.A. () Robust spectro-temporal reverse correlation for the auditory system: optimizing stimulus design. J. Comput Neurosci 9: 5-. Kowalski, N., Depireux, D.A. and Shamma, S.A. (996a) Analysis of dynamic spectra in ferret primary auditory cortex. I. Characteristics of single-unit responses to moving ripple spectra. J. Neurophysiol. 76: 353-353. Kowalski, N., Depireux, D.A. and Shamma, S.A. (996b) Analysis of dynamic spectra in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary dynamic spectra. J. Neurophysiol. 76: 35-353. Langner, G. and Schreiner, C.E. (9) Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. J. Neurophysiol. 6: 799-. Acknowledgements - Supported by the Netherlands Organization for Scientific Research, the Nijmegen Institute for Cognition and Information, and the University of Nijmegen