Lecture 4: Auditory Perception. Why study perception?

Size: px
Start display at page:

Download "Lecture 4: Auditory Perception. Why study perception?"

Transcription

1 EE E682: Speech & Audio Processing & Recognition Lecture 4: Auditory Perception Motivation: Why & how Auditory physiology Psychophysics: Detection & discrimination Pitch perception Speech perception Auditory organization & Scene analysis Dan Ellis <dpwe@ee.columbia.edu> Columbia University Dept. of Electrical Engineering Spring 26 E682 SAPR - Dan Ellis L4 - Perception Why study perception? Perception is messy: Can we avoid it? No! Audition provides the ground truth in audio - what is relevant and irrelevant - subjective importance of distortion (coding etc.) - (there could be other information in sound...) Some sounds are designed for audition - co-evolution of speech and hearing The auditory system is very successful - we would do extremely well to duplicate it We are now able to model complex systems - faster computers, bigger memories E682 SAPR - Dan Ellis L4 - Perception

2 How to study perception? Three different approaches: Analyze the example: physiology - dissection & nerve recordings Black box input/output: psychophysics - fit simple models of simple functions Information processing models - investigate and model complex functions - e.g. scene analysis, speech perception E682 SAPR - Dan Ellis L4 - Perception Outline Motivation Physiology - Outer, middle & inner ear - The Auditory Nerve and beyond - Models Psychophysics Pitch perception Speech perception Scene analysis E682 SAPR - Dan Ellis L4 - Perception

3 2 Physiology Processing chain from air to brain: Middle ear Auditory nerve Outer ear Inner ear (cochlea) Midbrain Cortex Study via: - anatomy - nerve recordings Signals flow in both directions E682 SAPR - Dan Ellis L4 - Perception Outer & middle ear Pinna Ear canal Middle ear bones Cochlea (inner ear) Eardrum (tympanum) Pinna horn - complex reflections give spatial (elevation) cues Ear canal - acoustic tube Middle ear - bones provide impedance matching E682 SAPR - Dan Ellis L4 - Perception

4 Inner ear: Cochlea Oval window (from ME bones) Travelling wave Basilar Membrane (BM) Cochlea 16 khz Resonant frequency 5 Hz Position 35mm Mechanical input from middle ear starts traveling wave moving down Basilar Membrane Varying stiffness and mass of BM gives results in continuous variation of resonant frequency At resonance, traveling wave energy is dissipated in BM vibration Frequency (Fourier) analysis E682 SAPR - Dan Ellis L4 - Perception Cochlea hair cells Ear converts sound to BM motion; Each point on BM corresponds to a frequency Cochlea Tectorial membrane Basilar membrane Inner Hair Cell (IHC) Auditory nerve Outer Hair Cell (OHC) Hair cells on BM convert motion into nerve impulses (firings) Inner Hair Cells detect motion Outer Hair Cells? Variable damping? [BM animation] E682 SAPR - Dan Ellis L4 - Perception

5 Inner Hair Cells IHCs convert BM vibration into nerve firings Human hear has ~35 IHCs; Each IHC has ~7 connections to Auditory Nerve Each nerve fires (somes) near peak displacement: Local BM displacement Typical nerve signal (mv) 5 / ms Histogram to get firing probability: Firing count Cycle angle E682 SAPR - Dan Ellis L4 - Perception Auditory nerve (AN) signals Single nerve measurements: Tone burst histogram Spike count 1 Frequency threshold db SPL Tone burst Time 1 ms 3 One fiber: ~ 25 db dynamic range 2 1 Hz 1 khz 1 khz (log) frequency (approx. constant-q) Rate vs. intensity Spikes/sec Intensity / db SPL Hearing dynamic range > 1 db Hard to measure: probe living ANs? E682 SAPR - Dan Ellis L4 - Perception

6 AN population response All the information the brain has about sound: - average rate & spike timings on 3, fibers freq / 8ve re 1 Hz Not unlike a (constant-q) spectrogram? p ( ) / ms E682 SAPR - Dan Ellis L4 - Perception Beyond the auditory nerve Ascending and descending Tonotopic? - modulation - position - source?? (from lloydwatts.com) E682 SAPR - Dan Ellis L4 - Perception

7 Periphery models IHC Sound Outer/middle ear filtering Cochlea filterbank IHC Modeled aspects: - outer/middle ear - cochlea filtering - hair cell transduction - efferent feedback? Result: neurogram / cochleagram channel SlaneyPatterson 12 chans/oct from 18 Hz, BBC1tmp (21218) / s E682 SAPR - Dan Ellis L4 - Perception Outline Motivation Physiology Psychophysics - Detection theory modeling - Intensity perception - Masking Pitch perception Speech perception Scene analysis E682 SAPR - Dan Ellis L4 - Perception

8 3 Psychophysics Physiology looks at the implementation; Psychology looks at the function/behavior Analyze audition as signal detection: - psychological tests reflect internal decisions - assume optimal decision process - infer nature of internal representations, noise,... lower bounds on more complex functions Different aspects to measure -, frequency, intensity - tones, complexes, noise - binaural - pitch, detuning p( ω O) E682 SAPR - Dan Ellis L4 - Perception Log(loudness rating) Basic psychophysics Relate physical and perceptual variables - e.g. intensity loudness frequency pitch Methodology: subject tests - just noticeable difference (jnd) - magnitude scaling e.g. adjust to twice as loud Results for Intensity vs. Loudness: Weber s law I α I log(l) = k log(i) Hartmann(1993) Classroom loudness scaling data 2.6 Textbook figure: L α I Power law fit: L α I.22 Sound level / db log2 ( L ) =.3log 2 ( I) log.3 1 I = log db = log = db Ú 1 E682 SAPR - Dan Ellis L4 - Perception

9 Loudness as a function of frequency Fletcher-Munson equal-loudness curves: Intensity / db SPL , freq / Hz Equivalent 1kHz Equivalent 1kHz 1 1 Hz 1 khz rapid loudness growth Intensity / db Intensity / db Hearing impairment: exaggerates E682 SAPR - Dan Ellis L4 - Perception Loudness as a function of bandwidth Same total energy, different distribution: freq mag mag I Same total energy I B I 1 B freq B 1 freq Loudness... but wider perceived as louder Critical Bandwidth B bandwidth - e.g. 2 chans at -6 db (not -1 db) Critical bands: independent freq. channels - ~ 25 total (4-6 / octave) [sndex] E682 SAPR - Dan Ellis L4 - Perception

10 Simultaneous masking A louder tone can mask the perception of a second tone nearby in frequency: masking absolute tone threshold Intensity / db Suggests an internal noise model: p(x I) p(x I) p(x I+ I) masked threshold log freq σ n internal noise decision variable x E682 SAPR - Dan Ellis L4 - Perception I Sequential masking Backward/forward in : simultaneous masking ~1 db masker envelope Intensity / db masked threshold backward masking ~5 ms forward masking ~1 ms - suggests temporal envelope of decision var. Time-frequency masking skirt : Masking tone intensity freq Masked threshold E682 SAPR - Dan Ellis L4 - Perception

11 What we do and don t hear A B X two-interval forced-choice : X = A or B? Timing: 2ms attack resolution, 2ms discrim - but: spectral splatter Tuning: ~ 1% discrimination - but: beats Spectrum: profile changes, formants - variable -frequency resolution Harmonic phase? Noisy signals & texture (Trace vs. categorical memory) E682 SAPR - Dan Ellis L4 - Perception Outline Motivation Physiology Psychophysics Pitch perception - Place models - Time models - Multiple cues & competition Speech perception Scene analysis E682 SAPR - Dan Ellis L4 - Perception

12 4 Pitch perception: A classic argument in psychophysics Harmonic complexes are a pattern on AN 7 freq. chan /s -.. but give a fused percept (ecological) What determines the pitch percept? - not the fundamental How is it computed? Two competing models: place and E682 SAPR - Dan Ellis L4 - Perception AN excitation Place model of pitch AN excitation pattern shows individual peaks Pattern matching method to find pitch: broader HF channels cannot resolve harmonics resolved harmonics Correlate with harmonic sieve : frequency channel Pitch strength frequency channel Support: Low harmonics are very important But: Flat-spectrum noise can carry pitch E682 SAPR - Dan Ellis L4 - Perception

13 Time model of pitch Timing information is preserved in AN down to ~ 1ms scale Extract periodicity by e.g. autocorrelation & combine across frequency chans: freq per-channel autocorrelation autocorrelation Summary autocorrelation common period (pitch) lag / ms But: HF gives weak pitch (in practice) E682 SAPR - Dan Ellis L4 - Perception Alternate & competing cues Pitch perception could rely on various cues - average excitation pattern - summary autocorrelation - more complex pattern matching Relying on just one cue is brittle - e.g. missing fundamental Perceptual system appears to use a flexible, opportunistic combination Optimal detector justification? argmax p( ω o) ω p( o ω) p( ω) = argmax ω p( o) = argmax po ( 1 ω) po ( 2 ω) p( ω) ω if o 1 and o 2 are conditionally independent E682 SAPR - Dan Ellis L4 - Perception

14 Outline Motivation Physiology Psychophysics Pitch perception Speech perception - The sounds of speech - Phoneme perception - Context and top-down influences Scene Analysis E682 SAPR - Dan Ellis L4 - Perception Speech perception Highly specialized function - subsequent to source organization? -.. but also can interact Kinds of speech sounds: glide nasal vowel fricative stop burst freq / Hz /s has a watch thin as a dime - vowels - glides - nasals - stops - fricatives... 2 level/db E682 SAPR - Dan Ellis L4 - Perception

15 Cues to phoneme perception Linguists describe speech with phonemes: h ε z has e a w t cl c ^ θ watch thin as a dime - phonemes define minimal word contrasts Acoustic-phoneticians describe phonemes by: formants & transitions I n I z I d a y bursts & onset s m freq transition stop burst vowel formants voicing onset E682 SAPR - Dan Ellis L4 - Perception Categorical perception (Some) speech sounds perceived categorically rather than analogically - e.g. stop-burst & timing: freq f b stop burst vowel formants burst freq f b / Hz T P K P 1 P i e ε a - tokens within category are hard to distinguish - category boundaries are very sharp Categories are learned for native tongue - merry / mary / marry c following vowel o u E682 SAPR - Dan Ellis L4 - Perception

16 Where is the information in speech? Articulation of high/low-pass filtered speech: high-pass low-pass Articulation / % freq / Hz - sums to more than 1... Speech message is highly redundant - e.g. constraints of language, context listeners can understand with very few cues E682 SAPR - Dan Ellis L4 - Perception Top-down influences: Phonemic restoration (Warren 197) freq / Hz 4 3 What if a noise burst obscures speech? / s - auditory system restores the missing phoneme... based on semantic context... even in retrospect! Subjects are typically unaware of which sounds are restored E682 SAPR - Dan Ellis L4 - Perception

17 freq / Hz freq / Hz A predisposition for speech: Sinewave replicas (Remez et al. 1994) Replace each formant with a single sinusoid: speech is (somewhat) intelligible - people hear both whistles and speech ( duplex ) - processed as speech despite un-speech-like What does it take to be speech? / s Speech Sines E682 SAPR - Dan Ellis L4 - Perception Simultaneous vowels Mix synthetic vowels with different f s: 1 Hz db 125 Hz freq + = Pitch difference helps (though not necessary): % both vowels correct / 1 4 / f (semitones) E682 SAPR - Dan Ellis L4 - Perception

18 Computational models of speech perception Various theoretical - practical models of speech comprehension, e.g. : Speech Phoneme recognition Lexical access Words Grammar constraints Open questions: - mechanism of phoneme classification - mechanism of lexical recall - mechanism of grammar constraints ASR is a practical implementation (?) E682 SAPR - Dan Ellis L4 - Perception Outline Motivation Physiology Psychophysics Pitch perception Speech perception Scene analysis - Events and sources - Fusion and streaming - Continuity & restoration - Simultaneous vowels E682 SAPR - Dan Ellis L4 - Perception

19 6 Auditory Organization frq/hz Detection model is huge simplification Real role of hearing is much more general: Recover useful information from outside world Sound organization into events and sources: 4 Voice /s Rumble Stab Research questions: - what determines perception of sources? - how do humans separate mixtures? - how much can we tell about a source? E682 SAPR - Dan Ellis L4 - Perception Auditory scene analysis: Simultaneous fusion Harmonics are distinct on AN, but perceived as one sound ( fused ): freq - depends on common onset - depends on harmonicity (common period) Methodologies: - ask subject how many objects - match attributes e.g. object pitch - manipulate higher level e.g. vowel identity E682 SAPR - Dan Ellis L4 - Perception

20 Sequential grouping: streaming Pattern / rhythm: property of set of objects - subsequent to fusion employs fused events? TRT: 6-15 ms frequency 1 khz f: 2 octaves Measure by relative timing judgments - cannot compare between streams Separate coherence and fusion boundaries Can interact and compete with fusion [sndex] E682 SAPR - Dan Ellis L4 - Perception Continuity & restoration Tone is interrupted by noise burst: What happened? freq? masking makes tone undetectable during noise Need to infer most probable real-world events - observation equally likely for either explanation - prior on continuous tone much higher choose Top-down influence on perceived events... pulsation threshold [sndex] E682 SAPR - Dan Ellis L4 - Perception

21 Models of auditory organization Psychological accounts suggest bottom-up: input mixture Front end signal features (maps) Object formation discrete objects Grouping rules Source groups freq onset period frq.mod - (Brown 1991) Complications in practice: - formation of separate elements - contradictory cues - influence of top-down constraints (context, expectations)... E682 SAPR - Dan Ellis L4 - Perception Summary Auditory perception provides the ground truth underlying audio processing Physiology specifies information available Psychophysics measures basic sensitivities Sound sources requires further organization Strong contextual effects in speech perception Sound Transduce Multiple represent'ns Scene analysis High-level recognition Parting thought: Is pitch central to communication? Why? E682 SAPR - Dan Ellis L4 - Perception

Lecture 3: Perception

Lecture 3: Perception ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 3: Perception 1. Ear Physiology 2. Auditory Psychophysics 3. Pitch Perception 4. Music Perception Dan Ellis Dept. Electrical Engineering, Columbia University

More information

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds Hearing Lectures Acoustics of Speech and Hearing Week 2-10 Hearing 3: Auditory Filtering 1. Loudness of sinusoids mainly (see Web tutorial for more) 2. Pitch of sinusoids mainly (see Web tutorial for more)

More information

Required Slide. Session Objectives

Required Slide. Session Objectives Auditory Physiology Required Slide Session Objectives Auditory System: At the end of this session, students will be able to: 1. Characterize the range of normal human hearing. 2. Understand the components

More information

Linguistic Phonetics Fall 2005

Linguistic Phonetics Fall 2005 MIT OpenCourseWare http://ocw.mit.edu 24.963 Linguistic Phonetics Fall 2005 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 24.963 Linguistic Phonetics

More information

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment Acoustics, signals & systems for audiology Psychoacoustics of hearing impairment Three main types of hearing impairment Conductive Sound is not properly transmitted from the outer to the inner ear Sensorineural

More information

Deafness and hearing impairment

Deafness and hearing impairment Auditory Physiology Deafness and hearing impairment About one in every 10 Americans has some degree of hearing loss. The great majority develop hearing loss as they age. Hearing impairment in very early

More information

Spectrograms (revisited)

Spectrograms (revisited) Spectrograms (revisited) We begin the lecture by reviewing the units of spectrograms, which I had only glossed over when I covered spectrograms at the end of lecture 19. We then relate the blocks of a

More information

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening AUDL GS08/GAV1 Signals, systems, acoustics and the ear Pitch & Binaural listening Review 25 20 15 10 5 0-5 100 1000 10000 25 20 15 10 5 0-5 100 1000 10000 Part I: Auditory frequency selectivity Tuning

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Long-term spectrum of speech HCS 7367 Speech Perception Connected speech Absolute threshold Males Dr. Peter Assmann Fall 212 Females Long-term spectrum of speech Vowels Males Females 2) Absolute threshold

More information

COM3502/4502/6502 SPEECH PROCESSING

COM3502/4502/6502 SPEECH PROCESSING COM3502/4502/6502 SPEECH PROCESSING Lecture 4 Hearing COM3502/4502/6502 Speech Processing: Lecture 4, slide 1 The Speech Chain SPEAKER Ear LISTENER Feedback Link Vocal Muscles Ear Sound Waves Taken from:

More information

Hearing Sound. The Human Auditory System. The Outer Ear. Music 170: The Ear

Hearing Sound. The Human Auditory System. The Outer Ear. Music 170: The Ear Hearing Sound Music 170: The Ear Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) November 17, 2016 Sound interpretation in the auditory system is done by

More information

Music 170: The Ear. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) November 17, 2016

Music 170: The Ear. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) November 17, 2016 Music 170: The Ear Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) November 17, 2016 1 Hearing Sound Sound interpretation in the auditory system is done by

More information

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics 2/14/18 Can hear whistle? Lecture 5 Psychoacoustics Based on slides 2009--2018 DeHon, Koditschek Additional Material 2014 Farmer 1 2 There are sounds we cannot hear Depends on frequency Where are we on

More information

Digital Speech and Audio Processing Spring

Digital Speech and Audio Processing Spring Digital Speech and Audio Processing Spring 2008-1 Ear Anatomy 1. Outer ear: Funnels sounds / amplifies 2. Middle ear: acoustic impedance matching mechanical transformer 3. Inner ear: acoustic transformer

More information

Auditory Scene Analysis: phenomena, theories and computational models

Auditory Scene Analysis: phenomena, theories and computational models Auditory Scene Analysis: phenomena, theories and computational models July 1998 Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 4 The computational

More information

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions.

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions. 24.963 Linguistic Phonetics Basic Audition Diagram of the inner ear removed due to copyright restrictions. 1 Reading: Keating 1985 24.963 also read Flemming 2001 Assignment 1 - basic acoustics. Due 9/22.

More information

Topics in Linguistic Theory: Laboratory Phonology Spring 2007

Topics in Linguistic Theory: Laboratory Phonology Spring 2007 MIT OpenCourseWare http://ocw.mit.edu 24.91 Topics in Linguistic Theory: Laboratory Phonology Spring 27 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Chapter 11: Sound, The Auditory System, and Pitch Perception

Chapter 11: Sound, The Auditory System, and Pitch Perception Chapter 11: Sound, The Auditory System, and Pitch Perception Overview of Questions What is it that makes sounds high pitched or low pitched? How do sound vibrations inside the ear lead to the perception

More information

Systems Neuroscience Oct. 16, Auditory system. http:

Systems Neuroscience Oct. 16, Auditory system. http: Systems Neuroscience Oct. 16, 2018 Auditory system http: www.ini.unizh.ch/~kiper/system_neurosci.html The physics of sound Measuring sound intensity We are sensitive to an enormous range of intensities,

More information

PHYS 1240 Sound and Music Professor John Price. Cell Phones off Laptops closed Clickers on Transporter energized

PHYS 1240 Sound and Music Professor John Price. Cell Phones off Laptops closed Clickers on Transporter energized PHYS 1240 Sound and Music Professor John Price Cell Phones off Laptops closed Clickers on Transporter energized The Ear and Hearing Thanks to Jed Whittaker for many of these slides Ear anatomy substructures

More information

Topic 4. Pitch & Frequency

Topic 4. Pitch & Frequency Topic 4 Pitch & Frequency A musical interlude KOMBU This solo by Kaigal-ool of Huun-Huur-Tu (accompanying himself on doshpuluur) demonstrates perfectly the characteristic sound of the Xorekteer voice An

More information

ENT 318 Artificial Organs Physiology of Ear

ENT 318 Artificial Organs Physiology of Ear ENT 318 Artificial Organs Physiology of Ear Lecturer: Ahmad Nasrul Norali The Ear The Ear Components of hearing mechanism - Outer Ear - Middle Ear - Inner Ear - Central Auditory Nervous System Major Divisions

More information

Hearing. istockphoto/thinkstock

Hearing. istockphoto/thinkstock Hearing istockphoto/thinkstock Audition The sense or act of hearing The Stimulus Input: Sound Waves Sound waves are composed of changes in air pressure unfolding over time. Acoustical transduction: Conversion

More information

Computational Perception /785. Auditory Scene Analysis

Computational Perception /785. Auditory Scene Analysis Computational Perception 15-485/785 Auditory Scene Analysis A framework for auditory scene analysis Auditory scene analysis involves low and high level cues Low level acoustic cues are often result in

More information

Hearing. Juan P Bello

Hearing. Juan P Bello Hearing Juan P Bello The human ear The human ear Outer Ear The human ear Middle Ear The human ear Inner Ear The cochlea (1) It separates sound into its various components If uncoiled it becomes a tapering

More information

Sound Waves. Sensation and Perception. Sound Waves. Sound Waves. Sound Waves

Sound Waves. Sensation and Perception. Sound Waves. Sound Waves. Sound Waves Sensation and Perception Part 3 - Hearing Sound comes from pressure waves in a medium (e.g., solid, liquid, gas). Although we usually hear sounds in air, as long as the medium is there to transmit the

More information

HEARING AND PSYCHOACOUSTICS

HEARING AND PSYCHOACOUSTICS CHAPTER 2 HEARING AND PSYCHOACOUSTICS WITH LIDIA LEE I would like to lead off the specific audio discussions with a description of the audio receptor the ear. I believe it is always a good idea to understand

More information

PSY 215 Lecture 10 Topic: Hearing Chapter 7, pages

PSY 215 Lecture 10 Topic: Hearing Chapter 7, pages PSY 215 Lecture 10 Topic: Hearing Chapter 7, pages 189-197 Corrections: NTC 09-1, page 3, the Superior Colliculus is in the midbrain (Mesencephalon). Announcements: Movie next Monday: Case of the frozen

More information

L2: Speech production and perception Anatomy of the speech organs Models of speech production Anatomy of the ear Auditory psychophysics

L2: Speech production and perception Anatomy of the speech organs Models of speech production Anatomy of the ear Auditory psychophysics L2: Speech production and perception Anatomy of the speech organs Models of speech production Anatomy of the ear Auditory psychophysics Introduction to Speech Processing Ricardo Gutierrez-Osuna CSE@TAMU

More information

Hearing. and other senses

Hearing. and other senses Hearing and other senses Sound Sound: sensed variations in air pressure Frequency: number of peaks that pass a point per second (Hz) Pitch 2 Some Sound and Hearing Links Useful (and moderately entertaining)

More information

SPHSC 462 HEARING DEVELOPMENT. Overview Review of Hearing Science Introduction

SPHSC 462 HEARING DEVELOPMENT. Overview Review of Hearing Science Introduction SPHSC 462 HEARING DEVELOPMENT Overview Review of Hearing Science Introduction 1 Overview of course and requirements Lecture/discussion; lecture notes on website http://faculty.washington.edu/lawerner/sphsc462/

More information

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear The psychophysics of cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences Prelude Envelope and temporal

More information

SLHS 1301 The Physics and Biology of Spoken Language. Practice Exam 2. b) 2 32

SLHS 1301 The Physics and Biology of Spoken Language. Practice Exam 2. b) 2 32 SLHS 1301 The Physics and Biology of Spoken Language Practice Exam 2 Chapter 9 1. In analog-to-digital conversion, quantization of the signal means that a) small differences in signal amplitude over time

More information

Acoustics Research Institute

Acoustics Research Institute Austrian Academy of Sciences Acoustics Research Institute Modeling Modelingof ofauditory AuditoryPerception Perception Bernhard BernhardLaback Labackand andpiotr PiotrMajdak Majdak http://www.kfs.oeaw.ac.at

More information

ID# Exam 2 PS 325, Fall 2003

ID# Exam 2 PS 325, Fall 2003 ID# Exam 2 PS 325, Fall 2003 As always, the Honor Code is in effect and you ll need to write the code and sign it at the end of the exam. Read each question carefully and answer it completely. Although

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Babies 'cry in mother's tongue' HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Babies' cries imitate their mother tongue as early as three days old German researchers say babies begin to pick up

More information

Signals, systems, acoustics and the ear. Week 5. The peripheral auditory system: The ear as a signal processor

Signals, systems, acoustics and the ear. Week 5. The peripheral auditory system: The ear as a signal processor Signals, systems, acoustics and the ear Week 5 The peripheral auditory system: The ear as a signal processor Think of this set of organs 2 as a collection of systems, transforming sounds to be sent to

More information

Topic 4. Pitch & Frequency. (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music)

Topic 4. Pitch & Frequency. (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music) Topic 4 Pitch & Frequency (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music) A musical interlude KOMBU This solo by Kaigal-ool of Huun-Huur-Tu

More information

Outline. 4. The Ear and the Perception of Sound (Psychoacoustics) A.1 Outer Ear Amplifies Sound. Introduction

Outline. 4. The Ear and the Perception of Sound (Psychoacoustics) A.1 Outer Ear Amplifies Sound. Introduction 4. The Ear and the Perception of Sound (Psychoacoustics) 1 Outline A. Structure of the Ear B. Perception of Loudness C. Perception of Pitch D. References Updated May 13, 01 Introduction 3 A. The Structure

More information

ID# Final Exam PS325, Fall 1997

ID# Final Exam PS325, Fall 1997 ID# Final Exam PS325, Fall 1997 Good luck on this exam. Answer each question carefully and completely. Keep your eyes foveated on your own exam, as the Skidmore Honor Code is in effect (as always). Have

More information

Issues faced by people with a Sensorineural Hearing Loss

Issues faced by people with a Sensorineural Hearing Loss Issues faced by people with a Sensorineural Hearing Loss Issues faced by people with a Sensorineural Hearing Loss 1. Decreased Audibility 2. Decreased Dynamic Range 3. Decreased Frequency Resolution 4.

More information

Auditory Physiology PSY 310 Greg Francis. Lecture 30. Organ of Corti

Auditory Physiology PSY 310 Greg Francis. Lecture 30. Organ of Corti Auditory Physiology PSY 310 Greg Francis Lecture 30 Waves, waves, waves. Organ of Corti Tectorial membrane Sits on top Inner hair cells Outer hair cells The microphone for the brain 1 Hearing Perceptually,

More information

The Structure and Function of the Auditory Nerve

The Structure and Function of the Auditory Nerve The Structure and Function of the Auditory Nerve Brad May Structure and Function of the Auditory and Vestibular Systems (BME 580.626) September 21, 2010 1 Objectives Anatomy Basic response patterns Frequency

More information

Frequency refers to how often something happens. Period refers to the time it takes something to happen.

Frequency refers to how often something happens. Period refers to the time it takes something to happen. Lecture 2 Properties of Waves Frequency and period are distinctly different, yet related, quantities. Frequency refers to how often something happens. Period refers to the time it takes something to happen.

More information

Auditory System. Barb Rohrer (SEI )

Auditory System. Barb Rohrer (SEI ) Auditory System Barb Rohrer (SEI614 2-5086) Sounds arise from mechanical vibration (creating zones of compression and rarefaction; which ripple outwards) Transmitted through gaseous, aqueous or solid medium

More information

PSY 214 Lecture # (11/9/2011) (Sound, Auditory & Speech Perception) Dr. Achtman PSY 214

PSY 214 Lecture # (11/9/2011) (Sound, Auditory & Speech Perception) Dr. Achtman PSY 214 PSY 214 Lecture 16 Topic: Sound, Auditory System & Speech Perception Chapter 11, pages 270-289 Corrections: None Announcements: CD is available outside Dr Achtman s office if you would like to see demonstrations

More information

ID# Exam 2 PS 325, Fall 2009

ID# Exam 2 PS 325, Fall 2009 ID# Exam 2 PS 325, Fall 2009 As always, the Skidmore Honor Code is in effect. At the end of the exam, I ll have you write and sign something to attest to that fact. The exam should contain no surprises,

More information

Intro to Audition & Hearing

Intro to Audition & Hearing Intro to Audition & Hearing Lecture 16 Chapter 9, part II Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Fall 2017 1 Sine wave: one of the simplest kinds of sounds: sound for which pressure

More information

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for What you re in for Speech processing schemes for cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences

More information

HEARING. Structure and Function

HEARING. Structure and Function HEARING Structure and Function Rory Attwood MBChB,FRCS Division of Otorhinolaryngology Faculty of Health Sciences Tygerberg Campus, University of Stellenbosch Analyse Function of auditory system Discriminate

More information

J Jeffress model, 3, 66ff

J Jeffress model, 3, 66ff Index A Absolute pitch, 102 Afferent projections, inferior colliculus, 131 132 Amplitude modulation, coincidence detector, 152ff inferior colliculus, 152ff inhibition models, 156ff models, 152ff Anatomy,

More information

Receptors / physiology

Receptors / physiology Hearing: physiology Receptors / physiology Energy transduction First goal of a sensory/perceptual system? Transduce environmental energy into neural energy (or energy that can be interpreted by perceptual

More information

Hearing. Figure 1. The human ear (from Kessel and Kardon, 1979)

Hearing. Figure 1. The human ear (from Kessel and Kardon, 1979) Hearing The nervous system s cognitive response to sound stimuli is known as psychoacoustics: it is partly acoustics and partly psychology. Hearing is a feature resulting from our physiology that we tend

More information

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair Who are cochlear implants for? Essential feature People with little or no hearing and little conductive component to the loss who receive little or no benefit from a hearing aid. Implants seem to work

More information

Representation of sound in the auditory nerve

Representation of sound in the auditory nerve Representation of sound in the auditory nerve Eric D. Young Department of Biomedical Engineering Johns Hopkins University Young, ED. Neural representation of spectral and temporal information in speech.

More information

An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant

An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant Annual Progress Report An Auditory-Model-Based Electrical Stimulation Strategy Incorporating Tonal Information for Cochlear Implant Joint Research Centre for Biomedical Engineering Mar.7, 26 Types of Hearing

More information

Modulation and Top-Down Processing in Audition

Modulation and Top-Down Processing in Audition Modulation and Top-Down Processing in Audition Malcolm Slaney 1,2 and Greg Sell 2 1 Yahoo! Research 2 Stanford CCRMA Outline The Non-Linear Cochlea Correlogram Pitch Modulation and Demodulation Information

More information

BCS 221: Auditory Perception BCS 521 & PSY 221

BCS 221: Auditory Perception BCS 521 & PSY 221 BCS 221: Auditory Perception BCS 521 & PSY 221 Time: MW 10:25 11:40 AM Recitation: F 10:25 11:25 AM Room: Hutchinson 473 Lecturer: Dr. Kevin Davis Office: 303E Meliora Hall Office hours: M 1 3 PM kevin_davis@urmc.rochester.edu

More information

Auditory System & Hearing

Auditory System & Hearing Auditory System & Hearing Chapters 9 and 10 Lecture 17 Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Spring 2015 1 Cochlea: physical device tuned to frequency! place code: tuning of different

More information

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES Varinthira Duangudom and David V Anderson School of Electrical and Computer Engineering, Georgia Institute of Technology Atlanta, GA 30332

More information

16.400/453J Human Factors Engineering /453. Audition. Prof. D. C. Chandra Lecture 14

16.400/453J Human Factors Engineering /453. Audition. Prof. D. C. Chandra Lecture 14 J Human Factors Engineering Audition Prof. D. C. Chandra Lecture 14 1 Overview Human ear anatomy and hearing Auditory perception Brainstorming about sounds Auditory vs. visual displays Considerations for

More information

Lecture 6 Hearing 1. Raghav Rajan Bio 354 Neurobiology 2 January 28th All lecture material from the following links unless otherwise mentioned:

Lecture 6 Hearing 1. Raghav Rajan Bio 354 Neurobiology 2 January 28th All lecture material from the following links unless otherwise mentioned: Lecture 6 Hearing 1 All lecture material from the following links unless otherwise mentioned: 1. http://wws.weizmann.ac.il/neurobiology/labs/ulanovsky/sites/neurobiology.labs.ulanovsky/files/uploads/purves_ch12_ch13_hearing

More information

Auditory scene analysis in humans: Implications for computational implementations.

Auditory scene analysis in humans: Implications for computational implementations. Auditory scene analysis in humans: Implications for computational implementations. Albert S. Bregman McGill University Introduction. The scene analysis problem. Two dimensions of grouping. Recognition

More information

Binaural Hearing. Steve Colburn Boston University

Binaural Hearing. Steve Colburn Boston University Binaural Hearing Steve Colburn Boston University Outline Why do we (and many other animals) have two ears? What are the major advantages? What is the observed behavior? How do we accomplish this physiologically?

More information

Hearing II Perceptual Aspects

Hearing II Perceptual Aspects Hearing II Perceptual Aspects Overview of Topics Chapter 6 in Chaudhuri Intensity & Loudness Frequency & Pitch Auditory Space Perception 1 2 Intensity & Loudness Loudness is the subjective perceptual quality

More information

Who are cochlear implants for?

Who are cochlear implants for? Who are cochlear implants for? People with little or no hearing and little conductive component to the loss who receive little or no benefit from a hearing aid. Implants seem to work best in adults who

More information

SOLUTIONS Homework #3. Introduction to Engineering in Medicine and Biology ECEN 1001 Due Tues. 9/30/03

SOLUTIONS Homework #3. Introduction to Engineering in Medicine and Biology ECEN 1001 Due Tues. 9/30/03 SOLUTIONS Homework #3 Introduction to Engineering in Medicine and Biology ECEN 1001 Due Tues. 9/30/03 Problem 1: a) Where in the cochlea would you say the process of "fourier decomposition" of the incoming

More information

Using Source Models in Speech Separation

Using Source Models in Speech Separation Using Source Models in Speech Separation Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/

More information

Thresholds for different mammals

Thresholds for different mammals Loudness Thresholds for different mammals 8 7 What s the first thing you d want to know? threshold (db SPL) 6 5 4 3 2 1 hum an poodle m ouse threshold Note bowl shape -1 1 1 1 1 frequency (Hz) Sivian &

More information

Computational Auditory Scene Analysis: An overview and some observations. CASA survey. Other approaches

Computational Auditory Scene Analysis: An overview and some observations. CASA survey. Other approaches CASA talk - Haskins/NUWC - Dan Ellis 1997oct24/5-1 The importance of auditory illusions for artificial listeners 1 Dan Ellis International Computer Science Institute, Berkeley CA

More information

3-D Sound and Spatial Audio. What do these terms mean?

3-D Sound and Spatial Audio. What do these terms mean? 3-D Sound and Spatial Audio What do these terms mean? Both terms are very general. 3-D sound usually implies the perception of point sources in 3-D space (could also be 2-D plane) whether the audio reproduction

More information

Outline. The ear and perception of sound (Psychoacoustics) A.1 Outer Ear Amplifies Sound. Introduction

Outline. The ear and perception of sound (Psychoacoustics) A.1 Outer Ear Amplifies Sound. Introduction The ear and perception of sound (Psychoacoustics) 1 Outline A. Structure of the Ear B. Perception of Pitch C. Perception of Loudness D. Timbre (quality of sound) E. References Updated 01Aug0 Introduction

More information

Binaural Hearing for Robots Introduction to Robot Hearing

Binaural Hearing for Robots Introduction to Robot Hearing Binaural Hearing for Robots Introduction to Robot Hearing 1Radu Horaud Binaural Hearing for Robots 1. Introduction to Robot Hearing 2. Methodological Foundations 3. Sound-Source Localization 4. Machine

More information

Speech Generation and Perception

Speech Generation and Perception Speech Generation and Perception 1 Speech Generation and Perception : The study of the anatomy of the organs of speech is required as a background for articulatory and acoustic phonetics. An understanding

More information

Sound, Mixtures, and Learning

Sound, Mixtures, and Learning Sound, Mixtures, and Learning Dan Ellis Laboratory for Recognition and Organization of Speech and Audio (LabROSA) Electrical Engineering, Columbia University http://labrosa.ee.columbia.edu/

More information

Human Acoustic Processing

Human Acoustic Processing Human Acoustic Processing Sound and Light The Ear Cochlea Auditory Pathway Speech Spectrogram Vocal Cords Formant Frequencies Time Warping Hidden Markov Models Signal, Time and Brain Process of temporal

More information

Hearing. PSYCHOLOGY (8th Edition, in Modules) David Myers. Module 14. Hearing. Hearing

Hearing. PSYCHOLOGY (8th Edition, in Modules) David Myers. Module 14. Hearing. Hearing PSYCHOLOGY (8th Edition, in Modules) David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, 2007 1 Hearing Module 14 2 Hearing Hearing The Stimulus Input: Sound Waves The

More information

Discrete Signal Processing

Discrete Signal Processing 1 Discrete Signal Processing C.M. Liu Perceptual Lab, College of Computer Science National Chiao-Tung University http://www.cs.nctu.edu.tw/~cmliu/courses/dsp/ ( Office: EC538 (03)5731877 cmliu@cs.nctu.edu.tw

More information

Music and Hearing in the Older Population: an Audiologist's Perspective

Music and Hearing in the Older Population: an Audiologist's Perspective Music and Hearing in the Older Population: an Audiologist's Perspective Dwight Ough, M.A., CCC-A Audiologist Charlotte County Hearing Health Care Centre Inc. St. Stephen, New Brunswick Anatomy and Physiology

More information

The Ear. The ear can be divided into three major parts: the outer ear, the middle ear and the inner ear.

The Ear. The ear can be divided into three major parts: the outer ear, the middle ear and the inner ear. The Ear The ear can be divided into three major parts: the outer ear, the middle ear and the inner ear. The Ear There are three components of the outer ear: Pinna: the fleshy outer part of the ear which

More information

Chapter 1: Introduction to digital audio

Chapter 1: Introduction to digital audio Chapter 1: Introduction to digital audio Applications: audio players (e.g. MP3), DVD-audio, digital audio broadcast, music synthesizer, digital amplifier and equalizer, 3D sound synthesis 1 Properties

More information

SPECIAL SENSES: THE AUDITORY SYSTEM

SPECIAL SENSES: THE AUDITORY SYSTEM SPECIAL SENSES: THE AUDITORY SYSTEM REVISION OF PHYSICS: WAVES A wave is an oscillation of power, sound waves have two main characteristics: amplitude, which is the maximum displacement or the power of

More information

Hearing: Physiology and Psychoacoustics

Hearing: Physiology and Psychoacoustics 9 Hearing: Physiology and Psychoacoustics Click Chapter to edit 9 Hearing: Master title Physiology style and Psychoacoustics The Function of Hearing What Is Sound? Basic Structure of the Mammalian Auditory

More information

Hearing I: Sound & The Ear

Hearing I: Sound & The Ear Hearing I: Sound & The Ear Overview of Topics Chapter 5 in Chaudhuri Philosophical Aside: If a tree falls in the forest and no one is there to hear it... Qualities of sound energy and sound perception

More information

Hearing in the Environment

Hearing in the Environment 10 Hearing in the Environment Click Chapter to edit 10 Master Hearing title in the style Environment Sound Localization Complex Sounds Auditory Scene Analysis Continuity and Restoration Effects Auditory

More information

The Production of Speech Speed of Sound Air (at sea level, 20 C) 343 m/s (1230 km/h) V=( T) m/s

The Production of Speech Speed of Sound Air (at sea level, 20 C) 343 m/s (1230 km/h) V=( T) m/s Overview E.M. Bakker The Physics of Sound The Production of Speech The Perception of Speech and Audio Frequency Masking Noise Masking Temporal Masking Phonetics and Phonology Analog and Digital Signals

More information

PSY 214 Lecture 16 (11/09/2011) (Sound, auditory system & pitch perception) Dr. Achtman PSY 214

PSY 214 Lecture 16 (11/09/2011) (Sound, auditory system & pitch perception) Dr. Achtman PSY 214 PSY 214 Lecture 16 Topic: Sound, auditory system, & pitch perception Chapter 11, pages 268-288 Corrections: None needed Announcements: At the beginning of class, we went over some demos from the virtual

More information

Auditory Physiology Richard M. Costanzo, Ph.D.

Auditory Physiology Richard M. Costanzo, Ph.D. Auditory Physiology Richard M. Costanzo, Ph.D. OBJECTIVES After studying the material of this lecture, the student should be able to: 1. Describe the morphology and function of the following structures:

More information

Automatic audio analysis for content description & indexing

Automatic audio analysis for content description & indexing Automatic audio analysis for content description & indexing Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 4 5 Auditory Scene Analysis (ASA) Computational

More information

Auditory nerve. Amanda M. Lauer, Ph.D. Dept. of Otolaryngology-HNS

Auditory nerve. Amanda M. Lauer, Ph.D. Dept. of Otolaryngology-HNS Auditory nerve Amanda M. Lauer, Ph.D. Dept. of Otolaryngology-HNS May 30, 2016 Overview Pathways (structural organization) Responses Damage Basic structure of the auditory nerve Auditory nerve in the cochlea

More information

Loudness. Loudness is not simply sound intensity!

Loudness. Loudness is not simply sound intensity! is not simply sound intensity! Sound loudness is a subjective term describing the strength of the ear's perception of a sound. It is intimately related to sound intensity but can by no means be considered

More information

Mechanical Properties of the Cochlea. Reading: Yost Ch. 7

Mechanical Properties of the Cochlea. Reading: Yost Ch. 7 Mechanical Properties of the Cochlea CF Reading: Yost Ch. 7 The Cochlea Inner ear contains auditory and vestibular sensory organs. Cochlea is a coiled tri-partite tube about 35 mm long. Basilar membrane,

More information

11 Music and Speech Perception

11 Music and Speech Perception 11 Music and Speech Perception Properties of sound Sound has three basic dimensions: Frequency (pitch) Intensity (loudness) Time (length) Properties of sound The frequency of a sound wave, measured in

More information

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair Who are cochlear implants for? Essential feature People with little or no hearing and little conductive component to the loss who receive little or no benefit from a hearing aid. Implants seem to work

More information

21/01/2013. Binaural Phenomena. Aim. To understand binaural hearing Objectives. Understand the cues used to determine the location of a sound source

21/01/2013. Binaural Phenomena. Aim. To understand binaural hearing Objectives. Understand the cues used to determine the location of a sound source Binaural Phenomena Aim To understand binaural hearing Objectives Understand the cues used to determine the location of a sound source Understand sensitivity to binaural spatial cues, including interaural

More information

Hearing I: Sound & The Ear

Hearing I: Sound & The Ear Hearing I: Sound & The Ear Overview of Topics Chapter 5 in Chaudhuri Philosophical Aside: If a tree falls in the forest and no one is there to hear it... Qualities of sound energy and sound perception

More information

How Do Our Ears Work? Quiz

How Do Our Ears Work? Quiz The Marvelous Ear How Do Our Ears Work? Quiz 1. How do humans hear sounds? 2. How does human hearing work? Sketch and label the system. 3. Do you know any sensors that detect sound and how they might do

More information

Psychoacoustical Models WS 2016/17

Psychoacoustical Models WS 2016/17 Psychoacoustical Models WS 2016/17 related lectures: Applied and Virtual Acoustics (Winter Term) Advanced Psychoacoustics (Summer Term) Sound Perception 2 Frequency and Level Range of Human Hearing Source:

More information

Lecture 9: Speech Recognition: Front Ends

Lecture 9: Speech Recognition: Front Ends EE E682: Speech & Audio Processing & Recognition Lecture 9: Speech Recognition: Front Ends 1 2 Recognizing Speech Feature Calculation Dan Ellis http://www.ee.columbia.edu/~dpwe/e682/

More information

Healthy Organ of Corti. Loss of OHCs. How to use and interpret the TEN(HL) test for diagnosis of Dead Regions in the cochlea

Healthy Organ of Corti. Loss of OHCs. How to use and interpret the TEN(HL) test for diagnosis of Dead Regions in the cochlea 'How we do it' Healthy Organ of Corti How to use and interpret the TEN(HL) test for diagnosis of s in the cochlea Karolina Kluk¹ Brian C.J. Moore² Mouse IHCs OHCs ¹ Audiology and Deafness Research Group,

More information