Categorical Perception

Similar documents
Sensory Thresholds and Signal Detection. Lecture 13

Research methods in sensation and perception. (and the princess and the pea)

Introduction to Sensation and Perception

Perception Lecture 1

Three methods for measuring perception. Magnitude estimation. Steven s power law. P = k S n

Lecturer: Rob van der Willigen 11/9/08

Lecturer: Rob van der Willigen 11/9/08

Sensation, Part 1 Gleitman et al. (2011), Chapter 4

d). Draw the following neural circuits (using the notation taught in class) and then say what would happen if they were stimulated as specified.

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing

Myers PSYCHOLOGY. (6th Ed) Chapter 5. Sensation

Psychophysics & a brief intro to the nervous system

ID# Exam 2 PS 325, Fall 2009

ID# Exam 2 PS 325, Fall 2003

Sensation and Perception: How the World Enters the Mind

Schwartz and Krantz, Sensation and Perception

ID# Final Exam PS325, Fall 1997

INTRODUCTION J. Acoust. Soc. Am. 103 (2), February /98/103(2)/1080/5/$ Acoustical Society of America 1080

1 Introduction. 1 Early Philosophy of Perception. 1 Early Philosophy of Perception (cont d)

Practice Test Questions

Auditory Scene Analysis

Consonant Perception test

SENSATION AND PERCEPTION KEY TERMS

Psychology Perception

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Name Psychophysical Methods Laboratory

Types of questions. You need to know. Short question. Short question. Measurement Scale: Ordinal Scale

Confidence ratings in speech perception research: Evaluation of an efficient technique for discrimination testing,',

Congruency Effects with Dynamic Auditory Stimuli: Design Implications

Detection Theory: Sensitivity and Response Bias

Lecture Outline. The GIN test and some clinical applications. Introduction. Temporal processing. Gap detection. Temporal resolution and discrimination

Sound localization psychophysics

Empirical Formula for Creating Error Bars for the Method of Paired Comparison

9/29/14. Amanda M. Lauer, Dept. of Otolaryngology- HNS. From Signal Detection Theory and Psychophysics, Green & Swets (1966)

A temporally dynamic context effect that disrupts voice onset time discrimination of rapidly successive stimuli

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

Chapter 1 Introduction. Welcome to Our World Thresholds and the Dawn of Psychophysics Sensory Neuroscience and the Biology of Perception

Fundamentals of Psychophysics

Speech Cue Weighting in Fricative Consonant Perception in Hearing Impaired Children

Unit 3 - Sensation & Perception. Chapter 4 Part 1: Intro. to S & P

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

INTRODUCTION. Electronic mail:

group by pitch: similar frequencies tend to be grouped together - attributed to a common source.

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

Shaw - PSYC& 100 Lilienfeld et al (2014) - Chapter 4 Sensation and Perception: How we sense and conceptualize the world

Infant Hearing Development: Translating Research Findings into Clinical Practice. Auditory Development. Overview

THE ROLE OF VISUAL SPEECH CUES IN THE AUDITORY PERCEPTION OF SYNTHETIC STIMULI BY CHILDREN USING A COCHLEAR IMPLANT AND CHILDREN WITH NORMAL HEARING

PARAMETERS OF ECHOIC MEMORY

Prof. Greg Francis 7/7/08

PHGY 210,2,4 - Physiology SENSORY PHYSIOLOGY. Martin Paré

= add definition here. Definition Slide

Sensation and Perception

Tactile Communication of Speech

Hearing in the Environment

Spreadsheet signal detection

Auditory Physiology PSY 310 Greg Francis. Lecture 30. Organ of Corti

Sensation. I. Basic Concepts II. Characteristics of Sensory Systems III. The Visual System

Thomas D. Carrell Department of Communication Sciences, University of Nebraska, Lincoln, Nebraska 68538

Hearing. istockphoto/thinkstock

Computing discriminability and bias with the R software

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

HCS 7367 Speech Perception

Detection Theory: Sensory and Decision Processes

An Update on Auditory Neuropathy Spectrum Disorder in Children

Hearing. and other senses

Prof. Greg Francis 5/23/08

Effects of Categorization and Discrimination Training on Auditory Perceptual Space

INTRODUCTION. Institute of Technology, Cambridge, MA Electronic mail:

The functional importance of age-related differences in temporal processing

Definition Slides. Sensation. Perception. Bottom-up processing. Selective attention. Top-down processing 11/3/2013

Psychology Research Process

Sound Localization PSY 310 Greg Francis. Lecture 31. Audition

Gick et al.: JASA Express Letters DOI: / Published Online 17 March 2008

Biological Psychology. Unit Two AD Mr. Cline Marshall High School Psychology

A common basis for auditory sensory storage in perception and immediate memory

Binaural Hearing. Why two ears? Definitions

EFFECTS OF RHYTHM ON THE PERCEPTION OF URGENCY AND IRRITATION IN 1 AUDITORY SIGNALS. Utrecht University

Spectrograms (revisited)

Sensation is the conscious experience associated with an environmental stimulus. It is the acquisition of raw information by the body s sense organs

Classic experiments in sensation and perception

PHONETIC AND AUDITORY TRADING RELATIONS BETWEEN ACOUSTIC CUES IN SPEECH PERCEPTION: PRELIMINARY RESULTS. Repp. Bruno H.

HCS 7367 Speech Perception

Hearing II Perceptual Aspects

Audiovisual speech perception in children with autism spectrum disorders and typical controls

A FRÖHLICH EFFECT IN MEMORY FOR AUDITORY PITCH: EFFECTS OF CUEING AND OF REPRESENTATIONAL GRAVITY. Timothy L. Hubbard 1 & Susan E.

Within-Consonant Perceptual Differences in the Hearing Impaired Ear

Language Speech. Speech is the preferred modality for language.

Mr. Silimperi Council Rock High School South Chapter 5 Sensation Sensation II

TESTING A NEW THEORY OF PSYCHOPHYSICAL SCALING: TEMPORAL LOUDNESS INTEGRATION

Cue-specific effects of categorization training on the relative weighting of acoustic cues to consonant voicing in English

Rajeev Raizada: Statement of research interests

Classical Psychophysical Methods (cont.)

Sound Waves. Sensation and Perception. Sound Waves. Sound Waves. Sound Waves

USING CUED SPEECH WITH SPECIAL CHILDREN Pamela H. Beck, 2002

Perceptual Plasticity in Spatial Auditory Displays

Sensation and Perception

Methods in Perception. Qualitative Methods in Perception. Qualitative Observation. Methods in Perception Research

Methods in Perception Research. Chapter 1: Fundamentals of Sensory Perception

Laboratory exercise: Adaptive tracks and psychometric functions

Study of perceptual balance for binaural dichotic presentation

Transcription:

Categorical Perception Discrimination for some speech contrasts is poor within phonetic categories and good between categories. Unusual, not found for most perceptual contrasts. Influenced by task, expectations, memory and quality of stimuli. 10/5/11 Speech Perception - 11 1

Categorical Perception In normal perception, a perceiver can discriminate many more stimuli than they can absolutely identify. Discrimination performance is generally proportional to the absolute magnitude of the stimulus. This is Weber s Law. In categorical perception, a perceiver can only discriminate between stimuli to the extent that they have different labels. 10/5/11 Speech Perception - 11 2

Basic Tasks Listeners perform two tasks with a stimulus continuum (e.g. a /ba/ - /pa/ VOT series): Identification and discrimination. 1. Identification - Present tokens in random order. Collect labeling responses. 2. ABX or AXB Discrimination - Each trial has three tokens. A and B are always different and X is either A or B. A and B are 1 or 2 or 3 stimuli apart (e.g. separated by 20 msec VOT). 10/5/11 Speech Perception - 11 3

ABX Trials A B X ISI ISI On a trial, stimuli A and B are always different. X is identical to either A or B. The ISI is typically short (50 to 500 ms). A and B change from trial to trial. The listeners task is to indicate if X was identical to A or to B. 10/5/11 Speech Perception - 11 4

ABX Trials - 2 To examine discrimination along a continuum, on different trials, comparison is made between A and B at different points along the continuum. For example, if our continuum is a VOT continuum from a 0 ms VOT /ba/ to a 60 ms VOT /pa/, we might compare discrimination for stimuli that are 20 msec VOT apart along the continuum. Our comparisons would be 0 and 20, 10 and 30, 20 and 40, 30 and 50, and 40 and 60 ms VOT. 10/5/11 Speech Perception - 11 5

ABX Trials - 3 The order of stimuli within a trial is counterbalanced. For each comparison of A and B, there are four trials: ABA, ABB, BAA, BAB. To ensure that we have stable data for each participant, we typically collect 20 or more labeling responses for each stimulus and 20 or more discrimination responses for each stimulus pair (A and B). 10/5/11 Speech Perception - 11 6

Basic Results Listeners can discriminate between two different tokens only to the extent that they give them different labels. For example, 40 and 60 msec VOT tokens are both labeled /pa/ on virtually all trials and listeners are at chance in discrimination for 40 and 60 msec pairs. The 20 and 40 msec tokens are in different categories (/ ba/ and /pa/) and listeners are near perfect in discriminating between them. 10/5/11 Speech Perception - 11 7

Percent /b/ & Percent Correct 100 75 50 25 0 0 10 20 30 40 50 60 labeling ABX obtained ABX predicted VOT (msec) 10/5/11 Speech Perception - 11 8

Prediction Formula 0.5 + 0.5 x (p(l 1 ) - p(l 2 )) 2 For any two tokens L 1 and L 2, find the difference in labeling, square it, multiply by.5 and add to.5 (chance). This is the predicted correct discrimination performance. If L 1 is 1.0 and L 2 is 0.0, the predicted discrimination performance is 1.0 while if L 1 and L 2 are the same, discrimination is predicted to be 0.5 (chance). 10/5/11 Speech Perception - 11 9

Further Data Categorical perception does not happen for all speech contrasts. Stop-consonants varying in place or VOT show it. Vowels do not. Putting the target in context makes perception more categorical. Making the targets shorter (vowels) makes them more categorical. Making the time interval between tokens in the ABX triad longer makes the results more categorical. 10/5/11 Speech Perception - 11 10

Further Data - 2 With training, listeners perform much better at within category discriminations (see Samuel, 1977). Other discrimination tasks (4IAX) yield better performance (Pisoni, 1973). What would happen if the stimuli were made more natural by co-varying multiple acoustic cues? 10/5/11 Speech Perception - 11 11

Further Issues Labeling data collected with one token presented at a time are not going to be exactly the same as when listeners hear three tokens in sequence. For this reason, some researchers have used an AXB task. In labeling and discrimination, listeners hear the same AXB sequences. In labeling, they label the middle (X) token. In discrimination, they indicate whether it matched the first (A) or third (B) token. 10/5/11 Speech Perception - 11 12

Further Issues - 2 Does categorical perception reflect memory limitations for rapidly changing stimuli? Does it reflect discontinuities in perception? Put another way, could there be specialized neural coding that introduces a threshold into how complex information such as VOT is coded. What does it tell us about the nature of speech perception? 10/5/11 Speech Perception - 11 13

An Aside Before going further, we need to introduce the concepts of Signal Detection Theory so that they can be used to help understand categorical perception. Signal Detection Theory is designed to separate the performance of a perceiver in a task into two components: the underlying sensory/perceptual coding processes and response (decision) processes that map internal coding onto an external response. To see why this is important, consider the performance of two listeners in a hearing test. 10/5/11 Speech Perception - 11 14

Tone Threshold Data Listeners Laura and Beth. Are their perceptual systems really as different as they appear? Percent "yes" 100 80 60 40 20 0 Laura Beth -5 0 5 10 15 20 25 30 Intensity (db) 10/5/11 Speech Perception - 11 15

Signal Detection Theory (SDT) The Problem - There is no way to tell, using a simple threshold task, whether differences in listener data (e.g. Laura and Beth) reflect differences in perception or their interpretation of the instructions. The Theory of Signal Detectability - TSD attempts to separate those factors that reflect decision processes (such as instructions, motivation) from those that reflect underlying perception. 10/5/11 Speech Perception - 11 16

Terminology Decision Criterion (Bias) - The rule that a listener uses to map their sensory/perceptual evidence onto the response alternatives. Should be influenced by motivation, interpretation of instructions, payoffs for various outcomes. Sensitivity - The underlying sensory and perceptual processes that transform the stimuli in the world into internal mental states. Influenced by the stimulus and basic perceptual processes. 10/5/11 Speech Perception - 11 17

Trial Structure In some psychophysical methods, a stimulus is presented on every trial. In principle, the subject could say yes on every trial and be correct. The problem is that we have no objective way of knowing whether a yes or no response was correct. We need to add trials on which yes and no responses are objectively correct. Put more generally, we need different trials with distinct, objectively correct, responses. 10/5/11 Speech Perception - 11 18

Trials with Different Correct Answers Signal S 2 S 1 Response Yes No Hit Miss False Alarm Correct Rejection 10/5/11 Speech Perception - 11 19

Laura and Beth Return to our example of the hearing threshold. We present the -5 db tone and silence to our two listeners. For Laura, we get 90% hits and 40% false alarms. For Beth, we get 50% hits and 6% false alarms. Laura clearly has a higher hit percentage than Beth, but she also has a higher percentage of false alarms. How do we make precise comparisons between these two individuals? 10/5/11 Speech Perception - 11 20

An Experiment According to the basic idea of a threshold, Laura has a lower threshold than Beth. If they are interpreting the instructions differently, then we should be able to change their performance by modifying the instructions. Offer Beth a monetary incentive for Hits. The payoffs are: Hit Win $10 Miss Lose $1 False Alarm Lose $1 Correct Rejection Win $1 With these payoffs, Beth has 98% hits and 70% false alarms. 10/5/11 Speech Perception - 11 21

Part 2 Next, we change the payoffs for Beth to: Hit Win $1 Miss Lose $1 False Alarm Lose $10 Correct Rejection Win $10 Now, she has 21% hits and 1% false alarms. The first payoff matrix is called liberal since it pays Beth to say yes a lot. The second is called conservative (pays Beth to say no a lot). Beth has changed her performance across the three conditions. As the hits increased, so did false alarms. As hits decreased, so did false alarms. 10/5/11 Speech Perception - 11 22

100 ROC Curve Laura Beth-L Percentage Hits 75 50 25 0 Beth Beth-C 0 25 50 75 100 Percentage False Alarms 10/5/11 Speech Perception - 11 23

Interpretation In the plot of Hits versus False Alarms, the three points for Beth s data and Laura s data seem to lie on the same smooth curve. Does this imply that the only difference between Beth and Laura is in their interpretation of the instructions, motivation, etc? To answer, we need a quantitative theory. 10/5/11 Speech Perception - 11 24

The Theory On an S 1 trial (signal absent or noise trial), the stimulus information gives rise to an internal percept of a particular magnitude. On different presentations of this trial, the internal magnitude of the sensation or percept will not always be the same (noise in the processing system). The sensation will be distributed normally. On S 2 trials (signal present or signal trials), the stimulus information will also give rise to an internal percept. This will differ from S 1 in magnitude on the internal perceptual dimension. The magnitude will be variable from trial to trial and will be distributed normally. 10/5/11 Speech Perception - 11 25

Criterion Probability Density S 1 S 2 Internal Percept 10/5/11 Speech Perception - 11 26

Theory and Data The graph shows Laura s data. A Hit occurs on an S 2 trial (distribution to the right) when the she says yes (to the right of the decision rule). The area under the S 2 distribution to the right of the criterion is the probability of a hit (shown in blue). A false alarm occurs on an S 1 trial (distribution on the left) when the person says yes (to the right of the decision rule). The area under the S 1 distribution to the right of the criterion is the probability of a false alarm (in red). 10/5/11 Speech Perception - 11 27

Beth s Decision Rules" L" N C Loudness (perceived intensity) 10/5/11 Speech Perception - 11 28

Laura s Decision Rule Beth s Decision Rule" N S + N Sensitivity Loudness (perceived intensity) 10/5/11 Speech Perception - 11 29

Sensitivity Using Hit and False Alarm data, we can compute two measures of listener performance. Assume that the mean of the S 1 distribution is at zero. Then, using the false alarm rate, we can determine where the listener has placed her criterion (also called ß or beta) along the dimension of of the internal percept. Using hit and false alarm rates, we can determine the distance between the means of the S 1 and S 2 distributions. This is our measure of sensitivity and is called d (d-prime). Beth and Laura have different criteria but the same sensitivity. 10/5/11 Speech Perception - 11 30

Summary of TSD Using TSD, we can assess a listener s performance in different conditions and see how both their decision rule and their sensitivity contribute to performance in different conditions. Perceivers can differ in sensitivity, criterion, or both. We can also compare a single individuals performance in different conditions to see if differences in performance are due to criterion changes, sensitivity differences or both. 10/5/11 Speech Perception - 11 31

A Memory Explanation One explanation for categorical perception is based on listeners having two memory codes: auditory and phonetic (Fujasaki & Kawashima). The auditory code retains stimulus details. It is short lived and easily overwritten by new sounds. The phonetic code (recode the auditory input into its language based segmental representation) is stable and easily held in working memory. It does not preserve acoustic detail. 10/5/11 Speech Perception - 11 32

Data Pisoni (1973, P&P, 13, 253-260; 1975, M&C, 3, 7-18) has reported the results of a series of studies investigating categorical perception. Lengthening the interval (ISI) in ABX triads makes perception more categorical. Going to a discrimination task (4IAX) that is less memory intensive than ABX makes perception less categorical. Transient, dynamic stimuli (e.g. stops varying in VOT or place of articulation) tend to be categorical. Steady-state vowels are not. Results are consistent with memory models. 10/5/11 Speech Perception - 11 33

A Perceptual Discontinuity Pastore (see Miller et al., 1976, JASA, 60, 410-417; Pastore et al., 1990, P&P, 48, 151-156) has explored an explanation based on psychophysical discontinuities. Basically, the auditory system has coding mechanisms for complex ( pattern ) properties of sound. For example, a set of neural mechanisms that detect synchrony versus asynchrony in information onset. This particular mechanism could underpin the coding of VOT. Stimuli that are coded by these mechanisms exhibit discontinuities in perception. 10/5/11 Speech Perception - 11 34

Data Pastore (see above), Pisoni (1977, JASA, 61, 1352-1361) have reported data showing categorical perception for nonspeech sounds that have properties similar to VOT. At first blush, the memory explanation will not work for these nonspeech data. However, if we change the two forms of memory to be auditory and abstract and note that the abstract does not preserve stimulus detail, then we have a memory model that will work for nonspeech. 10/5/11 Speech Perception - 11 35

The Role of TSD TSD can be used to investigate the perception of speech and explanations for categorical perception. Basically, both the memory and perceptual discontinuity explanations predict that sensitivity within a category will be low and between categories will be large. That is, the influence of auditory memory or a perceptual discontinuity should show up in changes in sensitivity. When TSD has been used to investigate phonetic continua, this is what has been found (see, e.g. Sawusch et al., 1980, P&P, 27, 421-434). 10/5/11 Speech Perception - 11 36

Summary The predictions of memory and perceptual discontinuity models are very similar and no resolution to this debate seems satisfactory at this time. For a wide ranging summary of views and issues, see S. Harnad (Ed.), 1987, Categorical perception, Cambridge Univ Press. 10/5/11 Speech Perception - 11 37