Temporal Location of Perceptual Cues for Cantonese Tone Identification

Similar documents
2/25/2013. Context Effect on Suprasegmental Cues. Supresegmental Cues. Pitch Contour Identification (PCI) Context Effect with Cochlear Implants

Normalizing talker variation in the perception of Cantonese level tones: Impact of speech and nonspeech contexts

ACOUSTIC ANALYSIS AND PERCEPTION OF CANTONESE VOWELS PRODUCED BY PROFOUNDLY HEARING IMPAIRED ADOLESCENTS

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

PERCEPTION OF UNATTENDED SPEECH. University of Sussex Falmer, Brighton, BN1 9QG, UK

Language Speech. Speech is the preferred modality for language.

Tone perception of Cantonese-speaking prelingually hearingimpaired children with cochlear implants

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES

Optimal Filter Perception of Speech Sounds: Implications to Hearing Aid Fitting through Verbotonal Rehabilitation

Title: Repetition enhances the musicality of speech and tone stimuli to similar degrees

SPEECH ANALYSIS 3/3 PERCEPTION

The Influence of Linguistic Experience on the Cognitive Processing of Pitch in Speech and Nonspeech Sounds

Perception of American English can and can t by Japanese professional interpreters* 1

Although considerable work has been conducted on the speech

Differences of tone realization between younger and older speakers of. Nanjing dialect. Si Chen and Caroline Wiltshire

Consonant Perception test

Twenty subjects (11 females) participated in this study. None of the subjects had

Study on Effect of Voice Analysis Applying on Tone Training Software for Hearing Impaired People

Changes in the Role of Intensity as a Cue for Fricative Categorisation

Acoustic cues to tonal contrasts in Mandarin: Implications for cochlear implants

Does Wernicke's Aphasia necessitate pure word deafness? Or the other way around? Or can they be independent? Or is that completely uncertain yet?

Perception and Production of Mandarin Tones in Prelingually Deaf Children with Cochlear Implants

Communication with low-cost hearing protectors: hear, see and believe

Janet Doyle* Lena L. N. Wongt

Measuring Auditory Performance Of Pediatric Cochlear Implant Users: What Can Be Learned for Children Who Use Hearing Instruments?

Proceedings of Meetings on Acoustics

Role of F0 differences in source segregation

Assessing Hearing and Speech Recognition

International Forensic Science & Forensic Medicine Conference Naif Arab University for Security Sciences Riyadh Saudi Arabia

MULTI-CHANNEL COMMUNICATION

THE ROLE OF VISUAL SPEECH CUES IN THE AUDITORY PERCEPTION OF SYNTHETIC STIMULI BY CHILDREN USING A COCHLEAR IMPLANT AND CHILDREN WITH NORMAL HEARING

Relationship Between Tone Perception and Production in Prelingually Deafened Children With Cochlear Implants

Computational Perception /785. Auditory Scene Analysis

Auditory-Visual Integration of Sine-Wave Speech. A Senior Honors Thesis

Categorical Perception

Effect of tones on voice onset time (VOT) in Cantonese aspirated stops. Title. Lam, Chung-ling; 林松齡

11 Music and Speech Perception

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

Gick et al.: JASA Express Letters DOI: / Published Online 17 March 2008

5. Which word refers to making

Perception of Talker Age by Young Adults with High-Functioning Autism. Cynthia G. Clopper, Kristin L. Rohrbeck & Laura Wagner

Mandarin tone recognition in cochlear-implant subjects q

Research Article The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition

Effect of Consonant Duration Modifications on Speech Perception in Noise-II

Speech Perception in Mandarin- and Cantonese- Speaking Children with Cochlear Implants: A Systematic Review

Speech Spectra and Spectrograms

Cue-specific effects of categorization training on the relative weighting of acoustic cues to consonant voicing in English

group by pitch: similar frequencies tend to be grouped together - attributed to a common source.

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception

Assessment of auditory temporal-order thresholds A comparison of different measurement procedures and the influences of age and gender

Bimodal bilingualism: focus on hearing signers

Validation of Mandarin Speech Audiometry Materials in Singapore. Co-investigator: Soh Wanxian (Kimberly) Principal Investigator: Dr.

Use of Auditory Techniques Checklists As Formative Tools: from Practicum to Student Teaching

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

SPEECHREADING OF WORDS AND SENTENCES BY NORMALLY HEARING AND HEARING IMPAIRED CHINESE SUBJECTS: THE ENHANCEMENT EFFECTS OF COMPOUND SPEECH PATTERNS

EFFECTS OF RHYTHM ON THE PERCEPTION OF URGENCY AND IRRITATION IN 1 AUDITORY SIGNALS. Utrecht University

THE INFLUENCE OF GROUPING & TEMPO

Speech, Hearing and Language: work in progress. Volume 11

Interjudge Reliability in the Measurement of Pitch Matching. A Senior Honors Thesis

Topics in Linguistic Theory: Laboratory Phonology Spring 2007

Lexical Tone Effects on Voice Onset Time in Cantonese

Evaluating the Clinical Effectiveness of EPG. in the Assessment and Diagnosis of Children with Intractable Speech Disorders

Best Practice Protocols

Feedback and feedforward control in apraxia of speech: Noise masking effects on fricative production.

Teaching Communication to Individuals with Autism. Laura Ferguson, M.Ed., BCBA

Outline.! Neural representation of speech sounds. " Basic intro " Sounds and categories " How do we perceive sounds? " Is speech sounds special?

Intelligibility of clear speech at normal rates for older adults with hearing loss

Effects of noise and filtering on the intelligibility of speech produced during simultaneous communication

City, University of London Institutional Repository

PROCEDURES TO ASSESS FOCUS. Phonology Project Technical Report No. 5. Joan Kwiatkowski. Lawrence D. Shriberg. September, 1997

TONE OCCURRENCES IN MBUBE

Auditory Scene Analysis. Dr. Maria Chait, UCL Ear Institute

Hearing the Universal Language: Music and Cochlear Implants

Speech Cue Weighting in Fricative Consonant Perception in Hearing Impaired Children

HCS 7367 Speech Perception

Education. Professional Experience and Employment History

Voice Pitch Control Using a Two-Dimensional Tactile Display

VERBAL OVERSHADOWING IN SPEAKER IDENTIFICATION: FACE AND VOICE

Lexical tone recognition in noise in normal-hearing children and prelingually deafened children with cochlear implants

Proceedings of Meetings on Acoustics

Anumber of studies have shown that the perception of speech develops. by Normal-Hearing and Hearing- Impaired Children and Adults

Suprasegmental Characteristics of Speech Produced during Simultaneous Communication by Inexperienced Signers

CURRICULUM VITAE G.H. Yeni-Komshian

! Introduction:! ! Prosodic abilities!! Prosody and Autism! !! Developmental profile of prosodic abilities for Portuguese speakers!

INTRODUCTION J. Acoust. Soc. Am. 103 (2), February /98/103(2)/1080/5/$ Acoustical Society of America 1080

The Deaf Brain. Bencie Woll Deafness Cognition and Language Research Centre

Curriculum Vitae of Chang Liu

Results. Dr.Manal El-Banna: Phoniatrics Prof.Dr.Osama Sobhi: Audiology. Alexandria University, Faculty of Medicine, ENT Department

Speech, Hearing and Language: work in progress. Volume 11

Tone Production in Mandarin-speaking Children with Cochlear Implants: A Preliminary Study

Problems decoding audio? Sounds like autism

Learning Process. Auditory Training for Speech and Language Development. Auditory Training. Auditory Perceptual Abilities.

ID# Exam 2 PS 325, Fall 2003

Toward an objective measure for a stream segregation task

Speech and Intelligibility Characteristics in Fragile X and Down Syndromes

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing

The Perception of Affective Prosody in Children with Autism Spectrum Disorders and Typical Peers

PITCH is one of the most important information-bearing

A Senior Honors Thesis. Brandie Andrews

Speech Reading Training and Audio-Visual Integration in a Child with Autism Spectrum Disorder

Transcription:

Temporal Location of Perceptual Cues for Cantonese Tone Identification Zoe Wai-Man Lam, Kathleen Currie Hall and Douglas Pulleyblank Department of Linguistics University of British Columbia 1

Outline of the presentation 1. Background 2. Research question 3. Methodology 4. Results 5. Discussion 6. Conclusion 2

F0 variation in language Fundamental frequency (F0): the frequency of vocal fold vibration (in Hz) F0 variation takes place within a temporal window. Lexical tone: temporal window = duration of a syllable Intonation: temporal window = duration of an utterance 3

Acoustic vs. perceptual F0 is an acoustic term: the physical properties of the signal Pitch is a perceptual term: the hearer s perception of the signal Tone is a linguistic term: phonological categories to distinguish words (Yip 2002) Example: Yoruba has 3 level tones (H M L) In a disyllabic noun σ 1 σ 2 : F0 of σ 2 Perceived tone of σ 2 Falling Low tone Flat Mid tone (Hombert 1976) Acoustic cues perceptual cues 4

Phonemic inventory of Cantonese tones 1 2 3 5 6 4 6 lexical tones Tone 1 Tone 2 Tone 3 Tone 4 Tone 5 Tone 6 High level High rising Mid level Low falling* Low rising Low level 3 level tones (1, 3, 6) 3 contour tones (2, 4, 5) *Slope of Tone 4 is similar to that of a level tone phonetically 5

Previous studies on tonal production Khouw & Ciocca (2007) Focus on syllables produced in isolation Goal: investigate which parts of the vocalic segment provide F0 information that is most strongly related to the identification of Cantonese tones F0 measurements are made for eight consecutive sections of the whole vocalic segment for each syllable. Discriminant analysis shows that the most important correlates of tone identity are in the later part of the vocalic segment, specifically the 6th and 7th sections. 6 7 6

Previous studies on tonal production Wong (2007) Focuses on tonal coarticulatory effects in disyllabic sequences in a carrier sentence lau lau as target syllables Carryover effects are so strong that F0 transition from the tone of σ 1 to σ 2 takes up 50% of σ 2, resulting in a great magnitude of F0 variation across different tonal contexts 7

Research question Do native Cantonese speakers rely particularly on the perceptual cues from the latter portion of the tone for identification? Our hypothesis 8

Stimulus preparation 4 naïve native speakers: 2 male, 2 female; age range 19-28 Carrier sentence jau5 go3 giu3 σ 1 σ 2 ge3 je5 havecl call of thing There is something called. σ 1 σ 2 is any disyllabic combination of the following: si se fu 1 high level 55 詩 Poetry 些 Some 呼 To exhale 2 high rising 25 史 History 寫 To write 苦 Bitter 3 mid level 33 試 To try* 瀉 Diarrhea 富 Wealth 4 low falling 21 時 Time 蛇 Snake 符 To match 5 low rising 23 市 Market* 社 Society 婦 Woman/wife 6 low level 22 是 Right 射 To shoot 負 To load 1152 sentence tokens, divided into 8 groups of stimuli 9

Four stimulus types Control stimuli Type A: Not manipulated jau5 go3 giu3 σ 1 σ 2 ge3 je5 Type B: σ 2 not affected, but there is some disruption elsewhere jau5 go3 giu3 σ 1 σ 2 ge3 je5 10

Four stimulus types Type C: 1 st half of the tone ofσ 2 zeroed out jau5 go3 giu3 σ 1 σ 2 ge3 je5 Type D: 2 nd half of the tone ofσ 2 zeroed out jau5 go3 giu3 σ 1 σ 2 ge3 je5 Zeroing out done by a Praat script Boundaries of the vocalic portion of the syllable were marked by the first author manually 11

Four predictions Type A Type B Type C Type D σ 2 Based on the hypothesis that the second half of the tone contains the crucial perceptual cues for tone identification, we predict that: 1. Type A = Type B 2. Type A = Type C 3. Type A > Type D 4. Type C > Type D 12

Perception study: Procedures Participants were presented with one sentence on the screen in each trial. They listened to an audio stimulus at the same time. They were always asked what σ 2 was. They were instructed to respond by pushing 1,2,3,4,5 or 6 on the keyboard. 13

Perception study: Procedures Practice set 12 trials, using the syllable jau, produced by the first author All stimuli in the practice set were not manipulated (i.e. belong to Type A). Feedback was provided to the participants during the practice period. Data of participants who failed to get at least 9 correct responses were excluded from analysis. Actual experiment 144 trials per participant, using si, se or fu syllables Randomized, not blocked according to stimulus type Participants were told that it s normal if some parts of the recording are not clear. They just needed to try their best to choose an answer. Feedback was NOT provided. 14

Perception study: The participants 63 participants from the UBC community participated in the study, who were either paid or given course credit. The data of 39 participants were excluded from the analysis. Not a speaker of the Hong Kong variety of Cantonese (e.g. Malaysia) Got fewer than 9 correct answers out of 12 Qs in the practice set The data of 24 participants were included in the analysis. Got at least 9 correct answers out of 12 Qs in the practice set Self-rated 6 or 7 out of 7 for their Cantonese proficiency Able to read Traditional Chinese characters, had spent at least 14 years in Hong Kong Subject 107 and 501 have never lived in Hong Kong, but their parents are from Hong Kong; results similar to other participants Age: 18-36 15

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Results (Version 1) Are these predictions correct? 1. Type A = Type B Yes 2. Type A = Type C No 3. Type A > Type D Yes 4. Type C > Type D No 16

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Results (Version 2) Are these predictions correct? 1. Type A = Type B Yes 2. Type A = Type C 3. Type A > Type D Yes only for contour tones but not level tones 4. Type C > Type D 17

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Type A Prediction = Type C 18

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Level tones: Type A (control) vs Type C (1 st half removed) Prediction: Type A = Type C 19

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Contour tones: Type A (control) vs Type C (1 st half removed) Prediction: Type A = Type C 20

Type A Prediction > Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Type D 21

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Level tones: Type A (control) vs Type D (2 nd half removed) Prediction: Type A > Type D 22

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Contour tones: Type A (control) vs Type D (2nd half removed) Prediction: Type A > Type D 23

Type C Prediction > Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Type D 24

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Level tones: Type C (1 st half removed) vs Type D (2 nd half removed) Prediction: Type C > Type D 25

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Contour tones: Type C (1 st half removed) vs Type D (2 nd half removed) Prediction: Type C > Type D 26

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Data analysis A logistic mixed effects model with percentage correct as the dependent variable was fit with fixed effects for Stimulus Type (A, B, C, D), Tone Type (Level, Contour) and their interactions. The random effect structure was as maximally specified as possible with random effects for Subject and Talker, and by-subject random slopes for the interaction of Stimulus Type and Tone Type. 27

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half 28

Type C = cannot hear the 1 st half Type D = cannot hear the 2 nd half Discussion Listener s bias When hearing Type C à tend to choose a contour tone When hearing Type D à tend to choose a level tone Type A Type B Type C Type D σ 2 p < 0.001 p = 0.03 29

Discussion Tone 4 patterns with other contour tones despite being phonetically similar to level tones. 1 2 3 5 6 4 30

Conclusion Listeners rely on perceptual cues from different parts of the syllable, depending on the tone type. This half is needed for the identification of level tones. This half is needed for the identification of contour tones. 31

Acknowledgements This project was supported by a research grant from Social Sciences and Humanities Research Council of Canada (F10-04735). We would like to thank Molly Babel and participants of the research seminar Attention and Salience at the University of British Columbia in 2014-2015, who offered invaluable comments throughout the process. All errors are ours. 32