Shaheen N. Awan 1, Nancy Pearl Solomon 2, Leah B. Helou 3, & Alexander Stojadinovic 2

Size: px
Start display at page:

Download "Shaheen N. Awan 1, Nancy Pearl Solomon 2, Leah B. Helou 3, & Alexander Stojadinovic 2"

Transcription

1 Shaheen N. Awan 1, Nancy Pearl Solomon 2, Leah B. Helou 3, & Alexander Stojadinovic 2 1 Bloomsburg University of Pennsylvania; 2 Walter Reed National Military Medical Center; 3 University of Pittsburgh

2 Dr. S. N. Awan is a consultant to KayPentax (Montvale, NJ) for the development of commercial computer software including cepstral analysis of continuous speech algorithms. KayPentax licenses the algorithms that form the basis of the Analysis of Dysphonia in Speech & Voice (ADSV) program from Dr. Awan. The views expressed in this presentation are those of the authors and do not reflect official policy of the United States Army, Department of Defense, or US Government.

3 Time-based perturbation measures have at least two key limitations: 1. Difficulty Analyzing More Severely Dysphonic Vowel Samples 2. Lack of Validity of Traditional Perturbation Measures In the Analysis of Continuous Speech:

4 In contrast to traditional perturbation analyses, spectralbased acoustic measures have shown the ability to characterize the voice signal by extracting characteristics such as the fundamental frequency (F 0 ) and the relative amplitude of harmonics vs. noise (de Krom, 1993) without the necessity of identifying cycle boundaries. Spectral-based methods analyze frames of data rather than cycles. *** Spectral/cepstral measures are able to provide valid and reliable correlates of vocal quality in continuous speech contexts.***

5 The cepstrum (a Fourier transform of the power spectrum of the voice signal) can graphically display the extent to which the dominant rahmonic (an anagram of harmonic a cepstral peak often associated with the vocal fundamental frequency) is individualized and emerges out of the background noise level. Measures of the relative amplitude of the cepstral peak in relation to extraneous cepstral components have been reported to provide an effective method for quantifying the severity of the dysphonic voice (Awan, Roy, & Dromey, 2009; Awan & Roy, 2005; Awan & Roy, 2006; Heman-Ackah, Michael, & Goding, 2002; Wolfe & martin, 1997; Hillenbrand, Cleveland, and Erickson,1994).

6 Examples of the log spectrum of a voiced signal and subsequent cepstrum (from Papamichalis, 1987)

7 Typical cepstrum for a normal male sustained vowel production. The cepstral peak is circled the cepstral peak corresponds to the fundamental frequency ( Hz) and a quefrency (x-axis value in time) of approx ms.

8 Analysis of Dysphonia in Speech and Voice (ADSV) from KayPENTAX is the first commercial program of its kind, allowing for voice quality assessment of sustained vowel and continuous speech samples in normal and mild-to-severely dysphonic voices. This program provides several key spectral and cepstral measures of the voice sample along with a graphic display of how these values change over time. The program also incorporates the Cepstral Spectral Index of Dysphonia (CSID) a multifactorial estimate of vocal severity that correlates with the VAS (%) severity scale used in the CAPE-V.

9 A E B C F D ADSV main screen - spectral/cepstral analyses of a normal female voice sample ("We were away a year ago"). Analysis windows include: (A) Sound spectrogram; (B) sound wave; (C) Low/High spectral ratio (L/H Ratio) over time; (D) Cepstral peak prominence (CPP) over time; (E) focused spectral analysis per data frame; (F) focused cepstral analysis per data frame.

10

11

12

13 In 2010, Awan, Roy, Jetté, Meltzner, & Hillman reported that an algorithm incorporating measures from cepstral and spectral analyses was able to produce estimates of dysphonia severity that strongly correlated with auditory-perceptual judgments of dysphonia severity. Using measures of the cepstral peak prominence (CPP), a ratio of low vs. high frequency spectral energy (L/H ratio), and the respective standard deviations of these measures, Awan et al. (2010) reported: R = 0.81 between acoustic and auditory-perceptual estimates of dysphonia severity in CAPE-V sentences, R = 0.96 between acoustic and auditory-perceptual estimates of dysphonia severity in sustained vowel productions.

14 All CAPE-V Sentences Combined Sustained Vowel Dysphonia Severity (100 pt. VAS) Dysphonia Severity (100 pt. VAS) Normal Mild Moderate Severe 0 Normal Mild Moderate Severe Group (n=128; 32 subjects per group) Group (n=32; 8 subjects per group) Estimated Rating Listener Rating Estimated Rating Listener Rating

15 "How hard did he hit him?" "We were away a year ago." Dysphonia Severity (100 pt. VAS) Dysphonia Severity (100 pt. VAS) Normal Mild Moderate Severe 0 Normal Mild Moderate Severe Group (n=32; 8 subjects per group) Group (n=32; 8 subjects per group) Estimated Rating Listener Rating Estimated Rating Listener Rating "We eat eggs at Easter." "Peter will keep at the peak." Dysphonia Severity (100 pt. VAS) Dysphonia Severity (100 pt. VAS) Normal Mild Moderate Severe 0 Normal Mild Moderate Severe Group (n=32; 8 subjects per group) Group (n=32; 8 subjects per group) Estimated Rating Listener Rating Estimated Rating Listener Rating

16 Because any research finding (strong or weak) may simply be a reflection of the particular sample of subjects being studied, it is essential that the results of any single study be replicated with alternative samples. In this way, the external validity (i.e., the ability to reproduce results with alternative subjects and in settings outside of the original study) of a particular research finding can be established. The goal of the present study was to assess the external validity of the acoustic algorithm and analysis methods reported in Awan et al. (2010) with a completely new and independent set of normal and disordered CAPE-V samples and associated listener judgments.

17 Samples were obtained from previously recorded voices of patients scheduled for partial or total thyroidectomy. Perceptual and acoustic analyses were conducted for subjects pre- and postthyroidectomy. CAPE-V sentences and sustained vowel samples were elicited from each subject at comfortable pitch and loudness levels.

18 3 experienced SLPs rated CAPE-V samples Custom automated program Blinded, randomized order, blocked for subject Over headphones in sound-treated booth A single rating for vowel and sentences combined. Separate sessions for male and female samples Accompanied by an anchor for moderate severity Median ratings (% of 100-mm line, labeled for severity) Severity Roughness Breathiness Strain

19

20 2-s center of each vowel /ɑ/ trimmed for onset & offset CAPE-V sentences soft glottal attacks and voiceless to voiced transitions ( How hard did he hit him? ), the presence of possible voiced stoppages or spasms, and the ability to maintain consistent voicing ( We were away a year ago ), the presence of hard glottal attacks ( We eat eggs every Easter ), the ability to transition easily between voiceless stop-plosive production and vowel production ( Peter will keep at the peak ). Measures of the cepstral peak prominence (CPP) and the ratio of low vs. high frequency spectral energy (L/H Ratio), as well as the standard deviations for the aforementioned measures were obtained.

21 In addition to the aforementioned acoustic measures, the ADSV program produces an estimate of dysphonia severity called the CSID (Cepstral/Spectral Index of Dysphonia). Separate CSID values are provided for sustained vowels vs. individual speech samples. For the purposes of this study, the CSID values for the vowel and each analyzed sentence were averaged to provide a single acoustic estimate of dysphonia.

22 Samples of 40 voices (20 normal [mean CAPE-V severity = 8.52%, SD = 4.90] and 20 dysphonic samples [mean CAPE-V severity = 30.84%, SD = 13.69]) were selected from the initial corpus of data: These voices reflected a relatively wide range of perceived dysphonia severity (CAPE-V Range = 1 to 65). Allowed focus on the ability to discriminate normal/typical voices from those judged to have mild-to-moderate degrees of dysphonia severity. Equal distribution of males and females.

23 Statistical evaluation of the CSID values (i.e., acoustically estimated dysphonia severity) vs. auditory-perceptual dysphonia severities (i.e., CAPE-V ratings) revealed the following: No significant difference between CSID vs. auditoryperceptual dysphonia severity in normal subjects. No significant difference between CSID vs. auditoryperceptual dysphonia severity in disordered subjects. Significant differences in dysphonia severity between normal vs. disordered subjects whether estimated via acoustic analyses (CSID; t (38) = -5.01, p <.001) or via auditory-perceptual judgment (t (38) = -6.87, p <.001).

24 Subjects Mean Cepstral/Spectral Index of Dysphonia (CSID) Mean Auditory- Perceptual Rating (CAPE-V) 20 Normal Voices 5.72 (SD = 9.97) 8.52 (SD = 4.90) 20 Disordered Voices (SD = 17.71) (SD = 13.69)

25 Across all 40 subjects, a strong and significant correlation between CSID values and CAPE-V auditory-perceptual ratings of dysphonia severity was observed: (r = 0.85; r 2 = 0.73; p <.001).

26

27 CAPE-V Severity Rating CSID: Vowel /ɑ/ CSID: Easy Onset Sentence CSID: All Voiced Sentence CSID: Glottal Sentence CSID: Plosive Sentence r = 0.67 r = 0.79 r = 0.82 r = 0.78 r = 0.78 All Pearson s r correlations significant at p <.01.

28 The results of this study indicate that the acoustic algorithm reported by Awan et al. (2010) and incorporated in the CSID is externally valid and an effective correlate of perceived dysphonia CAPE-V severity. Results are actually somewhat stronger than observed in Awan et al. (2010) for the multifactor CSID measure (sample characteristics differed in that the voices in the current study tended to be in the normal to moderate range of severity) - as in Awan et al. (2010), the all voiced sentence was observed to provide the best individual correlate of dysphonia severity.

29 In this study, spectral/cepstral measures from sentences correlated best with perceived dysphonia severity in contrast to Awan et al s (2010) sample in which measures from vowels provided the strongest correlate. Dysphonia may be more prominent in vowels vs. sentences, or vice versa, in different cases therefore, both samples are necessary.

30 The CSID provides an objective, multivariate measure of dysphonia severity that is effective in both sustained vowel and continuous speech contexts. The objective, automatic estimation of dysphonia severity in continuous speech and vowel samples is a potentially valuable and easily communicated method of categorizing voice and voice change without requiring multiple trained judges.

31 Awan, S. N., & Roy, N. (2005). Acoustic prediction of voice type in women with functional dysphonia. Journal of Voice, 19(2), Awan, S. N., & Roy, N. (2006). Toward the development of an objective index of dysphonia severity: a four-factor acoustic model. Clinical Linguistics & Phonetics, 20(1), Awan, S. N., Roy, N., & Dromey, C. (2009). Estimating dysphonia severity in continuous speech: application of a multi-parameter spectral/cepstral model. Clinical Linguistics & Phonetics, 23(11), Awan, S. N., Roy, N., Jetté, M. E., Meltzner, G. S., & Hillman, R. E. (2010). Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: Comparisons with auditory-perceptual judgments from the CAPE-V. Clinical Linguistics & Phonetics, 24(9), de Krom, G. (1993). A cepstrum-based technique for determining a harmonicsto-noise ratio in speech signals. Journal of Speech and Hearing Research, 36(2), Heman-Ackah, Y. D., Michael, D. D., & Goding, G. S., Jr. (2002). The relationship between cepstral peak prominence and selected parameters of dysphonia. Journal of Voice, 16(1), Hillenbrand, J., Cleveland, R. A., & Erickson, R. L. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech and Hearing Research, 37(4), Papamichalis, P.E. (1987). Practical Approaches to Speech Coding. Englewood Cliffs, NJ: Prentice-Hall. Wolfe, V., & Martin, D. (1997). Acoustic correlates of dysphonia: type and severity. Journal of Communication Disorders, 30(5),

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception ISCA Archive VOQUAL'03, Geneva, August 27-29, 2003 Jitter, Shimmer, and Noise in Pathological Voice Quality Perception Jody Kreiman and Bruce R. Gerratt Division of Head and Neck Surgery, School of Medicine

More information

Differential specificity of acoustic measures to listener perception of voice quality

Differential specificity of acoustic measures to listener perception of voice quality Boston University OpenBU Theses & Dissertations http://open.bu.edu Boston University Theses & Dissertations 2015 Differential specificity of acoustic measures to listener perception of voice quality Calabrese,

More information

Visi-Pitch IV is the latest version of the most widely

Visi-Pitch IV is the latest version of the most widely APPLICATIONS Voice Disorders Motor Speech Disorders Voice Typing Fluency Selected Articulation Training Hearing-Impaired Speech Professional Voice Accent Reduction and Second Language Learning Importance

More information

Overview. Acoustics of Speech and Hearing. Source-Filter Model. Source-Filter Model. Turbulence Take 2. Turbulence

Overview. Acoustics of Speech and Hearing. Source-Filter Model. Source-Filter Model. Turbulence Take 2. Turbulence Overview Acoustics of Speech and Hearing Lecture 2-4 Fricatives Source-filter model reminder Sources of turbulence Shaping of source spectrum by vocal tract Acoustic-phonetic characteristics of English

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Speech Communication Session 4aSCb: Voice and F0 Across Tasks (Poster

More information

Emerging Scientist: Challenges to CAPE-V as a Standard

Emerging Scientist: Challenges to CAPE-V as a Standard Perspectives of the ASHA Special Interest Groups SIG 3, Vol. 1(Part 2), 2016, Copyright 2016 American Speech-Language-Hearing Association Emerging Scientist: Challenges to CAPE-V as a Standard Kathleen

More information

Speech (Sound) Processing

Speech (Sound) Processing 7 Speech (Sound) Processing Acoustic Human communication is achieved when thought is transformed through language into speech. The sounds of speech are initiated by activity in the central nervous system,

More information

Hearing in the Environment

Hearing in the Environment 10 Hearing in the Environment Click Chapter to edit 10 Master Hearing title in the style Environment Sound Localization Complex Sounds Auditory Scene Analysis Continuity and Restoration Effects Auditory

More information

A COMPARISON OF SUSTAINED VOWEL AND CONNECTED SPEECH PRODUCTION IN HYPOFUNCTIONAL AND NORMAL VOICES. Emily A. Lambert

A COMPARISON OF SUSTAINED VOWEL AND CONNECTED SPEECH PRODUCTION IN HYPOFUNCTIONAL AND NORMAL VOICES. Emily A. Lambert A COMPARISON OF SUSTAINED VOWEL AND CONNECTED SPEECH PRODUCTION IN HYPOFUNCTIONAL AND NORMAL VOICES by Emily A. Lambert Bachelor of Science in Communication Sciences & Disorders, 2008 University of Texas

More information

Polly Lau Voice Research Lab, Department of Speech and Hearing Sciences, The University of Hong Kong

Polly Lau Voice Research Lab, Department of Speech and Hearing Sciences, The University of Hong Kong Perception of synthesized voice quality in connected speech by Cantonese speakers Edwin M-L. Yiu a) Voice Research Laboratory, Department of Speech and Hearing Sciences, The University of Hong Kong, 5/F

More information

METHOD. The current study was aimed at investigating the prevalence and nature of voice

METHOD. The current study was aimed at investigating the prevalence and nature of voice CHAPTER - 3 METHOD The current study was aimed at investigating the prevalence and nature of voice problems in call center operators with the following objectives: To determine the prevalence of voice

More information

Auditory scene analysis in humans: Implications for computational implementations.

Auditory scene analysis in humans: Implications for computational implementations. Auditory scene analysis in humans: Implications for computational implementations. Albert S. Bregman McGill University Introduction. The scene analysis problem. Two dimensions of grouping. Recognition

More information

Effects of Subglottic Stenosis and Cricotracheal Resection on Voice Production in Women

Effects of Subglottic Stenosis and Cricotracheal Resection on Voice Production in Women Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2016-03-01 Effects of Subglottic Stenosis and Cricotracheal Resection on Voice Production in Women Lisa Marie Mattei Brigham Young

More information

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED Francisco J. Fraga, Alan M. Marotta National Institute of Telecommunications, Santa Rita do Sapucaí - MG, Brazil Abstract A considerable

More information

Critical Review: Are laryngeal manual therapies effective in improving voice outcomes of patients with muscle tension dysphonia?

Critical Review: Are laryngeal manual therapies effective in improving voice outcomes of patients with muscle tension dysphonia? Critical Review: Are laryngeal manual therapies effective in improving voice outcomes of patients with muscle tension dysphonia? María López M.Cl.Sc (SLP) Candidate University of Western Ontario: School

More information

Voice Detection using Speech Energy Maximization and Silence Feature Normalization

Voice Detection using Speech Energy Maximization and Silence Feature Normalization , pp.25-29 http://dx.doi.org/10.14257/astl.2014.49.06 Voice Detection using Speech Energy Maximization and Silence Feature Normalization In-Sung Han 1 and Chan-Shik Ahn 2 1 Dept. of The 2nd R&D Institute,

More information

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES ISCA Archive ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES Allard Jongman 1, Yue Wang 2, and Joan Sereno 1 1 Linguistics Department, University of Kansas, Lawrence, KS 66045 U.S.A. 2 Department

More information

be investigated online at the following web site: Synthesis of Pathological Voices Using the Klatt Synthesiser. Speech

be investigated online at the following web site: Synthesis of Pathological Voices Using the Klatt Synthesiser. Speech References [1] Antonanzas, N. The inverse filter program developed by Norma Antonanzas can be investigated online at the following web site: www.surgery.medsch.ucla.edu/glottalaffairs/software_of_the_boga.htm

More information

Topics in Linguistic Theory: Laboratory Phonology Spring 2007

Topics in Linguistic Theory: Laboratory Phonology Spring 2007 MIT OpenCourseWare http://ocw.mit.edu 24.91 Topics in Linguistic Theory: Laboratory Phonology Spring 27 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Interjudge Reliability in the Measurement of Pitch Matching. A Senior Honors Thesis

Interjudge Reliability in the Measurement of Pitch Matching. A Senior Honors Thesis Interjudge Reliability in the Measurement of Pitch Matching A Senior Honors Thesis Presented in partial fulfillment of the requirements for graduation with distinction in Speech and Hearing Science in

More information

SoundRecover2 the first adaptive frequency compression algorithm More audibility of high frequency sounds

SoundRecover2 the first adaptive frequency compression algorithm More audibility of high frequency sounds Phonak Insight April 2016 SoundRecover2 the first adaptive frequency compression algorithm More audibility of high frequency sounds Phonak led the way in modern frequency lowering technology with the introduction

More information

ABSTRACT REPEATABILITY OF AERODYNAMIC MEASUREMENTS OF VOICE. by Courtney Rollins Garrison

ABSTRACT REPEATABILITY OF AERODYNAMIC MEASUREMENTS OF VOICE. by Courtney Rollins Garrison ABSTRACT REPEATABILITY OF AERODYNAMIC MEASUREMENTS OF VOICE by Courtney Rollins Garrison The purpose of this study was to compare baseline aerodynamic measurements of voice to measurements repeated after

More information

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds Hearing Lectures Acoustics of Speech and Hearing Week 2-10 Hearing 3: Auditory Filtering 1. Loudness of sinusoids mainly (see Web tutorial for more) 2. Pitch of sinusoids mainly (see Web tutorial for more)

More information

Computational Perception /785. Auditory Scene Analysis

Computational Perception /785. Auditory Scene Analysis Computational Perception 15-485/785 Auditory Scene Analysis A framework for auditory scene analysis Auditory scene analysis involves low and high level cues Low level acoustic cues are often result in

More information

Speech Spectra and Spectrograms

Speech Spectra and Spectrograms ACOUSTICS TOPICS ACOUSTICS SOFTWARE SPH301 SLP801 RESOURCE INDEX HELP PAGES Back to Main "Speech Spectra and Spectrograms" Page Speech Spectra and Spectrograms Robert Mannell 6. Some consonant spectra

More information

Topic 4. Pitch & Frequency

Topic 4. Pitch & Frequency Topic 4 Pitch & Frequency A musical interlude KOMBU This solo by Kaigal-ool of Huun-Huur-Tu (accompanying himself on doshpuluur) demonstrates perfectly the characteristic sound of the Xorekteer voice An

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Long-term spectrum of speech HCS 7367 Speech Perception Connected speech Absolute threshold Males Dr. Peter Assmann Fall 212 Females Long-term spectrum of speech Vowels Males Females 2) Absolute threshold

More information

UKnowledge. University of Kentucky

UKnowledge. University of Kentucky University of Kentucky UKnowledge Theses and Dissertations--Rehabilitation Sciences Rehabilitation Sciences 2016 INVESTIGATING THE EFFICACY OF VOCAL FUNCTION EXERCISES IN IMPROVING VOCAL FUNCTION IN ADULTS

More information

Discrete Signal Processing

Discrete Signal Processing 1 Discrete Signal Processing C.M. Liu Perceptual Lab, College of Computer Science National Chiao-Tung University http://www.cs.nctu.edu.tw/~cmliu/courses/dsp/ ( Office: EC538 (03)5731877 cmliu@cs.nctu.edu.tw

More information

Consonant Perception test

Consonant Perception test Consonant Perception test Introduction The Vowel-Consonant-Vowel (VCV) test is used in clinics to evaluate how well a listener can recognize consonants under different conditions (e.g. with and without

More information

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear The psychophysics of cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences Prelude Envelope and temporal

More information

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions.

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions. 24.963 Linguistic Phonetics Basic Audition Diagram of the inner ear removed due to copyright restrictions. 1 Reading: Keating 1985 24.963 also read Flemming 2001 Assignment 1 - basic acoustics. Due 9/22.

More information

2/25/2013. Context Effect on Suprasegmental Cues. Supresegmental Cues. Pitch Contour Identification (PCI) Context Effect with Cochlear Implants

2/25/2013. Context Effect on Suprasegmental Cues. Supresegmental Cues. Pitch Contour Identification (PCI) Context Effect with Cochlear Implants Context Effect on Segmental and Supresegmental Cues Preceding context has been found to affect phoneme recognition Stop consonant recognition (Mann, 1980) A continuum from /da/ to /ga/ was preceded by

More information

Linguistic Phonetics Fall 2005

Linguistic Phonetics Fall 2005 MIT OpenCourseWare http://ocw.mit.edu 24.963 Linguistic Phonetics Fall 2005 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 24.963 Linguistic Phonetics

More information

PATTERN ELEMENT HEARING AIDS AND SPEECH ASSESSMENT AND TRAINING Adrian FOURCIN and Evelyn ABBERTON

PATTERN ELEMENT HEARING AIDS AND SPEECH ASSESSMENT AND TRAINING Adrian FOURCIN and Evelyn ABBERTON PATTERN ELEMENT HEARING AIDS AND SPEECH ASSESSMENT AND TRAINING Adrian FOURCIN and Evelyn ABBERTON Summary This paper has been prepared for a meeting (in Beijing 9-10 IX 1996) organised jointly by the

More information

Speech Cue Weighting in Fricative Consonant Perception in Hearing Impaired Children

Speech Cue Weighting in Fricative Consonant Perception in Hearing Impaired Children University of Tennessee, Knoxville Trace: Tennessee Research and Creative Exchange University of Tennessee Honors Thesis Projects University of Tennessee Honors Program 5-2014 Speech Cue Weighting in Fricative

More information

Spectrograms (revisited)

Spectrograms (revisited) Spectrograms (revisited) We begin the lecture by reviewing the units of spectrograms, which I had only glossed over when I covered spectrograms at the end of lecture 19. We then relate the blocks of a

More information

PERCEPTUAL MEASUREMENT OF BREATHY VOICE QUALITY

PERCEPTUAL MEASUREMENT OF BREATHY VOICE QUALITY PERCEPTUAL MEASUREMENT OF BREATHY VOICE QUALITY By SONA PATEL A THESIS PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER

More information

Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners

Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners Jan Rennies, Eugen Albertin, Stefan Goetze, and Jens-E. Appell Fraunhofer IDMT, Hearing, Speech and

More information

Quarterly Progress and Status Report. Effect on LTAS of vocal loudness variation

Quarterly Progress and Status Report. Effect on LTAS of vocal loudness variation Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Effect on LTAS of vocal loudness variation Nordenberg, M. and Sundberg, J. journal: TMH-QPSR volume: 45 number: 1 year: 2003 pages:

More information

Language Speech. Speech is the preferred modality for language.

Language Speech. Speech is the preferred modality for language. Language Speech Speech is the preferred modality for language. Outer ear Collects sound waves. The configuration of the outer ear serves to amplify sound, particularly at 2000-5000 Hz, a frequency range

More information

SLHS 1301 The Physics and Biology of Spoken Language. Practice Exam 2. b) 2 32

SLHS 1301 The Physics and Biology of Spoken Language. Practice Exam 2. b) 2 32 SLHS 1301 The Physics and Biology of Spoken Language Practice Exam 2 Chapter 9 1. In analog-to-digital conversion, quantization of the signal means that a) small differences in signal amplitude over time

More information

11 Music and Speech Perception

11 Music and Speech Perception 11 Music and Speech Perception Properties of sound Sound has three basic dimensions: Frequency (pitch) Intensity (loudness) Time (length) Properties of sound The frequency of a sound wave, measured in

More information

International Forensic Science & Forensic Medicine Conference Naif Arab University for Security Sciences Riyadh Saudi Arabia

International Forensic Science & Forensic Medicine Conference Naif Arab University for Security Sciences Riyadh Saudi Arabia SPECTRAL EDITING IN SPEECH RECORDINGS: CHALLENGES IN AUTHENTICITY CHECKING Antonio César Morant Braid Electronic Engineer, Specialist Official Forensic Expert, Public Security Secretariat - Technical Police

More information

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3 PRESERVING SPECTRAL CONTRAST IN AMPLITUDE COMPRESSION FOR HEARING AIDS Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3 1 University of Malaga, Campus de Teatinos-Complejo Tecnol

More information

Auditory Scene Analysis

Auditory Scene Analysis 1 Auditory Scene Analysis Albert S. Bregman Department of Psychology McGill University 1205 Docteur Penfield Avenue Montreal, QC Canada H3A 1B1 E-mail: bregman@hebb.psych.mcgill.ca To appear in N.J. Smelzer

More information

Topic 4. Pitch & Frequency. (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music)

Topic 4. Pitch & Frequency. (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music) Topic 4 Pitch & Frequency (Some slides are adapted from Zhiyao Duan s course slides on Computer Audition and Its Applications in Music) A musical interlude KOMBU This solo by Kaigal-ool of Huun-Huur-Tu

More information

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening AUDL GS08/GAV1 Signals, systems, acoustics and the ear Pitch & Binaural listening Review 25 20 15 10 5 0-5 100 1000 10000 25 20 15 10 5 0-5 100 1000 10000 Part I: Auditory frequency selectivity Tuning

More information

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency Tetsuya Shimamura and Fumiya Kato Graduate School of Science and Engineering Saitama University 255 Shimo-Okubo,

More information

Twenty subjects (11 females) participated in this study. None of the subjects had

Twenty subjects (11 females) participated in this study. None of the subjects had SUPPLEMENTARY METHODS Subjects Twenty subjects (11 females) participated in this study. None of the subjects had previous exposure to a tone language. Subjects were divided into two groups based on musical

More information

Voice Evaluation. Area of Concern:

Voice Evaluation. Area of Concern: Name : Lilly Tulip File Number : 1002 Age : xx years D.O.B. : 9-9-1999 Address : 9999 9 th St NW D.O.E. : 11-30-2012 Minot, ND 59992 Referral : Dr. Andrew Hetland Phone : (701) 899-9999 Code : 784.40 Area

More information

Creative Commons: Attribution 3.0 Hong Kong License

Creative Commons: Attribution 3.0 Hong Kong License Title Effect of feedback on the effectiveness of a paired comparison perceptual voice rating training program Other Contributor(s) University of Hong Kong. Author(s) Law, Tsz-ying Citation Issued Date

More information

A COMPARISON OF PSYCHOPHYSICAL METHODS FOR THE EVALUATION OF ROUGH VOICE QUALITY

A COMPARISON OF PSYCHOPHYSICAL METHODS FOR THE EVALUATION OF ROUGH VOICE QUALITY A COMPARISON OF PSYCHOPHYSICAL METHODS FOR THE EVALUATION OF ROUGH VOICE QUALITY By STACIE NOELLE CUMMINGS A THESIS PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT

More information

Credits. Learner objectives and outcomes. Outline. Why care about voice quality? I. INTRODUCTION

Credits. Learner objectives and outcomes. Outline. Why care about voice quality? I. INTRODUCTION Credits Perceptual assessment of voice - past, present and future Jody Kreiman, PhD University of California, Los Angeles The ideas presented here were developed in collaboration with Bruce Gerratt (UCLA).

More information

INTRODUCTION J. Acoust. Soc. Am. 103 (2), February /98/103(2)/1080/5/$ Acoustical Society of America 1080

INTRODUCTION J. Acoust. Soc. Am. 103 (2), February /98/103(2)/1080/5/$ Acoustical Society of America 1080 Perceptual segregation of a harmonic from a vowel by interaural time difference in conjunction with mistuning and onset asynchrony C. J. Darwin and R. W. Hukin Experimental Psychology, University of Sussex,

More information

The Pennsylvania State University. The Graduate School. College of Health and Human Development THE EFFECT OF EXPERIENCE AND THE RELATIONSHIP AMONG

The Pennsylvania State University. The Graduate School. College of Health and Human Development THE EFFECT OF EXPERIENCE AND THE RELATIONSHIP AMONG i The Pennsylvania State University The Graduate School College of Health and Human Development THE EFFECT OF EXPERIENCE AND THE RELATIONSHIP AMONG SUBJECTIVE AND OBJECTIVE MEASURES OF VOICE QUALITY A

More information

It is important to understand as to how do we hear sounds. There is air all around us. The air carries the sound waves but it is below 20Hz that our

It is important to understand as to how do we hear sounds. There is air all around us. The air carries the sound waves but it is below 20Hz that our Phonetics. Phonetics: it is a branch of linguistics that deals with explaining the articulatory, auditory and acoustic properties of linguistic sounds of human languages. It is important to understand

More information

Overview. Each session is presented with: Text and graphics to summarize the main points Narration, video and audio Reference documents Quiz

Overview. Each session is presented with: Text and graphics to summarize the main points Narration, video and audio Reference documents Quiz TAVS Online Certification Course 2014 Advanced Brain Technologies All Rights Reserved Overview Each session is presented with: Text and graphics to summarize the main points Narration, video and audio

More information

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag JAIST Reposi https://dspace.j Title Effects of speaker's and listener's environments on speech intelligibili annoyance Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag Citation Inter-noise 2016: 171-176 Issue

More information

Dr Sabah Mohammed Hassan. Consultant Phoniatrician

Dr Sabah Mohammed Hassan. Consultant Phoniatrician Consultant Phoniatrician King Abul Aziz University Hospital King Saud University Non-organic (Functional)Voice Disorders Introduction Etiological Classification Of Voice Disorders: 1-Organic. 2- Non-Organic

More information

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

The role of periodicity in the perception of masked speech with simulated and real cochlear implants The role of periodicity in the perception of masked speech with simulated and real cochlear implants Kurt Steinmetzger and Stuart Rosen UCL Speech, Hearing and Phonetic Sciences Heidelberg, 09. November

More information

Voice Outcomes Following Treatment of Benign Midmembranous Vocal Fold Lesions Using a Nomenclature Paradigm

Voice Outcomes Following Treatment of Benign Midmembranous Vocal Fold Lesions Using a Nomenclature Paradigm The Laryngoscope VC 2015 The American Laryngological, Rhinological and Otological Society, Inc. Voice Outcomes Following Treatment of Benign Midmembranous Vocal Fold Lesions Using a Nomenclature Paradigm

More information

Research Article Measurement of Voice Onset Time in Maxillectomy Patients

Research Article Measurement of Voice Onset Time in Maxillectomy Patients e Scientific World Journal, Article ID 925707, 4 pages http://dx.doi.org/10.1155/2014/925707 Research Article Measurement of Voice Onset Time in Maxillectomy Patients Mariko Hattori, 1 Yuka I. Sumita,

More information

Voice Analysis in Individuals with Chronic Obstructive Pulmonary Disease

Voice Analysis in Individuals with Chronic Obstructive Pulmonary Disease ORIGINAL ARTICLE Voice Analysis in Individuals with Chronic 10.5005/jp-journals-10023-1081 Obstructive Pulmonary Disease Voice Analysis in Individuals with Chronic Obstructive Pulmonary Disease 1 Anuradha

More information

A Study on the Degree of Pronunciation Improvement by a Denture Attachment Using an Weighted-α Formant

A Study on the Degree of Pronunciation Improvement by a Denture Attachment Using an Weighted-α Formant A Study on the Degree of Pronunciation Improvement by a Denture Attachment Using an Weighted-α Formant Seong-Geon Bae 1 1 School of Software Application, Kangnam University, Gyunggido, Korea. 1 Orcid Id:

More information

2012, Greenwood, L.

2012, Greenwood, L. Critical Review: How Accurate are Voice Accumulators for Measuring Vocal Behaviour? Lauren Greenwood M.Cl.Sc. (SLP) Candidate University of Western Ontario: School of Communication Sciences and Disorders

More information

Auditory gist perception and attention

Auditory gist perception and attention Auditory gist perception and attention Sue Harding Speech and Hearing Research Group University of Sheffield POP Perception On Purpose Since the Sheffield POP meeting: Paper: Auditory gist perception:

More information

Speech Generation and Perception

Speech Generation and Perception Speech Generation and Perception 1 Speech Generation and Perception : The study of the anatomy of the organs of speech is required as a background for articulatory and acoustic phonetics. An understanding

More information

Reference: Mark S. Sanders and Ernest J. McCormick. Human Factors Engineering and Design. McGRAW-HILL, 7 TH Edition. NOISE

Reference: Mark S. Sanders and Ernest J. McCormick. Human Factors Engineering and Design. McGRAW-HILL, 7 TH Edition. NOISE NOISE NOISE: It is considered in an information-theory context, as that auditory stimulus or stimuli bearing no informational relationship to the presence or completion of the immediate task. Human ear

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Babies 'cry in mother's tongue' HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Babies' cries imitate their mother tongue as early as three days old German researchers say babies begin to pick up

More information

Acoustics of Speech and Environmental Sounds

Acoustics of Speech and Environmental Sounds International Speech Communication Association (ISCA) Tutorial and Research Workshop on Experimental Linguistics 28-3 August 26, Athens, Greece Acoustics of Speech and Environmental Sounds Susana M. Capitão

More information

A Determination of Drinking According to Types of Sentence using Speech Signals

A Determination of Drinking According to Types of Sentence using Speech Signals A Determination of Drinking According to Types of Sentence using Speech Signals Seong-Geon Bae 1, Won-Hee Lee 2 and Myung-Jin Bae *3 1 School of Software Application, Kangnam University, Gyunggido, Korea.

More information

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair Who are cochlear implants for? Essential feature People with little or no hearing and little conductive component to the loss who receive little or no benefit from a hearing aid. Implants seem to work

More information

CLUTTERING SEVERITY ASSESSMENT: TALKING TIME DETERMINATION DURING ORAL READING

CLUTTERING SEVERITY ASSESSMENT: TALKING TIME DETERMINATION DURING ORAL READING CLUTTERING SEVERITY ASSESSMENT: TALKING TIME DETERMINATION DURING ORAL READING Klaas Bakker Su Jin Lee Missouri State University, Springfield, MO Florence Myers Lawrence Raphael Adelphi University, Garden

More information

Temporal offset judgments for concurrent vowels by young, middle-aged, and older adults

Temporal offset judgments for concurrent vowels by young, middle-aged, and older adults Temporal offset judgments for concurrent vowels by young, middle-aged, and older adults Daniel Fogerty Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South

More information

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES Varinthira Duangudom and David V Anderson School of Electrical and Computer Engineering, Georgia Institute of Technology Atlanta, GA 30332

More information

Binaural processing of complex stimuli

Binaural processing of complex stimuli Binaural processing of complex stimuli Outline for today Binaural detection experiments and models Speech as an important waveform Experiments on understanding speech in complex environments (Cocktail

More information

Sound Waves. Sensation and Perception. Sound Waves. Sound Waves. Sound Waves

Sound Waves. Sensation and Perception. Sound Waves. Sound Waves. Sound Waves Sensation and Perception Part 3 - Hearing Sound comes from pressure waves in a medium (e.g., solid, liquid, gas). Although we usually hear sounds in air, as long as the medium is there to transmit the

More information

group by pitch: similar frequencies tend to be grouped together - attributed to a common source.

group by pitch: similar frequencies tend to be grouped together - attributed to a common source. Pattern perception Section 1 - Auditory scene analysis Auditory grouping: the sound wave hitting out ears is often pretty complex, and contains sounds from multiple sources. How do we group sounds together

More information

The Effects of Sleep Deprivation on the Acoustic and Perceptual Characteristics of Voice

The Effects of Sleep Deprivation on the Acoustic and Perceptual Characteristics of Voice University of Colorado, Boulder CU Scholar Speech, Language, and Hearing Sciences Graduate Theses & Dissertations Speech, Language and Hearing Sciences Spring 1-1-2017 The Effects of Sleep Deprivation

More information

Learning Process. Auditory Training for Speech and Language Development. Auditory Training. Auditory Perceptual Abilities.

Learning Process. Auditory Training for Speech and Language Development. Auditory Training. Auditory Perceptual Abilities. Learning Process Auditory Training for Speech and Language Development Introduction Demonstration Perception Imitation 1 2 Auditory Training Methods designed for improving auditory speech-perception Perception

More information

of the Consensus Auditory-Perceptual Evaluation The Consensus Auditory-Perceptual Evaluation of Voice: Development of a Standardized Clinical Protocol

of the Consensus Auditory-Perceptual Evaluation The Consensus Auditory-Perceptual Evaluation of Voice: Development of a Standardized Clinical Protocol Clinical Focus Consensus Auditory-Perceptual Evaluation of Voice: Development of a Standardized Clinical Protocol Gail B. Kempster Rush University, Chicago Bruce R. Gerratt University of California, Los

More information

Congruency Effects with Dynamic Auditory Stimuli: Design Implications

Congruency Effects with Dynamic Auditory Stimuli: Design Implications Congruency Effects with Dynamic Auditory Stimuli: Design Implications Bruce N. Walker and Addie Ehrenstein Psychology Department Rice University 6100 Main Street Houston, TX 77005-1892 USA +1 (713) 527-8101

More information

The Phonetic Characteristics in Patients of Bilateral Vocal Fold Paralysis Without Tracheotomy

The Phonetic Characteristics in Patients of Bilateral Vocal Fold Paralysis Without Tracheotomy Original Article Clinical and Experimental Otorhinolaryngology Vol. 10, No. 3: 272-277, September 2017 https://doi.org/10.21053/ceo.2016.00626 pissn 1976-8710 eissn 2005-0720 The Phonetic Characteristics

More information

Why? Speech in Noise + Hearing Aids = Problems in Noise. Recall: Two things we must do for hearing loss: Directional Mics & Digital Noise Reduction

Why? Speech in Noise + Hearing Aids = Problems in Noise. Recall: Two things we must do for hearing loss: Directional Mics & Digital Noise Reduction Directional Mics & Digital Noise Reduction Speech in Noise + Hearing Aids = Problems in Noise Why? Ted Venema PhD Recall: Two things we must do for hearing loss: 1. Improve audibility a gain issue achieved

More information

Perception of American English can and can t by Japanese professional interpreters* 1

Perception of American English can and can t by Japanese professional interpreters* 1 Research Note JAITS Perception of American English can and can t by Japanese professional interpreters* 1 Kinuko TAKAHASHI Tomohiko OOIGAWA (Doctoral Program in Linguistics, Graduate School of Foreign

More information

Putting the focus on conversations

Putting the focus on conversations Putting the focus on conversations A three-part whitepaper series Moving to a new technology platform presents hearing instrument manufacturers with opportunities to make changes that positively impact

More information

whether or not the fundamental is actually present.

whether or not the fundamental is actually present. 1) Which of the following uses a computer CPU to combine various pure tones to generate interesting sounds or music? 1) _ A) MIDI standard. B) colored-noise generator, C) white-noise generator, D) digital

More information

What Is the Difference between db HL and db SPL?

What Is the Difference between db HL and db SPL? 1 Psychoacoustics What Is the Difference between db HL and db SPL? The decibel (db ) is a logarithmic unit of measurement used to express the magnitude of a sound relative to some reference level. Decibels

More information

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking Vahid Montazeri, Shaikat Hossain, Peter F. Assmann University of Texas

More information

A Prospective Crossover Trial of Botulinum Toxin Chemodenervation Versus Injection Augmentation for Essential Voice Tremor

A Prospective Crossover Trial of Botulinum Toxin Chemodenervation Versus Injection Augmentation for Essential Voice Tremor The Laryngoscope VC 2017 The American Laryngological, Rhinological and Otological Society, Inc. A Prospective Crossover Trial of Botulinum Toxin Chemodenervation Versus Injection Augmentation for Essential

More information

EXPLORATORY ANALYSIS OF SPEECH FEATURES RELATED TO DEPRESSION IN ADULTS WITH APHASIA

EXPLORATORY ANALYSIS OF SPEECH FEATURES RELATED TO DEPRESSION IN ADULTS WITH APHASIA EXPLORATORY ANALYSIS OF SPEECH FEATURES RELATED TO DEPRESSION IN ADULTS WITH APHASIA Stephanie Gillespie 1, Elliot Moore 1, Jacqueline Laures-Gore 2, Matthew Farina 2 1 Georgia Institute of Technology,

More information

Speech and Voice Impairments in Individuals with PD. Jessica E. Huber, Ph.D. Associate Professor Purdue University

Speech and Voice Impairments in Individuals with PD. Jessica E. Huber, Ph.D. Associate Professor Purdue University Speech and Voice Impairments in Individuals with PD Jessica E. Huber, Ph.D. Associate Professor Purdue University jhuber@purdue.edu Parkinson s Disease Progressive disorder which affects about 1 million

More information

Gick et al.: JASA Express Letters DOI: / Published Online 17 March 2008

Gick et al.: JASA Express Letters DOI: / Published Online 17 March 2008 modality when that information is coupled with information via another modality (e.g., McGrath and Summerfield, 1985). It is unknown, however, whether there exist complex relationships across modalities,

More information

ACOUSTIC ANALYSIS AND PERCEPTION OF CANTONESE VOWELS PRODUCED BY PROFOUNDLY HEARING IMPAIRED ADOLESCENTS

ACOUSTIC ANALYSIS AND PERCEPTION OF CANTONESE VOWELS PRODUCED BY PROFOUNDLY HEARING IMPAIRED ADOLESCENTS ACOUSTIC ANALYSIS AND PERCEPTION OF CANTONESE VOWELS PRODUCED BY PROFOUNDLY HEARING IMPAIRED ADOLESCENTS Edward Khouw, & Valter Ciocca Dept. of Speech and Hearing Sciences, The University of Hong Kong

More information

Closing and Opening Phase Variability in Dysphonia Adrian Fourcin(1) and Martin Ptok(2)

Closing and Opening Phase Variability in Dysphonia Adrian Fourcin(1) and Martin Ptok(2) Closing and Opening Phase Variability in Dysphonia Adrian Fourcin() and Martin Ptok() () University College London 4 Stephenson Way, LONDON NW PE, UK () Medizinische Hochschule Hannover C.-Neuberg-Str.

More information

Systems Neuroscience Oct. 16, Auditory system. http:

Systems Neuroscience Oct. 16, Auditory system. http: Systems Neuroscience Oct. 16, 2018 Auditory system http: www.ini.unizh.ch/~kiper/system_neurosci.html The physics of sound Measuring sound intensity We are sensitive to an enormous range of intensities,

More information

Voice Pitch Control Using a Two-Dimensional Tactile Display

Voice Pitch Control Using a Two-Dimensional Tactile Display NTUT Education of Disabilities 2012 Vol.10 Voice Pitch Control Using a Two-Dimensional Tactile Display Masatsugu SAKAJIRI 1, Shigeki MIYOSHI 2, Kenryu NAKAMURA 3, Satoshi FUKUSHIMA 3 and Tohru IFUKUBE

More information

Role of F0 differences in source segregation

Role of F0 differences in source segregation Role of F0 differences in source segregation Andrew J. Oxenham Research Laboratory of Electronics, MIT and Harvard-MIT Speech and Hearing Bioscience and Technology Program Rationale Many aspects of segregation

More information