Integral and Diagnostic Speech-Quality Measurement: State of the Art, Problems, and New Approaches

Size: px
Start display at page:

Download "Integral and Diagnostic Speech-Quality Measurement: State of the Art, Problems, and New Approaches"

Transcription

1 Integral and Diagnostic Speech-Quality Measurement: State of the Art, Problems, and New Approaches U. Heute *, S. Möller **, A. Raake ***, K. Scholz *, M. Wältermann ** * Institute for Circuit and System Theory, Christian-Albrechts-University, Kaiserstr. 2, D Kiel, Germany, {uh,ks}@tf.uni-kiel.de, ** Institute of Communication Acoustics, Ruhr-University, D Bochum, Germany, {sebastian.moeller,marcel.waeltermann}@rub.de, *** LIMSI - CNRS, Université de Paris-Sud, F Orsay, France, alexander.raake@limsi.fr The user s overall impression of a speech signal which has been influenced by some system can be described in terms of the integral quality. There are well-defined auditory methods to assess integral quality: Mostly, an absolute-category rating in a listening-only situation is used, resulting in the meanopinion score (MOS). For this integral quality index, proposals exist for instrumental measurements, yielding MOS estimates. Such estimates have problems with distortions not taken into account during the model development. Furthermore, they do not allow to characterize the quality of speech signals. A different approach, trying to overcome these problems, aims at quality attributes, concerning distinct distortions, thus allowing for a system diagnosis, and together forming an integral-quality-impression model also able to cope with future degradations. Earlier work into that direction had problems, too. Avoiding some weaknesses and re-defining more suitable attributes, the diagnostic approach is re-visited. First results are reported, and further work is outlined. 1 Introduction The quality attributed to some speech-transmission, speech-enhancement, or, more generally, speechprocessing system by a user is the result of a perception and a judgment process. Both depend on factors not directly visible (or audible) in the speech signal, so that users alone, i.e., human subjects, can give valid quality statements, after all. Such statements will often concern the overall, i.e., integral quality. There are well-defined auditory methods to assess integral quality: Mostly, an absolutecategory rating (ACR) in a listening-only situation is used, resulting in the mean-opinion score (MOS). Numerous other methods exist, using different scales or more refined test situations. Smaller quality degradations may better be judged in a paired comparison test, because the differential sensitivity of the ear is higher than the absolute one. This is relevant in our context, because we would like to discriminate relatively small deteriorations, like those in a telephone transmission situation. Unfortunately, auditory experiments with human test subjects are expensive and time-consuming. System developers, thus, would like to get information on quality directly from the speech signal, without asking human listeners. For the integral quality, proposals exist for efficient instrumental, signal-based measurements, yielding MOS estimates, like the ITU-T standard P.862 [12] or the TOSQA model [3][4]. Models relying on signalbased measurements have problems with distortions not taken into account during the model development. Furthermore, the MOS estimates they provide do not allow to characterize the quality of speech signals: They might provide the same MOS for two signals which are perceived to sound differently in an auditory test. A different approach has been well-known that overcomes these problems: Quality attributes can be aimed at which - concern detailed and distinct distortion effects, - thus, allow for a system diagnosis, and - together may form an integral-quality-impression model - which is able to cope also with future degradations, not yet taken into account when the model was developed. Quality attributes can be investigated both on an auditory and on a speech-signal level. On the auditory level, perceptual dimensions have to be identified and their importance for integral quality has to be determined; on the speech-signal level, instrumental measures for the quality attributes have to be found and combined to provide an integral-quality estimate. Earlier work on quality attributes on both levels [1][5] has been known to have problems, too. Avoiding some weaknesses and re-defining more suitable attributes, the diagnostic approach is re-visited, including an integral-quality measure to be finally derived. New, important types of degradations, like time-varying, bursty data-packet loss in internet transmission, are to be included. In Sec. 2 and 3, present auditory and instrumental measurements for integral quality are motivated and explained. Corresponding procedures towards diagnostic quality estimation are presented in Sec. 4, motivating an improved attribute selection outlined in 1695

2 Sec. 5. Sec. 6 shows results of first experimental investigations in order to identify underlying perceptual attributes. The necessities of further work are summarized in Sec Auditory Integral-Quality Measurement Speech quality is an important factor for the acceptance (i.e. after all: the economical success) of a communication system. For valid judgments, users have to be asked for their impressions after using the communication system. Involving human subjects gives rise to the common notation as subjective tests ; in fact, measuring quality involves a perception and a judgment process which can only take place in a human acting subject. Still, a properly-designed test quite rigidly prescribes the choice of speech material, speakers, and listeners in order to objectivate the test results, i.e. to make them generalizable. Because of this, the term auditory measurement is preferred although the above-named points of choice always leave some subjectivity. A communication background would actually require two-way, i.e., conversation tests as a model. Because of their excessive time consumption, conversation tests with simplifications have been devised [17]. Still, listening-only tests (LOT) are preferred when new communication systems are to be designed, e.g., for the selection of a best candidate for a new standard system by the International Telecommunication Union / Telecommunication Sector (ITU-T). Even in an LOT the expenditure is quite high; so the possibility of direct (e.g., paired) system comparisons is often discarded. Instead, each system is independently graded on an absolute scale between 5 = excellent and 1 = poor after a listener s impression of a speech sample, and the judgments of many listeners in this ACR are averaged, yielding a MOS value for the system under test. The procedure for the choice of speech material and subjects as well as the evaluation have been defined in international standards [10]. 3 Instrumental Integral-Quality Estimation Even in its form as an ACR-LOT, auditory evaluations may be too expensive, especially too costly for frequent tests during a system development in a laboratory. Therefore, it is attractive to search for signal-based features measurable purely instrumentally such that the LOT-MOS result can be estimated reliably. The basis is, usually, a signal comparison it should be noted that, here, the quite poor LOT model of the communication situation is modeled itself, and in a not really consistent way. There are two distinct approaches for the comparison: One, due to Atal [20], relates the system-output signal to an error signal, i.e., the difference between input and output signals (see Fig. 1); the other one, due to Gersho [24], compares the input and output signals directly (see Fig. 2). The first idea generates more or less refined and extended variants of the classical SNR. The latter approach led to a series of advanced techniques (e.g., PSQM in [2][11], q C in [6], SQET in [7]), two of which are available commercially (TOSQA in [4]) or even standardized (PESQ in [12]). Their common idea is the measurement of distances (or similarities) between the input signal and the output or noise signal. The comparison is performed on a perception-based representation, e.g. using non-linearly transformed inner-ear excitations on a warped (Bark-) frequency scale. The methods differ in measurement details (like their determination of spectra), but also in the amount of psychoacoustics involved. Fig. 3 shows SQET as an example. PESQ, e.g., ignores loudness compression laws which are known from psychoacoustics for many years. Figure 1: Comparison between input and error signal for MOS estimation, according to Hauenstein [7], type

3 Figure 2: Comparison between input and output signal for MOS estimation, according to Hauenstein [7], type 1 Investigations in [7], however, have indicated that indeed as much knowledge as available and realizable should be employed: In the view of a generally applicable measure (i.e., outside the system-training world ), this is preferable to any combination of magical factors optimized blindly with respect to a maximum correlation between real and estimated MOS values. The latter two problems can be observed also in the best available measures. Fig. 4 shows auditory MOS values and their estimates using the PESQ or the TOSQA model. In some cases, instrumental measures predict the same MOS where indeed the auditory ratings differ quite largely, resulting in vertical groups of the corresponding test conditions (see ellipse in the right panel). In other cases, differences in MOS are predicted where the auditory MOS remains the same, leading to horizontal groups (ellipse in the left panel). Diagnostic information can be obtained through auditory tests (see Sec. 4.1) as well as through instrumental measurements (see Sec. 4.2). When using auditory tests, attributes can either be pre-defined, or they result from the test itself. Both methods will be discussed in the following paragraphs. 4.1 Auditory Diagnostic Measures Figure 3: Speech-quality evaluation tool (SQET, [8]) 4 Diagnostic Quality Measures While for a system selection, a single-value (MOS-) result is useful, it is not desirable during the systemdevelopment phase: More information about the influence of system parameters on sound perception is needed. Some distortions may change in a counteracting way such that the MOS remains the same. Even more, in instrumental measurement, a constant average distance may lead to an unchanged MOS estimate where in fact a listener would weight the distortions differently and change a judgment; on the other hand, the distance may change while the listener feels that, over all, the quality is the same. Barnwell s group [18] used the diagnostic acceptability measure (DAM, [22]) for very detailed investigations: Listeners were asked for their impression on both integral quality and details in terms of L = 10 attributes like brightness, noisiness, naturalness, etc. It was also tried to mimic human judgment by combining weighted attribute judgments into an integral-quality estimate. The run of such tests, however, requires huge resources, because listeners have to be trained for judging the attributes. While the DAM relies on predefined attributes, a multi-dimensional analysis [15][16][23] assumes only a fixed number L of dimensions: Listeners evaluate the perceived distances between system variants (in a comparison test), and N L systems are then interpreted as points on an L-dimensional map. Here, similarities and differences between the systems become visible, and from system knowledge, names may be attributed to the L axes. The dimensionality has to be selected as a compromise between an optimal fit of the distance judgments and the interpretability of the resulting axes. 1697

4 Figure 4: MOS values and their estimations MOS for two versions of enhanced Bark-spectral distances, both with high correlations: a) TOSQA, b) PESQ 4.2 Instrumental Diagnostic Measures Quality attributes which have been determined in an auditory test may also be predicted with the help of instrumental measures. For example, Barnwell s group investigated the correlation between the auditory DAM results and measured, known or varied quality indicators like the global or segmental SNR, a frequency-dependent SNR, Lp-norms of (LPC-) spectral distances, etc. The results show relatively loose relations even with optimized estimation parameters (like the value p in a norm), unless the class of systems was strongly restricted. Much higher correlations were achieved with a complicated, provisional combined measure [18]. Halka et al. [5] followed a different way: Using an improved noise-loading measurement [21], the linear distortion was described by an average frequency response jω H ( e ), the non-linearities by an average yy e noise spectrum jω Φnn ( e ), and the output signal by its average power density jω Φ ( ), with a speechlike random process as an input signal rather than the commonly used speech-signal samples. Then, these measured functions were mapped onto features highly correlated with the main factors of an auditory attribute analysis [1], and a combined measure for the integral quality was derived. A total correlation of some 95% was achieved, but with unexplained outliers. Later work with inclusion of foreign-type systems showed that more than just a parameter adjustment or additional sub-features were needed [7]. 5 Improved Attribute Approach The above-named direct integral-quality estimators (PESQ, TOSQA, SQET, etc.) outperform our earlier attribute-matching approach by far. After a careful analysis, the disadvantages are seen - not in the use of a speech-model signal, - not in the detour via attributes: this does fit the human judgment process; - indeed in the extraction of quality information from system descriptions averaged a priori rather than from instantaneous behaviors with an averaging at the end (as in the direct procedures); - indeed in the comparison between output and error / distortion signals rather than input and output signals (as in the direct procedures: Gersho s approach is, other than in audio quality, more appropriate than Atal s); - indeed in the use of pre-defined attributes which include clearness and, especially, also naturalness, i.e., actually no detail attributes but rather composite features almost like quality itself. We aim now at - more suitable attributes with 1698

5 - better separation of perceptual details, - i.e.: orthogonality such that, e.g., naturalness is replaced by a decomposition of un-natural effects, - model systems for such effects, and - immediate measuring procedures for these new attributes. In a first step, we try to identify perceptual attributes for a large number of degradations which are common in modern telephone transmission. In a second step, these attributes will be predicted by adequate signalbased measures. From the auditorily or instrumentally determined attribute values, indices for integral quality will be derived as a final step. The instrumental models as well as real-world test systems are run and evaluated on a real-time simulator able to simulate a duplex speech communication with large variability [17][19]. 6 First Results In order to identify attributes which characterize a wide range of modern telecommunication systems, a multidimensional scaling experiment has been carried out [23]. 14 test subjects had to rate the similarity of stimuli which were presented in pairs, on a continuous scale. The stimuli correspond to 14 different circuit conditions reflecting typical degradations, namely those associated with - codecs, - linear distortions and bandwidth limitations, - terminal equipment such as hands-free terminals (HFTs), - time-varying packet loss in IP networks, - additive background noise, and - noise-reduction algorithms. The test was carried out with source signals from two speakers (1m, 1f), resulting in 2 14 stimuli and comparisons to be performed by each listener. The similarity judgments were mapped to an L- dimensional space, using the INDSCAL algorithm which accounts for individual differences of the weighting of perceptual dimensions by each test subject [14]. L = 4 was found to be a good compromise between the goodness of fit (S-stress value, explained variance) and the interpretability of the derived dimensions. Following an informal listening of the stimuli in their ranking with respect to the dimensions by several expert listeners, the extracted dimensions have been labeled as follows: 1. Interruptedness : This dimension is mainly loaded by packet loss, as well as by musical tones stemming from imperfect noise reduction. 2. Clearness /directness : This dimension shows high loadings for the HFT stimuli; it seems to be linked to peaks and valleys in the amplitude spectrum. 3. Frequency content : This dimension is characterized by the presence or absence of high and low frequencies; it seems to be speakerdependent. 4. Noisiness : Stimuli which show high loadings on this dimension are the background-noise and circuit-noise conditions. Some of these dimensions have already been found in earlier experiments, e.g. interruptedness [15]. Thus, they seem to be context-independent. Interestingly, the codec conditions all group in a cluster of the perceptual space. Earlier investigations which focused on such stimuli [1] thus cover only a small fraction of the perceptual space of modern telecommunication systems. 7 Summary and Outlook In the present paper, different methods for the auditory measurement and the instrumental estimation of speech quality were discussed. In particular, we focussed on such measures which provide information on different perceptive quality dimensions, and not just integral quality values. We argued that instrumental measures which rely on quality attributes may provide more diagnostic and more generic information on quality. For a set of transmission conditions which are typical for modern telecommunication situations, relevant dimensions were derived in an auditory multidimensional scaling experiment. Four dimensions could be extracted; the one which explained most of the auditory data was labelled interruptedness. Apparently, time-varying characteristics have a strong impact on the perception of the transmitted speech samples, and it can be expected that their impact on quality is also very high. Interestingly, these characteristics seem to form a distinct perceptual dimension; this could be a result of the judgment procedure where only one rating was solicited after listening to an entire sample pair. In future experiments, it may be helpful to collect time-varying ratings for the time-varying stimuli, e.g. using a slider method which has been proposed in [13]. The question which could not be answered so far is how the individual quality attributes contribute to integral quality. On the one hand, the relationship between integral quality and individual quality attributes can be investigated auditorily, using the external preference mapping procedure applied in [15]. On the other hand, it will be interesting to see whether the same relationships also hold for instrumental estimates of individual quality attributes. Developing 1699

6 instrumental predictors for individual quality attributes is our ultimate aim, because we think that they will make quality prediction more robust against changes in the physical system, and open to new types of degradations. It is desirable that an instrumental predictor can not only estimate the time-average rating of a certain quality attribute but also its time-variations in case of stimuli with time-varying characteristics. Such predictors may work with real speech signals or with artificial speech-like signals. In order to develop such predictors, tools may be helpful which allow adjusting individual quality attributes. The work for the above aims is based on grants of Deutsche Forschungsgemeinschaft (DFG grants He 4465 and MO 1038); this is gratefully acknowledged. References [1] Bappert, V., Blauert, J.: Auditory Quality Evaluation of Speech-Coding Systems. Acta Acust., vol. 2 (1994), pp [2] Beerends, J.G., Stemerdink, J.A.: A Perceptual Speech-Quality Measure Based on a Psychoacoustic Sound Representation. J. Audio Eng. Soc., vol. 42 (1994), pp [3] Berger, J.: Instrumental Methods for Speech- Quality Estimation Models of Auditory Tests. Doctoral dissertation (in German), Univ. Kiel, [4] Berger, J.: TOSQA Telecommunication Objective Speech-Quality Assessment. ITU-T, Contr. COM-12-34, Geneva, [5] Halka, U., Heute, U.: A New Approach to Speech- Quality Measures Based on Attribute Matching. Speech Comm., vol. 11 (1992), pp [6] Hansen, M.: Assessment and Prediction of Speech Transmission Quality with an Auditory Processing Model. Doctoral dissertation, Univ. Oldenburg, [7] Hauenstein, M.: Psychoacoustically Motivated Measures for Instrumental Speech-Quality Judgment. Doctoral dissertation (in German), Univ. Kiel, [8] Hauenstein, M.: Homepage. [9] ITU-T Handbook on Telephonometry. ITU-T, Geneva, [10] ITU-T Rec. P.800: Methods for Subjective Determination of Transmission Quality. ITU-T, Geneva, [11] ITU-T Rec. P.861: Objective Quality Measurement of Telephone-band Speech Codecs. ITU-T, Geneva, [12] ITU-T Rec. P.862: Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for End-to-end Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs. ITU-T, Geneva, [13] ITU-T Rec. P.880: Continuous Evaluation of Time-varying Speech Quality. ITU-T, Geneva, [14] Kruskal, J., Wish, M.: Multidimensional Scaling. Quantitative Applications in the Social Sciences, Vol (E.M. Uslaner, ed.), Sage, Newbury Park CA, [15] Mattila, V.-V.: Perceptual Dimensions of Speech Quality in Mobile Communications. PhD thesis, Tampere Univ. Techn., [16] McDermott B.J.: Multidimensional Analyses of Circuit Quality Judgments. JASA, vol. 45 (1969), pp [17] Möller, S.: Assessment and Prediction of Speech Quality in Telecommunications. Kluwer Academic Publ., Boston MA, [18] Quackenbush, S.R., Barnwell, T.P., Clement, M.A.: Objective Measures of Speech Quality. Prentice Hall, Englewood Cliffs, [19] Raake, A.: Assessment and Parametric Modelling of Speech Quality in Voice-over-IP Networks. Doctoral dissertation (unpubl.), Ruhr-Univ. Bochum, [20] Schroeder, M.R., Atal, B.S., Hall, J.L.: Optimizing Digital Speech Coders by Exploiting Masking Properties of the Human Ear. JASA, vol. 66 (1979), pp [21] Schüssler, H.W., Dong, Y.: A New Method for Measuring the Performance of Weakly Nonlinear and Noisy Systems. Frequenz, vol. 44 (1990), pp [22] Voiers, W.D.: Diagnostic Acceptability Measure for Speech Communication Systems. Proceed. IEEE ICASSP, Munich, 1977, pp [23] Wältermann, M.: Determination of Relevant Quality Dimensions in Speech Transmission via Modern Telecommunication Links. Diploma thesis (in German), Ruhr-Univ., Bochum, [24] Wang, S., Sekey, A., Gersho, A.: Auditory Distortion Measure for Speech Coding. Proceed. IEEE ICASSP, 1991, pp

An active unpleasantness control system for indoor noise based on auditory masking

An active unpleasantness control system for indoor noise based on auditory masking An active unpleasantness control system for indoor noise based on auditory masking Daisuke Ikefuji, Masato Nakayama, Takanabu Nishiura and Yoich Yamashita Graduate School of Information Science and Engineering,

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 BACKGROUND Speech is the most natural form of human communication. Speech has also become an important means of human-machine interaction and the advancement in technology has

More information

Loudness Processing of Time-Varying Sounds: Recent advances in psychophysics and challenges for future research

Loudness Processing of Time-Varying Sounds: Recent advances in psychophysics and challenges for future research Loudness Processing of Time-Varying Sounds: Recent advances in psychophysics and challenges for future research Emmanuel PONSOT 1 ; Patrick SUSINI 1 ; Sabine MEUNIER 2 1 STMS lab (Ircam, CNRS, UPMC), 1

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Long-term spectrum of speech HCS 7367 Speech Perception Connected speech Absolute threshold Males Dr. Peter Assmann Fall 212 Females Long-term spectrum of speech Vowels Males Females 2) Absolute threshold

More information

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES

USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES USING AUDITORY SALIENCY TO UNDERSTAND COMPLEX AUDITORY SCENES Varinthira Duangudom and David V Anderson School of Electrical and Computer Engineering, Georgia Institute of Technology Atlanta, GA 30332

More information

SUPPRESSION OF MUSICAL NOISE IN ENHANCED SPEECH USING PRE-IMAGE ITERATIONS. Christina Leitner and Franz Pernkopf

SUPPRESSION OF MUSICAL NOISE IN ENHANCED SPEECH USING PRE-IMAGE ITERATIONS. Christina Leitner and Franz Pernkopf 2th European Signal Processing Conference (EUSIPCO 212) Bucharest, Romania, August 27-31, 212 SUPPRESSION OF MUSICAL NOISE IN ENHANCED SPEECH USING PRE-IMAGE ITERATIONS Christina Leitner and Franz Pernkopf

More information

Audio Quality Assessment

Audio Quality Assessment Audio Quality Assessment ECE 511 Guest Lecture Michael Lester Purdue University Audio Quality Assessment Audio can have many difference purposes Audio is a signal which is a carrier of information We can

More information

Psychoacoustical Models WS 2016/17

Psychoacoustical Models WS 2016/17 Psychoacoustical Models WS 2016/17 related lectures: Applied and Virtual Acoustics (Winter Term) Advanced Psychoacoustics (Summer Term) Sound Perception 2 Frequency and Level Range of Human Hearing Source:

More information

SoundRecover2 the first adaptive frequency compression algorithm More audibility of high frequency sounds

SoundRecover2 the first adaptive frequency compression algorithm More audibility of high frequency sounds Phonak Insight April 2016 SoundRecover2 the first adaptive frequency compression algorithm More audibility of high frequency sounds Phonak led the way in modern frequency lowering technology with the introduction

More information

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation Aldebaro Klautau - http://speech.ucsd.edu/aldebaro - 2/3/. Page. Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation ) Introduction Several speech processing algorithms assume the signal

More information

An Auditory System Modeling in Sound Source Localization

An Auditory System Modeling in Sound Source Localization An Auditory System Modeling in Sound Source Localization Yul Young Park The University of Texas at Austin EE381K Multidimensional Signal Processing May 18, 2005 Abstract Sound localization of the auditory

More information

Sonic Spotlight. SmartCompress. Advancing compression technology into the future

Sonic Spotlight. SmartCompress. Advancing compression technology into the future Sonic Spotlight SmartCompress Advancing compression technology into the future Speech Variable Processing (SVP) is the unique digital signal processing strategy that gives Sonic hearing aids their signature

More information

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening AUDL GS08/GAV1 Signals, systems, acoustics and the ear Pitch & Binaural listening Review 25 20 15 10 5 0-5 100 1000 10000 25 20 15 10 5 0-5 100 1000 10000 Part I: Auditory frequency selectivity Tuning

More information

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED Francisco J. Fraga, Alan M. Marotta National Institute of Telecommunications, Santa Rita do Sapucaí - MG, Brazil Abstract A considerable

More information

Auditory principles in speech processing do computers need silicon ears?

Auditory principles in speech processing do computers need silicon ears? * with contributions by V. Hohmann, M. Kleinschmidt, T. Brand, J. Nix, R. Beutelmann, and more members of our medical physics group Prof. Dr. rer.. nat. Dr. med. Birger Kollmeier* Auditory principles in

More information

Evaluation of Perceived Spatial Audio Quality

Evaluation of Perceived Spatial Audio Quality Evaluation of Perceived Spatial Audio Quality Jan BERG 1 School of Music, Luleå University of Technology Piteå, Sweden ABSTRACT 1 The increased use of audio applications capable of conveying enhanced spatial

More information

Performance Study of Objective Speech Quality Measurement for Modern Wireless-VoIP Communications

Performance Study of Objective Speech Quality Measurement for Modern Wireless-VoIP Communications Performance Study of Objective Speech Quality Measurement for Modern Wireless-VoIP Communications Tiago H. Falk and Wai-Yip Chan Abstract Wireless-VoIP communications introduce perceptual degradations

More information

CURRENTLY, the most accurate method for evaluating

CURRENTLY, the most accurate method for evaluating IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 1, JANUARY 2008 229 Evaluation of Objective Quality Measures for Speech Enhancement Yi Hu and Philipos C. Loizou, Senior Member,

More information

Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise

Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise 4 Special Issue Speech-Based Interfaces in Vehicles Research Report Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise Hiroyuki Hoshino Abstract This

More information

COM 12-C 039. STUDY GROUP 12 Original: English TELECOMMUNICATION STANDARDIZATION SECTOR. Question(s): 7/12 Geneva, January 2017 CONTRIBUTION

COM 12-C 039. STUDY GROUP 12 Original: English TELECOMMUNICATION STANDARDIZATION SECTOR. Question(s): 7/12 Geneva, January 2017 CONTRIBUTION INTERNATIONAL TELECOMMUNICATION UNION TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2017-2020 STUDY GROUP 12 Original: English Question(s): 7/12 Geneva, 10-19 January 2017 Source: Title: Purpose:

More information

Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners

Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners Jan Rennies, Eugen Albertin, Stefan Goetze, and Jens-E. Appell Fraunhofer IDMT, Hearing, Speech and

More information

Auditory Scene Analysis

Auditory Scene Analysis 1 Auditory Scene Analysis Albert S. Bregman Department of Psychology McGill University 1205 Docteur Penfield Avenue Montreal, QC Canada H3A 1B1 E-mail: bregman@hebb.psych.mcgill.ca To appear in N.J. Smelzer

More information

LATERAL INHIBITION MECHANISM IN COMPUTATIONAL AUDITORY MODEL AND IT'S APPLICATION IN ROBUST SPEECH RECOGNITION

LATERAL INHIBITION MECHANISM IN COMPUTATIONAL AUDITORY MODEL AND IT'S APPLICATION IN ROBUST SPEECH RECOGNITION LATERAL INHIBITION MECHANISM IN COMPUTATIONAL AUDITORY MODEL AND IT'S APPLICATION IN ROBUST SPEECH RECOGNITION Lu Xugang Li Gang Wang Lip0 Nanyang Technological University, School of EEE, Workstation Resource

More information

HEARING AND PSYCHOACOUSTICS

HEARING AND PSYCHOACOUSTICS CHAPTER 2 HEARING AND PSYCHOACOUSTICS WITH LIDIA LEE I would like to lead off the specific audio discussions with a description of the audio receptor the ear. I believe it is always a good idea to understand

More information

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception ISCA Archive VOQUAL'03, Geneva, August 27-29, 2003 Jitter, Shimmer, and Noise in Pathological Voice Quality Perception Jody Kreiman and Bruce R. Gerratt Division of Head and Neck Surgery, School of Medicine

More information

Issues faced by people with a Sensorineural Hearing Loss

Issues faced by people with a Sensorineural Hearing Loss Issues faced by people with a Sensorineural Hearing Loss Issues faced by people with a Sensorineural Hearing Loss 1. Decreased Audibility 2. Decreased Dynamic Range 3. Decreased Frequency Resolution 4.

More information

Infant Hearing Development: Translating Research Findings into Clinical Practice. Auditory Development. Overview

Infant Hearing Development: Translating Research Findings into Clinical Practice. Auditory Development. Overview Infant Hearing Development: Translating Research Findings into Clinical Practice Lori J. Leibold Department of Allied Health Sciences The University of North Carolina at Chapel Hill Auditory Development

More information

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for What you re in for Speech processing schemes for cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences

More information

Measurements that simulate human hearing psychoacoustics in ITU-T P.863

Measurements that simulate human hearing psychoacoustics in ITU-T P.863 Wireless Measurements that simulate human hearing psychoacoustics in ITU-T P.863 Jürgen Fälchle stock.adobe.com Even though telephony now utilizes only a small part of network resources, it remains a core

More information

EFFECTS OF TEMPORAL FINE STRUCTURE ON THE LOCALIZATION OF BROADBAND SOUNDS: POTENTIAL IMPLICATIONS FOR THE DESIGN OF SPATIAL AUDIO DISPLAYS

EFFECTS OF TEMPORAL FINE STRUCTURE ON THE LOCALIZATION OF BROADBAND SOUNDS: POTENTIAL IMPLICATIONS FOR THE DESIGN OF SPATIAL AUDIO DISPLAYS Proceedings of the 14 International Conference on Auditory Display, Paris, France June 24-27, 28 EFFECTS OF TEMPORAL FINE STRUCTURE ON THE LOCALIZATION OF BROADBAND SOUNDS: POTENTIAL IMPLICATIONS FOR THE

More information

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag JAIST Reposi https://dspace.j Title Effects of speaker's and listener's environments on speech intelligibili annoyance Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag Citation Inter-noise 2016: 171-176 Issue

More information

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds Hearing Lectures Acoustics of Speech and Hearing Week 2-10 Hearing 3: Auditory Filtering 1. Loudness of sinusoids mainly (see Web tutorial for more) 2. Pitch of sinusoids mainly (see Web tutorial for more)

More information

LIST OF FIGURES. Figure No. Title Page No. Fig. l. l Fig. l.2

LIST OF FIGURES. Figure No. Title Page No. Fig. l. l Fig. l.2 LIST OF FIGURES Figure No. Title Page No. Fig. l. l Fig. l.2 Fig. l.3 A generic perceptual audio coder. 3 Schematic diagram of the programme of the present work on perceptual audio coding. 18 Schematic

More information

Speech Compression for Noise-Corrupted Thai Dialects

Speech Compression for Noise-Corrupted Thai Dialects American Journal of Applied Sciences 9 (3): 278-282, 2012 ISSN 1546-9239 2012 Science Publications Speech Compression for Noise-Corrupted Thai Dialects Suphattharachai Chomphan Department of Electrical

More information

Psychometric Properties of the Mean Opinion Scale

Psychometric Properties of the Mean Opinion Scale Psychometric Properties of the Mean Opinion Scale James R. Lewis IBM Voice Systems 1555 Palm Beach Lakes Blvd. West Palm Beach, Florida jimlewis@us.ibm.com Abstract The Mean Opinion Scale (MOS) is a seven-item

More information

UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication

UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication Citation for published version (APA): Brons, I. (2013). Perceptual evaluation

More information

Perceptual Audio Evaluation

Perceptual Audio Evaluation Perceptual Audio Evaluation Theory, method and application Søren Bech & Nick Zacharov What are we talking about? Søren Bech Bang & Olufsen sbe@bang-olufsen.dk Nick Zacharov DELTA, SenseLab NVZ@delta.dk

More information

SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Telemeeting assessment

SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Telemeeting assessment I n t e r n a t i o n a l T e l e c o m m u n i c a t i o n U n i o n ITU-T P.1310 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (03/2017) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS,

More information

Near-End Perception Enhancement using Dominant Frequency Extraction

Near-End Perception Enhancement using Dominant Frequency Extraction Near-End Perception Enhancement using Dominant Frequency Extraction Premananda B.S. 1,Manoj 2, Uma B.V. 3 1 Department of Telecommunication, R. V. College of Engineering, premanandabs@gmail.com 2 Department

More information

ITU-T. FG AVA TR Version 1.0 (10/2013) Part 3: Using audiovisual media A taxonomy of participation

ITU-T. FG AVA TR Version 1.0 (10/2013) Part 3: Using audiovisual media A taxonomy of participation International Telecommunication Union ITU-T TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU FG AVA TR Version 1.0 (10/2013) Focus Group on Audiovisual Media Accessibility Technical Report Part 3: Using

More information

ClaroTM Digital Perception ProcessingTM

ClaroTM Digital Perception ProcessingTM ClaroTM Digital Perception ProcessingTM Sound processing with a human perspective Introduction Signal processing in hearing aids has always been directed towards amplifying signals according to physical

More information

Speech recognition in noisy environments: A survey

Speech recognition in noisy environments: A survey T-61.182 Robustness in Language and Speech Processing Speech recognition in noisy environments: A survey Yifan Gong presented by Tapani Raiko Feb 20, 2003 About the Paper Article published in Speech Communication

More information

Congruency Effects with Dynamic Auditory Stimuli: Design Implications

Congruency Effects with Dynamic Auditory Stimuli: Design Implications Congruency Effects with Dynamic Auditory Stimuli: Design Implications Bruce N. Walker and Addie Ehrenstein Psychology Department Rice University 6100 Main Street Houston, TX 77005-1892 USA +1 (713) 527-8101

More information

FREQUENCY COMPOSITION : A NEW APPROACH TO FREQUENCY- LOWERING

FREQUENCY COMPOSITION : A NEW APPROACH TO FREQUENCY- LOWERING FREQUENCY COMPOSITION : A NEW APPROACH TO FREQUENCY- LOWERING -lowering has come a long way. Over time, the technique has evolved from a controversial feature to one that is gaining more and more acceptance.

More information

Institute of Communication Acoustics Chair of Information Technology. Ruhr-Universität Bochum

Institute of Communication Acoustics Chair of Information Technology. Ruhr-Universität Bochum Institute of Communication Acoustics Chair of Information Technology Ruhr-Universität Bochum Institute of Communication Acoustics Institute of Communication Acoustics Ruhr-Universität Bochum 2 Institute

More information

Keywords. 1. Introduction. Peter POČTA 1, Jan HOLUB 2, Helena VLČKOVÁ 1, Zuzana POLKOVÁ 1.

Keywords. 1. Introduction. Peter POČTA 1, Jan HOLUB 2, Helena VLČKOVÁ 1, Zuzana POLKOVÁ 1. RADIOENGINEERING, VOL. 19, NO. 1, APRIL 2010 79 Impact of Different Active-Speech-Ratios on PESQ s Predictions in Case of Independent and Dependent Losses (in Presence of Receiver-Side Comfort-Noise) Peter

More information

Binaural Hearing. Steve Colburn Boston University

Binaural Hearing. Steve Colburn Boston University Binaural Hearing Steve Colburn Boston University Outline Why do we (and many other animals) have two ears? What are the major advantages? What is the observed behavior? How do we accomplish this physiologically?

More information

HearIntelligence by HANSATON. Intelligent hearing means natural hearing.

HearIntelligence by HANSATON. Intelligent hearing means natural hearing. HearIntelligence by HANSATON. HearIntelligence by HANSATON. Intelligent hearing means natural hearing. Acoustic environments are complex. We are surrounded by a variety of different acoustic signals, speech

More information

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition Implementation of Spectral Maxima Sound processing for cochlear implants by using Bark scale Frequency band partition Han xianhua 1 Nie Kaibao 1 1 Department of Information Science and Engineering, Shandong

More information

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair

Essential feature. Who are cochlear implants for? People with little or no hearing. substitute for faulty or missing inner hair Who are cochlear implants for? Essential feature People with little or no hearing and little conductive component to the loss who receive little or no benefit from a hearing aid. Implants seem to work

More information

Audiogram+: The ReSound Proprietary Fitting Algorithm

Audiogram+: The ReSound Proprietary Fitting Algorithm Abstract Hearing instruments should provide end-users with access to undistorted acoustic information to the degree possible. The Resound compression system uses state-of-the art technology and carefully

More information

Discrete Signal Processing

Discrete Signal Processing 1 Discrete Signal Processing C.M. Liu Perceptual Lab, College of Computer Science National Chiao-Tung University http://www.cs.nctu.edu.tw/~cmliu/courses/dsp/ ( Office: EC538 (03)5731877 cmliu@cs.nctu.edu.tw

More information

Binaural Hearing. Why two ears? Definitions

Binaural Hearing. Why two ears? Definitions Binaural Hearing Why two ears? Locating sounds in space: acuity is poorer than in vision by up to two orders of magnitude, but extends in all directions. Role in alerting and orienting? Separating sound

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Babies 'cry in mother's tongue' HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Babies' cries imitate their mother tongue as early as three days old German researchers say babies begin to pick up

More information

Individualizing Noise Reduction in Hearing Aids

Individualizing Noise Reduction in Hearing Aids Individualizing Noise Reduction in Hearing Aids Tjeerd Dijkstra, Adriana Birlutiu, Perry Groot and Tom Heskes Radboud University Nijmegen Rolph Houben Academic Medical Center Amsterdam Why optimize noise

More information

EEL 6586, Project - Hearing Aids algorithms

EEL 6586, Project - Hearing Aids algorithms EEL 6586, Project - Hearing Aids algorithms 1 Yan Yang, Jiang Lu, and Ming Xue I. PROBLEM STATEMENT We studied hearing loss algorithms in this project. As the conductive hearing loss is due to sound conducting

More information

ReSound NoiseTracker II

ReSound NoiseTracker II Abstract The main complaint of hearing instrument wearers continues to be hearing in noise. While directional microphone technology exploits spatial separation of signal from noise to improve listening

More information

Characterizing individual hearing loss using narrow-band loudness compensation

Characterizing individual hearing loss using narrow-band loudness compensation Characterizing individual hearing loss using narrow-band loudness compensation DIRK OETTING 1,2,*, JENS-E. APPELL 1, VOLKER HOHMANN 2, AND STEPHAN D. EWERT 2 1 Project Group Hearing, Speech and Audio Technology

More information

Hearing in the Environment

Hearing in the Environment 10 Hearing in the Environment Click Chapter to edit 10 Master Hearing title in the style Environment Sound Localization Complex Sounds Auditory Scene Analysis Continuity and Restoration Effects Auditory

More information

Slow compression for people with severe to profound hearing loss

Slow compression for people with severe to profound hearing loss Phonak Insight February 2018 Slow compression for people with severe to profound hearing loss For people with severe to profound hearing loss, poor auditory resolution abilities can make the spectral and

More information

WIDEXPRESS. no.30. Background

WIDEXPRESS. no.30. Background WIDEXPRESS no. january 12 By Marie Sonne Kristensen Petri Korhonen Using the WidexLink technology to improve speech perception Background For most hearing aid users, the primary motivation for using hearing

More information

Categorical Perception

Categorical Perception Categorical Perception Discrimination for some speech contrasts is poor within phonetic categories and good between categories. Unusual, not found for most perceptual contrasts. Influenced by task, expectations,

More information

INTEGRATING THE COCHLEA S COMPRESSIVE NONLINEARITY IN THE BAYESIAN APPROACH FOR SPEECH ENHANCEMENT

INTEGRATING THE COCHLEA S COMPRESSIVE NONLINEARITY IN THE BAYESIAN APPROACH FOR SPEECH ENHANCEMENT INTEGRATING THE COCHLEA S COMPRESSIVE NONLINEARITY IN THE BAYESIAN APPROACH FOR SPEECH ENHANCEMENT Eric Plourde and Benoît Champagne Department of Electrical and Computer Engineering, McGill University

More information

Annoyance and Hearing Aids Donald J. Schum, Ph.D. Vice President of Audiology & Professional Relations for Oticon, Inc

Annoyance and Hearing Aids Donald J. Schum, Ph.D. Vice President of Audiology & Professional Relations for Oticon, Inc Annoyance and Hearing Aids Donald J. Schum, Ph.D. Vice President of Audiology & Professional Relations for Oticon, Inc Abstract: Listening to amplified speech through hearing aids all day can lead to fatigue

More information

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing Categorical Speech Representation in the Human Superior Temporal Gyrus Edward F. Chang, Jochem W. Rieger, Keith D. Johnson, Mitchel S. Berger, Nicholas M. Barbaro, Robert T. Knight SUPPLEMENTARY INFORMATION

More information

Instrumental and Perceptual Evaluation of Hearing Devices Methods and Applications

Instrumental and Perceptual Evaluation of Hearing Devices Methods and Applications Instrumental and Perceptual Evaluation of Hearing Devices Methods and Applications Christoph Völker Disputation Oldenburg, 26.8.2016 Evaluation of Hearing Devices Table of Contents 1) Introduction 2) Comparing

More information

SOUND QUALITY EVALUATION OF INTERIOR VEHICLE NOISE USING AN EFFICIENT PSYCHOACOUSTIC METHOD

SOUND QUALITY EVALUATION OF INTERIOR VEHICLE NOISE USING AN EFFICIENT PSYCHOACOUSTIC METHOD SOUND QUALITY EVALUATION OF INTERIOR VEHICLE NOISE USING AN EFFICIENT PSYCHOACOUSTIC METHOD Markus Bodden 1, Ralf Heinrichs 2, Angela Linow 2 1 Ingenieurbüro Dr. Bodden Herthastr. 29 D-45131 Essen, Germany

More information

Study of perceptual balance for binaural dichotic presentation

Study of perceptual balance for binaural dichotic presentation Paper No. 556 Proceedings of 20 th International Congress on Acoustics, ICA 2010 23-27 August 2010, Sydney, Australia Study of perceptual balance for binaural dichotic presentation Pandurangarao N. Kulkarni

More information

Speech Enhancement Using Deep Neural Network

Speech Enhancement Using Deep Neural Network Speech Enhancement Using Deep Neural Network Pallavi D. Bhamre 1, Hemangi H. Kulkarni 2 1 Post-graduate Student, Department of Electronics and Telecommunication, R. H. Sapat College of Engineering, Management

More information

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3 PRESERVING SPECTRAL CONTRAST IN AMPLITUDE COMPRESSION FOR HEARING AIDS Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3 1 University of Malaga, Campus de Teatinos-Complejo Tecnol

More information

PESQ and 3SQM measurement of voice quality over live 3G networks

PESQ and 3SQM measurement of voice quality over live 3G networks PESQ and 3SQM measurement of voice quality over live 3G networks Mohammad Goudarzi, Lingfen Sun, Emmanuel Ifeachor Centre of Signal Processing and Multimedia Communications School of Computing, Communications

More information

Combating the Reverberation Problem

Combating the Reverberation Problem Combating the Reverberation Problem Barbara Shinn-Cunningham (Boston University) Martin Cooke (Sheffield University, U.K.) How speech is corrupted by reverberation DeLiang Wang (Ohio State University)

More information

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics 2/14/18 Can hear whistle? Lecture 5 Psychoacoustics Based on slides 2009--2018 DeHon, Koditschek Additional Material 2014 Farmer 1 2 There are sounds we cannot hear Depends on frequency Where are we on

More information

Hearing. Figure 1. The human ear (from Kessel and Kardon, 1979)

Hearing. Figure 1. The human ear (from Kessel and Kardon, 1979) Hearing The nervous system s cognitive response to sound stimuli is known as psychoacoustics: it is partly acoustics and partly psychology. Hearing is a feature resulting from our physiology that we tend

More information

Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone

Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone Acoustics 8 Paris Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone V. Zimpfer a and K. Buck b a ISL, 5 rue du Général Cassagnou BP 734, 6831 Saint Louis, France b

More information

Hearing the Universal Language: Music and Cochlear Implants

Hearing the Universal Language: Music and Cochlear Implants Hearing the Universal Language: Music and Cochlear Implants Professor Hugh McDermott Deputy Director (Research) The Bionics Institute of Australia, Professorial Fellow The University of Melbourne Overview?

More information

Analysis of Speech Recognition Techniques for use in a Non-Speech Sound Recognition System

Analysis of Speech Recognition Techniques for use in a Non-Speech Sound Recognition System Analysis of Recognition Techniques for use in a Sound Recognition System Michael Cowling, Member, IEEE and Renate Sitte, Member, IEEE Griffith University Faculty of Engineering & Information Technology

More information

BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED

BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED International Conference on Systemics, Cybernetics and Informatics, February 12 15, 2004 BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED Alice N. Cheeran Biomedical

More information

Codebook driven short-term predictor parameter estimation for speech enhancement

Codebook driven short-term predictor parameter estimation for speech enhancement Codebook driven short-term predictor parameter estimation for speech enhancement Citation for published version (APA): Srinivasan, S., Samuelsson, J., & Kleijn, W. B. (2006). Codebook driven short-term

More information

Performance Comparison of Speech Enhancement Algorithms Using Different Parameters

Performance Comparison of Speech Enhancement Algorithms Using Different Parameters Performance Comparison of Speech Enhancement Algorithms Using Different Parameters Ambalika, Er. Sonia Saini Abstract In speech communication system, background noise degrades the information or speech

More information

Assistive Listening Technology: in the workplace and on campus

Assistive Listening Technology: in the workplace and on campus Assistive Listening Technology: in the workplace and on campus Jeremy Brassington Tuesday, 11 July 2017 Why is it hard to hear in noisy rooms? Distance from sound source Background noise continuous and

More information

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment DISTRIBUTION STATEMENT A Approved for Public Release Distribution Unlimited Advanced Audio Interface for Phonetic Speech Recognition in a High Noise Environment SBIR 99.1 TOPIC AF99-1Q3 PHASE I SUMMARY

More information

Digital hearing aids are still

Digital hearing aids are still Testing Digital Hearing Instruments: The Basics Tips and advice for testing and fitting DSP hearing instruments Unfortunately, the conception that DSP instruments cannot be properly tested has been projected

More information

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment

Acoustics, signals & systems for audiology. Psychoacoustics of hearing impairment Acoustics, signals & systems for audiology Psychoacoustics of hearing impairment Three main types of hearing impairment Conductive Sound is not properly transmitted from the outer to the inner ear Sensorineural

More information

Voice Detection using Speech Energy Maximization and Silence Feature Normalization

Voice Detection using Speech Energy Maximization and Silence Feature Normalization , pp.25-29 http://dx.doi.org/10.14257/astl.2014.49.06 Voice Detection using Speech Energy Maximization and Silence Feature Normalization In-Sung Han 1 and Chan-Shik Ahn 2 1 Dept. of The 2nd R&D Institute,

More information

Who are cochlear implants for?

Who are cochlear implants for? Who are cochlear implants for? People with little or no hearing and little conductive component to the loss who receive little or no benefit from a hearing aid. Implants seem to work best in adults who

More information

The role of periodicity in the perception of masked speech with simulated and real cochlear implants

The role of periodicity in the perception of masked speech with simulated and real cochlear implants The role of periodicity in the perception of masked speech with simulated and real cochlear implants Kurt Steinmetzger and Stuart Rosen UCL Speech, Hearing and Phonetic Sciences Heidelberg, 09. November

More information

Sound Preference Development and Correlation to Service Incidence Rate

Sound Preference Development and Correlation to Service Incidence Rate Sound Preference Development and Correlation to Service Incidence Rate Terry Hardesty a) Sub-Zero, 4717 Hammersley Rd., Madison, WI, 53711, United States Eric Frank b) Todd Freeman c) Gabriella Cerrato

More information

Auditory scene analysis in humans: Implications for computational implementations.

Auditory scene analysis in humans: Implications for computational implementations. Auditory scene analysis in humans: Implications for computational implementations. Albert S. Bregman McGill University Introduction. The scene analysis problem. Two dimensions of grouping. Recognition

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 8 Audio and Multimodal Attention Audio Scene Analysis Two-stage process Segmentation: decomposition to time-frequency segments Grouping

More information

Local Image Structures and Optic Flow Estimation

Local Image Structures and Optic Flow Estimation Local Image Structures and Optic Flow Estimation Sinan KALKAN 1, Dirk Calow 2, Florentin Wörgötter 1, Markus Lappe 2 and Norbert Krüger 3 1 Computational Neuroscience, Uni. of Stirling, Scotland; {sinan,worgott}@cn.stir.ac.uk

More information

Lecture 3: Perception

Lecture 3: Perception ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 3: Perception 1. Ear Physiology 2. Auditory Psychophysics 3. Pitch Perception 4. Music Perception Dan Ellis Dept. Electrical Engineering, Columbia University

More information

The role of low frequency components in median plane localization

The role of low frequency components in median plane localization Acoust. Sci. & Tech. 24, 2 (23) PAPER The role of low components in median plane localization Masayuki Morimoto 1;, Motoki Yairi 1, Kazuhiro Iida 2 and Motokuni Itoh 1 1 Environmental Acoustics Laboratory,

More information

Topic 4. Pitch & Frequency

Topic 4. Pitch & Frequency Topic 4 Pitch & Frequency A musical interlude KOMBU This solo by Kaigal-ool of Huun-Huur-Tu (accompanying himself on doshpuluur) demonstrates perfectly the characteristic sound of the Xorekteer voice An

More information

New measurement methods for signal adaptive hearing aids

New measurement methods for signal adaptive hearing aids SIEMENS Josef Chalupper 1 New measurement methods for signal adaptive hearing aids Josef Chalupper Siemens Audiologische Technik, Erlangen SIEMENS Josef Chalupper 2 Motivation modern hearing aids signal

More information

How high-frequency do children hear?

How high-frequency do children hear? How high-frequency do children hear? Mari UEDA 1 ; Kaoru ASHIHARA 2 ; Hironobu TAKAHASHI 2 1 Kyushu University, Japan 2 National Institute of Advanced Industrial Science and Technology, Japan ABSTRACT

More information

Research Article Performance Study of Objective Speech Quality Measurement for Modern Wireless-VoIP Communications

Research Article Performance Study of Objective Speech Quality Measurement for Modern Wireless-VoIP Communications Hindawi Publishing Corporation EURASIP Journal on Audio, Speech, and Music Processing Volume 9, Article ID 438, pages doi:.55/9/438 Research Article Performance Study of Objective Speech Quality Measurement

More information

Linguistic Phonetics Fall 2005

Linguistic Phonetics Fall 2005 MIT OpenCourseWare http://ocw.mit.edu 24.963 Linguistic Phonetics Fall 2005 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 24.963 Linguistic Phonetics

More information

Healthy Organ of Corti. Loss of OHCs. How to use and interpret the TEN(HL) test for diagnosis of Dead Regions in the cochlea

Healthy Organ of Corti. Loss of OHCs. How to use and interpret the TEN(HL) test for diagnosis of Dead Regions in the cochlea 'How we do it' Healthy Organ of Corti How to use and interpret the TEN(HL) test for diagnosis of s in the cochlea Karolina Kluk¹ Brian C.J. Moore² Mouse IHCs OHCs ¹ Audiology and Deafness Research Group,

More information

2/16/2012. Fitting Current Amplification Technology on Infants and Children. Preselection Issues & Procedures

2/16/2012. Fitting Current Amplification Technology on Infants and Children. Preselection Issues & Procedures Fitting Current Amplification Technology on Infants and Children Cindy Hogan, Ph.D./Doug Sladen, Ph.D. Mayo Clinic Rochester, Minnesota hogan.cynthia@mayo.edu sladen.douglas@mayo.edu AAA Pediatric Amplification

More information