Speech Intelligibility Measurements in Auditorium

Similar documents
Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone

HCS 7367 Speech Perception

Vol. 125 (2014) ACTA PHYSICA POLONICA A No. 4-A

Communication quality for students with a hearing impairment: An experiment evaluating speech intelligibility and annoyance

IMPACT OF ACOUSTIC CONDITIONS IN CLASSROOM ON LEARNING OF FOREIGN LANGUAGE PRONUNCIATION

Frequency refers to how often something happens. Period refers to the time it takes something to happen.

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

HCS 7367 Speech Perception

Acoustical Quality Assessment of Lecture halls at Lund University, Sweden

The development of a modified spectral ripple test

Technical Discussion HUSHCORE Acoustical Products & Systems

Proceedings of Meetings on Acoustics

Issues faced by people with a Sensorineural Hearing Loss

Speech intelligibility in simulated acoustic conditions for normal hearing and hearing-impaired listeners

Speech Cue Weighting in Fricative Consonant Perception in Hearing Impaired Children

Hearing. Juan P Bello

Providing Effective Communication Access

IS THERE A STARTING POINT IN THE NOISE LEVEL FOR THE LOMBARD EFFECT?

Although considerable work has been conducted on the speech

A case study of construction noise exposure for preserving worker s hearing in Egypt

Consonant Perception test

A. SEK, E. SKRODZKA, E. OZIMEK and A. WICHER

Best Practice Protocols

Classroom acoustics for vocal health of elementary school teachers

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception

Hearing Lectures. Acoustics of Speech and Hearing. Auditory Lighthouse. Facts about Timbre. Analysis of Complex Sounds

CLASSROOM AMPLIFICATION: WHO CARES? AND WHY SHOULD WE? James Blair and Jeffery Larsen Utah State University ASHA, San Diego, 2011

Perceptual Effects of Nasal Cue Modification

Systems for Improvement of the Communication in Passenger Compartments

UNIVERSITY OF MILAN'S AUDITORIA: ANALYSIS AND COMPARISON

Combating the Reverberation Problem

Rating Airborne Sound Insulation in Terms of the Annoyance. and Loudness of Transmitted Speech and Music Sounds

Reverberation Time. Jason A. Galster. Dissertation. Submitted to the Faculty of the. Graduate School of Vanderbilt University

Intelligibility of narrow-band speech and its relation to auditory functions in hearing-impaired listeners

Speech Privacy Systems

What Is the Difference between db HL and db SPL?

Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners

The effect of binaural processing techniques on speech quality ratings of assistive listening devices in different room acoustics conditions

Effects of Aircraft Noise on Student Learning

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency

How the LIVELab helped us evaluate the benefits of SpeechPro with Spatial Awareness

Background noise level to determine the speech privacy in open plan offices

Eliminating Acoustical Barriers to Learning in Classrooms

THE MECHANICS OF HEARING

Overview. Acoustics of Speech and Hearing. Source-Filter Model. Source-Filter Model. Turbulence Take 2. Turbulence

Assistive Listening Technology: in the workplace and on campus

Wireless Technology - Improving Signal to Noise Ratio for Children in Challenging Situations

Sound localization psychophysics

Four important facts:

Psychoacoustical Models WS 2016/17

Impact of the ambient sound level on the system's measurements CAPA

Contents THINK ACOUSTICS FIRST NOT LAST WHO BENEFITS FROM IMPROVED ACOUSTICS?

Percentiles Study Group. Elk test for percentile analysis

BINAURAL DICHOTIC PRESENTATION FOR MODERATE BILATERAL SENSORINEURAL HEARING-IMPAIRED

Changes in the Role of Intensity as a Cue for Fricative Categorisation

Effects of noise and filtering on the intelligibility of speech produced during simultaneous communication

Noise-Robust Speech Recognition in a Car Environment Based on the Acoustic Features of Car Interior Noise

SPEECH PERCEPTION IN A 3-D WORLD

Speech Spectra and Spectrograms

CHAPTER 1 INTRODUCTION

Evidence base for hearing aid features:

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag

The Use of a High Frequency Emphasis Microphone for Musicians Published on Monday, 09 February :50

The effect of wearing conventional and level-dependent hearing protectors on speech production in noise and quiet

Trading Directional Accuracy for Realism in a Virtual Auditory Display

Masked Perception Thresholds of Low Frequency Tones Under Background Noises and Their Estimation by Loudness Model

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Pitch & Binaural listening

Two Modified IEC Ear Simulators for Extended Dynamic Range

Wolfgang Probst DataKustik GmbH, Gilching, Germany. Michael Böhm DataKustik GmbH, Gilching, Germany.

Elements of Effective Hearing Aid Performance (2004) Edgar Villchur Feb 2004 HearingOnline

Juan Carlos Tejero-Calado 1, Janet C. Rutledge 2, and Peggy B. Nelson 3

A field study of school teachers noise exposure, teachers speech levels and duration of speech during classroom teaching

! Can hear whistle? ! Where are we on course map? ! What we did in lab last week. ! Psychoacoustics

Linguistic Phonetics Fall 2005

Hearing. and other senses

Review of Methods for Quantifying Tonalness in Noise. Quantifying Tonalness

Effects of sound masking on workers a case study in a landscaped office

Speech segregation in rooms: Effects of reverberation on both target and interferer

1706 J. Acoust. Soc. Am. 113 (3), March /2003/113(3)/1706/12/$ Acoustical Society of America

Speech (Sound) Processing

Computational Perception /785. Auditory Scene Analysis

Speech Recognition in Noise for Hearing- Impaired Subjects : Effects of an Adaptive Filter Hearing Aid

Survey on sound environment in classrooms during school hours for hearing impaired students

ACOUSTIC ANALYSIS AND PERCEPTION OF CANTONESE VOWELS PRODUCED BY PROFOUNDLY HEARING IMPAIRED ADOLESCENTS

Challenges in microphone array processing for hearing aids. Volkmar Hamacher Siemens Audiological Engineering Group Erlangen, Germany

Proceedings of Meetings on Acoustics

UvA-DARE (Digital Academic Repository) Perceptual evaluation of noise reduction in hearing aids Brons, I. Link to publication

The role of low frequency components in median plane localization

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES

Linguistic Phonetics. Basic Audition. Diagram of the inner ear removed due to copyright restrictions.

NOISE SURVEY. EQUIPMENT: Sound Level Meter Octave Band Analyzer Dosimeters Sound Calibrator Survey Sheets Tape measure SP 10

Evidence based selection of hearing aids and features

Power Instruments, Power sources: Trends and Drivers. Steve Armstrong September 2015

CLASSROOM HEARING AND SOUND ENHANCEMENT. A White Paper of Audio Enhancement Co.

HOW TO USE THE SHURE MXA910 CEILING ARRAY MICROPHONE FOR VOICE LIFT

Colin Cobbing ARM Acoustics

Speech Recognition Experiment in Natural Quiet Background Noise

CONTRIBUTION OF DIRECTIONAL ENERGY COMPONENTS OF LATE SOUND TO LISTENER ENVELOPMENT

WIDEXPRESS. no.30. Background

A Basic Study on possibility to improve stage acoustics by active method

Transcription:

Vol. 118 (2010) ACTA PHYSICA POLONICA A No. 1 Acoustic and Biomedical Engineering Speech Intelligibility Measurements in Auditorium K. Leo Faculty of Physics and Applied Mathematics, Technical University of Gdańsk G. Narutowicza 27/12, 80-233 Gdańsk, Poland Speech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m 3 ). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between spatial differences in responses to articulation tests and EDT for low frequencies divided by EDT for high frequencies. PACS numbers: 43.55 Hy 1. Introduction 1.1. Acoustical parameters in auditorium During the process of building Auditorium Novum at Technical University of Gdańsk, concern of acoustical quality of the hall arouse. Taking the primary function of auditorium into account, high speech intelligibility was judged to be the most desirable parameter through the audience area in the hall [1]. To achieve high speech intelligibility in auditorium, a number of acoustic parameters have to be considered, for instance, reverberation time RT60 should have the optimum length, starting from minimum value 0.7 s for room of volume 283 m 3. Then it can linearly increase with the logarithm of room volume up to 1.1 s with deviations less than 10% [2]. This value prevents masking of speech sounds by late reflections [3]. On the other hand, it supports enough sound energy through the hall. Late reflections are not only important in speech intelligibility process but also much attention has to be paid to early reflections, as they play crucial role in the process of transporting intelligible speech from the speaker to listeners. 1.2. Mechanism of loss of speech intelligibility in relation to early reflections The nature of speech sounds determines the mechanism of loss of speech intelligibility in enclosed space. Vowels and consonants convey different sound energy. The average level of consonants is 10 12 db lower than the level of vowels. Thus, consonants can be easily masked by vowels in reverberant space. Moreover, the most energy of consonants is at higher frequencies, vowels are represented at lower and middle frequency bands [5]. Dividing audible spectrum into 10 equal intelligibility parts, one can see that frequencies up to around corresponding author; e-mail: krzysztof.leo@gmail.com Fig. 1. Three sound decay curves for different rooms with different RT60 length. Although values of RT60 differ significantly, the rooms share the same shape of decay curve for early reflections. Speech intelligibility is equal in each of these rooms [4]. 250 Hz and above 7.5 khz do not add to intelligibility of speech [6]. Sounds of frequencies below 250 Hz act just as maskers for higher frequencies speech sounds. Finally, when comparing average length of vowels and consonants, consonants are shorter than vowels, i.e. 20 ms compared to 90 ms. Vowels are stationary signals, with strong harmonic content, consonants are often non- -stationary in character (e.g. plosives), rather without harmonic content, sometimes with strong noise content (e.g. fricatives). To prevent the masking process of consonants by vowels, enough sound energy from the speaker to listeners has to be transported in a time range corresponding to average length of speech sound. Otherwise, the listener will hear elements of speech overlapping each other, which will deteriorate intelligibility. This critical time interval was found in the area of around 50 ms [5]. This is already known process of masking elements of speech sounds by other elements of speech. What should be moreover considered in a reverberant space? Is there an additional mechanism of deteriorating speech intelli- (110)

Speech Intelligibility Measurements in Auditorium 111 Fig. 2. Sound level vs. time for speech phrase at low mid and high frequencies. Solid line in front of the speaker, dotted line behind speaker [5]. gibility by early reflections? When strong early reflections are decaying, they are interfering with direct sound. Speech signal becomes distorted by discrete, delayed signals. This process is likely dependent on the relative length of the speech sounds and decay of early reflections. High frequency speech sounds are more likely to be misunderstood in conditions of given EDT than lower speech sounds. When high frequency elements of speech are being distorted by early reflections, lower frequency sounds remain still distinctive to the listener. This is mainly due to the fact of non-stationary character of consonants comparing to the stationary vowels. Thus speech intelligibility is likely to be dependent on the shape of EDT curve. The slope of EDT in the direction of higher frequencies curve should correspond to the better speech intelligibility. Flat EDT curve should mean poorer intelligibility. 1.3. Speech Transmission Index (STI) in relation to the results of articulation tests In order to check validity of articulation tests, additional STI measurements were performed (see point 3.2). Figure 3 compares various types of intelligibility tests in relation to the STI coefficient (data taken from the literature). These results were obtained for the hall with listeners, for many languages. On the Fig. 3 one can see marked values of the STI with corresponding with intelligibility scores. This area is in special interest of this work. 2. Method 2.1. Measurements Articulation tests in which 101 participants (aged 22 24) took part, were carried. Listeners were asked to cover the whole seating area uniformly. Two syllabic lists, consisting of 100 nonsense elements (logatoms) were read. The lists were balanced for Polish language. First list was Fig. 3. Typical relations between the STI and intelligibility scores for various types of tests, among them is logatoms test [7]. Range of STI values measured in a hall and corresponding intelligibility scores is marked. read without air-conditioning noise, the second one with fans being switched on. Objective measurement system consisted of sound source a gunshot, omni directional measurement microphone, sound pressure level meter and PC with sound card. Measurements were evaluated without audience. Eighteen measuring points were determined. 2.2. Analysis Three impulse responses from each point were added in time domain and averaged output impulse response was analyzed. STI values were calculated and RT60 and EDT in octave bands from impulse responses were obtained. EDT was measured for early decay, from 0 db to 10 db (±3 db). 3. Results 3.1. Articulation test Articulation tests revealed high speech intelligibility in the hall. Average intelligibility is 88.5% in the absence of disturbing noise from air-conditioning system and 80.8% in presence of noise. Figures 4 and 5 present spatial differences of intelligibility. Numbers indicate Percentage Speech Articulation (PSA). Contours of equal speech intelligibility were drawn in order to show areas of better and poorer speech intelligibility. Averaging of 3 values of PSA in the radius of approximately 2 m from each measurement point was done (p1 to p18 indication on figures). Values written under measurement point are three PSA averaged. Averaged PSA values in different seats in the hall are shown in the Table I. Differences of speech intelligibility through the hall were not very large. Maximum difference between average responses in best intelligibility rated place (p6) and worse place (p18) is 8.4% (noise OFF) and 9.7% (noise ON). Level and spectrum of noise in the hall was measured in the absence of audience in the middle of the hall (p2). Figure 6 shows spectrum of background noise in

112 K. Leo TABLE I STI values and corresponding average Percentage Speech Articulation, NOISE OFF. Fig. 4. left noise OFF, Fig. 5 right noise ON. Speech intelligibility equal contours and eighteen measurement points with averaged PSA around sixteen of them. Points on gray background are six survey points shown on EDT plots. Letter S marks position of the sound source. the presence and absence of disturbing noise from air- -conditioning system. There are large differences in level of noise at low frequencies. Measuring point STI Averaged PSA [%] 1 0.54 2 0.52 88 3 0.56 88 4 0.57 84 5 0.55 92.7 6 0.51 92.7 7 0.6 90 8 0.6 91 9 0.58 91.3 10 0.58 88.7 11 0.57 87 12 0.55 88 13 0.56 89.3 14 0.55 15 0.62 91.7 16 0.61 90.2 17 0.59 89 18 0.59 84.3 in Fig. 3. However this region is not far below the average speech intelligibility measured by articulation test 88.5%. 3.3. RT60 curves Fig. 6. Spectrum of background noise in presence and absence of air-conditioning noise. The total level of noise with air-conditioning off was: 33.9 dba, 30.1 dba, 30.3 dba at points 1, 2, 3 and with air-conditioning on was: 45.4 dba, 44.8 dba 43.7 dba. There were not large differences between levels of background noise noted through the hall, apart from 2 4 db rise in the front of the hall (p1). This is probably caused by part of air-conditioning system located over the stage always being switched on. RT60 values do not vary significantly through hall. Figure 7 shows RT60 from six measurement points in octave bands. RT60 value shows more discrepancies in lower bands (63 Hz and 125 Hz) bands than in higher bands. 3.2. STI values STI values calculated from impulse responses of the hall do not depend on the speech intelligibility scores measured in the hall. Table shows STI coefficients measured from averaged impulse responses in time domain to compare them with averaged PSA. Not all values of STI correspond to values of averaged PSA in the way shown on the Fig. 3. STI is not enough precise to measure small differences of PSA. Region of STI marked by minimum and maximum values 0.51 and 0.57 correlates with the area 73% 88% of PSA Fig. 7. RT60 curves for 6 measurement points. Points indicated with gray background and indicators p1, p2, p3, p4, p5 and p6 on the Fig. 4. 3.4. EDT curves EDT values for octave bands were obtained in order to find better range of differences in that parameter. Values

Speech Intelligibility Measurements in Auditorium 113 are plotted in Fig. 9 and in the condition of absence of noise are plotted in Fig. 10. Fig. 8. EDT values for 4 measurement points. Points indicated with indicators p4, p6, p9, p16 on the Fig. 4. of EDT in octave bands show differences, when different measuring points are compared. Figure 8 shows that in seats with poorer intelligibility (point 4, 18) EDT curves are falling slower to the direction of higher frequencies. In points with better intelligibility (point 6, 9) EDT curves are falling more rapidly towards higher frequencies. To measure the ratio of EDT slope towards high frequencies, a coefficient EDT s (for EDT slope) was proposed. (EDT s = EDT for 63 Hz plus EDT for 125 Hz divided by EDT for 4 khz plus EDT for 8 khz). Values of EDT s with corresponding averaged PSA are shown in Table II. TABLE II EDT s coefficient with corresponding, averaged PSA in 16 measurement points in the presence and absence of noise. Measuring point Averaged PSA [%] noise ON Averaged PSA [%] noise OFF EDT s 1 2.25 2 81 88 2.47 3 77.7 87.7 2.33 4 75 84.3 1.44 5 85.7 92.7 3.4 6 86.7 92.7 5.2 7 86 90 4.2 8 80 91 5.1 9 76.3 91.3 4.9 10 81 88.7 4 11 84 87 1.9 12 86 88 2.6 13 83.3 89.3 3.5 14 1.4 15 83.3 91.7 4.9 16 78.5 90.2 5.7 17 78.7 89 3.1 18 77 84.3 1.3 Values of coefficient EDT s at sixteen points against 3 PSA averaged in the condition of presence of the noise Fig. 9. EDT s coefficient against corresponding, averaged PSA in sixteen measurement points for condition without disturbing noise. Fig. 10. EDT s coefficient against corresponding, averaged PSA in 16 measurement points for condition with disturbing noise. Dotted line presents tendency obtained without background noise, solid line shows estimated tendency with background noise. Solid line was drawn using equation y = 4.95 ln(edt s)+75 comparing to the dotted line using equation y = 4.95 ln(edt s) + 83.335. Correlation (R 2 = 0.81) has been found between the level of EDT s and average PSA in the condition without noise. This fact suggests the dependence of Percentage Syllable Articulation on EDT curve slope. The mechanism of deteriorating speech intelligibility by early reflections, and dependence of intelligibility on the shape of EDT probably occurred in measured auditorium. Noise highly, and randomly deteriorated intelligibility in the hall. Although weak correlation was found in the presence of disturbing noise from ventilating fans, correlation between the EDT s (ratio of EDT for low frequencies to the EDT for high frequencies) with speech intelligibility scores changed into observable rising tendency. Logarithmic equations used to model this phenomena show differences in values of their constants. In condition without noise logarithmic model describes phenomena much better than with background noise. 4. Conclusions PSA articulation tests were conveyed in the auditorium of seating capacity 406 and volume 3300 m 3.

114 K. Leo A map of different level of speech intelligibility was constructed. STI measurements showed no significant differences through the audience area, although the mean value of PSA corresponds to mean STI value. The need of more precise objective measure of speech intelligibility arouse. Logarithmic correlation (R 2 = 0.81) was found between EDT s and averaged PSA. The model of deteriorating speech intelligibility in connection to different profiles of EDT curve versus frequency was proposed. References [1] M. Barron, Proc. Institute of Acoustics, 8 Part 3, 371 (1986). [2] V.O. Knudsen, Acoustical designing in Architecture, Ch. 9, Acoustical Society of America, Los Angeles 1988. [3] H. Kurtovič, Acoustica 33, 32 (1975). [4] H.J. Steeneken, T. Houtgast, in: Rasti, Technical review, Brüel & Kjær 1985, p. 13. [5] M. Barron, Auditorium Acoustics and Architectural Design, Chapman & Hall/Routledge, London 1993, Ch. 6, 7. [6] H.K. Dunn, S.D. White, J. Acoust. Soc. Am. 11, 278 (1940). [7] T. Houtgast, H.J. Steeneken, in: Rasti, Technical review, Brüel & Kjær 1985, p. 3.