Results of Vocoder Evaluation in Japan

Similar documents
A Sound Track to Reading

Version 1.1 Edition date 07 February 2018 ELPAC. English Language Proficiency for Aeronautical Communication ELPAC paper 1 test specifications

Assistive Listening Technology: in the workplace and on campus

Lindsay De Souza M.Cl.Sc AUD Candidate University of Western Ontario: School of Communication Sciences and Disorders

Bone Conduction Microphone: A Head Mapping Pilot Study

Hearing loss and travel: Assessing the hearing needs of travelers at airports

THE LISTENING QUESTIONNAIRE TLQ For Parents and Teachers of Students Ages 7 through 17 Years

PSYCHOMETRIC VALIDATION OF SPEECH PERCEPTION IN NOISE TEST MATERIAL IN ODIA (SPINTO)

MedRx HLS Plus. An Instructional Guide to operating the Hearing Loss Simulator and Master Hearing Aid. Hearing Loss Simulator

The University of Western Ontario Plurals Test v1.4

TIPS FOR TEACHING A STUDENT WHO IS DEAF/HARD OF HEARING

Effects of speaker's and listener's environments on speech intelligibili annoyance. Author(s)Kubo, Rieko; Morikawa, Daisuke; Akag

The following information relates to NEC products offered under our GSA Schedule GS-35F- 0245J and other Federal Contracts.

Development of Communication Support System using Lip Reading

Advanced Audio Interface for Phonetic Speech. Recognition in a High Noise Environment

Supporting Features. Criteria. Remarks and Explanations

DRAFT. 7 Steps to Better Communication. When a loved one has hearing loss. How does hearing loss affect communication?

Supporting Features Remarks and Explanations

Accessibility Standards Mitel MiVoice 8528 and 8568 Digital Business Telephones

Audioconference Fixed Parameters. Audioconference Variables. Measurement Method: Physiological. Measurement Methods: PQ. Experimental Conditions

aiic Essential do s and dont s when using simultaneous conference interpreters on TV

I. Language and Communication Needs

Research Article Measurement of Voice Onset Time in Maxillectomy Patients

한국어 Hearing in Noise Test(HINT) 문장의개발

Criteria Supporting Features Remarks and Explanations

Note: This document describes normal operational functionality. It does not include maintenance and troubleshooting procedures.

5. Which word refers to making

Ear Exam and Hearing Tests

Kristina M. Blaiser, PhD, CCC-SLP Assistant Professor, Utah State University Director, Sound Beginnings

TOWN OF FAIRFIELD PUBLIC HEALTH NURSING. MANUAL: School Health APPROVED BY: Board of Health School Medical Advisor

Speech perception of hearing aid users versus cochlear implantees

Noise hazards associated with the call centre industry V. PLANEAU

Air conduction hearing thresholds of young and older Japanese adults for pure tones from 125 Hz to 16 khz

Audioconference Fixed Parameters. Audioconference Variables. Measurement Method: Perceptual Quality. Measurement Method: Physiological

New CSA Noise Standards and Noise Control

IEC/TC 29 Electroacoustics. Status: September 2010

See what they say with Captioned Telephone

Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone

An active unpleasantness control system for indoor noise based on auditory masking

Communications Accessibility with Avaya IP Office

Echo Canceller with Noise Reduction Provides Comfortable Hands-free Telecommunication in Noisy Environments

TWO CHANNEL CLINICAL AUDIOMETER AUDIOSTAR PRO

WIDEXPRESS. no.30. Background

Visi-Pitch IV is the latest version of the most widely

Roger TM Plug in, turn on and teach. Dynamic SoundField

GSI AUDIOSTAR PRO CLINICAL TWO-CHANNEL AUDIOMETER

Welcome to Your Audiogram

Vision Painting Inc. Safety Management System

Can. Your Baby Hear You. Your Baby Passed The Hearing Screening. New York State Department of Health

HearPhones. hearing enhancement solution for modern life. As simple as wearing glasses. Revision 3.1 October 2015

Strategic Promotion of Ageing Research Capacity

SUMMARY TABLE VOLUNTARY PRODUCT ACCESSIBILITY TEMPLATE

Hearing Evaluation: Diagnostic Approach

Comfort Contego. Easy-to-use hearing product for a more active life

Effects of Setting Thresholds for the MED- EL Cochlear Implant System in Children

Listening to Learn: Strategies and Modifications for Children with Hearing Loss in the Classroom Environment

Prosody Rule for Time Structure of Finger Braille

Avaya Model 9611G H.323 Deskphone

Perception of American English can and can t by Japanese professional interpreters* 1

at SS8 and AGS on 20, 22 April, 2015 by Junichi Yoshio

An Overview of Simultaneous Remote Interpretation By Telephone. 8/22/2017 Copyright 2017 ZipDX LLC

GSI 61 CLINICAL TWO-CHANNEL AUDIOMETER

Avaya IP Office 10.1 Telecommunication Functions

REFERRAL AND DIAGNOSTIC EVALUATION OF HEARING ACUITY. Better Hearing Philippines Inc.

Hearing Conservation Program

Effects of noise and filtering on the intelligibility of speech produced during simultaneous communication

Voluntary Product Accessibility Template (VPAT)

2/25/2013. Context Effect on Suprasegmental Cues. Supresegmental Cues. Pitch Contour Identification (PCI) Context Effect with Cochlear Implants

HI-FIDELITY BROUGHT TO YOU BY TUNZ

Comparison of speech intelligibility between normal headsets and bone conduction hearing devices at call center

Perception of Hearing Loss by Graduate Students of Speech-Language Pathology

Audiology Assessment. 0c. Exam type:

S everal studies indicate that the identification/recognition. Identification Performance by Right- and Lefthanded Listeners on Dichotic CV Materials

Avaya 3904 Digital Deskphone Voluntary Product Accessibility Template (VPAT)

DTERM SP30 and SP350. Not applicable

Effects of Aircraft Noise on Student Learning

HEARING LOSS AT 3 KILOHERTZ AND THE CHABA "PROPOSED CLINICAL TEST OF SPEECH DISCRIMINATION IN NOISE" C. K. Myers and Cynthia Angermeier

Best Practice Protocols

Meeting a Person With Hearing and Vision Loss

April 23, Roger Dynamic SoundField & Roger Focus. Can You Decipher This? The challenges of understanding

NOAH Sound Equipment Guideline

Voice Pitch Control Using a Two-Dimensional Tactile Display

Cisco Accessibility Conformance Report VPAT Version 2.1

Roger Dynamic SoundField A new era in classroom amplification

Critical Review: What are the objective and subjective outcomes of fitting a conventional hearing aid to children with unilateral hearing impairment?

Everything you need to stay connected

Trouble hearing? Do you have trouble hearing in some situations?

Sennheiser. ActiveGard Technology. Your investment in. Sound Safety WHITE PAPER

ASR. ISSN / Audiol Speech Res 2017;13(3): / RESEARCH PAPER

HyperSound Tinnitus Module USER GUIDE (an optional feature of the HyperSound Clear 500P Directed Audio Solution)

Hearing Protection Systems

INTRODUCTION TO PURE (AUDIOMETER & TESTING ENVIRONMENT) TONE AUDIOMETERY. By Mrs. Wedad Alhudaib with many thanks to Mrs.

Hearing Conservation Program

Citation 音声科学研究 = Studia phonologica (1991),

EDITORIAL POLICY GUIDANCE HEARING IMPAIRED AUDIENCES

HEARING CONSERVATION PROGRAM

Summary Table: Voluntary Product Accessibility Template

Unit III Verbal and Non-verbal Communication

Transcription:

AMCP/WGD10-WP9 AERONAUTICAL MOBILE COMMUNICATIONS PANEL (AMCP) Working Group D Hawaii, U.S.A, January 19-28, 1999 Agenda Item : Vocoder Selection Results of Vocoder Evaluation in Japan Presented by T. Fujimori (Prepared by T. Fujimori and M.Ueno, ENRI) SUMMARY This paper provides the whole results of the vocoder evaluation in Japan which was conducted by ENRI(Electronic Navigation Research Institute) in October - November 1998. The evaluation consisted of MOS test (Japanese accented English and Japanese Languages) and Articulation test (Japanese Language). Four vocoders for evaluation were labelled A,B,C and D by FAA-TC (FAA s William J. Huges Technical Center). (A, C, D : candidate vocoders, B : industry standard vocoder as a performance benchmark) The results indicated that voice quality of Vocoder C was slightly higher than that of Vocoder A, and Vocoder D was the lesser performing equipment in all test conditions.

1. Introduction ENRI conducted the vocoder evaluation in Japan in October - November 1998. The purposes of this evaluation were to obtain basic data for studying acceptability of digital voice in air traffic environment in Japan, and to contribute to ICAO s vocoder selection activity by conducting MOS test for Japanese language(i.e. the test case #1 specified in Vocoder Selection Criteria which was developed at the WG-D/9 meeting). Firstly, the source tapes were produced and sent to FAA-TC on May 1998. Then, these tapes were processed through all four vocoders for the evaluation by FAA-TC and returned to Japan on September 7, 1998. At this time, four vocoders were labelled A,B,C and D by FAA-TC. (A, C, D : candidate vocoders, B : industry standard vocoder as a performance benchmark) It was also informed from FAA-TC that it is unable to inject errors into Vocoder B (therefore, only zero Bit Error Rate has been included in Vocoder B). The evaluation consisted of MOS test (Japanese accented English and Japanese Languages) and Articulation test(japanese Language). MOS test was carried out with participation of 97 air traffic controllers on October 12-28, 1998 and Articulation test was done by trained listeners on October 29 - November 6,1998. The mean values of test data were calculated by the end of November 1998. On FAA s request, ENRI sent the result of Japanese MOS test as the test case #1 for vocoder selection to FAA-TC by E-mail on December, 1998. (Refer to Appendix) The whole results of the evaluation are reported in this paper. 2. Test Tape Construction The source tapes for MOS test were produced by playing background noise(quiet, jet, prop and helicopter) while speakers read each script. On the other hand, the source tapes for Articulation test were produced by means of mixing with speech and background noise. Because all syllables used for Articulation test must be recorded at a constant level and level meter shall be used during recording, it is impossible to use a speaker for noise reproduction. All syllables were recorded without noise and then background noise was superimposed. The audiometric booth was used for recording(see Figure 1). Recording condition of source tapes for each test is shown in Table 1. The combinations between noise condition and microphone for recording were derived from a typical operational environment. The source tapes were processed through all four vocoders by FAA-TC. The test tapes were constructed by editing the various sentences prepared as speech samples in the processed tapes. Test MOS test Articulation test Table1. Recording Condition of Source Tapes NoiseCondit ion Microphone Speaker Sentence SONY ECM-9035 6 - English Jet [79dB/A] SONY ECM-9035 Air Traffic (DAM-ATC) Prop [89dB/A] Sigtronics S-0 Controllers - Japanese Heli [93dB/A] David Clark model-1 M- [3 males, 3 females] (ATC / General) SONY ECM-9035 Jet [79dB/A] SONY ECM-9035 - Japanese Prop [89dB/A] Sigtronics S-0 Trained Listeners (100 Syllables) Heli [93dB/A] David Clark model-1 M- [2 males, 2 females] Military[97dB/A] SONY ECM-9035 1

Audiometric Booth DAT#1 for Recording Headphone Amplifier for side tone Speaker #1 Display Sound Level Meter Speaker #2 Avionics Headset Speaker # DAT#2 for Noise Speaker #5 (Ceiling) Speaker #3 Figure 1 Source Tape Recording System 3. Test method and procedure 3.1 Test Contents The evaluation consisted of two separated methods, MOS test and Articulation test. MOS test was carried out with participation of 97 Japanese active Air Traffic Controllers (71 males, 26 females) from TACC (Tokyo Area Control Center), TIA (Tokyo International Airport) and ENRI. Articulation test was done by four trained listeners. A summary of test contents is given in Table 2 and participants of MOS test in Table 3. Table2. Summary of the Test Contents Test Test Conditions Participants Vocoder Language 2 [Japanese accented English, Japanese] 97 MOS test Articulation test Speaker 6 [3 males, 3 females] Bit Rate 1 [800bps] Background Noise [, Jet, Prop, Heli] Bit Error Rate 2 [0.1%, 2%] (Vocoder B : 0% only) Vocoder [A, B, C, D] Language 1 [Japanese] Speaker [2 males, 2 females] Bit Rate 1 [800bps] Background Noise 5 [, Jet, Prop, Heli, Military] Bit Error Rate 2 [0.1%, 2%] (Vocoder B : 0% only) Air Traffic Controllers Trained listeners Table 3. Summary of the MOS Test Participants 2

Belonging Test dates Number of participants TACC Oct. 12 28 (1998) 75 [ 5 males, 21 females ] TIA Oct. 19 23 (1998) 20 [ 15 males, 5 females ] ENRI Oct. 15 (1998) 2 [ 2 males ] Total 97 [ 71 males, 26 females ] 3.2 Test System Both tests were conducted in the audiometric booth at NTT-AT(Nippon Telegraph and Telephone Advanced Technology Corporation) in Tokyo (Shown in Figure 2 and Figure 3). Listening Room PC DAT Player Equalizer Amplifier Receivers Receiver Keypad for scoring Figure 2. Listening System Figure 3. Listening Room 3.3 MOS Test Procedure 3.3.1 MOS test The participants listen to a sentence one by one via a single-ear standard receiver in a fourseated audiometric booth, and give his/her opinion of the speech from a view point of Listeningeffort scale(shown in table ) with pressing an appropriate number of out of five buttons on a keypad. Each button is labeled with a description of the scale. The sentence lists are made up in random order for each group of four people. Four people in a same group evaluate same speech samples. The next group evaluates different speech samples in a different order. One participant listens to 336 sentences, that is one sentence per test condition. Table. Listening-effort Scale Effort required to understand the meaning of sentences Score Complete relaxation possible; no effort required 5 Attention necessary; no appreciable effort required Moderate effort required 3 Considerable effort required 2 No meaning understand with any feasible effort 1 3.3.2 Questionnaire on acceptability of digital voice 3

After the MOS test, the participated air traffic controllers were asked to answer the following questionnaire to obtain a reference data how they accept the digital voice quality for the Air Traffic Control environment. Although it is a simple approach, it helps to know the acceptable level for digital voice use among Japanese air traffic controllers. Questionnaire Do you think which level and over in the Listening-effort scale is acceptable as voice quality for use in Air Traffic Control environment? Please mark with a check. Complete relaxation possible; no effort required Attention necessary; no appreciable effort required Moderate effort required Considerable effort required No meaning understand with any feasible effort 3. Articulation Test Procedure Participants listen to a syllable one by one via a single-ear standard receiver in a four-seated audiometric booth, and type the syllable received with he keypad which has keys assigned all vowel and consonant in Japanese. Each participant listens to 100 syllables list (see Table 5) for every test condition in random way. The monosyllable articulation value (Sav) is defined by a percentage of the total number of received elements out of 195 monosyllables Here, Japanese 100syllables are separated into 195 monosyllables. According to ITU-T Handbook on Telephonometry, Sentence articulation value is considered to equal nearly 100% if Sav is above 80%. And it is also known that the value of satisfaction over communication declines rapidly if Sav is below 80%. Therefore, we tentatively established Sav is above 80% as the satisfied value in this articulation test. Table 5. Japanese Syllables List (syllables: 100, monosyllables:195) NO 1 2 3 5 1 re pa ro pya bya kyo o do mi ryu 2 kya te go nya ra gya ru a pu kyu 3 pyu me ri sya ga to pyo ma sa nyu hu byu hi hyo ze zi su myo se da 5 e gyu gu mya ge ya bi byo ti zyo 6 nyo zu ku ho tya Mu mo rya gi ka 7 ni gyo bu bo pe tyo zo Ke i hya 8 zya u ryo he tyu ko tu So zyu ba 9 myu ta syo ha za pi de no si be 10 nu wa yu ne ki po syu na hyu yo. Results

.1 MOS value Means for MOS test are presented in Tables 6 and 7. Graphical representations of these means are shown in Figure, 5, 6 and 7. Table 6. Means for MOS test under Japanese accented English language Bit Error Rate Background Vocoder (BER) Noise A C D B(Ref.) QUIET.3.1 3.06.23 BER 0.1% JET 3.71 3.89 2.2 3.6 PROP 3.9.15 2.9 3.75 HELI 3.12 3.26 1.92 2.95 QUIET.1.3 2.79.23 BER 2% JET 3.62 3.86 2.08 3.6 PROP 3.82.07 2.10 3.75 HELI 2.95 3.16 1.63 2.95 Table 7. Means for MOS test under Japanese language Bit Error Rate Background Vocoder (BER) Noise A C D B(Ref.) QUIET.5.60 3.11.20 BER 0.1% JET.09.23 2.70 3.67 PROP.1.26 2.6 3.57 HELI 3.15 3.31 2.07 2.88 QUIET.25. 2.80.20 BER 2% JET 3.80.09 2.31 3.67 PROP 3.77 3.99 2.5 3.57 HELI 2.96 3.20 1.66 2.88 (Note : The above MOS values under BER 0.1%& PROP noise condition are the result of MOS test as the test case #1 in Vocoder Selection Criteria developed in the WG-D/9 meeting. Refer Appendix, too.) [Comments] (1) The MOS values for Vocoder A and Vocoder C are higher than those for Vocoder B (performance benchmark) although Vocoder A and Vocoder C have Bit Error. (Vocoder B has no Bit Error.) (2) The MOS values for Vocoder C is slightly higher than those for Vocoder A in any test conditions. (3) The MOS values for Vocoder D is significantly lower than those for the other Vocoders. () With regard to some data that the MOS value for Prop noise (89dB/A) is higher than that for Jet noise (79dB/A), it seems that the effect of differences of characteristics among microphones in producing the source tapes appeared sensitively. The characteristics of Sigtronics S-0 microphone used in Prop noise were found to surpass that of SONY ECM- 9035 microphone used in Jet noise by the extra Articulation test. (See Figure 8 of page 9) 5

Japanese Accented English with BER0.1% Condition 5 MOS value 3 2 Jet Prop Heli 1 Vocoder A Vocoder C Vocoder D Vocoder B (Ref.) Figure. MOS value for Japanese accented English with BER 0.1% Japanese Accented English with BER 2% Condition 5 MOS value 3 2 Jet Prop Heli 1 Vocoder A Vocoder C Vocoder D Vocoder B (Ref.) Figure 5. MOS value for Japanese accented English with BER 2% 6

Japanese with BER 0.1% Condition 5 MOS value 3 2 Jet Prop Heli 1 Vocoder A Vocoder C Vocoder D Vocoder B (Ref.) Figure 6. MOS value for Japanese with BER 0.1% (Note : The above MOS values under PROP noise condition are the result of MOS test as the test case #1 in Vocoder Selection Criteria developed in the WG-D/9 meeting. Refer Appendix, too.) Japanese with BER 2% Condition 5 MOS value 3 2 Jet Prop Heli 1 Vocoder A Vocoder C Vocoder D Vocoder B (Ref.) Figure 7. MOS value for Japanese with BER 2% 7

.2 Response to Questionnaire on acceptability of digital voice The response to the questionnaire is shown in Table 8. [Comments] Table 8. Response to the questionnaire Listening-effort scale Number of poll 5 : Complete relaxation possible; no effort required (%) : Attention necessary; no appreciable effort required 58 (60%) 3 : Moderate effort required 33 (3%) 2 : Considerable effort required 2 (2%) 1 : No meaning understand with any feasible effort 0 (0%) Total 97(100%) Mean Score 3.66 (1) 60% of all participants selected (i.e. Attention necessary; no appreciable effort required) and 3% chose 3 (i.e. Moderate effort required). The mean score is 3.66. (2) According to the above result, MOS=3.66 can be temporarily assumed as acceptable level for use of digital voice in Air Traffic Control Environment. (3) It is known by experiences that the value of MOS=3.5 can be considered as a measure of voice quality in construction of a digital telecommunication network. This fact supports MOS=3.66 as acceptable level for use of digital voice..3 Articulation value Means for Articulation test (Monosyllable Articulation value) are presented in Table 9 and graphical representations are shown in Figure 9 and 10. Bit Error Rate (BER) BER 0.1% BER 2% Table 9. Monosyllable Articulation value (%) Background Vocoder Noise A C D B(Ref.) QUIET 86.0 86.7 81.5 82.9 JET 80.9 82.9 77.1 77. PROP 82.6 8.6 76.3 79.3 HELI 77. 80.9 72.2 72.2 MIL 70.2 72.1 67.1 66.2 QUIET 8.1 85. 78.7 82.9 JET 79.8 82.5 73. 77. PROP 80.6 85.0 71.5 79.3 HELI 73.8 79.3 68.6 72.2 MIL 66.7 70.0 62.6 66.2 8

[Comments] (1) The monosyllable articulation values for Vocoder A and Vocoder C are higher than those for standard Vocoder B even under the condition of BER 2%. (2) The monosyllable articulation values for Vocoder C is slightly higher than those for Vocoder A under all the test conditions. (3) The monosyllable articulation values for Vocoder D is significantly lower than those for the other Vocoders under all the test conditions. () With the exception of the background noise of Military, the monosyllable articulation values for Vocoder C are above 80%(i.e. tentative satisfied value). (5) The monosyllable articulation values for Vocoder A are above 80% under the noise condition of Jet and Prop. (6) Some monosyllable articulation values for Prop noise is higher those for Jet noise as well as the results of MOS test. It also seems to depend on the characteristics of microphones used for recording speech sentences. (See Figure 8) Comparsion of Microphone Characteristic (Test Condition : Vocoder A, BER0.1%, ) Monosyllable Articulation Valu 100 90 80 70 SONY Sigtronics David Clark Figure 8. Comparison of Microphone Characteristic 9

Articulation Test (BER 0.1%) Monosyllable Articulation Valu 100 90 80 70 60 50 Vocoder A Vocoder b Vocoder c Vocoder B (Ref.) Jet Prop Heli Military Figure 9. Monosyllable Articulation value for Japanese with BER 0.1% Articulation Test (BER 2%) Monosyllable Articulation Valu 100 90 80 70 60 50 Vocoder A Vocoder b Vocoder c Vocoder B (Ref.) Jet Prop Heli Military Figure 10. Monosyllable Articulation value for Japanese with BER 2% 5. Data Analysis (Analysis of Variance) 10

5.1 MOS test An Analysis of Variance (ANOVA) was conducted to ascertain the relevance of the independent variables (i.e. Vocoder, Language, BER, Sex of speaker and Background noise) in the MOS test. The results are significant with 99% confidence (a=0.01 level). The results of Analysis of Variance are shown in Table 10. However, the data for Vocoder B were excepted from these data since it is the performance benchmark vocoder and its BER is 0% only. Table 10. Analysis of Variance for MOS test Independent Mean DF Anova SS Variable Square F Result Vocoder 2 139.78 69.89 550.02 Significant Language 1 1.29 1.29 10.19 Significant BER 1 3.39 3.39 26.72 Significant Sex of speaker 1 19.67 19.67 15.81 Significant Background Noise 3 53.21 17.7 139.59 Significant [Comments] (1) The dependent measure of MOS test varied significantly by all independent variables. (2) As to the language, the MOS values for messages spoken in Japanese are slightly higher than that spoken in Japanese accented English, One possible reason for it is that the DAM- ATC sentence lists, which were developed by FAA and used in Japanese accented English MOS test, included some FIX names and phraseology unfamiliar to Japanese air traffic controllers. (3) As to the sex of speaker, the MOS values for messages spoken by the male are significantly higher than that spoken by the female. This result is clearly due to the difference of speech level, since the speech levels were left to speakers when recording speech sentences for MOS test. (Refer to Appendix) 5.2 Articulation test ANOVA was conducted to ascertain the relevance of the independent variables (i.e. Vocoder, BER, Sex of speaker, Background noise) in the Articulation test. The results are significant with 99% confidence (a =0.01 level). The results of ANOVA are in Table 11. However the data for Vocoder B (performance benchmark) were excepted from these data. [Comments] Table 11. Analysis of Variance for Articulation test Independent Mean DF Anova SS F Result Variable Square Vocoder 2 1365.5 682.7 12.8 Significant BER 1 189.7 189.7 3.7 Not Significant Sex of speaker 1 71.7 71.7 1.30 Not Significant Background Noise 3573.0 893.2 16.32 Significant 11

(1) The dependent measure of articulation test varied significantly by factors of Vocoder and Background noise. (2) In contrast with ANOVA for MOS test, the sex of speaker is resulted in Not significant. This is simply because that the speech level was equalized when producing the source tapes for Articulation test. Consequently, it is considered that there are no differences in the speech quality for messages spoken by male and female. 6. Conclusion The results indicate the followings. (1) The voice quality of Vocoder C is slightly higher than that for Vocoder A under any test condition. (2) The voice quality for Vocoder D is significantly lower than that for the other Vocoders. (3) With the exception of Helicopter noise, MOS values for Vocoder C are beyond the acceptable level of 3.66 which was assumed based on the result of questionaire to the participated air traffic controllers. () The monosyllable articulation values for Vocoder C are above 80%(i.e. tentative satisfied value) under BER 0.1% & all the background noise conditions except Military noise. 7. Recommendation WG-D is invited to consider these results in the vocoder selection process with respect to standardizing a.8 kbps vocoder for use in VDL Mode 3 air/ground communication system. [References] (1) Vocoder Evaluation Plan in Japan (ICAO AMCP WG-D/8 WP-23, Oberpfaffenhaffen, Germany, December, 1997) (2) Overview of Vocoder evaluation program in Japan (ICAO AMCP WG-D/9 WP-5, Ottawa, Canada, September, 1998) (3) ITU-T Handbook on Telephonometry () ITU-T Recommendation P.800 12

Appendix Result of Japanese MOS Test for Vocoder Selection (The test case #1 specified in the Vocoder Selection Criteria) Mean scores (MOS value) for each vocoder are presented in Table 1 and a graphical representation of MOS value is shown in Figure 1. Table1. MOS value for each vocoder(japanese Language) Sex of Speaker A Vocoder B C D Male.39.07.0 2.81 Female 3.90 3.07.13 2.8 Mean.1 3.57.26 2.6 (Vocoder A,C,D : BER 0.1% / Vocoder B : BER 0%) Figure 1. Graphical representation of MOS value Prop Noise Condition (Japanese) 5 MOS value 3 2 Male Female Mean 1 A B C D VOCODER (Vocoder A,C,D : BER 0.1% / Vocoder B : BER 0%) The MOS value for Vocoder C is slightly higher than that for Vocoder A. The value for Vocoder D is significantly lower than those for the other Vocoders. With regard to a fact that the MOS value for male voice is higher than that for female voice, it seems that the effect of differences of speech level between male and female appeared sensitively. 13