Voice analysis for detecting patients with Parkinson s disease using the hybridization of the best acoustic features

Size: px
Start display at page:

Download "Voice analysis for detecting patients with Parkinson s disease using the hybridization of the best acoustic features"

Transcription

1 International Journal on Electrical Engineering and Informatics - Volume 8, umber, March 206 Voice analysis for detecting patients with Parkinson s disease using the hybridization of the best acoustic features Achraf BEBA, Abdelilah JILBAB 2 and Ahmed HAMMOUCH 3,2,3 Laboratoire de Recherche en Génie Electrique, Ecole ormale Supérieure de l Enseignement Technique, Mohammed V University, Rabat, Morocco achraf.benba@um5s.net.ma Abstract: Parkinson's disease (PD) is a neurodegenerative disorder of unknown etiology. It causes, during its course, vocal impairment in approximately 90% of patients. PD patients suffer from hypokinetic dysarthria, which manifests in all aspects of speech production: respiration, phonation, articulation, nasality and prosody. To evaluate these, clinicians have adopted perceptual methods, based on acoustic cues, to distinguish different disease states. In order to improve these evaluations, we used a variety of voice samples comprising the numbers from to 0, four rhymed sentences, nine Turkish words plus the sustained vowels a, o, and u. Samples were collected from 40 people, 20 with PD. We used the method of Leave- One-Subject-Out (LOSO) validation with a K earest eighbor (k-) classifier with its different types of kernels, (i.e.; RBF, Linear, polynomial and MLP). The best result obtained was 82.5% using two diffirent voice samples; - the 4 th acoustic features along with the 7 th voice samples; 2- The 3 th and the 5 th acoustic features along with 20 th voice sample. Keywords: Voice analysis, Parkinson s disease, acoustic features, UCI Machine learning, Leave One Subject Out, Support Vector Machines.. Introduction Parkinson s disease (PD) is the second most common neurological disorder after Alzheimer's disease. It causes, during its course, a variety of symptoms. These include difficulty walking, talking, thinking or completing other simple tasks [] [2] [3]. Such neurological diseases profoundly affect the lives of patients and their families []. PD is generally seen in people over the age of 50. For most elderly people who are suffering from the disease, physical visits to the clinic for diagnosis, monitoring, and treatment are difficult and complicated [4] [5]. Parkinson s disease causes vocal impairment for approximately 90% of patients [6]. Vocal disorders do not appear abruptly. They are the result of a slow process whose early stages may be unnoticed. For this reason, the development of early diagnosis and tele-monitoring systems with accurate, reliable, and unbiased predictive models, are crucial for patients and research [] [7]. These will allow practitioners and patients to act faster and better understand the disease. There are recent studies for the detection of voice disorders with machine learning tools using acoustic measurements (features) of dysphonia. These include fundamental frequency or pitch of vocal oscillation (F0); absolute sound pressure level (indicating the relative loudness of speech); jitter which represents the cycle-to-cycle variation of fundamental frequency; shimmer which represents the extent of variation in speech amplitude from cycle to cycle; and harmonicity which represents the degree of acoustic periodicity [] [8] [9]. Studies have shown variations in all these measurements for distinguish person with PD from healthy controls [0], which shows that these acoustic parameters could be useful to evaluate speech disorders []. Voice impairments can be detected by using acoustic features extracted from people with PD. Other measurements, such as complex nonlinear aperiodicity, turbulent, aero-acoustic, and non-gaussian randomness of the sound could be useful to increase the efficacy of voice disorder diagnosis systems [8]. Received: October 29 th, 204. Accepted: February 2 nd, 206 DOI: /ijeei

2 Achraf BEBA, et al. As for disorders, Little et al. [] aimed to discriminate healthy people from people with PD by detecting dysphonia. In their study, sustained vowel a phonations were recorded from 3 subjects, of whom 23 were diagnosed with PD. They then selected ten highly uncorrelated measures, and found four that, in combination, lead to overall correct classification performance of 9.4%, using a kernel Support Vector Machine (SVM). Betul Erdogdu Sakar et al [5] analyzed multiple types of sound recordings collected from people with Parkinson s disease. The extracted features were fed into SVM and k-earest eighbor (k-) classifiers for PD diagnosis by using a leave-one-subject-out (LOSO) cross-validation scheme and summarized Leave-One-Out. To distinguish healthy subjects from PWP, most studies use SVM classification [] [5]. Success of the diagnostic system is measured with true positive (TP), true negative (T), false positive (FP) and false negative (F) rates. In this work, we used a dataset that was published in the UCI machine learning archive on June 204 and which was used by Betul Erdogdu Sakar et al [5]. In their study, multiple voice samples per subject were collected during the pronunciation of numbers from to 0, four rhymed sentences, nine words in Turkish language along with sustained vowels a, o, and u from 40 people, 20 with Parkinson s disease. In this paper we used the same database as [5]. The main idea of our work is to show the effectiveness of using, not multiple types of voice recording as [5], but each type of voice recording independently in order to improve the classification accuracy. For classification, we used k-, and to evaluate the success of the models in discriminating healthy subjects from people with Parkinson s disease, we calculated accuracy, specificity, sensitivity scores [5]. 2. Data acquisition The database of this study (Table I) was downloaded from the UCI machine learning archive, and was used in [5]. It consists of 20 Parkinsonian patients (6 female, 4 male) and 20 healthy subjects (0 female, 0 male) (Figureure ) who visited the Department of eurology in the Cerrahpasa Faculty of Medicine, Istanbul University. The test group consisted of patients suffering from Parkinson's disease for 0 to 6 years. The age of PD patients varied between 43 and 77 years (mean: 64.86, standard deviation: 8.97). The age of healthy subjects ranged between 45 and 83 years (mean: 62.55, standard deviation: 0.79). For each individual, 26 voice samples including sustained vowels, numbers, words, and short sentences were recorded. The voice samples were selected by a group of neurologists from a set of speaking exercises that aim to improve voice performance. All samples were recorded by a Trust MC-500 microphone with a frequency range between 50 Hz and 3 khz. The microphone was set to 96 khz, 30 db and placed at a distance of 0 cm from the subject, who then read or repeated the specified words or texts. In the UCI machine learning archive dataset [], there are 26 voice samples, with multiple types of voice recordings. The dataset is organized in a way that the columns represent the 26 features, and the rows represent the 26 types of voice samples for each individual, for instance the first three samples represent the sustained vowels /a/, /o/ and /u/, respectively. The samples from 4 to 3 represent numbers from to 0. The 4 th to the 7 th voice sample represent short sentences, while the rest of samples represent individual words. This makes a metrics of 040x26 (40x26 =040). 09

3 Voice analysis for detecting patients with Parkinson s disease using the hybridization Figure. Waveform of a voice sample belonging to a healthy individual (top) and a PWP (bottom). Table. Structure of the database Database (040x26) 26 Acoustic features (see Table I) which contains (F0, Jitter, Shimmer, HR) Subjects Samples Table 2. Time-Frequency-Based Features given by Pratt acoustic analysis software Groups Features umber of feature Jitter Parameters Shimmer Parameters Harmonicity Pitch Parameters Pulses Parameters Voicing Parameters Jitter (local) (%) Jitter (local, absolute) (s) 2 Jitter (rap) (%) 3 Jitter (ppq5) (%) 4 Jitter (ddp) (%) 5 Shimmer (local) (%) 6 Shimmer (local, db) (db) 7 Shimmer (apq3) (%) 8 Shimmer (apq5) (%) 9 Shimmer (apq) (%) 0 Shimmer (dda) (%) Mean autocorrelation (AC) 2 Mean HR 3 Mean HR 4 Median pitch (Hz) 5 Mean pitch (Hz) 6 Standard deviation (Hz) 7 Minimum pitch (Hz) 8 Maximum pitch (Hz) 9 umber of pulses 20 umber of periods 2 Mean period (s) 22 Standard deviation of period (s) 23 Fraction of locally unvoiced pitch frames (%) 24 umber of voice breaks 25 Degree of voice breaks (%) 26 0

4 Achraf BEBA, et al. 3. Methodology In their study, Betul Erdogdu Sakar et al [5] used a classification with Leave-One-Subject- Out (LOSO) validation scheme, in which all the 26 voice samples of one individual were left out to be used for validation as if it was an unseen individual, and the rest of the samples are used for training [5]. According to their method, if the majority of the voice samples of a test individual are classified as unhealthy, then the individual is classified as positive [5]. In their study, they presented another classification with Summarized Leave-One-Out (s-loo). The aim of using this method was to compare the success of conventional Leave-One-Subject-Out validation with an unbiased Leave-One-Out (LOO) [5]. In this method, the feature values of the 26 voice samples of each individual are summarized using central tendency and dispersion metrics such as mean, median, trimmed mean (0% and 25% removed), standard deviation, interquartile range, mean absolute deviation, and a novel form of dataset consisting of samples is formed where is the number of individuals [5]. The purpose of summarizing the voice samples of individuals is to minimize the effect of variations between different voice samples of a subject [5]. The best classification accuracy achieved in their research was 77.50% using s-loo with linear kernel of SVM and the best results of 000 runs of selecting a random voice samples from each individual was 85% with the same SVM kernel [5] which is not a steady results, because it needs 000 runs maybe less maybe more and also they used random voice samples and not the same sample for all subjects. The methodology of our method is described in the next paragraphs: A. Feature Extraction Dysarthria is the set of voice illnesses related with turbulences of muscular control of the speech organs. Dysarthria includes all malfunctions related to breathing, phonation, articulation, nasalization and prosody. These deficits can be measured and detected by analyzing various features of voice. In this study each subject was asked to read or say predetermined 26 items comprising numbers from to 0, four rhymed sentences, nine words in Turkish language plus sustained vowels a, o, and u. A total of 040 recordings each represented with by a 26 dimensional feature vector was calculated (Table II) along with a binary PD-score. A PD-score of zero indicates that the feature vector belongs to a person with PD and a score of one indicates that it belongs to a healthy subject. To extract features from voice samples, Praat acoustic analysis software [2] was used. A group of 26 linear and timefrequency based features were extracted from each voice sample.. Fundamental Frequency The fundamental frequency is the number of opening and closing cycles of the vocal folds per second. 2. Voice breaks ormal voices can easily maintain phonation when saying sustained vowel /a/. Some pathological voices have difficulty with it. This can be measured in different ways [2]. Fraction of locally unvoiced pitch frames (FLUPF) represents the fraction of pitch frames that are considered as unvoiced (the MDVP calls it DUV) [2]. umber of voice breaks is defined as the number of times distances between consecutive pulses that are longer than.25 divided by the pitch floor [2]. Degree of voice breaks is computed as the total duration of the breaks between the voiced parts of the signal, divided by the total duration of the analyzed part of the signal (the MDVP calls it DVB). [2]. 3. Jitter measurement Jitter (absolute) represents the cycle-to-cycle variation of fundamental frequency. It is computed as the average absolute difference between consecutive periods, expressed as [5] [6]:

5 Voice analysis for detecting patients with Parkinson s disease using the hybridization Jitter ( absolu) T i T i () Where T i is the extracted F 0 (fundamental frequency) period lengths and is the number of extracted F 0 periods. Jitter (relative) is defined as the average absolute difference between consecutive periods, divided by the average period. It is expressed as a percentage [5] [6]: Jitter i ( relative ) T T T i (2) Jitter (rap) represents the Relative Average Perturbation, computed as the average absolute difference between a period and the average of it and its two neighbors, divided by the average period [5] [6]. Jitter (ppq5) represents the five-point Period Perturbation Quotient, defined as the average absolute difference between a period and the average of it and its four closest neighbors, divided by the average period [5] [6]. 4. Shimmer measurement Shimmer (db) represents the variability of the peak-to-peak amplitude in decibels, computed as the average absolute base-0 logarithm of the difference between the amplitudes of consecutive periods, multiplied by 20 [5] [6]: Shimmer ( db) A 20log Ai (3) where A i is the extracted peak-to-peak amplitude data, and is the number of extracted fundamental frequency periods F 0. Shimmer (relative) is expressed as the average absolute difference between the amplitudes of consecutive periods, divided by the average amplitude, expressed as a percentage [5] [6]: Shimmer( relative ) i A A A i (4) Shimmer (apq) represents the -point Amplitude Perturbation Quotient, defined as the average absolute difference between the amplitude of a period, and the average of the amplitudes of it, and its ten closest neighbors, divided by the average amplitude [5]. 5. Harmonicity Harmonics-to-oise Ratio (HR) is expressed as the degree of acoustic periodicity [2]. B. Classification with Leave-One-Subject-Out (LOSO) 2

6 Achraf BEBA, et al. Instead of using conventional bootstrapping or LOO validation methods [3] [4] (which consist of sparing some samples of an individual in the training phase and some for the testing phase, creating an artificial overlap between the training and test sets) or s-loo (which summarizes the feature values of voice samples of each subject by using central tendency and dispersion metrics [5]), we use a LOSO validation scheme. That is, we left out all the samples of one individual to be used for validation as if it were an unseen individual, and trained a classifier on the rest of the samples. In addition, we compared two methods of using LOSO validation: first with all the 26 voice samples of each individual; then, using LOSO for each voice sample independently. Evaluation Metrics In order to measure the success of our classifiers and select the best acoustic features, we used the evaluation metrics accuracy, sensitivity and specificity. Accuracy is the ratio of correctly classified instances to total instances [5]: TP T Accuracy TP T FP F (5) Where TP is true positive (Healthy subjects who was correctly classified), T is true negative (Patients with PD who was correctly classified), FP is false positive (Patients with PD who was incorrectly classified) and F is false negative (Healthy subjects who were incorrectly classified). Accuracy represents the success of the classifier to discriminate between the two groups, Sensitivity represents the accuracy of detecting Healthy subjects and Specificity represents the accuracy of detecting the other patients with PD [5] [7] [8] [9] [20]: TP Sensitivity TP F (6) T Specifisit y T FP (7) 4. Experimental result In this study, we replicated the work in [5]. Firstly, we used LOSO method with all the 26 voice samples at the same time of each individual as can be seen in Table I. These 26 voice samples contain sustained vowels, numbers, words, and short sentences. For example; in the first step of LOSO method, we left out the 26 voice samples of the first subject to be used for validation, and we trained our classifier on the other voice samples of the other 39 subjects (all subjects except the first one). And then we decide if it is TP, T, FP or F. In the second step, we left out the 26 voice samples of the second subject to be used for validation, and we trained our classifier on the other voice samples of the other 39 subjects (all subjects except the second one) then we decide if it is TP, T, FP or F. This method is repeated for all until the 40 th subject. Then we calculated the accuracy sensitivity and specificity. The best obtained classification accuracy was % using k- with k=3 as shown in the Table III. This is somewhat lower than that what was obtained in [5] using the entire 26 voice sample at the same time. By using s-loo, they got 85% as maximum classification accuracy after 000 runs which is not a steady result, because it needs 000 runs maybe less maybe more and also they used random voice samples and not the same sample for all subjects. In order to build a real system to diagnosis the disease we should use the same recordings for all subjects and not different ones. The best obtained results using same recording was 77.50% using s-loo with linear kernel of SVM and 82.4% using s-loo with RBF kernel of SVM for the test dataset which is not use in this study. 3

7 Voice analysis for detecting patients with Parkinson s disease using the hybridization Table 3. Classification results using LOSO with all 26 voice samples k- Accuracy % Sensitivity % Specificity % Figure 2. The classification accuracies using Leave-One-Subject-Out (LOSO) validation scheme for the 20 th voice sample. The best results was achieved for the 3 th and the 5 nd voice samples (which correspond to Jitter (rap) and Jitter (ddp) successively) using LOSO with Linear k- (k = 3) Figure 3. The classification accuracies using Leave-One-Subject-Out (LOSO) validation scheme for the 7 th voice sample. The best results was achieved for the 4 th (which correspond to Jitter (ppq5)) using LOSO with Linear k- (k = 3) In our study, instead of using all the 26 voice samples at the same time, we used LOSO for every voice sample independently. That is we used LOSO for the first voice sample of each subject and we calculated the accuracy, and then for the second voice sample and so on until the 26 th voice sample. As can be seen from Figureure 2 and Figureure 3 and Table II, the best result obtained was 82.5% using two different voice samples; - the 4 th acoustic features (Jitter (ppq5)) along with the 7 th voice samples (short sentence); 2- The 3 th (Jitter (rap)) and the 5 th (Jitter (ddp)) acoustic features along with 20 th voice sample (word). These samples correspond 4

8 Achraf BEBA, et al. to sentences and words. From these results, it is clear that Jitter measurements and the running speech contain more information about the state of a subject as pathological or healthy. 5. Conclusion The characteristic disorders of Parkinson s disease are the result of a slow process whose early stages may go unnoticed. For this reason, we used a variety of voice samples per subject with multiple types of voice recordings, in order to develop method for early diagnosis and to build predictive tele-diagnosis and tele-monitoring models. The representations of acoustic signals have allowed us to examine the effect of Parkinsonism on the phonological system. However, during the various stages of processing, the results show that some voice samples are not reliable indicator of the state of a subject. Indeed, for Parkinson s patients some parameters do not confirm the presence of the disease and do not distinguish them from healthy subjects. The purpose of this study is to show the effectiveness of using each type of voice recording and each acoustic feature independently. The best result obtained was 82.5% using running speech samples along with jitter measurement by k- and LOSO validation scheme with k=3. 6. Acknowledgment The authors would like to thank Thomas R. Przybeck and Daniel Wood, United States Peace Corps Volunteers (Morocco ), and all of the participants involved in this study. 7. References: []. Little, Max A., et al. "Suitability of dysphonia measurements for telemonitoring of Parkinson's disease." Biomedical Engineering, IEEE Transactions on 56.4 (2009): [2]. Ishihara, L., and C. Brayne. "A systematic review of depression and mental illness preceding Parkinson's disease." Acta eurologica Scandinavica 3.4 (2006): [3]. Jankovic, Joseph. "Parkinson s disease: clinical features and diagnosis."journal of eurology, eurosurgery & Psychiatry 79.4 (2008): [4]. Huse, Daniel M., et al. "Burden of illness in Parkinson's disease." Movement disorders 20. (2005): [5]. Sakar, Betul Erdogdu, et al. "Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings." Biomedical and Health Informatics, IEEE Journal of 7.4 (203): [6]. S. B. O Sullivan and T. J. Schmitz, Parkinson disease, in Physical Rehabilitation, 5th ed. Philadelphia, PA, USA: F. A. Davis Company, 2007, pp , pp [7]. Ruggiero, C., R. Sacile, and M. Giacomini. "Home telecare." Journal of Telemedicine and Telecare 5. (999): -7. [8]. Little, Max A., et al. "Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection." BioMedical Engineering OnLine 6. (2007): 23. [9]. Rahn III, Douglas A., et al. "Phonatory impairment in Parkinson's disease: evidence from nonlinear dynamic analysis and perturbation analysis." Journal of Voice 2. (2007): [0]. Zwirner, Petra, Thomas Murry, and Gayle E. Woodson. "Phonatory function of neurologically impaired patients." Journal of communication disorders 24.4 (99): []. UCI Machine Learning Repository, Parkinson speech dataset with multiple types of sound recording data set. Available : [ archive.ics.uci.edu/ml/datasets/parkinson+speech+dataset+with++multiple+types+of+s ound+recordings] [2]. Boersma, Paul. "Praat, a system for doing phonetics by computer." Glot international 5.9/0 (2002):

9 Voice analysis for detecting patients with Parkinson s disease using the hybridization [3]. Efron, Bradley. "Bootstrap methods: another look at the jackknife." The annals of Statistics (979): -26. [4]. Reunanen, Juha. "Overfitting in making comparisons between variable selection methods." The Journal of Machine Learning Research 3 (2003): [5]. Farrús, Mireia, Javier Hernando, and Pascual Ejarque. "Jitter and shimmer measurements for speaker recognition." ITERSPEECH [6]. Shirvan, R. Arefi, and E. Tahami. "Voice analysis for detecting Parkinson's disease using genetic algorithm and K classification method." Biomedical Engineering (ICBME), 20 8th Iranian Conference of. IEEE, 20. [7]. BEBA, Achraf, Abdelilah JILBAB, and Ahmed HAMMOUCH. "Voice analysis for detecting persons with Parkinson s disease using MFCC and VQ." The 204 International Conference on Circuits, Systems and Signal Processing [8]. Benba, Achraf, Abdelilah Jilbab, and Ahmed Hammouch. "Hybridization of best acoustic cues for detecting persons with Parkinson's disease." Complex Systems (WCCS), 204 Second World Conference on. IEEE, 204. [9]. Benba, Achraf, Abdelilah Jilbab, and Ahmed Hammouch. "Voiceprint analysis using Perceptual Linear Prediction and Support Vector Machines for detecting persons with Parkinson s disease." the 3rd International Conference on Health Science and Biomedical Systems (HSBS'4), ovember , Florence, Italie. [20]. Benba, Achraf, Abdelilah Jilbab, and Ahmed Hammouch. "Voice analysis for detecting persons with Parkinson s disease using PLP and VQ." Journal of Theoretical & Applied Information Technology 70.3 (204). disorders. BEBA Achraf, received the Master degree in Electrical Engineering from Ecole ormale Supérieure de l Enseignement Technique ESET, Rabat Mohammed V University, Morocco, in 203 he is a research student of Sciences and Technology of the Engineer in Ecole ationale Supérieure d Informatique et d Analyse des Systèmes ESIAS, Research Laboratory in Electrical Engineering LRGE, Research Team in Computer and Telecommunication ERIT at ESET, Mohammed V University, Rabat, Morocco. His interests are in Signal processing for detection neurological Abdelilah JILBAB is a teacher at the Ecole ormale Supérieure de l Enseignement Technique de Rabat, Morocco; He acquired his PhD in Computer and Telecommunication from Mohammed V-Agdal University, Rabat, Morocco in February His thesis is concerned with the Filtering illegal sites on the Internet: Contribution to the type of image recognition based on the Principle of Maximum Entropy. Since 2003 he is a member of the laboratory LRIT (Unit associated with the CRST, FSR, Mohammed V University, Rabat, Morocco). Ahmed HAMMOUCH received the master degree and the PhD in Automatic, Electrical, Electronic by the Haute Alsace University of Mulhouse (France) in 993 and the PhD in Signal and Image Processing by the Mohammed V Agdal University of Rabat (Morocco) in From 993 to 203 he was professor in the Mohammed V-Souissi University in Morocco. Since 2009 he manages the Research Laboratory in Electronic Engineering. He is an author of several papers in international journals and conferences. His domains of interest include multimedia data processing and telecommunications. He is currently head of Department for Scientific and Technical Affairs in ational Center for Scientific and Technical Research in Rabat (Morocco). 6

MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE

MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE Abstract MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE Arvind Kumar Tiwari GGS College of Modern Technology, SAS Nagar, Punjab, India The prediction of Parkinson s disease is

More information

Performance of Gaussian Mixture Models as a Classifier for Pathological Voice

Performance of Gaussian Mixture Models as a Classifier for Pathological Voice PAGE 65 Performance of Gaussian Mixture Models as a Classifier for Pathological Voice Jianglin Wang, Cheolwoo Jo SASPL, School of Mechatronics Changwon ational University Changwon, Gyeongnam 64-773, Republic

More information

Parkinson s Disease Diagnosis by k-nearest Neighbor Soft Computing Model using Voice Features

Parkinson s Disease Diagnosis by k-nearest Neighbor Soft Computing Model using Voice Features Parkinson s Disease Diagnosis by k-nearest Neighbor Soft Computing Model using Voice Features Chandra Prakash Rathore 1, Rekh Ram Janghel 2 1 Department of Information Technology, Dr. C. V. Raman University,

More information

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes J. Sujatha Research Scholar, Vels University, Assistant Professor, Post Graduate

More information

International Journal of Pure and Applied Mathematics

International Journal of Pure and Applied Mathematics International Journal of Pure and Applied Mathematics Volume 118 No. 9 2018, 253-257 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu CLASSIFICATION

More information

Telephone Based Automatic Voice Pathology Assessment.

Telephone Based Automatic Voice Pathology Assessment. Telephone Based Automatic Voice Pathology Assessment. Rosalyn Moran 1, R. B. Reilly 1, P.D. Lacy 2 1 Department of Electronic and Electrical Engineering, University College Dublin, Ireland 2 Royal Victoria

More information

Nonlinear signal processing and statistical machine learning techniques to remotely quantify Parkinson's disease symptom severity

Nonlinear signal processing and statistical machine learning techniques to remotely quantify Parkinson's disease symptom severity Nonlinear signal processing and statistical machine learning techniques to remotely quantify Parkinson's disease symptom severity A. Tsanas, M.A. Little, P.E. McSharry, L.O. Ramig 9 March 2011 Project

More information

Sound Texture Classification Using Statistics from an Auditory Model

Sound Texture Classification Using Statistics from an Auditory Model Sound Texture Classification Using Statistics from an Auditory Model Gabriele Carotti-Sha Evan Penn Daniel Villamizar Electrical Engineering Email: gcarotti@stanford.edu Mangement Science & Engineering

More information

Detection of Parkinson S Disease by Speech Analysis

Detection of Parkinson S Disease by Speech Analysis Detection of Parkinson S Disease by Speech Analysis Ch.Rajanikanth 1, A.Amardeep 2, B.Kishore 3, Shaik Azeez 4 Student, Department of ECE, Lendi Institute of Engineering And Technology, Vizianagaram, India

More information

Evaluation of the neurological state of people with Parkinson s disease using i-vectors

Evaluation of the neurological state of people with Parkinson s disease using i-vectors INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Evaluation of the neurological state of people with Parkinson s disease using s N. Garcia 1, J. R. Orozco-Arroyave 1,2, L. F. D Haro 3, Najim Dehak

More information

UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES

UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES UNOBTRUSIVE MONITORING OF SPEECH IMPAIRMENTS OF PARKINSON S DISEASE PATIENTS THROUGH MOBILE DEVICES T. Arias-Vergara 1, J.C. Vásquez-Correa 1,2, J.R. Orozco-Arroyave 1,2, P. Klumpp 2, and E. Nöth 2 1 Faculty

More information

A Review on Dysarthria speech disorder

A Review on Dysarthria speech disorder A Review on Dysarthria speech disorder Miss. Yogita S. Mahadik, Prof. S. U. Deoghare, 1 Student, Dept. of ENTC, Pimpri Chinchwad College of Engg Pune, Maharashtra, India 2 Professor, Dept. of ENTC, Pimpri

More information

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception

Jitter, Shimmer, and Noise in Pathological Voice Quality Perception ISCA Archive VOQUAL'03, Geneva, August 27-29, 2003 Jitter, Shimmer, and Noise in Pathological Voice Quality Perception Jody Kreiman and Bruce R. Gerratt Division of Head and Neck Surgery, School of Medicine

More information

2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media,

2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising

More information

2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media,

2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising

More information

METHOD. The current study was aimed at investigating the prevalence and nature of voice

METHOD. The current study was aimed at investigating the prevalence and nature of voice CHAPTER - 3 METHOD The current study was aimed at investigating the prevalence and nature of voice problems in call center operators with the following objectives: To determine the prevalence of voice

More information

Accurate telemonitoring of Parkinson s disease progression by non-invasive speech tests

Accurate telemonitoring of Parkinson s disease progression by non-invasive speech tests Non-invasive telemonitoring of Parkinson s disease, Tsanas et al. 1 Accurate telemonitoring of Parkinson s disease progression by non-invasive speech tests Athanasios Tsanas*, Max A. Little, Member, IEEE,

More information

Voice Pathology Recognition and Classification using Noise Related Features

Voice Pathology Recognition and Classification using Noise Related Features Voice Pathology Recognition and Classification using Noise Related Features HAMDI Rabeh 1, HAJJI Salah 2, CHERIF Adnane 3 Faculty of Mathematical, Physical and Natural Sciences of Tunis, University of

More information

Acoustic analysis of occlusive weakening in Parkinsonian French speech

Acoustic analysis of occlusive weakening in Parkinsonian French speech Acoustic analysis of occlusive weakening in Parkinsonian French speech Danielle Duez To cite this version: Danielle Duez. Acoustic analysis of occlusive weakening in Parkinsonian French speech. International

More information

Using VOCALAB For Voice and Speech Therapy

Using VOCALAB For Voice and Speech Therapy Using VOCALAB For Voice and Speech Therapy Anne MENIN-SICARD, Speech Therapist in Toulouse, France Etienne SICARD Professeur at INSA, University of Toulouse, France June 2014 www.vocalab.org www.gerip.com

More information

Hierarchical Classification of Emotional Speech

Hierarchical Classification of Emotional Speech 1 Hierarchical Classification of Emotional Speech Zhongzhe Xiao 1, Emmanuel Dellandrea 1, Weibei Dou 2, Liming Chen 1 1 LIRIS Laboratory (UMR 5205), Ecole Centrale de Lyon, Department of Mathematic and

More information

COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION

COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION Journal of Engineering Science and Technology Vol. 11, No. 9 (2016) 1221-1233 School of Engineering, Taylor s University COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION

More information

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency

Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency Combination of Bone-Conducted Speech with Air-Conducted Speech Changing Cut-Off Frequency Tetsuya Shimamura and Fumiya Kato Graduate School of Science and Engineering Saitama University 255 Shimo-Okubo,

More information

EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS?

EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS? EMOTION CLASSIFICATION: HOW DOES AN AUTOMATED SYSTEM COMPARE TO NAÏVE HUMAN CODERS? Sefik Emre Eskimez, Kenneth Imade, Na Yang, Melissa Sturge- Apple, Zhiyao Duan, Wendi Heinzelman University of Rochester,

More information

Reviewing the connection between speech and obstructive sleep apnea

Reviewing the connection between speech and obstructive sleep apnea DOI 10.1186/s12938-016-0138-5 BioMedical Engineering OnLine RESEARCH Open Access Reviewing the connection between speech and obstructive sleep apnea Fernando Espinoza Cuadros 1*, Rubén Fernández Pozo 1,

More information

A Judgment of Intoxication using Hybrid Analysis with Pitch Contour Compare (HAPCC) in Speech Signal Processing

A Judgment of Intoxication using Hybrid Analysis with Pitch Contour Compare (HAPCC) in Speech Signal Processing A Judgment of Intoxication using Hybrid Analysis with Pitch Contour Compare (HAPCC) in Speech Signal Processing Seong-Geon Bae #1 1 Professor, School of Software Application, Kangnam University, Gyunggido,

More information

Parkinson s Disease : A Case Study

Parkinson s Disease : A Case Study Parkinson s Disease : A Case Study Ms. Ahilya V.Salunkhe 1, Prof. Ms. Pallavi S.Deshpande 2 1,2 Department of Electronics & Telecommunication Engineering,Smt. Kashibai Navale College of Engineering, Vadgaon

More information

EMOTIONS are one of the most essential components of

EMOTIONS are one of the most essential components of 1 Hidden Markov Model for Emotion Detection in Speech Cyprien de Lichy, Pratyush Havelia, Raunaq Rewari Abstract This paper seeks to classify speech inputs into emotion labels. Emotions are key to effective

More information

Interjudge Reliability in the Measurement of Pitch Matching. A Senior Honors Thesis

Interjudge Reliability in the Measurement of Pitch Matching. A Senior Honors Thesis Interjudge Reliability in the Measurement of Pitch Matching A Senior Honors Thesis Presented in partial fulfillment of the requirements for graduation with distinction in Speech and Hearing Science in

More information

SPEECH TO TEXT CONVERTER USING GAUSSIAN MIXTURE MODEL(GMM)

SPEECH TO TEXT CONVERTER USING GAUSSIAN MIXTURE MODEL(GMM) SPEECH TO TEXT CONVERTER USING GAUSSIAN MIXTURE MODEL(GMM) Virendra Chauhan 1, Shobhana Dwivedi 2, Pooja Karale 3, Prof. S.M. Potdar 4 1,2,3B.E. Student 4 Assitant Professor 1,2,3,4Department of Electronics

More information

Gender Based Emotion Recognition using Speech Signals: A Review

Gender Based Emotion Recognition using Speech Signals: A Review 50 Gender Based Emotion Recognition using Speech Signals: A Review Parvinder Kaur 1, Mandeep Kaur 2 1 Department of Electronics and Communication Engineering, Punjabi University, Patiala, India 2 Department

More information

Speech (Sound) Processing

Speech (Sound) Processing 7 Speech (Sound) Processing Acoustic Human communication is achieved when thought is transformed through language into speech. The sounds of speech are initiated by activity in the central nervous system,

More information

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear

Prelude Envelope and temporal fine. What's all the fuss? Modulating a wave. Decomposing waveforms. The psychophysics of cochlear The psychophysics of cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences Prelude Envelope and temporal

More information

POSSIBILITIES OF AUTOMATED ASSESSMENT OF /S/

POSSIBILITIES OF AUTOMATED ASSESSMENT OF /S/ Fricative consonants are produced by narrowing a supraglottal part of the vocal tract and such constriction generates a typical turbulence noise. In the case of the consonant /s/ the narrowed part occurs

More information

A Study on the Degree of Pronunciation Improvement by a Denture Attachment Using an Weighted-α Formant

A Study on the Degree of Pronunciation Improvement by a Denture Attachment Using an Weighted-α Formant A Study on the Degree of Pronunciation Improvement by a Denture Attachment Using an Weighted-α Formant Seong-Geon Bae 1 1 School of Software Application, Kangnam University, Gyunggido, Korea. 1 Orcid Id:

More information

Prosodic Analysis of Speech and the Underlying Mental State

Prosodic Analysis of Speech and the Underlying Mental State Prosodic Analysis of Speech and the Underlying Mental State Roi Kliper 1, Shirley Portuguese, and Daphna Weinshall 1 1 School of Computer Science and Engineering, Hebrew University of Jerusalem, 91904

More information

An Improved Algorithm To Predict Recurrence Of Breast Cancer

An Improved Algorithm To Predict Recurrence Of Breast Cancer An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant

More information

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation Aldebaro Klautau - http://speech.ucsd.edu/aldebaro - 2/3/. Page. Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation ) Introduction Several speech processing algorithms assume the signal

More information

Online Speaker Adaptation of an Acoustic Model using Face Recognition

Online Speaker Adaptation of an Acoustic Model using Face Recognition Online Speaker Adaptation of an Acoustic Model using Face Recognition Pavel Campr 1, Aleš Pražák 2, Josef V. Psutka 2, and Josef Psutka 2 1 Center for Machine Perception, Department of Cybernetics, Faculty

More information

Comparative Analysis of Vocal Characteristics in Speakers with Depression and High-Risk Suicide

Comparative Analysis of Vocal Characteristics in Speakers with Depression and High-Risk Suicide International Journal of Computer Theory and Engineering, Vol. 7, No. 6, December 205 Comparative Analysis of Vocal Characteristics in Speakers with Depression and High-Risk Suicide Thaweewong Akkaralaertsest

More information

EMOTION DETECTION FROM VOICE BASED CLASSIFIED FRAME-ENERGY SIGNAL USING K-MEANS CLUSTERING

EMOTION DETECTION FROM VOICE BASED CLASSIFIED FRAME-ENERGY SIGNAL USING K-MEANS CLUSTERING EMOTION DETECTION FROM VOICE BASED CLASSIFIED FRAME-ENERGY SIGNAL USING K-MEANS CLUSTERING Nazia Hossain 1, Rifat Jahan 2, and Tanjila Tabasum Tunka 3 1 Senior Lecturer, Department of Computer Science

More information

EXPLORATORY ANALYSIS OF SPEECH FEATURES RELATED TO DEPRESSION IN ADULTS WITH APHASIA

EXPLORATORY ANALYSIS OF SPEECH FEATURES RELATED TO DEPRESSION IN ADULTS WITH APHASIA EXPLORATORY ANALYSIS OF SPEECH FEATURES RELATED TO DEPRESSION IN ADULTS WITH APHASIA Stephanie Gillespie 1, Elliot Moore 1, Jacqueline Laures-Gore 2, Matthew Farina 2 1 Georgia Institute of Technology,

More information

Linear classification in speech-based objective differential diagnosis of parkinsonism

Linear classification in speech-based objective differential diagnosis of parkinsonism Linear classification in speech-based objective differential diagnosis of parkinsonism Gongfeng Li, Khalid Daoudi, Jiri Klempir, Jan Rusz To cite this version: Gongfeng Li, Khalid Daoudi, Jiri Klempir,

More information

Analysis of Emotion Recognition using Facial Expressions, Speech and Multimodal Information

Analysis of Emotion Recognition using Facial Expressions, Speech and Multimodal Information Analysis of Emotion Recognition using Facial Expressions, Speech and Multimodal Information C. Busso, Z. Deng, S. Yildirim, M. Bulut, C. M. Lee, A. Kazemzadeh, S. Lee, U. Neumann, S. Narayanan Emotion

More information

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES

ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES ISCA Archive ACOUSTIC AND PERCEPTUAL PROPERTIES OF ENGLISH FRICATIVES Allard Jongman 1, Yue Wang 2, and Joan Sereno 1 1 Linguistics Department, University of Kansas, Lawrence, KS 66045 U.S.A. 2 Department

More information

Speech Compression for Noise-Corrupted Thai Dialects

Speech Compression for Noise-Corrupted Thai Dialects American Journal of Applied Sciences 9 (3): 278-282, 2012 ISSN 1546-9239 2012 Science Publications Speech Compression for Noise-Corrupted Thai Dialects Suphattharachai Chomphan Department of Electrical

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training. Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze

More information

Speech Generation and Perception

Speech Generation and Perception Speech Generation and Perception 1 Speech Generation and Perception : The study of the anatomy of the organs of speech is required as a background for articulatory and acoustic phonetics. An understanding

More information

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for

What you re in for. Who are cochlear implants for? The bottom line. Speech processing schemes for What you re in for Speech processing schemes for cochlear implants Stuart Rosen Professor of Speech and Hearing Science Speech, Hearing and Phonetic Sciences Division of Psychology & Language Sciences

More information

11 Music and Speech Perception

11 Music and Speech Perception 11 Music and Speech Perception Properties of sound Sound has three basic dimensions: Frequency (pitch) Intensity (loudness) Time (length) Properties of sound The frequency of a sound wave, measured in

More information

Results. Dr.Manal El-Banna: Phoniatrics Prof.Dr.Osama Sobhi: Audiology. Alexandria University, Faculty of Medicine, ENT Department

Results. Dr.Manal El-Banna: Phoniatrics Prof.Dr.Osama Sobhi: Audiology. Alexandria University, Faculty of Medicine, ENT Department MdEL Med-EL- Cochlear Implanted Patients: Early Communicative Results Dr.Manal El-Banna: Phoniatrics Prof.Dr.Osama Sobhi: Audiology Alexandria University, Faculty of Medicine, ENT Department Introduction

More information

ParkDiag: A Tool to Predict Parkinson Disease using Data Mining Techniques from Voice Data

ParkDiag: A Tool to Predict Parkinson Disease using Data Mining Techniques from Voice Data ParkDiag: A Tool to Predict Parkinson Disease using Data Mining Techniques from Voice Data Tarigoppula V.S. Sriram 1, M. Venkateswara Rao 2, G.V. Satya Narayana 3 and D.S.V.G.K. Kaladhar 4 1 CSE, Raghu

More information

Colon cancer subtypes from gene expression data

Colon cancer subtypes from gene expression data Colon cancer subtypes from gene expression data Nathan Cunningham Giuseppe Di Benedetto Sherman Ip Leon Law Module 6: Applied Statistics 26th February 2016 Aim Replicate findings of Felipe De Sousa et

More information

Quick detection of QRS complexes and R-waves using a wavelet transform and K-means clustering

Quick detection of QRS complexes and R-waves using a wavelet transform and K-means clustering Bio-Medical Materials and Engineering 26 (2015) S1059 S1065 DOI 10.3233/BME-151402 IOS Press S1059 Quick detection of QRS complexes and R-waves using a wavelet transform and K-means clustering Yong Xia

More information

Classification of EEG signals in an Object Recognition task

Classification of EEG signals in an Object Recognition task Classification of EEG signals in an Object Recognition task Iacob D. Rus, Paul Marc, Mihaela Dinsoreanu, Rodica Potolea Technical University of Cluj-Napoca Cluj-Napoca, Romania 1 rus_iacob23@yahoo.com,

More information

Particle Swarm Optimization Supported Artificial Neural Network in Detection of Parkinson s Disease

Particle Swarm Optimization Supported Artificial Neural Network in Detection of Parkinson s Disease IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 5, Ver. VI (Sep. - Oct. 2016), PP 24-30 www.iosrjournals.org Particle Swarm Optimization Supported

More information

Enhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation

Enhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation Enhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation L Uma Maheshwari Department of ECE, Stanley College of Engineering and Technology for Women, Hyderabad - 500001, India. Udayini

More information

General Soundtrack Analysis

General Soundtrack Analysis General Soundtrack Analysis Dan Ellis oratory for Recognition and Organization of Speech and Audio () Electrical Engineering, Columbia University http://labrosa.ee.columbia.edu/

More information

Heart Murmur Recognition Based on Hidden Markov Model

Heart Murmur Recognition Based on Hidden Markov Model Journal of Signal and Information Processing, 2013, 4, 140-144 http://dx.doi.org/10.4236/jsip.2013.42020 Published Online May 2013 (http://www.scirp.org/journal/jsip) Heart Murmur Recognition Based on

More information

Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone

Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone Acoustics 8 Paris Ambiguity in the recognition of phonetic vowels when using a bone conduction microphone V. Zimpfer a and K. Buck b a ISL, 5 rue du Général Cassagnou BP 734, 6831 Saint Louis, France b

More information

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED

FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED FREQUENCY COMPRESSION AND FREQUENCY SHIFTING FOR THE HEARING IMPAIRED Francisco J. Fraga, Alan M. Marotta National Institute of Telecommunications, Santa Rita do Sapucaí - MG, Brazil Abstract A considerable

More information

Noise-Robust Speech Recognition Technologies in Mobile Environments

Noise-Robust Speech Recognition Technologies in Mobile Environments Noise-Robust Speech Recognition echnologies in Mobile Environments Mobile environments are highly influenced by ambient noise, which may cause a significant deterioration of speech recognition performance.

More information

Audio-based Emotion Recognition for Advanced Automatic Retrieval in Judicial Domain

Audio-based Emotion Recognition for Advanced Automatic Retrieval in Judicial Domain Audio-based Emotion Recognition for Advanced Automatic Retrieval in Judicial Domain F. Archetti 1,2, G. Arosio 1, E. Fersini 1, E. Messina 1 1 DISCO, Università degli Studi di Milano-Bicocca, Viale Sarca,

More information

Analysis of EEG Signal for the Detection of Brain Abnormalities

Analysis of EEG Signal for the Detection of Brain Abnormalities Analysis of EEG Signal for the Detection of Brain Abnormalities M.Kalaivani PG Scholar Department of Computer Science and Engineering PG National Engineering College Kovilpatti, Tamilnadu V.Kalaivani,

More information

Shaheen N. Awan 1, Nancy Pearl Solomon 2, Leah B. Helou 3, & Alexander Stojadinovic 2

Shaheen N. Awan 1, Nancy Pearl Solomon 2, Leah B. Helou 3, & Alexander Stojadinovic 2 Shaheen N. Awan 1, Nancy Pearl Solomon 2, Leah B. Helou 3, & Alexander Stojadinovic 2 1 Bloomsburg University of Pennsylvania; 2 Walter Reed National Military Medical Center; 3 University of Pittsburgh

More information

Running-speech MFCC are better markers of Parkinsonian speech deficits than vowel phonation and diadochokinetic

Running-speech MFCC are better markers of Parkinsonian speech deficits than vowel phonation and diadochokinetic Running-speech MFCC are better markers of Parkinsonian speech deficits than vowel phonation and diadochokinetic Taha Khan School of Technology and Business Studies, Computer Engineering, Dalarna University,

More information

2013), URL

2013), URL Title Acoustical analysis of voices produced by Cantonese patients of unilateral vocal fold paralysis: acoustical analysis of voices by Cantonese UVFP Author(s) Yan, N; Wang, L; Ng, ML Citation The 2013

More information

A Determination of Drinking According to Types of Sentence using Speech Signals

A Determination of Drinking According to Types of Sentence using Speech Signals A Determination of Drinking According to Types of Sentence using Speech Signals Seong-Geon Bae 1, Won-Hee Lee 2 and Myung-Jin Bae *3 1 School of Software Application, Kangnam University, Gyunggido, Korea.

More information

Computational Perception /785. Auditory Scene Analysis

Computational Perception /785. Auditory Scene Analysis Computational Perception 15-485/785 Auditory Scene Analysis A framework for auditory scene analysis Auditory scene analysis involves low and high level cues Low level acoustic cues are often result in

More information

The impact of numeration on visual attention during a psychophysical task; An ERP study

The impact of numeration on visual attention during a psychophysical task; An ERP study The impact of numeration on visual attention during a psychophysical task; An ERP study Armita Faghani Jadidi, Raheleh Davoodi, Mohammad Hassan Moradi Department of Biomedical Engineering Amirkabir University

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Babies 'cry in mother's tongue' HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Babies' cries imitate their mother tongue as early as three days old German researchers say babies begin to pick up

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception Long-term spectrum of speech HCS 7367 Speech Perception Connected speech Absolute threshold Males Dr. Peter Assmann Fall 212 Females Long-term spectrum of speech Vowels Males Females 2) Absolute threshold

More information

PCA Enhanced Kalman Filter for ECG Denoising

PCA Enhanced Kalman Filter for ECG Denoising IOSR Journal of Electronics & Communication Engineering (IOSR-JECE) ISSN(e) : 2278-1684 ISSN(p) : 2320-334X, PP 06-13 www.iosrjournals.org PCA Enhanced Kalman Filter for ECG Denoising Febina Ikbal 1, Prof.M.Mathurakani

More information

An Investigation of MDVP Parameters for Voice Pathology Detection on Three Different Databases

An Investigation of MDVP Parameters for Voice Pathology Detection on Three Different Databases INTERSPEECH 2015 An Investigation of MDVP Parameters for Voice Pathology Detection on Three Different Databases Ahmed Al-nasheri 1, Zulfiqar Ali 1,2, Ghulam Muhammad 1, and Mansour Alsulaiman 1 1 Digital

More information

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network International Journal of Electronics Engineering, 3 (1), 2011, pp. 55 58 ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network Amitabh Sharma 1, and Tanushree Sharma 2

More information

M. Shahidur Rahman and Tetsuya Shimamura

M. Shahidur Rahman and Tetsuya Shimamura 18th European Signal Processing Conference (EUSIPCO-2010) Aalborg, Denmark, August 23-27, 2010 PITCH CHARACTERISTICS OF BONE CONDUCTED SPEECH M. Shahidur Rahman and Tetsuya Shimamura Graduate School of

More information

Correlating Speech and Voice Features of Transgender Women with Ratings of Femininity and Gender

Correlating Speech and Voice Features of Transgender Women with Ratings of Femininity and Gender University of Rhode Island DigitalCommons@URI Open Access Master's Theses 2018 Correlating Speech and Voice Features of Transgender Women with Ratings of Femininity and Gender Kimberly Dahl University

More information

PERCEPTION OF UNATTENDED SPEECH. University of Sussex Falmer, Brighton, BN1 9QG, UK

PERCEPTION OF UNATTENDED SPEECH. University of Sussex Falmer, Brighton, BN1 9QG, UK PERCEPTION OF UNATTENDED SPEECH Marie Rivenez 1,2, Chris Darwin 1, Anne Guillaume 2 1 Department of Psychology University of Sussex Falmer, Brighton, BN1 9QG, UK 2 Département Sciences Cognitives Institut

More information

Visi-Pitch IV is the latest version of the most widely

Visi-Pitch IV is the latest version of the most widely APPLICATIONS Voice Disorders Motor Speech Disorders Voice Typing Fluency Selected Articulation Training Hearing-Impaired Speech Professional Voice Accent Reduction and Second Language Learning Importance

More information

Heart Abnormality Detection Technique using PPG Signal

Heart Abnormality Detection Technique using PPG Signal Heart Abnormality Detection Technique using PPG Signal L.F. Umadi, S.N.A.M. Azam and K.A. Sidek Department of Electrical and Computer Engineering, Faculty of Engineering, International Islamic University

More information

Linguistic Phonetics Fall 2005

Linguistic Phonetics Fall 2005 MIT OpenCourseWare http://ocw.mit.edu 24.963 Linguistic Phonetics Fall 2005 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 24.963 Linguistic Phonetics

More information

Critical Review: The Effects of Subthalamic Nucleus Deep Brain Stimulation on Speech Production in Parkinson s Disease

Critical Review: The Effects of Subthalamic Nucleus Deep Brain Stimulation on Speech Production in Parkinson s Disease Critical Review: The Effects of Subthalamic Nucleus Deep Brain Stimulation on Speech Production in Parkinson s Disease Marie McIntosh M.Cl.Sc SLP Candidate University of Western Ontario: School of Communication

More information

A Sleeping Monitor for Snoring Detection

A Sleeping Monitor for Snoring Detection EECS 395/495 - mhealth McCormick School of Engineering A Sleeping Monitor for Snoring Detection By Hongwei Cheng, Qian Wang, Tae Hun Kim Abstract Several studies have shown that snoring is the first symptom

More information

Polly Lau Voice Research Lab, Department of Speech and Hearing Sciences, The University of Hong Kong

Polly Lau Voice Research Lab, Department of Speech and Hearing Sciences, The University of Hong Kong Perception of synthesized voice quality in connected speech by Cantonese speakers Edwin M-L. Yiu a) Voice Research Laboratory, Department of Speech and Hearing Sciences, The University of Hong Kong, 5/F

More information

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A Medical Decision Support System based on Genetic Algorithm and Least Square Support Vector Machine for Diabetes Disease Diagnosis

More information

Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance *

Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance * Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance * Brian S. Helfer 1, James R. Williamson 1, Benjamin A. Miller 1, Joseph Perricone 1, Thomas F. Quatieri 1 MIT Lincoln

More information

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition

Implementation of Spectral Maxima Sound processing for cochlear. implants by using Bark scale Frequency band partition Implementation of Spectral Maxima Sound processing for cochlear implants by using Bark scale Frequency band partition Han xianhua 1 Nie Kaibao 1 1 Department of Information Science and Engineering, Shandong

More information

Voice Detection using Speech Energy Maximization and Silence Feature Normalization

Voice Detection using Speech Energy Maximization and Silence Feature Normalization , pp.25-29 http://dx.doi.org/10.14257/astl.2014.49.06 Voice Detection using Speech Energy Maximization and Silence Feature Normalization In-Sung Han 1 and Chan-Shik Ahn 2 1 Dept. of The 2nd R&D Institute,

More information

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Oral Presentation at MIE 2011 30th August 2011 Oslo Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Kirsi

More information

Role of F0 differences in source segregation

Role of F0 differences in source segregation Role of F0 differences in source segregation Andrew J. Oxenham Research Laboratory of Electronics, MIT and Harvard-MIT Speech and Hearing Bioscience and Technology Program Rationale Many aspects of segregation

More information

Applying Data Mining for Epileptic Seizure Detection

Applying Data Mining for Epileptic Seizure Detection Applying Data Mining for Epileptic Seizure Detection Ying-Fang Lai 1 and Hsiu-Sen Chiang 2* 1 Department of Industrial Education, National Taiwan Normal University 162, Heping East Road Sec 1, Taipei,

More information

www.optimalsp.com.au What may happen to breathing, voice and swallow in PD? Muscles of breathing, voice and speech can be subject to the same disease processes as other muscle groups in PD Slowness, stiffness,

More information

Research Article Automatic Speaker Recognition for Mobile Forensic Applications

Research Article Automatic Speaker Recognition for Mobile Forensic Applications Hindawi Mobile Information Systems Volume 07, Article ID 698639, 6 pages https://doi.org//07/698639 Research Article Automatic Speaker Recognition for Mobile Forensic Applications Mohammed Algabri, Hassan

More information

Vital Responder: Real-time Health Monitoring of First- Responders

Vital Responder: Real-time Health Monitoring of First- Responders Vital Responder: Real-time Health Monitoring of First- Responders Ye Can 1,2 Advisors: Miguel Tavares Coimbra 2, Vijayakumar Bhagavatula 1 1 Department of Electrical & Computer Engineering, Carnegie Mellon

More information

Timing Patterns of Speech as Potential Indicators of Near-Term Suicidal Risk

Timing Patterns of Speech as Potential Indicators of Near-Term Suicidal Risk International Journal of Multidisciplinary and Current Research Research Article ISSN: 2321-3124 Available at: http://ijmcr.com Timing Patterns of Speech as Potential Indicators of Near-Term Suicidal Risk

More information

Comparative Accuracy of a Diagnostic Index Modeled Using (Optimized) Regression vs. Novometrics

Comparative Accuracy of a Diagnostic Index Modeled Using (Optimized) Regression vs. Novometrics Comparative Accuracy of a Diagnostic Index Modeled Using (Optimized) Regression vs. Novometrics Ariel Linden, Dr.P.H. and Paul R. Yarnold, Ph.D. Linden Consulting Group, LLC Optimal Data Analysis LLC Diagnostic

More information

Empirical Mode Decomposition based Feature Extraction Method for the Classification of EEG Signal

Empirical Mode Decomposition based Feature Extraction Method for the Classification of EEG Signal Empirical Mode Decomposition based Feature Extraction Method for the Classification of EEG Signal Anant kulkarni MTech Communication Engineering Vellore Institute of Technology Chennai, India anant8778@gmail.com

More information

Effective Diagnosis of Alzheimer s Disease by means of Association Rules

Effective Diagnosis of Alzheimer s Disease by means of Association Rules Effective Diagnosis of Alzheimer s Disease by means of Association Rules Rosa Chaves (rosach@ugr.es) J. Ramírez, J.M. Górriz, M. López, D. Salas-Gonzalez, I. Illán, F. Segovia, P. Padilla Dpt. Theory of

More information

Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods

Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods Ying-Fang Lai 1 and Hsiu-Sen Chiang 2* 1 Department of Industrial Education, National Taiwan Normal University 162, Heping

More information

III. PROPOSED APPROACH FOR PREDICTING SPIROMETRY READINGS FROM COUGH AND WHEEZE

III. PROPOSED APPROACH FOR PREDICTING SPIROMETRY READINGS FROM COUGH AND WHEEZE Automatic Prediction of Spirometry Readings from Cough and Wheeze for Monitoring of Asthma Severity Achuth Rao MV 1, Kausthubha NK 1, Shivani Yadav 3, Dipanjan Gope 2, Uma Maheswari Krishnaswamy 4, Prasanta

More information