Literature DB >> 26522263

Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model.

Zulfiqar Ali1, Irraivan Elamvazuthi2, Mansour Alsulaiman3, Ghulam Muhammad3.   

Abstract

BACKGROUND AND
OBJECTIVE: Automatic voice pathology detection using sustained vowels has been widely explored. Because of the stationary nature of the speech waveform, pathology detection with a sustained vowel is a comparatively easier task than that using a running speech. Some disorder detection systems with running speech have also been developed, although most of them are based on a voice activity detection (VAD), that is, itself a challenging task. Pathology detection with running speech needs more investigation, and systems with good accuracy (ACC) are required. Furthermore, pathology classification systems with running speech have not received any attention from the research community. In this article, automatic pathology detection and classification systems are developed using text-dependent running speech without adding a VAD module.
METHOD: A set of three psychophysics conditions of hearing (critical band spectral estimation, equal loudness hearing curve, and the intensity loudness power law of hearing) is used to estimate the auditory spectrum. The auditory spectrum and all-pole models of the auditory spectrums are computed and analyzed and used in a Gaussian mixture model for an automatic decision.
RESULTS: In the experiments using the Massachusetts Eye & Ear Infirmary database, an ACC of 99.56% is obtained for pathology detection, and an ACC of 93.33% is obtained for the pathology classification system. The results of the proposed systems outperform the existing running-speech-based systems. DISCUSSION: The developed system can effectively be used in voice pathology detection and classification systems, and the proposed features can visually differentiate between normal and pathological samples. Copyright Â
© 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  All-pole model; Auditory spectrum; GMM; Running speech; Voice pathology classification; Voice pathology detection

Mesh:

Year:  2015        PMID: 26522263     DOI: 10.1016/j.jvoice.2015.08.010

Source DB:  PubMed          Journal:  J Voice        ISSN: 0892-1997            Impact factor:   2.009


  3 in total

1.  Continuous Speech for Improved Learning Pathological Voice Disorders.

Authors:  Syu-Siang Wang; Chi-Te Wang; Chih-Chung Lai; Yu Tsao; Shih-Hau Fang
Journal:  IEEE Open J Eng Med Biol       Date:  2022-02-14

2.  A Generic Deep Learning Based Cough Analysis System From Clinically Validated Samples for Point-of-Need Covid-19 Test and Severity Levels.

Authors:  Javier Andreu-Perez; Humberto Perez-Espinosa; Eva Timonet; Mehrin Kiani; Manuel I Giron-Perez; Alma B Benitez-Trinidad; Delaram Jarchi; Alejandro Rosales-Perez; Nick Gatzoulis; Orion F Reyes-Galaviz; Alejandro Torres-Garcia; Carlos A Reyes-Garcia; Zulfiqar Ali; Francisco Rivas
Journal:  IEEE Trans Serv Comput       Date:  2021-02-23       Impact factor: 11.019

3.  Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms.

Authors:  Tamer A Mesallam; Mohamed Farahat; Khalid H Malki; Mansour Alsulaiman; Zulfiqar Ali; Ahmed Al-Nasheri; Ghulam Muhammad
Journal:  J Healthc Eng       Date:  2017-10-19       Impact factor: 2.682

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.