Literature DB >> 29567049

Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.

Shih-Hau Fang1, Yu Tsao2, Min-Jing Hsiao1, Ji-Ying Chen1, Ying-Hui Lai3, Feng-Chuan Lin4, Chi-Te Wang5.   

Abstract

OBJECTIVES: Computerized detection of voice disorders has attracted considerable academic and clinical interest in the hope of providing an effective screening method for voice diseases before endoscopic confirmation. This study proposes a deep-learning-based approach to detect pathological voice and examines its performance and utility compared with other automatic classification algorithms.
METHODS: This study retrospectively collected 60 normal voice samples and 402 pathological voice samples of 8 common clinical voice disorders in a voice clinic of a tertiary teaching hospital. We extracted Mel frequency cepstral coefficients from 3-second samples of a sustained vowel. The performances of three machine learning algorithms, namely, deep neural network (DNN), support vector machine, and Gaussian mixture model, were evaluated based on a fivefold cross-validation. Collective cases from the voice disorder database of MEEI (Massachusetts Eye and Ear Infirmary) were used to verify the performance of the classification mechanisms.
RESULTS: The experimental results demonstrated that DNN outperforms Gaussian mixture model and support vector machine. Its accuracy in detecting voice pathologies reached 94.26% and 90.52% in male and female subjects, based on three representative Mel frequency cepstral coefficient features. When applied to the MEEI database for validation, the DNN also achieved a higher accuracy (99.32%) than the other two classification algorithms.
CONCLUSIONS: By stacking several layers of neurons with optimized weights, the proposed DNN algorithm can fully utilize the acoustic features and efficiently differentiate between normal and pathological voice samples. Based on this pilot study, future research may proceed to explore more application of DNN from laboratory and clinical perspectives.
Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Neoplasm; Nodule; Polyp; Spasmodic dysphonia; Sulcus

Mesh:

Year:  2018        PMID: 29567049     DOI: 10.1016/j.jvoice.2018.02.003

Source DB:  PubMed          Journal:  J Voice        ISSN: 0892-1997            Impact factor:   2.009


  8 in total

1.  Continuous Speech for Improved Learning Pathological Voice Disorders.

Authors:  Syu-Siang Wang; Chi-Te Wang; Chih-Chung Lai; Yu Tsao; Shih-Hau Fang
Journal:  IEEE Open J Eng Med Biol       Date:  2022-02-14

2.  Using SincNet for Learning Pathological Voice Disorders.

Authors:  Chao-Hsiang Hung; Syu-Siang Wang; Chi-Te Wang; Shih-Hau Fang
Journal:  Sensors (Basel)       Date:  2022-09-02       Impact factor: 3.847

3.  Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery.

Authors:  Rytis Maskeliūnas; Audrius Kulikajevas; Robertas Damaševičius; Kipras Pribuišis; Nora Ulozaitė-Stanienė; Virgilijus Uloza
Journal:  Cancers (Basel)       Date:  2022-05-11       Impact factor: 6.575

4.  Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning.

Authors:  Norah Saleh Alghamdi; Mohammed Zakariah; Vinh Truong Hoang; Mohammad Mamun Elahi
Journal:  Comput Math Methods Med       Date:  2022-04-04       Impact factor: 2.238

5.  Computer-aided diagnosis of external and middle ear conditions: A machine learning approach.

Authors:  Michelle Viscaino; Juan C Maass; Paul H Delano; Mariela Torrente; Carlos Stott; Fernando Auat Cheein
Journal:  PLoS One       Date:  2020-03-12       Impact factor: 3.240

6.  Reduction of the Multipath Propagation Effect in a Hydroacoustic Channel Using Filtration in Cepstrum.

Authors:  Agnieszka Czapiewska; Andrzej Luksza; Ryszard Studanski; Andrzej Zak
Journal:  Sensors (Basel)       Date:  2020-01-29       Impact factor: 3.576

7.  Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks.

Authors:  Adrián Vázquez-Romero; Ascensión Gallardo-Antolín
Journal:  Entropy (Basel)       Date:  2020-06-20       Impact factor: 2.524

8.  Ambulatory Phonation Monitoring With Wireless Microphones Based on the Speech Energy Envelope: Algorithm Development and Validation.

Authors:  Chi-Te Wang; Ji-Yan Han; Shih-Hau Fang; Ying-Hui Lai
Journal:  JMIR Mhealth Uhealth       Date:  2020-12-03       Impact factor: 4.773

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.