Literature DB >> 36081092

Using SincNet for Learning Pathological Voice Disorders.

Chao-Hsiang Hung1, Syu-Siang Wang1, Chi-Te Wang2, Shih-Hau Fang1.   

Abstract

Deep learning techniques such as convolutional neural networks (CNN) have been successfully applied to identify pathological voices. However, the major disadvantage of using these advanced models is the lack of interpretability in explaining the predicted outcomes. This drawback further introduces a bottleneck for promoting the classification or detection of voice-disorder systems, especially in this pandemic period. In this paper, we proposed using a series of learnable sinc functions to replace the very first layer of a commonly used CNN to develop an explainable SincNet system for classifying or detecting pathological voices. The applied sinc filters, a front-end signal processor in SincNet, are critical for constructing the meaningful layer and are directly used to extract the acoustic features for following networks to generate high-level voice information. We conducted our tests on three different Far Eastern Memorial Hospital voice datasets. From our evaluations, the proposed approach achieves the highest 7%-accuracy and 9%-sensitivity improvements from conventional methods and thus demonstrates superior performance in predicting input pathological waveforms of the SincNet system. More importantly, we intended to give possible explanations between the system output and the first-layer extracted speech features based on our evaluated results.

Entities:  

Keywords:  SincNet; classification; convolutional neural network; pathological voice; sinc functions

Mesh:

Year:  2022        PMID: 36081092      PMCID: PMC9460101          DOI: 10.3390/s22176634

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.847


  8 in total

1.  From Local Explanations to Global Understanding with Explainable AI for Trees.

Authors:  Scott M Lundberg; Gabriel Erion; Hugh Chen; Alex DeGrave; Jordan M Prutkin; Bala Nair; Ronit Katz; Jonathan Himmelfarb; Nisha Bansal; Su-In Lee
Journal:  Nat Mach Intell       Date:  2020-01-17

2.  Approximated and User Steerable tSNE for Progressive Visual Analytics.

Authors:  Nicola Pezzotti; Boudewijn P F Lelieveldt; Laurens Van Der Maaten; Thomas Hollt; Elmar Eisemann; Anna Vilanova
Journal:  IEEE Trans Vis Comput Graph       Date:  2016-05-19       Impact factor: 4.579

3.  Convolutional Neural Networks for Pathological Voice Detection.

Authors:  Huiyi Wu; John Soraghan; Anja Lowit; Gaetano Di Caterina
Journal:  Annu Int Conf IEEE Eng Med Biol Soc       Date:  2018-07

Review 4.  A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders.

Authors:  Sarika Hegde; Surendra Shetty; Smitha Rai; Thejaswi Dodderi
Journal:  J Voice       Date:  2018-10-11       Impact factor: 2.009

5.  Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.

Authors:  Shih-Hau Fang; Yu Tsao; Min-Jing Hsiao; Ji-Ying Chen; Ying-Hui Lai; Feng-Chuan Lin; Chi-Te Wang
Journal:  J Voice       Date:  2018-03-19       Impact factor: 2.009

6.  Continuous Speech for Improved Learning Pathological Voice Disorders.

Authors:  Syu-Siang Wang; Chi-Te Wang; Chih-Chung Lai; Yu Tsao; Shih-Hau Fang
Journal:  IEEE Open J Eng Med Biol       Date:  2022-02-14

7.  Prevalence of voice disorders in teachers and the general population.

Authors:  Nelson Roy; Ray M Merrill; Susan Thibeault; Rahul A Parsa; Steven D Gray; Elaine M Smith
Journal:  J Speech Lang Hear Res       Date:  2004-04       Impact factor: 2.297

8.  Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update.

Authors:  Daryush D Mehta; Jarrad H Van Stan; Matías Zañartu; Marzyeh Ghassemi; John V Guttag; Víctor M Espinoza; Juan P Cortés; Harold A Cheyne; Robert E Hillman
Journal:  Front Bioeng Biotechnol       Date:  2015-10-16
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.