Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Using SincNet for Learning Pathological Voice Disorders.

Literature DB >> 36081092

Using SincNet for Learning Pathological Voice Disorders.

Chao-Hsiang Hung¹, Syu-Siang Wang¹, Chi-Te Wang², Shih-Hau Fang¹.

Abstract

Deep learning techniques such as convolutional neural networks (CNN) have been successfully applied to identify pathological voices. However, the major disadvantage of using these advanced models is the lack of interpretability in explaining the predicted outcomes. This drawback further introduces a bottleneck for promoting the classification or detection of voice-disorder systems, especially in this pandemic period. In this paper, we proposed using a series of learnable sinc functions to replace the very first layer of a commonly used CNN to develop an explainable SincNet system for classifying or detecting pathological voices. The applied sinc filters, a front-end signal processor in SincNet, are critical for constructing the meaningful layer and are directly used to extract the acoustic features for following networks to generate high-level voice information. We conducted our tests on three different Far Eastern Memorial Hospital voice datasets. From our evaluations, the proposed approach achieves the highest 7%-accuracy and 9%-sensitivity improvements from conventional methods and thus demonstrates superior performance in predicting input pathological waveforms of the SincNet system. More importantly, we intended to give possible explanations between the system output and the first-layer extracted speech features based on our evaluated results.

Entities: Chemical

Keywords: SincNet; classification; convolutional neural network; pathological voice; sinc functions

Mesh：

Year: 2022 PMID： 36081092 PMCID： PMC9460101 DOI： 10.3390/s22176634

Source DB: PubMed Journal: Sensors (Basel) ISSN： 1424-8220 Impact factor: 3.847

Keyword Cloud
References

8 in total

Using SincNet for Learning Pathological Voice Disorders.

1. From Local Explanations to Global Understanding with Explainable AI for Trees.

2. Approximated and User Steerable tSNE for Progressive Visual Analytics.

3. Convolutional Neural Networks for Pathological Voice Detection.

Review 4. A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders.

5. Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.

6. Continuous Speech for Improved Learning Pathological Voice Disorders.

7. Prevalence of voice disorders in teachers and the general population.

8. Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update.