| Literature DB >> 33937404 |
Sidra Abid Syed1, Munaf Rashid2, Samreen Hussain3, Hira Zahid4.
Abstract
Diagnosis on the basis of a computerized acoustic examination may play an incredibly important role in early diagnosis and in monitoring and even improving effective pathological speech diagnostics. Various acoustic metrics test the health of the voice. The precision of these parameters also has to do with algorithms for the detection of speech noise. The idea is to detect the disease pathology from the voice. First, we apply the feature extraction on the SVD dataset. After the feature extraction, the system input goes into the 27 neuronal layer neural networks that are convolutional and recurrent neural network. We divided the dataset into training and testing, and after 10 k-fold validation, the reported accuracies of CNN and RNN are 87.11% and 86.52%, respectively. A 10-fold cross-validation is used to evaluate the performance of the classifier. On a Linux workstation with one NVidia Titan X GPU, program code was written in Python using the TensorFlow package.Entities:
Year: 2021 PMID: 33937404 PMCID: PMC8062167 DOI: 10.1155/2021/6635964
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Characteristics of SVD dataset.
| Dataset | SVD | ||
|---|---|---|---|
| Characteristics | Language | Sampling frequency | Text |
| German | 50 KHz | Vowel /a/ | |
| (1) Vowel /i/ | |||
| (2) Vowel /u/ | |||
| (3) Sentence | |||
Figure 1Architecture of CNN [22].
Figure 2Architecture of LSTM-RNN [25].
Figure 3Proposed in-layer model of CNN.
Figure 4Proposed in-layer model of LSTM-RNN.
Accuracy of RNN and CNN at 10-fold verification.
| Algorithm | Validation | Accuracy |
|---|---|---|
| CNN | 10-fold | 87.11% |
| LSTM-RNN | 10-fold | 86.52% |
Figure 5Accuracy and error evaluation of CNN in training and testing phase.
Figure 6Accuracy and error evaluation of RNN in training and testing phase.
Figure 7Confusion matrix of CNN.
Figure 8Confusion matrix of LSTM-RNN.