| Literature DB >> 35444694 |
Mohammed Usman1, Vinit Kumar Gunjan2, Mohd Wajid3, Mohammed Zubair1, Kazy Noor-E-Alam Siddiquee4.
Abstract
The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform (STFT), get altered in a statistical sense as a result of physiological changes. These spectral features are then used as input features to machine learning-based classification algorithms to classify them as coming from a COVID-19 positive individual or not. Speech samples from healthy as well as "asymptomatic" COVID-19 positive individuals have been used in this study. It is shown that the RMS error of statistical distribution fitting is higher in the case of speech samples of COVID-19 positive speech samples as compared to the speech samples of healthy individuals. Five state-of-the-art machine learning classification algorithms have also been analyzed, and the performance evaluation metrics of these algorithms are also presented. The tuning of machine learning model parameters is done so as to minimize the misclassification of COVID-19 positive individuals as being COVID-19 negative since the cost associated with this misclassification is higher than the opposite misclassification. The best performance in terms of the "recall" metric is observed for the Decision Forest algorithm which gives a recall value of 0.7892.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35444694 PMCID: PMC9014833 DOI: 10.1155/2022/6093613
Source DB: PubMed Journal: Comput Intell Neurosci
Attributes of data used in this study.
| Age group (years) | No. of recordings | Sampling rate (sps) | Quantization depth (bits) | Audio bit rate (kbps) | Audio format | Other parameters measured | |
|---|---|---|---|---|---|---|---|
| Healthy | 25–45 | 84 | 16000 | 16 | 256 | .wav | Heart rate, SpO2 |
| COVID+ | 32–57 | 22 |
Figure 1Speech preprocessing.
Figure 2Biological parameters correlated to speech and COVID-19 symptoms.
Figure 3Block diagram of the methodology used.
Evaluation metrics for binary classification.
| Evaluation metric | Definition | Notations |
|---|---|---|
| Binary classification | ||
| Precision (PRE) | PRE = | tp–Total no. of true positive samples |
| Recall (REC) | REC = | tn–Total no. of true negative samples |
| Accuracy (ACC) | ACC = | fp–Total no. of false positive samples |
| F1-score |
| fn – Total no. of false negative samples |
| Area under RoC curve | (AUC) AUC =∫01RoC | RoC - receiver operating characteristic curve |
Figure 4Statistical properties of speech STFT coefficients of a healthy person (without COVID-19).
Figure 5Statistical properties of speech STFT coefficients of an infected person (with COVID-19).
Performance metrics for binary classification algorithms.
| Classification algorithms | Optimal | Performance metrics | ||||
|---|---|---|---|---|---|---|
| ACC | PRE | REC | F1 score | AUC | ||
| BDT | No. of Leaves: 16 | 0.724 (0.048) | 0.714 (0.037) | 0.7037 (0.063) | 0.7088 (0.052) | 0.717 (0.053) |
| DF | Random split Count: 128 | 0.7317 (0.021) | 0.7421 (0.017) | 0.7892 (0.081) | 0.7649 (0.025) | 0.755 (0.017) |
| NN | Learning rate: 0.001 | 0.711 (0.031) | 0.7271 (0.043) | 0.7188 (0.018) | 0.7229 (0.029) | 0.7616 (0.095) |
| LoR | Optimization Tolerance: 1e-06 | 0.6741 (0.019) | 0.6805 (0.024) | 0.6161 (0.027) | 0.6467 (0.019) | 0.6874 (0.065) |
| SVM | Lambda – 0.001 | 0.694 (0.017) | 0.673 (0.074) | 0.6027 (0.019) | 0.6359 (0.011) | 0.6619 (0.037) |
Average RMS error of the fitted LD distributions.
| Category | Average RMS error of fitted LD |
|---|---|
| Without COVID-19 | 0.00354 |
| With COVID-19 | 0.01271 |
Test evaluation for binary classification
| Test sample | Actual class | Predicted class | ||||
|---|---|---|---|---|---|---|
| BDT | DF | NN | LoR | SVM | ||
| 1 | 0 | 0 | 1 | 0 | 0 | 1 |
| 2 | 1 | 1 | 1 |
| 1 | 1 |
| 3 | 1 | 1 | 1 | 1 |
|
|
| 4 | 0 | 1 | 0 | 1 | 1 | 1 |
| 5 | 1 | 1 | 1 |
| 1 | 1 |
| 6 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 1 |
|
| 1 | 1 | 1 |
| 8 | 1 | 1 | 1 | 1 |
|
|
| 9 | 1 |
|
|
| 1 | 1 |
| 10 | 0 | 1 | 0 | 0 | 1 | 1 |
| 11 | 1 |
| 1 | 1 | 1 | 1 |
| 12 | 0 | 0 | 1 | 1 | 0 | 0 |
| 13 | 0 | 1 | 0 | 1 | 1 | 0 |
| 14 | 1 | 1 | 1 | 1 |
| 1 |
| 15 | 0 | 0 | 0 | 0 | 0 | 1 |
| 16 | 0 | 1 | 1 | 0 | 0 | 1 |
| 17 | 1 | 1 | 1 |
|
|
|
| 18 | 1 |
| 1 | 1 | 1 | 1 |
| 19 | 0 | 0 | 0 | 1 | 0 | 1 |
| 20 | 1 | 1 | 1 | 1 | 0 | 1 |
Figure 6Devices used for measuring biomedical parameters along with speech for future analysis.
Figure 7Possible applications of the proposed research.