| Literature DB >> 36247809 |
Musatafa Abbas Abbood Albadr1, Sabrina Tiun1, Masri Ayob1, Fahad Taha Al-Dhief2.
Abstract
COVID-19 (coronavirus disease 2019) is an ongoing global pandemic caused by severe acute respiratory syndrome coronavirus 2. Recently, it has been demonstrated that the voice data of the respiratory system (i.e., speech, sneezing, coughing, and breathing) can be processed via machine learning (ML) algorithms to detect respiratory system diseases, including COVID-19. Consequently, many researchers have applied various ML algorithms to detect COVID-19 by using voice data from the respiratory system. However, most of the recent COVID-19 detection systems have worked on a limited dataset. In other words, the systems utilize cough and breath voices only and ignore the voices of the other respiratory system, such as speech and vowels. In addition, another issue that should be considered in COVID-19 detection systems is the classification accuracy of the algorithm. The particle swarm optimization-extreme learning machine (PSO-ELM) is an ML algorithm that can be considered an accurate and fast algorithm in the process of classification. Therefore, this study proposes a COVID-19 detection system by utilizing the PSO-ELM as a classifier and mel frequency cepstral coefficients (MFCCs) for feature extraction. In this study, respiratory system voice samples were taken from the Corona Hack Respiratory Sound Dataset (CHRSD). The proposed system involves thirteen different scenarios: breath deep, breath shallow, all breath, cough heavy, cough shallow, all cough, count fast, count normal, all count, vowel a, vowel e, vowel o, and all vowels. The experimental results demonstrated that the PSO-ELM was capable of attaining the highest accuracy, reaching 95.83%, 91.67%, 89.13%, 96.43%, 92.86%, 88.89%, 96.15%, 96.43%, 88.46%, 96.15%, 96.15%, 95.83%, and 82.89% for breath deep, breath shallow, all breath, cough heavy, cough shallow, all cough, count fast, count normal, all count, vowel a, vowel e, vowel o, and all vowel scenarios, respectively. The PSO-ELM is an efficient technique for the detection of COVID-19 utilizing voice data from the respiratory system.Entities:
Keywords: Mel frequency cepstral coefficients; Particle swarm optimization-extreme learning machine
Year: 2022 PMID: 36247809 PMCID: PMC9554849 DOI: 10.1007/s12559-022-10063-x
Source DB: PubMed Journal: Cognit Comput ISSN: 1866-9956 Impact factor: 4.890
Summary of the previous works on COVID-19 detection by using respiratory system voice data
| [ | Coswara dataset | SRMSF | ResNet18 | Cough | 0.72 AUC | 1. The output results are not encouraging and need more improvement 2. Only one scenario has been conducted while other scenarios such as breath, counting fast, counting normal, and vowels (i.e., i, e, and o) have been ignored |
| [ | Collected from YouTube | MFBF | SVM | Speech | 88.60% accuracy | |
| [ | Own collected dataset | MFCC | ResNet50 | Cough | 94.2% specificity | |
| [ | Crowdsourced dataset | Several handcrafted features | LR | Cough and breath | 80.00% precision for cough and 69.00% precision for breath | 1. The output results are not encouraging and need more improvement 2. Only two scenarios have been conducted while other scenarios such as counting fast, counting normal, and vowels (i.e., i, e, and o) have been ignored |
| [ | ESC-50 dataset and COVID-19 | MSF | CNN | Speech, cough, and overall | Accuracy of 92.00%, 92.85%, and 88.00% for speech, cough, and overall, respectively | 1. The output results are not encouraging and need more improvement 2. More scenarios such as counting fast, counting normal, and vowels (i.e., i, e, and o) have been ignored |
| [ | Collected by CMU | MFCC-VMO-RP-RQA | XGBoost | Cough and vowel “ah” | 97.00% accuracy for cough and 99.00% accuracy for vowel “ah” | More scenarios such as counting fast, counting normal, and vowels (i.e., i, e, and o) have been ignored |
| [ | Own collected dataset | MFCC | LSTM | Cough, breath, and speech | Accuracy of 97.00% for cough, 98.20% for breath, and 88.20% for speech |
Fig. 1Block diagram of the proposed COVID-19 detection system
Description of the whole dataset
| Class | Number of samples | Class | Number of samples | Class | Number of samples | ||
|---|---|---|---|---|---|---|---|
| Healthy | 39 | Healthy | 39 | Healthy | 78 | 1 | |
| COVID-19 | 39 | COVID-19 | 39 | COVID-19 | 78 | 2 | |
| Class | Number of samples | Class | Number of samples | Class | Number of samples | ||
| Healthy | 45 | Healthy | 45 | Healthy | 90 | 1 | |
| COVID-19 | 45 | COVID-19 | 45 | COVID-19 | 90 | 2 | |
| Class | Number of samples | Class | Number of samples | Class | Number of samples | ||
| Healthy | 42 | Healthy | 45 | Healthy | 87 | 1 | |
| COVID-19 | 42 | COVID-19 | 45 | COVID-19 | 87 | 2 | |
| Class | Number of samples | Class | Number of samples | ||||
| Healthy | 43 | Healthy | 43 | 1 | |||
| COVID-19 | 43 | COVID-19 | 43 | 2 | |||
| Class | Number of samples | Class | Number of samples | ||||
| Healthy | 41 | Healthy | 127 | 1 | |||
| COVID-19 | 41 | COVID-19 | 127 | 2 | |||
Fig. 2The process of MFCC feature extraction
The value of the MFCC variables that have been used in this study
| 44,100 Hz | |
| 25 ms | |
| 10 ms | |
| 1103 | |
| 441 | |
| Number of MFCC features | 13 |
Fig. 3PSO-ELM algorithm flowchart
The ELM and PSO parameters
| Parameters | Values | Parameters | Values |
|---|---|---|---|
| Input weights and biases | Population (particles) | Contain positions and velocities | |
| Output weights | Position | Generated randomly at the beginning, in the range of [− 1, 1] for input weights and [0, 1] for biases | |
| Input weight | In the range of [− 1, 1] | Velocity | Start with zero values, and it is limited to the range of [− 2, 2] |
| Biases values | In the range of [0, 1] | 50 | |
| Input neurons number ( | Input attributes | 0.7289, 1.496, 1.496 | |
| Hidden neurons number ( | 100–600, with 50 increment step | 100 | |
| Output neurons | Class values | Best particle position | |
| Activation function | Sigmoid | Best position of all particles | |
The dataset that was used in deep breath, shallow breath, and all breath scenarios
| Class | Number of all samples | Number of training samples | Number of testing samples | Label |
|---|---|---|---|---|
| Healthy | 39 | 27 | 12 | 1 |
| COVID-19 | 39 | 27 | 12 | 2 |
| Healthy | 39 | 27 | 12 | 1 |
| COVID-19 | 39 | 27 | 12 | 2 |
| Healthy | 78 | 55 | 23 | 1 |
| COVID-19 | 78 | 55 | 23 | 2 |
Fig. 4The ROC of PSO-ELM best results for the deep breath, shallow breath, and all breath scenarios
The dataset that was utilized in heavy cough, shallow cough, and all cough scenarios
| Class | Number of all samples | Number of training samples | Number of testing samples | Label |
|---|---|---|---|---|
| Healthy | 45 | 31 | 14 | 1 |
| COVID-19 | 45 | 31 | 14 | 2 |
| Healthy | 45 | 31 | 14 | 1 |
| COVID-19 | 45 | 31 | 14 | 2 |
| Healthy | 90 | 63 | 27 | 1 |
| COVID-19 | 90 | 63 | 27 | 2 |
Fig. 5The ROC of PSO-ELM best results for the heavy cough, shallow cough, and all cough scenarios
The dataset that was used in count fast, count normal, and all count scenarios
| Class | Number of all samples | Number of training samples | Number of testing samples | Label |
|---|---|---|---|---|
| Healthy | 42 | 29 | 13 | 1 |
| COVID-19 | 42 | 29 | 13 | 2 |
| Healthy | 45 | 31 | 14 | 1 |
| COVID-19 | 45 | 31 | 14 | 2 |
| Healthy | 87 | 61 | 26 | 1 |
| COVID-19 | 87 | 61 | 26 | 2 |
Fig. 6The ROC of PSO-ELM best results for the count fast, count normal, and all count scenarios
The dataset that was utilized in vowel a, vowel e, vowel o, and all vowel scenarios
| Class | Number of all samples | Number of training samples | Number of testing samples | Label |
|---|---|---|---|---|
| Healthy | 43 | 30 | 13 | 1 |
| COVID-19 | 43 | 30 | 13 | 2 |
| Healthy | 43 | 30 | 13 | 1 |
| COVID-19 | 43 | 30 | 13 | 2 |
| Healthy | 41 | 29 | 12 | 1 |
| COVID-19 | 41 | 29 | 12 | 2 |
| Healthy | 127 | 89 | 38 | 1 |
| COVID-19 | 127 | 89 | 38 | 2 |
Fig. 7The ROC of PSO-ELM best results for vowel a, vowel e, vowel o, and all vowel scenarios
The best results of the proposed PSO-ELM in all scenarios
| Hidden neurons | TP | TN | FP | FN | Accuracy | Precision | Recall | Specificity | F-measure | G-mean | Execution time (s) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 150 | 11 | 12 | 0 | 1 | 95.83 | 100.00 | 91.67 | 100.00 | 95.65 | 95.74 | 119.8567 |
| 300 | 10 | 12 | 0 | 2 | 91.67 | 100.00 | 83.33 | 100.00 | 90.91 | 91.29 | 119.5190 |
| 250 | 22 | 19 | 4 | 1 | 89.13 | 84.62 | 95.65 | 82.61 | 89.80 | 89.96 | 132.7839 |
| 400 | 14 | 13 | 1 | 0 | 96.43 | 93.33 | 100.00 | 92.86 | 96.55 | 96.61 | 128.3579 |
| 550 | 14 | 12 | 2 | 0 | 92.86 | 87.50 | 100.00 | 85.71 | 93.33 | 93.54 | 127.9549 |
| 200 | 26 | 22 | 5 | 1 | 88.89 | 83.87 | 96.30 | 81.48 | 89.66 | 89.87 | 176.8013 |
| 150 | 12 | 13 | 0 | 1 | 96.15 | 100.00 | 92.31 | 100.00 | 96.00 | 96.08 | 124.2523 |
| 350 | 13 | 14 | 0 | 1 | 96.43 | 100.00 | 92.86 | 100.00 | 96.30 | 96.36 | 125.7110 |
| 150 | 24 | 22 | 4 | 2 | 88.46 | 85.71 | 92.31 | 84.62 | 88.89 | 88.95 | 174.0120 |
| 100 | 13 | 12 | 1 | 0 | 96.15 | 92.86 | 100.00 | 92.31 | 96.30 | 96.36 | 107.1898 |
| 100 | 13 | 12 | 1 | 0 | 96.15 | 92.86 | 100.00 | 92.31 | 96.30 | 96.36 | 108.1326 |
| 150 | 12 | 11 | 1 | 0 | 95.83 | 92.31 | 100.00 | 91.67 | 96.00 | 96.08 | 106.6515 |
| 200 | 28 | 35 | 3 | 10 | 82.89 | 90.32 | 73.68 | 92.11 | 81.16 | 81.58 | 198.1815 |
The best experimental results of the basic ELM, NN, and RF techniques in all scenarios
| Technique | TP | TN | FP | FN | Accuracy | Precision | Recall | Specificity | F-measure | G-mean | Execution time (s) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ELM | 6 | 10 | 2 | 6 | 66.67 | 75.00 | 50.00 | 83.33 | 60.00 | 61.24 | 0.5236 |
| NN | 8 | 7 | 5 | 4 | 62.50 | 61.54 | 66.67 | 58.33 | 64.00 | 64.05 | 21.5222 |
| RF | 6 | 9 | 3 | 6 | 62.50 | 66.67 | 50.00 | 75.00 | 57.14 | 57.74 | 3.3139 |
| ELM | 7 | 10 | 2 | 5 | 70.83 | 77.78 | 58.33 | 83.33 | 66.67 | 67.36 | 0.5218 |
| NN | 10 | 6 | 6 | 2 | 66.67 | 62.50 | 83.33 | 50.00 | 71.43 | 72.17 | 24.5918 |
| RF | 9 | 7 | 5 | 3 | 66.67 | 64.29 | 75.00 | 58.33 | 69.23 | 69.44 | 3.3163 |
| ELM | 18 | 12 | 11 | 5 | 65.22 | 62.07 | 78.26 | 52.17 | 69.23 | 69.70 | 0.9417 |
| NN | 16 | 11 | 12 | 7 | 58.70 | 57.14 | 69.57 | 47.83 | 62.75 | 63.05 | 33.4789 |
| RF | 17 | 11 | 12 | 6 | 60.87 | 58.62 | 73.91 | 47.83 | 65.38 | 65.82 | 3.8154 |
| ELM | 11 | 10 | 4 | 3 | 75.00 | 73.33 | 78.57 | 71.43 | 75.86 | 75.91 | 0.6609 |
| NN | 11 | 6 | 8 | 3 | 60.71 | 57.89 | 78.57 | 42.86 | 66.67 | 67.45 | 22.4395 |
| RF | 10 | 9 | 5 | 4 | 67.86 | 66.67 | 71.43 | 64.29 | 68.97 | 69.01 | 3.6511 |
| ELM | 8 | 10 | 4 | 6 | 64.29 | 66.67 | 57.14 | 71.43 | 61.54 | 61.72 | 0.6421 |
| NN | 6 | 11 | 3 | 8 | 60.71 | 66.67 | 42.86 | 78.57 | 52.17 | 53.45 | 22.6169 |
| RF | 7 | 9 | 5 | 7 | 57.14 | 58.33 | 50.00 | 64.29 | 53.85 | 54.01 | 3.7370 |
| ELM | 19 | 16 | 11 | 8 | 64.81 | 63.33 | 70.37 | 59.26 | 66.67 | 66.76 | 1.0641 |
| NN | 18 | 14 | 13 | 9 | 59.26 | 58.06 | 66.67 | 51.85 | 62.07 | 62.22 | 34.6208 |
| RF | 19 | 14 | 13 | 8 | 61.11 | 59.38 | 70.37 | 51.85 | 64.41 | 64.64 | 4.1569 |
| ELM | 8 | 10 | 3 | 5 | 69.23 | 72.73 | 61.54 | 76.92 | 66.67 | 66.90 | 0.5920 |
| NN | 4 | 11 | 2 | 9 | 57.69 | 66.67 | 30.77 | 84.62 | 42.11 | 45.29 | 28.5440 |
| RF | 9 | 8 | 5 | 4 | 65.38 | 64.29 | 69.23 | 61.54 | 66.67 | 66.71 | 4.3687 |
| ELM | 11 | 8 | 6 | 3 | 67.86 | 64.71 | 78.57 | 57.14 | 70.97 | 71.30 | 0.6178 |
| NN | 14 | 6 | 8 | 0 | 71.43 | 63.64 | 100.00 | 42.86 | 77.78 | 79.77 | 25.1147 |
| RF | 10 | 8 | 6 | 4 | 64.29 | 62.50 | 71.43 | 57.14 | 66.67 | 66.82 | 4.4511 |
| ELM | 16 | 17 | 9 | 10 | 63.46 | 64.00 | 61.54 | 65.38 | 62.75 | 62.76 | 1.0476 |
| NN | 16 | 17 | 9 | 10 | 63.46 | 64.00 | 61.54 | 65.38 | 62.75 | 62.76 | 37.9394 |
| RF | 12 | 19 | 7 | 14 | 59.62 | 63.16 | 46.15 | 73.08 | 53.33 | 53.99 | 4.8791 |
| ELM | 6 | 11 | 2 | 7 | 65.38 | 75.00 | 46.15 | 84.62 | 57.14 | 58.83 | 0.4631 |
| NN | 7 | 8 | 5 | 6 | 57.69 | 58.33 | 53.85 | 61.54 | 56.00 | 56.04 | 26.5335 |
| RF | 8 | 6 | 7 | 5 | 53.85 | 53.33 | 61.54 | 46.15 | 57.14 | 57.29 | 3.3466 |
| ELM | 7 | 9 | 4 | 6 | 61.54 | 63.64 | 53.85 | 69.23 | 58.33 | 58.54 | 0.4873 |
| NN | 9 | 8 | 5 | 4 | 65.38 | 64.29 | 69.23 | 61.54 | 66.67 | 66.71 | 30.6789 |
| RF | 5 | 10 | 3 | 8 | 57.69 | 62.50 | 38.46 | 76.92 | 47.62 | 49.03 | 3.1672 |
| ELM | 10 | 5 | 7 | 2 | 62.50 | 58.82 | 83.33 | 41.67 | 68.97 | 70.01 | 0.4789 |
| NN | 9 | 4 | 8 | 3 | 54.17 | 52.94 | 75.00 | 33.33 | 62.07 | 63.01 | 22.5454 |
| RF | 7 | 7 | 5 | 5 | 58.33 | 58.33 | 58.33 | 58.33 | 58.33 | 58.33 | 3.1421 |
| ELM | 25 | 22 | 16 | 13 | 61.84 | 60.98 | 65.79 | 57.89 | 63.29 | 63.34 | 1.5370 |
| NN | 22 | 21 | 17 | 16 | 56.58 | 56.41 | 57.89 | 55.26 | 57.14 | 57.15 | 53.8420 |
| RF | 29 | 17 | 21 | 9 | 60.53 | 58.00 | 76.32 | 44.74 | 65.91 | 0.6653 | 4.5728 |
The best experimental results of the LSTM and XGBoost approaches in all scenarios
| Technique | TP | TN | FP | FN | Accuracy | Precision | Recall | Specificity | F-measure | G-mean | Execution time (s) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| LSTM | 8 | 11 | 1 | 4 | 79.17 | 88.89 | 66.67 | 91.67 | 76.19 | 76.98 | 116.448 |
| XGBoost | 10 | 7 | 5 | 2 | 70.83 | 66.67 | 83.33 | 58.33 | 74.07 | 74.54 | 63.124 |
| LSTM | 8 | 9 | 3 | 4 | 70.83 | 72.73 | 66.67 | 75.00 | 69.57 | 69.63 | 113.005 |
| XGBoost | 8 | 9 | 3 | 4 | 70.83 | 72.73 | 66.67 | 75.00 | 69.57 | 69.63 | 65.930 |
| LSTM | 17 | 11 | 12 | 6 | 60.87 | 58.62 | 73.91 | 47.83 | 65.38 | 65.82 | 128.850 |
| XGBoost | 14 | 15 | 8 | 9 | 63.04 | 63.63 | 60.87 | 65.22 | 62.22 | 62.24 | 85.102 |
| LSTM | 10 | 11 | 3 | 4 | 75.00 | 76.92 | 71.43 | 78.57 | 74.07 | 74.12 | 112.323 |
| XGBoost | 10 | 8 | 6 | 4 | 64.29 | 62.50 | 71.43 | 57.14 | 66.67 | 66.82 | 55.048 |
| LSTM | 7 | 9 | 5 | 7 | 57.14 | 58.33 | 50.00 | 64.29 | 53.85 | 54.01 | 114.679 |
| XGBoost | 13 | 8 | 6 | 1 | 75.00 | 68.42 | 92.86 | 57.14 | 78.79 | 79.71 | 58.063 |
| LSTM | 17 | 18 | 9 | 10 | 64.81 | 65.38 | 62.96 | 66.67 | 64.15 | 64.16 | 127.551 |
| XGBoost | 16 | 18 | 9 | 11 | 62.96 | 64.00 | 59.26 | 66.67 | 61.54 | 61.58 | 82.043 |
| LSTM | 8 | 9 | 4 | 5 | 65.38 | 66.67 | 61.54 | 69.23 | 64.00 | 64.05 | 117.751 |
| XGBoost | 9 | 8 | 5 | 4 | 65.38 | 64.29 | 69.23 | 61.54 | 66.67 | 66.71 | 60.707 |
| LSTM | 10 | 8 | 6 | 4 | 64.29 | 62.50 | 71.43 | 57.14 | 66.67 | 66.82 | 115.631 |
| XGBoost | 8 | 10 | 4 | 6 | 64.29 | 66.67 | 57.14 | 71.43 | 61.54 | 61.72 | 58.609 |
| LSTM | 11 | 23 | 3 | 15 | 65.38 | 78.57 | 42.31 | 88.46 | 55.00 | 57.66 | 126.962 |
| XGBoost | 15 | 16 | 10 | 11 | 59.62 | 60.00 | 57.69 | 61.54 | 58.82 | 58.83 | 90.167 |
| LSTM | 6 | 10 | 3 | 7 | 61.54 | 66.67 | 46.15 | 76.92 | 54.55 | 55.47 | 101.736 |
| XGBoost | 8 | 7 | 6 | 5 | 57.69 | 57.14 | 61.54 | 53.85 | 59.26 | 59.30 | 61.645 |
| LSTM | 6 | 11 | 2 | 7 | 65.38 | 75.00 | 46.15 | 84.62 | 57.14 | 58.83 | 103.035 |
| XGBoost | 6 | 10 | 3 | 7 | 61.54 | 66.67 | 46.15 | 76.92 | 54.55 | 55.47 | 60.859 |
| LSTM | 7 | 8 | 4 | 5 | 62.50 | 63.64 | 58.33 | 66.67 | 60.87 | 60.93 | 98.829 |
| XGBoost | 7 | 9 | 3 | 5 | 66.67 | 70.00 | 58.33 | 75.00 | 63.64 | 63.90 | 59.786 |
| LSTM | 26 | 20 | 18 | 12 | 60.53 | 59.09 | 68.42 | 52.63 | 63.41 | 63.59 | 133.452 |
| XGBoost | 22 | 24 | 14 | 16 | 60.53 | 61.11 | 57.89 | 63.16 | 59.46 | 59.48 | 120.183 |
The comparison of accuracy between the proposed PSO-ELM and other previous works
| PSO-ELM | 89.13% | PSO-ELM | 88.89% |
| SVM in [ | 81.50% | SVM [ | 85.70% |
| RF [ | 75.17% | RF [ | 70.69% |
| CNN [ | 70.37% | CNN [ | 88.48% |
| RF [ | 86.79% | Ensemble model [ | 77.10% |
| Technique | Accuracy | Technique | Accuracy |
| PSO-ELM | 95.83% | PSO-ELM | 91.67% |
| SVM [ | 62.30% | SVM in [ | 62.20% |
| BI-ATGRU [ | 94.50% | ||
| Technique | Accuracy | Technique | Accuracy |
| PSO-ELM | 96.43% | PSO-ELM | 92.86% |
| SVM [ | 72.30% | SVM [ | 74.10% |
| Technique | Accuracy | Technique | Accuracy |
| PSO-ELM | 96.15% | PSO-ELM | |
| SVM [ | 73.50% | SVM [ | 72.50% |
| Technique | Accuracy | Technique | Accuracy |
| PSO-ELM | 96.15% | PSO-ELM | 96.15% |
| SVM [ | 59.30% | SVM [ | 68.20% |
| Technique | |||
| PSO-ELM | 95.83% | ||
| SVM [ | 69.20 | ||