| Literature DB >> 35126765 |
Ezz El-Din Hemdan1, Walid El-Shafai2,3, Amged Sayed4.
Abstract
Today, there is a level of panic and chaos dominating the entire world due to the massive outbreak in the second wave of COVID-19 disease. As the disease has numerous symptoms ranging from a simple fever to the inability to breathe, which may lead to death. One of these symptoms is a cough which is considered one of the most common symptoms for COVID-19 disease. Recent research shows that the cough of a COVID-19 patient has distinct features that are different from other diseases. Consequently, the cough sound can be detected and classified to be used as a preliminary diagnosis of the COVID-19, which will help in reducing the spreading of that disease. The artificial intelligence (AI) engine can diagnose COVID-19 diseases by executing differential analysis of its inherent characteristics and comparing it to other non-COVID-19 coughs. However, the diagnosis of a COVID-19 infection by cough alone is an extremely challenging multidisciplinary problem. Therefore, this paper proposes a hybrid framework for efficiently COVID-19 detection and diagnosis using various ML algorithms from cough audio signals. The accuracy of this framework is improved with the utilization of the genetic algorithm with the ML techniques. We also assess the proposed system called CR19 for diagnosis on metrics such as precision, recall, F-measure. The results proved that the hybrid (GA-ML) technique provides superior results based on different evaluation metrics compared with ML approaches such as LR, LDA, KNN, CART, NB, and SVM. The proposed framework achieve an accuracy equal to 92.19%, 94.32%, 97.87%, 92.19%, 91.48%, and 93.61% in compared with the ML are 90.78, 92.90, 95.74, 87.94, 81.56, and 92.198 for LR, LDA, KNN, CART, NB, and SVM respectively. The proposed framework will efficiently help the physicians provide a proper medical decision regarding the COVID-19 analysis, thereby saving more lives. Therefore, this CR19 framework can be a clinical decision assistance tool used to channel clinical testing and treatment to those who need it the most, thereby saving more lives.Entities:
Keywords: AI; Automated diagnosis; COVID-19; Classification; Cough; GA-ML technique; Genetic algorithm; Machine learning
Year: 2022 PMID: 35126765 PMCID: PMC8803577 DOI: 10.1007/s12652-022-03732-0
Source DB: PubMed Journal: J Ambient Intell Humaniz Comput
Fig. 1Reported cases in some countries (WHO Nov 28th 2020)
Fig. 2Flowchart of genetic algorithm
Summary of contribution and limitations for some existing work in COVID-19 diagnosis using cough signals
| Work | Contribution | Limitations |
|---|---|---|
| Imran et al. ( | They provided a system called AI4COVID-19. This system is an AI-enabled preliminary diagnosis for COVID-19 from cough samples via an app | They focused only on using cough samples via an app. So they can use other acquisition methods for collecting and testing their proposed system for better performance analysis |
| Pal and Sankarasubbu ( | An interpretable and COVID-19 diagnosis AI framework is devised and developed based on the cough sounds features and symptoms metadata | They worked on a medical dataset containing symptoms and demographic data of 30,000 audio segments, 328 cough sounds from 150 patients. So, more datasets can be acceptable for testing their work |
| Pahar et al. ( | A machine learning-based COVID-19 cough classifier which can discriminate COVID-19 coughs recorded on a smartphone | Still, the performance of their model needs to be improved by performing feature selection and other preprocessing processes |
| Deshpande et al. ( | They provided an overview of research on human audio signals using AI techniques to screen, diagnose, monitor, and spread awareness about COVID-19 | They have to work in provide more comparative analysis of existing machine and deep learning models with their performance in COVID-19 coughs from different datasets |
| Trivedy et al. ( | The offered the design and development of a low-cost, portable, smartphone-enabled spirometer with an automatic disease classification using CNN | They have to use more number of the dataset and more experimental investigation in evaluations the proposed system with more algorithms |
| Melek ( | The offered a system for the diagnosis of COVID-19 coughs based on the radial basis function (RBF) kernel function of SVM and the MFCC method | Their work focused only on using the SVM with the MFCC method, so they can be extended to evaluate their system with more models and features extraction methods for cough audio signals |
| Grant et al. ( | They proposed an approach to analyzing sounds to unobtrusively detect COVID-19 based on MFCCs and RASTA-PLP features with classifiers RF and DNN | They worked only on the MFCCs and RASTA-PLP features so more features can be extracted from the audio signals plus testing with more number of classifiers over different datasets |
| Andreu-Perez et al. ( | They developed a generic method based on EMD with subsequent classification based on a tensor of audio features and DeepCough classifier | They proposed a web tool and underpinning algorithm, so it’s better to develop a mobile-based app besides the web tool to empower their approach |
| Laguarta et al. ( | They built a data collection pipeline of COVID-19 cough recordings through their website to train their MIT open voice model through using CNN-based models | They worked only on CNN-based models to test their collected data with other machine learning models and compare them with existing proposed models |
Fig. 3Proposed CR19 framework to diagnose COVID-19 in cough audio signals
Fig. 4Proposed remote monitoring framework for Early COVID-19 detection and diagnosis
Confusion matrix
| Predicted negative | Predicted positive | |
|---|---|---|
| Actual negative | ||
| Actual positive |
Fig. 5Precision, recall, and F-measure comparison for positive cases of six different classifiers
Fig. 6Precision, recall, and F-measure comparison for negative cases of six different classifiers
Fig. 7Confusion matrix of all machine learning algorithms in the proposed framework
Fig. 8Confusion matrix of hybrid GA-ML algorithms in the proposed framework
Fig. 9Precision, recall, and F-measure comparison for positive cases of hybrid GA-ML for different classifiers
Fig. 10Precision, recall, and F-measure comparison for negative cases of hybrid GA-ML for different classifiers
Fig. 11The accuracy of classifier for ML and GA-ML framework
Fig. 12The percentage of increasing accuracy GA-ML framework in compared with classical ML framework
Comparative study between ML and proposed hybrid GA-ML for COVID-19 detection based cough data as positive or negative status
| Metrics | Patient status | Algorithms | |||||
|---|---|---|---|---|---|---|---|
| LR/GA-LR | LDA/GA-LDA | KNN/GA-KNN | CART/GA-CART | NB/GA-NB | SVM/GA-SVM | ||
| Precision | Negative | 0.91/0.92 | 0.94/0.95 | 0.97/0.98 | 0.92/0.94 | 0.94/0.94 | 0.92/0.93 |
| Positive | 0.80/1.00 | 0.80/0.90 | 0.86/1.00 | 0.45/0.73 | 0.32/0.67 | 1.00/1.00 | |
| Recall | Negative | 0.99/1.00 | 0.98/0.99 | 0.98/1.00 | 0.95/0.98 | 0.85/0.97 | 1.00/1.00 |
| Positive | 0.25/0.31 | 0.50/0.56 | 0.75/0.81 | 0.31/0.50 | 0.56/0.50 | 0.31/0.44 | |
| F1-score | Negative | 0.95/0.96 | 0.96/0.97 | 0.98/0.99 | 0.93/0.96 | 0.89/0.95 | 0.96/0.97 |
| Positive | 0.38/0.48 | 0.62/0.69 | 0.80/0.90 | 0.37/0.59 | 0.41/0.57 | 0.48/0.61 | |
Comparative analysis between the proposed hybrid GA-ML and existing work for COVID-19 detection based cough
| Work | Dataset | Classifier | Results |
|---|---|---|---|
| Imran et al. ( | ESC-50 | Multi-class classifier (CNN, SVM, and binary classifier) | Accuracy is 92.64 |
| Verde et al. ( | Dataset collected by Cambridge University | ResNet | AUC is 84.6 |
| Pahar et al. ( | Coswara | CNN, LSTM, ResNet50, and LSTM + SFS | Accuracy are 73.02, 73.78, 74.58, 92.91 |
| Grant et al. ( | Crowdsourced | Random forest + DNN | AUC of 79.38 for detecting COVID-19 via speech sound analysis, and 75.75 for detecting COVID-19 via breathing sound analysis |
| Proposed CR19 framework | Coswara | Hybrid GA-ML (GA-LR, GA-LDR,GA-KNN,GA-CART,GA-NB, GA-SVM) | Accuracy are 90.78, 92.90, 95.74, 87.94, 81.56, and 92.198 |