| Literature DB >> 35399333 |
Mehrdad Rostami1, Mourad Oussalah1,2.
Abstract
Several Artificial Intelligence-based models have been developed for COVID-19 disease diagnosis. In spite of the promise of artificial intelligence, there are very few models which bridge the gap between traditional human-centered diagnosis and the potential future of machine-centered disease diagnosis. Under the concept of human-computer interaction design, this study proposes a new explainable artificial intelligence method that exploits graph analysis for feature visualization and optimization for the purpose of COVID-19 diagnosis from blood test samples. In this developed model, an explainable decision forest classifier is employed to COVID-19 classification based on routinely available patient blood test data. The approach enables the clinician to use the decision tree and feature visualization to guide the explainability and interpretability of the prediction model. By utilizing this novel feature selection phase, the proposed diagnosis model will not only improve diagnosis accuracy but decrease the execution time as well.Entities:
Keywords: COVID-19; Decision forest; Disease diagnosis; Explainable artificial intelligence; Feature selection; Human-computer interaction
Year: 2022 PMID: 35399333 PMCID: PMC8985417 DOI: 10.1016/j.imu.2022.100941
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Fig. 1Schematic diagram of artificial intelligence approaches for related COVID-19 outbreak tasks.
Outlining the reviewed machine learning-based models in COVID-19 pandemic related tasks.
| Paper | Technique | Task | Data type | Accuracy | Explainability |
|---|---|---|---|---|---|
| Mahdy et al. [ | SVM | Covid-19 lung image classification | X-ray image | High | Low |
| Yu et al. [ | Decision Tree | Severity detection of COVID-19 paediatric cases | Chest radiography and CT images | Medium | High |
| Too and Mirjalili [ | KNN | Prediction of the death and recovery conditions | The patients' information (Gender, Age, Country, etc.) and their symptoms | Medium | Medium |
| Song et al. [ | Time-dependent model parameters. | forecasting the dynamic spread of COVID-19 | Daily reported cases in China and the United States | High | Low |
| Kumar and Kumar [ | Fuzzy clustering and time series model | Prediction of COVID-19 infected cases and deaths | Daily reported cases in India | Medium | Low |
| Cobre et al. [ | KNN, Neural Networks, Partial Least Squares Discriminant Analysis, etc. | Diagnosis and prediction of COVID-19 severity | Biochemical, hematological, and urinary biomarkers | Medium | Low |
| Arvind et al. [ | Sliding-window approach | Prediction of intubation among hospitalized patients | laboratory and vitals data COVID-19+ patients | Medium | Low |
| Pahar et al. [ | Residual neural networks | Classification of COVID-19 cough | Coughing sounds recorded during or after the acute phase of COVID-19 | Medium | Low |
| Ebinger et al. [ | Logistic regression, SVM, KNN, etc. | Prediction of duration of hospitalization in COVID-19 patients | Electronic health record data from COVID-19 patients | Medium | Low |
| Zhang et al. [ | Least absolute shrinkage and selection operator regression and least absolute shrinkage and selection operator neural network models. | Identification and validation of prognostic factors in COVID-19 patients | Demographic data including, clinical data including and outcome (28-day mortality) | Medium | Low |
| Gulati et al. [ | Linear SVC, Perceptron, Passive Aggressive, Logistic Regression, etc. | Sentiment classification of discussion related to COVID-19 pandemic | Tweets related to COVID-19 pandemic | Medium | Low |
| Singh et al. [ | Ensemble Support Vector Machine | COVID-19 detection | Lung tomography scan data | High | Low |
| Wu et al. [ | Joint Classification and Segmentation | COVID-19 diagnosis | Chest CT images | Medium | Medium |
| Yang et al. [ | Decision Tree | Death outcome prediction | Medical records (demographics, clinical characteristics, and laboratory test results) | Medium | High |
| Lella and Pja [ | Deep Convolutional Neural Network | Diagnosis of COVID-19 disease | Human respiratory sounds such as voice, dry cough, and breath, | High | Low |
| Qayyum et al. [ | Depth-wise deep learning | Detection and diagnosis of COVID-19 infection | Lungs X-rays images | High | Low |
| Roy et al. [ | Spatial Transformer Networks-based Deep learning | Classification and Localization of COVID-19 Markers | Lung ultrasonography (LUS) images. | High | Low |
| Shamsi et al. [ | Deep transfer learning | Diagnosis of COVID-19 | Chest X-ray and CT images | High | Low |
| Islam et al. [ | Deep Convolutional Neural Network and LSTM | Detection of COVID-19 | X-ray images | High | Low |
| Hall et al. [ | Deep Convolutional Neural Network | Detection of COVID-19 | Chest x-rays | High | Low |
| Ahmadian et al. [ | Deep Neuroevolution | Diagnosis of COVID-19 | Chest x-rays | High | Low |
Fig. 2Explainable artificial intelligence approach for COVID-19 diagnosis.
Fig. 3Flowchart of the proposed model.
Nomenclature and parameters of the developed COVID-19 diagnosis model.
| Symbol | Description |
|---|---|
| Feature node | |
| Feature vector | |
| Feature graph | |
| Link between original features | |
| Number of initial features | |
| Similarity between features | |
| Feature Vector | |
| Average of feature vector | |
| Set of dataset samples | |
| Average of all the calculated similarities | |
| Variance of all the calculated similarities | |
| Importance Value of feature | |
| Fisher Score of features | |
| Node Centrality of feature | |
| Set of all classes in a dataset (i.e. Positive and Negative) | |
| Number of patterns on the class | |
| Average of feature vector | |
| Variance of feature vector | |
| Laplacian Energy | |
| Average value of all the samples related to the feature | |
| Set of all classes in a dataset (i.e. Positive and Negative) | |
| Number of samples on the class | |
| Variance and average of feature | |
| Average of feature | |
| Decision tree | |
| Set of rules in Decision tree | |
| Number of trees in decision forest |
Fig. 4Flow pseudo-code of the developed model.
Average performance, standard deviation (shown in parenthesis) and p-value of different predications model based on 10-fold validation in 30 independent runs.
| Method | Accuracy | F1-Score | Sensitivity | Specificity | AUROC |
|---|---|---|---|---|---|
| XGBoost | 87.71 (1.34) | 71.45 (1.42) | 67.52 (1.38) | 90.82 (1.26) | 89.36 (1.24) |
| SVM | 84.79 (1.31) | 72.48(1.71) | 67.01 (1.36) | 88.96 (0.67) | 87.69 (1.31) |
| MLP | 85.25 (1.29) | 69.92 (1.37) | 62.21 (1.03) | 88.17 (1.43) | 88.38 (1.36) |
| XDT | 88.41 (0.59) | 76.17 (0.67) | 67.21 (0.82) | 91.02 (0.69) | 90.62 (1.03) |
| P-value | 0.0034218 | 0.0037548 | 0.003295 | 0.004606 | 0.004438 |
Fig. 5Boxplot of 10-fold validation in 30 independent runs.
Number of times the best results are achieved by different prediction models in 30 independent runs.
| Method | Accuracy | F1-Score | Sensitivity | Specificity | AUROC |
|---|---|---|---|---|---|
| XGBoost | 2 | 1 | 1 | 2 | 2 |
| SVM | 0 | 1 | 0 | 0 | 1 |
| MLP | 1 | 0 | 1 | 1 | 0 |
| XDT | 2 | 1 | 1 | 1 | 2 |
Normalized confusion matrices for different COVID-19 diagnosis model.
| Method | True-Negative | True-Positive | False-Negative | False-Positive |
|---|---|---|---|---|
| XGBoost | 92.18 | 66.87 | 8.71 | 34.71 |
| SVM | 89.79 | 67.36 | 11.27 | 37.21 |
| MLP | 90.71 | 65.57 | 10.01 | 38.37 |
| XDT | 91.38 | 69.81 | 9.75 | 32.56 |
| 92.99 | 71.98 | 8.12 | 30.72 |
Fig. 6The ROC Curve for the developed model.
The selected features sorted based on their importance.
| Number | Feature |
|---|---|
| 1 | PLT |
| 2 | EOS |
| 3 | MPV |
| 4 | CRP |
| 5 | AST |
| 6 | CREAT |
| 7 | WBC |
| 8 | MONO |
| 9 | LYM |
| 10 | RBC |
| 11 | NEU |
| 12 | NA |
| 13 | ALT |
| 14 | HCT |
| 15 | HGB |
| 16 | RWD |
| 17 | UREA |
| 18 | K+ |
| 19 | MCV |
| 20 | MCH |
| 21 | MCHC |
Fig. 7Part of explainable tree for COVID-19 diagnosis.
Fig. 8Average classification accuracy of different feature selection methods on various classifiers.
Fig. 9Average execution time (in ms) of different feature selection approaches over 30 independent runs.
Characteristics of comparative transforming techniques.
| Paper | Year | Technique |
|---|---|---|
| Rubén et al. [ | 2020 | Counterfactual Sets |
| Sagi et al. [ | 2020 | Rule Conjunctions |
| Sagi et al. [ | 2021 | Filtering of Conjunction Sets |
| Neto et al. [ | 2021 | Explainable Matrix–Visualization |
Average performance and standard deviation (shown in parenthesis) of transforming techniques.
| Method | Accuracy | F1-Score | Sensitivity | Specificity |
|---|---|---|---|---|
| Counterfactual Sets | 87.81 (1.24) | 72.32 (2.31) | 68.51 (2.41) | 91.86 (1.93) |
| Rule Conjunctions | 84.79 (2.31) | 75.31(1.72) | 69.51 (3.31) | 88.82 (3.12) |
| Filtering of Conjunction Sets | 85.12 (3.27) | 69.17 (3.12) | 51.15 (1.28) | 89.13 (2.84) |
| Matrix–Visualization | 88.19 (2.51) | 76.13 (2.13) | 68.21 (2.81) | 91.54 (1.71) |
| p-value | 0.0034784 | 0.0037691 | 0.0039877 | 0.004897 |
Fig. 10Average performance (in %) over 30 independent runs, with different values for Accuracy, F1-Score, Sensitivity and Specificity measures.
Fig. 11Average performance (in %) over 30 independent runs, with different values for Accuracy, F1-Score, Sensitivity and Specificity measures.
Average ranks of the different COVID-19 prediction models on different measures.
| Measure | Compared COVID-19 diagnosis models | ||||
|---|---|---|---|---|---|
| XGBoost | SVM | MLP | XDT | Proposed Model | |
| Accuracy | 4.79 | 4.06 | 2.68 | 2.10 | 1.34 |
| F1-Score | 4.79 | 3.93 | 2.79 | 2.17 | 1.31 |
| Sensitivity | 4.68 | 3.89 | 2.82 | 2.31 | 1.27 |
| Specificity | 4.62 | 4.06 | 2.72 | 2.20 | 1.37 |
| AUROC | 4.58 | 4.03 | 2.79 | 2.27 | 1.32 |
The results of the Friedman statistics test.
| Measure | |||||
|---|---|---|---|---|---|
| Accuracy | F1-Score | Sensitivity | Specificity | AUROC | |
| Chi-Square | 10.3858 | 13.9218 | 10.2319 | 15.0436 | 15.0321 |
| df | 4 | 4 | 4 | 4 | 4 |
| Asymp.Sig (p-value) | 0.0034218 | 0.0037548 | 0.003295 | 0.004606 | 0.004438 |