| Literature DB >> 35035108 |
Ivanoe De Falco1, Giuseppe De Pietro1, Giovanna Sannino1.
Abstract
In medical practice, all decisions, as for example the diagnosis based on the classification of images, must be made reliably and effectively. The possibility of having automatic tools helping doctors in performing these important decisions is highly welcome. Artificial Intelligence techniques, and in particular Deep Learning methods, have proven very effective on these tasks, with excellent performance in terms of classification accuracy. The problem with such methods is that they represent black boxes, so they do not provide users with an explanation of the reasons for their decisions. Confidence from medical experts in clinical decisions can increase if they receive from Artificial Intelligence tools interpretable output under the form of, e.g., explanations in natural language or visualized information. This way, the system outcome can be critically assessed by them, and they can evaluate the trustworthiness of the results. In this paper, we propose a new general-purpose method that relies on interpretability ideas. The approach is based on two successive steps, the former being a filtering scheme typically used in Content-Based Image Retrieval, whereas the latter is an evolutionary algorithm able to classify and, at the same time, automatically extract explicit knowledge under the form of a set of IF-THEN rules. This approach is tested on a set of chest X-ray images aiming at assessing the presence of COVID-19.Entities:
Keywords: Chest X-ray images; Classification; Covid-19 disease; Evolutionary algorithms; Interpretable machine learning
Year: 2022 PMID: 35035108 PMCID: PMC8741589 DOI: 10.1007/s00521-021-06806-w
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.102
Fig. 1The method. Input: X-ray radiography (CXR) images. 1st step: the feature extraction through the use of a filter detailed in 3.1 - each image is processed in order to extract 64 features that will constitute an item of the data set. 2nd step: the data set creation - to each item of 64 features it is associated the class (1: Covid-19 or 2: Normal). 3rd step: the DEREx classifier, which also provides the explicit knowledge extraction under the form of IF-THEN rules. Output: the classification - each X-ray image is classified
The distribution of the considered images (items) over the two classes: Covid-19 and Normal
| #items | |
|---|---|
| Covid-19 | 3,616 |
| Normal | 10,192 |
| Total | 13,808 |
Fig. 2Examples of items from the two classes. Top panes: normal. Bottom panes: Covid-19
The order of the 29 attributes in the encoding
| 1 | Color correlogram: black bin |
| 2 to 8 | Color correlogram: gray bins |
| 9 | Color correlogram: white bin |
| 10–16 | Mean values of the seven texture moment attributes |
| 17–23 | Variation values of the seven texture moment attributes |
| 24 | First-order R color moment |
| 25 | First-order G color moment |
| 26 | First-order B color moment |
| 27 | Second-order R color moment |
| 28 | Second-order G color moment |
| 29 | Second-order B color moment |
The Parameter Setting for DEREx
| Pop_Size | 50 |
| Max_Gens | 500 |
| Cr_Ratio | 0.3 |
| Mut_F | 0.7 |
| DE_Algo | DE/rand-to-best/1/bin |
| N_Max_Rules | 2 |
| Rule_Thr | 0.0 |
| Lit_Thr | 0.95 |
| Class 1 | |
| Class 2 |
The distribution of the items over the training and testing sets. Each item is composed of 29 attributes and one class (Covid-19 or Normal)
| #Items | Percentage (%) | Covid-19 | Normal | |
|---|---|---|---|---|
| Training | 9665 | 70 | 2541 | 7124 |
| Testing | 4143 | 30 | 1075 | 3068 |
| Total | 13,808 | 100 | 3616 | 10,192 |
Confusion Matrices of the Best Rule Set. Covid-19 is class 1, instead Normal is class 2
| Real class | Train set | Test set | Whole data set | |||
|---|---|---|---|---|---|---|
| Predicted class | Predicted class | Predicted class | ||||
| Covid-19 | Normal | Covid-19 | Normal | Covid-19 | Normal | |
| Covid-19 | 1,574 | 967 | 670 | 405 | 2,244 | 1,372 |
| Normal | 955 | 6,169 | 396 | 2,672 | 1,351 | 8,841 |
| 80.11% | 0.486 | 80.67% | 0.496 | 80.28% | 0.489 | |
The results obtained by the algorithms
| Average | Best | Std. dev | ||||
|---|---|---|---|---|---|---|
| A | M | A | M | A | M | |
| BN | 77.51 | 78.61 | 0.627 | |||
| NB | 76.62 | 0.372 | 77.41 | 0.397 | 0.516 | |
| RBF | 0.410 | 0.442 | 0.016 | |||
| SVM | 78.55 | 0.361 | 79.39 | 0.391 | ||
| AB | 76.73 | 0.341 | 79.07 | 0.419 | 1.034 | 0.045 |
| OR | 76.31 | 0.317 | 77.76 | 0.354 | 0.935 | 0.023 |
| DEREx | 2.094 | 0.022 | ||||
For each column, the best value achieved is shown in bold, and the second best is reported in italic