| Literature DB >> 34025211 |
Alexandra A de Souza1, Danilo Candido de Almeida2, Thiago S Barcelos1, Rodrigo Campos Bortoletto1, Roberto Munoz3, Helio Waldman4, Miguel Angelo Goes2, Leandro A Silva5.
Abstract
The pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) which is related to new coronavirus disease (COVID-19) has mobilized several scientifics to explore clinical data using soft-computing approaches. In the context of machine learning, previous studies have explored supervised algorithms to predict and support diagnosis based on several clinical parameters from patients diagnosed with and without COVID-19. However, in most of them the decision is based on a "black-box" method, making it impossible to discover the variable relevance in decision making. Hence, in this study, we introduce a non-supervised clustering analysis with neural network self-organizing maps (SOM) as a strategy of decision-making. We propose to identify potential variables in routine blood tests that can support clinician decision-making during COVID-19 diagnosis at hospital admission, facilitating rapid medical intervention. Based on SOM features (visual relationships between clusters and identification of patterns and behaviors), and using linear discriminant analysis , it was possible to detect a group of units of the map with a discrimination power around 83% to SARS-CoV-2-positive patients. In addition, we identified some variables in admission blood tests (Leukocytes, Basophils, Eosinophils, and Red cell Distribution Width) that, in combination had strong influence in the clustering performance, which could assist a possible clinical decision. Thus, although with limitations, we believe that SOM can be used as a soft-computing approach to support clinician decision-making in the context of COVID-19.Entities:
Keywords: Covid-19 diagnostic; SARS-CoV-2; Self-organizing maps
Year: 2021 PMID: 34025211 PMCID: PMC8127503 DOI: 10.1007/s00500-021-05810-5
Source DB: PubMed Journal: Soft comput ISSN: 1432-7643 Impact factor: 3.732
Fig. 1Flowchart plot of process view. Adaptation of the process view structure proposed by De Souza et al. (2019). The Kohonen objects have a set of information resulting from application SOM algorithm such as: dataset, grid of the map and your units, intra-distance of each unit, among others that will be detailed in subsection 3.3
SOM algorithm input and output
|
| |
|---|---|
|
| |
Adaptation of the pseudocode shown by da Silva et al. (2017)
Fig. 2Weight of each attribute in the composition of each unit of the map. Overview map of training stage in SOM process different colors represents distinct variables in the dataset
SOM function parametrization
| Parameter | Configuration |
|---|---|
| Data | 599 samples |
| topo | Hexagonal |
| toroidal | False |
| rlen | 14.000 |
| alpha | For 14 variables: 0.05 with linear reduction up to 0.01 |
| For 4 variables: 0.08 with linear reduction up to 0.03 | |
| dist.fcts |
Fig. 3Establishment of Information Needs. It can be observed the general feature of “Vector Visualization color Map” with 14 variables (a) and the Maps comparing between SARS-CoV-2 Negatives (b) and Positives (c) patients with clustering distinction
Fig. 4Scatter plot distribution of variables in the blood test applied in SOM analysis: Leukocytes, Basophils, Eosinophils e Red blood Cell Distribution Width (RDW) presented different behavior in positive and negative SARS-CoV-2 groups
Fig. 5Establishment of Information Needs II. It can be observed the general feature of “Vector Visualization color Map” with 4 variables a pre-selected in training I and the Maps comparing between SARS-CoV-2 Negatives b and Positives c patients with evident segregation
Fig. 6Color heatmaps distribution. It is observed at different colors in the heatmaps the distribution of the attributes (that represented the 4 variables in the blood test) in SOM training II
Fig. 7Topological ordinal distribution of units of map. Each units of the map were separately identified and it is observed the distribution of SARS-CoV-2 negative and positive patients at each unit in the map
Positive x negative tests on units of map
| Unit map | Negative tests | Positive tests | Positive tests (%) |
|---|---|---|---|
| 5 | 33 | 6 | 15 |
| 12 | 35 | 5 | 12.5 |
| 15 | 17 | 24 | 60 |
| 20 | 19 | 5 | 20 |
| 25 | 236 | 37 | 13 |
| Others | 178 | 4 | 1.5 |
| Total | 518 | 81 | 13.5 |
Confusion matrix LDA units of map 15, 20, 25
| Units of map | ||||||||
|---|---|---|---|---|---|---|---|---|
| 15, 20, 25 | 15 | 20 | 25 | |||||
| Negative | Positive | Negative | Positive | Negative | Positive | Negative | Positive | |
| Negative | 250 | 22 | 10 | 7 | 16 | 3 | 167 | 69 |
| Positive | 38 | 28 | 11 | 13 | 2 | 3 | 8 | 29 |