| Literature DB >> 36205834 |
Adel Thaljaoui1, Salim El Khediri2,3, Emna Benmohamed4,5, Abdulatif Alabdulatif6, Abdullah Alourani1.
Abstract
The coronavirus infection continues to spread rapidly worldwide, having a devastating impact on the health of the global population. To fight against COVID-19, we propose a novel autonomous decision-making process which combines two modules in order to support the decision-maker: (1) Bayesian Networks method-based data-analysis module, which is used to specify the severity of coronavirus symptoms and classify cases as mild, moderate, and severe, and (2) autonomous decision-making module-based association rules mining method. This method allows the autonomous generation of the adequate decision based on the FP-growth algorithm and the distance between objects. To build the Bayesian Network model, we propose a novel data-based method that enables to effectively learn the network's structure, namely, MIGT-SL algorithm. The experimentations are performed over pre-processed discrete dataset. The proposed algorithm allows to correctly generate 74%, 87.5%, and 100% of the original structure of ALARM, ASIA, and CANCER networks. The proposed Bayesian model performs well in terms of accuracy with 96.15% and 94.77%, respectively, for binary and multi-class classification. The developed decision-making model is evaluated according to its utility in solving the decisional problem, and its accuracy of proposing the adequate decision is about 97.80%.Entities:
Keywords: Autonomous decision-making; Bayesian networks; Bayesian network’s structure learning based on data approach; COVID-19; Variable approach
Year: 2022 PMID: 36205834 PMCID: PMC9540074 DOI: 10.1007/s11517-022-02677-y
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 3.079
Fig. 1Flowchart of the description of the autonomous orientation method
Fig. 2Autonomous Decision-Making (ADM) process
Fig. 3Schematic representation of the MIGT-SL algorithm
Fig. 4An example of MIGT-SL algorithm application on Asia network: a the Asia original network, b dependencies between nodes, c edges orientation
Sample view of the COVID-19 dataset
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | … | |
|---|---|---|---|---|---|---|---|---|
| Fever | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Tiredness | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Dry-cough | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Difficulty breathing | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Sore throat | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
None Symptoms | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Pains | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Nasal congestion | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Runny nose | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Diarrhea | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Age | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Gender | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Severity | 1 | 1 | 1 | 2 | 2 | 2 | 3 |
Discretized variables
| Variable | Corresponding columns | Discrete values |
|---|---|---|
| Age | Age_0–9, age_10–19, age_20–24, age_25–59, age_60 + | 1: [0,9]; 2: [ |
| Fever | Fever | 0: No fever 1: Fever |
| Dry cough | Dry cough | 0: No dry-cough 1: Dry-cough |
| Difficulty in breathing | Difficulty in breathing | 0: No breath problem 1: difficulty in breathing |
| Tiredness | Tiredness | 0: No tiredness 1: Tiredness |
| Sore throat | Sore throat | 0: No sore throat 1: Sore throat |
| None symptoms | None symptoms | 0: With symptoms 1: None symptoms |
| Pains | Pains | 0: No pains 1: Pains |
| Nasal congestion | Nasal congestion | 0: No nasal congestion 1: Nasal congestion |
| Runny nose | Runny nose | 0: No runny nose 1: Runny nose |
| Diarrhea | Diarrhea | 0: No diarrhea 1: Diarrhea |
| Gender | Gender_female, gender_male, gender_transgender | 1: Male 2: Female 3: Transgender |
| Severity | Severity_moderate, severity_none, severity_severe | 1: Mild 2: moderate 3: severe |
Fig. 5Instances of rules
Fig. 6Autonomous Decision-Making (ADM) process: first module
Fig. 7Probability distributions within severity class 3 (severe)
Fig. 8Autonomous decision-making (ADM) process: second module
Frequent factors with its occurrence
| Variable | Occurrence |
|---|---|
| Severity class | 100% |
| Age | 98.75% |
| Difficulty in breathing | 92 0.82% |
| Disease class, age | 87.50% |
| Age, non-symptom | 70.18% |
| Age, severity class, sore throat | 63.32% |
| Pains, tiredness, fever | 60.34% |
| Fever, severity class | 59.66% |
Fig. 9Decision selection procedure
Experimental results of MIGT-SL algorithm for ASIA, ALARM, and CANCER networks
| 500 | 1000 | 2000 | 3000 | 5000 | 10,000 | ||
|---|---|---|---|---|---|---|---|
| ASIA | CE | 7 | 7 | 7 | 7 | 7 | 7 |
| DE | 0 | 0 | 0 | 0 | 0 | 0 | |
| RE | 1 | 1 | 1 | 1 | 1 | 1 | |
| AE | 0 | 0 | 0 | 0 | 1 | 1 | |
| SD | 1 | 1 | 1 | 1 | 2 | 2 | |
| Orig BIC score | − 1216.11 | − 2357.42 | − 4594.40 | − 6840.03 | − 11,409.44 | − 22,406.41 | |
| BIC score | − 1216.11 | − 2357.42 | − 4594.40 | − 6840.03 | − 11,409.44 | − 22,406.41 | |
| ALARM | CE | 34 | 34 | 34 | 34 | 33 | 33 |
| DE | 11 | 11 | 11 | 11 | 12 | 12 | |
| RE | 1 | 1 | 1 | 1 | 1 | 1 | |
| AE | 4 | 4 | 4 | 4 | 4 | 4 | |
| SD | 16 | 16 | 16 | 16 | 17 | 17 | |
| Orig BIC score | − 6357.61 | − 11,116.05 | − 20,633.39 | − 29,889.48 | − 48,593.10 | − 95,290.65 | |
| BIC score | − 6823.90 | − 12,429.44 | − 23,520.28 | − 34,651.23 | − 56,987.10 | − 112,504.4 | |
| CANCER | CE | 4 | 4 | 4 | 4 | 4 | |
| DE | 0 | 0 | 0 | 0 | 0 | ||
| RE | 0 | 0 | 0 | 0 | 0 | ||
| AE | 0 | 0 | 0 | 0 | 0 | ||
| SD | 0 | 0 | 0 | 0 | 0 | ||
| Orig BIC score | − 2115.88 | − 4268.59 | − 6361.50 | − 10,592.53 | − 21,222.33 | ||
| BIC score | − 2115.88 | − 4268.59 | − 6361.50 | − 10,592.53 | − 21,222.33 |
Structures comparisons among five algorithms on ASIA network
| MIGT-SL algorithm | Wand and Liu algorithm | Ko and Kim method | Tabar et al. method | Ai method | ||
|---|---|---|---|---|---|---|
| 1000 | CE | 5 | 4 | 4 | ||
| DE | 1 | 0 | 0 | 1 | ||
| RE | 0 | 3 | 4 | 2 | ||
| AE | 0 | 1 | 0 | 3 | ||
| SD | 1 | 4 | 4 | 6 | ||
| 2000 | CE | - | 5 | 5 | 4 | |
| DE | - | 0 | 0 | 1 | ||
| RE | - | 3 | 3 | 2 | ||
| AE | - | 1 | 0 | 3 | ||
| SD | - | 4 | 3 | 6 | ||
| 3000 | CE | - | 5 | 5 | 4 | |
| DE | 1 | 0 | 0 | 1 | ||
| RE | 0 | 3 | 3 | 2 | ||
| AE | 0 | 1 | 0 | 3 | ||
| SD | 1 | 4 | 3 | 6 | ||
| 5000 | CE | 5 | 5 | 4 | ||
| DE | 0 | 0 | 1 | |||
| RE | 3 | 3 | 1 | |||
| AE | 1 | 1 | 3 | |||
| SD | 4 | 4 | 6 | |||
| 10,000 | CE | 5 | 6 | 5 | ||
| DE | 0 | 0 | 1 | |||
| RE | 3 | 3 | 1 | |||
| AE | 1 | 1 | 3 | |||
| SD | 4 | 4 | 5 |
Structures comparisons among five algorithms on ALARM network
| MIGT-SL algorithm | Wang and Liu algorithm | Ko and Kim method | Tabar et al. method | Ai method | ||
|---|---|---|---|---|---|---|
| 1000 | CE | - | 38 | 38 | 23 | |
| DE | 11 | 2 | 4 | 2 | 3 | |
| RE | 1 | 4 | 8 | 28 | ||
| AE | 2 | 9 | 4 | 34 | ||
| SD | 16 | 6 | 17 | 14 | 59 | |
| 2000 | CE | - | 39 | 39 | 23 | |
| DE | 11 | 2 | 2 | 1 | 3 | |
| RE | 0 | 4 | 8 | 21 | ||
| AE | 2 | 9 | 4 | 34 | ||
| SD | 16 | 4 | 15 | 13 | 55 | |
| 3000 | ||||||
| CE | 34 | - | - | - | - | |
| DE | 11 | 1 | - | - | - | |
| RE | 1 | 0 | - | - | - | |
AE SD | 4 16 | 1 3 | - - | - - | - - | |
| 5000 | CE | 33 | - | 40 | 41 | 24 |
| DE | 12 | 1 | 2 | 1 | 2 | |
| RE | 0 | 4 | 8 | 21 | ||
| AE | 1 | 13 | 7 | 32 | ||
| SD | 17 | 2 | 19 | 16 | 55 | |
| 10,000 | CE | 33 | - | 40 | 41 | 24 |
| DE | 12 | - | 2 | 1 | 2 | |
| RE | - | 4 | 8 | 20 | ||
| AE | - | 15 | 7 | 30 | ||
| SD | - | 21 | 16 | 52 |
Comparison of the BIC scores difference
| HC algorithm | K2 algorithm | HCbo + C algorithm | Improved K2 algorithm | MIGT-SL algorithm | ||
|---|---|---|---|---|---|---|
| ASIA | 500 | 4 | 2.07 | 0 | - | |
| 2000 | 22 | 3.6 | 1 | - | ||
| 5000 | 64 | 4.04 | 7 | - | ||
| 10,000 | 42 | 4.1 | 0 | 204.63 | ||
| ALARM | ||||||
| 500 | 500 | 224.07 | 50 | - | 466.29 | |
| 2000 | 1050 | 221.88 | 115 | - | 2886.89 | |
| 5000 | 3680 | 1372.79 | 182 | - | 8394 | |
| 10,000 | 3020 | 3284.51 | 123 | 648.03 | 17,213.75 | |
| CANCER | ||||||
| 500 | 7 | 0 | 7 | - | ||
| 2000 | 7 | 11.48 | 6 | - | ||
| 5000 | 14 | 19.97 | 3 | - | ||
| 10,000 | 11 | 44.53 | 5 | 0 |
Fig. 10Multiclass ROC curve
Fig. 11BN model evaluation
Confusion matrix
| Actual decision | ||||
|---|---|---|---|---|
| Self-isolation | Hospitalization | Hospitalization and treatments | ||
| Assigned decision | Self-isolation | 94 | 7 | 5 |
| Hospitalization | 4 | 125 | 225 | |
| Hospitalization and treatments | 6 | 371 | 163 | |
Comparison with seven similar works
| Datasets | Model | Classification | Accuracy | |
|---|---|---|---|---|
| Yao et al. [ | 137 confirmed COVID-19 cases collected from the Tongji Hospital | Logistic regression classifier (LR) | Binary classification: severe, mild/moderate | 0.73 |
| Random forest classifier (RF) | Binary classification: severe, mild/moderate | 0.791 | ||
| K nearest neighbor classifier (KNN) | Binary classification: severe, mild/moderate | 0.78 | ||
| The boosting-based classifier (AdaBoost) | Binary classification: severe, mild/moderate | 0.713 | ||
| Support Vector Machine classifier (SVM) | Binary classification: severe, mild/moderate | 0.7926 | ||
| SVM-RBF classifier | Binary classification: severe, mild/moderate | 0.9091 | ||
| SVM (using LibSVM) | [ | SVM | Multi-class classification: mild, moderate, severe | 0.9325 |
| Binary classification: severe, mild/moderate | 0.9546 | |||
| Our proposal | [ | Discrete BN model | Multi-class classification: mild, moderate, severe | |
| Binary classification: severe, mild/moderate |