| Literature DB >> 35003242 |
Khalid Twarish Alhamazani1, Jalawi Alshudukhi1, Saud Aljaloud1, Solomon Abebaw2.
Abstract
Chronic kidney disease (CKD) is a global health issue with a high rate of morbidity and mortality and a high rate of disease progression. Because there are no visible symptoms in the early stages of CKD, patients frequently go unnoticed. The early detection of CKD allows patients to receive timely treatment, slowing the disease's progression. Due to its rapid recognition performance and accuracy, machine learning models can effectively assist physicians in achieving this goal. We propose a machine learning methodology for the CKD diagnosis in this paper. This information was completely anonymized. As a reference, the CRISP-DM® model (Cross industry standard process for data mining) was used. The data were processed in its entirety in the cloud on the Azure platform, where the sample data was unbalanced. Then the processes for exploration and analysis were carried out. According to what we have learned, the data were balanced using the SMOTE technique. Four matching algorithms were used after the data balancing was completed successfully. Artificial intelligence (AI) (logistic regression, decision forest, neural network, and jungle of decisions). The decision forest outperformed the other machine learning models with a score of 92%, indicating that the approach used in this study provides a good baseline for solutions in the production.Entities:
Mesh:
Year: 2021 PMID: 35003242 PMCID: PMC8739929 DOI: 10.1155/2021/3941978
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Business understanding tasks.
Figure 2Study tasks and understanding of the data.
Statistics of the variables.
| . | Average | Std.dev | Minimum | 25% | 50% |
|---|---|---|---|---|---|
| Municipality | 30328.27 | 28219.83 | 5001.00 | 8001.00 | 13001.00 |
| Age | 73.51 | 12.01 | 1.00 | 67.00 | 75.00 |
| HTA | 1.00 | 0.00 | 1.00 | 1.00 | 1.00 |
| Albunia | 2.50 | 1.50 | 0.00 | 1.00 | 3.00 |
| Sugar | 2.00 | 1.22 | 0.00 | 1.00 | 2.00 |
| UREA_SANGRE | 143.39 | 86.16 | 1.00 | 70.00 | 139.00 |
| Creatinine | 5.78 | 4.89 | 0.10 | 1.70 | 3.80 |
| Sodium | 134.95 | 20.23 | 100.00 | 117.00 | 135.00 |
| Potassium | 4.75 | 1.30 | 2.50 | 3.60 | 4.80 |
| Hemoglobin | 14.40 | 1.31 | 12.10 | 13.40 | 14.30 |
| DM | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| Weight | 77.75 | 10.00 | 60.00 | 70.50 | 77.20 |
| Stadium | 3.14 | 0.44 | 3.00 | 3.00 | 3.00 |
Stadium classification.
| Stadium | Description |
|---|---|
| 1 | Kidney damage with normal or high filtration |
| 2 | Kidney damage with mild decrease in kidney function |
| 3 | Kidney damage with a moderate decrease in kidney function |
| 4 | Kidney damage with a severe decrease in kidney function |
| 5 | Renal failure |
Figure 3Image data preparation experiment.
Figure 4Modeling.
Figure 5Image data division experiment.
Figure 6Image model training experiment.
Figure 7Evaluation (obtaining results).
Performance comparison between models.
| Multiclass red neuronal | Decision forest multiclass | Multiclass logistic regression | Multiclass jungle of decisions | |
|---|---|---|---|---|
| General accuracy | 0.784 | 0.831 | 0.773 | 0.835 |
| Medium accuracy | 0.892 | 0.887 | 0.886 | 0.890 |
| Micro-precision averaged | 0.784 | 0.831 | 0.773 | 0.835 |
| Macro-accuracy averaged | 0.446 | 0.691 | 0.374 | 0.817 |
| Micro-sensitivity averaged | 0.784 | 0.831 | 0.773 | 0.835 |
| Macro-sensitivity averaged | NA | 0.617 | NA | 0.537 |
| Artificial neural networks matrix | ||||
| Current class | 3 | 4 | 5 | |
| 3 | 0.971 | 0 | 0.029 | |
| 4 | 0.972 | 0.001 | 0.028 | |
| 5 | 0.829 | 0 | 0.17 | |
| Decision forest matrix | ||||
| Current class | 3 | 4 | 5 | |
| 3 | 0.939 | 0.033 | 0.027 | |
| 4 | 0.494 | 0.417 | 0.089 | |
| 5 | 0.406 | 0.1 | 0.494 | |
| Logistic regression matrix | ||||
| Current class | 3 | 4 | 5 | |
| 3 | 0.94 | 0.014 | 0.046 | |
| 4 | 0.799 | 0.121 | 0.08 | |
| 5 | 0.595 | 0.035 | 0.37 | |
| Jungle matrix of decisions | ||||
| Current class | 3 | 4 | 5 | |
| 3 | 0.987 | 0 | 0.013 | |
| 4 | 0.778 | 0.153 | 0.069 | |
| 5 | 0.507 | 0.023 | 0.471 | |
Final accuracy and completeness values.
| Accuracy | Completeness | |
|---|---|---|
| Decision forests | 0.922 | 0.921 |
| Neural networks | 0.806 | 0.800 |
| Logistic regression | 0.689 | 0.689 |
| Decision jungle | 0.754 | 0.750 |