| Literature DB >> 35054287 |
Vijendra Singh1, Vijayan K Asari2, Rajkumar Rajasekaran3.
Abstract
Diabetes and high blood pressure are the primary causes of Chronic Kidney Disease (CKD). Glomerular Filtration Rate (GFR) and kidney damage markers are used by researchers around the world to identify CKD as a condition that leads to reduced renal function over time. A person with CKD has a higher chance of dying young. Doctors face a difficult task in diagnosing the different diseases linked to CKD at an early stage in order to prevent the disease. This research presents a novel deep learning model for the early detection and prediction of CKD. This research objectives to create a deep neural network and compare its performance to that of other contemporary machine learning techniques. In tests, the average of the associated features was used to replace all missing values in the database. After that, the neural network's optimum parameters were fixed by establishing the parameters and running multiple trials. The foremost important features were selected by Recursive Feature Elimination (RFE). Hemoglobin, Specific Gravity, Serum Creatinine, Red Blood Cell Count, Albumin, Packed Cell Volume, and Hypertension were found as key features in the RFE. Selected features were passed to machine learning models for classification purposes. The proposed Deep neural model outperformed the other four classifiers (Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Logistic regression, Random Forest, and Naive Bayes classifier) by achieving 100% accuracy. The proposed approach could be a useful tool for nephrologists in detecting CKD.Entities:
Keywords: chronic kidney disease; feature selection; machine learning; recursive feature elimination; support vector machine
Year: 2022 PMID: 35054287 PMCID: PMC8774382 DOI: 10.3390/diagnostics12010116
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Comparative Accuracy analysis of the diagnosis of Chronic Diseases from literature.
| Chronic Diseases Diagnosis | Model | Accuracy Achieved (%) | Reference |
|---|---|---|---|
| Kidney Renal Failure | Artificial Neural Networks | 91.9%–94.2% | [ |
| Diabetic Kidney Disease | Convolutional Model | 71% | [ |
| Chronic Kidney Disease | Neural Network Classifier | 95% | [ |
| Breast Cancer | SBSP + NN | 98.57% | [ |
| Hepatitis Disease | FDA and SVM | 96.77% | [ |
| Breast Cancer | SVM + ReliefF | 92.3% | [ |
| Chronic Kidney Disease | 99% | [ | |
| Chronic Renal Failure | Fisher Discriminatory Analysis and SVM | 96.7% | [ |
| Chronic Renal Failure | 94% | [ | |
| Chronic Kidney Disease | SVM, | 99.7% | [ |
| Chronic Kidney Disease | Logistic Regression, Decision Tree, Naïve Bayes, and Random Forests | 93% | [ |
| Chronic Kidney Disease | 97.8% | [ | |
| Chronic Kidney Disease | SVM, | 99.1% | [ |
| Chronic Kidney Disease | Convolutional Neural Networks | 95.7% | [ |
| Chronic Kidney Disease | SVM, Random Forest, and Gradient Boosting | 99% | [ |
| Chronic Kidney Disease | Logistic regression, | 99.7% | [ |
| Chronic Kidney Disease | XGBoost | 95.8% | [ |
| Chronic Kidney Disease | SVM | 98.5% | [ |
| Chronic Kidney Disease | Softmax Regression | 98% | [ |
Characteristics of the UCI CKD data.
| Features | Specification | Value |
|---|---|---|
| AGE | AGE (IN YEARS) | 0–90 |
| AL | ALBUMIN | 0–5 |
| ANE | ANAEMIA | NO, YES |
| APPET | APPETITE | POOR, GOOD |
| BA | BACTERIA | PRESENT, NOTPRESENT |
| BGR | BLOOD GLUCOSE RANDOM | 0–490 |
| BP | BLOOD PRESSURE | 0–180 |
| BU | BLOOD UREA | 0–391 |
| CAD | CORONARY ARTERY DISEASE | NO, YES |
| CLASS | CLASS | NOTCKD, CKD |
| DM | DIABETES MELLITUS | NO, YES |
| HEMO | HAEMOGLOBIN | 0–17.8 |
| HTN | HYPERTENSION | NO, YES |
| PC | PUS CELL | NORMAL, ABNORMAL |
| PCC | PUS CELL CLUMPS | PRESENT, NOTPRESENT |
| PCV | PACKED CELL VOLUME | 0–54 |
| PE | PEDAL EDEMA | NO, YES |
| POT | POTASSIUM | 0–47 |
| RBC | RED BLOOD CELLS | NORMAL, ABNORMAL |
| RC | RED BLOOD CELL COUNT | 0–8 |
| SC | SERUM CREATININE | 0–76 |
| SG | SPECIFIC GRAVITY | 0–1.025 |
| SOD | SODIUM | 0–163 |
| SU | SUGAR | 0–5 |
| WC | WHITE BLOOD CELL COUNT | 0–26,400 |
Figure 1Support Vector Machine.
Figure 2K-Nearest Neighbor.
Figure 3Decision trees.
Figure 4Random Forest.
Figure 5A framework of the proposed model.
Figure 6Pseudo-Code of the proposed model.
Figure 7Layers architecture of the proposed model deep neural network.
Experimental setup details.
| Resource | Specification |
|---|---|
| Processor | Intel Core i5 Gen7 |
| Random access memory | 16 GB |
| Graphics processing unit | 4 GB |
| Language | Python |
Hyper-parameter settings.
| Hyper-Parameter | Setting |
|---|---|
| Epochs | 850 |
| Batch size | 15 |
| Dropout rate | 0.5 to 0.1 |
| Activation Function | relu |
| Activation output layer | sigmoid |
| Optimizer | Adam |
| Loss | binary_crossentropy |
Figure 8Confusion matrices of the Proposed model.
Comparative analysis of the proposed model with existing classification techniques on CKD data set.
| Method | Accuracy | Recall | Precision | F-Measure |
|---|---|---|---|---|
| Logistic Regression | 0.99 | 1.0 | 0.98 | 0.99 |
| 0.92 | 0.88 | 0.98 | 0.92 | |
| Naïve bayes | 0.95 | 0.92 | 1.00 | 0.95 |
| Support Vector Machines | 0.92 | 0.87 | 0.96 | 0.92 |
| Decision Tree | 0.97 | 0.95 | 1.00 | 0.97 |
| Proposed Model | 1.00 | 1.00 | 1.00 | 1.00 |
Comparative analysis of the proposed model with existing models from the literature on the UCI data set.
| Authors | Model | Accuracy (%) |
|---|---|---|
| Elhoseny et al. [ | Ant Colony-based Optimization Classifier | 95 |
| Vasquez-Morales et al. [ | Neural network | 95 |
| M Senan et al. [ | 98.33 | |
| Krishnamurthy et al. [ | Convolutional Neural Networks | 95.4 |
| Polat, H et al. [ | Support Vector Machine | 98.5 |
| Sarah A. et al. [ | SAE and Softmax Regression | 98 |
| Proposed Model | Deep Neural Network | 100 |
Figure 9Accuracy graphical representation for the UCI CKD data set.
Figure 10Accuracy graphical representation for the UCI CKD data set.
The most critical risk factors from CKD data.
| Risk Factor Name |
|---|
| Hemoglobin |
| Serum Creatinine |
| Red Blood Cell Count |
| Packed Cell Volume |
| Albumin |
| Specific Gravity |
| Hypertension |
Figure 11Important features selected by RFE.
Figure 12ROC/AUC of Proposed model.
Figure 13ROC/AUC of Logistic Regression.
Figure 14ROC/AUC of Decision tree.
Figure 15ROC/AUC of SVM.
Figure 16ROC/AUC of KNN.
Figure 17ROC/AUC of Naïve bayes.