| Literature DB >> 35372240 |
Osama Rabie1, Daniyal Alghazzawi1, Junaid Asghar2, Furqan Khan Saddozai3, Muhammad Zubair Asghar3.
Abstract
Background and Objective: According to the WHO, diabetes mellitus is a long-term condition marked by high blood sugar levels. The consequences might be far-reaching. According to current increases in mortality, diabetes has risen to number 10 among the leading causes of mortality worldwide. When used to predict diabetes using unbalanced datasets from testing, machine learning (ML) classifiers and established approaches for encoding categorical data have exhibited a broad variety of surprising outcomes. Early studies also made use of an artificial neural network to extract features without obtaining a grasp of the sequence information.Entities:
Keywords: decision support system; deep learning; diabetes prediction; disease diagnoses; disease diagnosis
Mesh:
Year: 2022 PMID: 35372240 PMCID: PMC8970706 DOI: 10.3389/fpubh.2022.861062
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Summary of selected works.
|
|
|
|
|
|---|---|---|---|
| Butt et al. ( | Randomized forest, multilayer perceptron, and regression models (LR) | MLP outperforms similar learners with an accuracy of 86% | Dimensionality reduction techniques not applied |
| Gupta et al. ( | Support vector machine | Support vector machine did better than the naive Bayes model. | Lack of current comparability and consistency |
| Qawqzeh et al. ( | Regression model | 92% accuracy | Lack of comparison with the current methods |
| Pethunachiyar ( | SVM classifier | Outperformed Naive Bayes, decision trees, and neural nets | There is no discussion of parameter estimation |
| Choubey et al. ( | SVM, KNN, and NB | 91% accuracy | Performance overhead due to incorporation of extensive feature engineering |
| Zhou et al. ( | DTP model for glycemic control diagnosis | Promising results | Execution time needs to be further reduced |
| Garca-Ordás et al. ( | Deep learning approach (CNN) | 92.31% accuracy | Ensemble learning technique required for more better results |
| Alam et al. ( | Association rules | 92% accuracy | More effective preprocessing |
| Naz and Ahuja ( | Multilayer Perceptron, Logistic Regression, and Deep Learning | 90% accuracy | A pipeline of classifiers can produce efficient results |
| Yuvaraj and SriPreethaa ( | Machine learning techniques in hadoop clusters | Outperformed baseline methods | Poor selection of predictors |
| Hasan et al. ( | k-nearest neighbor, Decision Trees, Randomized Forest, Xgboost, Bayesian Network, Gradient boosting, and Lstm | LSTM exhibited better results (92% accuracy) | Parameter selection is affected by lack of efficient proper preprocessing techniques |
Figure 1Overview of the proposed system for predicting diabetes disease.
Figure 2Parameters used to predict the likelihood of diabetes.
Figure 3BILSTM-based for diabetes prediction system.
Figure 4Diabetes disease classification using the softmax function.
Methodology of the proposed diabetes disease prediction model.
| A. Data Input: As a csv file, import the diabetes disease labeled dataset. |
Figure 5Data entry form for diabetes prediction.
Figure 6Create a model screen by loading training data.
Figure 7Diabetes prediction interface.
BILSTM deep learning models' accuracy, recall, and f1-score.
|
|
|
|
|
|---|---|---|---|
| BILSTM-1 | 0.74 | 0.75 | 0.74 |
| BILSTM-2 | 0.83 | 0.77 | 0.75 |
| BILSTM-3 | 0.85 | 0.82 | 0.80 |
| BILSTM-4 | 0.86 | 0.84 | 0.83 |
| BILSTM-5 | 0.89 | 0.85 | 0.84 |
| BILSTM-6 | 0.88 | 0.88 | 0.87 |
| BILSTM-7 | 0.89 | 0.89 | 0.88 |
| BILSTM-8 | 0.90 | 0.89 | 0.89 |
| BILSTM-9 | 0.92 | 0.90 | 0.90 |
| BILSTM-10 | 0.93 | 0.92 | 0.92 |
Performance of the BILSTM models with and without balancing data.
|
|
|
|
|---|---|---|
| Accuracy (%) | 82 | 93.07 |
| Precision (%) | 82 | 93 |
| Recall (%) | 82 | 92 |
| F1-score | 88 | 92 |
The BILSTM models' accuracy, test loss, and training time.
|
|
|
|
|
|---|---|---|---|
| BILSTM1 | 81.23 | 0.78 | 18 |
| BILSTM2 | 82.21 | 0.86 | 5 |
| BILSTM3 | 83.71 | 1.04 | 17 |
| BILSTM4 | 84.67 | 1.13 | 15 |
| BILSTM5 | 85.98 | 0.92 | 6 |
| BILSTM6 | 87.47 | 0.91 | 13 |
| BILSTM7 | 88.21 | 1.11 | 11 |
| BILSTM8 | 88.36 | 0.80 | 13 |
| BILSTM9 | 89.23 | 0.91 | 15 |
| BILSTM10 | 92.15 | 0.82 | 10 |
Machine learning classifiers vs. proposed model (BILSTM).
|
|
|
|
|
|
|---|---|---|---|---|
| KNN | 78 | 79 | 80 | 79 |
| DT | 81 | 80 | 80 | 80 |
| SVM | 82 | 79 | 81 | 79 |
| NB | 72 | 70 | 70 | 70 |
| Proposed (BILSTM) | 93.07 | 93 | 92 | 92 |
BILSTM vs. other DL models.
|
|
|
|
|
|
|---|---|---|---|---|
| LSTM | 83.36 | 84 | 83 | 83 |
| CNN | 83.22 | 81 | 80 | 81 |
| RNN | 81.11 | 80 | 81 | 80 |
| Proposed (BILSTM) | 93.07 | 93 | 92 | 92 |
Comparison of the BILSTM model with other studies.
|
|
|
|
|---|---|---|
| Butt et al. ( | Predicting diabetic illness with machine learning | Acc: 85% |
| Gupta et al. ( | Predicting diabetic illness with machine learning | Acc. 88% |
| Proposed (BILSTM) | Predicting diabetes with deep learning (BILSTM) | 93.07 |
The human expert's prognosis vs. the suggested system's.
|
|
|
|
|---|---|---|
| 1 | Diabetes = yes | Diabetes = yes |
| 2 | Diabetes = yes | Diabetes = yes |
| 3 | Diabetes = no | Diabetes = no |
| 4 | Diabetes = yes | Diabetes = no |
| 5 | Diabetes = yes | Diabetes = yes |
| 6 | Diabetes = yes | Diabetes = yes |
| 7 | Diabetes = yes | Diabetes = yes |
| 8 | Diabetes = no | Diabetes = no |
| 9 | Diabetes = yes | Diabetes = yes |
| 10 | Diabetes = no | Diabetes = no |
| 11 | Diabetes = yes | Diabetes = yes |
| 12 | Diabetes = yes | Diabetes = yes |
The external validation of the proposed method.
|
|
|
|
|
|---|---|---|---|
| Dataset 2 | 0.723 (KNN) | 0.718 (KNN) | 0.751(KNN) |
| Dataset 3 | 0.718 (KNN) | 0.817 (KNN) | 0.781 (KNN) |