| Literature DB >> 34884099 |
Khalid Mahmood Aamir1, Laiba Sarfraz1, Muhammad Ramzan1,2, Muhammad Bilal3, Jana Shafi4, Muhammad Attique5.
Abstract
Diabetes is a fatal disease that currently has no treatment. However, early diagnosis of diabetes aids patients to start timely treatment and thus reduces or eliminates the risk of severe complications. The prevalence of diabetes has been rising rapidly worldwide. Several methods have been introduced to diagnose diabetes at an early stage, however, most of these methods lack interpretability, due to which the diagnostic process cannot be explained. In this paper, fuzzy logic has been employed to develop an interpretable model and to perform an early diagnosis of diabetes. Fuzzy logic has been combined with the cosine amplitude method, and two fuzzy classifiers have been constructed. Afterward, fuzzy rules have been designed based on these classifiers. Lastly, a publicly available diabetes dataset has been used to evaluate the performance of the proposed fuzzy rule-based model. The results show that the proposed model outperforms existing techniques by achieving an accuracy of 96.47%. The proposed model has demonstrated great prediction accuracy, suggesting that it can be utilized in the healthcare sector for the accurate diagnose of diabetes.Entities:
Keywords: classification; diabetes; diabetes prediction; fuzzy logic; fuzzy rule-based system
Mesh:
Year: 2021 PMID: 34884099 PMCID: PMC8659829 DOI: 10.3390/s21238095
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Summary of machine learning techniques for diabetes detection.
| Sr. No. | Reference | Year | Methodology | Finding and Results |
|---|---|---|---|---|
| 1 | Sisodia et al. [ | 2018 | Naive Bayes, SVM, and DT | NB classifier outperformed the other classifiers with an accuracy of 76.30%. |
| 2 | Naz et al. [ | 2020 | Artificial Neural Network (ANN), Bayes, Decision Tree, and Deep Learning | Deep Learning (DL) attained the highest 98.07% accuracy |
| 3 | Khanam et al. [ | 2021 | SVM, DT, k-Nearest Neighbours (kNN), Random Forest (RF), Logistic Regression (LR), AdaBoost (AB), and Neural Network (NN) | Neural Network (NN) outperformed the other techniques and reached an accuracy of 88.6% on the Pima Indians Diabetes (PID) dataset |
| 4 | Hasan et al. [ | 2020 | Weighted ensemble model of kNN, DT, RF, AB, NB, and XGBoost | The results demonstrated that the proposed ensemble classifier achieved 78.9% sensitivity, 93.4% specificity, and 95% AUC |
| 5 | Singh et al. [ | 2021 | Ensemble model of DT, RF, SVM, XGBoost, and NN | The model demonstrated an accuracy of 95%. |
| 6 | Pradhan et al. [ | 2020 | Artificial neural network | With 70% training data and 30% testing data, the model achieved an accuracy of 85.09% |
| 7 | Kannadasan et al. [ | 2019 | Deep Neural Network (DNN) | The results demonstrated that the classifier achieved an accuracy of 86.26%. |
| 8 | Maniruzzaman et al. [ | 2017 | Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), and Naive Bayes (NB) | The results demonstrated that the model achieved an accuracy of 81.97%. |
| 9 | Azad et al. [ | 2021 | Synthetic Minority Over-sampling Technique (SMOTE), Genetic Algorithm (GA), and Decision Tree (DT) | The model was tested on the Pima Indians Diabetes (PID) dataset and achieved an accuracy of 82.1256%. |
| 10 | Kumari et al. [ | 2021 | k-Nearest Neighbours (kNN) | The model achieved an accuracy of 92.28% on the diabetes dataset |
| 11 | Abokhzam et al. [ | 2021 | Machine Learning grid-based Random Forest | The model achieved an accuracy of 95.7%. |
Summary of fuzzy logic techniques for diabetes detection.
| Sr. No. | Reference | Year | Methodology | Finding and Results |
|---|---|---|---|---|
| 1 | Siva et al. [ | 2019 | Fuzzy rules and grey wolf optimization (GWO) algorithm | The classification was carried out by using optimal rules. The proposed model obtained an accuracy of 81%. |
| 2 | Cheruku et al. [ | 2018 | Rough Set Theory (RST) and the Bat Optimization Algorithm | The system gave an accuracy of 85.33%. |
| 3 | Singh et al. [ | 2019 | Fuzzy rule miner (ANT FDCSM). | The results demonstrated 87.7% accuracy, 92.2% sensitivity, and 80.3% specificity. |
| 4 | Lukmanto et al. [ | 2019 | Fuzzy support vector machine | The results demonstrated that the system achieved an accuracy of 89.02% in predicting diabetes. |
| 5 | Sharma et al. [ | 2021 | Mediative Fuzzy Logic | The researchers generated optimal fuzzy rules for diabetes prediction. |
| 6 | Thungrut et al. [ | 2019 | Fuzzy genetic algorithm | The system showed 87.40% accuracy, 86.82% sensitivity, and 88% specificity. |
| 7 | Zhang et al. [ | 2019 | Parallel ensemble fuzzy classifier | The finding demonstrated that the FP-TSK-FW is effective in the classification of diabetes. |
| 8 | Mujawar et al. [ | 2019 | Fuzzy expert system | The results demonstrated 84% prediction accuracy. |
| 9 | Chen et al. [ | 2019 | Neuro-fuzzy | The proposed model gave 75.67% accuracy on the selected dataset. |
| 10 | Mansourypoor et al. [ | 2017 | Fuzzy rule-based system | The researchers used two datasets to test RLEFRBS performance: the Pima Indian Diabetes dataset and the Biosat Diabetes dataset, and these datasets gave 82.5% and 96.5% accuracy, respectively. |
| 11 | Vaishali et al. [ | 2017 | Multiple objective evolutionary fuzzy classifier | With 70% training and 30% testing data, the classifier achieved an accuracy of 83.0435%. |
| 12 | Geman et al. [ | 2017 | Adaptive Neuro-Fuzzy Inference System | The proposed system demonstrated accuracy for training data is 85.35%, and testing data is 84.27%. |
| 13 | Bhuvaneswari et al. [ | 2018 | Temporal fuzzy ant miner tree | The proposed system achieved an 83.7% accuracy. |
| 14 | Deshmukh et al. [ | 2018 | Fuzzy CNN | The results demonstrated that the fuzzified CNN approach outperformed the traditional NN approach and achieved an accuracy of 95%. |
Description of the dataset attributes.
| Sr. No. | Parameters of Dataset | Description of Parameters | Normal Range |
|---|---|---|---|
| 1 | Number of pregnancies | Number of times the person gets pregnant | 0–17 |
| 2 | Plasma glucose concentration | Represents the concentration of glucose in a person’s body. | 0–199 |
| 3 | Diastolic blood pressure | Represents the diastolic blood pressure in (mm Hg) | 0–122 |
| 4 | Triceps skinfold thickness | Represents triceps skinfold thickness in (mm) | 0–99 |
| 5 | Serum insulin | Represent 2 h serum insulin in (µU/mL) | 0–846 |
| 6 | Body mass index | It is a value derived from the weight and height of a person (weight in kg/(height in m)2). | 0–67.1 |
| 7 | Diabetes pedigree function | It represents the history of diabetes associated with a particular person. | 0.0078–2.42 |
| 8 | Age | Age of the person in years | 21–81 |
| 9 | Class variable | It represented two classes: diabetic and non-diabetic | Yes/No |
Figure 1The framework of the proposed system.
Dataset division into training and testing.
| Class 0 | Class 1 | |
|---|---|---|
|
| 500 | 268 |
| Training | 250 | 150 |
| Testing | 250 | 118 |
Figure 2Fuzzy membership values for variables.
Figure 3Demonstrates the threshold values of both classifiers for the training phase. (a) shows the threshold values for classifier 1 and (b) shows the threshold values for classifier 2.
Figure 4MFs. (a–c) shows MFs for class 1 while (d,e) shows MFs for class 2.
Figure 5MFs for classifier 1.
Figure 6MFs for classifier 2.
Description of the confusion matrix.
| Predicted Yes | Predicted No | |
|---|---|---|
|
| True Positive (TP) | False Negative (FN) |
|
| False Positive (FP) | True Negative (TN) |
Confusion matrix for both fuzzy classifiers.
| Predicted Yes | Predicted No | ||
|---|---|---|---|
| Classifier 1 |
| 113 | 5 |
|
| 8 | 242 | |
| Classifier 2 |
| 106 | 12 |
|
| 5 | 245 |
Comparison of our fuzzy classifiers with other fuzzy techniques.
| Sr. No. | Reference | Methods | Classification Accuracy |
|---|---|---|---|
| 1 | Siva et al. [ | Fuzzy rules and grey wolf optimization (GWO) algorithm | 81% |
| 2 | Cheruku et al. [ | Rough Set Theory (RST) and the Bat Optimization Algorithm | 85.33% |
| 3 | Singh et al. [ | Fuzzy rule miner (ANT FDCSM). | 87.7% |
| 4 | Lukmanto et al. [ | Fuzzy support vector machine | 89.02% |
| 5 | Thungrut et al. [ | Fuzzy genetic algorithm | 87.40% |
| 6 | Mujawar et al. [ | Fuzzy expert system | 84% |
| 7 | Chen et al. [ | Neuro-fuzzy | 75.67% |
| 8 | Mansourypoor et al. [ | Fuzzy rule-based system | 82.5% and 96.5% |
| 9 | Vaishali et al. [ | Multiple objective evolutionary fuzzy classifier | 83.0435% |
| 10 | Geman et al. [ | Adaptive Neuro-Fuzzy Inference System | For training data, 85.35% and testing data, is 84.27% |
| 11 | Bhuvaneswari et al. [ | Temporal fuzzy ant miner tree | 83.7% |
| 12 | Deshmukh et al. [ | Fuzzy CNN | 95% |
| 13 | Fuzzy classifier 1 | Fuzzy | 96.47% |
| 14 | Fuzzy classifier 2 | Fuzzy | 95.38% |
The performance measures for both classifiers.
| Performance Measure | Percentage | |
|---|---|---|
| Classifier 1 | Accuracy | 96.47% |
| Recall | 95.76% | |
| Precision | 93.39% | |
| F-measure | 94.56% | |
| Classifier 2 | Accuracy | 95.38% |
| Recall | 89.83% | |
| Precision | 95.50% | |
| F-measure | 92.58% |