| Literature DB >> 35449835 |
Roshi Saxena1, Sanjay Kumar Sharma1, Manali Gupta1, G C Sampada2.
Abstract
Diabetes is a chronic disease characterized by a high amount of glucose in the blood and can cause too many complications also in the body, such as internal organ failure, retinopathy, and neuropathy. According to the predictions made by WHO, the figure may reach approximately 642 million by 2040, which means one in a ten may suffer from diabetes due to unhealthy lifestyle and lack of exercise. Many authors in the past have researched extensively on diabetes prediction through machine learning algorithms. The idea that had motivated us to present a review of various diabetic prediction models is to address the diabetic prediction problem by identifying, critically evaluating, and integrating the findings of all relevant, high-quality individual studies. In this paper, we have analysed the work done by various authors for diabetes prediction methods. Our analysis on diabetic prediction models was to find out the methods so as to select the best quality researches and to synthesize the different researches. Analysis of diabetes data disease is quite challenging because most of the data in the medical field are nonlinear, nonnormal, correlation structured, and complex in nature. Machine learning-based algorithms have been ruled out in the field of healthcare and medical imaging. Diabetes mellitus prediction at an early stage requires a different approach from other approaches. Machine learning-based system risk stratification can be used to categorize the patients into diabetic and controls. We strongly recommend our study because it comprises articles from various sources that will help other researchers on various diabetic prediction models.Entities:
Mesh:
Year: 2022 PMID: 35449835 PMCID: PMC9018179 DOI: 10.1155/2022/8100697
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 3.822
Comparative analysis of diabetes prediction using machine learning methods.
| S. no. | Method name | Number of datasets used | Name of the dataset | Data size | Speed | Does it rank features | CV protocol used | Evaluation parameters taken | Classifier used | Feature selection method | Number of features used | Classification accuracy | Year in which paper was published | Temporal interval |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Kamrul Hasan | 1 | PIDD | 768 | Slow | Yes | 5 | Sn, Sp, and AUC | KNN, DT, RF, MLP, AB, XB, and NB | PCA, ICA, and CRB | 6 | 78.9% | April 2020 | 2020-2021 |
| 2 | Quan Zou | 2 | Luzhou and PIDD | 68994, 768 | Slow | Yes | 5 | Sn, Sp, ACC, and MCC | J48, RF, and NN | PCA and mrMR | 11, 7 | 80.84% | November 2018 | 2018-2019 |
| 3 | Nishith Kumar | 1 | PIDD | 768 | Fast | No | 5 and 10 | Sn, Sp, ACC, PPV, and NPV | GPC, LDA, QDA, and NB | Kernels | All | 81.97% | December 2017 | 2016-2017 |
| 4 | Maniruzzuman | 1 | NHANES | 9858 | Slow | No | 2, 5, and 10 | Sn, ACC, PPV, NPV, FM, and AUC | NB, DT, RF, and AB | LR | All | 92.75% | January 2020 | 2020-2021 |
| 5 | V. Jackins | 1 | PIDD | 768 | Fast | Yes | None | ACC | NB and RF | CRB | 4 | 74.46% | November 2020 | 2020-2021 |
| 6 | N. Sneha | 1 | PIDD | 2500 | Slow | Yes | None | Sn, Sp, ACC, PPV, NPV, PLR, NLR, and DP | SVM, RF, NB, DT, and KNN | CRB | 11 | 82.3% | February 2019 | 2018-2019 |
| 7 | S. Mohapatra | 1 | PIDD | 768 | Fast | No | None | ACC, TP, and TN | MLP | None | All | 77.5% | September 2019 | 2018-2019 |
| 8 | D. Sisodia | 1 | PIDD | 768 | Fast | No | None | Recall, precision, and ACC | NB, SVM, and DT | None | All | 76.3% | December 2018 | 2018-2019 |
| 9 | Orabi | 1 | Egyptian National Research Centre | Not mentioned | Slow | No | None | ACC | DT | Not mentioned | 9 | 84% | 2016 | 2016-2017 |
| 10 | O. M. Alade | 1 | PIDD | 768 | Fast | No | None | ACC | NN | None | All | Only prediction was done | December 2017 | 2016-2017 |