| Literature DB >> 35634091 |
Sasmita Padhy1, Sachikanta Dash2, Sidheswar Routray3, Sultan Ahmad4, Jabeen Nazeer4, Afroj Alam5.
Abstract
Nowadays, there is a growing need for Internet of Things (IoT)-based mobile healthcare applications that help to predict diseases. In recent years, several people have been diagnosed with diabetes, and according to World Health Organization (WHO), diabetes affects 346 million individuals worldwide. Therefore, we propose a noninvasive self-care system based on the IoT and machine learning (ML) that analyses blood sugar and other key indicators to predict diabetes early. The main purpose of this work is to develop enhanced diabetes management applications which help in patient monitoring and technology-assisted decision-making. The proposed hybrid ensemble ML model predicts diabetes mellitus by combining both bagging and boosting methods. An online IoT-based application and offline questionnaire with 15 questions about health, family history, and lifestyle were used to recruit a total of 10221 people for the study. For both datasets, the experimental findings suggest that our proposed model outperforms state-of-the-art techniques.Entities:
Mesh:
Year: 2022 PMID: 35634091 PMCID: PMC9132636 DOI: 10.1155/2022/2389636
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Health monitoring in IoT hospitals.
Possible answers for different features.
| Sl. No | Features | Possible answers |
|---|---|---|
| 1 | Age | More than 18 years |
| 2 | Gender | Male 643, Female 429 |
| 3 | Family history | Yes/No |
| 4 | Physical_Activity | 1. More_than_one_hour |
| 5 | Urination_Frequency | 1. Frequently |
| 6 | Junc_Food_Consumption | Yes/No |
| 7 | Blood_Pressure | 1. Normal |
| 8 | BMI | Numeric |
| 9 | Diabetes | Yes/No |
| 10 | Reglar_Intake_Of_Medicne | Yes/No |
| 11 | No_Of_Pregnancies | Numeric |
| 12 | Smoking | Yes/No |
| 13 | Alcohol_Consumption | Yes/No |
| 14 | Hour_Of_Sleep | Numeric |
| 15 | Stress | Yes/No |
Figure 2Screenshot of the collected dataset.
Figure 3Sensors and services for diabetes monitoring.
Figure 4Flow of the proposed model.
Matrix of confusion for different classification methods.
| Dataset/Models | Logistic regression | K-nearest neighbor | Support vector machine | Proposed model |
|---|---|---|---|---|
| PIMA dataset | [[138 12] [42 41]] | [[132 20] [39 44]] | [[143 9] [47 36]] | [[128 20] [36 45]] |
| Collected dataset | [[192 24] [15 87]] | [[164 38] [46 74]] | [[195 15] [24 93]] | [[224 3] [4 95]] |
Comparison of statistical measurement for various classification techniques.
| Logistic regression | K-nearest neighbor | Support vector machine | Proposed model | |||||
|---|---|---|---|---|---|---|---|---|
| Collected dataset | Pima dataset | Collected dataset | Pima dataset | Collected dataset | Pima dataset | Collected dataset | Pima dataset | |
| Accuracy | 0.872 | 0.744 | 0.739 | 0.708 | 0.888 | 0.744 | 0.984 | 0.750 |
| Error | 0.127 | 0.255 | 0.261 | 0.291 | 0.112 | 0.255 | 0.016 | 0.250 |
| Sensitivity | 0.923 | 0.775 | 0.778 | 0.748 | 0.898 | 0.775 | 0.987 | 0.789 |
| Specificity | 0.764 | 0.666 | 0.702 | 0.603 | 0.816 | 0.666 | 0.916 | 0.661 |
| Precision | 0.885 | 0.856 | 0.816 | 0.832 | 0.933 | 0.856 | 0.991 | 0.840 |
| F-measure | 0.903 | 0.813 | 0.797 | 0.787 | 0.915 | 0.813 | 0.989 | 0.813 |
| MCC | 0.732 | 0.416 | 0.503 | 0.331 | 0.764 | 0.416 | 0.963 | 0.436 |
| Kappa | 0.727 | 0.470 | 0.516 | 0.419 | 0.713 | 0.466 | 0.922 | 0.488 |
| AUC | 0.908 | 0.765 | 0.916 | 0.815 | 0.893 | 0.771 | 0.984 | 0.978 |
Figure 5(a) ROC curve with AUC for PIMA dataset. (b) ROC curve with AUC for the collected dataset.
Importance of parameters.
| Count | Mean | Std | min | 0.25 | 0.5 | 0.75 | Max | |
|---|---|---|---|---|---|---|---|---|
| Age∗∗∗ | 10221 | 32.9026 | 11.09011 | 21.00000 | 24.00000 | 29.50000 | 40.00000 | 66.00000 |
| Gender∗∗ | 10221 | 0.50649 | 0.50158 | 0.00000 | 0.00000 | 1.00000 | 1.00000 | 1.00000 |
| family_history∗∗∗ | 10221 | 0.29220 | 0.45626 | 0.00000 | 0.00000 | 0.00000 | 1.00000 | 1.00000 |
| physical_activity∗∗∗ | 10221 | 1.38311 | 0.78547 | 0.00000 | 1.00000 | 2.00000 | 2.00000 | 2.00000 |
| Regular_intake_of_medicine∗∗∗ | 10221 | 0.26623 | 0.44343 | 0.00000 | 0.00000 | 0.00000 | 1.00000 | 1.00000 |
| No_of_pregnancies∗∗ | 10221 | 1.88311 | 3.08686 | 0.00000 | 0.00000 | 0.00000 | 3.00000 | 14.00000 |
| Smoking | 10221 | 0.07792 | 0.26892 | 0.00000 | 0.00000 | 0.00000 | 0.00000 | 1.00000 |
| alcohol_consumption | 10221 | 0.33766 | 0.47445 | 0.00000 | 0.00000 | 0.00000 | 1.00000 | 1.00000 |
| hour_of_sleep | 10221 | 6.59740 | 0.91836 | 6.00000 | 6.00000 | 6.00000 | 8.00000 | 8.00000 |
| Stress | 10221 | 0.44805 | 0.57214 | 0.00000 | 0.00000 | 0.00000 | 1.00000 | 2.00000 |
| Urination_frequency | 10221 | 0.40909 | 0.49327 | 0.00000 | 0.00000 | 0.00000 | 1.00000 | 1.00000 |
| Junc_food_consumption | 10221 | 0.52597 | 0.50095 | 0.00000 | 0.00000 | 1.00000 | 1.00000 | 1.00000 |
| Blood_pressure∗∗ | 10221 | 0.86363 | 0.80900 | 0.00000 | 0.00000 | 1.00000 | 2.00000 | 2.00000 |
| BMI | 10221 | 32.3701 | 7.40424 | 0.00000 | 28.00000 | 33.00000 | 36.00000 | 67.00000 |
| Diabetes | 10221 | 0.44155 | 0.49819 | 0.00000 | 0.00000 | 0.00000 | 1.00000 | 1.00000 |
Figure 6Correlation matrix.
Figure 7Comparison of traditional classification models and proposed model when implemented on the collected dataset and PIMA dataset.
Comparison study of our suggested work with the current state of the art in terms of accuracy.
| Author(Year) | Method | Accuracy (%) |
|---|---|---|
| Swapna et al. [ | Deep learning algorithms | 95.7 |
| Bhatia et al. [ | Fuzzy cognitive maps | 96 |
| Samant et al. [ | Improvised random forest technique | 89.66 |
| Sisodia et al. [ | Modified machine learning algorithms | 76.3 |
| Wu et al. [ | Improved data mining techniques | 95.42 |
| Our approach | IoT-based hybrid ensemble machine learning model |
|