| Literature DB >> 30700010 |
Vanessa Alcalá-Rmz1, Laura A Zanella-Calzada2, Carlos E Galván-Tejada3, Alejandra García-Hernández4, Miguel Cruz5, Adan Valladares-Salgado6, Jorge I Galván-Tejada7, Hamurabi Gamboa-Rosales8.
Abstract
Diabetes is a chronic and noncommunicable but preventable disease that is affecting the Mexican population at worrying levels, being the first place in prevalence worldwide. Early diabetes detection has become important to prevent other health conditions that involve low organ yield until the patient death. Based on this problem, this work proposes the architecture of an Artificial Neural Network (ANN) for the automated classification of healthy patients from diabetics patients. The analysis was performed used a set of 19 para-clinical features to determine the health status of the patients. The developed model was evaluated through a statistical analysis based on the calculation of the loss function, accuracy, area under the curve (AUC) and receiving operating characteristics (ROC) curve. The results obtained present statistically significant values, with accuracy of 0.94 and AUC values of 0.98. Based on these results, it is possible to conclude that the ANN implemented in this work can classify patients with presence of diabetes from controls with significant accuracy, presenting preliminary results for the development of a diagnostic tool that can be supportive for health specialists.Entities:
Keywords: Artificial Neural Network; computer-aided diagnosis; net reclassification improvement; statistical analysis; type 2 diabetes
Mesh:
Year: 2019 PMID: 30700010 PMCID: PMC6388177 DOI: 10.3390/ijerph16030381
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Features description.
| Feature | Description |
|---|---|
| Age | Patient age at the time of analysis. |
| Gender | Patient gender (0—male/1—female). |
| Education | Studies concluded by the patient, (1—elementary school/2—secondary school, |
| Weight | Patient weight in kilograms. |
| Height | Patient height in centimeters. |
| Waist | Patient waist perimeter in centimeters. |
| Hip Perimeter | Patient hip perimeter in centimeters. |
| BMI | Body Mass Index based on weight and height of a patient. |
| WHR | Waist Hip-Ratio based on the circumference of the waist to that of the hips. |
| SBP | Systolic Blood Pressure based on the pressure in the blood vessels when the heart beats. |
| DBP | Diastolic Blood Pressure based on the pressure in the blood vessels when the heart rests |
| Glucose | Blood glucose levels in terms of milligrams. |
| MMO Glucose | Blood glucose levels in terms of a molar concentration. |
| Insulin | Patient insulin in the blood. |
| HOMA | Homeostatic Model Assessment based on insulin resistance and beta-cell function. |
| Cholesterol | Fat-like substance that is found in all cells in the patient body. |
| LDL | Stands of low-density lipoprotein in the patient body. |
| HDL | Stands for high-density lipoprotein in the patient body. |
| TR | Triglycerides based on a type of fat (lipids) found in the patient body. |
| Output | Diabetes status (0—control/1—case). |
Figure 1Graphic representation of the Artificial Neural Network (ANN) implemented.
Accuracy and loss function values using different number of epochs.
| Epochs | Accuracy | Loss Function | Processing Time (s) |
|---|---|---|---|
| 10 | 0.93 | 0.21 | 1.59 |
| 50 | 0.94 | 0.20 | 5.12 |
| 100 | 0.96 | 0.21 | 9.35 |
| 150 | 0.94 | 0.23 | 13.96 |
| 200 | 0.93 | 0.25 | 18.23 |
| 300 | 0.93 | 0.30 | 27.18 |
| 500 | 0.94 | 0.29 | 44.12 |
| 1000 | 0.93 | 0.39 | 86.99 |
Accuracy, loss function and processing time with different number of layers and neurons.
| Layers Dense/Dropout | Neurons | Accuracy | Loss Function | Processing Time (s) |
|---|---|---|---|---|
| 2/0 | 19 > 2 | 0.94 | 0.20 | 3.83 |
| 2/1 | 19 > 0.5 > 2 | 0.94 | 0.20 | 4.37 |
| 3/1 | 19 > 100 > 0.5 > 2 | 0.96 | 0.20 | 4.83 |
| 3/2 | 19 > 0.25 > 100 > 0.5 > 2 | 0.95 | 0.19 | 5.20 |
| 4/1 | 19 > 100 > 0.5 > 500 > 2 | 0.97 | 0.25 | 6.59 |
| 4/2 | 19 > 0.25 > 100 > 0.5 > 500 > 2 | 0.96 | 0.22 | 6.97 |
| 4/3 | 19 > 0.25 > 100 > 0.5 > 500 > 0.25 > 2 | 0.96 | 0.23 | 7.63 |
| 5/1 | 19 > 100 > 0.5 > 500 > 100 > 2 | 0.98 | 0.31 | 8.17 |
| 5/2 | 19 > 0.25 > 100 > 0.5 > 500 > 100 > 2 | 0.96 | 0.21 | 8.62 |
| 5/3 | 19 > 0.25 > 100 > 0.5 > 500 > 0.25 > 100 > 2 | 0.96 | 0.22 | 9.21 |
|
|
|
|
|
|
1 Structure selected for the Artificial Neural Network (ANN).
Figure 2Accuracy behavior.
Figure 3Loss function behavior.
Figure 4Receiving operating characteristics (ROC) curves obtained with the average performance of the ANN. AUC: area under the curve.
Accuracy presented in related works.
| Work | Description | Accuracy |
|---|---|---|
| Ndaba et al. [ | Diabetes classification based on a regression ANN | 86.00% |
| Soltani et al. [ | Diabetes diagnosis based on a probabilistic ANN | 89.56% |
| Sejdinović et al. [ | Diabetes classification on an ANN | 93.90% |
| Chen et al. [ | Diabetes classification model based on boosting algorithms | 95.30% |