| Literature DB >> 34296003 |
Lara Lama1, Oskar Wilhelmsson1, Erik Norlander1, Lars Gustafsson1, Anton Lager2, Per Tynelius2, Lars Wärvik1, Claes-Göran Östenson3.
Abstract
AIMS: To study if machine learning methodology can be used to detect persons with increased type 2 diabetes or prediabetes risk among people without known abnormal glucose regulation.Entities:
Keywords: Individual healthcare plan; Interpretable machine learning; Machine learning; Risk screening; SHAP; Type 2 diabetes
Year: 2021 PMID: 34296003 PMCID: PMC8282976 DOI: 10.1016/j.heliyon.2021.e07419
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Participants of the epidemiological study.
| Women | Men | Total | |
|---|---|---|---|
| All investigated | 4821 | 3128 | 7949 |
| NGT | 4551 | 2803 | 7354 |
| IFG | 124 | 143 | 267 |
| IGT | 70 | 76 | 146 |
| IFG + IGT | 25 | 40 | 65 |
| Prediabetes (all) | 219 | 259 | 478 |
| T2D (new) | 51 | 66 | 117 |
| All investigated | 3318 | 2360 | 5678 |
| NGT | 2817 | 1660 | 4477 |
| IFG | 237 | 331 | 568 |
| IGT | 94 | 100 | 194 |
| IFG + IGT | 68 | 98 | 166 |
| Prediabetes (all) | 399 | 529 | 928 |
| T2D (new) | 54 | 87 | 141 |
| T2D (incident) | 48 | 84 | 132 |
| T2D (all) | 102 | 171 | 273 |
| All investigated | 2019 | 1323 | 3342 |
| NGT | 1308 | 712 | 2020 |
| IFG | 394 | 336 | 730 |
| IGT | 83 | 76 | 159 |
| IFG + IGT | 138 | 110 | 248 |
| Prediabetes (all) | 615 | 522 | 1137 |
| T2D (new) | 96 | 89 | 185 |
| T2D (earlier) | 153 | 237 | 285 |
| T2D (all) | 230 | 326 | 556 |
The table summarizes the number (n) of individuals participating in the three investigations with a mean of 8–12 years apart and how they were diagnosed according to glucose tolerance.
NGT, normal glucose tolerance, IFG, impaired fasting glucose, IGT, impaired glucose tolerance, Prediabetes, sum of those with IFG, IGT and IFG + IGT, T2D new, diagnosed at the investigation and T2D incident, reported and ascertained diagnosed during interval between two investigations, e.g. the baseline and the Follow-up #1.
Factors included in analysis.
| Heredity i.e. family history of diabetes |
| High age |
| High waist-hip ratio |
| High BMI |
| Systolic blood pressure increased |
| Diastolic blood pressure increased |
| Low physical activity |
| Male gender |
| Exercise |
| Higher socioeconomic strata |
| Lower age |
| Tobacco use, cigarettes |
| Snus, oral moist tobacco |
| Chest pain, angina |
| General health |
| Psychologic distress: |
| Depression |
| Nervousness |
| Fatigue |
| Lethargy |
| Insomnia |
| Coffee |
Figure 1(A). The figure shows the score, S, vs. AUC (Area Under the Curve on Validation Set). (B) The feature importance shown as a split violin plot with the SHAPt values for feature values above (red) and below (blue) the mean values.
Figure 2The difference in SHAP values for different BMI (body mass index, expressed as kg/m2) in relation to diabetes heredity. Feature values larger or smaller than the mean of the feature are depicted in red or blue color, respectively.
Figure 3A SHAP force plot for a person in the data set with a higher risk than average to develop type II diabetes. Features depicted in red color represent higher risk, while features in blue color lower risk of diabetes.
Figure 4The circular bar plot shows the feature values for a person in the data set with higher than average diabetes risk. The features are colorized according to SHAP values, i.e. red indicates increased, blue decreased and grey non-significant effect on the risk.