| Literature DB >> 31471957 |
Woo Jung Kim1,2,3, Ji Min Sung4, David Sung5, Myeong-Hun Chae6, Suk Kyoon An2,7, Kee Namkoong2,7, Eun Lee2,7, Hyuk-Jae Chang4,8.
Abstract
BACKGROUND: With the increase in the world's aging population, there is a growing need to prevent and predict dementia among the general population. The availability of national time-series health examination data in South Korea provides an opportunity to use deep learning algorithm, an artificial intelligence technology, to expedite the analysis of mass and sequential data.Entities:
Keywords: deep learning; dementia; proportional hazards models
Year: 2019 PMID: 31471957 PMCID: PMC6743261 DOI: 10.2196/13139
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Study design and sample selection. (A) All-cause dementia; (B) Alzheimer dementia.
Figure 2Conceptual diagram showing longitudinal data collection. DL-R: deep learning model with repeated measurements; HR-B: hazards regression model with baseline data only; HR-R: hazards regression model with repeated measurements; max: maximum; min: minimum.
Figure 3Expression of the ranking of risk factors.
Characteristics of the development datasets from the National Health Insurance Service-Health Screening Cohort (40-79 years of age).
| Variable | All-cause dementia (n=43,648) | Alzheimer dementia (n=20,026) | ||
| Baseline | Repeated measurement | Baseline | Repeated measurement | |
| Duration of follow-up (years), mean (SD) | 9.11 (2.27) | —a | 9.20 (2.21) | — |
| Number of periodic health examinations (n), mean (SD) | 4.72 (2.34) | — | 4.76 (2.33) | — |
| Age (years), mean (SD) | 57.97 (10.61) | — | 58.34 (10.70) | — |
| Sex (female), n (%) | 22,374 (51.26) | — | 10,465 (52.26) | — |
| Body mass index (kg/m2), mean (SD) | 23.98 (3.06) | 23.90 (2.89) | 23.95 (3.09) | 23.86 (2.92) |
| Systolic blood pressure (mm Hg), mean (SD) | 129.45 (18.85) | 128.64 (13.34) | 129.17 (18.74) | 128.45 (13.20) |
| Diastolic blood pressure (mm Hg), mean (SD) | 80.16 (11.81) | 79.04 (7.85) | 79.85 (11.76) | 78.83 (7.75) |
| Fasting plasma glucose (mg/dL), mean (SD) | 100.56 (38.08) | 102.17 (26.46) | 100.54 (38.82) | 102.03 (25.96) |
| Total cholesterol (mg/dL), mean (SD) | 201.79 (39.14) | 199.82 (30.70) | 201.64 (38.67) | 199.64 (30.75) |
| Smoking, n (%) | 9192 (21.06) | 8302 (19.02) | 4088 (20.41) | 3667 (18.31) |
| No exercise, n (%) | 16,492 (37.78) | 24,530 (56.20) | 7522 (37.56) | 11,233 (56.09) |
| Cardiovascular disease, n (%) | 5061 (11.60) | 23,463 (53.76) | 2273 (11.35) | 10,516 (52.51) |
| Diabetes, n (%) | 2708 (6.20) | 7092 (16.25) | 1271 (6.35) | 3317 (16.56) |
| Hypertension, n (%) | 5447 (12.48) | 17,517 (40.13) | 2442 (12.19) | 7919 (39.54) |
| Psychiatric disorder, n (%) | 2308 (5.29) | 17,088 (39.15) | 1064 (5.31) | 7941 (39.65) |
| Neurological disorder, n (%) | 5920 (13.56) | 28,105 (64.39) | 2720 (13.58) | 12,854 (64.19) |
aNot applicable.
Characteristics of the validation datasets from the National Health Insurance Service-Health Screening Cohort (40-79 years of age).
| Variable | All-cause dementia (n=95,969) | Alzheimer dementia (n=93,009) | ||
| Baseline | Repeated measurement | Baseline | Repeated measurement | |
| Duration of follow-up (years), mean (SD) | 10.39 (1.44) | —a | 10.48 (1.31) | — |
| Number of periodic health examinations, mean (SD) | 5.66 (2.56) | — | 5.71 (2.56) | — |
| Age (years), mean (SD) | 52.53 (9.35) | — | 52.22 (9.17) | — |
| Sex (female), n (%) | 43,786 (45.63) | — | 42,178 (45.35) | — |
| Body mass index (kg/m2), mean (SD) | 24.04 (2.96) | 24.02 (2.80) | 24.04 (2.96) | 24.02 (2.80) |
| Systolic blood pressure (mm Hg), mean (SD) | 126.80 (18.06) | 126.34 (12.45) | 126.68 (18.00) | 126.19 (12.38) |
| Diastolic blood pressure (mm Hg), mean (SD) | 79.51 (11.67) | 78.49 (7.60) | 79.50 (11.66) | 78.44 (7.58) |
| Fasting plasma glucose (mg/dL), mean (SD) | 97.76 (33.23) | 100.02 (22.11) | 97.65 (33.06) | 99.96 (22.07) |
| Total cholesterol (mg/dL), mean (SD) | 200.43 (38.45) | 199.26 (29.48) | 200.35 (38.34) | 199.25 (29.32) |
| Smoking, n (%) | 23,104 (24.07) | 20,438 (21.30) | 22,593 (24.29) | 19,966 (21.47) |
| No exercise, n (%) | 41,179 (42.91) | 62,805 (65.44) | 40,125 (43.14) | 61,415 (66.03) |
| Cardiovascular disease, n (%) | 6,629 (6.91) | 39,848 (41.52) | 6,188 (6.65) | 38,004 (40.86) |
| Diabetes, n (%) | 3744 (3.90) | 13,116 (13.67) | 3472 (3.73) | 12,537 (13.48) |
| Hypertension, n (%) | 7609 (7.93) | 34,033 (35.46) | 7120 (7.66) | 32,720 (35.18) |
| Psychiatric disorder, n (%) | 3209 (3.34) | 28,723 (29.93) | 2992 (3.22) | 27,321 (29.37) |
| Neurological disorder, n (%) | 8755 (9.12) | 52,773 (54.99) | 8283 (8.91) | 50,572 (54.37) |
| Event rate, n (%) | 5456 (5.69) | 5456 (5.69) | 2503 (2.69) | 2503 (2.69) |
aNot applicable.
Figure 4Summary of hazard ratios and 95% confidence intervals in the hazards regression models (40-79 years of age). BMI: body mass index; DBP: diastolic blood pressure; FG: fasting plasma glucose; SBP: systolic blood pressure; TC: total cholesterol.
Comparison of the models’ performance to predict all-cause dementia and Alzheimer dementia in individuals aged 40 to 79 years.
| Performance variable | All-cause dementiaa,b | Alzheimer dementiaa,b | ||||
| HR-Bc | HR-Rd | DL-Re | HR-B | HR-R | DL-R | |
| Discrimination (performance) | 0.84 (0.83-0.85) | 0.87 (0.86-0.88) | 0.90 (0.90-0.90) | 0.87 (0.86-0.88) | 0.90 (0.88-0.91) | 0.91 (0.91-0.91) |
| Sensitivity (%) | 80.41 (79.35-81.46) | 80.17 (79.11-81.23) | 83.50 (82.52-84.49) | 82.90 (81.43-84.38) | 80.54 (78.99-82.09) | 87.62 (86.32-88.91) |
| Specificity (%) | 73.23 (72.94-73.52) | 77.88 (77.61-78.15) | 79.88 (79.61-80.14) | 75.86 (75.58-76.13) | 81.25 (80.99-81.5) | 78.66 (78.40-78.93) |
| Accuracy (%) | 73.64 (73.36-73.92) | 78.01 (77.75-78.27) | 80.08 (79.83-80.33) | 76.04 (75.77-76.32) | 81.23 (80.98-81.48) | 78.91 (78.64-79.17) |
| Positive predictive value (%) | 15.33 (14.91-15.75) | 17.93 (17.45-18.41) | 20.01 (19.49-20.53) | 8.67 (8.32-9.03) | 10.62 (10.18-11.06) | 10.20 (9.79-10.60) |
| Negative predictive value (%) | 98.41 (98.32-98.51) | 98.49 (98.40-98.58) | 98.77 (98.69-98.85) | 99.38 (99.32-99.44) | 99.34 (99.28-99.40) | 99.57 (99.52-99.61) |
aValues in parentheses indicate 95% CIs.
bDiscrimination performance of the HR-B model and the HR-R model is based on C-statistics and that of the DL-R model is based on the area under the receiver operating characteristic curve.
cHR-B: hazard regression model with baseline data.
dHR-R: hazard regression model with repeated measurements.
eDL-R: deep learning model with repeated measurements.
Comparison of the hazard regression model with repeated measurements and the deep learning model with repeated measurements using validation datasets from the National Health Insurance Service-Health Screening Cohort (40-79 years of age).
| Performance index | All-cause dementia | Alzheimer dementia | |
| DL-Ra,b versus HR-Rc | DL-Ra versus HR-R | ||
|
|
|
| |
|
| Difference between AUCsd | 0.034 (0.029-0.039)e | 0.024 (0.018-0.031)e |
|
| Absolute IDIf | 0.334 | 0.423 |
|
| Relative IDI | 3.200 | 5.351 |
|
|
|
| |
|
| Patients move to higher, n (%) | 4163 (76.30) | 2163 (86.42) |
|
| Patients move to lower, n (%) | 0 (0.00) | 0 (0.00) |
|
| Controls move to higher, n (%) | 17,664 (19.52) | 19,231 (21.25) |
|
| Controls move to lower, n (%) | 0 (0.00) | 0 (0.00) |
|
| NRIg (%) | 56.79h | 65.17h |
aNRIs were calculated to determine the improvement in the performance of each model to identify individuals whose risk of dementia was more than 50%.
bDL-R: deep learning model with repeated measurements.
cHR-R: hazard regression model with repeated measurements.
dAUC: area under the receiver operating characteristic curve.
eDifference between AUCs was significant with P<.001.
fIDI: integrated discrimination improvement.
gNRI: net reclassification improvement.
hNRI was significant with P<.001.
Figure 5Calibration plots for each model (40-79 years of age). DL-R: deep learning model with repeated measurements; HR-B: hazards regression model with baseline data only; HR-R: hazards regression model with repeated measurements.