| Literature DB >> 34913981 |
Charlotte James1,2, Janice M Ranson1,2, Richard Everson2,3,4, David J Llewellyn1,2,4.
Abstract
Importance: Machine learning algorithms could be used as the basis for clinical decision-making aids to enhance clinical practice. Objective: To assess the ability of machine learning algorithms to predict dementia incidence within 2 years compared with existing models and determine the optimal analytic approach and number of variables required. Design, Setting, and Participants: This prognostic study used data from a prospective cohort of 15 307 participants without dementia at baseline to perform a secondary analysis of factors that could be used to predict dementia incidence. Participants attended National Alzheimer Coordinating Center memory clinics across the United States between 2005 and 2015. Analyses were conducted from March to May 2021. Exposures: 258 variables spanning domains of dementia-related clinical measures and risk factors. Main Outcomes and Measures: The main outcome was incident all-cause dementia diagnosed within 2 years of baseline assessment.Entities:
Mesh:
Year: 2021 PMID: 34913981 PMCID: PMC8678688 DOI: 10.1001/jamanetworkopen.2021.36553
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Sample Characteristics
| Characteristic | Participants, No. (%) | |
|---|---|---|
| No incident dementia (n = 13 379) | Incident dementia (n = 1568) | |
| Age, mean (SD), y | 72 (9.8) | 75 (9.4) |
| Sex | ||
| Men | 5376 (39) | 802 (51) |
| Women | 8363 (61) | 766 (49) |
| Native English speaker | 12 823 (93) | 1471 (94) |
| Education, mean (SD), y | 15.5 (3.2) | 15.3 (3.3) |
| Dependent living | 927 (7) | 625 (40) |
| CDR sum, median (IQR) | 0.0 (0.0-0.5) | 1.5 (1.0-2.5) |
| Total MMSE score, mean (SD) | 28.5 (1.8) | 26.2 (2.7) |
Abbreviations: CDR, Clinical Dementia Rating; MMSE, Mini-Mental State Examination.
Performance Measures for the Prediction of Incident All-Cause Dementia Over 2 Years
| Performance measures | Mean (SD) | |||||
|---|---|---|---|---|---|---|
| Existing models | Machine learning models | |||||
| BDSI | CAIDE | LR | SVM | RF | XGB | |
| Overall accuracy | 0.83 (0.01) | 0.76 (0.01) | 0.92 (0.01) | 0.92 (0.01) | 0.92 (0.01) | 0.92 (0.01) |
| Sensitivity | 0.37 (0.03) | 0.18 (0.02) | 0.47 (0.05) | 0.47 (0.05) | 0.31 (0.05) | 0.45 (0.05) |
| Specificity | 0.88 (0.01) | 0.82 (0.00) | 0.97 (0.01) | 0.97 (0.01) | 0.98 (0.00) | 0.97 (0.01) |
| Positive predictive value | 0.23 (0.02) | 0.10 (0.01) | 0.62 (0.05) | 0.64 (0.05) | 0.68 (0.07) | 0.66 (0.06) |
| Negative predictive value | 0.92 (0.00) | 0.90 (0.01) | 0.94 (0.01) | 0.94 (0.01) | 0.93 (0.01) | 0.94 (0.01) |
| Area under the curve | 0.72 (0.01) | 0.52 (0.02) | 0.92 (0.01) | 0.91 (0.01) | 0.92 (0.01) | 0.92 (0.01) |
Abbreviations: BDSI, Brief Dementia Screening Indicator; CAIDE, Cardiovascular Risk Factors, Aging and Incidence of Dementia; LR, logistic regression; RF, random forest; SVM, support vector machine; XGB, gradient-boosted trees.
Given values are for recommended thresholds.
Values are for a decision threshold of 0.5.
Figure 1. Receiver Operating Characteristic Curves
BDSI indicates Brief Dementia Screening Indicator; CAIDE, Cardiovascular Risk Factors, Aging and Incidence of Dementia; LR, logistic regression; RF, random forest; SVM, support vector machine; and XGB, gradient-boosted trees.
Figure 2. Area Under the Curve (AUC) vs the Number of Variables Used for Training for 4 Machine Learning Models
Values were obtained by bootstrapping the validation set. Lines indicate the median; shaded regions, IQR; LR, logistic regression; RF, random forest; SVM, support vector machine; XGB, gradient-boosted trees.
Diagnostic Stability and Model Predictions Among Patients Who Were Initially Diagnosed With Dementia Within 2 Years of Their Baseline Assessment
| Diagnosis status | Patients, No. (%) | |||||
|---|---|---|---|---|---|---|
| BDSI | CAIDE | LR | SVM | RF | XGB | |
|
| ||||||
| Consistently diagnosed, model predicted to develop dementia | 536 (37.3) | 243 (16.9) | 694 (48.3) | 689 (47.9) | 477 (33.2) | 666 (46.3) |
| Diagnosis reversed, model predicted to stay dementia-free | 91 (70.0) | 97 (74.6) | 92 (70.8) | 93 (71.5) | 109 (83.8) | 98 (75.4) |
|
| ||||||
| Consistently diagnosed, model predicted to stay dementia-free | 902 (62.7) | 1195 (83.1) | 744 (51.7) | 749 (52.1) | 961 (66.8) | 772 (53.7) |
| Diagnosis reversed, model predicted to develop dementia | 39 (30.0) | 33 (25.4) | 38 (29.2) | 37 (28.5) | 21 (16.2) | 32 (24.6) |
Abbreviations: BDSI, Brief Dementia Screening Indicator; CAIDE, Cardiovascular Risk Factors, Aging and Incidence of Dementia; LR, logistic regression; RF, random forest; SVM, support vector machine; XGB, gradient-boosted trees.
Patients considered as having their diagnosis reversed were initially diagnosed with dementia within 2 years of their baseline visit whose diagnosis was subsequently reversed to mild cognitive impairment or cognitively unimpaired within 2 years of further follow-up suggesting probable misdiagnosis.