| Literature DB >> 35563811 |
Honghuang Lin1,2, Jayandra J Himali1,3,4,5, Claudia L Satizabal1,5, Alexa S Beiser1,3,4, Daniel Levy1,6, Emelia J Benjamin1,3,4, Mitzi M Gonzales1,5, Saptaparni Ghosh1,4, Ramachandran S Vasan1,4, Sudha Seshadri1,4,5, Emer R McGrath1,7.
Abstract
Blood biomarkers for dementia have the potential to identify preclinical disease and improve participant selection for clinical trials. Machine learning is an efficient analytical strategy to simultaneously identify multiple candidate biomarkers for dementia. We aimed to identify important candidate blood biomarkers for dementia using three machine learning models. We included 1642 (mean 69 ± 6 yr, 53% women) dementia-free Framingham Offspring Cohort participants attending examination, 7 who had available blood biomarker data. We developed three machine learning models, support vector machine (SVM), eXtreme gradient boosting of decision trees (XGB), and artificial neural network (ANN), to identify candidate biomarkers for incident dementia. Over a mean 12 ± 5 yr follow-up, 243 (14.8%) participants developed dementia. In multivariable models including all 38 available biomarkers, the XGB model demonstrated the strongest predictive accuracy for incident dementia (AUC 0.74 ± 0.01), followed by ANN (AUC 0.72 ± 0.01), and SVM (AUC 0.69 ± 0.01). Stepwise feature elimination by random sampling identified a subset of the nine most highly informative biomarkers. Machine learning models confined to these nine biomarkers showed improved model predictive accuracy for dementia (XGB, AUC 0.76 ± 0.01; ANN, AUC 0.75 ± 0.004; SVM, AUC 0.73 ± 0.01). A parsimonious panel of nine candidate biomarkers were identified which showed moderately good predictive accuracy for incident dementia, although our results require external validation.Entities:
Keywords: biomarkers; blood biomarkers; dementia; machine learning; risk prediction
Mesh:
Substances:
Year: 2022 PMID: 35563811 PMCID: PMC9100323 DOI: 10.3390/cells11091506
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 7.666
List of blood-based biomarkers with assay performance characteristics.
| Variable | Min | 25% Quantile | Median | 75% Quantile | Max | Missing Rate | Inter-Assay CV | Intra-Assay CV | LLOQ |
|---|---|---|---|---|---|---|---|---|---|
| Aß42/40 * | 0.1 | 0.2 | 0.3 | 0.3 | 1.2 | 1 | - | - | - |
| Adiponectin (ug/mL) | 0.9 | 5.9 | 9 | 14.1 | 59.9 | 16.8 | 9.6 | 6.2 | 0.08 |
| ApoA1 (pg/mL) | 8.2 × 107 | 7.8 × 108 | 9.1 × 108 | 1.0 × 109 | 3.3 × 109 | 0.7 | 7.3 | 11.8 | 3.4 × 105 |
| ApoB (pg/mL) | 2.4 × 108 | 6.1 × 108 | 7.1 × 108 | 8.4 × 108 | 3.6 × 109 | 0.7 | 13.4 | 6.6 | 2.6 × 106 |
| BDNF (pg/mL) | 1713.7 | 1.7 × 104 | 2.3 × 104 | 2.9 × 104 | 5.6 × 104 | 8.2 | 7.6 | 4.8 | 312 |
| BNP (pg/mL) | 10.1 | 132 | 275 | 548.8 | 8530 | 1 | 7.2 | 10.3 | 9.7 |
| CD14 (pg/mL) | 7.66 × 106 | 1.44 × 107 | 1.65 × 107 | 1.92 × 107 | 4.24 × 107 | 0.3 | 14.5 | 3.6 | 5.8 × 104 |
| CD40L (ng/mL) | 0.1 | 0.5 | 1.1 | 3.3 | 29.5 | 0.2 | 14.1 | 4.9 | 0.005 |
| Clusterin (pg/mL) | 1.54 × 106 | 4.44 × 107 | 5.07 × 107 | 5.88 × 107 | 1.23 × 108 | 0.4 | 12.6 | 9.1 | 1.5 × 104 |
| CRP mg/L | 0.2 | 1.2 | 2.6 | 5.5 | 99.8 | 0.2 | 5.3 | 3.2 | 0.2 |
| Cystatin C * (mg/L) | 0.6 | 0.9 | 1 | 1.1 | 7 | 1.6 | 3.3 | 2.4 | 0.3 |
| FGF-23 (pg/mL) | 19 | 56 | 69 | 89 | 434 | 13.6 | 13.4 | 5.5 | 18.7 |
| Fibrinogen (mg/dL) | 204 | 343 | 387 | 436 | 763 | 0.4 | 4.4 | 1.1 | 90 |
| GDF-15 (pg/mL) | 249 | 602 | 770 | 1030 | 2.1 × 104 | 0.3 | 2.9 | 2.3 | 40 |
| HbA1c (%) | 1.7 | 5.2 | 5.6 | 6.1 | 14.6 | 10.1 | <2.5 | <2.5 | 0 |
| HDL-C * (mg/dL) | 17 | 40 | 50.5 | 63 | 136 | 0.1 | 2.8 | 0.9 | 17 |
| Homocysteine * (umol/L) | 3.3 | 6.9 | 8.4 | 10.3 | 84.3 | 0.1 | 7 | 4.5 | 3.2 |
| ICAM-1 (ng/mL) | 29 | 217 | 247.1 | 290 | 1327.5 | 0.1 | 6 | 3.9 | <0.4 |
| IGF-1 (ng/mL) | 24.3 | 84.8 | 105.1 | 129.6 | 377.3 | 8.9 | 4.5 | 3.4 | 23.5 |
| IGFBP-1 (pg/mL) | 1000 | 5545 | 1.0 × 104 | 2.0 × 104 | 1.7 × 105 | 2.4 | 5.4 | 2.5 | 979 |
| IGFBP-2 * (pg/mL) | 1.79 × 106 | 8.24 × 106 | 1.22 × 107 | 1.77 × 107 | 9.31 × 107 | 0.8 | 8.7 | 6 | 1.6 × 106 |
| IGFBP-3 (pg/mL) | 9.4 × 104 | 1.8 × 105 | 2.2 × 105 | 2.6 × 105 | 6.2 × 105 | 0.5 | 18 | 4.4 | 272 |
| IL-6 (pg/mL) | 0.6 | 2.1 | 3.2 | 4.8 | 104.4 | 0.4 | 9 | 3.7 | <0.7 |
| Insulin (pmol/L) | 14.9 | 59.2 | 80.4 | 111.6 | 1296 | 1.2 | 6.1 | 3.9 | 12 |
| Leptin * (pg/mL) | 413 | 2290 | 5425 | 11875 | 129000 | 5.9 | 7 | 3.2 | 397 |
| MCP-1 * (pg/mL) | 34.5 | 267.1 | 328.4 | 398.3 | 2139.8 | 2 | 11.1 | 3.8 | 5.7 |
| MMP-9 (pg/mL) | 1.7 × 104 | 3.8 × 104 | 4.8 × 104 | 6.4 × 104 | 6.1 × 105 | 0.5 | 10 | 3.9 | 243 |
| MPO (ng/mL) | 4.9 | 27.4 | 38.9 | 58.7 | 332.1 | 3.1 | NR | 3.2 | 0.2 |
| OPG (pmol/L) | 0.6 | 5 | 6 | 7.1 | 26.9 | 0.3 | NR | 3.7 | 0.1 |
| PAI-1 * (pg/mL) | 4600 | 1.4 × 104 | 1.9 × 104 | 2.6 × 104 | 1.2 × 105 | 0.5 | 10.8 | 3.6 | 449 |
| P-selectin (ng/mL) | 2.5 | 29.4 | 37.3 | 46.8 | 194.9 | 0.2 | NR | 3.2 | <0.5 |
| Resistin (ng/dL) | 1.2 | 10.3 | 13.2 | 17.3 | 110 | 16.2 | 11 | 4.6 | 0.2 |
| TC (mg/dL) | 83 | 174 | 197 | 219 | 357 | 0 | 1.5 | 0.7 | 20 |
| TNF-a * (pg/dL) | 0.3 | 1 | 1.3 | 1.7 | 20.9 | 22.7 | 11.3 | 7.6 | 0.1 |
| TNFR-2 (pg/mL) | 681.6 | 1814.8 | 2170.1 | 2665.2 | 8383.4 | 2.4 | NR | 2.3 | 0.2 |
| VEGF (pg/mL) | 15.3 | 162.9 | 288.6 | 459.7 | 1728.4 | 8.4 | 14.7 | 4.3 | 9.2 |
| Vitamin B12 (pg/mL) | 54.7 | 323.1 | 411.2 | 522.9 | 2931.2 | 0.1 | 10 | 8.5 | 34.6 |
| Vitamin D (ng/mL) | 3.1 | 14.8 | 19.4 | 24.2 | 58.5 | 45.8 | 8.5 | NR | 2.2 |
| T-tau (pg/mL) | 0.8 | 3.3 | 4.1 | 5 | 17 | 30.6 | 7.5 | 4.1 | 0.01 |
* The nine most informative biomarkers are highlighted in bold. Abbreviations: LLOQ, lower limit of quantification; CV, coefficient of variation; NR = not reported (for some biomarkers, no inter-assay CV is reported as assays were run at the same time/using the same plate); Aß42/40, ß-amyloid 42/ß-amyloid 40; Apo A1, Apolipoprotein A-1; ApoB, Apolipoprotein B; BDNF, brain-derived neurotrophic factor; BNP, brain natriuretic peptide; CD14, monocyte differentiation antigen; CD40L, cluster of differentiation 40 ligand; CRP, C-reactive protein; FGF23, fibroblast growth factor 23; GDF-15, growth differentiation factor 15; HbA1c, glycosylated hemoglobin; HDL-C, high-density lipoprotein cholesterol; ICAM-1, intercellular cell-adhesion molecule-1; IGF-1, insulin-like growth factor 1, pg/mL; IGFBP, insulin-like growth factor-binding protein; IL-6, interleukin-6; MCP-1, monocyte chemotactic protein-1; MMP-9, matrix metallopeptidase 9; MPO, myeloperoxidase; OPG, osteoprotegerin; PAI-1, plasminogen activator inhibitor 1; TC, total cholesterol; TNF-α; tumor necrosis factor-α; TNFR-2, tumor necrosis factor receptor-2;VEGF, vascular endothelial growth factor; T-tau, total tau.
Baseline descriptives.
| Dementia | No Dementia | ||
|---|---|---|---|
| Women | 144 (59.3) | 721 (51.5) | 0.03 |
| Age, years | 72 ± 6 | 68 ± 6 | <0.001 |
| MMSE (median, IQR) | 28 (27–29) | 29 (28–30) | <0.001 |
| Current smoker | 22 (9.1) | 118 (8.4) | 0.75 |
| Body mass index (BMI), kg/m2 | 27.6 ± 5.0 | 28.1 ± 5.1 | 0.14 |
| Total cholesterol, mg/dL | 195 ± 38 | 199 ± 36 | 0.11 |
| HDL cholesterol, mg/dL | 52 ± 17 | 53 ± 17 | 0.25 |
| Prevalent CVD | 62 (25.5) | 250 (17.9) | 0.005 |
| Prevalent stroke | 17 (7.0) | 57 (4.1) | 0.06 |
| Hypertension treatment | 125 (51.4) | 599 (42.8) | 0.01 |
| Diabetes | 14 (5.8) | 151 (10.8) | 0.01 |
| Systolic blood pressure, mmHg | 136± 20 | 132 ± 20 | 0.001 |
Baseline characteristics were measured at examination cycle 7 and are presented separately for individuals who subsequently developed dementia during follow-up, and those who remained dementia-free. Values are reported as mean ± standard deviation for continuous variables and n (%) for categorical variables.
Figure 1(a) Prediction of dementia using 38 biomarkers. (b) Prediction of dementia using the 9 most informative biomarkers.