| Literature DB >> 32835819 |
Ann-Marie G de Lange1, Melis Anatürk2, Sana Suri2, Tobias Kaufmann3, James H Cole4, Ludovica Griffanti2, Enikő Zsoldos2, Daria E A Jensen2, Nicola Filippini2, Archana Singh-Manoux5, Mika Kivimäki6, Lars T Westlye7, Klaus P Ebmeier8.
Abstract
Brain age is becoming a widely applied imaging-based biomarker of neural aging and potential proxy for brain integrity and health. We estimated multimodal and modality-specific brain age in the Whitehall II (WHII) MRI cohort using machine learning and imaging-derived measures of gray matter (GM) morphology, white matter microstructure (WM), and resting state functional connectivity (FC). The results showed that the prediction accuracy improved when multiple imaging modalities were included in the model (R2 = 0.30, 95% CI [0.24, 0.36]). The modality-specific GM and WM models showed similar performance (R2 = 0.22 [0.16, 0.27] and R2 = 0.24 [0.18, 0.30], respectively), while the FC model showed the lowest prediction accuracy (R2 = 0.002 [-0.005, 0.008]), indicating that the FC features were less related to chronological age compared to structural measures. Follow-up analyses showed that FC predictions were similarly low in a matched sub-sample from UK Biobank, and although FC predictions were consistently lower than GM predictions, the accuracy improved with increasing sample size and age range. Cardiovascular risk factors, including high blood pressure, alcohol intake, and stroke risk score, were each associated with brain aging in the WHII cohort. Blood pressure showed a stronger association with white matter compared to gray matter, while no differences in the associations of alcohol intake and stroke risk with these modalities were observed. In conclusion, machine-learning based brain age prediction can reduce the dimensionality of neuroimaging data to provide meaningful biomarkers of individual brain aging. However, model performance depends on study-specific characteristics including sample size and age range, which may cause discrepancies in findings across studies.Entities:
Keywords: Brain age prediction; Cardiovascular risk; Machine learning; Multimodal MRI
Mesh:
Year: 2020 PMID: 32835819 PMCID: PMC8121758 DOI: 10.1016/j.neuroimage.2020.117292
Source DB: PubMed Journal: Neuroimage ISSN: 1053-8119 Impact factor: 6.556
Sample demographics. Age range (mean age ± standard deviation = 69.71 ± 5.07), percentage male (M) and female (F) participants, percentage with white (W) and non-white (NW) ethnic background, and percentage with educational qualifications U = university degree, PG = post-graduate / masters / PhD, Pr = Professional qualifications, A = A levels or equivalent, O = O levels or equivalent, N = No qualifications.
| N | Age range | Sex % | Ethnicity % | Educational qualification % |
|---|---|---|---|---|
| 610 | 60.34 - 84.58 | M81 | F19 | W94 | NW6 | U27 | PG22 | Pr12 | A18 | O14 | C5 | N3 |
Fig. 1The correlations (Pearson’s r) between brain age deltas of the multimodal model (MM), the gray matter (GM), white matter (WM), and functional connectivity (FC) models, and the external gray matter model (Ext. GM) based on a separate training sample, indicating the amount of shared variance explained by the models. The delta values were first corrected for age-bias as described in Section 2.3.5, and the corrected deltas were used in the correlation analysis.
Fig. 3The associations (β ± standard error) between standardized measures of brain age delta and blood pressure, alcohol intake, and Framingham stroke risk score for each of the brain-age models. The analyses included age as a covariate. The vertical gray line indicates β = 0. MM = multimodal, GM = gray matter, WM = white matter, FC = functional connectivity, Ext. GM = external gray matter model (gray matter predictions based on the external UK Biobank training set (Section 2.3.5).
Selected thresholds (TH) based on hierarchical clustering on the Spearman rank-order correlation. The models were trained on 70% of the data and applied to 100 bootstrapped test sets generated from the remaining 30%. Number (N) of features and average R2 ± SD are shown before and after the feature reduction; full = all imaging features included, reduced = selected features included based on cluster threshold. MM = multimodal, GM = gray matter, WM = white matter, FC = functional connectivity.
| Model | N features | TH | N features | ||
|---|---|---|---|---|---|
| MM | 5019 | 0.40 ± 0.05 | 9 | 97 | 0.34 ± 0.05 |
| GM | 1118 | 0.32 ± 0.05 | 4 | 156 | 0.27 ± 0.05 |
| WM | 246 | 0.31 ± 0.05 | 5 | 19 | 0.31 ± 0.05 |
| FC | 3655 | -0.03 ± 0.04 | 7 | 25 | -0.03 ± 0.03 |
Age distribution for the high-quality training sub-samples for each modality.
| Model | N | Mean age ± SD | Age range |
|---|---|---|---|
| Multimodal | 389 | 69.71 ± 5.13 | 60.56 - 84.58 |
| Gray matter | 400 | 69.76 ± 4.97 | 60.56 - 84.58 |
| White matter | 396 | 69.78 ± 5.17 | 61.56 - 84.58 |
| F. connectivity | 393 | 69.69 ± 5.12 | 60.56 - 84.58 |
Mean ± standard deviation of each of the clinical variables.
| N | Systolic BP | Diastolic BP | Alcohol intake | Stroke Risk score |
|---|---|---|---|---|
| 582 | 140.43 ± 16.61 | 77.14 ± 10.54 | 14.82 ± 13.54 | 11.04 ± 6.53 |
The correlations (r) between predicted age and chronological age, R2, root mean square error (RMSE), and mean absolute error (MAE) for each of the brain age models. 95% confidence intervals are indicated in square brackets. MM = multimodal, GM = gray matter, WM = white matter, FC = functional connectivity, Ext. GM = external gray matter model, which represents the gray matter model trained on the external UK Biobank training set (Section 2.3.5). RMSE and MAE are reported in years, and represent the values estimated before and after age-bias correction as described in 2.3.5 and shown in SI Fig. 7.
| Model | R2 | RMSE | MAE | RMSE | MAE | |
|---|---|---|---|---|---|---|
| MM | 0.55 [0.49, 0.60] | 0.30 [0.24, 0.36] | 4.25 | 3.37 | 2.32 | 1.85 |
| GM | 0.46 [0.40, 0.52] | 0.22 [0.16, 0.27] | 4.49 | 3.60 | 2.18 | 1.73 |
| WM | 0.49 [0.43, 0.55] | 0.24 [0.18, 0.30] | 4.43 | 3.51 | 2.49 | 1.95 |
| FC | 0.04 [-0.04, 0.12] | 0.002 [-0.005, 0.008] | 5.16 | 4.18 | 1.19 | 0.90 |
| Ext. GM | 0.45 [0.38, 0.51] | 0.20 [0.14, 0.26] | 11.61 | 10.69 | 2.51 | 2.07 |
The R2 values ± standard deviation based on training sets with and without low-quality data applied to 100 bootstrapped test sets generated from 30% of the full sample. 95% confidence intervals are indicated in square brackets. Z represents the difference in r values expressed in standard deviations, accounting for the correlated samples (Eq. (1)). MM = multimodal, GM = gray matter, WM = white matter, FC = functional connectivity.
| Model | % LQD removed | ||||
|---|---|---|---|---|---|
| MM | 0.228 ± 0.060 | 0.250 ± 0.063 | -1.416 | 0.157 | 8.90 |
| GM | 0.224 ± 0.065 | 0.271 ± 0.065 | -2.586 | 0.009 | 6.32 |
| WM | 0.203 ± 0.066 | 0.184 ± 0.062 | 1.097 | 0.273 | 7.26 |
| FC | -0.011 ± 0.040 | -0.019 ± 0.039 | 0.299 | 0.765 | 7.96 |
The correlations (r) between predicted age and chronological age, R2, root mean square error (RMSE), and mean absolute error (MAE) for the brain age models based on WHII and UK Biobank (UKB) sub-samples matched on sample size (N) and age range. 95% confidence intervals are indicated in square brackets. RMSE and MAE are reported in years. SD = standard deviation.
| Model | N | Mean age ± SD | R2 | RMSE | MAE | ||
|---|---|---|---|---|---|---|---|
| WHII | 567 | 68.88 ± 4.22 | 0.06 [-0.02, 0.14] | 0.004 [-0.006, 0.014] | 4.27 | 3.48 | |
| UKB | 567 | 67.07 ± 4.61 | 0.04 [-0.04, 0.12] | 0.002 [-0.005, 0.008] | 4.69 | 3.85 |
Fig. 2Prediction accuracy (y-axis) for functional connectivity (left plot) and gray matter (right plot) for two different sample sizes (N) across five different age ranges. The sample of 610 (red) resembles the size of the WHII dataset, while the sample of 1782 represents the maximum number of participants available with the smallest age range, keeping N stable across age ranges.
Relationships between (standardized) clinical variables and brain age delta for each modality. p-values are reported before and after FDR correction. Corrected p-values below 0.05 are marked with an asterisk. MM = multimodal, GM = gray matter, WM = white matter, FC = functional connectivity, Ext. GM = external gray matter model gray matter predictions based on the external UK Biobank training set (Section 2.3.5).
| Model | SE | ||||
|---|---|---|---|---|---|
| MM | 0.047 | 0.023 | 2.053 | 0.041 | 0.077 |
| GM | -0.003 | 0.021 | -0.135 | 0.892 | 0.940 |
| WM | 0.074 | 0.024 | 3.121 | 0.002 | 0.013* |
| FC | 0.022 | 0.010 | 2.282 | 0.023 | 0.067 |
| Ext.GM | 0.011 | 0.023 | 0.478 | 0.633 | 0.744 |
| MM | 0.026 | 0.023 | 1.137 | 0.256 | 0.366 |
| GM | -0.003 | 0.020 | -0.166 | 0.868 | 0.940 |
| WM | 0.053 | 0.024 | 2.271 | 0.024 | 0.067 |
| FC | 0.010 | 0.010 | 1.058 | 0.291 | 0.388 |
| Ext. GM | 0.017 | 0.023 | 0.729 | 0.466 | 0.583 |
| MM | 0.072 | 0.023 | 3.163 | 0.002 | 0.013* |
| GM | 0.052 | 0.020 | 2.582 | 0.010 | 0.040* |
| WM | 0.041 | 0.024 | 1.754 | 0.080 | 0.133 |
| FC | 0.010 | -0.025 | 0.980 | 0.980 | |
| Ext. GM | 0.064 | 0.023 | 2.766 | 0.006 | 0.029* |
| MM | 0.057 | 0.027 | 2.126 | 0.034 | 0.075 |
| GM | 0.049 | 0.024 | 2.036 | 0.042 | 0.077 |
| WM | 0.102 | 0.028 | 3.688 | 0.005* | |
| FC | 0.016 | 0.011 | 1.417 | 0.157 | 0.242 |
| Ext. GM | 0.060 | 0.027 | 2.181 | 0.030 | 0.074 |
Differences between the gray matter (GM) and white matter (WM) associations with clinical variables. P-values are reported before and after FDR correction. BP = blood pressure.
| GM | WM | ||||
|---|---|---|---|---|---|
| BP systolic | -0.003 ± 0.021 | 0.074 ± 0.024 | 2.988 | 0.003 | 0.011 |
| BP diastolic | -0.003 ± 0.020 | 0.053 ± 0.020 | 2.214 | 0.027 | 0.054 |
| Alcohol intake | 0.052 ± 0.020 | 0.052 ± 0.020 | -0.435 | 0.664 | 0.664 |
| Stroke risk score | 0.049 ± 0.024 | 0.102 ± 0.028 | 1.778 | 0.076 | 0.100 |