| Literature DB >> 32614505 |
Karteek Popuri1, Da Ma1, Lei Wang2, Mirza Faisal Beg1.
Abstract
Biomarkers for dementia of Alzheimer's type (DAT) are sought to facilitate accurate prediction of the disease onset, ideally predating the onset of cognitive deterioration. T1-weighted magnetic resonance imaging (MRI) is a commonly used neuroimaging modality for measuring brain structure in vivo, potentially providing information enabling the design of biomarkers for DAT. We propose a novel biomarker using structural MRI volume-based features to compute a similarity score for the individual's structural patterns relative to those observed in the DAT group. We employed ensemble-learning framework that combines structural features in most discriminative ROIs to create an aggregate measure of neurodegeneration in the brain. This classifier is trained on 423 stable normal control (NC) and 330 DAT subjects, where clinical diagnosis is likely to have the highest certainty. Independent validation on 8,834 unseen images from ADNI, AIBL, OASIS, and MIRIAD Alzheimer's disease (AD) databases showed promising potential to predict the development of DAT depending on the time-to-conversion (TTC). Classification performance on stable versus progressive mild cognitive impairment (MCI) groups achieved an AUC of 0.81 for TTC of 6 months and 0.73 for TTC of up to 7 years, achieving state-of-the-art results. The output score, indicating similarity to patterns seen in DAT, provides an intuitive measure of how closely the individual's brain features resemble the DAT group. This score can be used for assessing the presence of AD structural atrophy patterns in normal aging and MCI stages, as well as monitoring the progression of the individual's brain along with the disease course.Entities:
Keywords: Alzheimer's disease; cross-database independent validation; dementia of Alzheimer's type; dementia score; disease progression; ensemble learning; longitudinal diagnostic stratification; magnetic resonance imaging; probabilistic classifier; prognosis prediction
Mesh:
Substances:
Year: 2020 PMID: 32614505 PMCID: PMC7469784 DOI: 10.1002/hbm.25115
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.399
Stratification of ADNI images and associated demographic, clinical, and biomarker details
| Dementia trajectory | Group name | Clinical diagnosis at imaging | Clinical progression | Subjects [M:F] | Images [1.5 T:3 T] | Age | CSF |
|---|---|---|---|---|---|---|---|
| DAT− | sNC:Stable NC | NC | NC → NC | 197:226 | 929:939 | 76.25 (6.22) | 0.38 (0.27) |
| DAT− | uNC:Unstable NC | NC | NC → MCI | 37:23 | 175:59 | 78.12 (4.89) | 0.45 (0.22) |
| DAT− | sMCI:Stable MCI | MCI | NC → MCI or MCI → MCI | 315:220 | 845:1350 | 74.58 (7.73) | 0.51 (0.42) |
| DAT+ | pNC:Progressive NC | NC | NC → MCI → DAT | 11:13 | 108:10 | 77.27 (4.23) | 0.71 (0.40) |
| DAT+ | pMCI:Progressive MCI | MCI | NC → MCI → DAT or MCI → DAT | 188:133 | 754:274 | 75.43 (7.20) | 0.79 (0.50) |
| DAT+ | eDAT:Early DAT | DAT | NC → MCI → DAT or MCI → DAT | 171:122 | 569:181 | 76.69 (6.87) | 0.77 (0.43) |
| DAT+ | sDAT:Stable DAT | DAT | DAT → DAT | 182:148 | 603:372 | 75.70 (7.80) | 0.85 (0.45) |
Note: The stratification was based on two criteria, clinical diagnosis of subjects at the time of MRI image acquisition and their longitudinal clinical progression. Each image is assigned a membership of the form “prefixGroup”, where “Group” is the clinical diagnosis at imaging visit, and “prefix” signals past or future clinical diagnoses. For example, an image is designated as pNC if the subject was assigned a NC diagnosis at that particular imaging visit, but the subject converts to DAT at a future timepoint. The eDAT images are associated with the diagnosis of DAT, but the subject had received NC or MCI status during previous ADNI visits (conversion within ADNI window). Whereas, the sDAT images belong to subjects with a consistent clinical diagnosis of DAT throughout the ADNI study window, hence these individuals have progressed to DAT prior to their ADNI recruitment. DAT: not on the DAT trajectory and will not get a DAT diagnosis in the ADNI window.
The mean (standard deviation) age and CSF measure values within each group are given CSF measures were only available for a subset of images in each of the groups: four sNC (573), uNC (79), sMCI (634), pNC (41), pMCI (315), eDAT (154), sDAT (329).
CSF, cerebrospinal fluid; DAT, dementia of Alzheimer's type; MCI, mild cognitive impairment; NC, normal controls; t‐tau: total tau, Aβ1‐42: beta amyloid 1–42.
DAT+: On DAT trajectory, that is, at some point in time, these subjects will be clinically diagnosed as DAT. DAT−: not on the DAT trajectory and will not get a DAT diagnosis in the ADNI window.
Baseline sNC: N = 423, Age: 73.87 (5.78), MMSE: 29.06 (1.15), CSF: 0.39 (0.28). Follow‐up sNC: N = 1,445, Age: 76.95 (6.17), MMSE: 29.02 (1.25), CSF: 0.37 (0.26).
Baseline sDAT: N = 330, Age: 74.93 (7.83), MMSE: 23.16 (2.06), CSF: 0.84 (0.44). Follow‐up sDAT: N = 645, Age: 76.08 (7.76), MMSE: 20.94 (4.61), CSF: 0.88 (0.45).
Demographics summary of each stratified groups for all the independent validation dataset
| Group name | ||||||||
|---|---|---|---|---|---|---|---|---|
| Dataset | Clinical measure | sNC | uNC | sMCI | pNC | pMCI | eDAT | sDAT |
| AIBL |
Subjects [M:F] Images [1.5 T:3 T] |
140:179 174:447 |
8:7 21:5 |
37:33 24:76 |
4:1 0:9 |
11:10 10:17 | 10:10 2:29 |
30:42 22:80 |
| Age (years) | 73.45 (6.69) | 72.73 (7.48) | 75.97 (7.09) | 73.22 (4.97) | 77.78 (6.57) | 79.45 (6.30) | 73.79 (8.17) | |
| OASIS‐1 |
Subjects [M:F] Images [1.5 T:3 T] |
119:197 336:0 | 31:39 70:0 | 10:20 30:0 | ||||
| Age (years) | 43.80 (23.75) | 76.21 (7.19) | 78.03 (6.91) | |||||
| OASIS‐2 |
Subjects [M:F] Images [1.5 T:3 T] |
20:50 183:0 |
4:9 17:0 |
27:24 104:0 |
7:6 13:0 |
7:6 16:0 |
5:6 26:0 | |
| Age (years) | 76.89 (8.13) | 79.34 (7.35) | 78.07 (6.89) | 72.69 (4.57) | 74.33 (4.16) | 76.31 (8.13) | ||
| MIRIAD |
Subjects [M:F] Images [1.5 T:3 T] |
12:11 243:0 |
19:27 465:0 | |||||
| Age (years) | 69.86 (6.94) | 69.56 (6.86) | ||||||
Note: Not all datasets contain all the stratified subgroups.
FIGURE 1The schematic diagram of the group stratification. Each image is assigned a membership in the form of “prefixGroup”, where “Group” is the clinical diagnosis at imaging visit, and “prefix” signals past or future clinical diagnoses. The beginning of each arrow marks the point when the participant entered the ADNI study. Each box represents the stratified group that is assigned to an image for the current visit of the subject. The green boxes represent the stratified groups that belong to the DAT− trajectory, while the red boxes represent the stratified groups that belong to the DAT+ trajectory. The sNC and sDAT groups are enclosed with red border indicating that their baseline images are used as the training dataset. Noted that some of the DAT− participants may switch over to DAT+ in future follow‐ups
FIGURE 2Visualization of the ROI volumes taken from the ADNI database (total of 7,168 images). Top panel: Raw volumes before covariate harmonization; bottom panel: w‐score of the raw volume with respect to the sNC group, after harmonization. Each column represent one subjects’ image, and each row represents one ROI in the brain
Most discriminative ROIs determined by the ensemble classification model
| ROI Name | Frequency (%) [left | right] | ROI Name | Frequency (%) [left | right] |
|---|---|---|---|
| Accumbens‐area | 100.00 | 100.00 | Temporal pole | 100.00 | 100.00 |
| Amygdala | 100.00 | 100.00 | Lateral occipital | 96.00 | 100.00 |
| Banks sts | 100.00 | 100.00 | Isthmus cingulate | 93.00 | 95.00 |
| Entorhinal | 100.00 | 100.00 | Lateral orbitofrontal | 94.00 | 77.00 |
| Fusiform | 100.00 | 100.00 | Insula | 56.00 | 92.00 |
| Hippocampus | 100.00 | 100.00 | Putamen | 20.00 | 96.00 |
| Inferior–lateral‐ventricle | 100.00 | 100.00 | Pars orbitalis | 25.00 | 87.00 |
| Inferior parietal | 100.00 | 100.00 | Third‐ventricle | 100 |
| Inferior temporal | 100.00 | 100.00 | Thalamus | 31.00 | 66.00 |
| Lateral‐ventricle | 100.00 | 100.00 | Caudal middle frontal | 80.00 | 3.00 |
| Middle temporal | 100.00 | 100.00 | Posterior cingulate | 66.00 | 2.00 |
| Para hippocampal | 100.00 | 100.00 | Medial orbitofrontal | 0.00 | 42.00 |
| Precuneus | 100.00 | 100.00 | Lingual | 31.00 | 2.00 |
| Rostral middle frontal | 100.00 | 100.00 | Pars triangularis | 21.00 | 0.00 |
| Superior frontal | 100.00 | 100.00 | Postcentral | 19.00 | 2.00 |
| Superior parietal | 100.00 | 100.00 | Precentral | 3.00 | 0.00 |
| Superior temporal | 100.00 | 100.00 | Transverse temporal | 0.00 | 1.00 |
| Supramarginal | 100.00 | 100.00 |
Note: The ROIs are listed in descending order of their total (left and right averaged) selection frequency.
FIGURE 3Visualization of the cortical ROIs chosen by the ensemble classification model as being the most discriminative for sNC versus sDAT. The ROIs are colored in decreasing order of their D‐statistic, a measure of separation between the ECDFs of the sNC and the sDAT volume w‐score measures
FIGURE 4MRDATS distribution among the sNC and sDAT images and classification performance obtained in assigning images to either the DAT− or DAT+ trajectory using a binarizing 0.5 MRDATS threshold. The top panel presents the MRDATS on the baseline images used for training the ensemble model. The bottom panel shows ensemble model predictions on the follow‐up images of the sNC and sDAT individuals. The follow‐up images were not part of training the MRDATS computation. The (number of images: mean MRDATS) is shown for each subgroup. Balanced accuracy is the mean of the sensitivity and specificity measures
FIGURE 5The top panel shows the MRDATS distribution among independent validation images/subjects taken from the ADNI database. The bottom panel shows the MRDATS distribution among independent validation databases namely the AIBL, OASIS‐1, OASIS‐2, and MIRIAD databases. The classification performance was obtained by determining dementia trajectories (DAT− or DAT+) for each image using a 0.5 MRDATS threshold. The MRDATS histograms corresponding to the DAT− (sNC, uNC, sMCI) and the DAT+ (pNC, pMCI, eDAT, sDAT) trajectories are stacked together respectively. The (number of images: mean MRDATS) for each group is shown
FIGURE 6Pearson correlation between CSF t‐tau/Aβ1‐42 and MRDATS across different stratified groups in the ADNI database. The CSF t‐tau/Aβ1‐42 measures were only available for a subset of images and their numbers are shown in parentheses. A previously published 64 threshold of 0.52 was used to differentiate the low‐risk (t‐tau/Aβ1‐42 ≤ 0.52) from the high‐risk (t‐tau/Aβ1‐42 > 0.52) group. The solid dots represent the data with CSF t‐tau/Aβ1‐42 measurement equal or above the 0.52 threshold, and the hollow dots represent the CSF t‐tau/Aβ1‐42 measurement below the 0.52 threshold. The statistical significance threshold for correlation coefficient (r) was set at p < .05. First row: Correlation for the DAT− groups. Second and third row: Correlation for the DAT+ groups. Significant correlation between the MRI DAT score and the t‐tau/Aβ1‐42 were observed in the sNC, sMC group along the DAT− trajectory, and in the eDAT and sDAT in the DAT+ trajectory. Note that most of the sNC images with small t‐tau/Aβ1‐42 also show small MRDATS, whereas those from sDAT subgroup with high t‐tau/Aβ1‐42 also show high MRDATS
FIGURE 7Visualization of the w‐score of raw volume from 9,587 images combined across ADNI, AIBL, OASIS, and MIRIAD databases. The red arrows point to the line of 0.5 cutoff for the MRDATS for that subgroup. Each column consists of FreeSurfer‐derived ROI volume‐based w‐scores from one subject. Each row represents one ROI in the brain across all images. Within each stratification subgroup, the images are sorted from left to right according to their MRDATS. On the vertical axis, the ROIs are sorted according to the separation of the sample distribution between the sNC and sDAT groups, calculated as the D statistic score of the K–S test. Note that the patterns for MRDATS greater than 0.5 resemble the demented (DAT+) patterns, and those less than 0.5 resemble the nondemented (DAT−) patterns. The thick red vertical line indicate the separation of different stratification groups, while the thin black vertical line with red arrow on top indicate the 0.5 cutoff point of MRDATS that separate the DAT+/DAT− patterns
FIGURE 8The empirical cumulative distribution function (ECDF) of the w‐score feature from the 91 most discriminative ROI volumes, sorted by the separation of the sample distribution between the sNC and sDAT groups, calculated as the D statistic score of the K–S test. For each panel, the x‐axis is the w‐score in range [−4 to 4] and the y‐axis is the ECDF of the w‐score in the range [0, 1]. The lower right panel shows the MRDATS for each stratification group pooling all databases (ADNI, AIBL, OASIS Cross‐sectional, OASIS Longitudinal, MIRIAD). Note the sDAT volume w‐scores are clustered around lower values (leftward ECDFs) indicative of atrophy and reduced ROI volume relative to the sNC volume w‐scores which show higher values (rightward ECDFs). This trend is reversed for the lateral ventricles which are enlarged in AD (rightward ECDFs). The sDAT MRDATS ECDFs in lower right panel are clustered toward higher values (rightward ECDFs) whereas the sNC MRDATS ECDFs are clustered towards lower values (leftward ECDFs). The separation between the ECDFs indicates the extent of separation between these stratification subgroups for that measure, and the MRDATS with a D‐statistic of 0.8 shows a greater separation of the sNC/sDAT ECDFs as compared to all the raw ROI volume ECDF separations
Summary of classification performance
| ADNI | AIBL | OASIS‐1 | OASIS‐2 | MIRIAD | |
|---|---|---|---|---|---|
| MRDATS | |||||
| sNC | 0.200 (1445) | 0.209 (621) | 0.178 (336) | 0.288 (183) | 0.133 (243) |
| uNC | 0.350 (234) | 0.283 (26) | 0.313 (17) | ||
| sMCI | 0.384 (2195) | 0.478 (100) | 0.617 (70) | 0.581 (104) | |
| pNC | 0.329 (118) | 0.791 (9) | |||
| pMCI | 0.678 (1028) | 0.677 (27) | 0.673 (13) | ||
| eDAT | 0.867 (750) | 0.878 (31) | 0.865 (16) | ||
| sDAT | 0.884 (645) | 0.831 (102) | 0.839 (30) | 0.848 (26) | 0.847 (465) |
| Accuracy | |||||
| sNC | 0.864 (1445) | 0.878 (621) | 0.938 (336) | 0.792 (183) | 0.951 (243) |
| uNC | 0.714 (234) | 0.846 (26) | 0.765 (17) | ||
| sMCI | 0.654 (2195) | 0.580 (100) | 0.343 (70) | 0.413 (104) | |
| pNC | 0.229 (118) | 0.889 (9) | |||
| pMCI | 0.718 (1028) | 0.741 (27) | 0.615 (13) | ||
| eDAT | 0.907 (750) | 0.903 (31) | 1.000 (16) | ||
| sDAT | 0.932 (645) | 0.902 (102) | 0.900 (30) | 0.923 (26) | 0.903 (465) |
Note: Top: Predicted MRDATS of each stratified group across all the independent unseen test images in each dataset; bottom: The classification accuracy with the proposed ensemble‐learning‐based classification (using a 0.5 threshold). The number in the bracket shows the corresponding sample number in each stratified group.
The effect of time to conversion (TTC) on MRDATS score for the converters in the DAT+ trajectory (i.e., pNC and pMCI groups)
| pNC | pMCI | |
|---|---|---|
| MRDATS | ||
| TTC (years) | 0.362 (127) | 0.678 (1,068) |
| 0–1 | 0.537 (1) | 0.770 (264) |
| 1–2 | 0.479 (12) | 0.712 (358) |
| 2–3 | 0.472 (13) | 0.669 (200) |
| 3–4 | 0.440 (19) | 0.593 (111) |
| 4–5 | 0.386 (15) | 0.545 (51) |
| 5–6 | 0.358 (22) | 0.573 (27) |
| 6–7 | 0.285 (13) | 0.395 (22) |
| 7–8 | 0.306 (16) | 0.477 (19) |
| 8–9 | 0.213 (10) | 0.398 (12) |
| 9–10 | 0.123 (6) | 0.293 (4) |
| Accuracy | ||
| TTC (years) | 0.276 (127) | 0.717 (1,068) |
| 0–1 | 1.000 (1) | 0.826 (264) |
| 1–2 | 0.417 (12) | 0.757 (358) |
| 2–3 | 0.385 (13) | 0.730 (200) |
| 3–4 | 0.368 (19) | 0.586 (111) |
| 4–5 | 0.267 (15) | 0.549 (51) |
| 5–6 | 0.273 (22) | 0.556 (27) |
| 6–7 | 0.231 (13) | 0.364 (22) |
| 7–8 | 0.250 (16) | 0.474 (19) |
| 8–9 | 0.000 (10) | 0.417 (12) |
| 9–10 | 0.000 (6) | 0.250 (4) |
Note: The left two columns show the MRDATS, and the right two columns show the corresponding classification accuracy. The number in the brackets shows the number of images in the particular subgroup.
Comparison of sMCI versus pMCI classification performance obtained using MRDATS with the state‐of‐the‐art MRI based classification methods
| Study | Feature type | Images [sMCI:pMCI] | Time to conversion | Evaluation scheme | AUC | Sensitivity | Specificity | Balanced | |
|---|---|---|---|---|---|---|---|---|---|
| Accuracy | Accuracy | ||||||||
| Zhang et al. ( | Volume + PET | 50:38 | 0–6 months | 10‐fold CV | 0.745 | 0.658 | 0.78 | 0.727 | – |
| Zhang et al. ( | Volume + PET | 50:38 | 0–6 months | 10‐fold CV | 0.768 | 0.79 | 0.789 | 0.784 | – |
| MRDATS | Volume | 2,469:77 | 0–6 months | INDEP.VAL | 0.803 | 0.805 | 0.62 | 0.712 | 0.625 |
| Sørensen et al. ( | Volume | 192:41 | 0–1 year | 10‐fold CV | 0.705 | – | – | – | – |
| Sørensen et al. ( | Texture | 192:41 | 0–1 year | 10‐fold CV | 0.74 | – | – | – | – |
| Sørensen et al. ( | Volume + Texture | 192:41 | 0–1 year | 10‐fold CV | 0.739 | – | – | – | – |
| Khan et al. ( | Volume (hippocampal subfields) | 90:357 | 0–1 year | 7‐fold CV | – | 0.81 | 0.48 | 0.55 | – |
| MRDATS | Volume | 2,469:272 | 0–1 year | INDEP.VAL | 0.788 | 0.824 | 0.632 | 0.728 | 0.651 |
| Suk et al. (2017) | Volume | 226:167 | 0–1.5 years | 10‐fold CV | 0.754 | 0.709 | 0.788 | 0.749 | 0.748 |
| Zhu et al. (2015) | Volume | 56:43 | 0–1.5 years | 5‐fold CV | 0.814 | 0.48 | 0.928 | 0.704 | 0.718 |
| MRDATS | Volume | 2,469:482 | 0–1.5 years | INDEP.VAL | 0.769 | 0.788 | 0.62 | 0.704 | 0.647 |
| Sørensen et al. ( | Volume | 140:93 | 0–2 years | 10‐fold CV | 0.672 | – | – | – | – |
| Sørensen et al. ( | Texture | 140:93 | 0–2 years | 10‐fold CV | 0.742 | – | – | – | – |
| Sørensen et al. ( | Volume + Texture | 140:93 | 0–2 years | 10‐fold CV | 0.72 | – | – | – | – |
| Chincarini et al. ( | Intensity + Texture | 103:38 | 0–2 years | INDEP.VAL | 0.74 | 0.76 | 0.68 | 0.72 | – |
| Chincarini et al. ( | Intensity + Texture | 103:38 | 0–2 years | INDEP.VAL | 0.76 | 0.75 | 0.66 | 0.71 | – |
| Cheng et al. ( | Volume | 56:43 | 0–2 years | 10‐fold CV | 0.764 | 0.734 | 0.721 | 0.728 | 0.734 |
| MRDATS | Volume | 2,469:627 | 0–2 years | INDEP.VAL | 0.76 | 0.783 | 0.62 | 0.701 | 0.653 |
| Young et al. ( | Volume | 96/47 | 0–3 years | INDEP.VAL | 0.643 | 0.532 | 0.698 | 0.615 | 0.643 |
| Huang et al. ( | Longitudinal intensity | 61:70 | 0–3 years | 10‐fold CV | 0.812 | 0.865 | 0.782 | 0.824 | 0.794 |
| Liu et al. ( | Texture | 239:38 | 0–3 years | INDEP.VAL | 0.776 | 0.421 | 0.824 | 0.623 | 0.769 |
| Lu et al. ( | Volume | 753:409 | 0–3 years | INDEP.VAL | – | 0.7327 | 0.7618 | 0.7473 | 0.7544 |
| MRDATS | Volume | 2,469:826 | 0–3 years | INDEP.VAL | 0.751 | 0.777 | 0.62 | 0.698 | 0.659 |
| Coupe et al. ( | Hippocampal grade | 309:37 | 0–7 years | LOOCV | 0.73 | 0.649 | 0.735 | 0.692 | – |
| MRDATS | Volume | 2,469:1033 | 0–7 years | INDEP.VAL | 0.729 | 0.733 | 0.62 | 0.676 | 0.653 |
Abbreviations: CV, cross‐validation; INDEP.VAL, independent validation; LOOCV, leave‐one‐out cross‐validation.
Trained with sMCI/pMCI.
Average normalized FDG‐PET intensities.
Longitudinal features from multiple time points.
Trained with NC/AD.
The Hippocampal grade is calculated based on the image similarity between the test image and the template image.