| Literature DB >> 34095821 |
Damiano Archetti1, Alexandra L Young2,3, Neil P Oxtoby3, Daniel Ferreira4,5, Gustav Mårtensson4, Eric Westman4, Daniel C Alexander3, Giovanni B Frisoni6,7, Alberto Redolfi1.
Abstract
Alzheimer's disease (AD) is a neurodegenerative disorder which spans several years from preclinical manifestations to dementia. In recent years, interest in the application of machine learning (ML) algorithms to personalized medicine has grown considerably, and a major challenge that such models face is the transferability from the research settings to clinical practice. The objective of this work was to demonstrate the transferability of the Subtype and Stage Inference (SuStaIn) model from well-characterized research data set, employed as training set, to independent less-structured and heterogeneous test sets representative of the clinical setting. The training set was composed of MRI data of 1043 subjects from the Alzheimer's disease Neuroimaging Initiative (ADNI), and the test set was composed of data from 767 subjects from OASIS, Pharma-Cog, and ViTA clinical datasets. Both sets included subjects covering the entire spectrum of AD, and for both sets volumes of relevant brain regions were derived from T1-3D MRI scans processed with Freesurfer v5.3 cross-sectional stream. In order to assess the predictive value of the model, subpopulations of subjects with stable mild cognitive impairment (MCI) and MCIs that progressed to AD dementia (pMCI) were identified in both sets. SuStaIn identified three disease subtypes, of which the most prevalent corresponded to the typical atrophy pattern of AD. The other SuStaIn subtypes exhibited similarities with the previously defined hippocampal sparing and limbic predominant atrophy patterns of AD. Subject subtyping proved to be consistent in time for all cohorts and the staging provided by the model was correlated with cognitive performance. Classification of subjects on the basis of a combination of SuStaIn subtype and stage, mini mental state examination and amyloid-β1-42 cerebrospinal fluid concentration was proven to predict conversion from MCI to AD dementia on par with other novel statistical algorithms, with ROC curves that were not statistically different for the training and test sets and with area under curve respectively equal to 0.77 and 0.76. This study proves the transferability of a SuStaIn model for AD from research data to less-structured clinical cohorts, and indicates transferability to the clinical setting.Entities:
Keywords: SuStain model; alzheiemer’s disease; inter-cohort validation; patient staging; patient subtyping
Year: 2021 PMID: 34095821 PMCID: PMC8173213 DOI: 10.3389/fdata.2021.661110
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
Characteristics of the data sets selected.
| Data Set | Full name | Description | Categories | |
|---|---|---|---|---|
| Training Set | ADNI-1 | Alzheimer’s Disease Neuroimaging Initiative – 1 | The Alzheimer’s Disease Neuroimaging Initiative | CN MCI |
| AD | ||||
| SMC | ||||
| ADNI-GO | Alzheimer’s Disease Neuroimaging Initiative – Grand Opportunities | MCI | ||
| SMC | ||||
| ADNI-2 | Alzheimer’s Disease Neuroimaging Initiative – 2 | CN | ||
| MCI | ||||
| AD | ||||
| SMC | ||||
| Test Set | OASIS | Open Access Series of Imaging Studies | OASIS | CN |
| MCI | ||||
| AD | ||||
| PharmaCog (E-ADNI) | Prediction of cognitive properties of new drug candidates for neurodegenerative diseases in early clinical development | PharmaCog is an industry-academic (Innovative Medicines Initiative – IMI) European project aimed at identifying biomarkers sensitive to symptomatic and disease modifying effects of drugs for Alzheimer’s disease | MCI | |
| AD | ||||
| ViTA | Vienna Transdanube Aging | ViTA is a population-based cohort-study of all 75-years old inhabitants of a geographically defined area of Vienna | CN | |
| MCI | ||||
| AD |
AD, Alzheimer’s disease; CN, cognitively normal; MCI, mild cognitive impairment; SMC; subjective memory complaints.
Demographic, clinical, genetic and biological characteristics of the training and test sets.
| N | Age (years) | Sex (M/F) | Education (years) | MMSE (raw score) | Aβ1-42 (positive/negative) |
| ||
|---|---|---|---|---|---|---|---|---|
| Training set | CN | 335 | 73.5 ± 5.9 | 46%/54% | 16.3 ± 2.6 | 29.1 ± 1.2 | 40%/60% | 27%/73% |
| MCI | 537 | 72.0 ± 7.2 | 59%/41% | 16.0 ± 2.8 | 27.7 ± 1.8 | 66%/34% | 51%/49% | |
| AD | 171 | 73.4 ± 8.2 | 54%/46% | 15.5 ± 2.7 | 23.4 ± 2.0 | 95%/5% | 73%/27% | |
| Total | 1043 | 72.7 ± 7.0 | 54%/46% | 16.04 ± 2.7 | 27.4 ± 2.5 | 62%/38% | 46%/54% | |
| sMCI | 271 | 72.3 ± 7.1 | 58%/42% | 16.1 ± 2.8 | 28.0 ± 1.7 | 56%/44% | 42%/58% | |
| pMCI | 205 | 73.1 ± 6.8 | 59%/41% | 15.8 ± 2.8 | 27.2 ± 1.8 | 87%/13% | 64%/36% | |
| Test Set | CN | 440 | 54 ± 25* | 37%/63%* | 8.4 ± 5.7* | 29.0 ± 1.2 | NA | 2%/7% |
| MCI | 283 | 72.3 ± 7.6 | 46%/54* | 9.2 ± 5.1* | 26.3 ± 2.6* | 34%/17% | 19%/32%* | |
| AD | 44 | 77.3 ± 7.4* | 34%/66%* | 5.8 ± 5.3* | 21.7 ± 3.8* | NA | 0%/5% | |
| Total | 767 | 62 ± 21* | 40%/60%* | 8.6 ± 5.4* | 27.2 ± 3.0 | 12%/6% | 8%/16%* | |
| sMCI | 152 | 71.2 ± 7.5 | 47%/53% | 11.5 ± 4.2* | 26.7 ± 2.2* | 46%/25% | 25%/43% | |
| pMCI | 39 | 69.8 ± 6.4* | 49%/51% | 11.7 ± 3.9* | 25.7 ± 2.4* | 44%/5% | 33%/26% |
Values from CN, MCI and AD contribute to the totals, MCI subpopulations of pMCIs and sMCIs are reported as well. Values marked with * on the test set are significantly different (p-value of ANOVA for continuous variables and chi-square for discrete variables <0.05) from the corresponding values from training set. Abbreviations: M, male; F, female; N, number.
FIGURE 1SuStaIn model built on the basis of volumetric biomarkers of the training set.(A) Z-score progression patterns for each subtype. Color shades indicate the probability of a Z-score to increment, “N” indicates the number of subjects from the training set assigned to each subtype (B) Representations of early stages for each subtype.
FIGURE 2AVRA vs. SuStaIn subtypes of AD. Pie graphs represent the percentage of AVRA subtypes subjects for each SuStaIn subtype. Regional atrophy in AVRA was measured with the MTA, PA and GCA-F scales based on T1-3D weighted images; below, visual examples of the SuStaIn atrophy subtypes are shown.
FIGURE 3Hippocampal volume of subjects from Subtypes 1 and 2 labeled with minimal atrophy according to AVRA scores. Hippocampal volumes were averaged between right and left hemisphere for simpler representation.
Number and percentage of subjects from each diagnostic category assigned to each subtype.
| Subtype 1 | Subtype 2 | Subtype 3 | |||||
|---|---|---|---|---|---|---|---|
| N | Average Stage | N | Average Stage | N | Average Stage | ||
| Training Set | CN | 96 (54%) | 3 ± 3 | 74 (41%) | 3 ± 3 | 9 (5%) | 4 ± 1 |
| MCI | 243 (62%) | 5 ± 4 | 128 (33%) | 5 ± 5 | 22 (5%) | 7 ± 4 | |
| AD | 126 (79%) | 8 ± 5 | 26 (16%) | 9 ± 6 | 7 (5%) | 11 ± 5 | |
| sMCI | 111 (59%) | 4 ± 4 | 67 (36%) | 4 ± 4 | 10 (5%) | 8 ± 5 | |
| pMCI | 116 (69%) | 6 ± 5 | 44 (26%) | 7 ± 6 | 8 (5%) | 7 ± 4 | |
| Test Set | CN | 303 (86%) | 5 ± 4 | 37 (11%) | 5 ± 4 | 9 (3%) | 5 ± 2 |
| MCI | 185 (78%) | 7 ± 6 | 49 (21%) | 6 ± 4 | 3 (1%) | 9 ± 2 | |
| AD | 41 (95%) | 9 ± 7 | 1 (2.5%) | 12 | 1 (2.5%) | 9 | |
| sMCI | 83 (68%) | 7 ± 6 | 35 (29%) | 5 ± 4 | 4 (3%) | 8 ± 3 | |
| pMCI | 32 (84%) | 9 ± 6 | 6 (16%) | 11 ± 4 | 0 (0%) | NA | |
Descriptive statistics of the demographic, clinical, biological and genetic variables of subjects for each subtype
| Age (years) | Sex (M/F) | Education (years) | MMSE (raw score) | Aβ1-42 (positive/negative) |
| ||
|---|---|---|---|---|---|---|---|
| Training Set | Subtype 1 | 72.5 ± 7.2a | 48%/52%a | 15.9 ± 2.7a | 26.6 ± 2.6a | 72%/28%a | 48%/52% |
| Subtype 2 | 73.8 ± 6.7a | 84%/16%a,b | 16.4 ± 2.7a | 27.9 ± 2.0a,b | 53%/46%a,b | 44%/56% | |
| Subtype 3 | 74.8 ± 6.2 | 61%/39%b | 15.9 ± 3.0 | 26.5 ± 2.6b | 79%/21b | 42%/58% | |
| Test Set | Subtype 1 | 60 ± 24c | 41%/59%a | 8.5 ± 5.4a | 26.7 ± 3.3 | 3%/10% | 8%/14% |
| Subtype 2 | 63 ± 17b | 64%/36%a | 10.4 ± 5.6a | 27.4 ± 2.2b | 31%/15% | 24%/25% | |
| Subtype 3 | 74.4 ± 5.7c,b | 46%/54% | 7.9 ± 6.1 | 25.7 ± 4.7b | 15%/0% | 0%/15% |
Values marked with aindicate significant differences (p-value < 0.05) between Subtype 1 and Subtype 2 values in the same set; values marked with cindicate significant differences (p-value < 0.05) between Subtype 1 and Subtype 3 values in the same set; values marked with bindicate significant differences (p-value < 0.05) between Subtype 2 and Subtype 3 values in the same set.
FIGURE 4Longitudinal subtype consistency for training set subjects (left) and test set subjects (right) over a 12-months follow-up period.
FIGURE 5Plot of Cognitive performance measured by Mini Mental State Examination (MMSE) vs. the estimated disease stage subjects from the training (left) and test (right) sets for each subgroup. Coefficients of determination (R2) of the linear regression of MMSE score vs. disease stage are reported. The x-axes are only reported up to stage 25 of 39 as no subjects were staged beyond.