| Literature DB >> 34344910 |
Walter H L Pinaya1,2,3, Cristina Scarpazza4,5, Rafael Garcia-Dias4, Sandra Vieira4, Lea Baecker4, Pedro F da Costa6,7, Alberto Redolfi8, Giovanni B Frisoni9,10, Michela Pievani9, Vince D Calhoun11, João R Sato12, Andrea Mechelli4.
Abstract
Normative modelling is an emerging method for quantifying how individuals deviate from the healthy populational pattern. Several machine learning models have been implemented to develop normative models to investigate brain disorders, including regression, support vector machines and Gaussian process models. With the advance of deep learning technology, the use of deep neural networks has also been proposed. In this study, we assessed normative models based on deep autoencoders using structural neuroimaging data from patients with Alzheimer's disease (n = 206) and mild cognitive impairment (n = 354). We first trained the autoencoder on an independent dataset (UK Biobank dataset) with 11,034 healthy controls. Then, we estimated how each patient deviated from this norm and established which brain regions were associated to this deviation. Finally, we compared the performance of our normative model against traditional classifiers. As expected, we found that patients exhibited deviations according to the severity of their clinical condition. The model identified medial temporal regions, including the hippocampus, and the ventricular system as critical regions for the calculation of the deviation score. Overall, the normative model had comparable cross-cohort generalizability to traditional classifiers. To promote open science, we are making all scripts and the trained models available to the wider research community.Entities:
Mesh:
Year: 2021 PMID: 34344910 PMCID: PMC8333350 DOI: 10.1038/s41598-021-95098-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic information for the subjects from the UK Biobank dataset, the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, and the Australian Imaging Biomarkers and Lifestyle Study of Ageing (AIBL) dataset. We used ANOVA test and the chi‐square test of homogeneity to test for significant differences in age and sex between healthy controls and patients. Abbreviations: HC = healthy control; EMCI = early mild cognitive impairment; LMCI = late mild cognitive impairment; AD = Alzheimer’s disease; MCI = mild cognitive impairment; SD = standard deviation.
| UK BIOBANK | ADNI | p | AIBL | p | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| HC | EMCI | LMCI | AD | HC | MCI | AD | ||||
| 0.87 | 0.28 | |||||||||
| Mean ± SD | 61.6 ± 7.0 | 66.6 ± 3.7 | 66.4 ± 4.2 | 66.2 ± 5.1 | 66.6 ± 5.4 | 68.2 ± 3.2 | 68.2 ± 3.6 | 67.3 ± 5.2 | ||
| Range | [47, 73] | [56, 72] | [56, 73] | [56, 73] | [56, 73] | [60, 73] | [56, 73] | [55, 73] | ||
| 0.25 | 0.15 | |||||||||
| Men | 5180 (47) | 90 (42) | 72 (45) | 40 (49) | 36 (56) | 113 (43) | 19 (41) | 17 (45) | ||
| Women | 5854 (53) | 122 (58) | 87 (55) | 42 (51) | 28 (44) | 149 (57) | 27 (59) | 21 (55) | ||
Demographic information for the subjects from the Alzheimer’s Disease Repository Without Borders (ARWiBo) dataset, the Open Access Series of Imaging Studies: Cross-Sectional (OASIS-1) dataset, and the Minimal Interval Resonance Imaging in Alzheimer's Disease (MIRIAD) dataset. We used ANOVA test and the chi‐square test of homogeneity to test for significant differences in age and sex between healthy controls and patients. Abbreviations: HC = healthy control; AD = Alzheimer’s disease; MCI = mild cognitive impairment; SD = standard deviation.
| ARWiBo | p | OASIS-1 | p | MIRIAD | p | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| HC | MCI | AD | HC | AD | HC | AD | ||||
| 0.16 | 0.07 | 0.71 | ||||||||
| Mean ± SD | 65.1 ± 4.4 | 66.4 ± 5.7 | 65.1 ± 6.0 | 68.2 ± 3.8 | 69.7 ± 3.1 | 66.7 ± 4.1 | 66.2 ± 4.6 | |||
| Range | [57, 73] | [47, 73] | [50, 73] | [61, 73] | [62, 73] | [59, 73] | [56, 73] | |||
| 0.70 | 0.30 | 0.76 | ||||||||
| Men | 86 (40) | 23 (34) | 14 (38) | 11 (27) | 14 (38) | 7 (39) | 13 (43) | |||
| Women | 129 (60) | 44 (66) | 23 (62) | 30 (73) | 23 (62) | 11 (61) | ||||
Figure 1Structure of the normative model based on adversarial autoencoders. In this configuration, the subject data is inputted into the encoder and then mapped to the latent code. This latent code is fed to the decoder with the demographic data, and then the decoder generates a reconstruction of the original data. During the training of the model, the discriminator predicts if its input data came from the latent code or if it was randomly sampled from the chosen prior distribution (e.g. Gaussian distribution). Based on these predictions, the adversarial autoencoder forces the encoder to produce a latent code similar to the prior distribution selected. Since the model is trained on healthy controls data, it is expected that it can reconstruct similar data relatively well, yielding a small reconstruction error. However, the model is expected to generate a high error when processing data affected by unseen underlying mechanisms, e.g. pathological mechanisms.
Figure 2Mean value of the observed deviation calculated by the autoencoder for each group. The square marker indicates the mean value and the horizontal bars indicates the 95% confidence interval calculated using the percentile method on the bootstrap analysis. Abbreviations: AD = Alzheimer’s disease; EMCI = early mild cognitive impairment; LMCI = late mild cognitive impairment; MCI = mild cognitive impairment; HC = healthy controls; ADNI = Alzheimer’s Disease Neuroimaging Initiative; AIBL = Australian Imaging Biomarkers and Lifestyle Study of Ageing; ARWiBo = Alzheimer's Disease Repository Without Borders; OASIS-1 = Open Access Series of Imaging Studies: Cross-Sectional; MIRIAD = Minimal Interval Resonance Imaging in Alzheimer's Disease.
Figure 3Discriminative performance of the normative approach. The solid line indicates the mean receiver operating characteristic curve across the bootstrap iterations with the shaded area indicating the 95% confidence interval calculated using the percentile method on the bootstrap analysis. The dashed line indicates the chance level. Abbreviations: AD = Alzheimer’s disease; AUC-ROC = area under the receiver operating characteristic curve; EMCI = early mild cognitive impairment; LMCI = late mild cognitive impairment; MCI = mild cognitive impairment; HC = healthy controls; ADNI = Alzheimer’s Disease Neuroimaging Initiative; AIBL = Australian Imaging Biomarkers and Lifestyle Study of Ageing; ARWiBo = Alzheimer's Disease Repository Without Borders; OASIS-1 = Open Access Series of Imaging Studies: Cross-Sectional; MIRIAD = Minimal Interval Resonance Imaging in Alzheimer's Disease.
Figure 4Brain regions deviations. The square marker indicates the mean effect size (Cliff’s delta) between the healthy control group and the respective patient groups. The horizontal bars indicate the 95% confidence interval calculated using the percentile method on the bootstrap analysis. Only the regions with a mean effect size significantly different from zero are presented. Abbreviations: AD = Alzheimer’s disease; AUC-ROC = area under the receiver operating characteristic curve; EMCI = early mild cognitive impairment; LMCI = late mild cognitive impairment; MCI = mild cognitive impairment; HC = healthy controls; ADNI = Alzheimer’s Disease Neuroimaging Initiative; AIBL = Australian Imaging Biomarkers and Lifestyle Study of Ageing; ARWiBo = Alzheimer's Disease Repository Without Borders; OASIS-1 = Open Access Series of Imaging Studies: Cross-Sectional; MIRIAD = Minimal Interval Resonance Imaging in Alzheimer's Disease.
Generalization performance of the classifiers for the classification between HC and patients with Alzheimer’s disease. In this table, the rows indicate the dataset where the classifier is trained and the columns indicate the dataset where the performance was tested. The area under the receiver operating characteristic curve is shown with the upper and lower bound of its 95% confidence interval. Performance significantly different from the normative approach calculated using the confidence interval of the difference between the approach across the bootstrap scheme is indicated by “*”.
| ADNI | AIBL | ARWiBo | OASIS-1 | MIRIAD | |
|---|---|---|---|---|---|
| ADNI | – | 0.89 [0.93, 0.83] * | 0.88 [0.81, 0.93] | 0.84 [0.76, 0.90] * | 0.98 [0.95, 1.00] |
| AIBL | 0.88 [0.82, 0.93] * | – | 0.89 [0.93, 0.83] * | 0.86 [0.80, 0.91] * | 0.98 [0.94, 1.00] |
| ARWiBo | 0.88 [0.81, 0.92] * | 0.88 [0.83, 0.92] * | – | 0.80 [0.72, 0.86] | 0.96 [0.91, 0.99] |
| OASIS-1 | 0.90 [0.84, 0.93] * | 0.89 [0.83, 0.93] * | 0.86 [0.77, 0.92] | – | 0.96 [0.91, 1.00] |
| MIRIAD | 0.89 [0.83, 0.93] * | 0.87 [0.80, 0.91] * | 0.82 [0.73, 0.91] | 0.83 [0.74, 0.89] | – |
Generalization performance of the classifiers for the classification between HC and patients with mild cognitive impairment. In this table, the rows indicate the dataset where the classifier is trained and the columns indicate the dataset where the performance was measured. The area under the receiver operating characteristic curve is shown with the upper and lower bound of its 95% confidence interval. No case had a performance significantly different from the normative approach calculated using the confidence interval of the difference between the approach across the bootstrap scheme.
| AIBL | ARWiBo | |
|---|---|---|
| AIBL | – | 0.61 [0.54, 0.67] |
| ARWiBo | 0.59 [0.53, 0.65] | – |