| Literature DB >> 31077314 |
Angela Tam1,2, Christian Dansereau1,3, Yasser Iturria-Medina4, Sebastian Urchs1,4, Pierre Orban1,5,6, Hanad Sharmarke1, John Breitner2,7, Pierre Bellec1,8.
Abstract
BACKGROUND: Clinical trials in Alzheimer's disease need to enroll patients whose cognition will decline over time, if left untreated, in order to demonstrate the efficacy of an intervention. Machine learning models used to screen for patients at risk of progression to dementia should therefore favor specificity (detecting only progressors) over sensitivity (detecting all progressors), especially when the prevalence of progressors is low. Here, we explore whether such high-risk patients can be identified using cognitive assessments and structural neuroimaging by training machine learning tools in a high-specificity regime.Entities:
Keywords: Alzheimer's disease; cognition; machine learning; mild cognitive impairment; neuroimaging
Mesh:
Year: 2019 PMID: 31077314 PMCID: PMC6511068 DOI: 10.1093/gigascience/giz055
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Summary of objectives, experiments, and main findings
| Specific objectives | Experiments | Main findings |
|---|---|---|
| (i) Identify subtypes of brain atrophy patterns | We used unsupervised clustering on atrophy maps generated from structural images in patients with AD and cognitively normal participants | Seven distinct patterns of atrophy were identified, some of which were strongly associated with a diagnosis of AD (Fig. |
| (iia) Replicate previous findings from works that used cognitive and structural features to predict progression to AD from MCI | A linear SVM that was optimized for accuracy was trained on the following features: (i) structural atrophy patterns, (ii) multi-domain cognitive assessments, and (iii) a combination of both | The SVM based on cognitive features had higher predictive value than the structural MRI signature, similar to previous findings [ |
| (iib) Train a model in a high-specificity regime to identify high-confidence AD participants with a high-risk signature | We used a 2-stage algorithm to ensure that we were maximizing specificity over sensitivity. We trained on the following features: (i) structural atrophy patterns, (ii) multi-domain cognitive assessments, and (iii) a combination of both | The 2-stage algorithm resulted in a model that achieved high specificity and high PPV, with reduced sensitivity (Fig. |
| (iii) Assess whether the high-risk signature generated by the 2-stage algorithm can identify progressors in participants with MCI within a 3-year period | We measured PPV, specificity, sensitivity, and accuracy of the model in predicting progressors in 2 separate MCI cohorts | The model achieved high specificity and high PPV, again at the cost of sensitivity and accuracy (Figs |
| (iv) Test the performance of the 2-stage algorithm against standard algorithms | We compared the ROC performance of the 2-stage algorithm against standard algorithms (e.g., KNN, GNB, SVM with an RBF kernel) | The performance of the 2-stage algorithm did not differ from standard algorithms, in terms of area under an ROC curve, but was the only one to operate in a high-specificity, low-sensitivity regime (Fig. |
| (v) Validate whether this high-risk signature represents a prodromal phase of AD | We compared cognitive decline, Aβ, and τ burden in tagged high-risk individuals against those who were not | Tagged high-risk individuals experienced sharper cognitive decline and higher levels of Aβ and τ than non-tagged individuals (Fig. |
| (vi) Assess the complementarity of cognitive and structural measures | We examined whether there was overlap in the participants who were identified by the 3 high-risk signatures | The majority of participants who were identified by the multimodal high-risk signature had been identified as such by the unimodal cognitive and unimodal structural signatures. The unimodal cognitive signature identified the majority of all high-risk participants (Fig. |
Figure 1:Subtyping procedure and resulting subtypes. (a) A hierarchical clustering procedure identified 7 subtypes, or subgroups, of individuals with similar patterns of grey matter topography within the ADNI1 cohort of CN and AD participants (top). A measure of spatial similarity, called subtype weight, between a single individual's GMV map and the average of a given subtype was calculated for all individuals and all subtypes (bottom). (b) Maps of the 7 subtypes showing the distribution of grey matter across all voxels relative to the average. CN* and AD* denote significant associations between the subtype weights and diagnoses of CN or AD, respectively.
Demographic information for post−quality control participants in ADNI1 and ADNI2
| Parameter | CN | sMCI | pMCI | AD |
|---|---|---|---|---|
| ADNI1 | ||||
| No. | 205 | 88 | 147 | 165 |
| Age, y, mean ± SD | 76.1 ± 5.0 | 74.0 ± 7.6 | 74.3 ± 7.1 | 75.4 ± 7.5 |
| Female sex, % | 51.7 | 40.9 | 40.8 | 51.5 |
| APOE4+, % | 27.8 | 37.5 | 68.7 | 65.4 |
| ADAS13 score, mean ± SD | 9.5 ± 4.3 | 14.3 ± 5.5 | 21.3 ± 5.3 | 28.6 ± 7.1 |
| MMSE score, mean ± SD | 29.1 ± 1.0 | 27.7 ± 1.7 | 26.7 ± 1.7 | 23.4 ± 2.0 |
| ADNI2 | ||||
| No. | 188 | 180 | 55 | 89 |
| Age, y, mean ± SD | 72.8 ± 6.1 | 70.8 ± 7.3 | 72.1 ± 7.1 | 74.4 ± 7.8 |
| Female sex, % | 54.0 | 47.8 | 49.1 | 46.1 |
| APOE4+, % | 29.4 | 35.6 | 65.4 | 71.3 |
| ADAS13 score, mean ± SD | 9.1 ± 4.2 | 11.8 ± 5.3 | 21.4 ± 6.5 | 31.6 ± 8.7 |
| MMSE score, mean ± SD | 29.1 ± 1.1 | 28.4 ± 1.6 | 27.3 ± 1.9 | 23.1 ± 2.3 |
ADAS13: Alzheimer's Disease Assessment Scale—Cognitive subscale (13 items); MMSE: Mini Mental State Examination; SD: standard deviation.
Figure 2:Accuracy, specificity, sensitivity, and PPV for different classifiers: linear SVM, HPS, and the linear SVM thresholded at 0.95 (SVM 0.95), for the classifications of patients with AD and CN individuals and patients with MCI who progress to AD (pMCI) and stable MCI (sMCI) in ADNI1 and ADNI2. VBM represents the model trained with VBM subtypes, COG represents the model trained with baseline cognitive scores, and VCOG represents the model trained with both VBM subtypes and cognition. PPV was adjusted [PPV (adj)] for a prevalence of 33.6% pMCI in a sample of MCI participants for both ADNI1 and ADNI2 MCI cohorts. Significant differences are denoted by * for P < 0.05 and ** for P < 0.001.
Figure 3:ROC curves for various machine learning algorithms with different features (VBM for VBM subtypes only, COG for cognitive features only, VCOG for a combination of VBM subtypes and cognitive features). Algorithms included an SVM with a radial basis function kernel (RBF SVM), K nearest neighbours (KNN), random forest (RF), Gaussian naive Bayes (GNB), an SVM with a linear kernel representing the first stage (Linear SVM) of the 2-stage predictive model, and the 2-stage HPS. TPR: true-positive rate; FPR: false-positive rate; AUC: area under the curve.
Figure 4:Characteristics of participants with MCI with the VCOG signature in ADNI1 and ADNI2. We show the percentage of participants with MCI who (a) progressed to dementia and were (b) APOE4 carriers, (c) female, (d) positive for Aβ measured by a cut-off of 192 pg/mL in the CSF [38], and (e) positive for τ measured by a cut-off of 93 pg/mL in the CSF [38] in each classification (high confidence, low confidence, and negative). (f) Age and (g) cognitive trajectories, measured by the Alzheimer's Disease Assessment Scale-Cognitive subscale with 13 items (ADAS13), across the 3 classes. Significant differences are denoted by asterisks for family-wise error rate-corrected P < 0.05.
Figure 5:Coefficients of the high-confidence prediction (a) VCOG HPS model, (b) COG HPS model, (c) VBM HPS model. ADAS13 = Alzheimer's Disease Assessment Scale—Cognitive; MEM = ADNI-MEM score; EXEC = ADNI-EF score; BNT = Boston Naming Test; CLOCK = clock drawing test; VBM 1–7 = VBM subtype weights.
Figure 6:Venn diagram depicting the number of participants with MCI labelled as high confidence by the VBM, COG, and VCOG HPS models in ADNI1 and ADNI2.