| Literature DB >> 28811553 |
Asha Singanamalli1, Haibo Wang2, Anant Madabhushi3.
Abstract
The introduction of mild cognitive impairment (MCI) as a diagnostic category adds to the challenges of diagnosing Alzheimer's Disease (AD). No single marker has been proven to accurately categorize patients into their respective diagnostic groups. Thus, previous studies have attempted to develop fused predictors of AD and MCI. These studies have two main limitations. Most do not simultaneously consider all diagnostic categories and provide suboptimal fused representations using the same set of modalities for prediction of all classes. In this work, we present a combined framework, cascaded multiview canonical correlation (CaMCCo), for fusion and cascaded classification that incorporates all diagnostic categories and optimizes classification by selectively combining a subset of modalities at each level of the cascade. CaMCCo is evaluated on a data cohort comprising 149 patients for whom neurophysiological, neuroimaging, proteomic and genomic data were available. Results suggest that fusion of select modalities for each classification task outperforms (mean AUC = 0.92) fusion of all modalities (mean AUC = 0.54) and individual modalities (mean AUC = 0.90, 0.53, 0.71, 0.73, 0.62, 0.68). In addition, CaMCCo outperforms all other multi-class classification methods for MCI prediction (PPV: 0.80 vs. 0.67, 0.63).Entities:
Mesh:
Substances:
Year: 2017 PMID: 28811553 PMCID: PMC5558022 DOI: 10.1038/s41598-017-03925-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of related previous work.
| Previous Work | Modalities | Methods | N | Classes and Performance |
|---|---|---|---|---|
| Gray | Baseline T1w MRI, FDG PET, CSF | Joint embedding of manifolds constructed using random forest based similarity measure | 147 | AD/HC (Acc: 89% +/−0.7), MCI/HC (Acc: 74.6% +/−0.8), pMCI/sMCI (Acc: 58% +/−0.9) |
| Zhang | Baseline T1w MRI, FDG PET, CSF | Kernel combination method embedded with support vector machine classifier | 202 | AD/HC (Acc: 93.2%, Sen: 93%, Spec: 93.3%), MCI/HC (Acc: 76.4%, Sen: 81.8%, Spec: 66%; 91.5% pMCI and 73.4% sMCI classified as MCI) |
| Hinrichs | Baseline and longitudinal T1w MRI, FDG PET, cognitive measures; Baseline CSF, ApoE | Multi-kernel learning framework with support vector machine classifier | 233 | AD/HC (Acc: 92.4%, Sen: 86.7%, Spec: 96.6%, AUC: 0.977), pMCI/rMCI (AUC: 0.97), pMCI/sMCI (AUC: 0.77) |
| Westman | Baseline T1w MRI, CSF | Orthogonal partial least squares (OPLS) | 369 | AD/HC (Acc: 91.8%, Sen: 88.5%, Spec: 94.6%, AUC: 0.958), MCI/HC (Acc: 77.6%, Sen: 72.8%, Spec: 84.7%, AUC: 0.876), pMCI/sMCI using AD/HC model (Acc: 58.6%, 65.8%, 66.4%, 66.1%, AUC: 0.594, 0.647, 0.610, 0.578 for conversion within 12, 18, 24 and 36 months, respectively) |
| Da | Baseline T1w MRI, Cognitive scores, CSF, ApoE | SVM Classification of concatenated features | 432, 381 | AD/HC (T1w MRI AUC: 0.98), sMCI/pMCI Kalpan Meier analysis |
| Davatzikos | Baseline T1w MRI (SPARE-AD), CSF | SVM Classification of concatenated features; pMCI and sMCI categorization based on global CDR score change at follow-up (6–36 months) | 239 | sMCI/pMCI (T1w MRI AUC: 0.734, T1w MRI + CSF AUC: 0.671) |
| Suk | Baseline T1w MRI, FDG PET | Joint feature representation of image patches using Deep Boltzman Machine (DBM) | 194, 305, 204 | AD/HC (Acc: 95.35%, AUC: 0.9877), MCI/HC (Acc: 85.67%, AUC: 0.88), pMCI/sMCI (Acc: 75.92%, AUC: 0.747) |
| Zhu | T1w MRI, FDG PET, CSF | Feature selection method and regression to predict clinical variables in addition to class labels | 202 | AD/HC (Acc: 95.9%, AUC: 98.8), MCI/HC (Acc: 82.0%, AUC: 87.0), sMCI/pMCI (Acc: 72.6%, AUC: 78.8%) |
| Liu | Baseline T1w MRI, FDG PET | Fused data representation of image patches using stacked autoencoder for multiclass classification | 331 | AD/HC (Multiclass Precision: 59.1 +/− 19.7, 52.2 +/− 11.8, 40.2 +/− 14.4, 64.1 +/− 15.24 for HC, sMCI, pMCI and AD, Acc: 53.8 +/− 4.8, Sen: 52.1 +/− 11.8, Spe:87 +/− 9.6) |
Figure 1The cascade and the modalities for fusion at each level of the cascade were determined on training set and validated on independent testing set. Neurophysiological test scores (ADAS-Cog) are fused with CSF proteomics and APOE at the first level of the cascade to identify healthy controls (HC). At the second level, ADAS-Cog scores are combined with PET to distinguish between patients with Alzheimer’s Disease (AD) and mild cognitive impairment (MCI).
Summary of Notations.
| Symbol | Description |
|---|---|
|
| subjects, total number of subjects |
|
| modalities, total number of modalities; |
|
| features, total number of features in each modality; |
|
| total number of features over all modalities; |
|
| data matrix containing features from modality |
|
| concatenated data matrix containing all features from all modalities |
|
| weight vector for modality |
|
| weight matrix for modality |
|
| concatenated weight vector over all modalities |
|
| weight matrix for all modalities |
|
| label matrix |
|
| classes, total number of classes |
|
| notation used in sMVCCA to denote |
|
| data vector of selected modalities |
|
| total number of features over modalities in |
|
| concatenated data matrix containing all features from a subset of modalities [ |
|
| dimensionality of the fused data subspace |
Figure 2Cascaded multiview canonical correlation analysis (CaMCCo) algorithm for constructing the joint multimodal data fusion and multiclass classification framework.
Clinical and demographic information of the 149 ADNI subjects considered in this study, selected based on the availability of imaging, non-imaging and clinical metrics at baseline.
| Diagnosis | N(F/M) | Age | MMSE Score |
|---|---|---|---|
| AD | 52 (16/24) | 75.1 +/− 8.1 | 23.8 +/− 2.0 |
| MCI | 71 (17/37) | 74.1 +/− 7.2 | 27.1 +/− 1.7 |
| HC | 26 (10/14) | 74.9 +/− 7.3 | 28.6 +/− 1.4 |
| Total | 149 (43/75) | 74.2 +/− 7.2 | 26.3 +/− 2.6 |
The dataset was split into independent training set with 60 cases (40%) and a holdout validation set with 89 cases (60%). Note that gender information was unavailable for a subset of the data, as a result of which N does not equal to the sum of females (F) and males (M).
Summary of features considered in this study from each modality.
| Modality | Features | Description | Number |
|---|---|---|---|
| Neurophysiologic Exam | Modified ADAS-Cog score[ | Score based on cognitive test assessing memory, praxis, orientation, word recall and recognition | 1 |
| T1w MRI | Volumetric Measurements | Volumetric measures of atlas based segmented brain regions | 327 |
| FDG PET | Hippocampal Glucose Metabolism[ | Pons normalized left and right hippocampal glucose metabolism | 2 |
| CSF Proteomics | t-tau, A | Markers of neuronal degeneration, plaque formation and tau hyperphosphorylation[ | 3 |
| Plasma Proteomics | Adiponectin, Insulin, Fibrinogen etc[ | Concentrations of signaling proteins in blood, measured by multiplex immunoassay panel | 146 |
| ApoE Genotype | ApoE alleles 1 & 2 | Combination of allele forms | 1 |
Summary of notations used to refer to comparative strategies evaluated in this work.
| Symbol | Description |
|---|---|
|
| |
|
| Cascaded Classifier |
|
| One Shot Classifier |
|
| One-vs-All Classifier |
|
| Binary Classifier |
|
| |
|
| Supervised Multiview CCA data fusion approach |
|
| Principal component analysis of concatenated features as baseline fusion approach |
|
| |
|
| Multimodal dataset comprising all modalities considered in this study |
|
| Multimodal dataset comprising select subset of all available modalities |
|
| Unimodal dataset containing quantitative attributes extracted from MRI |
|
| Unimodal dataset containing quantitative attributes extracted from PET |
|
| Unimodal dataset containing proteomic measurements from CSF |
|
| Unimodal dataset containing plasma proteomic data |
|
| Unimodal dataset containing APOE data |
|
| Unimodal dataset containing ADAS-Cog scores |
|
| |
|
| Cascaded classification of single modality MR data |
|
| Cascaded classification of single modality PET data |
|
| Cascaded classification of single modality CSF data |
|
| Cascaded classification of single modality Plasma Proteomics data |
|
| Cascaded classification of single modality APOE data |
|
| Cascaded classification of single modality ADAS-Cog data |
|
| Classifier resulting from CaMCCo framework, which is comprised of cascaded classifier, sMVCCA data fusion and modality selection |
|
| Cascaded classifier in combination with sMVCCA based data fusion method to combine all modalities |
|
| Cascaded classifier with PCA reduced representation of data concatenated from all modalities |
|
| Cascaded classifier with PCA reduced representation of data concatenated from selected subset of modalities |
|
| One-vs-all classifier constructed from sMVCCA fused data from selected modalities. |
|
| One shot classifier constructed from sMVCCA fused data from selected modalities. |
|
| Binary classifier constructed from sMVCCA fused data from selected modalities. |
Figure 3Performance of single and multi modality cascaded classifiers. Area under the ROC curve (AUC) for prediction of (a) healthy control (HC) from all cognitive impairments, and (b) mild cognitive impairment (MCI) from Alzheimer’s Disease (AD).
Performance of multiclass classification strategies – one shot classifier (OSC), one vs. all (OVA), cascaded classifier in CaMCCo – upon fusion of modalities chosen from training set for each classification task.
| ACC | BACC | AUC | SEN | SPEC | PPV | ||
|---|---|---|---|---|---|---|---|
| CN |
| 0.69 | 0.63 | 0.96 | 0.34 | 0.92 | 0.75 |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
| MCI |
| 0.69 | 0.69 | 0.77 | 0.68 |
| 0.67 |
|
| 0.68 | 0.68 | 0.77 | 0.78 | 0.59 | 0.63 | |
|
|
|
|
|
| 0.69 |
| |
| AD |
| 0.69 | 0.67 | 0.84 | 0.53 | 0.82 | 0.69 |
|
|
|
|
|
|
|
| |
|
| 0.80 | 0.78 | 0.89 | 0.69 | 0.88 | 0.80 |
The highest accuracy (ACC), balanced accuracy (BACC), area under the ROC curve (AUC), sensitivity (SEN), specificity (SPE) and positive predictive value (PPV) achieved for each class are shown in bold. These results indicate that although the performance of both and are comparable for CN and AD classification, outperforms all other methods for MCI classification.
Performance of the combined fusion () and modality selection () modules of CaMCCo for binary classification ().
| ACC | BACC | AUC | SEN | SPEC | PPV | ||
|---|---|---|---|---|---|---|---|
| AD vs. HC | ADAS - Cog |
|
|
|
|
|
|
| T1w MRI | 0.63 | 0.55 | 0.65 | 0.86 | 0.24 | 0.66 | |
| PET | 0.80 | 0.78 | 0.87 | 0.86 | 0.71 | 0.83 | |
| CSF | 0.91 | 0.89 | 0.97 | 0.97 | 0.82 | 0.90 | |
| PP | 0.59 | 0.47 | 0.51 | 0.93 | 0.00 | 0.61 | |
| APOE | 0.76 | 0.70 | 0.86 | 0.93 | 0.47 | 0.75 | |
|
| 0.87 | 0.85 |
| 0.93 | 0.76 | 0.87 | |
| MCI vs. HC | ADAS-Cog | 0.79 | 0.75 | 0.84 | 0.85 | 0.65 | 0.85 |
| T1w MRI | 0.69 | 0.49 | 0.53 | 0.98 | 0.00 | 0.70 | |
| PET | 0.72 | 0.67 | 0.76 | 0.80 | 0.53 | 0.80 | |
| CSF | 0.83 | 0.77 | 0.92 | 0.90 | 0.65 | 0.86 | |
| PP | 0.71 | 0.52 | 0.54 | 0.98 | 0.06 | 0.71 | |
| APOE | 0.83 | 0.71 | 0.84 |
| 0.41 | 0.80 | |
|
|
|
|
| 0.93 |
|
|
shows improvement over individual modality classifiers for MCI vs. HC, particularly in terms of achieving the both high sensitivity and specificity. For AD vs. HC, several individual modalities have sufficiently high classification performance and thereby leaving no room for further improvement with .