| Literature DB >> 29950585 |
Xuemei Ding1,2, Magda Bucholc3, Haiying Wang4, David H Glass4, Hui Wang4, Dave H Clarke5, Anthony John Bjourson6, Le Roy C Dowey7,8, Maurice O'Kane7, Girijesh Prasad3, Liam Maguire3, KongFatt Wong-Lin9.
Abstract
There is currently a lack of an efficient, objective and systemic approach towards the classification of Alzheimer's disease (AD), due to its complex etiology and pathogenesis. As AD is inherently dynamic, it is also not clear how the relationships among AD indicators vary over time. To address these issues, we propose a hybrid computational approach for AD classification and evaluate it on the heterogeneous longitudinal AIBL dataset. Specifically, using clinical dementia rating as an index of AD severity, the most important indicators (mini-mental state examination, logical memory recall, grey matter and cerebrospinal volumes from MRI and active voxels from PiB-PET brain scans, ApoE, and age) can be automatically identified from parallel data mining algorithms. In this work, Bayesian network modelling across different time points is used to identify and visualize time-varying relationships among the significant features, and importantly, in an efficient way using only coarse-grained data. Crucially, our approach suggests key data features and their appropriate combinations that are relevant for AD severity classification with high accuracy. Overall, our study provides insights into AD developments and demonstrates the potential of our approach in supporting efficient AD diagnosis.Entities:
Mesh:
Year: 2018 PMID: 29950585 PMCID: PMC6021389 DOI: 10.1038/s41598-018-27997-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Proposed hybrid computational framework. CAIM: class-attribute interdependence maximization. 10-fold CV: 10-fold cross validation. SMOTE: synthetic minority over-sampling technique. Correlation analysis validates usage of clinical dementia rating (CDR) as an index of AD severity. The CAIM algorithm is used to discretize the considered features with respect to CDR. Entropy-based feature selection with 10-fold CV is applied to a model development set to identify features most relevant for predicting AD severity. SMOTE technique is implemented to balance unbalanced disease classes in the model development set in order to avoid inflated performance estimates. 10-fold CV is used to evaluate the capability of various scoring functions of BNs and determine the BN with the optimal predictive performance. The trained BN models are evaluated on an independent test set partitioned from the original data. Prior knowledge from domain experts is used to provide constraints in structure learning (see Methods section for more details).
Figure 2Strong correlation between clinical diagnosis and Clinical Dementia Rating (CDR) categories. Vertical bars: healthy control (HC); horizontal bars: MCI; diagonal brick bars: AD; Cor.: correlation coefficient; CI: confidence interval. CDR scores reflect 5 categories: normal controls (CDR = 0), very mild (CDR = 0.5), mild (CDR = 1), moderate (CDR = 2), and severe (CDR = 3) patients. Clinical diagnosis contains 3 categories: HC, MCI, and AD. The data distribution along with the Cor., 95% CI, and p-value showed a significant correlation between clinical diagnosis and CDR.
Figure 3Optimized Bayesian network (BN) structure via 10-fold CV with probabilistic dependencies among predisposing factors, psychological/functional assessments, and AD severity. ApoE: apolipoprotein E genotype; GM: grey matter volume; CSF: cerebrospinal volume; PiB-PET: Pittsburgh compound B - positron-electron tomography; CDR: clinical dementia rating; MMSE: mini-mental state examination; LMIR/LMDR: logical memory immediately/delayed recall. BN is constructed based on the complete data. The thickness of the arrows represents the strength of the probabilistic influence between features. CDR is directly influenced by neuroimaging-based CSF, GM, and PiB-PET, while indirectly influenced by age and ApoE. The probabilistic influences between CDR and psychological/functional assessments are much stronger than those between predisposing indicators/biomarkers and CDR.
Figure 4Bayesian networks based on the complete data at different times. (A) Group 1 data at BL (197 participants). (B) Group 1 data, assessments conducted at least once during the M18-54 time interval (130 participants). (C) Group 2 longitudinal data at BL (133 participants) including time-evolved features. (D) Group 2 longitudinal data, assessments conducted at least once during the M18-54 time interval including time-evolved features. The thickness of arrows represents the strength of the probabilistic influences between variables. As the Group 2 set uses longitudinal data focuses, the ApoE feature was disregarded due to its unchanging nature.
Figure 5Classification accuracy (AUC) of individual predisposing indicators and biomarkers and their combinations with respect to CDR. Circled markers: BN models constructed using individual as well as combinations of predisposing factors/biomarkers without ApoE. Squared markers: BN models constructed using individual as well as combinations of predisposing factors/biomarkers with ApoE. The incorporation of ApoE into the BN structure generally improved the model performance.
The AUC and MCA performance of the BN models constructed based on the combination of all predisposing factors with cognitive/functional assessments.
| Features | AUC | MCA |
|---|---|---|
| All predisposing factors (Age + GM + CSF + PiB-PET + ApoE) | 0.81 | 0.72, 95%CI [0.59, 0.83] |
| All predisposing factors + MMSE | 0.86 | 0.76, 95%CI [0.63, 0.86] |
| All predisposing factors + LMIR | 0.85 | 0.73, 95%CI [0.60, 0.84] |
| All predisposing factors + LMDR | 0.83 | 0.69, 95%CI [0.54, 0.79] |
| All predisposing factors + MMSE + LMIR | 0.89 | 0.81, 95%CI [0.69, 0.90] |
| All predisposing factors + MMSE + LMDR | 0.91 | 0.80, 95%CI [0.67, 0.89] |
| All predisposing factors + LMIR + LMDR | 0.87 | 0.75, 95%CI [0.62, 0.85] |
| All predisposing factors + MMSE + LMIR + LMDR | 0.91 | 0.80, 95%CI [0.67, 0.89] |