| Literature DB >> 32714766 |
Dan Jin1,2, Bo Zhou3, Ying Han4, Jiaji Ren1,2, Tong Han5, Bing Liu1,2,6, Jie Lu7, Chengyuan Song8, Pan Wang9, Dawei Wang10, Jian Xu11, Zhengyi Yang1,2, Hongxiang Yao12, Chunshui Yu13, Kun Zhao14, Max Wintermark15, Nianming Zuo1,2, Xinqing Zhang4, Yuying Zhou9, Xi Zhang3, Tianzi Jiang1,2,6, Qing Wang10, Yong Liu1,2,6,16.
Abstract
Precision medicine for Alzheimer's disease (AD) necessitates the development of personalized, reproducible, and neuroscientifically interpretable biomarkers, yet despite remarkable advances, few such biomarkers are available. Also, a comprehensive evaluation of the neurobiological basis and generalizability of the end-to-end machine learning system should be given the highest priority. For this reason, a deep learning model (3D attention network, 3DAN) that can simultaneously capture candidate imaging biomarkers with an attention mechanism module and advance the diagnosis of AD based on structural magnetic resonance imaging is proposed. The generalizability and reproducibility are evaluated using cross-validation on in-house, multicenter (n = 716), and public (n = 1116) databases with an accuracy up to 92%. Significant associations between the classification output and clinical characteristics of AD and mild cognitive impairment (MCI, a middle stage of dementia) groups provide solid neurobiological support for the 3DAN model. The effectiveness of the 3DAN model is further validated by its good performance in predicting the MCI subjects who progress to AD with an accuracy of 72%. Collectively, the findings highlight the potential for structural brain imaging to provide a generalizable, and neuroscientifically interpretable imaging biomarker that can support clinicians in the early diagnosis of AD.Entities:
Keywords: Alzheimer's disease; computer‐aided diagnosis; neurobiological basis; neuroscientifically interpretable biomarkers; structural magnetic resonance imaging
Year: 2020 PMID: 32714766 PMCID: PMC7375255 DOI: 10.1002/advs.202000675
Source DB: PubMed Journal: Adv Sci (Weinh) ISSN: 2198-3844 Impact factor: 16.806
Figure 1Schematic of the data analysis pipeline. A) The architecture of the 3D attention network (3DAN). In the attention mechanism module, each voxel i of the H × W × D‐dimensional feature maps F was weighted by the H × W × D‐dimensional attention map M. The trainable attention map M was independent of the channel of the features and was only related to the spatial position. B) The attention score map (left: in‐house database, right: ADNI database) was generated by the attention mechanism module of the 3DAN model, indicating the discriminative power of various brain regions for AD diagnosis. C) To test the robustness and generalizability of the 3DAN model, cross validations were performed using two completely independent databases (an in‐house database and the ADNI database) (Details can be found in Table 1). D) Investigation of the association between the classification output and clinical measures [that is the cognitive function measured by Mini‐Mental State Examination (MMSE), CSF beta‐amyloid (Aβ), CSF tau, and polygenic risk scores (PGRS)] in the AD and MCI groups.
Classification performance of the proposed 3DAN method in the AD and NC classification tasks
| Training set | Testing set | ACC | SEN | SPE | AUC | |
|---|---|---|---|---|---|---|
|
| In‐house( | ADNI( | 0.861 | 0.881 | 0.846 | 0.912 |
|
| ADNI | In‐house | 0.870 | 0.789 | 0.961 | 0.913 |
|
| In‐house leave‐center‐outcross‐validation | 0.909 | 0.869 | 0.957 | 0.940 | |
|
| ADNI 10‐foldcross‐validation | 0.921 | 0.890 | 0.944 | 0.941 | |
Abbreviations: ACC = accuracy; SEN = sensitivity; SPE = specificity; AUC = area under the curve of the receiver operating characteristic.
Figure 2Diagnostic performance of the ROC curves for AD/NC classification (A–D) and pMCI/sMCI classification (E,F). A) ROC curve for the classifier that was trained on the in‐house database and tested on the ADNI database; B) ROC curve for the classifier that was trained on the ADNI database and tested on the in‐house database; C) ROC curve for the classifier that was trained and tested on the in‐house database with leave‐center‐out cross‐validation (CV); D) ROC curve for the classifier that was trained and tested on the ADNI database with tenfold cross‐validation; E) ROC curve of the pMCI/sMCI classification with tenfold cross‐validation on the ADNI database; F) violin plots for the distributions of the pMCI/sMCI classifications.
Figure 3A) Mean attention score map derived from the in‐house database (left) and the ADNI database (right) and the correlation between these two attention maps (middle). Brighter colors indicate that the region is more discriminative for AD classification. The regions whose attention scores were in the top 30% (82/273) are displayed. The correlation figure indicates the replicability of the in‐house and ADNI databases. B) Correlation analysis between the mean attention score and the t statistic score of the gray matter volume between the NC and AD for the 273 ROIs in the Brainnetome Atlas for the in‐house database (left) and ADNI database (right). C) Correlation maps between the attention score for the regions and the MMSE scores with FDR correction (p < 0.05) in the in‐house database (left) and the ADNI database (right) and the relationship between the two correlation maps (middle). The correlation figure indicates the replicability of the in‐house and ADNI databases. D) Correlation between classification accuracy and the mean attention score of K groups of regions. The abscissa value of each point in the scatter plots represents the mean attention score of [273/K] brain regions in each group, and the ordinate value of each point in the scatter plots represents the classification accuracy based on the images of [273/K] brain regions in each group. At each K, the fact that higher attention scores are associated with higher classification accuracy reflects the effectiveness of the attention mechanism (Details of the method can be found in Figure S4, Supporting Information).
Figure 4Correlations between the class scores and the MMSE scores in the in‐house database (A) and the ADNI database (B). Correlations between the class scores and the CSF Aβ (n = 472) (C), CSF tau (n = 472), (D) and polygenetic risk factors (n = 321) (E) of individual subjects in the ADNI database. F) Correlation between the classification output and the length of time before conversion to AD of the pMCI individuals in the ADNI database.
Comparison of the classification performance of the proposed 3DAN method with other methods for AD diagnosis in Strategies 1 and 2 in Table 1
| Method | Training: In‐house, Testing: ADNI | Training: ADNI, Testing: In‐house | ||||||
|---|---|---|---|---|---|---|---|---|
| ACC | SEN | SPE | AUC | ACC | SEN | SPE | AUC | |
| 3DAN | 0.861 | 0.881 | 0.846 | 0.912 | 0.870 | 0.789 | 0.961 | 0.913 |
| ResNet | 0.853 | 0.863 | 0.846 | 0.907 | 0.860 | 0.759 | 0.974 | 0.910 |
| VBM | 0.712 | 0.947 | 0.538 | 0.907 | 0.821 | 0.667 | 0.996 | 0.908 |
| ROI‐AAL | 0.720 | 0.947 | 0.551 | 0.885 | 0.811 | 0.651 | 0.991 | 0.888 |
| ROI‐BNA | 0.744 | 0.960 | 0.584 | 0.901 | 0.813 | 0.651 | 0.996 | 0.894 |
Abbreviations: ACC = accuracy; SEN = sensitivity; SPE = specificity; AUC = area under the curve of the receiver operating characteristic; BNA = Brainnetome Atlas; AAL = anatomical automatic labeling; ROI = region of interest; VBM = voxel‐based morphometric.