| Literature DB >> 34676625 |
Hao Guan1, Chaoyue Wang1, Jian Cheng2, Jing Jing3, Tao Liu2,4.
Abstract
Structural magnetic resonance imaging (sMRI) can capture the spatial patterns of brain atrophy in Alzheimer's disease (AD) and incipient dementia. Recently, many sMRI-based deep learning methods have been developed for AD diagnosis. Some of these methods utilize neural networks to extract high-level representations on the basis of handcrafted features, while others attempt to learn useful features from brain regions proposed by a separate module. However, these methods require considerable manual engineering. Their stepwise training procedures would introduce cascading errors. Here, we propose the parallel attention-augmented bilinear network, a novel deep learning framework for AD diagnosis. Based on a 3D convolutional neural network, the framework directly learns both global and local features from sMRI scans without any prior knowledge. The framework is lightweight and suitable for end-to-end training. We evaluate the framework on two public datasets (ADNI-1 and ADNI-2) containing 1,340 subjects. On both the AD classification and mild cognitive impairment conversion prediction tasks, our framework achieves competitive results. Furthermore, we generate heat maps that highlight discriminative areas for visual interpretation. Experiments demonstrate the effectiveness of the proposed framework when medical priors are unavailable or the computing resources are limited. The proposed framework is general for 3D medical image analysis with both efficiency and interpretability.Entities:
Keywords: Alzheimer's disease; convolutional neural network; early diagnosis; structural MRI; visual attention
Mesh:
Year: 2021 PMID: 34676625 PMCID: PMC8720194 DOI: 10.1002/hbm.25685
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
FIGURE 1Architecture of the proposed parallel attention‐augmented bilinear network (pABN). “pA” refers to the parallel attention‐augmented blocks
FIGURE 2A flowchart showing the modified double attention block used in the present study. Specifically, “m” represents number of channels of the convolutional layer, and “7,128” represents the product of the spatial dims of the feature map (i.e., dhw = 18 × 22 × 18 = 7,128); “MatMul” represents matrix multiplication; “OutPro” represents outer product
Demographic characteristics of the studied subjects
| Dataset | Category | Gender (F/M) | Age (±SD) | Education (±SD) | MMSE (±SD) |
|---|---|---|---|---|---|
| ADNI‐1 | NC | 110/112 | 76.0 ± 4.9 | 15.9 ± 2.9 | 29.1 ± 1.0 |
| sMCI | 69/123 | 74.7 ± 7.5 | 15.6 ± 3.2 | 27.3 ± 1.8 | |
| pMCI | 66/90 | 74.5 ± 6.9 | 15.8 ± 2.9 | 26.7 ± 1.7 | |
| AD | 97/96 | 75.7 ± 7.7 | 14.7 ± 3.2 | 23.3 ± 2.1 | |
| ADNI‐2 | NC | 95/77 | 72.9 ± 6.1 | 16.6 ± 2.6 | 29.1 ± 1.2 |
| sMCI | 95/114 | 71.5 ± 7.3 | 16.3 ± 2.7 | 28.2 ± 1.7 | |
| pMCI | 19/22 | 71.8 ± 7.2 | 15.9 ± 3.4 | 26.8 ± 1.7 | |
| AD | 66/89 | 74.9 ± 8.0 | 15.8 ± 2.9 | 23.1 ± 2.1 |
Abbreviations: MMSE, mini‐mental state examination; NC, normal control; pMCI, progressive MCI; sMCI, stable mild cognitive impairment (MCI; Folstein, Folstein, & McHugh, 1975).
Results for AD classification (AD vs. NC) using baseline sMRI. The models were trained on the ADNI‐1 dataset and evaluated on the ADNI‐2 dataset
| Model | Params | ACC × 100% (std.) | SEN × 100% (std.) | SPE × 100% (std.) | AUC (std.) |
|---|---|---|---|---|---|
| 3D ResNet | 3.2011 M | 84.10 (0.33) | 76.13 (2.97) | 91.28 (2.27) | 0.9209 (0.0051) |
| nH‐FCN | 3.1286 M | 86.36 (0.24) | 85.94 (1.49) | 86.75 (1.78) | 0.9255 (0.0024) |
| a. BB + GAP | 0.7103 M | 76.94 (0.63) | 70.32 (4.72) | 82.91 (4.54) | 0.8384 (0.0073) |
| b. BB + 1 × A2‐block + GAP | 0.7224 M | 88.13 (1.12) | 84.26 (2.99) | 91.63 (1.86) | 0.9293 (0.0062) |
| c. BB + 1 × A2‐block + Bili | 0.7244 M | 88.62 (0.49) | 84.26 (1.94) | 92.56 (1.35) | 0.9291 (0.0037) |
| d. BB + 2 × A2‐block + Bili | 0.7756 M | 88.81 (0.37) | 85.42 (2.02) | 91.86 (1.33) | 0.9271 (0.0044) |
| e. BB + pA‐blocks (S) + Bili | 0.7367 M | 89.66 (0.23) | 86.58 (1.25) | 92.44 (0.97) | 0.9303 (0.0076) |
| f. BB + pA‐blocks (S‐48) + Bili | 0.7515 M | 88.93 (0.94) | 84.39 (2.78) |
| 0.9282 (0.0035) |
| g. BB + pA‐blocks (A) + Bili | 0.7510 M |
|
| 92.44 (1.22) |
|
Notes: “BB” refers to the backbone network; “GAP” refers to global average pooling; “Bili” refers to bilinear pooling; “(S‐48)” refers to the symmetric pA‐blocks with 48 output channels; “(S)” refers to the symmetric pA‐blocks with 32 output channels; “(A)” refers to the asymmetric pA‐blocks. “Params” refers to the number of parameters (weights, in millions); “std.” refers to standard deviation.
Significantly different from the non‐bold values (p <.05, t‐test).
Implemented from scratch and tested under the same experimental settings.
Results for AD classification (AD vs. NC) on the ADNI‐1 dataset, with the models trained on the ADNI‐2 dataset
| Model | ACC × 100% (std.) | SEN × 100% (std.) | SPE × 100% (std.) | AUC (std.) |
|---|---|---|---|---|
| 3D ResNet | 77.59 (0.97) | 81.76 (1.68) | 73.96 (2.91) | 0.8627 (0.0127) |
| nH‐FCN | 84.05 (0.78) | 87.05 (2.15) | 81.44 (2.47) | 0.9088 (0.0019) |
| BB + pA‐blocks (S) + Bili | 86.84 (0.45) | 84.97 (1.35) |
| 0.9209 (0.0054) |
| BB + pA‐blocks (A) + Bili |
|
| 85.59 (1.61) |
|
Notes: “BB” refers to the backbone network; “Bili” refers to bilinear pooling; “(S)” refers to the symmetric pA‐blocks; “(A)” refers to the asymmetric pA‐blocks; “std.” refers to standard deviation.
Significantly different from the non‐bold values (p < .05, t‐test).
Implemented from scratch and tested under the same experimental settings.
Results for MCI conversion prediction (pMCI vs. sMCI) using baseline sMRI
| Model | ACC × 100% (std.) | SEN × 100% (std.) | SPE × 100% (std.) | AUC (std.) |
|---|---|---|---|---|
| a. BB + pA‐blocks (S) + Bili (scratch) | 73.92 (1.46) | 48.78 (9.51) | 78.85 (3.12) | 0.6991 (0.0063) |
| b. BB + pA‐blocks (A) + Bili (scratch) | 74.96 (2.72) | 52.19 (10.9) | 79.43 (5.13) | 0.7139 (0.0142) |
| c. BB + pA‐blocks (S) + Bili (FC) | 76.32 (0.16) | 54.64 (1.20) | 80.57 (0.24) | 0.7493 (0.0009) |
| d. BB + pA‐blocks (A) + Bili (FC) | 75.04 (0.32) |
| 78.09 (0.47) | 0.7750 (0.0003) |
| 3D ResNet | 79.12 (0.69) | 43.90 (3.45) |
| 0.7240 (0.0118) |
| nH‐FCN | 78.48 (0.64) | 52.20 (2.49) | 83.64 (1.06) | 0.7620 (0.0067) |
| e. BB + pA‐blocks (S) + Bili (pA + FC) | 77.36 (0.60) | 53.17 (0.98) | 82.11 (0.78) | 0.7471 (0.0024) |
| f. BB + pA‐blocks (A) + Bili (pA + FC) |
| 54.64 (1.20) | 84.12 (0.93) |
|
Notes: “BB” refers to the backbone network; “Bili” refers to bilinear pooling; “(S)” refers to the symmetric pA‐blocks; “(A)” refers to the asymmetric pA‐blocks; “std.” refers to standard deviation; “(scratch)” refers to the models trained from scratch; “(FC)” refers to the AD classification models with the last fully connected (FC) layer fine‐tuned; “(pA + FC)” refers to the AD classification models with both the pA‐blocks and the FC layer fine‐tuned.
Significantly different from the non‐bold values (p < .05, t‐test).
Implemented from scratch and tested under the same experimental settings.
FIGURE 3Heat maps highlighting discriminative regions of AD classification (subjects with AD). The rows correspond to heat maps generated using different feature maps. The areas with warmer colors have higher discriminative contributions
FIGURE 4Heat maps highlighting discriminative regions of MCI conversion prediction (subjects with pMCI). The rows correspond to heat maps generated using different feature maps. The areas with warmer colors have higher discriminative contributions
A summary of state‐of‐the‐art sMRI‐based studies for AD classification and MCI conversion prediction
| Reference | Methodology | Feature engineering or prior knowledge | Subjects | AD versus NC | pMCI versus sMCI | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NC | sMCI | pMCI | AD | ACC (%) | SEN (%) | SPE (%) | AUC | ACC (%) | SEN (%) | SPE (%) | AUC | |||
| Eskildsen et al. ( | Cortical thickness + LDA | Feature engineering | 226 | 227 | 161 | 194 | 84.5 | 79.4 | 88.9 | 0.905 | 67.3 | 65.8 | 68.3 | 0.685 |
| Liu et al. ( | Gray matter density map + ensemble SVM | Feature engineering | 128 | 117 | 117 | 97 | 93.1 | 94.9 | 90.5 | 0.958 | 79.3 | 88.0 | 75.5 | 0.834 |
| Zhang et al. ( | Morphological feature + SVM | Feature engineering | 229 | ‐ | ‐ | 199 | 83.7 | 80.9 | 86.7 | ‐ | ‐ | ‐ | ‐ | ‐ |
| Moradi et al. ( | Gray matter density map + semi‐supervised classifier | Feature engineering | 231 | 100 | 164 | 200 | ‐ | ‐ | ‐ | ‐ | 74.7 | 88.9 | 51.6 | 0.766 |
| Suk et al. ( | sMRI patches + deep Boltzmann machine + SVM | ‐ | 101 | 128 | 76 | 93 | 92.4 | 91.5 | 94.6 | 0.970 | 72.4 | 36.7 | 91.0 | 0.734 |
| Lin, Tong, et al. ( | sMRI patches + CNN + PCA + Lasso + ELM | Prior knowledge | 229 | 139 | 169 | 188 | 88.8 | ‐ | ‐ | ‐ | 76.9 | 81.7 | 71.2 | 0.829 |
| Liu et al. ( | sMRI patches + CNN | Feature engineering | 452 | 465 | 205 | 404 | 91.1 | 88.1 | 93.5 | 0.959 | 76.9 | 42.1 | 82.4 | 0.776 |
| Lian et al. ( | sMRI patches + CNN | Prior knowledge | 429 | 465 | 205 | 358 | 90.3 | 82.4 | 96.5 | 0.951 | 80.9 | 52.6 | 85.4 | 0.781 |
| Khvostikov et al. ( | Hippocampal sMRI + CNN | Prior knowledge | 58 | ‐ | ‐ | 48 | 85.0 | 88.0 | 90.0 | ‐ | ‐ | ‐ | ‐ | ‐ |
| Korolev et al. ( | Whole brain sMRI + CNN | ‐ | 61 | ‐ | ‐ | 50 | 80.0 | ‐ | ‐ | 0.870 | ‐ | ‐ | ‐ | ‐ |
| Spasov et al. ( | Whole brain sMRI + CNN | ‐ | 184 | 228 | 181 | 192 | ‐ | ‐ | ‐ | ‐ | 72.0 | 63.0 | 81.0 | 0.790 |
| Ours | Whole brain sMRI + CNN | ‐ | 394 | 401 | 197 | 348 | 90.7 | 88.8 | 92.4 | 0.936 | 79.3 | 54.6 | 84.1 | 0.776 |