| Literature DB >> 35326275 |
Mingfeng Jiang1, Bin Yan1, Yang Li1, Jucheng Zhang2, Tieqiang Li3, Wei Ke4.
Abstract
Automatic and accurate classification of Alzheimer's disease is a challenging and promising task. Fully Convolutional Network (FCN) can classify images at the pixel level. Adding an attention mechanism to the Fully Convolutional Network can effectively improve the classification performance of the model. However, the self-attention mechanism ignores the potential correlation between different samples. Aiming at this problem, we propose a new method for image classification of Alzheimer's disease based on the external-attention mechanism. The external-attention module is added after the fourth convolutional block of the fully convolutional network model. At the same time, the double normalization method of Softmax and L1 norm is introduced to obtain a better classification performance and richer feature information of the disease probability map. The activation function Softmax can increase the degree of fitting of the neural network to the training set, which transforms linearity into nonlinearity, thereby increasing the flexibility of the neural network. The L1 norm can avoid the attention map being affected by especially large (especially small) eigenvalues. The experiments in this paper use 550 three-dimensional MRI images and use five-fold cross-validation. The experimental results show that the proposed image classification method for Alzheimer's disease, combining the external-attention mechanism with double normalization, can effectively improve the classification performance of the model. With this method, the accuracy of the MLP-A model is 92.36%, the accuracy of the MLP-B model is 98.55%, and the accuracy of the fusion model MLP-C is 98.73%. The classification performance of the model is higher than similar models without adding any attention mechanism, and it is better than other comparison methods.Entities:
Keywords: Alzheimer’s disease; double normalization; external-attention mechanism; fully convolutional network; image classification
Year: 2022 PMID: 35326275 PMCID: PMC8946519 DOI: 10.3390/brainsci12030319
Source DB: PubMed Journal: Brain Sci ISSN: 2076-3425
Detailed information of the experimental dataset.
| Dataset | ADNI | |
|---|---|---|
| Research object | AD | NC |
| Number of samples | 307 | 243 |
| Average age | 76.3 (57–92) | 79.4 (65–87) |
| Gender (Male/Female) | 180/127 | 101/142 |
| Average MMSE | 22.8 (19–27) | 28.6 (26–30) |
Figure 1The flowchart of image preprocessing.
Figure 2Diagram of the self-attention mechanism.
Figure 3The feature maps in the self-attention module.
Figure 4Diagram of the external-attention mechanism.
Figure 5The feature maps in the external-attention module.
Figure 6The FCN model’s framework.
Figure 7Randomly sampling 3D-MRI image’s patches for training the FCN Model.
Figure 8The CNN model’s framework.
The parameter settings of CNN model.
| Input Layer | Detailed Description | Output Size |
|---|---|---|
| Input | (1, 182, 218, 182) | |
| 3D convolutional layer1 | channel 20, kernel 7, stride 2, padding 0 | (20, 88, 106, 88) |
| 3D maxpool layer1 | kernel 3, stride 2, padding 0 | (20, 43, 52, 43) |
| 3D batch normalization1 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU1; Dropout1 | Negative slope = 0.01; | |
| 3D convolutional layer2 | channel 40, kernel 4, stride 1, padding 0 | (40, 40, 49, 40) |
| 3D maxpool layer2 | kernel 2, stride 2, padding 0 | (40, 20, 24, 20) |
| 3D batch normalization2 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU2; Dropout2 | Negative slope = 0.01; | |
| 3D convolutional layer3 | channel 80, kernel 3, stride 1, padding 0 | (80, 18, 22, 18) |
| 3D maxpool layer3 | kernel 2, stride 2, padding 0 | (80, 9, 11, 9) |
| 3D batch normalization3 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU3; Dropout3 | Negative slope = 0.01; | |
| 3D convolutional layer4 | channel 160, kernel 3, stride 1, padding 0 | (160, 7, 9, 7) |
| 3D maxpool layer4 | kernel 2, stride 1, padding 0 | (160, 6, 8, 6) |
| 3D batch normalization4 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU4; Dropout4 | Negative slope = 0.01; | |
| Flatten | (46, 80) | |
| Dropout5 | ||
| Fully connected layer1 | channel 30 | (30) |
| Leaky ReLU5; Dropout6 | Negative slope = 0.01; | |
| Fully connected layer2 | channel 2 | (2) |
Figure 9The MLP model’s framework.
The parameter settings of FCN model.
| Input Layer | Detailed Description | Output Patch Size |
|---|---|---|
| Input | (1, 47, 47, 47) | |
| 3D convolutional layer1 | channel 20, kernel 4, stride 1, padding 0 | (20, 44, 44, 44) |
| 3D maxpool layer1 | kernel 2, stride 1, padding 0 | (20, 43, 43, 43) |
| 3D batch normalization1 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU1; Dropout1 | Negative slope = 0.01; | |
| 3D convolutional layer2 | channel 40, kernel 4, stride 1, padding 0 | (40, 40, 40, 40) |
| 3D maxpool layer2 | kernel 2, stride 2, padding 0 | (40, 20, 20, 20) |
| 3D batch normalization2 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU2; Dropout2 | Negative slope = 0.01; | |
| 3D convolutional layer3 | channel 80, kernel 3, stride 1, padding 0 | (80, 18, 18, 18) |
| 3D maxpool layer3 | kernel 2, stride 2, padding 0 | (80, 9, 9, 9) |
| 3D batch normalization3 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU3; Dropout3 | Negative slope = 0.01; | |
| 3D convolutional layer4 | channel 160, kernel 3, stride 1, padding 0 | (160, 7, 7, 7) |
| 3D maxpool layer4 | kernel 2, stride 1, padding 0 | (160, 6, 6, 6) |
| 3D batch normalization4 | eps = 1 × 10−5, momentum = 0.1, affine = True | |
| Leaky ReLU4; Dropout4 | Negative slope = 0.01; | |
| Fully connected layer1 | channel 30, kernel 6, stride 1, padding 0 | (30, 1, 1, 1) |
| Leaky ReLU5; Dropout5 | Negative slope = 0.01; | |
| Fully connected layer2 | channel 2, kernel 1, stride 1, padding 0 | (2, 1, 1, 1) |
Figure 10(A) The MCC value can show the overall classification performance of the FCN model. From the MCC heatmap, it can be observed that some locations have higher MCC values (that is, these locations have higher classification accuracy). The MLP model uses these specific locations as a region of interest (ROI). (B–D) represents the MCC value of the FCN model in the individual axial, coronal and sagittal directions.
Figure 11(A) The disease probability map generated by the FCN model highlights the brain regions at high risk of Alzheimer’s disease. The first two samples were clinically diagnosed as patients with Alzheimer’s disease, and the latter two samples were clinically confirmed as normal cognitive persons. (B–D) shows the axial, coronal and sagittal disease probability map of patients who are clinically diagnosed with Alzheimer’s disease. Red indicates that the risk of Alzheimer’s disease is >0.5, and blue indicates <0.5.
Figure 12(a) The changes of the FCN models’ accuracy; (b) The changes of the MLP models’ accuracy.
Figure 13(a) No attention mechanism is added to models; (b) Models add self-attention mechanism; (c) Models add external-attention mechanism and Softmax; (d) Models add external-attention mechanism and double normalization.
Figure 14(a) Models without attention mechanism; (b) Models add self-attention mechanism; (c) Models add external-attention mechanism and Softmax; (d) Models add external-attention mechanism and double normalization.
The classification performance of the MLP models without any attention module.
| FCN | Accu | Sens | Spec | F1 | MCC |
|---|---|---|---|---|---|
| MLP-A | 0.8685 ± 0.0140 | 0.8444 ± 0.0356 | 0.8943 ± 0.0481 | 0.8693 ± 0.0131 | 0.7408 ± 0.0288 |
| MLP-B | 0.9688 ± 0.0103 | 0.9526 ± 0.0221 | 0.9642 ± 0.0175 | 0.9691 ± 0.0104 | 0.9485 ± 0.0197 |
| MLP-C | 0.9728 ± 0.0143 | 0.9643 ± 0.0180 | 0.9674 ± 0.0139 | 0.9757 ± 0.0131 | 0.9534 ± 0.0177 |
The experimental results of the CNN model and the MLP fusion model.
| Accu | Sens | Spec | F1 | MCC | |
|---|---|---|---|---|---|
| CNN | 0.8636 ± 0.0237 | 0.8875 ± 0.0153 | 0.8696 ± 0.0364 | 0.8287 ± 0.0488 | 0.7549 ± 0.0368 |
| MLP fusion model | 0.9188 ± 0.0221 | 0.9439 ± 0.0340 | 0.8918 ± 0.0280 | 0.9232 ± 0.0215 | 0.8389 ± 0.0446 |
The classification performance of the FCN model.
| Accu | MCC | |
|---|---|---|
| FCN | 0.58449 ± 0.0129 | 0.16132 ± 0.0264 |
| FCN + SA | 0.59604 ± 0.0052 | 0.18586 ± 0.0076 |
| FCN + EA + Softmax | 0.61012 ± 0.0105 | 0.20895 ± 0.0090 |
| FCN + EA + Double normalization | 0.61815 ± 0.0069 | 0.21327 ± 0.0138 |
The classification performance of the MLP models after adding the self-attention module.
| FCN + SA | Accu | Sens | Spec | F1 | MCC |
|---|---|---|---|---|---|
| MLP-A | 0.8947 ± 0.0103 | 0.8696 ± 0.0162 | 0.9107 ± 0.0134 | 0.8846 ± 0.0123 | 0.7711 ± 0.0189 |
| MLP-B | 0.9745 ± 0.0101 | 0.9649 ± 0.0248 | 0.9689 ± 0.0220 | 0.9751 ± 0.0100 | 0.9531 ± 0.0193 |
| MLP-C | 0.9788 ± 0.0113 | 0.9704 ± 0.0241 | 0.9712 ± 0.0152 | 0.9792 ± 0.0113 | 0.9584 ± 0.0219 |
The classification performance of the MLP models after adding the external-attention module and double normalization.
| FCN + EA + | Accu | Sens | Spec | F1 | MCC |
|---|---|---|---|---|---|
| MLP-A | 0.9236 ± 0.0193 | 0.9260 ± 0.0141 | 0.9356 ± 0.0163 | 0.9292 ± 0.0182 | 0.8492 ± 0.0180 |
| MLP-B | 0.9855 ± 0.0045 | 0.9902 ± 0.0080 | 0.9796 ± 0.0129 | 0.9869 ± 0.0040 | 0.9707 ± 0.0090 |
| MLP-C | 0.9873 ± 0.0069 | 0.9929 ± 0.0041 | 0.9828 ± 0.0185 | 0.9889 ± 0.0076 | 0.9749 ± 0.0130 |
The classification performance of the MLP models after adding the external-attention module and Softmax.
| FCN + EA + Softmax | Accu | Sens | Spec | F1 | MCC |
|---|---|---|---|---|---|
| MLP-A | 0.9124 ± 0.0112 | 0.9116 ± 0.0111 | 0.9234 ± 0.0156 | 0.9111 ± 0.0100 | 0.8253 ± 0.0140 |
| MLP-B | 0.9828 ± 0.0147 | 0.9867 ± 0.0089 | 0.9701 ± 0.0246 | 0.9836 ± 0.0129 | 0.9686 ± 0.0296 |
| MLP-C | 0.9851 ± 0.0099 | 0.9906 ± 0.0072 | 0.9741 ± 0.0204 | 0.9876 ± 0.0088 | 0.9721 ± 0.0198 |
Compare with the classification models of other researchers.
| Author | Type of Dataset | Methods | Number of Samples | Accuracy |
|---|---|---|---|---|
| Liu S et al. [ | MRI | Stacked auto-encoder (SAE) + region-level engineered features | 180 AD/204 NC | 0.79 |
| Shi J et al. [ | MRI | Deep Polynomial Network (DPN) | 51 AD/52 NC | 0.9076 |
| Tomassini S et al. [ | MRI | Based on long short-term memory network (LSTM) | 213 AD/214 NC | 0.86 |
| Ullah H et al. [ | MRI | Deep Convolutional Network (3D-CNN) | 416 (AD + NC) | 0.8025 |
| Hinrichs C et al. [ | MRI | SVM + Linear Program boost (LP) + voxel-level engineered features | 183 (AD + NC) | 0.82 |
| Suk H I et al. [ | MRI | Deep Boltzmann Machine | 93 AD/101 NC | 0.9238 |
| PET | 0.9220 | |||
| Our proposed methods | MRI | FCN + SA | 307 AD/243 NC | 0.9788 |
| FCN + EA + softmax | 0.9851 | |||
| FCN + EA + double normalization | 0.9873 |