| Literature DB >> 35684871 |
Nadiah A Baghdadi1, Amer Malki2, Hossam Magdy Balaha3, Mahmoud Badawy3, Mostafa Elhosseini2,3.
Abstract
Alzheimer's disease (AD) is a chronic disease that affects the elderly. There are many different types of dementia, but Alzheimer's disease is one of the leading causes of death. AD is a chronic brain disorder that leads to problems with language, disorientation, mood swings, bodily functions, memory loss, cognitive decline, mood or personality changes, and ultimately death due to dementia. Unfortunately, no cure has yet been developed for it, and it has no known causes. Clinically, imaging tools can aid in the diagnosis, and deep learning has recently emerged as an important component of these tools. Deep learning requires little or no image preprocessing and can infer an optimal data representation from raw images without prior feature selection. As a result, they produce a more objective and less biased process. The performance of a convolutional neural network (CNN) is primarily affected by the hyperparameters chosen and the dataset used. A deep learning model for classifying Alzheimer's patients has been developed using transfer learning and optimized by Gorilla Troops for early diagnosis. This study proposes the A3C-TL-GTO framework for MRI image classification and AD detection. The A3C-TL-GTO is an empirical quantitative framework for accurate and automatic AD classification, developed and evaluated with the Alzheimer's Dataset (four classes of images) and the Alzheimer's Disease Neuroimaging Initiative (ADNI). The proposed framework reduces the bias and variability of preprocessing steps and hyperparameters optimization to the classifier model and dataset used. Our strategy, evaluated on MRIs, is easily adaptable to other imaging methods. According to our findings, the proposed framework was an excellent instrument for this task, with a significant potential advantage for patient care. The ADNI dataset, an online dataset on Alzheimer's disease, was used to obtain magnetic resonance imaging (MR) brain images. The experimental results demonstrate that the proposed framework achieves 96.65% accuracy for the Alzheimer's Dataset and 96.25% accuracy for the ADNI dataset. Moreover, a better performance in terms of accuracy is demonstrated over other state-of-the-art approaches.Entities:
Keywords: Alzheimer; artificial gorilla troops optimizer (GTO); convolutional neural network (CNN); deep learning (DL); metaheuristic optimization
Mesh:
Year: 2022 PMID: 35684871 PMCID: PMC9185328 DOI: 10.3390/s22114250
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The anticipated number of US people above 65 with AD from 2020 to 2060.
Figure 2The major signs and symptoms of dementia.
Figure 3The characteristics of well-known AD datasets.
Figure 4The gorilla natural behaviors flowchart.
Figure 5The suggested AC-TL-GTO framework.
Figure 6Samples from each used dataset in the current study.
Figure 7The data conversion and cleaning steps applied on the ADNI dataset.
The solution indexing with the hyperparameters definitions.
| Element Index | Hyperparameter Definition |
|---|---|
| 1 | Loss function |
| 2 | Batch size |
| 3 | Dropout ratio |
| 4 | TL learning ratio |
| 5 | Weights (i.e., parameters) optimizer |
| 6 | Dimension scaling technique |
| 7 | Apply data augmentation or not |
| 8 | Rotation value (in case of data augmentation is applied) |
| 9 | Width shift value (in case of data augmentation is applied) |
| 10 | Height shift value (in case of data augmentation is applied) |
| 11 | Shear value (in case of data augmentation is applied) |
| 12 | Zoom value (in case of data augmentation is applied) |
| 13 | Horizontal flipping flag (in case of data augmentation is applied) |
| 14 | Vertical flipping flag (in case of data augmentation is applied) |
| 15 | Brightness changing range (in case of data augmentation is applied) |
The common experiments configurations.
| Configuration | Specifications |
|---|---|
| Apply Dataset Shuffling? | Yes (Random) |
| Input Image Size | |
| Hyperparameters Metaheuristic Optimizer | Artificial Gorilla Troops Optimizer (GTO) |
| Train Split Ratio | 85% to 15% (i.e., 85% for training (and validation) and 15% for testing) |
| Size of Population | 10 |
| Number of Iterations | 10 |
| Number of Epochs | 5 |
| Output Activation Function | SoftMax |
| Pretrained Models | DenseNet201, MobileNet, MobileNetV2, MobileNetV3Small, MobileNetV3Large, VGG16, VGG19, and Xception |
| Pretrained Parameters Initializers | ImageNet |
| Losses Range | Categorical Crossentropy, Categorical Hinge, KLDivergence, Poisson, Squared Hinge, and Hinge |
| Parameters Optimizers Range | Adam, NAdam, AdaGrad, AdaDelta, AdaMax, RMSProp, SGD, Ftrl, SGD Nesterov, RMSProp Centered, and Adam AMSGrad |
| Dropout Range | |
| Batch Size Range | |
| Pretrained Model Learn Ratio Range | |
| Scaling Techniques | Normalize, Standard, Min-Max, and Max-Abs |
| Apply Data Augmentation (DA) | |
| DA Rotation Range | |
| DA Width Shift Range | |
| DA Height Shift Range | |
| DA Shear Range | |
| DA Zoom Range | |
| DA Horizontal Flip Range | |
| DA Vertical Flip Range | |
| DA Brightness Range | |
| Scripting Language | Python |
| Python Major Packages | Tensorflow, Keras, Pydicom, NumPy, OpenCV, and Matplotlib |
| Working Environment | Google Colab with GPU (i.e., Intel(R) Xeon(R) CPU @ 2.00 GHz, Tesla T4 16 GB GPU, CUDA v.11.2, and 12 GB RAM) |
The confusion matrix (i.e., TP, TN, FP, and FN) for each pretrained CNN model using the “Alzheimer’s Dataset (4 class of Images)” dataset.
| Model Name | DenseNet201 | MobileNet | MobileNetV2 | MobileNetV3Small | MobileNetV3Large | VGG16 | VGG19 | Xception | Best Score | Worst Score |
|---|---|---|---|---|---|---|---|---|---|---|
| TP | 9097 | 12,360 | 7055 | 5642 | 4307 | 4317 | 7258 | 6324 | 12,360 | 4307 |
| TN | 35,317 | 37,953 | 37,792 | 34,055 | 36,976 | 36,823 | 33,922 | 35,897 | 37,953 | 33,922 |
| FP | 2963 | 423 | 608 | 4333 | 1424 | 1577 | 4478 | 2503 | 423 | 4478 |
| FN | 3663 | 432 | 5745 | 7154 | 8493 | 8483 | 5542 | 6476 | 432 | 8493 |
The best performance metrics reported by the “Alzheimer’s Dataset (4 class of Images)” dataset.
| Model Name | DenseNet201 | MobileNet | MobileNetV2 | MobileNetV3Small | MobileNetV3Large | VGG16 | VGG19 | Xception | Best |
|---|---|---|---|---|---|---|---|---|---|
| Loss | 0.891 | 0.094 | 0.702 | 1.079 | 0.926 | 0.795 | 1.102 | 0.798 | 0.094 |
| Accuracy | 71.76% | 96.65% | 66.63% | 50.20% | 57.19% | 63.57% | 60.12% | 63.78% | 96.65% |
| F1 | 71.45% | 96.65% | 56.92% | 49.20% | 39.04% | 42.59% | 58.37% | 57.90% | 96.65% |
| Precision | 71.62% | 96.69% | 63.06% | 56.68% | 52.84% | 66.83% | 60.66% | 71.19% | 96.69% |
| Recall (Sensitivity) | 71.29% | 96.62% | 55.12% | 44.09% | 33.65% | 33.73% | 56.70% | 49.41% | 96.62% |
| Specificity | 92.26% | 98.90% | 98.42% | 88.71% | 96.29% | 95.89% | 88.34% | 93.48% | 98.90% |
| AUC* | 92.52% | 99.75% | 90.26% | 80.36% | 84.91% | 84.09% | 86.17% | 88.47% | 99.75% |
| IoU* | 79.89% | 96.39% | 70.62% | 59.72% | 61.72% | 62.56% | 67.18% | 64.98% | 96.39% |
| Dice | 81.06% | 96.96% | 73.96% | 63.97% | 66.52% | 66.37% | 70.82% | 69.47% | 96.96% |
| Cosine Similarity | 75.09% | 97.21% | 75.99% | 61.93% | 68.73% | 68.12% | 68.24% | 72.25% | 97.21% |
| Youden Index | 63.55% | 95.52% | 53.53% | 32.80% | 29.94% | 29.62% | 45.04% | 42.89% | 95.52% |
| NPV * | 90.96% | 98.88% | 88.19% | 82.70% | 81.98% | 81.58% | 86.11% | 84.81% | 98.88% |
| MCC * | 63.11% | 95.54% | 54.89% | 35.85% | 34.92% | 37.58% | 45.86% | 48.89% | 95.54% |
| FBeta | 71.35% | 96.63% | 55.75% | 45.94% | 35.41% | 36.65% | 57.32% | 52.43% | 96.63% |
| FNR * | 0.287 | 0.034 | 0.449 | 0.559 | 0.664 | 0.663 | 0.433 | 0.506 | 0.034 |
| FDR * | 0.284 | 0.033 | 0.129 | 0.433 | 0.169 | 0.268 | 0.393 | 0.288 | 0.033 |
| Fallout | 0.077 | 0.011 | 0.016 | 0.113 | 0.037 | 0.041 | 0.117 | 0.065 | 0.011 |
| CC * | 0.891 | 0.094 | 0.702 | 1.079 | 0.926 | 1.086 | 0.973 | 0.798 | 0.094 |
| KLD * | 0.891 | 0.094 | 0.702 | 1.079 | 0.924 | 1.086 | 0.973 | 0.798 | 0.094 |
| Categorical Hinge | 0.506 | 0.090 | 0.593 | 0.926 | 0.852 | 0.795 | 0.802 | 0.776 | 0.090 |
| Hinge | 0.892 | 0.773 | 0.945 | 1.020 | 1.001 | 1.002 | 0.969 | 0.979 | 0.773 |
| Squared Hinge | 0.993 | 0.786 | 1.039 | 1.173 | 1.129 | 1.138 | 1.102 | 1.096 | 0.786 |
| Poisson | 0.473 | 0.274 | 0.426 | 0.520 | 0.481 | 0.521 | 0.493 | 0.449 | 0.274 |
| Logcosh Error | 0.045 | 0.006 | 0.044 | 0.071 | 0.060 | 0.063 | 0.061 | 0.055 | 0.006 |
| MAE * | 0.142 | 0.023 | 0.195 | 0.270 | 0.251 | 0.252 | 0.219 | 0.229 | 0.023 |
| Mean IoU | 0.389 | 0.424 | 0.375 | 0.375 | 0.375 | 0.375 | 0.375 | 0.375 | 0.375 |
| MSE * | 0.101 | 0.013 | 0.094 | 0.153 | 0.128 | 0.135 | 0.133 | 0.117 | 0.013 |
| MSLE * | 0.051 | 0.006 | 0.046 | 0.075 | 0.063 | 0.066 | 0.066 | 0.057 | 0.006 |
| RMSE * | 0.318 | 0.113 | 0.306 | 0.391 | 0.358 | 0.368 | 0.365 | 0.341 | 0.113 |
* AUC: Area Under Curve, IoU: Intersection over Union, NPV: Negative Predictive Value, MCC: Matthews Correlation Coefficient, FNR: False Negative Rate, FDR: False Discovery Rate, CC: Categorical Crossentropy, KLD: Kullback Leibler Divergence, MAE: Mean Absolute Error, MSE: Mean Squared Error, MSLE: Mean Squared Logarithmic Error, RMSE: Root Mean Squared Error.
The best hyperparameters for each pretrained CNN model using the “Alzheimer’s Dataset (4 class of Images)” dataset.
| Model Name | DenseNet201 | MobileNet | MobileNetV2 | MobileNetV3Small | MobileNetV3Large | VGG16 | VGG19 | Xception |
|---|---|---|---|---|---|---|---|---|
| Loss | Categorical | Categorical | Categorical | Categorical | Categorical | Categorical Hinge | Squared Hinge | Categorical Crossentropy |
| Batch Size | 44 | 12 | 20 | 28 | 4 | 16 | 40 | 40 |
| Dropout | 0.13 | 0.24 | 0.6 | 0.52 | 0.33 | 0.05 | 0.06 | 0.22 |
| TL Learn Ratio | 97 | 65 | 74 | 54 | 92 | 32 | 52 | 6 |
| Optimizer | SGD | SGD | SGD Nesterov | NAdam | RMSProp | AdaGrad | SGD | NAdam |
| Scaling Technique | Standardization | Min-Max | Standardization | Normalization | Max-Abs | Normalization | Standardization | Normalization |
| Apply Augmentation | Yes | No | Yes | Yes | Yes | Yes | Yes | No |
| Rotation Range | 13 | N/A | 5 | 33 | 4 | 4 | 21 | N/A |
| Width Shift Range | 0.05 | N/A | 0.25 | 0.05 | 0.17 | 0.07 | 0.03 | N/A |
| Height Shift Range | 0.07 | N/A | 0.23 | 0 | 0.03 | 0.02 | 0.13 | N/A |
| Shear Range | 0.2 | N/A | 0 | 0.1 | 0.02 | 0 | 0.07 | N/A |
| Zoom Range | 0.17 | N/A | 0 | 0.1 | 0.02 | 0.02 | 0.23 | N/A |
| Horizontal Flip | Yes | N/A | No | No | Yes | Yes | No | N/A |
| Vertical Flip | Yes | N/A | Yes | Yes | No | Yes | Yes | N/A |
| Brightness Range | 0.54–0.8 | N/A | 0.5–1.51 | 1.83–2.0 | 0.92–1.53 | 0.63–0.82 | 0.81–1.93 | N/A |
Figure 8Graphical summary of the performance metrics of the “Alzheimer’s Dataset (4 class of Images)” dataset.
Figure 9The confusion matrices using the “Alzheimer’s Dataset (4 class of Images)” dataset.
The confusion matrix (i.e., TP, TN, FP, and FN) for each pretrained CNN model using the “Alzheimer’s Disease Neuroimaging Initiative (ADNI)” dataset.
| Model Name | DenseNet201 | MobileNet | MobileNetV2 | MobileNetV3Small | MobileNetV3Large | VGG16 | VGG19 | Xception | Best Score | Worst Score |
|---|---|---|---|---|---|---|---|---|---|---|
| TP | 14,057 | 14,348 | 14,187 | 10,680 | 11,019 | 11,292 | 12,350 | 14,365 | 14,365 | 10,680 |
| TN | 29,351 | 29,407 | 29,439 | 27,402 | 28,027 | 27,944 | 28,425 | 29,511 | 29,511 | 27,402 |
| FP | 601 | 593 | 513 | 2598 | 1973 | 2016 | 1495 | 489 | 489 | 2598 |
| FN | 919 | 652 | 789 | 4320 | 3981 | 3688 | 2610 | 635 | 635 | 4320 |
The best performance metrics reported by the “Alzheimer’s Disease Neuroimaging Initiative (ADNI)” dataset.
| Model Name | DenseNet201 | MobileNet | MobileNetV2 | MobileNetV3Small | MobileNetV3Large | VGG16 | VGG19 | Xception | Best |
|---|---|---|---|---|---|---|---|---|---|
| Loss | 0.171 | 0.112 | 0.122 | 0.544 | 0.495 | 0.448 | 0.347 | 0.106 | 0.106 |
| Accuracy | 94.74% | 95.81% | 95.63% | 75.82% | 79.97% | 80.81% | 85.98% | 96.25% | 96.25% |
| F1 | 94.80% | 95.84% | 95.58% | 75.04% | 77.95% | 79.56% | 85.61% | 96.22% | 96.22% |
| Precision | 95.86% | 96.03% | 96.51% | 80.22% | 84.82% | 84.84% | 89.18% | 96.72% | 96.72% |
| Recall (Sensitivity) | 93.86% | 95.65% | 94.73% | 71.20% | 73.46% | 75.38% | 82.55% | 95.77% | 95.77% |
| Specificity | 97.99% | 98.02% | 98.29% | 91.34% | 93.42% | 93.27% | 95.00% | 98.37% | 98.37% |
| AUC * | 99.44% | 99.59% | 99.63% | 92.01% | 93.68% | 94.80% | 96.97% | 99.68% | 99.68% |
| IoU * | 89.56% | 96.02% | 93.45% | 76.22% | 75.11% | 76.57% | 81.72% | 94.85% | 96.02% |
| Dice | 91.78% | 96.58% | 94.69% | 79.58% | 79.15% | 80.52% | 84.91% | 95.78% | 96.58% |
| Cosine Similarity | 95.45% | 96.61% | 96.35% | 81.40% | 83.62% | 84.92% | 88.73% | 96.79% | 96.79% |
| Youden Index | 91.86% | 93.68% | 93.02% | 62.54% | 66.88% | 68.65% | 77.56% | 94.14% | 94.14% |
| NPV * | 97.02% | 97.84% | 97.42% | 86.62% | 87.94% | 88.50% | 91.68% | 97.91% | 97.91% |
| MCC * | 92.36% | 93.77% | 93.47% | 64.59% | 69.60% | 70.91% | 79.17% | 97.91% | 97.91% |
| FBeta | 94.22% | 95.72% | 95.06% | 72.62% | 75.08% | 76.95% | 83.72% | 95.94% | 95.94% |
| FNR * | 0.061 | 0.043 | 0.053 | 0.288 | 0.265 | 0.246 | 0.174 | 0.042 | 0.042 |
| FDR * | 0.041 | 0.040 | 0.035 | 0.198 | 0.152 | 0.152 | 0.108 | 0.033 | 0.033 |
| Fallout | 0.020 | 0.020 | 0.017 | 0.087 | 0.066 | 0.067 | 0.050 | 0.016 | 0.016 |
| CC * | 0.171 | 0.112 | 0.122 | 0.544 | 0.495 | 0.448 | 0.347 | 0.106 | 0.106 |
| KLD * | 0.171 | 0.112 | 0.122 | 0.544 | 0.495 | 0.448 | 0.347 | 0.106 | 0.106 |
| Categorical Hinge | 0.225 | 0.099 | 0.147 | 0.545 | 0.553 | 0.529 | 0.409 | 0.120 | 0.099 |
| Hinge | 0.749 | 0.701 | 0.720 | 0.871 | 0.875 | 0.861 | 0.818 | 0.709 | 0.701 |
| Squared Hinge | 0.777 | 0.721 | 0.742 | 0.976 | 0.970 | 0.949 | 0.884 | 0.728 | 0.721 |
| Poisson | 0.390 | 0.371 | 0.374 | 0.515 | 0.498 | 0.483 | 0.449 | 0.369 | 0.369 |
| Logcosh Error | 0.013 | 0.009 | 0.010 | 0.049 | 0.045 | 0.041 | 0.031 | 0.009 | 0.009 |
| MAE * | 0.082 | 0.034 | 0.053 | 0.204 | 0.209 | 0.195 | 0.151 | 0.042 | 0.034 |
| Mean IoU | 0.333 | 0.414 | 0.337 | 0.333 | 0.333 | 0.333 | 0.333 | 0.347 | 0.333 |
| MSE * | 0.028 | 0.020 | 0.022 | 0.105 | 0.095 | 0.088 | 0.066 | 0.019 | 0.019 |
| MSLE * | 0.014 | 0.010 | 0.011 | 0.052 | 0.047 | 0.043 | 0.033 | 0.009 | 0.009 |
| RMSE * | 0.168 | 0.142 | 0.148 | 0.324 | 0.308 | 0.296 | 0.257 | 0.139 | 0.139 |
* AUC: Area Under Curve, IoU: Intersection over Union, NPV: Negative Predictive Value, MCC: Matthews Correlation Coefficient, FNR: False Negative Rate, FDR: False Discovery Rate, CC: Categorical Crossentropy, KLD: Kullback Leibler Divergence, MAE: Mean Absolute Error, MSE: Mean Squared Error, MSLE: Mean Squared Logarithmic Error, RMSE: Root Mean Squared Error.
The best hyperparameters for each pretrained CNN model using the “Alzheimer’s Disease Neuroimaging Initiative (ADNI)” dataset.
| Model Name | DenseNet201 | MobileNet | MobileNetV2 | MobileNetV3Small | MobileNetV3Large | VGG16 | VGG19 | Xception |
|---|---|---|---|---|---|---|---|---|
| Loss | Categorical Crossentropy | Categorical Crossentropy | KLDivergence | KLDivergence | KLDivergence | KLDivergence | KLDivergence | KLDivergence |
| Batch Size | 32 | 40 | 36 | 20 | 12 | 28 | 44 | 40 |
| Dropout | 0.1 | 0.23 | 0.3 | 0.34 | 0.13 | 0.11 | 0.02 | 0.3 |
| TL Learn Ratio | 37 | 28 | 41 | 53 | 94 | 71 | 99 | 39 |
| Optimizer | AdaGrad | AdaMax | SGD Nesterov | AdaMax | AdaGrad | AdaGrad | SGD Nesterov | AdaMax |
| Scaling Technique | Standardization | Standardization | Min-Max | Standardization | Normalization | Min-Max | Standardization | Max-Abs |
| Apply Augmentation | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Rotation Range | N/A | 44 | 12 | 15 | 28 | 3 | 12 | 27 |
| Width Shift Range | N/A | 0.2 | 0.17 | 0.13 | 0.09 | 0.16 | 0.22 | 0.17 |
| Height Shift Range | N/A | 0.23 | 0.16 | 0.08 | 0.25 | 0.23 | 0.23 | 0.12 |
| Shear Range | N/A | 0.07 | 0.17 | 0.06 | 0.25 | 0.23 | 0.08 | 0.21 |
| Zoom Range | N/A | 0.22 | 0.19 | 0.14 | 0.06 | 0.03 | 0.08 | 0.1 |
| Horizontal Flip | N/A | Yes | Yes | Yes | No | No | No | Yes |
| Vertical Flip | N/A | No | Yes | Yes | Yes | Yes | Yes | No |
| Brightness Range | N/A | 0.56–0.68 | 1.09–1.48 | 0.85–1.67 | 1.48–2.0 | 0.52–1.34 | 1.23–1.51 | 0.53–1.82 |
Figure 10Graphical summary of the performance metrics of the “Alzheimer’s Disease Neuroimaging Initiative (ADNI)” dataset.
Figure 11The confusion matrices using the “Alzheimer’s Disease Neuroimaging Initiative (ADNI)” dataset.
Figure 12Graphical summary of the performed work in the current study concerning the hyperparameters selection process.
Comparison between the suggested approach and related studies.
| Study | Year | Approach | Best Metric(s) |
|---|---|---|---|
| Islam and Zhang [ | 2017 | DL | 73.75% Accuracy |
| Zhang et al. [ | 2019 | Voxel-based Morphometry | 96% Accuracy |
| Martinez et al. [ | 2019 | DL + Autoencoders | 95% Accuracy |
| Saratxaga et al. [ | 2021 | DL | 93% Balanced Accuracy |
| Raees et al. [ | 2021 | DL | 90% Accuracy |
| Buvaneswari et al. [ | 2021 | DL | 95% Accuracy |
| Katabathula et al. [ | 2021 | 3D DL | 92.5% Accuracy |
| Current Study (A | 2022 | Hybrid (GTO + DL) | 96.65% Accuracy for “Alzheimer’s Dataset (4 class of Images)” and 96.25% Accuracy “Alzheimer’s Disease Neuroimaging Initiative (ADNI)” |