| Literature DB >> 35785142 |
Pranjal Bhardwaj1, Prajjwal Gupta1, Thejineaswar Guhan2, Kathiravan Srinivasan1.
Abstract
Early diagnosis of retinal diseases such as diabetic retinopathy has had the attention of many researchers. Deep learning through the introduction of convolutional neural networks has become a prominent solution for image-related tasks such as classification and segmentation. Most tasks in image classification are handled by deep CNNs pretrained and evaluated on imagenet dataset. However, these models do not always translate to the best result on other datasets. Devising a neural network manually from scratch based on heuristics may not lead to an optimal model as there are numerous hyperparameters in play. In this paper, we use two nature-inspired swarm algorithms: particle swarm optimization (PSO) and ant colony optimization (ACO) to obtain TDCN models to perform classification of fundus images into severity classes. The power of swarm algorithms is used to search for various combinations of convolutional, pooling, and normalization layers to provide the best model for the task. It is observed that TDCN-PSO outperforms imagenet models and existing literature, while TDCN-ACO achieves faster architecture search. The best TDCN model achieves an accuracy of 90.3%, AUC ROC of 0.956, and a Cohen's kappa score of 0.967. The results were compared with the previous studies to show that the proposed TDCN models exhibit superior performance.Entities:
Mesh:
Year: 2022 PMID: 35785142 PMCID: PMC9246601 DOI: 10.1155/2022/3571364
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
Search goals and performance bounds used in this work.
| Metric |
|
|
|---|---|---|
| Accuracy | ≥74.8 | 1.5% |
| AUC ROC | ≥0.91 | 0.2 |
| Cohen's kappa | ≥0.776 | 0.02 |
ConvNet layer abbreviations used in this work.
| Abbreviation | Layer |
|---|---|
| C2D | Conv2D (convolutional layer with 2-dimensional filters) |
| BN | Batch normalization |
| DO | Dropout |
| MP | Max pooling |
| AP | Average pooling |
| DE | Dense or fully connected |
| F | Flatten |
Figure 1TDCN-ACO search using ant colony optimization.
Figure 2TDCN-PSO search using particle swarm optimization.
Hyperparameter setting for TDCN-PSO search.
| Category | Hyperparameter | Value |
|---|---|---|
| Particle swarm optimization | Number of runs ( | 5 |
| Number of iterations ( | 12 | |
| Swarm size ( | 20 | |
| Cg | 0.5 | |
| CNN architecture initialization | Minimum number of outputs from a Conv layer | 3 |
| Maximum number of outputs from a Conv layer | 256 | |
| Minimum number of neurons in a FC layer | 1 | |
| Maximum number of neurons in a FC layer | 300 | |
| Minimum size of a Conv kernel | 3 × 3 | |
| Maximum size of a Conv kernel | 7 × 7 | |
| Minimum number of layers | 3 | |
| Maximum number of layers | 20 | |
| Dropout rate | 0.5 | |
| Training | No. of epochs for the global best | 100 |
| No. of epochs for particle evaluation | 1 | |
| Bath normalize layer outputs | Yes | |
| Probability | Probability of convolutional layer | 0.7 |
| Probability of pooling layer | 0.15 | |
| Probability of fully connected layer | 0.15 |
Classes in APTOS 2019 dataset.
| Class ID | Class name | Number of samples |
|---|---|---|
| 0 | No DR | 1805 |
| 1 | Mild | 370 |
| 2 | Moderate | 999 |
| 3 | Severe | 193 |
| 4 | Proliferative DR | 295 |
Comparison of TDCN models with imagenet models and literature.
| Model | Accuracy | AUC ROC | Cohen's kappa |
|---|---|---|---|
| Inception | 73.2 | 0.91 | 0.738 |
| Xception | 74.8 | 0.87 | 0.772 |
| Resnet50 | 73.8 | 0.89 | 0.776 |
| Shaban.et al. [ | 88 | 0.930 | 0.910 |
| S. Kassani et al. [ | 83.09 | 0.950 | 0.892 |
| Taufiqurrahman et al. [ | 85 | 0.820 | 0.925 |
| Bodapati et al. [ | 84.31 | 0.970 | 0.758 |
| TDCN-ACO | 78.4 | 0.907 | 0.795 |
| TDCN-PSO | 90.3 | 0.956 | 0.967 |
Figure 3Size comparison of TDCN and imagenet models.
Figure 4Pixel histogram for imagenet samples.
Figure 5Pixel histogram for APTOS samples.
TDCN-ACO results with respect to different hyperparameter settings.
| Image | Ant | Accuracy | ROC AUC | Kappa |
|---|---|---|---|---|
| 32 | 8 | 75.7 | 0.89 | 0.710 |
| 32 | 16 | 77 | 0.905 | 0.776 |
| 64 | 8 | 76.8 | 0.888 | 0.776 |
| 64 | 16 | 78.4 | 0.907 | 0.795 |
Figure 6(a) TDCN-ACO AUC ROC curve, (b) TDCN-ACO confusion matrix, (c) TDCN-PSO AUC ROC curve, and (d) TDCN-PSO confusion matrix.
Figure 7(a) Run vs. gbest accuracy, (b) run 5 gbest train accuracy, and (c) run 5 gbest validation accuracy.