| Literature DB >> 35336464 |
Zhe Jiang1,2, Chen Zhao1,2, Haiyan Wang1,3.
Abstract
Underwater target classification has been an important topic driven by its general applications. Convolutional neural network (CNN) has been shown to exhibit excellent performance on classifications especially in the field of image processing. However, when applying CNN and related deep learning models to underwater target classifications, the problems, including small sample size of underwater target and low complexity requirement, impose a great challenge. In this paper, we have proposed the modified DCGAN model to augment data for targets with small sample size. The data generated from the proposed model help to improve classification performance under imbalanced category conditions. Furthermore, we have proposed the S-ResNet model to obtain good classification accuracy while significantly reducing complexity of the model, and achieve a good tradeoff between classification accuracy and model complexity. The effectiveness of proposed models is verified through measured data from sea trial and lake tests.Entities:
Keywords: convolutional neural network; generative adversarial network; underwater target classification
Mesh:
Year: 2022 PMID: 35336464 PMCID: PMC8950804 DOI: 10.3390/s22062293
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Construction of GAN.
Figure 2Block diagram of proposed underwater target classification method.
Figure 3Construction of generative model.
Specific structure of the generative model.
| Kernel | Stride | Padding | BN | Activation Function | Output | |
|---|---|---|---|---|---|---|
| Input |
| 1 | 0 | Y | ReLU |
|
| Conv1 |
| 2 | 1 | Y | ReLU |
|
| Conv2 |
| 2 | 1 | Y | ReLU |
|
| Conv3 |
| 2 | 1 | Y | ReLU |
|
| Conv4 |
| 3 | 1 | Y | Tanh |
|
Figure 4Construction of discriminative model.
Specific structure of the discriminative model.
| Kernel | Stride | Padding | BN | Activation Function | Output | |
|---|---|---|---|---|---|---|
| Input |
| 3 | 1 | Y | Leaky-ReLU |
|
| Conv1 |
| 2 | 1 | Y | Leaky-ReLU |
|
| Conv2 |
| 2 | 1 | Y | Leaky-ReLU |
|
| Conv3 |
| 2 | 1 | Y | Leaky-ReLU |
|
| Conv4 |
| 1 | 0 | Y | Sigmoid |
Figure 5The construction of S-ResNet classification model.
Figure 6The fire module of S-ResNet classification model.
Specific structure of the S-ResNet classification model.
| Input-Size | Kernel-Size | Depth |
|
|
| |
|---|---|---|---|---|---|---|
| Input |
|
| 1 | |||
| Conv1 |
|
| 1 | |||
| Conv2 |
|
| 1 | |||
| Conv3 |
|
| 1 | |||
| Maxpool |
|
| 0 | |||
| fire1 |
| 2 | 32 | 64 | 64 | |
| fire2 |
| 2 | 32 | 64 | 64 | |
| fire3 |
| 2 | 48 | 96 | 96 | |
| Maxpool4 |
|
| 0 | |||
| fire4 |
| 2 | 48 | 96 | 96 | |
| fire5 |
| 2 | 64 | 128 | 128 | |
| fire6 |
| 2 | 64 | 128 | 128 | |
| fire7 |
| 2 | 128 | 256 | 256 | |
| Maxpool8 |
|
| 0 | |||
| fire8 |
| 2 | 128 | 256 | 256 | |
| Conv4 |
|
| 1 | |||
| GAP |
|
| 0 |
Figure 7Five types of different targets.
Hyperparameter settings of modified DCGAN model.
| Hyperparameter | Value |
|---|---|
| Input-dimension | 96 |
| Batch-size | 64 |
| Epoch | 100 |
| Adam | 0.5, 0.999 |
| Learning-rate |
|
| Leaky ReLU |
|
Figure 8Generated data through proposed modified DCGAN.
Figure 9Generated data through standard DCGAN.
Comparison between modified and standard DCGAN on IS and FID Indicators.
| Model | FID | IS |
|---|---|---|
| Modified DCGAN |
|
|
| Standard DCGAN |
|
|
Number of samples in dataset with data augmentation.
| Train Set | Test Set | Total | |
|---|---|---|---|
| Target 1 | 929 | 400 | 1329 |
| Target 2 | 832 | 353 | 1185 |
| Target 3 | 890 | 387 | 1277 |
| Target 4 | 548 | 232 | 780 |
| Target 5 | 434 | 126 | 560 |
Hyperparameter settings of S-ResNet classification model.
| Hyperparameter | Value |
|---|---|
| Batch-size | 32 |
| Epoch | 30 |
| SGDM |
|
| Learning-rate |
|
| Leaky ReLU |
|
Figure 10The confusion matrix with data augmentation.
The classification performance with data augmentation.
| Target 1 | Target 2 | Target 3 | Target 4 | Target 5 | |
|---|---|---|---|---|---|
| Test Accuracy |
| ||||
| Precision |
|
|
|
|
|
| Recall |
|
|
|
| 1 |
| F1-score |
|
|
|
|
|
The classification performance without data augmentation.
| Target 1 | Target 2 | Target 3 | Target 4 | Target 5 | |
|---|---|---|---|---|---|
| Test Accuracy |
| ||||
| Precision |
|
|
|
|
|
| Recall |
|
|
|
|
|
| F1-score |
|
|
|
|
|
Figure 11Classification accuracy comparisons with and without data augmentation.
Accuracy comparisons of S-ResNet with different algorithms.
| S-ResNet | Decision Tree | KNN | Random Forest | Multiclassification SVM | |
|---|---|---|---|---|---|
| Target 1 |
|
|
|
|
|
| Target 2 |
|
|
|
|
|
| Target 3 |
|
|
|
|
|
| Target 4 |
|
|
|
|
|
| Target 5 |
|
|
|
|
|
| Overall |
|
|
|
|
|
Comparisons of classification performance with different models.
| Accuracy | Parameter (M) | Flops (G) | Epoch (s) | |
|---|---|---|---|---|
| S-ResNet |
|
|
| 50 |
| SqueezeNet |
|
|
| 44 |
| ResNet-18 |
|
|
|
|
| ResNet-34 |
|
|
|
|
| ResNet-50 |
|
|
| 72 |
| VGG-16 |
|
|
|
|
| DenseNet |
|
|
| 70 |
| AlexNet |
|
|
|
|