| Literature DB >> 34883801 |
Jun Mao1, Change Zheng1, Jiyan Yin2, Ye Tian1, Wenbin Cui3.
Abstract
Training a deep learning-based classification model for early wildfire smoke images requires a large amount of rich data. However, due to the episodic nature of fire events, it is difficult to obtain wildfire smoke image data, and most of the samples in public datasets suffer from a lack of diversity. To address these issues, a method using synthetic images to train a deep learning classification model for real wildfire smoke was proposed in this paper. Firstly, we constructed a synthetic dataset by simulating a large amount of morphologically rich smoke in 3D modeling software and rendering the virtual smoke against many virtual wildland background images with rich environmental diversity. Secondly, to better use the synthetic data to train a wildfire smoke image classifier, we applied both pixel-level domain adaptation and feature-level domain adaptation. The CycleGAN-based pixel-level domain adaptation method for image translation was employed. On top of this, the feature-level domain adaptation method incorporated ADDA with DeepCORAL was adopted to further reduce the domain shift between the synthetic and real data. The proposed method was evaluated and compared on a test set of real wildfire smoke and achieved an accuracy of 97.39%. The method is applicable to wildfire smoke classification tasks based on RGB single-frame images and would also contribute to training image classification models without sufficient data.Entities:
Keywords: adversarial training; deep learning; domain adaptation; synthetic images; wildfire smoke classification
Mesh:
Substances:
Year: 2021 PMID: 34883801 PMCID: PMC8659729 DOI: 10.3390/s21237785
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Sample images of virtual smoke.
Figure 2Sample images of virtual wildland background.
Figure 3Samples of synthetic images.
Figure 4Samples of test images: (a) real smoke; (b) real non-smoke.
Figure 5CycleGAN-based pixel-level domain adaptation architecture.
Figure 6Pre-training phase for feature-level domain adaptation. The input source images in this phase are all labeled.
Figure 7The adversarial adaptation phase of feature-level domain adaptation.
Image datasets for pixel-level domain adaptation.
| Synthetic Images | Real Images | |
|---|---|---|
| Smoke images | 2000 | 1800 |
Image datasets for feature-level domain adaptation.
| Smoke Images | Non-Smoke Images | |
|---|---|---|
| Source images | 5000 | 5000 |
| Target images | 5000 | 5000 |
Testing set.
| Real Smoke Images | Real Non-Smoke Images | |
|---|---|---|
| Test set | 520 | 520 |
Figure 8Test phase.
Quantitative ablation results of the different domain adaptation architecture.
| CD | ED | MD | |
|---|---|---|---|
| ResNet-50 w/source images | 0.6348 | 0.3371 | 0.4712 |
| ResNet-50 w/target images | 0.6597 | 0.1988 | 0.5420 |
| ResNet-50 w/PDA | 0.7042 | 0.2989 | 0.2764 |
| ResNet-50 w/FDA(only Deep CORAL) | 0.7918 | 0.1053 | 0.1291 |
| ResNet-50 w/FDA(only ADDA) | 0.8569 | 0.1765 | 0.1138 |
| ResNet-50 w/FDA(ADDA+DeepCORAL) | 0.9242 | 0.0815 | 0.0655 |
| ResNet-50 w/PDA+FDA(ADDA+DeepCORAL) |
|
|
|
Bold indicates that the results shown in this line are the best.
Figure 9Confusion matrix of the proposed model.
Figure 10Samples of four cases: (a) true positive; (b) true negative; (c) false positive; (d) false negative.
Figure 11Examples of positive samples and negative samples: (a) wildland images of the cloud-smoke hybrid scenario; (b) non-smoke wildland images containing clouds or fog.
Comparison of model performance in the cloud-smoke hybrid scenario.
| CD | ED | MD | |
|---|---|---|---|
| ResNet-50 | 0.6598 | 0.2812 | 0.2157 |
| ResNet-50 w/PDA+FDA(ADDA+DeepCORAL) | 0.9382 | 0.0477 | 0.0534 |
Average recognition time for a single image on different devices.
| Average Recognition Time (s) | Average Recognition Time (s) | Average Recognition Time (s) | |
|---|---|---|---|
| GPU | 0.0041 | 0.0039 | 0.0038 |
| CPU | 0.0595 | 0.0586 | 0.0580 |