| Literature DB >> 35626467 |
Xiaole Ma1,2, Zhihai Wang1, Shaohai Hu1,2, Shichao Kan3.
Abstract
The methods based on the convolutional neural network have demonstrated its powerful information integration ability in image fusion. However, most of the existing methods based on neural networks are only applied to a part of the fusion process. In this paper, an end-to-end multi-focus image fusion method based on a multi-scale generative adversarial network (MsGAN) is proposed that makes full use of image features by a combination of multi-scale decomposition with a convolutional neural network. Extensive qualitative and quantitative experiments on the synthetic and Lytro datasets demonstrated the effectiveness and superiority of the proposed MsGAN compared to the state-of-the-art multi-focus image fusion methods.Entities:
Keywords: generative adversarial network; multi-focus image fusion; multi-scale decomposition
Year: 2022 PMID: 35626467 PMCID: PMC9140435 DOI: 10.3390/e24050582
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1Multi-focus source image: (a) right-focused image; (b) left-focused image.
Figure 2The framework of image fusion based on a GAN.
Figure 3The framework of the multi-focus image fusion based on a multi-scale GAN.
Figure 4The framework of the proposed sub-image fusion based on a GAN.
The details of the generator and discriminator.
| Layer | Convolution | Normalization | Activation | ||
|---|---|---|---|---|---|
|
| Encoder | En_1 | Conv(3,64,3,1,1) | BatchNorm2d | Leaky ReLU |
| Conv(64,64,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| En_2 | Conv(64,128,4,2,1 | BatchNorm2d | Leaky ReLU | ||
| Conv(128,128,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| En_3 | Conv(128,256,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(256,256,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| En_4 | Conv(256,512,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(512,512,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| En_5 | Conv(512,512,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(512,512,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| Feature Fusion | FF | Concat(f1, f2) | - | - | |
| Decoder | De_1 | ConvT (512,512,4,2,1) | - | Leaky ReLU | |
| Conv(512, 512,3,1,1)*2 | BatchNorm2d | Leaky ReLU | |||
| De_2 | ConvT (512,256,4,2,1) | - | Leaky ReLU | ||
| Conv(256,256,3,1,1)*2 | BatchNorm2d | Leaky ReLU | |||
| De_3 | ConvT (256,128,4,2,1) | - | Leaky ReLU | ||
| Conv(128,128,3,1,1)*2 | BatchNorm2d | Leaky ReLU | |||
| De_4 | ConvT(128,64,4,2,1) | - | Leaky ReLU | ||
| Conv(64,64,3,1,1)*2 | BatchNorm2d | Leaky ReLU | |||
| De_5 | Conv(64,3,3,1,1) | - | Tanh | ||
|
| D_1 | Conv(3,64,3,1,1) | BatchNorm2d | Leaky ReLU | |
| Conv(64,64,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| D_2 | Conv(64,128,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(128,128,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| D_3 | Conv(128,256,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(256,256,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| D_4 | Conv(256,512,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(512,512,3,1,1) | BatchNorm2d | Leaky ReLU | |||
| D_5 | Conv(512,512,4,2,1) | BatchNorm2d | Leaky ReLU | ||
| Conv(512,512,3,1,1) | BatchNorm2d | Leaky ReLU |
Figure 5The synthetic multi-focus images: (a) multi-focus source image; (b) focus area detection map.
Figure 6The multi-focus images in the Lytro dataset.
Figure 7The fused images of “Globe”: (a) source image A; (b) source image B; (c) DWT; (d) NSCT; (e) MST-SR; (f) IFCNN; (g) ECNN; (h) MsCNN; (i) proposed.
Figure 8The fused images of “Heart”: (a) source image A; (b) source image B; (c) DWT; (d) NSCT; (e) MST-SR; (f) IFCNN; (g) ECNN; (h) MsCNN; (i) proposed.
Figure 9The fused images of “Zoo”: (a) source image A; (b) source image B; (c) DWT; (d) NSCT; (e) MST-SR; (f) IFCNN; (g) ECNN; (h) MsCNN; (i) proposed.
The average metric values of the fused images of Lytro dataset.
| Information Theory Based Metrics | Image Feature Based Metrics | Human Perception Inspired Fusion | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| DWT [ | 0.8860 | 0.3747 | 0.8274 | 0.6483 | 0.7836 | 0.7073 |
| NSCT [ | 0.9521 | 0.3782 | 0.8317 | 0.6310 | 0.7689 | 0.7238 |
| MST-SR [ | 0.9594 | 0.3852 | 0.8309 | 0.6905 | 0.8247 | 0.7515 |
| ECNN [ | 0.8877 | 0.3796 | 0.8272 | 0.6396 | 0.7842 | 0.7129 |
| IFCNN [ | 0.8580 | 0.3759 | 0.8258 | 0.6195 | 0.7665 | 0.6868 |
| MsCNN | 0.9602 | 0.3829 | 0.8309 | 0.6503 | 0.7877 | 0.7504 |
| Proposed | 0.9945 | 0.3861 | 0.8329 | 0.6727 | 0.8028 | 0.7654 |