| Literature DB >> 34072625 |
Qiang Yu1,2, Feiqiang Liu1,2, Long Xiao3, Zitao Liu4, Xiaomin Yang1,2.
Abstract
Deep-learning (DL)-based methods are of growing importance in the field of single image super-resolution (SISR). The practical application of these DL-based models is a remaining problem due to the requirement of heavy computation and huge storage resources. The powerful feature maps of hidden layers in convolutional neural networks (CNN) help the model learn useful information. However, there exists redundancy among feature maps, which can be further exploited. To address these issues, this paper proposes a lightweight efficient feature generating network (EFGN) for SISR by constructing the efficient feature generating block (EFGB). Specifically, the EFGB can conduct plain operations on the original features to produce more feature maps with parameters slightly increasing. With the help of these extra feature maps, the network can extract more useful information from low resolution (LR) images to reconstruct the desired high resolution (HR) images. Experiments conducted on the benchmark datasets demonstrate that the proposed EFGN can outperform other deep-learning based methods in most cases and possess relatively lower model complexity. Additionally, the running time measurement indicates the feasibility of real-time monitoring.Entities:
Keywords: convolutional neural networks; deep learning; environment research; image super-resolution; lightweight model; real-time
Mesh:
Year: 2021 PMID: 34072625 PMCID: PMC8199203 DOI: 10.3390/ijerph18115890
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Visualization of some feature maps generated by the first residual block of RCAN. The feature map pairs connected by curves have strong similarity.
Concise description of symbols in the paper.
| Symbols | Description |
|---|---|
|
| The extracted primary features |
|
| LR images |
|
| Function of FEM |
|
| Convolution layer |
|
| Function of |
|
| Output of |
|
| Concatenation operation in channel-wise |
|
| |
|
| Fused features |
|
| Function of reconstruction block |
|
| Bicubic interpolation |
|
| Function of proposed EFGN |
|
| SR image |
|
| HR image |
|
| Input of |
|
| |
|
| |
|
| Refined features of |
Figure 2Network structure of proposed efficient feature generating network (EFGN).
Figure 3(a) The architecture of SIRU. (b) An illustration of efficient feature generating block.
Figure 4Trade-off between the model complexity and reconstruction performance with scale factor ×4 on Set5. The number after each method indicates the computational costs.
Figure 5The trade-off between the running time and PSNR with scale factor ×4 on Set5.
Results of different parameter setting in SIRU on Set5 dataset with scaling factor 4.
| EFGN_S | EFGN_B | EFGN_L | |
|---|---|---|---|
| Parameters | 464 K | 960 K | 1603 K |
| PSNR | 32.00 | 32.13 | 32.18 |
Figure 6Converge analysis of EFGN with different number of LRM. The parameter of each model is also listed. The results are evaluated on Set5 dataset with ×4 scale.
Figure 7The structure of modified SIRU. 64, 128, 256 represent the channel numbers of feature maps.
Investigations of SIRU scheme and EFGB module for scale factor on Set5 dataset. w/ and w/o represent with and without, respectively.
| Model | SIRU | EFGB | Parameters | PSNR |
|---|---|---|---|---|
| EFGN_NS | w/o | w/ | 1303 K | 32.08 |
| EFGN_NE | w/ | w/o | 887 K | 31.98 |
| EFGN | w/ | w/ | 960 K | 32.13 |
Figure 8Visualization of the feature map generated by the first EFGB in the first LRM.
Quantitative results for scale factor , and on benchmarks. Best and second best results are bold and underlined.
| Method | Scale | Params | MAC | Set5 PSNR/SSIM | Set14 PSNR/SSIM | B100 PSNR/SSIM | Urban100 PSNR/SSIM |
|---|---|---|---|---|---|---|---|
| Bicubic | 2 | - | - | 33.65/0.9299 | 30.34/0.8688 | 29.56/0.8431 | 26.88/0.8403 |
| SRCNN [ | 2 | 57 K | 52.7 G | 36.66/0.9542 | 32.45/0.9067 | 31.36/0.8879 | 29.50/0.8946 |
| FSRCNN [ | 2 | 12 K | 6.0 G | 37.00/0.9558 | 32.63/0.9088 | 31.53/0.8920 | 29.88/0.9020 |
| VDSR [ | 2 | 665 K | 612.6 G | 37.53/0.9587 | 33.03/0.9124 | 31.90/0.8960 | 30.76/0.9140 |
| DRCN [ | 2 | 1774 K | 17,974.3 G | 37.63/0.9588 | 33.04/0.9118 | 31.85/0.8942 | 30.75/0.9133 |
| LapSRN [ | 2 | 813 K | 29.9 G | 37.52/0.9591 | 32.99/0.9124 | 31.80/0.8952 | 30.41/0.9103 |
| DRRN [ | 2 | 297 K | 6796.9 G | 37.74/0.9591 | 33.23/0.9136 | 32.05/0.8973 | 31.23/0.9188 |
| MemNet [ | 2 | 677 K | 2662.4 G | 37.78/0.9597 | 33.28/0.9142 | 32.08/0.8978 | 31.31/0.9195 |
| IDN 1 [ | 2 | 579 K | 124.6 G | 31.95/0.9266 | |||
| CARN [ | 2 | 1592 K | 222.8 G | 37.76/0.9590 | 33.52/0.9166 | 32.09/0.8978 | 31.92/0.9256 |
| EFGN(Ours) | 2 | 939 K | 216 G | ||||
| Bicubic | 3 | - | - | 30.39/0.8682 | 27.55/0.7742 | 27.21/0.7385 | 24.46/0.7349 |
| SRCNN [ | 3 | 57 K | 52.7 G | 32.75/0.9090 | 29.30/0.8215 | 28.41/0.7863 | 26.24/0.7989 |
| FSRCNN [ | 3 | 12 K | 5.0 G | 33.18/0.9140 | 29.37/0.8240 | 28.53/0.7910 | 26.43/0.8080 |
| VDSR [ | 3 | 665 K | 612.6 G | 33.66/0.9213 | 29.77/0.8314 | 28.82/ 0.7976 | 27.14/0.8279 |
| DRCN [ | 3 | 1774 K | 17,974.3 G | 33.82/0.9226 | 29.76/0.8311 | 28.80/0.7963 | 27.15/0.8276 |
| DRRN [ | 3 | 297 K | 6796.9 G | 34.03/0.9244 | 29.96/0.8349 | 28.95/0.8004 | 27.53/0.8378 |
| MemNet [ | 3 | 677 K | 2662.4 G | 34.09/0.9248 | 30.00/0.8350 | 28.96/0.8001 | 27.56/0.8376 |
| IDN 1 [ | 3 | 588 K | 56.3 G | 34.24/ | 30.27/ | 29.03/ | |
| CARN [ | 3 | 1592 K | 118.8 G | 27.38/0.8404 | |||
| EFGN(Ours) | 3 | 948 K | 96.7 G | ||||
| Bicubic | 4 | - | - | 28.42/0.8104 | 26.00/0.7027 | 25.96/0.6675 | 23.14/0.6577 |
| SRCNN [ | 4 | 57 K | 52.7 G | 30.48/0.8628 | 27.50/0.7513 | 26.90/0.7101 | 24.52/0.7221 |
| FSRCNN [ | 4 | 12 K | 4.6 G | 30.71/0.8657 | 27.59/0.7535 | 26.98/0.7150 | 24.62/0.7280 |
| VDSR [ | 4 | 665 K | 612.6 G | 31.35/0.8838 | 28.01/0.7674 | 27.29/0.7251 | 25.18/0.7524 |
| DRCN [ | 4 | 1774 K | 17,974.3 G | 31.53/0.8854 | 28.02/0.7670 | 27.23/0.7233 | 25.14/0.7510 |
| LapSRN [ | 4 | 813 K | 149.4 G | 31.54/0.8852 | 28.09/0.7700 | 27.32/0.7275 | 25.21/0.7562 |
| DRRN [ | 4 | 297 K | 6796.9 G | 31.68/0.8888 | 28.21/0.7720 | 27.38/0.7284 | 25.44/0.7638 |
| MemNet [ | 4 | 677 K | 2662.4 G | 31.74/0.8893 | 28.26/0.7723 | 27.40/0.7281 | 25.50/0.7630 |
| IDN 1 [ | 4 | 600 K | 32.3 G | 31.99/0.8928 | 28.52/0.7794 | 27.52/0.7339 | 25.92/0.7801 |
| CARN [ | 4 | 1592 K | 90.9 G | ||||
| EFGN(Ours) | 4 | 960 K | 55.2 G |
1 IDN refers to the results given by LatticeNet [37].
Figure 9Qualitative comparison of EFGN with other deep learning-based methods on SISR.
Figure 10Images from ImageNet CLS-LOC validation dataset.
ResNet-50 object recognition performance. The original images are served as baseline. Best results are shown in bold.
| Metric | Bicubic | RCAN [ | EFGN | Baseline |
|---|---|---|---|---|
| Top-1 error | 0.366 | 0.344 |
| 0.238 |
| Top-5 error | 0.143 | 0.136 |
| 0.066 |
Figure 11Visual comparison of RCAN with our proposed EFGN at ×4 super-resolution on images from ImageNet.