| Literature DB >> 31547250 |
Grigorios Tsagkatakis1,2, Anastasia Aidini3,4, Konstantina Fotiadou5,6, Michalis Giannopoulos7,8, Anastasia Pentari9,10, Panagiotis Tsakalides11,12.
Abstract
Deep Learning, and Deep Neural Networks in particular, have established themselves as the new norm in signal and data processing, achieving state-of-the-art performance in image, audio, and natural language understanding. In remote sensing, a large body of research has been devoted to the application of deep learning for typical supervised learning tasks such as classification. Less yet equally important effort has also been allocated to addressing the challenges associated with the enhancement of low-quality observations from remote sensing platforms. Addressing such channels is of paramount importance, both in itself, since high-altitude imaging, environmental conditions, and imaging systems trade-offs lead to low-quality observation, as well as to facilitate subsequent analysis, such as classification and detection. In this paper, we provide a comprehensive review of deep-learning methods for the enhancement of remote sensing observations, focusing on critical tasks including single and multi-band super-resolution, denoising, restoration, pan-sharpening, and fusion, among others. In addition to the detailed analysis and comparison of recently presented approaches, different research avenues which could be explored in the future are also discussed.Entities:
Keywords: convolutional neural networks; deep learning; denoising; earth observations; fusion; generative adversarial networks; pan-sharpening; satellite imaging; super-resolution
Year: 2019 PMID: 31547250 PMCID: PMC6767260 DOI: 10.3390/s19183929
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1A typical CNN architecture for remote sensing image enhancement featuring convolutional and nonlinear activation layers with residual and skip connections.
Figure 2Hourglass shaped CNN architecture.
Figure 3Typical structure of a single-layer autoencoder: This one-hidden layer structure learns the best possible compressed representation, so that the output is as close as possible to the input.
Figure 4A (conditional) Generative Adversarial Network architecture composed of a Generator and a discriminator network for image quality enhancement.
Listing of representative super-resolution approaches.
| Methods | Observation Type | Approach |
|---|---|---|
| [ | Single-Image/band | single-scale CNN |
| [ | Single-Image/band | multiscale CNN |
| [ | Single Image | GAN |
| [ | HS/MS | multiscale CNN (LPN) |
| [ | HS/MS | multiple loss CNN |
| [ | HS/MS | band-specific CNN |
| [ | HS/MS | 3D-CNN |
| [ | Video frames | CNN variants |
Relative performance gains (with respect to PSNR) for different DNN-based single image super-resolution methods compared to bicubic interpolation for scale factors ×2, ×3 and ×24.
| Method | Dataset | Performance Gain | ||
|---|---|---|---|---|
|
|
|
| ||
| LGCNet [ | UC Merced | +9% | +7% | +5% |
| WMCNN [ | RSSCN7 | +8% | +3% | |
| [ | UCMERCED/RSSCN7/NWPU-RESIS45 | +9% | +7% | |
| TGAN [ | UC Merced (airplanes) pre-trained on the DIV2K | +16% | ||
| DMCN [ | UC Merced | +10% | +8% | +7% |
| MRNN [ | NWPU-RESISC45 | +13% | ||
| FRDBPN [ | UC Merced | +5% | ||
| EEGAN [ | Kaggle | +14% | 4% | +13% |
Relative performance gains (with respect to mean PSNR) for different DNN-based multispectral super-resolution methods compared to bicubic interpolation for scale factors , and .
| Method | Dataset | Performance Gain | ||
|---|---|---|---|---|
|
|
|
| ||
| [ | UC Merced | +2% | ||
| MLFN [ | UC Merced | +11% | ||
| SSIN [ | UC Merced | +8% | +6% | +4% |
| 3D-FCNN [ | UC Merced | +9% | +5% | +3% |
Figure 5An exemplary CNN architecture for MS/HS super-resolution.
Listing of representative pan-sharpening approaches.
| Method | Approach |
|---|---|
| [ | AE variants |
| [ | CNN based on SRCNN architecture |
| [ | CNN with residual connections |
| [ | Inception-like CNN for multiscale feature extraction |
| [ | Target-adapted CNN |
| [ | Two-stream CNN architecture |
| [ | CNN-based pixel-level regression over multiple scales |
| [ | Two CNNs associated with spectral and spectral dimension |
| [ | Two-stream GAN-CNN architecture |
Relative performance gains (with respect to SAM) for different DNN pan-sharpening methods compared to non-DNN MTF-GLP [118] method and baseline DNN-based PNN [76] method.
| Method | Dataset | Baseline Approaches | |
|---|---|---|---|
| MTF-GLP | PNN | ||
| PUNET [ | Ikonos/WorldView-2 | - | +15%/+16% |
| DRPNN [ | QuickBird/WorldView-2 | +35%/+25% | +8%/+5% |
| MSDCNN [ | QuickBird/WorldView-2 | +35%/+26% | +7%/+5% |
| RSIFNN [ | Gaofen-1/QuickBird | - | +11%/+8% |
| L1-RL-FT [ | WorldView-2/WorldView-3 | +38%/+29% | +14%/+37% |
| BDPN [ | QuickBird/WorldView-3 | - | +25%/+ 11% |
| [ | WorldView-2 | - | +33% |
| [ | Pleiades/WorldView-2 | +50%/+26% | +46%/+20% |
| PNN [ | WorldView-2/Ikonos | +25%/+21% | - |
| PanNet [ | WorldView-3 | - | +15% |
| PSGAN [ | QuickBird/GF-1 | - | +18%/+35% |
Figure 6An exemplary two-stream GAN-CNN architecture for pan-sharpening.
Major approaches in DNN-based remote sensing observation restoration.
| Method | Observation Type | Approach |
|---|---|---|
| [ | HSI + Gaussian noise | CNN with trainable activation function |
| [ | HSI + Gaussian noise | Residual CNN with key band selection |
| [ | Aerial RGB + Gaussian noise | Multiscale CNN for learning residual noise component |
| [ | HSI + Gaussian noise | Multiscale 3D CNN using neighboring bands |
| [ | HSI + Gaussian/Poisson noise | Multiscale 3D CNN w/atrous layers |
| [ | HSI + mixed noise | 2D CNN with residual connection |
| [ | Noise & Blur | GAN architecture |
| [ | Missing spatial measurements | CNN-based fusion using auxiliary observations |
| [ | Missing temporal measurements | CNN for recovery via temporal prediction |
| [ | Cloud occlusions | GAN |
| [ | Missing MS observations | cGAN using multitemporal MS and SAR observation |
Relative performance gains (with respect to mean PSNR) for different DNN denoising methods compared to non-DNN BM4D [139] method.
| Method | Dataset | Comparison to BM4D [ |
|---|---|---|
| HDnTSDL [ | Washington DC Mall (HYDICE) | +12%/+10%/+13% (Gaussian w/ |
| SSDRN [ | PAVIA (ROSIS) | +2%/+3%/+4% (Gaussian w/ |
| SSGN [ | Pavia (ROSIS), | +20%/+17% |
| [ | Washington DC Mall (HYDICE) | +6%/+8%/+11% |
| [ | Pavia University (ROSIS), Indian Pines (AVIRIS) | +1%/+4% |
Listing on key approaches in DNN-based observation fusion.
| Method | Inputs | Objective | Approach |
|---|---|---|---|
| [ | MS and HS | spatial/spectral resolution | 3D-CNN using concatenated observations |
| [ | MS and HS | spatial/spectral resolution | Fusion using two-stream CNNs |
| [ | MS and HS | spatial/temporal resolution | CNN sub-networks fusion |
| [ | MS and SAR | NDVI estimation | CNN using concatenated observations |
| [ | RGB and SAR | registration | GAN-based framework |
Relative performance gains (with respect to PSNR) for three DNN-based MS/HS fusion approaches.
| Method | Dataset | |||
|---|---|---|---|---|
| Botswana | Washington | Pavia | ||
| PFCN + GDL [ | 35.36 | 38.38 | 41.80 | |
| 3D-CNN [ | 34.03 | 36.48 | 39.93 | |
| PNN [ | 30.30 | 28.71 | 36.51 | |