| Literature DB >> 35109930 |
Aram You1, Jin Kuk Kim2,3, Ik Hee Ryu2,3, Tae Keun Yoo4,5.
Abstract
BACKGROUND: Recent advances in deep learning techniques have led to improved diagnostic abilities in ophthalmology. A generative adversarial network (GAN), which consists of two competing types of deep neural networks, including a generator and a discriminator, has demonstrated remarkable performance in image synthesis and image-to-image translation. The adoption of GAN for medical imaging is increasing for image generation and translation, but it is not familiar to researchers in the field of ophthalmology. In this work, we present a literature review on the application of GAN in ophthalmology image domains to discuss important contributions and to identify potential future research directions.Entities:
Keywords: Data augmentation; Deep learning; Domain transfer; Generative adversarial network; Ophthalmology image
Year: 2022 PMID: 35109930 PMCID: PMC8808986 DOI: 10.1186/s40662-022-00277-3
Source DB: PubMed Journal: Eye Vis (Lond) ISSN: 2326-0254
Fig. 1An illustration of a basic architecture of GAN (vanilla GAN) for retinal image synthesis. The generator transforms a noise vector from the distribution into a synthesized retinal image . The discriminator distinguishes the synthetic and real retinal images based on the distributions of and , respectively. The generated image samples form a distribution , which is desired to be an approximation of from real image sample, after successful training
The characteristics of typical GAN variant techniques and examples of general tasks in general medicine and ophthalmology fields
| GAN Techniques | Dataset | Characteristics | Task examples in general medicine | Task examples in ophthalmology |
|---|---|---|---|---|
| Deep convolutional GAN (DCGAN) | Images from one domain | Improved image quality using deep convolutional layers | Augmentation of CT images [ | Fundus photographs synthesis [ |
| Wasserstein GAN (WGAN) | Images from one domain | Using Wasserstein distance as a loss function | Augmentation of CT images [ Removing artifacts in CT [ | Anomaly detection [ OCT segmentation [ |
| Progressively growing GAN (PGGAN) | Images from one domain (generally high resolution) | High resolution & realistic image generation | X-ray image synthesis [ Data augmentation for cytological images [ | Data augmentation for fundus photography, retinal OCT, and ocular images [ Super-resolution of fundus photographs [ |
| StyleGAN | Images of one domain or multiple domains (unpaired images) | Disentanglement of representations (mapping features to low dimensions) | Augmentation of CT and MRI images in specific conditions [ Skin image synthesis [ | None |
| Conditional GAN (vector input models) | Images annotated by conditional variables | Image synthesis conditioned to specific variables | Super-resolution guided by a conditional variable [ | Data augmentation for retinal OCT [ Post-intervention (orbital decompression) prediction [ |
| Conditional GAN (Pix2pix and other image input models) | Paired images of two domains or classes. (Training samples should be aligned) | Supervised learning for image-to-image translation | Super-resolution for fluorescence microscopy images [ Domain transfer (CT → PET) [ Segmentation of lungs from chest X-ray [ CT image synthesis [ | Domain transfer (fundus photography → angiography) [ Retinal OCT segmentation [ Retinal vessel segmentation [ Data augmentation for fundus photography and corneal topography [ |
| Super-resolution GAN (SRGAN) | Low- and high-resolution image pairs | Adopting perceptual loss to generate super-resolved realistic images | Super-resolution for dental X-ray [ | Super-resolution for optic disc photography [ |
| Cycle-consistent GAN (CycleGAN) | Unpaired images of two domains or classes | Adopting a cycle consistency for domain transfer without any paired dataset | Manipulating breast imaging [ Data augmentation for CT and throat images [ Segmentation for cardiac ultrasound [ | Denoising for fundus photography and OCT [ Domain transfer (Ultra-widefield retinal images → classic fundus photography) [ |
| StarGAN | Unpaired images of multiple domains or classes | A single network to achieve translation of multiple domains | Domain transfer between MRI contrasts [ | None |
CT = computed tomography; GAN = generative adversarial network; MRI = magnetic resonance imaging; OCT = optical coherence tomography; PET = positron emission tomography
Fig. 2Examples of applications of GAN in ophthalmology image domains. a Post-intervention prediction for decompression surgery for thyroid ophthalmopathy [15] and anti-vascular endothelial growth factor (VEGF) therapy for neovascular age-related macular degeneration [66]. b Denoising in fundus photography [53] and peripapillary optical coherence tomography (OCT) [16]. c Super-resolution for optic nerve head photography [56]. d Domain transfer for fundus photography to angiography [62] and ultra-widefield to classic fundus photography (re-analysis in this work) [63]. e Data augmentation for ocular surface images [46] and anterior segment OCT [82]. f Segmentation for corneal sub basal nerves in in vivo confocal microscopy images [37]. Most images were generated according to publicly available datasets and the methods of each study (some cases are based on our own dataset)
Fig. 3Number of studies that were reviewed in this work grouped according to tasks and image domains. a Study objectives in the application of GAN. b Ophthalmology image domains for the use of GAN. If one study deals with two issues, it was reviewed and double-counted appropriately
Summary of literature review for image segmentation task using GAN in ophthalmology imaging domains
| Publication | Basic technique | Domain | Target | Summary |
|---|---|---|---|---|
| Iqbal et al. [ | Conditional GAN | Fundus photography | Retinal vessels | The framework achieved retinal vessels image segmentation from a small training set |
| Wang et al. [ | Conditional GAN (using PatchGAN) | Fundus photography | Optic disc and optic cup | Unsupervised domain adaptation for joint optic disc and cup segmentation using a patch-based discriminator |
| Son et al. [ | Conditional GAN (using U-Net as a generator) | Fundus photography | Retinal vessels | The GAN model segmented retinal vasculature and the optic disc using a dataset, which consisted of fundoscopic images and manual segmentation of the vessels |
| Rammy et al. [ | Conditional GAN | Fundus photography | Retinal vessels | Patch-based GAN with additional loss function to learn thin and thick vessel segmented retinal vessels with the enhanced performance |
| Park et al. [ | Conditional GAN | Fundus photography | Retinal vessels | The proposed GAN model with a multi-kernel pooling and false negative loss could segment retinal blood vessels more robustly |
| Heisler et al. [ | Pix2Pix (conditional GAN) | Peripapillary retinal OCT | Nerve fiber layer, Bruch’s membrane, choroid-sclera boundary | The use of a generative adversarial network and unlabeled data can improve the performance of segmentation for the 3D morphometric analysis of glaucomatous optic nerve head volumes |
| Yang et al. [ | Conditional GAN (topological structure-constrained) | Fundus photography | Retinal vessels | The topological structure-constrained proposed GAN model identified retinal arteries and veins via segmentation from the complex background of retinal images |
| Yang et al. [ | Conditional GAN | Fundus photography | Retinal vessels | To separate blood vessels from fundus image, the model could detect more tiny vessels and locate the edge of blood vessels more accurately |
| Zhao et al. [ | Conditional GAN (with a large receptive field) | Fundus photography | Retinal vessels | The proposed retinal vessel segmentation algorithm using GAN with a large receptive field could capture large-scale high-level semantic vessel features |
| Kadambi et al. [ | Wasserstein GAN | Fundus photography | Optic disc and optic cup | WGAN-based domain adaptation showed a better performance than baseline models for the joint optic disc-and-cup segmentation in fundus images |
| Bian et al. [ | Conditional GAN (using U-Net as a generator) | Fundus photography | Optic disc and optic cup | The proposed method successfully performed optic disc and cup segmentation. The cup-to-disc ratio was automatically calculated with a good performance |
| Khan et al. [ | Conditional GAN | Meibography infrared images | Meibomian gland | The proposed GAN automatically identified the area of the meibomian glands and outperformed the state-of-art methods |
| Yildiz et al. [ | Conditional GAN | In vivo corneal confocal microscopy images | Sub-basal nerves | Automatic segmentation of sub-basal nerves in in vivo confocal microscopy images was performed using the U-Net and GAN-based techniques as a diagnostic tool for corneal diseases |
| Zhou et al. [ | Conditional GAN (using U-Net as a baseline architecture) | Fundus photography | Retinal vessels | The GAN model strengthened retinal vessel segmentation in the low-contrast background using a symmetric equilibrium GAN (U-Net-based), multi-scale features refine blocks and attention mechanism |
GAN = generative adversarial network; OCT = optical coherence tomography; WGAN = Wasserstein GAN
Summary of literature review for data augmentation task using GAN in ophthalmology imaging domains
| Publication | Basic technique | Domain | Summary |
|---|---|---|---|
| Diaz-Pinto et al. [ | DCGAN | Peripapillary fundus photography (optic disc photo) | DCGAN was able to generate high-quality synthetic optic disc images |
| Burlina et al. [ | PGGAN | Fundus photography | The GAN technique was used to synthesize high-resolution realistic fundus images serving as proxy data sets for use by retinal specialists and deep learning models |
| Zheng et al. [ | PGGAN | Retinal OCT (spectral domain) | The image quality of real images |
| Zhou et al. [ | Conditional GAN | Fundus photography | To generate a large amount of balanced training data, the GAN model synthesized high-resolution diabetic retinopathy fundus images which can be manipulated with arbitrary grading and lesion information |
| Wang et al. [ | Multi-channel GAN (modified vanilla GAN) | Fundus photography | The model generated a series of sub-fundus images corresponding to the scattering diabetic retinopathy features and made full use of both labeled and unlabeled data |
| He et al. [ | Label smoothing GAN (modified vanilla GAN) | Retinal OCT | The GAN model generated the synthetic unlabeled images from limited OCT training samples, and the mixing of the synthetic images and real images can be used as training data to improve the classification performance |
| Yoo et al. [ | CycleGAN | Retinal OCT | GAN generated OCT images of rare diseases from normal OCT images and increased the accuracy of diagnosing rare retinal diseases with few-shot classification |
| Kugelman et al. [ | Conditional GAN | Retinal OCT (patch level) | GAN was feasible to generate patches that are visually indistinguishable from their real variants and improved the segmentation performance |
| Zheng et al. [ | PGGAN | Anterior Segment OCT | The synthetic OCT images generated by GAN appeared to be of good quality, according to the glaucoma specialists, and the deep learning model for angle-closure detection was improved using both synthetic and real images |
| Yoo et al. [ | CycleGAN, PGGAN | Ocular surface image | To improve the diagnostic accuracy, GAN was adopted to perform data augmentation of ocular surface images with conjunctival melanoma |
| Abdelmotaal et al. [ | Pix2pix | Corneal topography (Scheimpflug images) | The synthesized images showed plausible subjectively- and objectively-assessed quality. Training deep learning with a combination of real and synthesized images showed better classification performance to detect keratoconus |
GAN = generative adversarial network; DCGAN = deep convolutional GAN; OCT = optical coherence tomography; PGGAN = progressively growing GAN
Summary of literature review for image enhancement (denoising and super-resolution) tasks using GAN in ophthalmology imaging domains
| Publication | Basic technique | Domain | Target | Summary |
|---|---|---|---|---|
| Halupka et al. [ | Modified Wasserstein GAN + perceptual loss (conditional GAN) | Retinal OCT (spectral domain) | Removing speckle noise | The GAN was used to reduce speckle artifacts in retinal OCT images. The method improved the image quality metrics for OCT |
| Mahapatra et al. [ | PGGAN with a conditional design | Fundus photography | Super-resolution | Image super-resolution using multi-stage PGGAN outperforms competing methods and baseline GANs. The super-resolved images can be used for landmark and pathology detection |
| Huang et al. [ | Conditional GAN | Retinal OCT | Super-resolution and removing noise | The GAN model effectively suppressed speckle noise and super-resolved OCT images at different scales |
| Ouyang et al. [ | Conditional GAN | Anterior Segment OCT | Removing speckle noise | The model removed undesired specular artifacts and speckle-noise patterns to improve the visualization of corneal and limbal OCT images |
| Yoo et al. [ | CycleGAN | Fundus photography | Removing artifacts and noise | The GAN model removed the artifacts automatically in a fundus photograph without matching paired images |
| Cheong et al. [ | DeshadowGAN (modified conditional GAN with perceptual loss) | Peripapillary retinal OCT (spectral domain) | Removing vessel shadow artifacts | The GAN model using manually masked artifact images and perceptual loss function removed blood vessel shadow artifacts from OCT images of the optic nerve head |
| Chen et al. [ | Conditional GAN | Peripapillary retinal OCT (spectral domain) | Removing speckle noise | The GAN model was designed for speckle noise reduction in OCT images and preserved the textural details found in OCT |
| Das et al. [ | CycleGAN | Retinal OCT | Super-resolution and removing noise | To achieve denoising and super-resolution, adversarial learning with cycle consistency was used without requiring aligned low–high resolution pairs |
| Ha et al. [ | Enhanced super-resolution GAN (SRGAN) | Peripapillary fundus photography (optic disc photo) | Super-resolution | The GAN approach was capable of 4-times up-scaling and enhancement of anatomical details using contrast, color, and brightness improvement |
| Yuhao et al. [ | CycleGAN | Fundus photography | Removing artifacts and noise | The developed model dehazed cataractous retinal images through unpaired clear retinal images and cataract images |
GAN = generative adversarial network; OCT = optical coherence tomography; PGGAN = progressively growing GAN
Summary of literature review for domain transfer task using GAN in ophthalmology imaging domains
| Publication | Basic technique | Domain | Summary |
|---|---|---|---|
| Costa et al. [ | Conditional GAN | Vessel image → Fundus photography | The study proposed a vessel network to retinal image translation framework producing simplified vessel tree and realistic retinal images by estimating latent space. Autoencoder was used to synthesize new retinal vessel images apart from training of GAN |
| Zhao et al. [ | Conditional GAN | Vessel image → Fundus photography | Retinal image synthesis can be effectively learned in a data-driven fashion from a relatively small sample size using a conditional GAN architecture |
| Yu et al. [ | Pix2pix (with ResU-net generator) (conditional GAN) | Vessel image → Fundus photography | To enlarge training datasets for facilitating medical image analysis, the multiple-channels-multiple-landmarks (MCML) was developed to synthesize color fundus images from a combination of vessel and optic disc masked images |
| Wu et al. [ | Conditional GAN | Volumetric retinal OCT → Fundus autofluorescence | The en-face OCT images were synthesized from volumetric retinal OCT by restricted summed voxel projection. The fundus autofluorescence images were generated from en-face OCT images using GAN to identify the geographic atrophy region |
| Tavakkoli et al. [ | Conditional GAN | Fundus photography → Fluorescein angiography | The proposed GAN produced anatomically accurate fluorescein angiography images that were indistinguishable from real angiograms |
| Yoo et al. [ | CycleGAN | Ultra-widefield fundus photography → Fundus photography | Ultra-widefield images were successfully translated into traditional fundus photography-style images by CycleGAN, and the main structural information of the retina and optic nerve was retained |
| Ju et al. [ | CycleGAN | Fundus photography → Ultra-widefield fundus photography | The CycleGAN model transferred the color fundus photographs to ultra-widefield images to introduce additional data for existing limited ultra-widefield images. The proposed method was adopted for diabetic retinopathy grading and lesion detection |
| Lazaridis et al. [ | Wasserstein GAN + perceptual loss (conditional GAN) | Time-domain OCT → spectral-domain OCT | Time-domain OCT was converted to synthetic spectral-domain OCT using GAN. The model improved the statistical power of the measurements when compared with those derived from the original OCT |
GAN = generative adversarial network; OCT = optical coherence tomography
Summary of literature review for post-intervention prediction task using GAN in ophthalmology imaging domains
| Publication | Basic technique | Domain | Intervention | Summary |
|---|---|---|---|---|
| Yoo et al. [ | Conditional GAN, CycleGAN | Periorbital facial images | Orbital decompression surgery | The developed model transformed preoperative facial input images into predicted postoperative images for orbital decompression for thyroid-associated ophthalmopathy |
| Liu et al. [ | Pix2pix (conditional GAN) | Retinal OCT | Intravitreal anti-vascular endothelial growth factor injection | The model generated individualized post-therapeutic OCT images that could predict the short-term response of treatment for age-related macular degeneration |
| Lee et al. [ | Conditional GAN (multi-channel inputs) | Retinal OCT (with fluorescein angiography and indocyanine green angiography) | Intravitreal anti-vascular endothelial growth factor injection | The trained model generated post-treatment optical coherence tomography (OCT) images of neovascular age-related macular degeneration |
GAN = generative adversarial network; OCT = optical coherence tomography
Summary of literature review for feature extraction task using GAN in ophthalmology imaging domains
| Publication | Basic technique | Domain | Target | Summary |
|---|---|---|---|---|
| Schlegl et al. [ | f-AnoGAN (Wasserstein GAN + latent space mapping) | Retinal OCT | Intra-retinal fluid detection (OCT anomaly detection) | The GAN based unsupervised learning of healthy training data was trained with fast mapping from images to encodings in the latent space. Anomalies were detected via a combined anomaly score based on an image reconstruction error |
| Xie et al. [ | Conditional GAN (with attention encoder and multi-branch structure) | Ultra-widefield fundus photography (scanning laser ophthalmoscopy) | Features for retinal diseases | The GAN based on the attention encoder and multi-branch structure was used to extract features for retinal disease detection. The discriminator in GAN was modified to build the classifier to detect the disease images |
GAN = generative adversarial network; OCT = optical coherence tomography
Fig. 4Examples of problems encountered using GAN techniques. a Mode collapse where the generator produces limited varieties of samples. b Spatial deformity due to small training images without spatial alignment. c Unintended changes due to the difference of data distribution between two domains. d Checker-board artifacts in synthetic images. All of the images were generated according to publicly available datasets and the standard GAN methods