| Literature DB >> 35418566 |
Olaide N Oyelade1, Absalom E Ezugwu2, Mubarak S Almutairi3, Apu Kumar Saha4, Laith Abualigah5,6, Haruna Chiroma7.
Abstract
Deep learning (DL) models are becoming pervasive and applicable to computer vision, image processing, and synthesis problems. The performance of these models is often improved through architectural configuration, tweaks, the use of enormous training data, and skillful selection of hyperparameters. The application of deep learning models to medical image processing has yielded interesting performance, capable of correctly detecting abnormalities in medical digital images, making them surpass human physicians. However, advancing research in this domain largely relies on the availability of training datasets. These datasets are sometimes not publicly accessible, insufficient for training, and may also be characterized by a class imbalance among samples. As a result, inadequate training samples and difficulty in accessing new datasets for training deep learning models limit performance and research into new domains. Hence, generative adversarial networks (GANs) have been proposed to mediate this gap by synthesizing data similar to real sample images. However, we observed that benchmark datasets with regions of interest (ROIs) for characterizing abnormalities in breast cancer using digital mammography do not contain sufficient data with a fair distribution of all cases of abnormalities. For instance, the architectural distortion and breast asymmetry in digital mammograms are sparsely distributed across most publicly available datasets. This paper proposes a GAN model, named ROImammoGAN, which synthesizes ROI-based digital mammograms. Our approach involves the design of a GAN model consisting of both a generator and a discriminator to learn a hierarchy of representations for abnormalities in digital mammograms. Attention is given to architectural distortion, asymmetry, mass, and microcalcification abnormalities so that training distinctively learns the features of each abnormality and generates sufficient images for each category. The proposed GAN model was applied to MIAS datasets, and the performance evaluation yielded a competitive accuracy for the synthesized samples. In addition, the quality of the images generated was also evaluated using PSNR, SSIM, FSIM, BRISQUE, PQUE, NIQUE, FID, and geometry scores. The results showed that ROImammoGAN performed competitively with state-of-the-art GANs. The outcome of this study is a model for augmenting CNN models with ROI-centric image samples for the characterization of abnormalities in breast images.Entities:
Mesh:
Year: 2022 PMID: 35418566 PMCID: PMC9008034 DOI: 10.1038/s41598-022-09929-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Illustration of the generator and discriminator[11].
A summary of related studies, their approaches, and application as compared with what is obtained in this study.
| Author and reference | Approach and domain of application | Comparison with this study |
|---|---|---|
| Chen et al.[ | InfoGAN: based on unsupervised learning which maximizes mutual information in a small subset of latent variables. Applied to writing styles using MNIST | RoiMammoGAN: used semi-supervised learning. Applied to breast images from mammograms using MIAS |
| Zhang et al.[ | StackGAN: stacked two GANs on each other. Applied to generating photorealistic images using CUB and Oxford-102 datasets | RoiMammoGAN: one GAN model sufficiently and accurately achieved our aim. Applied to breast images from mammograms using MIAS |
| Wang and Gupta[ | S2-GAN: composes of a style and structure GANS. Applied for generating structure and style in 2D images | RoiMammoGAN: learns the pattern and structure of abnormalities in medical images. Applied to breast images from mammograms using MIAS |
| Nguyen et al.[ | PPGNs: used probabilistic interpretation and performance gradient to generate realistic and high-resolution images | RoiMammoGAN: combined the Adam gradient algorithm and performance increment to generate images. Applied to breast images from mammograms using MIAS |
| Neff[ | WGAN: improved loss function performance using Wasserstein | RoiMammoGAN: a combination of RELU and LeakyRELU were used for computing loss function |
| Arjovsky et al.[ | WGAN: aimed to stabilize learning pattern and reducing mode collapse in GAN | RoiMammoGAN: architectural composition showed that this model overcome mode collapse |
| Gulrajani et al.[ | Used a penalization mechanism for norm gradient to overcome clipping weights. Used CIFAR-10 and LSUN bedrooms | RoiMammoGAN: the challenge of clipping weights was eliminated in our model. Applied to breast images from mammograms using MIAS |
| Mao et al.[ | LSGAN: least squares loss function was used to curtail the vanishing gradients problem | RoiMammoGAN: kernel sizes of |
| Ilya et al.[ | AdaGAN: addition of component through iterative procedure to avoid training problem | RoiMammoGAN: to eliminate complexity of learning features during training, staged-class-based learning was applied |
| Odena et al.[ | CGAN: uses label conditioning to generate high resolution images | RoiMammoGAN: adopted the label conditioning strategy in addition to label flipping |
| Pana et al | SalGAN: designed as a data-driven metric-based saliency prediction method and trained with an adversarial loss function | RoiMammoGAN: the concept of saliency map was not considered in the study |
| Wang et al | SRGAN: high resolution focused GAN model | RoiMammoGAN: also a high-resolution focused GAN model |
| Wu et al.[ | ciGAN: used for contextual in-filling for synthesizing lesions. Applied to mammogram patches | RoiMammoGAN: uses the class label to condition the learning and training process. Applied to mammogram ROIs |
| Guibas et al | two-stage pipeline and pair-based GAN for medical image synthesis | RoiMammoGAN: one-stage and single-based GAN for breast cancer mammography image synthesis |
| Kazuhiro et al | DCGAN: based on deep convolutional. Applied to magnetic resonance (MR) images | RoiMammoGAN: based on deep convolutional-transpose network. Applied to breast images from mammograms using MIAS |
| Brock et al | BigGAN: class-conditioning and orthogonal regularization was used in the generator to achieve fidelity and variety | RoiMammoGAN: class-label guided approach was used |
| Chen et al.[ | Combined conditional and unconditional GANs with adversarial training and self-supervision | RoiMammoGAN: based on conditional GAN with adversarial training and semi-supervision |
| Shaham et al | SinGAN: unconditional GAN capable of learning from a single natural image without an accompanying label | RoiMammoGAN: learns from batch of images using conditional GAN approach |
| Wu et al | GP-GAN: leverage the strengths of the classical gradient-based for GAN | RoiMammoGAN: Adam gradient-based approach was used |
| Yi et al | A GAN can help explore and discover the underlying structure of medical images | RoiMammoGAN: can detect the structure of abnormalities in a digital mammogram (medical images) |
| Wu et al | U-net-based GAN was designed to generate lesions on mammograms | RoiMammoGAN: was also designed to generate lesions on mammograms |
| Oyelade and Ezugwu[ | ArchGAN: capable of synthesizing mammograms with only architectural distortion | RoiMammoGAN: an advanced model of the ArchGAN |
Figure 2Illustration of generator and discriminator[11].
Figure 3An illustration of how GAN is trained[52].
Figure 4The proposed ROImammoGAN framework with the image preprocessing component and the GAN consisting of a generator and discriminator.
Figure 5The generator (G) proposed in the ROImammoGAN and accepts input of size 100-dimension.
Figure 6The discriminator () accepts inputs of either a real image from ⅆ or fake from G and outputs the probability [between 0.0 and 1.0], indicating when an input is either real or fake.
Generator architecture: we adopted the input noise vector of dimensionality 100 drawn from a zero-mean Gaussian distribution.
| Input projection | Layer1 | Layer2 | Layer3 | Layer4 | Layer5 | Layer6 | |
|---|---|---|---|---|---|---|---|
| Type | Fully connected | Fractionally strided convolution | Fractionally strided convolution | Fractionally strided convolution | Fractionally strided convolution | Fractionally strided convolution | Fractionally strided convolution |
| Input | [1 × 100] | [4 × 4 × 1024] | [8 × 8 × 512] | [16 × 16 × 256] | [32 × 32 × 128] | [64 × 64 × 64] | [128 × 128 × 32] |
| Output | [4 × 4 × 1024] | [8 × 8 × 512] | [16 × 16 × 256] | [32 × 32 × 128] | [64 × 64 × 64] | [128 × 128 × 32] | [64 × 64 × 2] |
| Activation | ReLU | ReLU | ReLU | ReLU | ReLU | ReLU | TanH |
| Batch norm | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Stride | – | 2 | 2 | 2 | 2 | 1 | – |
| Padding | – | same | Same | Same | Same | Same | Same |
| Kernel Size | – | 5 | 5 | 5 | 5 | 5 | 5 |
| Kernels | – | 1024 | 512 | 256 | 128 | 64 | 32 |
Minibatch Size: 32, Optimizer: Adaptive Moment Estimation (Adam) (η = 0.00001, β1 = 0.5, β2 = 0.999). All weights were initialized using the normal distribution initializer.
Discriminator architecture: Minibatch Size: 32; Optimizer: Adam (η = 0.0001, β1 = 0.5, β2 = 0.999).
| Layer1 | Layer2 | Layer3 | Layer4 | Layer5 | Output | |
|---|---|---|---|---|---|---|
| Type | Convolution | Convolution | Convolution | Convolution | Convolution | Full Con |
| Input | [32 × 32 × 2] | [64 × 64 × 64] | [32 × 32 × 128] | [16 × 16 × 256] | [8 × 8 × 512] | [4 × 4 × 1024] |
| Output | [64 × 64 × 64] | [32 × 32 × 128] | [16 × 16 × 256] | [8 × 8 × 512] | [4 × 4 × 1024] | [1] |
| Activation | LeakyReLU | LeakyReLU | LeakyReLU | LeakyReLU | LeakyReLU | Sigmoid |
| Batch norm | Yes | Yes | Yes | Yes | Yes | – |
| Stride | 2 | 2 | 2 | 1 | 1 | – |
| Padding | Same | Same | Same | Same | Same | – |
| Kernel Size | 5 | 5 | 5 | 5 | 5 | – |
| Kernels | 64 | 128 | 256 | 512 | 1024 | – |
Description of some benchmarked datasets used for experimentation.
| Database | No. of patients | No. of images | Cases of abnormalities | Description |
|---|---|---|---|---|
| MIAS | 161 | 322 (MLO view of images) | All forms of abnormalities (32 shows architectural distortion) | Digitised to 50 micron pixel edge, and reduced to is 200 micron pixel edge and padded/clipped so that all the images are 1024 × 1024 |
| Images include radiologist's truth-markings | ||||
| DDSM | 2620 | 10,480 (MLO and CC view of images) | All forms of abnormalities (approximately 137 shows architectural distortion) | The database has some associated patient information (like age at the time of study) and image information (like spatial resolution) |
| Images are marked with ground truth information about the locations and types of suspicious regions |
Description of datasets used for experimentation.
| Dataset | Total no. of samples/ROIs | No. of samples/ROIs with abnormalities | |
|---|---|---|---|
| Benchmark dataset | DDSM + CBIS-DDSM | The dataset contains 55,890 of which 14% are positive and the remaining 86% negative | 7824 |
| MIAS | 5136 ROIs | 536 |
Figure 7(a) Sample digital breast images with abnormalities characterized by architectural distortion from the MIAS dataset were drawn from a random batch of images during training. (b) Sample digital breast images with abnormalities characterized by asymmetry from the MIAS dataset were drawn from a random batch of images during training. (c) Sample digital breast images with abnormalities characterized by microcalcification from the MIAS dataset were drawn from a random batch of images during training. (d) Sample digital breast images with abnormalities characterized by mass from the MIAS dataset were drawn from a random batch of images during training.
Figure 8Training output on the proposed GAN model for architectural distortion showing how the generator learns in synthesizing images similar to real samples with architectural distortion.
Figure 9Training output on the proposed GAN model for asymmetry showing how the generator learns in synthesizing images similar to real samples with asymmetry.
Figure 10Training output on the proposed GAN model for microclacification showing how the generator learns in synthesizing images similar to real samples with microclacification.
Figure 11Training output on the proposed GAN model for mass showing how the generator learns in synthesizing images similar to real samples with architectural distortion mass.
Figure 12A plot of the combination of the real and generated (fake) accuracies of (a) architectural distortion, (b) asymmetry, (c) microclacification, and (d) mass during training as evaluated under 100%.
Figure 13(a) Sample images generated during the training of inputs with architectural distortion abnormality. (b) Sample images generated during the training of inputs with asymmetrical abnormalities. (c) Sample images generated during the training of inputs with microclacification abnormalities. (d) Sample images generated during the training of inputs with mass abnormality.
Quantitative comparison of the image quality analysis of ten (10) randomly selected synthesized images with architectural distortion (AD) for metrics ranging in the categories of reference-based, nonreference-based, and feature-based.
| Images | PSNR | SSIM | DSSIM | MSE | FSIM | BRISQUE | PQUE | NIQE | GS | FID |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 27.97 | 0.04 | 0.48 | 103.73 | 0.90 | 27.05 | 22.65 | 25.81 | 0.00 | 343,130.76 |
| 2 | 28.00 | 0.04 | 0.48 | 103.07 | 0.88 | 17.79 | 23.17 | 37.07 | 0.00 | 321,157.06 |
| 3 | 27.94 | 0.04 | 0.48 | 104.41 | 0.89 | 29.11 | 23.04 | 30.40 | 0.00 | 413,143.07 |
| 4 | 27.83 | 0.03 | 0.48 | 107.18 | 0.89 | 15.33 | 22.70 | 28.10 | 0.00 | 429,359.88 |
| 5 | 27.98 | 0.04 | 0.48 | 103.47 | 0.89 | 28.32 | 23.67 | 25.31 | 0.00 | 330,390.83 |
| 6 | 28.00 | 0.05 | 0.48 | 103.09 | 0.88 | 16.62 | 23.71 | 27.50 | 0.00 | 339,042.88 |
| 7 | 27.93 | 0.05 | 0.47 | 104.61 | 0.87 | 34.25 | 24.23 | 22.30 | 0.00 | 425,196.46 |
| 8 | 27.99 | 0.04 | 0.48 | 103.22 | 0.87 | 34.27 | 23.26 | 27.56 | 0.00 | 308,565.14 |
| 9 | 28.02 | 0.06 | 0.47 | 102.61 | 0.90 | 18.04 | 23.01 | 27.48 | 0.00 | 260,963.64 |
| 10 | 28.04 | 0.05 | 0.47 | 102.22 | 0.87 | 37.61 | 24.91 | 24.45 | 0.00 | 335,339.00 |
| Average | 27.97 | 0.04 | 0.48 | 103.76 | 0.88 | 25.84 | 23.44 | 27.60 | 0.00 | 350,628.87 |
Quantitative comparison of the image quality analysis of ten (10) randomly selected synthesized images with asymmetry (ASY) for metrics ranging in the categories of reference-based, nonreference-based, and feature-based.
| Images | PSNR | SSIM | DSSIM | MSE | FSIM | BRISQUE | PQUE | NIQE | GS | FID |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 27.60 | 0.74 | 0.13 | 112.95 | 0.77 | 128.64 | 59.77 | 53.52 | 0.00 | 261,029.43 |
| 2 | 27.68 | 0.71 | 0.15 | 111.05 | 0.79 | 99.24 | 62.88 | 53.10 | 0.00 | 366,081.45 |
| 3 | 27.25 | 0.77 | 0.11 | 122.58 | 0.80 | 106.95 | 61.86 | 70.46 | 0.00 | 225,235.34 |
| 4 | 27.72 | 0.80 | 0.10 | 109.92 | 0.76 | 122.02 | 62.92 | 64.81 | 0.00 | 206,818.55 |
| 5 | 27.67 | 0.73 | 0.14 | 111.30 | 0.75 | 126.44 | 59.49 | 54.14 | 0.00 | 303,943.84 |
| 6 | 27.38 | 0.73 | 0.13 | 118.84 | 0.81 | 126.58 | 62.47 | 54.82 | 0.00 | 278,837.33 |
| 7 | 28.72 | 0.66 | 0.17 | 87.29 | 0.83 | 101.89 | 57.70 | 46.08 | 0.00 | 380,301.12 |
| 8 | 27.51 | 0.75 | 0.13 | 115.31 | 0.74 | 113.56 | 62.98 | 50.48 | 0.00 | 281,589.40 |
| 9 | 27.65 | 0.75 | 0.13 | 111.68 | 0.75 | 125.46 | 58.68 | 80.70 | 0.00 | 280,155.78 |
| 10 | 27.70 | 0.79 | 0.11 | 110.34 | 0.77 | 105.03 | 61.94 | 58.10 | 0.00 | 221,111.01 |
| Average | 27.69 | 0.74 | 0.13 | 111.13 | 0.78 | 115.58 | 61.07 | 58.62 | 0.00 | 280,510.32 |
Quantitative comparison of the image quality analysis of ten (10) randomly selected synthesized images with microcalcification (CALC) for metrics ranging in the categories of reference-based, nonreference-based, and feature-based.
| Images | PSNR | SSIM | DSSIM | MSE | FSIM | BRISQUE | PQUE | NIQE | GS | FID |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 27.74 | 0.05 | 0.48 | 109.44 | 0.84 | 18.63 | 19.56 | 34.02 | 0.00 | 1,147,952.54 |
| 2 | 28.11 | 0.05 | 0.47 | 100.46 | 0.86 | 21.52 | 16.54 | 44.94 | 0.00 | 1,168,311.50 |
| 3 | 27.75 | 0.05 | 0.47 | 109.27 | 0.89 | 22.43 | 12.99 | 33.25 | 0.00 | 1,608,844.19 |
| 4 | 27.79 | 0.03 | 0.48 | 108.27 | 0.87 | 32.13 | 20.36 | 38.35 | 0.00 | 961,768.18 |
| 5 | 27.67 | 0.05 | 0.47 | 111.16 | 0.85 | 17.46 | 14.22 | 32.89 | 0.00 | 1,186,538.32 |
| 6 | 27.87 | 0.05 | 0.48 | 106.21 | 0.89 | 10.95 | 15.37 | 24.84 | 0.00 | 973,001.32 |
| 7 | 28.36 | 0.05 | 0.48 | 94.89 | 0.78 | 13.00 | 15.14 | 25.14 | 0.00 | 1,368,235.44 |
| 8 | 27.79 | 0.05 | 0.47 | 108.17 | 0.86 | 20.40 | 15.44 | 31.20 | 0.00 | 1,211,028.80 |
| 9 | 28.43 | 0.05 | 0.47 | 93.38 | 0.79 | 22.52 | 13.48 | 37.85 | 0.00 | 1,578,621.76 |
| 10 | 27.84 | 0.06 | 0.47 | 106.80 | 0.90 | 24.81 | 20.36 | 29.34 | 0.00 | 867,607.44 |
| Average | 27.93 | 0.05 | 0.48 | 104.81 | 0.85 | 20.39 | 16.35 | 33.18 | 0.00 | 1,207,190.95 |
Quantitative comparison of the image quality analysis of ten (10) randomly selected synthesized images with mass (MS) for metrics ranging in the categories of reference-based, nonreference-based, and feature-based.
| Images | PSNR | SSIM | DSSIM | MSE | FSIM | BRISQUE | PQUE | NIQE | GS | FID |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 7.61 | 0.35 | 0.33 | 1.13+04 | 0.84 | 52.10 | 69.89 | 150.22 | 0.00 | 446,829.53 |
| 2 | 10.36 | 0.38 | 0.31 | 1.13+04 | 0.80 | 52.10 | 69.89 | 150.22 | 0.00 | 481,668.03 |
| 3 | 10.53 | 0.39 | 0.31 | 1.13+04 | 0.84 | 69.52 | 69.96 | 122.82 | 0.00 | 429,579.98 |
| 4 | 8.97 | 0.37 | 0.32 | 1.13+04 | 0.83 | 52.10 | 69.89 | 150.22 | 0.00 | 462,492.35 |
| 5 | 8.34 | 0.36 | 0.32 | 1.13+04 | 0.87 | 52.10 | 69.89 | 150.22 | 0.00 | 443,899.24 |
| 6 | 8.37 | 0.36 | 0.32 | 1.13+04 | 0.83 | 52.10 | 69.89 | 150.22 | 0.00 | 465,602.81 |
| 7 | 8.54 | 0.36 | 0.32 | 1.13+04 | 0.87 | 52.10 | 69.89 | 150.22 | 0.00 | 497,179.28 |
| 8 | 5.21 | 0.30 | 0.35 | 1.13+04 | 0.90 | 52.10 | 69.89 | 150.22 | 0.00 | 571,124.48 |
| 9 | 4.11 | 0.25 | 0.38 | 1.13+04 | 0.88 | 52.10 | 69.89 | 150.22 | 0.00 | 580,729.34 |
| 10 | 10.54 | 0.39 | 0.31 | 1.13+04 | 0.87 | 69.52 | 69.96 | 122.82 | 0.00 | 448,607.70 |
| Average | 8.26 | 0.35 | 0.32 | 1.13+04 | 0.85 | 55.59 | 69.91 | 144.74 | 0.00 | 482,771.27 |
Figure 14Boxplot showing the distribution of values obtained for ten randomly selected samples of architectural distortion in computational metrics PSNR, SSIM, FSIM, BRISQUE, PQUE, and NIQE.
Figure 15Boxplot showing the distribution of values obtained for ten randomly selected samples of asymmetry in computational metrics PSNR, SSIM, FSIM, BRISQUE, PQUE, and NIQE.
Figure 16Boxplot showing the distribution of values obtained for ten randomly selected samples of microcalcification in computational metrics PSNR, SSIM, MSE, FSIM, BRISQUE, PQUE, and NIQE.
Figure 17Boxplot showing the distribution of values obtained for ten randomly selected samples of mass in computational metrics PSNR, SSIM, MSE, FSIM, BRISQUE, PQUE, and NIQE.
Figure 18Sample image outputs in 1000 iterations with architectural distortion synthesized using the fully trained generator.
Figure 19Plot of accuracy and loss values for testing the trained model on samples with (a) architectural distortion, (b) asymmetry, and (c) microcalcification.
Comparison of the performance of GAN proposed in this study with state-of-the-art GANs using metrics of reference-based category.
| Authors and references | GAN model | SSIM | DSSIM | PSNR | MSE |
|---|---|---|---|---|---|
| [ | Conditional GAN (cGAN) | 0.8960 | 0.05 | 23.65 | 313.2 |
| [ | Peceptual GAN | 0.9071 | 0.05 | 24.20 | 287 |
| [ | Style-content (SC-GAN) | 0.9046 | 0.05 | 24.12 | 282.8 |
| [ | MedGAN | 0.9160 | 0.04 | 24.62 | 264.8 |
| This study | ROImammoGAN | 0.8000 | 0.10 | 27.72 | 109.92 |
Figure 20A graphical comparison of the performance of the proposed GAN model in this study compared with similar state-of-the-art medical image GANs: (a) SSIM, (b) PSNR, and (c) MSE.