| Literature DB >> 34841040 |
Saman Motamed1,2, Patrik Rogalla3, Farzad Khalvati1,2,4.
Abstract
Successful training of convolutional neural networks (CNNs) requires a substantial amount of data. With small datasets, networks generalize poorly. Data Augmentation techniques improve the generalizability of neural networks by using existing training data more effectively. Standard data augmentation methods, however, produce limited plausible alternative data. Generative Adversarial Networks (GANs) have been utilized to generate new data and improve the performance of CNNs. Nevertheless, data augmentation techniques for training GANs are underexplored compared to CNNs. In this work, we propose a new GAN architecture for augmentation of chest X-rays for semi-supervised detection of pneumonia and COVID-19 using generative models. We show that the proposed GAN can be used to effectively augment data and improve classification accuracy of disease in chest X-rays for pneumonia and COVID-19. We compare our augmentation GAN model with Deep Convolutional GAN and traditional augmentation methods (rotate, zoom, etc.) on two different X-ray datasets and show our GAN-based augmentation method surpasses other augmentation methods for training a GAN in detecting anomalies in X-ray images.Entities:
Keywords: Data augmentation; Disease detection; Generative adversarial networks; Medical Imaging; Semi-supervised learning
Year: 2021 PMID: 34841040 PMCID: PMC8607740 DOI: 10.1016/j.imu.2021.100779
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Fig. 1IAGAN’s generator architecture.
Fig. 2IAGAN’s generator specific architecture breakdown.
Fig. 3Discriminator architecture.
Fig. 4Pneumonia and COVID-19 sample images from COVIDx dataset with class consistent annotations.
Fig. 5Generator’s output during training.
IAGAN augmentation.
| Normal | Pneumonia | COVID-19 | |
|---|---|---|---|
| (Training/Test) | (Training/Test) | (Training/Test) | |
| Dataset I | 0/500 | 3,765/500 | N/A |
| Augmented Dataset I | 0/500 | 19,360/500 | N/A |
| Dataset II | 7,477/589 | 4,700/589 | 0/589 |
| Augmented Dataset II | 48,708/589 | 48,708/589 | 0/589 |
DCGAN augmentation.
| Normal | Pneumonia | COVID-19 | |
|---|---|---|---|
| (Train/Test) | (Train/Test) | (Train/Test) | |
| Augmented Dataset I | 0/500 | 15,060/500 | N/A |
| Augmented Dataset II | 29,908/589 | 18,800/589 | 0/589 |
Fig. 6Traditional augmentation output sample.
Traditional augmentation.
| Normal | Pneumonia | COVID-19 | |
|---|---|---|---|
| (Train/Test) | (Train/Test) | (Train/Test) | |
| Augmented Dataset I | 0/500 | 33,885/500 | N/A |
| Augmented Dataset II | 67,293/589 | 42,300/589 | 0/589 |
AUC and -value for datasets I and II.
| No augmentation | IAGAN | DCGAN | Traditional augmentation | |
|---|---|---|---|---|
| Dataset I | 0.87 | 0.87 (p = 0.5) | 0.88 (p = 0.08) | |
| Dataset II | 0.74 | 0.75 (p = 0.43) | 0.75 (p = 0.57) |
Sensitivity, Specificity and Accuracy for datasets I and II, respectively.
| Model (Datasets I/II) | Sensitivity | Specificity | Accuracy |
|---|---|---|---|
| No augmentation | 0.80/0.67 | 0.81/0.68 | 0.80/0.67 |
| IAGAN | 0.80/ | ||
| DCGAN | 0.80/0.67 | 0.81/0.67 | 0.80/0.67 |
| Traditional augmentation | 0.80/0.68 | 0.81/0.68 | 0.80/0.68 |
Fig. 7IAGAN’s generator output at different epochs of the model training with random generated input images.