| Literature DB >> 33286824 |
Huan Zhao1, Tingting Li1, Yufeng Xiao1, Yu Wang1.
Abstract
Generative adversarial networks (GANs), which are a promising type of deep generative network, have recently drawn considerable attention and made impressive progress. However, GAN models suffer from the well-known problem of mode collapse. This study focuses on this challenge and introduces a new model design, called the encoded multi-agent generative adversarial network (E-MGAN), which tackles the mode collapse problem by introducing the variational latent representations learned from a variable auto-encoder (VAE) to a multi-agent GAN. The variational latent representations are extracted from training data to replace the random noise input of the general multi-agent GANs. The generator in E-MGAN employs multiple generators and is penalized by a classifier. This integration guarantees that the proposed model not only enhances the quality of generated samples but also improves the diversity of generated samples to avoid the mode collapse problem. Moreover, extensive experiments are conducted on both a synthetic dataset and two large-scale real-world datasets. The generated samples are visualized for qualitative evaluation. The inception score (IS) and Fréchet inception distance (FID) are adopted to measure the performance of the model for quantitative assessment. The results confirmed that the proposed model achieves outstanding performances compared to other state-of-the-art GAN variants.Entities:
Keywords: diversity; generative adversarial networks; mode collapsing; multi-agent generator; quality; variable auto-encoder; variational latent representations
Year: 2020 PMID: 33286824 PMCID: PMC7597127 DOI: 10.3390/e22091055
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1An illustration of the E-MGAN’s architecture.
Notation used in the model of E-MGAN.
| Notation | Definition | Notation | Definition |
|---|---|---|---|
|
| Real samples |
| Real data distribution |
|
| Random prior variable |
| Random prior distribution |
|
| Mean and variance of latent feature representations |
| Latent feature distribution |
|
| Latent feature representations |
| Output of the multi-agent generator |
|
| Number of generators in multi-agent generator. |
| Kullback–Leibler (KL) divergence |
|
| Loss of Kullback–Leibler (KL) divergence |
| Shannon entropy |
|
| Reconstruction error |
| Cross–entropy |
|
| Reconstructed (generated) samples |
| Sum function of vector elements |
|
| Generated data mode of |
| Generated sample distribution |
|
| Weight of |
| Probability that |
|
| Value function of the classifier |
| Jensen–Shannon divergence |
|
| Value function of the discriminator |
| Probability that x is a real sample |
|
| Value function of the multi-agent generator |
Figure 2An inside view of the multi-agent generator .
Figure 3Samples generated by MGAN (in the upper row) and our proposed E-MGAN (in the bottom row) trained on the 2D synthetic dataset. The red points are real samples, while the blue points are generated samples.
Figure 4Generated samples of proposed E-MGAN and MGAN trained on CIFAE-10.
Inception scores on CIFAR-10 and STL-10 datasets. All the results are made in an unsupervised manner. The higher the IS value, the better the quality of generated samples is. A dash (“–”) indicates unavailable data.
| Model | CIFAR-10 | STL-10 |
|---|---|---|
| Real data |
|
|
| WGAN [ |
| – |
| MIX+WGAN [ |
| – |
| mproved-GAN [ |
| – |
| ALI [ |
| – |
| BEGAN [ |
| – |
| MAGAN [ |
| – |
| GMAN [ |
| – |
| DCGAN [ |
|
|
| DFM [ |
|
|
| D2GAN [ |
|
|
| MGAN [ |
|
|
|
|
|
|
Figure 5Samples generated by WGAN (in the upper row) and our proposed E-MGAN (in the bottom row) trained on the 2D synthetic dataset. The red points are real samples, while the blue points are generated samples.
Figure 6Samples on the CIFAR-10 dataset. On the left are real data sampled from the CIFAR-10 dataset, and on the right are samples generated by E-MGAN trained on CIFAR-10 dataset.
Figure 7Samples generated by the proposed E-MGAN at different epochs trained on STL-10.
FIDs of different models on CIFAR-10. The lower the FID value, the better the diversity of generated samples is.
| Model | FID |
|---|---|
| DCGAN [ |
|
| DCGAN + TTUR [ |
|
| WGAN-GP [ |
|
| WGAN-GP + TTUR [ |
|
| MGAN [ |
|
|
|
|