| Literature DB >> 33824572 |
Syed Khurram Jah Rizvi1,2, Muhammad Ajmal Azad3, Muhammad Moazam Fraz1,4.
Abstract
The survey paper summarizes the recent applications and developments in the domain of Generative Adversarial Networks (GANs) i.e. a back propagation based neural network architecture for generative modeling. GANs is one of the most highlighted research avenue due to its synthetic data generation capabilities and benefits of representations comprehended irrespective of the application. While several reviews for GANs in the arena of image processing have been conducted by present but none have given attention on the review of GANs over multi-disciplinary domains. Therefore, in this survey, use of GAN in multidisciplinary applications areas and its implementation challenges have been done by conducting a rigorous search for journal/research article related to GAN and in this regard five renowned journal databases i.e. "ACM Digital Library"," Elsevier", "IEEE Explore", "Science Direct", "Springer" and proceedings of best domain specific conference are considered. By employing hybrid research methodology and article inclusion and exclusion criteria, 100 research articles are considered encompassing 23 application domains for the survey. In this paper applications of GAN in various practical domain and their implementation challenges its associated advantages and disadvantages have been discussed. For the first time a survey of this type have been done where GAN with wide range of application and its associated advantages and disadvantages issue have been reviewed. Finally, this article presents several diversified prominent developing trends in the respective research domain which will provide a visionary perspective regarding ongoing GANs related research and eventually help to develop an intuition for problem solving using GANs. © CIMNE, Barcelona, Spain 2021.Entities:
Year: 2021 PMID: 33824572 PMCID: PMC8017345 DOI: 10.1007/s11831-021-09543-4
Source DB: PubMed Journal: Arch Comput Methods Eng ISSN: 1134-3060 Impact factor: 7.302
Comparison of survey works for application of gans in multidisciplinary domains
| Paper author | Year | Areas covered | Focus applications | Multidisciplinary domains | Remarks |
|---|---|---|---|---|---|
| Wu [ | 2017 | 1 | Image processing and translation | Not covered | The research paper presents a survey of GANs application in image fusion and associated manipulations |
| Wang [ | 2017 | 3 | Computer vision, natural language processing and malware analysis | Very limited coverage | The paper covers GANs’ background, fundamental concepts, implementation approaches followed by applications |
| Zhang [ | 2017 | 1 | Image synthesis | Not covered | The article presents an overview about adversarial training framework and survey of related work |
| Saifuddin Hitawala [ | 2018 | 1 | Image processing | Not covered | The article study image processing model as well as its variation and their comparative analysis |
| Qiantong Xu [ | 2018 | 2 | Image processing | Not covered | It presents numerous several representative evaluation parameters and challenges related to the evaluation of those parameters |
| Zhaoqing. Pan [ | 2019 | 2 | Natural language processing and computer vision | Not covered | Discussed basic theory including the differences among different generative models as well as derived models |
| Zhengwei Wang[ | 2020 | 1 | Computer vision | Not covered | Conducted a review of GAN-variants and challenges in computer vision |
| Jie Gui [ | 2020 | 4 | Computer vision, medical and data science | Limited coverage | Covered motivations, mathematical representations and few applications |
| Proposed work | 2020 | 23 | Multidisciplinary and cross domain | Covered | We made an extensive survey of, 5 different research databases and discussed application of GAN over a portfolio of 22 diversified and multidisciplinary domains |
Fig. 1Search and selection methodology
Manuscript inclusion and exclusion criteria
| Inclusion | Exclusion |
|---|---|
| Criteria | |
| The articles publishing language is English | Articles published in any language except English |
| Studies published in and after 2016 | Articles published before 2016 |
| Work focusing GAN based framework in an applied domain | Work focusing in an applied domain but not using GAN |
| Articles published in open access journals | Articles with restricted access |
Application domains
| Image processing | Unmanned aerial vehicles (UAV’s) |
|---|---|
| Speech recognition | Simulation and modeling |
| Genetic engineering | Market prediction and forecasting |
| Drug discovery | NLP—natural language processing (NLP) |
| Health | Architectural designing |
| Fault prediction | Road Network generation and path planning |
| Agriculture | Testing and validation |
| Music | Software designing and development |
| Weather forecasting | Fake audio, video and image generation |
| Sports | Text generation |
| Internet of things (IoT) | Malware detection |
Fig. 2Article screening year distribution
Overview Index for GAN based applications
| Application | Paper | Year | Description |
|---|---|---|---|
| Image processing | Shrivastava et al. [ | 2017 | Simulated and unsupervised learning for synthetic image generation Use image as input instead of random vector |
| Yang et al. [ | 2017 | Conditional GAN for reconstruction of compressed or corrupted sensing magnetic resonance imaging(CS-MRI) from random under sampled data Generate image with better textures and edges | |
| Cho et al. [ | 2019 | Style transfer approach for image transformation Achieved efficiency and image quality | |
| Dong et al. [ | 2017 | Image translation as well as intra-domain image translation | |
| Zhang et al. [ | 2016 | Conditional augmentation technique for generation of low resolution image from text followed by noise removal Achieved image smoothness and improved quality | |
| Reed et al. [ | 2016 | Synthetic image generation from text description Advantage: image generation with high quality and with multiple objects and variable background | |
| Speech recognition | Sriram et al. [ | 2017 | Data driven approach that introduced invariance in encoder embedding with no specialized pre-processing Achieved scalability and robustness |
| Donahue et al. [ | 2017 | Enhancement of contaminated speed by additive and reverberant noise by employing GAN with Log-mel filter banks instead of wavelet Attained performance and robustness | |
| Pascual et al. [ | 2017 | Application of GAN at waveform level | |
| Unmanned aerial vehicles (UAV’s) | Wang et al. [ | 2018 | Noise filtering to avoid information loss during remote sending Benefit: image de-noising |
| Qiuhong et al. [ | 2019 | High volume data redundancy by applying compression | |
| NLP—natural language processing (NLP) | Li et al. [ | 2018 | Text regression model for association of text data and social outcome Advantage: data analysis with limited labelling |
| Lin et al. [ | 2017 | Rank-Gan for data analysis and quality assessment using rank metric | |
| Qian et al. [ | 2018 | Event factuality identification using Ac-GAN by learning syntactic inform and address imbalance among factuality values Advantage: reduced reliance over annotated text | |
| Health | Che et al. [ | 2017 | Hergan for synthetic health data generation with limited electronic health record (HER) |
| Hwang et al. [ | 2017 | Disease prediction using AC-GAN and stacked auto-encoder | |
| Rezaei et al. [ | 2018 | Semantic segmentation and disease classification by selective weighted loss Advantage: address Data imbalance | |
| Fake audio, video and image generation | Choi et al. [ | 2018 | Stargan for fake image generation by using deep CNN Achieved high classification accuracy |
| Nataraj et al. [ | 2019 | Detection of fake images using co-occurrence matrices along with deep learning Achieved good generalization and very high classification accuracy | |
| Agriculture | Suarez et al. [ | 2017 | Strength assessment of vegetation against normalized difference vegetation index (NDI) by applying Conditional GAN |
| Barth et al. [ | 2017 | Cyclegan for gap reduction between synthetic and empirical image data set Advantage: ease of translation of color and textures | |
| Music | Yang et al. [ | 2017 | Midinet—generation of musical notes by using CNN GAN Comparison of midinet was also made with Google’s melodyrnn from scratch Advantage: combine existing melodies as well as generate melodies from multiple channels |
| Dong et al. [ | 2018 | Misegan—generates symbolic music i.e. piano-rolls of five tracks and four bars i.e. Bass, drums, guitar, piano and strings Proposed three models known as call jamming model, composer model and hybrid model for music generation | |
| Yu et al. [ | 2019 | Simultaneous generation of lyrics-conditioned melody and association alignment between syllables of given lyrics by using conditional deep Lstm generator and discriminator Deep generative model for generation of melody and notes of predicted melody | |
| Weather forecasting | Chen et al. [ | 2018 | Scenario generation used for weather forecasting, however errors become more pronounced when the typhoons move into deep sea Advantage: generates wind patterns and weather forecasts based on historic data |
| Ruttgers et al. [ | 2018 | Predict track of typhoons by using satellite image. If information about surface pressure, velocity and sea surface temperature are added the results can become more accurate Advantage: predict the typhoon center as well as the movement of clouds with certain margins for error | |
| Sports | Jiao et al. [ | 2018 | Distinguishes correct performed golf swings Achieved accuracy and precision both in identification as well as classification of golf swings |
| Deverall et al. [ | 2017 | Conditional GAN for designing athletic shoes based on google gnet Achieved shoes categorization according to their physical attributes as well as functional type | |
| Internet of things (IoT) | Wang et al. [ | 2018 | Use of Bayesian methods for Radio Frequency (RF) sensing for IoT Advantage: overcome limitation of limited data availability by introducing an offline stage |
| Zhao et al. [ | 2018 | Individual identity authentication by applying open-categorical classification model based on gan (occ-gan) Advantage: better results are achieved than other methods like one-class support vector machine (oc-svm) and one-versus-rest support vector machine (ovr-svm) | |
| Genetic engineering | Dizaji et al. [ | 2018 | Gene expression profiling by using semi-supervised GAN for expression inference Use landmark genes instead of whole gene expressions |
| Simulation and modeling | Hassouni et al. [ | 2018 | Generating realistic simulation environments that simulates daily activities of users Advantage: generate realistic sensory data that related to daily activities of users |
| Pöpperl et al. [ | 2019 | Synthetic ultrasonic signal simulation using conditional gans (cgans) Advantage: real like data augmentation for automotive ultrasonic and also adaptive to external influences | |
| Market prediction and forecasting | Tian et al. [ | 2019 | A technique for predicting the consumption of energy Advantage: outperforms the standard approaches i.e. information diffusion technology (idt), the heuristic mega-trend-diffusion (hmtd) technology and the bootstrap technique Advantage: scalable to perform forecast for demand of electricity and the traffic supply |
| Luo et al. [ | 2018 | A technique for predicting the prices of the crude oil using adaptive scales continuous wavelet transform (as-cwt) Advantage: more accurate forecasts as compare to naive forecast (nf) model and other nonlinear models i.e. deep belief networks (dbns) | |
| Drug discovery | Zhavoronkov et al. [ | 2019 | Drug discovery using generative modelling. i.e. generative tensorial reinforcement learning Useful for the discovery of new micro molecule kinase inhibitors and DNA damage response (DDR1) inhibitors |
| Architectural designing | Zheng et al. [ | 2018 | Floor plan image identification and creation Floor design images get translated into programmatic patches of colors |
| Wang et al. [ | 2019 | Double P-buried layers MISFET (DP-MISFET) is proposed Simulated and characteristics are analysed by the Sentaurus TCAD tool | |
| Road network generation and path planning | Albert et al. [ | 2018 | Novel technique to simulate real like urban designs fine-tuned with urban land-use inventory Advantage: synthetic urban pattern is formulated to qualitatively regenerate the spatial structures perceived in urban designs |
| Mohammadi et al. [ | 2018 | Precise and reliable paths for navigation software including wayfinding for disabled people, route identification for evacuations, robotic navigations for autonomous vehicles Advantage: high accuracy of the classification task with high quality of the generated paths is achieved | |
| Testing and validation | Zhang et al. [ | 2018 | Unsupervised model for automatic verification and validation of the consistent behavior of autonomous vehicle driving systems Real time validation is also achieved |
| Segura et al. [ | 2016 | Metamorphic verification and validation approach for identifying unusual behaviors of autonomous vehicle systems along with input validation | |
| Zhihui Li et al. [ | 2019 | Create fuzzing data using Wasserstein GANs (wgans) Advantage: does not require specification of input data Significant for testing of industrial control systems (icss) | |
| Software designing and development | Li et al. [ | 2019 | Layoutgan—Wireframe designing i.e. layouts generation of relational graphic elements to wireframe images by modelling geometric relations of different types of two dimensional elements Advantage: introduction of wireframe rendering layer which produce a set of relational graphic controls |
| Liu et al. [ | 2018 | Treegan for source code generation Advantage: syntax-aware sequence generation | |
| Fault prediction | Gao et al. [ | 2019 | ASM1D-GAN a model to identify the faults by extracting features related to faults from real fault samples and create the similar one Advantage: integration of data creation and fault determination |
| Zhou et al. [ | 2019 | Synthesize vibrational fault samples using a technique of global optimization Advantage: feature extraction of feature using limited number of samples and its effective representation using auto-encoder Filter the non-compliant synthetic samples which are not useful for reliable fault diagnosis | |
| Zheng et al. [ | 2019 | Gan-fp utilizes multiple GANs to create training samples and an inference network in parallel to predict failures for newly crafted samples Improved performance as well as significant socio-economic impact | |
| Text generation | Subramanian et al. [ | 2018 | Ability to create sentence outlines using an adversarial model which learns the distribution of sentences in a hidden space persuaded by sentence encoder Advantage: produce real like samples with multinomial sampling |
| Liang et al. [ | 2017 | Create useful distractors Advantage: achieves comparable performance to a frequently used word2vec-based method for the Wiki dataset | |
| Malware detection | Dahl et al. [ | 2013 | Employ random projections to decrease the dimension of the original latent space Achieved improved classification results |
| Grosse et al. [ | 2016 | To craft real offensive adversarial attacks Introduced additional constraints in the adversarial sample crafting (i) continuous, differentiable input domains are replaced by discrete, often binary inputs; and (ii) the loose condition of leaving visual appearance unchanged is replaced by requiring equivalent functional behavior | |
| Arjovsky et al. [ | 2017 | IDSGAN used to generate malware attacks which can bypass the different intrusion detection systems (IDS) Achieved high degree of evasion against IDS | |
| Heusel et al. [ | 2017 | Framework to target portable executable (PE) anti malware systems in an offensive way Advantage: proved to be an effective model to identify the vulnerabilities of the anti-malware systems | |
| Arjovsky et al. [ | 2017 | Model to generate malware instances for Black-Box Attacks Based on GAN | |
| Gulrajani et al. [ | 2019 | Adversarial sample generation to launch attack against malware classifiers | |
| Singh et al. [ | 2019 | Generative model for malware images that could be used to boost classifier’s performance by performing data augmentation Advantage: leveraged to generate malware images which would alleviate the problem of public sharing of the dataset | |
| Odena et al. [ | 2016 | Class-conditional image synthesis model to segregate generated samples to their respective malware category without any manual intervention | |
| Anderson et al. [ | 2016 | Model to bypass a detector of web domain generation algorithm | |
| Rigaki et al. [ | 2018 | To adapt malware communication to force misclassification of new generation Intrusion Prevention Systems (IPS) Advantage: effective at modifying malware traffic in order to remain undetectable | |
| Labaca et al. [ | 2019 | GAN to inject automatic byte-level perturbations into PE files to fool the classifier | |
| Kawai et al. [ | 2020 | Bypass malware defenders by adding benign to the original malicious code Advantage: resolve the problem of creating an huge collection of APIs to bypass the detectors | |
| BlockChain | Zheng [ | 2020 | GANs based technology for exchange of secret key which also overcome the block chain problems of security, recovery of lost key and communication inefficiency Advantage: a new avenue is opened the exchange of secret key which us reliable and adaptive as well as efficient |
Fig. 3Problems of GAN