Literature DB >> 29457155

VIGAN: Missing View Imputation with Generative Adversarial Networks.

Chao Shang1, Aaron Palmer1, Jiangwen Sun1, Ko-Shin Chen1, Jin Lu1, Jinbo Bi1.   

Abstract

In an era when big data are becoming the norm, there is less concern with the quantity but more with the quality and completeness of the data. In many disciplines, data are collected from heterogeneous sources, resulting in multi-view or multi-modal datasets. The missing data problem has been challenging to address in multi-view data analysis. Especially, when certain samples miss an entire view of data, it creates the missing view problem. Classic multiple imputations or matrix completion methods are hardly effective here when no information can be based on in the specific view to impute data for such samples. The commonly-used simple method of removing samples with a missing view can dramatically reduce sample size, thus diminishing the statistical power of a subsequent analysis. In this paper, we propose a novel approach for view imputation via generative adversarial networks (GANs), which we name by VIGAN. This approach first treats each view as a separate domain and identifies domain-to-domain mappings via a GAN using randomly-sampled data from each view, and then employs a multi-modal denoising autoencoder (DAE) to reconstruct the missing view from the GAN outputs based on paired data across the views. Then, by optimizing the GAN and DAE jointly, our model enables the knowledge integration for domain mappings and view correspondences to effectively recover the missing view. Empirical results on benchmark datasets validate the VIGAN approach by comparing against the state of the art. The evaluation of VIGAN in a genetic study of substance use disorders further proves the effectiveness and usability of this approach in life science.

Entities:  

Keywords:  autoencoder; cycle-consistent; domain mapping; generative adversarial networks; missing data; missing view

Year:  2018        PMID: 29457155      PMCID: PMC5813842          DOI: 10.1109/BigData.2017.8257992

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Big Data


  11 in total

1.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

Review 2.  Missing value imputation for gene expression data: computational techniques to recover missing data from available information.

Authors:  Alan Wee-Chung Liew; Ngai-Fong Law; Hong Yan
Journal:  Brief Bioinform       Date:  2010-12-14       Impact factor: 11.622

3.  Reducing the dimensionality of data with neural networks.

Authors:  G E Hinton; R R Salakhutdinov
Journal:  Science       Date:  2006-07-28       Impact factor: 47.728

4.  The criminality of new drug users in Glasgow.

Authors:  R Hammersley; A Forsyth; T Lavelle
Journal:  Br J Addict       Date:  1990-12

5.  Multiview matrix completion for multilabel image classification.

Authors: 
Journal:  IEEE Trans Image Process       Date:  2015-04-09       Impact factor: 10.856

6.  Spectral Regularization Algorithms for Learning Large Incomplete Matrices.

Authors:  Rahul Mazumder; Trevor Hastie; Robert Tibshirani
Journal:  J Mach Learn Res       Date:  2010-03-01       Impact factor: 3.654

7.  Reliability of DSM-IV diagnostic criteria using the semi-structured assessment for drug dependence and alcoholism (SSADDA).

Authors:  Amira Pierucci-Lagha; Joel Gelernter; Grace Chan; Albert Arias; Joseph F Cubells; Lindsay Farrer; Henry R Kranzler
Journal:  Drug Alcohol Depend       Date:  2007-06-27       Impact factor: 4.492

8.  Genome-wide association study of opioid dependence: multiple associations mapped to calcium and potassium pathways.

Authors:  Joel Gelernter; Henry R Kranzler; Richard Sherva; Ryan Koesterer; Laura Almasy; Hongyu Zhao; Lindsay A Farrer
Journal:  Biol Psychiatry       Date:  2013-10-19       Impact factor: 13.382

9.  Multimodal Deep Autoencoder for Human Pose Recovery.

Authors:  Chaoqun Hong; Jun Yu; Jian Wan; Dacheng Tao; Meng Wang
Journal:  IEEE Trans Image Process       Date:  2015-10-07       Impact factor: 10.856

10.  An Effective Method to Identify Heritable Components from Multivariate Phenotypes.

Authors:  Jiangwen Sun; Henry R Kranzler; Jinbo Bi
Journal:  PLoS One       Date:  2015-12-14       Impact factor: 3.240

View more
  4 in total

1.  A Survey on Multi-View Clustering.

Authors:  Guoqing Chao; Shiliang Sun; Jinbo Bi
Journal:  IEEE Trans Artif Intell       Date:  2021-04-05

Review 2.  Proposed Requirements for Cardiovascular Imaging-Related Machine Learning Evaluation (PRIME): A Checklist: Reviewed by the American College of Cardiology Healthcare Innovation Council.

Authors:  Partho P Sengupta; Sirish Shrestha; Béatrice Berthon; Emmanuel Messas; Erwan Donal; Geoffrey H Tison; James K Min; Jan D'hooge; Jens-Uwe Voigt; Joel Dudley; Johan W Verjans; Khader Shameer; Kipp Johnson; Lasse Lovstakken; Mahdi Tabassian; Marco Piccirilli; Mathieu Pernot; Naveena Yanamala; Nicolas Duchateau; Nobuyuki Kagiyama; Olivier Bernard; Piotr Slomka; Rahul Deo; Rima Arnaout
Journal:  JACC Cardiovasc Imaging       Date:  2020-09

3.  A Survey of Unsupervised Deep Domain Adaptation.

Authors:  Garrett Wilson; Diane J Cook
Journal:  ACM Trans Intell Syst Technol       Date:  2020-07-05       Impact factor: 4.654

4.  A Benchmark for Data Imputation Methods.

Authors:  Sebastian Jäger; Arndt Allhorn; Felix Bießmann
Journal:  Front Big Data       Date:  2021-07-08
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.