Literature DB >> 31106307

Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data.

Eun Jeong Min1, Changgee Chang1, Qi Long1.   

Abstract

Integrative clustering is a clustering approach for multiple datasets, which provide different views of a common group of subjects. It enables analyzing multi-omics data jointly to, for example, identify the subtypes of diseases, cells, and so on, capturing the complex underlying biological processes more precisely. On the other hand, there has been a great deal of interest in incorporating the prior structural knowledge on the features into statistical analyses over the past decade. The knowledge on the gene regulatory network (pathways) can potentially be incorporated into many genomic studies. In this paper, we propose a novel integrative clustering method which can incorporate the prior graph knowledge. We first develop a generalized Bayesian factor analysis (GBFA) framework, a sparse Bayesian factor analysis which can take into account the graph information. Our GBFA framework employs the spike and slab lasso (SSL) prior to impose sparsity on the factor loadings and the Markov random field (MRF) prior to encourage smoothing over the adjacent factor loadings, which establishes a unified shrinkage adaptive to the loading size and the graph structure. Then, we use the framework to extend iCluster+, a factor analysis based integrative clustering approach. A novel variational EM algorithm is proposed to efficiently estimate the MAP estimator for the factor loadings. Extensive simulation studies and the application to the NCI60 cell line dataset demonstrate that the propose method is superior and delivers more biologically meaningful outcomes.

Entities:  

Keywords:  Generalized Bayesian Factor Analysis; High Dimensional Data; Integrative Analysis; Integrative Clustering; Markov Random Field (MRF); NCI60; Network Information; Omics Data; Spike and Slab Lasso (SSL); Structural Information; Variational EM Algorithm

Year:  2019        PMID: 31106307      PMCID: PMC6521881          DOI: 10.1109/DSAA.2018.00021

Source DB:  PubMed          Journal:  Proc Int Conf Data Sci Adv Anal


  4 in total

1.  Knowledge-Guided Biclustering via Sparse Variational EM Algorithm.

Authors:  Changgee Chang; Jihwan Oh; Eun Jeong Min; Qi Long
Journal:  10th IEEE Int Conf Big Knowl (2019)       Date:  2019-12-30

2.  GRIA: Graphical Regularization for Integrative Analysis.

Authors:  Changgee Chang; Jihwan Oh; Qi Long
Journal:  Proc SIAM Int Conf Data Min       Date:  2020

3.  An ensemble of the iCluster method to analyze longitudinal lncRNA expression data for psoriasis patients.

Authors:  Suyan Tian; Chi Wang
Journal:  Hum Genomics       Date:  2021-04-20       Impact factor: 4.639

4.  Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data.

Authors:  Eun Jeong Min; Qi Long
Journal:  BMC Bioinformatics       Date:  2020-04-15       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.