Literature DB >> 34290493

Knowledge-Guided Biclustering via Sparse Variational EM Algorithm.

Changgee Chang1, Jihwan Oh1, Eun Jeong Min1, Qi Long1.   

Abstract

A biclustering in the analysis of a gene expression data matrix, for example, is defined as a set of biclusters where each bicluster is a group of genes and a group of samples for which the genes are differentially expressed. Although many data mining approaches for biclustering exist in the literature, only few are able to incorporate prior knowledge to the analysis, which can lead to great improvements in terms of accuracy and interpretability, and all are limited in handling discrete data types. We propose a generalized biclustering approach that can be used for integrative analysis of multi-omics data with different data types. Our method is capable of utilizing biological information that can be represented by graph such as functional genomics and functional proteomics and accommodating a combination of continuous and discrete data types. The proposed method builds on a generalized Bayesian factor analysis framework and a variational EM approach is used to obtain parameter estimates, where the latent quantities in the loglikelihood are iteratively imputed by their conditional expectations. The biclusters are retrieved via the sparse estimates of the factor loadings and the conditional expectation of the latent factors. In order to obtain the sparse conditional expectation of the latent factors, a novel sparse variational EM algorithm is used. We demonstrate the superiority of our method over several existing biclustering methods in extensive simulation experiements and in integrative analysis of multi-omics data.

Keywords:  Bayesian latent factor model; biclustering; integrative multi-omics analysis; variational EM algorithm

Year:  2019        PMID: 34290493      PMCID: PMC8291726          DOI: 10.1109/icbk.2019.00012

Source DB:  PubMed          Journal:  10th IEEE Int Conf Big Knowl (2019)


  25 in total

1.  Extracting conserved gene expression motifs from gene expression data.

Authors:  T M Murali; Simon Kasif
Journal:  Pac Symp Biocomput       Date:  2003

2.  A systematic comparison and evaluation of biclustering methods for gene expression data.

Authors:  Amela Prelić; Stefan Bleuler; Philip Zimmermann; Anja Wille; Peter Bühlmann; Wilhelm Gruissem; Lars Hennig; Lothar Thiele; Eckart Zitzler
Journal:  Bioinformatics       Date:  2006-02-24       Impact factor: 6.937

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

4.  Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data.

Authors:  Eun Jeong Min; Changgee Chang; Qi Long
Journal:  Proc Int Conf Data Sci Adv Anal       Date:  2019-02-04

Review 5.  Targeting apoptosis pathways in glioblastoma.

Authors:  Günter Eisele; Michael Weller
Journal:  Cancer Lett       Date:  2011-01-26       Impact factor: 8.679

6.  The Ras-related protein, Rap1A, mediates thrombin-stimulated, integrin-dependent glioblastoma cell proliferation and tumor growth.

Authors:  Jacqueline Sayyah; Alena Bartakova; Nekeisha Nogal; Lawrence A Quilliam; Dwayne G Stupack; Joan Heller Brown
Journal:  J Biol Chem       Date:  2014-05-01       Impact factor: 5.157

7.  A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data.

Authors:  Qianxing Mo; Ronglai Shen; Cui Guo; Marina Vannucci; Keith S Chan; Susan G Hilsenbeck
Journal:  Biostatistics       Date:  2018-01-01       Impact factor: 5.899

8.  Incorporating predictor network in penalized regression with application to microarray data.

Authors:  Wei Pan; Benhuai Xie; Xiaotong Shen
Journal:  Biometrics       Date:  2009-07-23       Impact factor: 2.571

9.  INCORPORATING BIOLOGICAL INFORMATION INTO LINEAR MODELS: A BAYESIAN APPROACH TO THE SELECTION OF PATHWAYS AND GENES.

Authors:  Francesco C Stingo; Yian A Chen; Mahlet G Tadesse; Marina Vannucci
Journal:  Ann Appl Stat       Date:  2011-09-01       Impact factor: 2.083

10.  Subclass mapping: identifying common subtypes in independent disease data sets.

Authors:  Yujin Hoshida; Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  PLoS One       Date:  2007-11-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.