Literature DB >> 25984253

A BAYESIAN NONPARAMETRIC MIXTURE MODEL FOR SELECTING GENES AND GENE SUBNETWORKS.

Yize Zhao1, Jian Kang1, Tianwei Yu1.   

Abstract

It is very challenging to select informative features from tens of thousands of measured features in high-throughput data analysis. Recently, several parametric/regression models have been developed utilizing the gene network information to select genes or pathways strongly associated with a clinical/biological outcome. Alternatively, in this paper, we propose a nonparametric Bayesian model for gene selection incorporating network information. In addition to identifying genes that have a strong association with a clinical outcome, our model can select genes with particular expressional behavior, in which case the regression models are not directly applicable. We show that our proposed model is equivalent to an infinity mixture model for which we develop a posterior computation algorithm based on Markov chain Monte Carlo (MCMC) methods. We also propose two fast computing algorithms that approximate the posterior simulation with good accuracy but relatively low computational cost. We illustrate our methods on simulation studies and the analysis of Spellman yeast cell cycle microarray data.

Entities:  

Keywords:  Dirichlet process mixture; density estimation; feature selection; ising priors; microarray data

Year:  2014        PMID: 25984253      PMCID: PMC4433170          DOI: 10.1214/14-AOAS719

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  27 in total

1.  Emergence of scaling in random networks

Authors: 
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

Review 2.  Structural maintenance of chromosomes (SMC) proteins: conserved molecular properties for multiple biological functions.

Authors:  A V Strunnikov; R Jessberger
Journal:  Eur J Biochem       Date:  1999-07

3.  Bayesian Joint Modeling of Multiple Gene Networks and Diverse Genomic Data to Identify Target Genes of a Transcription Factor.

Authors:  Peng Wei; Wei Pan
Journal:  Ann Appl Stat       Date:  2012-01-01       Impact factor: 2.083

4.  A Markov random field model for network-based analysis of genomic data.

Authors:  Zhi Wei; Hongzhe Li
Journal:  Bioinformatics       Date:  2007-05-05       Impact factor: 6.937

5.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions.

Authors:  Ioannis Xenarios; Lukasz Salwínski; Xiaoqun Joyce Duan; Patrick Higney; Sul-Min Kim; David Eisenberg
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  A hierarchical semiparametric model for incorporating intergene information for analysis of genomic data.

Authors:  Long Qu; Dan Nettleton; Jack C M Dekkers
Journal:  Biometrics       Date:  2012-09-20       Impact factor: 2.571

7.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

8.  Incorporating predictor network in penalized regression with application to microarray data.

Authors:  Wei Pan; Benhuai Xie; Xiaotong Shen
Journal:  Biometrics       Date:  2009-07-23       Impact factor: 2.571

9.  Pathway Commons, a web resource for biological pathway data.

Authors:  Ethan G Cerami; Benjamin E Gross; Emek Demir; Igor Rodchenkov; Ozgün Babur; Nadia Anwar; Nikolaus Schultz; Gary D Bader; Chris Sander
Journal:  Nucleic Acids Res       Date:  2010-11-10       Impact factor: 16.971

10.  Saccharomyces Genome Database: the genomics resource of budding yeast.

Authors:  J Michael Cherry; Eurie L Hong; Craig Amundsen; Rama Balakrishnan; Gail Binkley; Esther T Chan; Karen R Christie; Maria C Costanzo; Selina S Dwight; Stacia R Engel; Dianna G Fisk; Jodi E Hirschman; Benjamin C Hitz; Kalpana Karra; Cynthia J Krieger; Stuart R Miyasato; Rob S Nash; Julie Park; Marek S Skrzypek; Matt Simison; Shuai Weng; Edith D Wong
Journal:  Nucleic Acids Res       Date:  2011-11-21       Impact factor: 16.971

View more
  6 in total

1.  Bayesian network feature finder (BANFF): an R package for gene network feature selection.

Authors:  Zhou Lan; Yize Zhao; Jian Kang; Tianwei Yu
Journal:  Bioinformatics       Date:  2016-08-08       Impact factor: 6.937

2.  Detecting subnetwork-level dynamic correlations.

Authors:  Yan Yan; Shangzhao Qiu; Zhuxuan Jin; Sihong Gong; Yun Bai; Jianwei Lu; Tianwei Yu
Journal:  Bioinformatics       Date:  2016-09-25       Impact factor: 6.937

3.  forgeNet: a graph deep neural network model using tree-based ensemble classifiers for feature graph construction.

Authors:  Yunchuan Kong; Tianwei Yu
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

4.  A BAYESIAN NONPARAMETRIC MIXTURE MODEL FOR SELECTING GENES AND GENE SUBNETWORKS.

Authors:  Yize Zhao; Jian Kang; Tianwei Yu
Journal:  Ann Appl Stat       Date:  2014-06       Impact factor: 2.083

5.  DNLC: differential network local consistency analysis.

Authors:  Jianwei Lu; Yao Lu; Yusheng Ding; Qingyang Xiao; Linqing Liu; Qingpo Cai; Yunchuan Kong; Yun Bai; Tianwei Yu
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

6.  K-Profiles: A Nonlinear Clustering Method for Pattern Detection in High Dimensional Data.

Authors:  Kai Wang; Qing Zhao; Jianwei Lu; Tianwei Yu
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.