Literature DB >> 21133881

Network-based auto-probit modeling for protein function prediction.

Xiaoyu Jiang1, David Gold, Eric D Kolaczyk.   

Abstract

Predicting the functional roles of proteins based on various genome-wide data, such as protein-protein association networks, has become a canonical problem in computational biology. Approaching this task as a binary classification problem, we develop a network-based extension of the spatial auto-probit model. In particular, we develop a hierarchical Bayesian probit-based framework for modeling binary network-indexed processes, with a latent multivariate conditional autoregressive Gaussian process. The latter allows for the easy incorporation of protein-protein association network topologies-either binary or weighted-in modeling protein functional similarity. We use this framework to predict protein functions, for functions defined as terms in the Gene Ontology (GO) database, a popular rigorous vocabulary for biological functionality. Furthermore, we show how a natural extension of this framework can be used to model and correct for the high percentage of false negative labels in training data derived from GO, a serious shortcoming endemic to biological databases of this type. Our method performance is evaluated and compared with standard algorithms on weighted yeast protein-protein association networks, extracted from a recently developed integrative database called Search Tool for the Retrieval of INteracting Genes/proteins (STRING). Results show that our basic method is competitive with these other methods, and that the extended method-incorporating the uncertainty in negative labels among the training data-can yield nontrivial improvements in predictive accuracy.
© 2010, The International Biometric Society.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21133881      PMCID: PMC3116961          DOI: 10.1111/j.1541-0420.2010.01519.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  13 in total

1.  Assessment of prediction accuracy of protein function from protein--protein interaction data.

Authors:  H Hishigaki; K Nakai; T Ono; A Tanigami; T Takagi
Journal:  Yeast       Date:  2001-04       Impact factor: 3.239

2.  A statistical framework for genomic data fusion.

Authors:  Gert R G Lanckriet; Tijl De Bie; Nello Cristianini; Michael I Jordan; William Stafford Noble
Journal:  Bioinformatics       Date:  2004-05-06       Impact factor: 6.937

3.  An integrated probabilistic model for functional prediction of proteins.

Authors:  Minghua Deng; Ting Chen; Fengzhu Sun
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

4.  Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions.

Authors:  Hon Nian Chua; Wing-Kin Sung; Limsoon Wong
Journal:  Bioinformatics       Date:  2006-04-21       Impact factor: 6.937

5.  A network of protein-protein interactions in yeast.

Authors:  B Schwikowski; P Uetz; S Fields
Journal:  Nat Biotechnol       Date:  2000-12       Impact factor: 54.908

6.  Predicting protein function from protein/protein interaction data: a probabilistic approach.

Authors:  Stanley Letovsky; Simon Kasif
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

7.  Prediction of protein function using protein-protein interaction data.

Authors:  Minghua Deng; Kui Zhang; Shipra Mehta; Ting Chen; Fengzhu Sun
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

Review 8.  Network-based prediction of protein function.

Authors:  Roded Sharan; Igor Ulitsky; Ron Shamir
Journal:  Mol Syst Biol       Date:  2007-03-13       Impact factor: 11.429

9.  STRING: known and predicted protein-protein associations, integrated and transferred across organisms.

Authors:  Christian von Mering; Lars J Jensen; Berend Snel; Sean D Hooper; Markus Krupp; Mathilde Foglierini; Nelly Jouffre; Martijn A Huynen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  Probabilistic protein function prediction from heterogeneous genome-wide data.

Authors:  Naoki Nariai; Eric D Kolaczyk; Simon Kasif
Journal:  PLoS One       Date:  2007-03-28       Impact factor: 3.240

View more
  3 in total

1.  Poly-dipeptides encoded by the C9ORF72 repeats block global protein translation.

Authors:  Kohsuke Kanekura; Takuya Yagi; Alexander J Cammack; Jana Mahadevan; Masahiko Kuroda; Matthew B Harms; Timothy M Miller; Fumihiko Urano
Journal:  Hum Mol Genet       Date:  2016-02-29       Impact factor: 6.150

Review 2.  Review of biological network data and its applications.

Authors:  Donghyeon Yu; Minsoo Kim; Guanghua Xiao; Tae Hyun Hwang
Journal:  Genomics Inform       Date:  2013-12-31

3.  TarNet: An Evidence-Based Database for Natural Medicine Research.

Authors:  Ruifeng Hu; Guomin Ren; Guibo Sun; Xiaobo Sun
Journal:  PLoS One       Date:  2016-06-23       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.