Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Network-based auto-probit modeling for protein function prediction.

Literature DB >> 21133881

Network-based auto-probit modeling for protein function prediction.

Xiaoyu Jiang¹, David Gold, Eric D Kolaczyk.

Abstract

Predicting the functional roles of proteins based on various genome-wide data, such as protein-protein association networks, has become a canonical problem in computational biology. Approaching this task as a binary classification problem, we develop a network-based extension of the spatial auto-probit model. In particular, we develop a hierarchical Bayesian probit-based framework for modeling binary network-indexed processes, with a latent multivariate conditional autoregressive Gaussian process. The latter allows for the easy incorporation of protein-protein association network topologies-either binary or weighted-in modeling protein functional similarity. We use this framework to predict protein functions, for functions defined as terms in the Gene Ontology (GO) database, a popular rigorous vocabulary for biological functionality. Furthermore, we show how a natural extension of this framework can be used to model and correct for the high percentage of false negative labels in training data derived from GO, a serious shortcoming endemic to biological databases of this type. Our method performance is evaluated and compared with standard algorithms on weighted yeast protein-protein association networks, extracted from a recently developed integrative database called Search Tool for the Retrieval of INteracting Genes/proteins (STRING). Results show that our basic method is competitive with these other methods, and that the extended method-incorporating the uncertainty in negative labels among the training data-can yield nontrivial improvements in predictive accuracy.

Entities: Disease Gene Species

Mesh：

Substances：
Proteins

Year: 2010 PMID： 21133881 PMCID： PMC3116961 DOI： 10.1111/j.1541-0420.2010.01519.x

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

13 in total

Network-based auto-probit modeling for protein function prediction.

1. Assessment of prediction accuracy of protein function from protein--protein interaction data.

2. A statistical framework for genomic data fusion.

3. An integrated probabilistic model for functional prediction of proteins.

4. Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions.

5. A network of protein-protein interactions in yeast.

6. Predicting protein function from protein/protein interaction data: a probabilistic approach.

7. Prediction of protein function using protein-protein interaction data.

Review 8. Network-based prediction of protein function.

9. STRING: known and predicted protein-protein associations, integrated and transferred across organisms.

10. Probabilistic protein function prediction from heterogeneous genome-wide data.

1. Poly-dipeptides encoded by the C9ORF72 repeats block global protein translation.

Review 2. Review of biological network data and its applications.

3. TarNet: An Evidence-Based Database for Natural Medicine Research.