Literature DB >> 27318205

NegGOA: negative GO annotations selection using ontology structure.

Guangyuan Fu1, Jun Wang1, Bo Yang2, Guoxian Yu3.   

Abstract

MOTIVATION: Predicting the biological functions of proteins is one of the key challenges in the post-genomic era. Computational models have demonstrated the utility of applying machine learning methods to predict protein function. Most prediction methods explicitly require a set of negative examples-proteins that are known not carrying out a particular function. However, Gene Ontology (GO) almost always only provides the knowledge that proteins carry out a particular function, and functional annotations of proteins are incomplete. GO structurally organizes more than tens of thousands GO terms and a protein is annotated with several (or dozens) of these terms. For these reasons, the negative examples of a protein can greatly help distinguishing true positive examples of the protein from such a large candidate GO space.
RESULTS: In this paper, we present a novel approach (called NegGOA) to select negative examples. Specifically, NegGOA takes advantage of the ontology structure, available annotations and potentiality of additional annotations of a protein to choose negative examples of the protein. We compare NegGOA with other negative examples selection algorithms and find that NegGOA produces much fewer false negatives than them. We incorporate the selected negative examples into an efficient function prediction model to predict the functions of proteins in Yeast, Human, Mouse and Fly. NegGOA also demonstrates improved accuracy than these comparing algorithms across various evaluation metrics. In addition, NegGOA is less suffered from incomplete annotations of proteins than these comparing methods.
AVAILABILITY AND IMPLEMENTATION: The Matlab and R codes are available at https://sites.google.com/site/guoxian85/neggoa CONTACT: gxyu@swu.edu.cn SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27318205     DOI: 10.1093/bioinformatics/btw366

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  SGFSC: speeding the gene functional similarity calculation based on hash tables.

Authors:  Zhen Tian; Chunyu Wang; Maozu Guo; Xiaoyan Liu; Zhixia Teng
Journal:  BMC Bioinformatics       Date:  2016-11-04       Impact factor: 3.169

2.  NoGOA: predicting noisy GO annotations using evidences and sparse representation.

Authors:  Guoxian Yu; Chang Lu; Jun Wang
Journal:  BMC Bioinformatics       Date:  2017-07-21       Impact factor: 3.169

3.  Novel comparison of evaluation metrics for gene ontology classifiers reveals drastic performance differences.

Authors:  Ilya Plyusnin; Liisa Holm; Petri Törönen
Journal:  PLoS Comput Biol       Date:  2019-11-04       Impact factor: 4.475

4.  Predicting functions of maize proteins using graph convolutional network.

Authors:  Guangjie Zhou; Jun Wang; Xiangliang Zhang; Maozu Guo; Guoxian Yu
Journal:  BMC Bioinformatics       Date:  2020-12-16       Impact factor: 3.169

5.  Prediction of Human Phenotype Ontology terms by means of hierarchical ensemble methods.

Authors:  Marco Notaro; Max Schubach; Peter N Robinson; Giorgio Valentini
Journal:  BMC Bioinformatics       Date:  2017-10-12       Impact factor: 3.169

6.  Benchmarking gene ontology function predictions using negative annotations.

Authors:  Alex Warwick Vesztrocy; Christophe Dessimoz
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

Review 7.  Automatic Gene Function Prediction in the 2020's.

Authors:  Stavros Makrodimitris; Roeland C H J van Ham; Marcel J T Reinders
Journal:  Genes (Basel)       Date:  2020-10-27       Impact factor: 4.096

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.