Literature DB >> 32470107

Detecting Gene Ontology misannotations using taxon-specific rate ratio comparisons.

Xiaoqiong Wei1,2, Chengxin Zhang2, Peter L Freddolino2,3, Yang Zhang2,3.   

Abstract

MOTIVATION: Many protein function databases are built on automated or semi-automated curations and can contain various annotation errors. The correction of such misannotations is critical to improving the accuracy and reliability of the databases.
RESULTS: We proposed a new approach to detect potentially incorrect Gene Ontology (GO) annotations by comparing the ratio of annotation rates (RAR) for the same GO term across different taxonomic groups, where those with a relatively low RAR usually correspond to incorrect annotations. As an illustration, we applied the approach to 20 commonly studied species in two recent UniProt-GOA releases and identified 250 potential misannotations in the 2018-11-6 release, where only 25% of them were corrected in the 2019-6-3 release. Importantly, 56% of the misannotations are 'Inferred from Biological aspect of Ancestor (IBA)' which is in contradiction with previous observations that attributed misannotations mainly to 'Inferred from Sequence or structural Similarity (ISS)', probably reflecting an error source shift due to the new developments of function annotation databases. The results demonstrated a simple but efficient misannotation detection approach that is useful for large-scale comparative protein function studies.
AVAILABILITY AND IMPLEMENTATION: https://zhanglab.ccmb.med.umich.edu/RAR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2020        PMID: 32470107      PMCID: PMC7751014          DOI: 10.1093/bioinformatics/btaa548

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  A family of Salmonella virulence factors functions as a distinct class of autoregulated E3 ubiquitin ligases.

Authors:  Cindy M Quezada; Stuart W Hicks; Jorge E Galán; C Erec Stebbins
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-09       Impact factor: 11.205

2.  Structure and Protein Interaction-Based Gene Ontology Annotations Reveal Likely Functions of Uncharacterized Proteins on Human Chromosome 17.

Authors:  Chengxin Zhang; Xiaoqiong Wei; Gilbert S Omenn; Yang Zhang
Journal:  J Proteome Res       Date:  2018-10-16       Impact factor: 4.466

3.  Identification of plakoglobin domains required for association with N-cadherin and alpha-catenin.

Authors:  P A Sacco; T M McGranahan; M J Wheelock; K R Johnson
Journal:  J Biol Chem       Date:  1995-08-25       Impact factor: 5.157

4.  Quality of computationally inferred gene ontology annotations.

Authors:  Nives Skunca; Adrian Altenhoff; Christophe Dessimoz
Journal:  PLoS Comput Biol       Date:  2012-05-31       Impact factor: 4.475

5.  InterProScan 5: genome-scale protein function classification.

Authors:  Philip Jones; David Binns; Hsin-Yu Chang; Matthew Fraser; Weizhong Li; Craig McAnulla; Hamish McWilliam; John Maslen; Alex Mitchell; Gift Nuka; Sebastien Pesseat; Antony F Quinn; Amaia Sangrador-Vegas; Maxim Scheremetjew; Siew-Yit Yong; Rodrigo Lopez; Sarah Hunter
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

6.  UniProt: a worldwide hub of protein knowledge.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

7.  Estimating the annotation error rate of curated GO database sequence annotations.

Authors:  Craig E Jones; Alfred L Brown; Ute Baumann
Journal:  BMC Bioinformatics       Date:  2007-05-22       Impact factor: 3.169

8.  Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach.

Authors:  Carson Andorf; Drena Dobbs; Vasant Honavar
Journal:  BMC Bioinformatics       Date:  2007-08-03       Impact factor: 3.169

9.  Annotation error in public databases: misannotation of molecular function in enzyme superfamilies.

Authors:  Alexandra M Schnoes; Shoshana D Brown; Igor Dodevski; Patricia C Babbitt
Journal:  PLoS Comput Biol       Date:  2009-12-11       Impact factor: 4.475

10.  Understanding how and why the Gene Ontology and its annotations evolve: the GO within UniProt.

Authors:  Rachael P Huntley; Tony Sawford; Maria J Martin; Claire O'Donovan
Journal:  Gigascience       Date:  2014-03-18       Impact factor: 6.524

View more
  3 in total

1.  Functions of Essential Genes and a Scale-Free Protein Interaction Network Revealed by Structure-Based Function and Interaction Prediction for a Minimal Genome.

Authors:  Chengxin Zhang; Wei Zheng; Micah Cheng; Gilbert S Omenn; Peter L Freddolino; Yang Zhang
Journal:  J Proteome Res       Date:  2021-01-04       Impact factor: 4.466

Review 2.  The emerging potential of microbiome transplantation on human health interventions.

Authors:  Howard Junca; Dietmar H Pieper; Eva Medina
Journal:  Comput Struct Biotechnol J       Date:  2022-01-19       Impact factor: 7.271

3.  Tissue-specific transcriptome profiles identify functional differences key to understanding whole plant response to life in variable salinity.

Authors:  Mitchell W Booth; Martin F Breed; Gary A Kendrick; Philipp E Bayer; Anita A Severn-Ellis; Elizabeth A Sinclair
Journal:  Biol Open       Date:  2022-08-23       Impact factor: 2.643

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.