Literature DB >> 11485799

Intrinsic errors in genome annotation.

D Devos1, A Valencia.   

Abstract

Genome sequencing is usually followed by routine annotation of protein function based on the assumption that similar sequences will have similar functions. Here, we introduce a simple calculation to estimate the magnitude of any possible annotation errors. We counted the number of discrepancies in the annotation of well-established sets of similar proteins and extrapolated these values to the pairs of similar sequences used for the annotation of different microbial genomes. We conclude that the number of potential errors in the prediction of detailed functions is higher than is usually believed.

Mesh:

Year:  2001        PMID: 11485799     DOI: 10.1016/s0168-9525(01)02348-4

Source DB:  PubMed          Journal:  Trends Genet        ISSN: 0168-9525            Impact factor:   11.639


  98 in total

1.  Genome-wide analysis of core cell cycle genes in Arabidopsis.

Authors:  Klaas Vandepoele; Jeroen Raes; Lieven De Veylder; Pierre Rouzé; Stephane Rombauts; Dirk Inzé
Journal:  Plant Cell       Date:  2002-04       Impact factor: 11.277

2.  Search and retrieve. Large-scale data generation is becoming increasingly important in biological research. But how good are the tools to make sense of the data?

Authors:  Alfonso Valencia
Journal:  EMBO Rep       Date:  2002-05       Impact factor: 8.807

3.  A primordial RNA modification enzyme: the case of tRNA (m1A) methyltransferase.

Authors:  Martine Roovers; Johan Wouters; Janusz M Bujnicki; Catherine Tricot; Victor Stalon; Henri Grosjean; Louis Droogmans
Journal:  Nucleic Acids Res       Date:  2004-01-22       Impact factor: 16.971

4.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

5.  The PredictProtein server.

Authors:  Burkhard Rost; Jinfeng Liu
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  Congruent evolution of different classes of non-coding DNA in prokaryotic genomes.

Authors:  Igor B Rogozin; Kira S Makarova; Darren A Natale; Alexey N Spiridonov; Roman L Tatusov; Yuri I Wolf; Jodie Yin; Eugene V Koonin
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

7.  PANDORA: keyword-based analysis of protein sets by integration of annotation sources.

Authors:  Noam Kaplan; Avishay Vaaknin; Michal Linial
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

8.  The PredictProtein server.

Authors:  Burkhard Rost; Guy Yachdav; Jinfeng Liu
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

9.  LEON: multiple aLignment Evaluation Of Neighbours.

Authors:  Julie D Thompson; Véronique Prigent; Olivier Poch
Journal:  Nucleic Acids Res       Date:  2004-02-24       Impact factor: 16.971

10.  Assignment of pterin deaminase activity to an enzyme of unknown function guided by homology modeling and docking.

Authors:  Hao Fan; Daniel S Hitchcock; Ronald D Seidel; Brandan Hillerich; Henry Lin; Steven C Almo; Andrej Sali; Brian K Shoichet; Frank M Raushel
Journal:  J Am Chem Soc       Date:  2013-01-02       Impact factor: 15.419

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.