| Literature DB >> 15608204 |
M Louise Riley1, Thorsten Schmidt, Christian Wagner, Hans-Werner Mewes, Dmitrij Frishman.
Abstract
The PEDANT genome database (http://pedant.gsf.de) contains pre-computed bioinformatics analyses of publicly available genomes. Its main mission is to provide robust automatic annotation of the vast majority of amino acid sequences, which have not been subjected to in-depth manual curation by human experts in high-quality protein sequence databases. By design PEDANT annotation is genome-oriented, making it possible to explore genomic context of gene products, and evaluate functional and structural content of genomes using a category-based query mechanism. At present, the PEDANT database contains exhaustive annotation of over 1,240,000 proteins from 270 eubacterial, 23 archeal and 41 eukaryotic genomes.Entities:
Mesh:
Year: 2005 PMID: 15608204 PMCID: PMC539973 DOI: 10.1093/nar/gki019
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Growth of the number of annotated genomes in the PEDANT database since 1998. *Number as of September 1, 2004.
Figure 2Number of annotated genomes (A) and protein sequences (B) in different genome categories.
Figure 3An illustration of the functional and structural content of the PEDANT database. The figure shows the percentage of protein sequences associated with PFAM sequence motifs, SCOP structural domains and MIPS functional categories, as well as any combinations of these three attributes.
Figure 4The FunCat distribution of all 334 genomes in PEDANT. Here, the relative amounts of proteins that are assigned to one or more of six general FunCat classes are shown. Since proteins can be assigned to more than one functional category, the total fraction exceeds 100%.