| Literature DB >> 18940859 |
Mathias C Walter1, Thomas Rattei, Roland Arnold, Ulrich Güldener, Martin Münsterkötter, Karamfilka Nenova, Gabi Kastenmüller, Patrick Tischler, Andreas Wölling, Andreas Volz, Norbert Pongratz, Ralf Jost, Hans-Werner Mewes, Dmitrij Frishman.
Abstract
The PEDANT genome database provides exhaustive annotation of nearly 3000 publicly available eukaryotic, eubacterial, archaeal and viral genomes with more than 4.5 million proteins by a broad set of bioinformatics algorithms. In particular, all completely sequenced genomes from the NCBI's Reference Sequence collection (RefSeq) are covered. The PEDANT processing pipeline has been sped up by an order of magnitude through the utilization of precalculated similarity information stored in the similarity matrix of proteins (SIMAP) database, making it possible to process newly sequenced genomes immediately as they become available. PEDANT is freely accessible to academic users at http://pedant.gsf.de. For programmatic access Web Services are available at http://pedant.gsf.de/webservices.jsp.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18940859 PMCID: PMC2686588 DOI: 10.1093/nar/gkn749
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The number of species from major taxonomic groups contained in the PEDANT genome database as of September 2008
| NCBI Taxonomy ID | Taxonomic group | Number of genomes |
|---|---|---|
| Cellular organisms | 861 | |
| 2157 | Archaea | 53 |
| 2 | Bacteria | 691 |
| 2759 | Eukaryota | 117 |
| Fungi | ||
| Metazoa | ||
| Viridiplantae | ||
| Other | ||
| Viruses | ||
| Total | 2942 |
Other groups: Alveolata (2), Amoebozoa (1), Cryptophyta (1), Euglenozoa (5), Rhodophyta (1), Stramenopiles (2).
Figure 1.UML activity model of the PEDANT genome import and processing pipeline. Symbols according to the UML 2.0 specification (http://www.uml.org) for activity diagrams.