| Literature DB >> 17148486 |
M Louise Riley1, Thorsten Schmidt, Irena I Artamonova, Christian Wagner, Andreas Volz, Klaus Heumann, Hans-Werner Mewes, Dmitrij Frishman.
Abstract
The PEDANT genome database provides exhaustive annotation of 468 genomes by a broad set of bioinformatics algorithms. We describe recent developments of the PEDANT Web server. The all-new Graphical User Interface (GUI) implemented in Javatrade mark allows for more efficient navigation of the genome data, extended search capabilities, user customization and export facilities. The DNA and Protein viewers have been made highly dynamic and customizable. We also provide Web Services to access the entire body of PEDANT data programmatically. Finally, we report on the application of association rule mining for automatic detection of potential annotation errors. PEDANT is freely accessible to academic users at http://pedant.gsf.de.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17148486 PMCID: PMC1761421 DOI: 10.1093/nar/gkl1005
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Navigation tree and report page for the hypothetical secreted protein of Helicobacter pylori (gi_15645712). This figure also illustrates the application of association rule mining for finding potential annotation errors (see text). A number of domain assignments have been marked as suspicious. The InterPro domain IPR001440 and PFAM domain PF00515 were erroneously assigned to this gene product based on a weak similarity hit to a single TPR_1 repeat whereas the domain definition requires at least three repeat copies. Note that all other features marked as suspicious are in fact annotated correctly; the method does not detect annotation errors as such, but rather incompatible feature combinations which might include annotation errors.
Methods available in the Data Retrieval Service
| Methods | Description |
|---|---|
| getDatabases | Returns a string array of the pedant2 databases available for data retrieval |
| getProteinSequencesbyDb | Takes the database name as a parameter and returns all protein codes and sequences in the db as a 2D string array |
| getReportbyDbCodeandContig | Takes the database name, protein code and contig as parameters and returns the report data as a ReportItem bean |
| areCodesUniqueWithinDb | Takes the database name as a parameter and returns a Boolean |
| getRawHitsbyDbCodeandMethod | Takes the database name, protein code and method as parameters and returns the code and raw output for that method as a 2D string array |
| getRawHitsbyDbCodeContigandMethod | Takes the database name, protein code, contig and method as parameters and returns the code and raw output for that method as a 2D string array |
| getProteinCodesbyDb | Takes the database name as a parameter and returns the protein codes as a string array |
| getProteinCodesandContigsbyDb | Takes the database name as a parameter and returns the protein codes and contigs as a 2D string array |
| getMethodsbyDb | Takes the database name as a parameter and returns the available methods as a string array |