| Literature DB >> 19010804 |
Brigitte Waegele1, Irmtraud Dunger-Kaltenbach, Gisela Fobo, Corinna Montrone, H-Werner Mewes, Andreas Ruepp.
Abstract
UNLABELLED: Cross-mapping of gene and protein identifiers between different databases is a tedious and time-consuming task. To overcome this, we developed CRONOS, a cross-reference server that contains entries from five mammalian organisms presented by major gene and protein information resources. Sequence similarity analysis of the mapped entries shows that the cross-references are highly accurate. In total, up to 18 different identifier types can be used for identification of cross-references. The quality of the mapping could be improved substantially by exclusion of ambiguous gene and protein names which were manually validated. Organism-specific lists of ambiguous terms, which are valuable for a variety of bioinformatics applications like text mining are available for download. AVAILABILITY: CRONOS is freely available to non-commercial users at http://mips.gsf.de/genre/proj/cronos/index.html, web services are available at http://mips.gsf.de/CronosWSService/CronosWS?wsdl.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19010804 PMCID: PMC2638938 DOI: 10.1093/bioinformatics/btn590
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Quality of the mapping with and without ambiguous gene names and protein names. The sequence identity of the mapped entries from human (Swiss-Prot and RefSeq) was calculated and pooled in fractions of 0–5%, 5–10% sequence identity etc. The plot shows the fraction of the mapped entries plotted against the sequence identity of these entries.