| Literature DB >> 22544604 |
Samuel P Wein1, Richard G Côté, Marine Dumousseau, Florian Reisinger, Henning Hermjakob, Juan A Vizcaíno.
Abstract
The Protein Identifier Cross-Reference (PICR) service is a tool that allows users to map protein identifiers, protein sequences and gene identifiers across over 100 different source databases. PICR takes input through an interactive website as well as Representational State Transfer (REST) and Simple Object Access Protocol (SOAP) services. It returns the results as HTML pages, XLS and CSV files. It has been in production since 2007 and has been recently enhanced to add new functionality and increase the number of databases it covers. Protein subsequences can be Basic Local Alignment Search Tool (BLAST) against the UniProt Knowledgebase (UniProtKB) to provide an entry point to the standard PICR mapping algorithm. In addition, gene identifiers from UniProtKB and Ensembl can now be submitted as input or mapped to as output from PICR. We have also implemented a 'best-guess' mapping algorithm for UniProt. In this article, we describe the usefulness of PICR, how these changes have been implemented, and the corresponding additions to the web services. Finally, we explain that the number of source databases covered by PICR has increased from the initial 73 to the current 102. New resources include several new species-specific Ensembl databases as well as the Ensembl Genome ones. PICR can be accessed at http://www.ebi.ac.uk/Tools/picr/.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22544604 PMCID: PMC3394263 DOI: 10.1093/nar/gks338
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The updated PICR mapping algorithm. The first part of the algorithm deals with finding all UniParc entries that pertain to the source data, be it coming from protein sequences or identifiers. From there, the UniParc entries are used to build cross-references to the required target databases, based on the initial search criteria.
Figure 2.The updated PICR web interface. The integration of BLAST support to PICR required an additional step to the identifier mapping process. Users can select which BLAST hit to use as an entry point to the map-by-accession PICR mapping algorithm. The final result that is presented to the user will contain both the definition line for each sequence and the selected UniProt accession number, so that the user can keep track of what was originally submitted for mapping.