Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A database of unique protein sequence identifiers for proteome studies.

Literature DB >> 16858731

A database of unique protein sequence identifiers for proteome studies.

Abstract

In proteome studies, identification of proteins requires searching protein sequence databases. The public protein sequence databases (e.g., NCBInr, UniProt) each contain millions of entries, and private databases add thousands more. Although much of the sequence information in these databases is redundant, each database uses distinct identifiers for the identical protein sequence and often contains unique annotation information. Users of one database obtain a database-specific sequence identifier that is often difficult to reconcile with the identifiers from a different database. When multiple databases are used for searches or the databases being searched are updated frequently, interpreting the protein identifications and associated annotations can be problematic. We have developed a database of unique protein sequence identifiers called Sequence Globally Unique Identifiers (SEGUID) derived from primary protein sequences. These identifiers serve as a common link between multiple sequence databases and are resilient to annotation changes in either public or private databases throughout the lifetime of a given protein sequence. The SEGUID Database can be downloaded (http://bioinformatics.anl.gov/SEGUID/) or easily generated at any site with access to primary protein sequence databases. Since SEGUIDs are stable, predictions based on the primary sequence information (e.g., pI, Mr) can be calculated just once; we have generated approximately 500 different calculations for more than 2.5 million sequences. SEGUIDs are used to integrate MS and 2-DE data with bioinformatics information and provide the opportunity to search multiple protein sequence databases, thereby providing a higher probability of finding the most valid protein identifications.

Mesh：

Year: 2006 PMID： 16858731 DOI： 10.1002/pmic.200600032

Source DB: PubMed Journal: Proteomics ISSN： 1615-9853 Impact factor: 3.984

Keyword Cloud
Cited

11 in total

1. iRefScape. A Cytoscape plug-in for visualization and data mining of protein interaction data from iRefIndex.

Authors: Sabry Razick; Antonio Mora; Katerina Michalickova; Paul Boddie; Ian M Donaldson
Journal: BMC Bioinformatics Date: 2011-10-05 Impact factor: 3.169

2. Gene selection and cloning approaches for co-expression and production of recombinant protein-protein complexes.

Authors: György Babnigg; Robert Jedrzejczak; Boguslaw Nocek; Adam Stein; William Eschenfeldt; Lucy Stols; Norman Marshall; Alicia Weger; Ruiying Wu; Mark Donnelly; Andrzej Joachimiak
Journal: J Struct Funct Genomics Date: 2015-12-15

3. The M5nr: a novel non-redundant database containing protein sequences and annotations from multiple sources and associated tools.

Authors: Andreas Wilke; Travis Harrison; Jared Wilkening; Dawn Field; Elizabeth M Glass; Nikos Kyrpides; Konstantinos Mavrommatis; Folker Meyer
Journal: BMC Bioinformatics Date: 2012-06-21 Impact factor: 3.169

4. Improvements in the Protein Identifier Cross-Reference service.

Authors: Samuel P Wein; Richard G Côté; Marine Dumousseau; Florian Reisinger; Henning Hermjakob; Juan A Vizcaíno
Journal: Nucleic Acids Res Date: 2012-04-27 Impact factor: 16.971

5. The Booly aliasing resource: a database of grouped biological identifiers.

Authors: Long Hoang Do; Ethan Bier
Journal: Bioinformation Date: 2011-03-26

6. SeqRepo: A system for managing local collections of biological sequences.

Authors: Reece K Hart; Andreas Prlić
Journal: PLoS One Date: 2020-12-03 Impact factor: 3.240

7. MIRIAM Resources: tools to generate and resolve robust cross-references in Systems Biology.

Authors: Camille Laibe; Nicolas Le Novère
Journal: BMC Syst Biol Date: 2007-12-13

8. The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases.

Authors: Richard G Côté; Philip Jones; Lennart Martens; Samuel Kerrien; Florian Reisinger; Quan Lin; Rasko Leinonen; Rolf Apweiler; Henning Hermjakob
Journal: BMC Bioinformatics Date: 2007-10-18 Impact factor: 3.169

9. iRefIndex: a consolidated protein interaction database with provenance.

Authors: Sabry Razick; George Magklaras; Ian M Donaldson
Journal: BMC Bioinformatics Date: 2008-09-30 Impact factor: 3.169

Review 10. Soil and leaf litter metaproteomics-a brief guideline from sampling to understanding.

Authors: Katharina M Keiblinger; Stephan Fuchs; Sophie Zechmeister-Boltenstern; Katharina Riedel
Journal: FEMS Microbiol Ecol Date: 2016-08-21 Impact factor: 4.194