| Literature DB >> 21932440 |
Johannes Griss1, María Martín, Claire O'Donovan, Rolf Apweiler, Henning Hermjakob, Juan Antonio Vizcaíno.
Abstract
The International Protein Index (IPI) database has been one of the most widely used protein databases in MS proteomics approaches. Recently, the closure of IPI in September 2011 was announced. Its recommended replacement is the new UniProt Knowledgebase (UniProtKB) "complete proteome" sets, launched in May 2011. Here, we analyze the consequences of IPI's discontinuation for human and mouse data, and the effect of its substitution with UniProtKB on two levels: (i) data already produced and (ii) newly performed experiments. To estimate the effect on existing data, we investigated how well IPI identifiers map to UniProtKB accessions. We found that 21% of human and 10% of mouse identifiers do not map to UniProtKB and would thus be "lost." To investigate the impact on new experiments, we compared the theoretical search space (i.e. the tryptic peptides) of both resources and found that it is decreased by 14.0% for human and 8.9% for mouse data through IPI's closure. An analysis on the experimental evidence for these "lost" peptides showed that the vast majority has not been identified in experiments available in the major proteomics repositories. It thus seems likely that the search space provided by UniProtKB is of higher quality than the one currently provided by IPI.Entities:
Mesh:
Year: 2011 PMID: 21932440 PMCID: PMC3556690 DOI: 10.1002/pmic.201100363
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984
Figure 1Schematic summary of the two analyses performed. The first (“protein analysis”) investigated the effect of the closure of IPI on stored data. The second analysis (“peptide analysis”) focused on the impact on newly performed experiments.
Figure 2Mapping result of IPI identifiers to UniProtKB accessions for the “logical” as well as the PICR mapping algorithm. This analysis represents the effect of the discontinuation of IPI on stored proteomics data. Both mapping approaches produced similar results for human and mouse data.
Figure 3Number of peptides retrieved from IPI that were not represented in the respective UniProtKB “complete proteome” build and their evidence in the three major proteomics repositories. The vast majority of peptides not found in UniProtKB have not been identified in the experiments from PRIDE, PeptideAtlas, and GPMDB.