| Literature DB >> 20615486 |
Juan Antonio Vizcaíno1, Joseph M Foster, Lennart Martens.
Abstract
Despite the fact that data deposition is not a generalised fact yet in the field of proteomics, several mass spectrometry (MS) based proteomics repositories are publicly available for the scientific community. The main existing resources are: the Global Proteome Machine Database (GPMDB), PeptideAtlas, the PRoteomics IDEntifications database (PRIDE), Tranche, and NCBI Peptidome. In this review the capabilities of each of these will be described, paying special attention to four key properties: data types stored, applicable data submission strategies, supported formats, and available data mining and visualization tools. Additionally, the data contents from model organisms will be enumerated for each resource. There are other valuable smaller and/or more specialized repositories but they will not be covered in this review. Finally, the concept behind the ProteomeXchange consortium, a collaborative effort among the main resources in the field, will be introduced.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20615486 PMCID: PMC2958306 DOI: 10.1016/j.jprot.2010.06.008
Source DB: PubMed Journal: J Proteomics ISSN: 1874-3919 Impact factor: 4.044
Main characteristics of the major proteomics repositories and databases.
| PRIDE | GPMDB | PeptideAtlas | Tranche | Peptidome | |
|---|---|---|---|---|---|
| Reprocessing of the data | No | Yes (X!Tandem and others) | Yes (TPP) | No | No |
| Editorial control | No | Yes | Yes | No | No |
| Level of annotation | Detailed | Very basic | Basic | Very basic | Detailed |
| Proteotypic peptides | No | Yes | Yes | No | No |
| Support for targeted approaches | No | Yes | Yes | No | No |
| Data submission | PRIDE Converter and FTP upload | Web interface | FTP upload | Web interface | FTP upload |
| Submission of raw (binary) data | No | No | Yes | Yes | No |
| Access to deposited data | Web interface, BioMart, web service, and direct download of files | Web interface and direct download of files | Web interface and direct download of files | Java Web Start application | Web interface and direct download of files |
Fig. 1(A) A pie chart representing the protein identification contributions of model organisms in PRIDE. * The asterisk represents 12 species: G. gallus, Zea mays, Synechocystis sp., C. elegans, E. coli, Xenopus laevis, Emiliania huxleyi, Macaca mulatta, N. tabacum, Ustilago maydis, Ashbya gossypii and Neurospora crassa. (B) A pie chart representing the peptide contributions of model organisms in PRIDE. * The asterisk represents 7 species: Synechocystis sp., G. gallus, E. coli, C. elegans, M. mulatta, E. huxleyi and N. tabacum.
Fig. 2Model organism representation by number of data depositions in PRIDE. * The asterisk represents 8 species: X. laevis, U. maydis, O. sativa, N. tabacum, N. crassa, M. mulatta, E. huxlei and A. gossypii.