| Literature DB >> 21055489 |
Gelio Alves1, Aleksey Y Ogurtsov, Yi-Kuo Yu.
Abstract
Querying MS/MS spectra against a database containing only proteotypic peptides reduces data analysis time due to reduction of database size. Despite the speed advantage, this search strategy is challenged by issues of statistical significance and coverage. The former requires separating systematically significant identifications from less confident identifications, while the latter arises when the underlying peptide is not present, due to single amino acid polymorphisms (SAPs) or post-translational modifications (PTMs), in the proteotypic peptide libraries searched. To address both issues simultaneously, we have extended RAId's knowledge database to include proteotypic information, utilized RAId's statistical strategy to assign statistical significance to proteotypic peptides, and modified RAId's programs to allow for consideration of proteotypic information during database searches. The extended database alleviates the coverage problem since all annotated modifications, even those that occurred within proteotypic peptides, may be considered. Taking into account the likelihoods of observation, the statistical strategy of RAId provides accurate E-value assignments regardless whether a candidate peptide is proteotypic or not. The advantage of including proteotypic information is evidenced by its superior retrieval performance when compared to regular database searches. Published by Elsevier B.V.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21055489 PMCID: PMC3186061 DOI: 10.1016/j.jprot.2010.10.005
Source DB: PubMed Journal: J Proteomics ISSN: 1874-3919 Impact factor: 4.044