| Literature DB >> 26418581 |
Michael R Shortreed1, Craig D Wenger1, Brian L Frey1, Gloria M Sheynkman1, Mark Scalf1, Mark P Keller1, Alan D Attie1, Lloyd M Smith1.
Abstract
Bottom-up proteomics database search algorithms used for peptide identification cannot comprehensively identify post-translational modifications (PTMs) in a single-pass because of high false discovery rates (FDRs). A new approach to database searching enables global PTM (G-PTM) identification by exclusively looking for curated PTMs, thereby avoiding the FDR penalty experienced during conventional variable modification searches. We identified over 2200 unique, high-confidence modified peptides comprising 26 different PTM types in a single-pass database search.Entities:
Keywords: G-PTM; Jurkat; Morpheus; PTM; acetylation; database search; phosphorylation; post-translational modification; proteomics
Mesh:
Substances:
Year: 2015 PMID: 26418581 PMCID: PMC4642219 DOI: 10.1021/acs.jproteome.5b00599
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Figure 1Comparison of three proteomics database search strategies: no PTMs considered (FASTA), conventional variable modification for phosphorylation (FASTA vP), and consideration of a long list of residue- and protein-specific PTMs of numerous different types (G-PTM). Search space size increases by 67-fold for FASTA vP, but only by 10% for G-PTM.
Numbers of Target and Identified PTMs Using the G-PTM Search Strategy. These Data Are for Human Jurkat Cells; The Analogous Data for Mouse Are in Supplemental Table S5. The “Database PTM Positions” Column Shows That a Total of 22 540 Residue Positions from 104 PTM Types Were Included in the Human UniProt XML File. The G-PTM Search Identified Peptides with Modifications at 1969 of Those Positions, across the 26 Observed PTM Types Listed Here (See Supplementary Table S2 for Entire List). aETA Is an Abbreviation for “Ethanolamine”
Figure 2Database search results and measures of their confidences for PTM peptide assignments. Results are shown for the three search types for both human and mouse proteomics data sets. The tabulated results are given for “All” (unmodified and modified peptides) and for “Modified” only peptides. The FDR for the “Modified” peptides was calculated from the numbers of decoy and target identifications meeting the global 1% FDR cutoff (i.e., the FDR for “All” peptide identifications). The FDR values for FASTA vP are >1%, indicating substantially poorer confidence, whereas the FDRs are <1% for the G-PTM searches, indicating high confidence in these PTM peptide assignments. The PEP plots corroborate this result by showing higher error probabilities for phosphorylated peptides in the FASTA vP search but lower error probabilities for modified peptides identified with G-PTM.