| Literature DB >> 31198645 |
Jacob M Wozniak1,2, David J Gonzalez1,2.
Abstract
BACKGROUND: Mass-spectrometry-based proteomics is a prominent field of study that allows for the unbiased quantification of thousands of proteins from a particular sample. A key advantage of these techniques is the ability to detect protein post-translational modifications (PTMs) and localize them to specific amino acid residues. These approaches have led to many significant findings in a wide range of biological disciplines, from developmental biology to cancer and infectious diseases. However, there is a current lack of tools available to connect raw PTM site information to biologically meaningful results in a high-throughput manner. Furthermore, many of the available tools require significant programming knowledge to implement.Entities:
Keywords: Mass spectrometry; Motifs; PTMs; Post-translational modifications; Proteomics; R package
Year: 2019 PMID: 31198645 PMCID: PMC6555389 DOI: 10.7717/peerj.7046
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
PTMphinder functions and data.
|
| parses proteome database into a table with two columns: protein accession ID and sequence |
|
| localizes PTM sites in full length proteins and extracts surrounding motifs |
|
| extracts motif backgrounds from a parsed proteome database |
| Human_Uniprot_Example.txt | Uniprot database example for input into |
| Human_Uniprot_Parsed_Example.txt | parsed Uniprot database example for input into |
| phindPTMs_Input_Example.csv | phospho-proteomic dataset example for input into |
Figure 1Workflow of PTMphinder Package.
Blue boxes indicate PTMphinder functions while grey bocks indicate functions outside of PTMphinder. Pipeline simply requires the user’s proteomic dataset, a proteome database and minor, additional user-defined parameters.
phindPTMs input columns.
| Identifier | unique identifier for each modified peptide |
| Protein_ID | protein accession ID |
| Peptide_Seq | peptide sequence detected in experiment |
| Total_Sites | total number of modified sites on peptide |
| PTM_Loc | (potential) locations of PTM sites |
| PTM_Score | confidence scores of PTM localizations |
phindPTMs output columns.
| Identifier | unique identifier for each modified peptide |
| Protein_ID | protein accession ID |
| Pep_Loc | location of the PTMs in the identified peptide |
| Prot_Loc | location of the PTMs the full-length protein |
| PTM_Score | confidence scores of PTM localizations |
| Flank_Seq | flanking sequences extracted from PTMs |
| Ambiguity | ambiguity of PTMs based on input data |
| Prot_Seq | full-length protein from which motifs were extracted |
Figure 2Assessment of PTMphinder functionality.
(A) Results of running rmotif-x on flanking peptide sequences already extracted in a previous publication12. (B) Results of running rmotif-x on flanking peptide sequences extracted using PTMphinder from the raw phospho-proteomics data in the same publication. Motif enrichment scores are represented in the parentheses. Depictions of motifs were generated using WebLogo (https://weblogo.berkeley.edu/). (C) Time of motif extraction for various methods (n = 5 modified peptides).