| Literature DB >> 30934878 |
Mariella Hurtado Silva1, Iain J Berry2,3, Natalie Strange4, Steven P Djordjevic5, Matthew P Padula6.
Abstract
Methods for analyzing the terminal sequences of proteins have been refined over the previous decade; however, few studies have evaluated the quality of the data that have been produced from those methodologies. While performing global N-terminal labelling on bacteria, we observed that the labelling was not complete and investigated whether this was a common occurrence. We assessed the completeness of labelling in a selection of existing, publicly available N-terminomics datasets and empirically determined that amine-based labelling chemistry does not achieve complete labelling and potentially has issues with labelling amine groups at sequence-specific residues. This finding led us to conduct a thorough review of the historical literature that showed that this is not an unexpected finding, with numerous publications reporting incomplete labelling. These findings have implications for the quantitation of N-terminal peptides and the biological interpretations of these data.Entities:
Keywords: amine labelling; mass spectrometry; terminomics
Year: 2019 PMID: 30934878 PMCID: PMC6631386 DOI: 10.3390/proteomes7020011
Source DB: PubMed Journal: Proteomes ISSN: 2227-7382
Figure 1Schematic for the various methods for the production of proteoforms. Final protein products may be modified during transcription or during translation at the ribosome. Alternatively, nascent polypeptide chains may be modified after translation with a myriad of post-translational modifications. Once translated, the protein sequence can often require further modifications to perform a specialized function, generating proteoforms that vary from the original expressed protein [2,3,23]. Post-translational modifications such as phosphorylation, a reversible chemical addition to the protein, or proteolysis, a permanent hydrolysis event removing amino acid(s) from the polypeptide chain, cause important functional changes in proteins [24,25].
Figure 2Dimethylation reaction implemented in the UTS Proteomics Core Facility with examples of incomplete labelling. (A) Theoretical reductive dimethylation reaction shown, which will attach two methyl groups to every prime amine in a protein sample (N-terminus and lysine residues). However, preliminary experimental results of the dimethylation process indicate that the current method is not modifying all primary amine groups in complex samples. (B) Unpublished data of incomplete labelling occurring in a model protein sample (bovine serum albumin). The N-terminal sequence that was obtained indicated that the protein amino acid sequence began with aspartic acid (Letter symbol, D), which was confirmed with bovine serum albumin data from UniProt (accession number: P02769). Identical peptide sequences containing lysine residues have been sequenced by mass spectrometry; however, the first peptide, indicated in the red rectangle, has been identified with only the lysine dimethylated. (C) The second sequence from the same mass spectrometry experiment has a dimethyl label on both the N-terminal amine (the aspartic acid residue) and lysine residue.
Published reaction conditions of reductive methylation protocols.
| Reference (Year) | Reaction Conditions | Reactant Concentrations | Significant Observations |
|---|---|---|---|
| Friedman et al. [ | 4–16 h (room temperature) | ~11 mM NaBH3CN | Modification of lysine resides ranged from 40–90% using different aldehyde reagents, between protein molecules and different amino acid residues |
| Jentoft et al. [ | 2–24 h (22 °C) | 20 mM NaBH3CN | 80–90% dimethyl conversion of lysine residues with a 6 fold excess of formaldehyde |
| Hsu et al. [ | Sodium acetate buffer | ~22 mM NaBH3CN | Observation of immonium ion signal with dimethyl labelling |
| Krusemark et al. [ | 2 h, room temperature | 30 mM Pyridine-BH3 (reducing agent) | 4 model proteins containing various abundance of amine groups, dimethyl labelled to completeness |
| Boersema et al. [ | 1 h, room temperature | ~22 mM NaBH3CN | (protocol paper) |
| Kleifeld et al. [ | 4 h—overnight incubation at 37 °C | 20 mM NaBH3CN | (protocol paper) |
| Jhan et al. [ | 30 s–2h, room temperature | 1.4–85 mM NaBH3CN | Accessibility of primary amines on the protein greatly affects dimethylation efficiency |
Meta-analysis results of dimethyl labelling studies [72,73,74,75].
| PRIDE Dataset Identifier | FDR PEAKS Generated (%) | Duplicate Peptide Sequences Detected | Duplicate Sequences with Complete Labelling | Complete Labelling (%) | Duplicate Sequences with Partial Labelling | Partial Labelling (%) | Duplicate Sequences with No Dimethyl Label | Unlabeled (%) | Total Partial and Unlabeled Duplicate Sequences | Total Partial and Unlabeled (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| PXD002785 | 1.7 | 6658 | 5454 | 81.92 | 1161 | 17.44 | 43 | 0.65 | 1204 | 18.08 |
| PRD000055 | 0.6 | 5395 | 5062 | 93.83 | 315 | 5.84 | 18 | 0.33 | 333 | 6.17 |
| PXD005920 | 1.5 | 3269 | 2847 | 87.09 | 404 | 12.36 | 18 | 0.55 | 422 | 12.91 |
| PXD003298 | 1.6 | 3531 | 2893 | 81.93 | 584 | 16.54 | 54 | 1.53 | 638 | 18.07 |
| PXD004654 | 3.0 | 6293 | 5498 | 87.37 | 715 | 11.36 | 80 | 1.27 | 795 | 12.63 |