| Literature DB >> 28894154 |
Iain J Berry1,2, Veronica M Jarocki1,2, Jessica L Tacchi1,2, Benjamin B A Raymond1,2, Michael Widjaja1,2, Matthew P Padula1,2, Steven P Djordjevic3,4.
Abstract
Proteolytic processing alters protein function. Here we present the first systems-wide analysis of endoproteolysis in the genome-reduced pathogen Mycoplasma hyopneumoniae. 669 N-terminal peptides from 164 proteins were identified, demonstrating that functionally diverse proteins are processed, more than half of which 75 (53%) were accessible on the cell surface. Multiple cleavage sites were characterised, but cleavage with arginine in P1 predominated. Putative functions for a subset of cleaved fragments were assigned by affinity chromatography using heparin, actin, plasminogen and fibronectin as bait. Binding affinity was correlated with the number of cleavages in a protein, indicating that novel binding motifs are exposed, and protein disorder increases, after a cleavage event. Glyceraldehyde 3-phosphate dehydrogenase was used as a model protein to demonstrate this. We define the rules governing methionine excision, show that several aminopeptidases are involved, and propose that through processing, genome-reduced organisms can expand protein function.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28894154 PMCID: PMC5593965 DOI: 10.1038/s41598-017-11296-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sequence analysis of iMet removed N-terminal peptides.
| P1′ Residue | P2′ Residue | ||
|---|---|---|---|
| S | 16 | K | 16 |
| A | 14 | N | 9 |
| T | 4 | I | 4 |
| *N | 4 | Y | 4 |
| *K | 3 | A | 3 |
| *D | 1 | L | 3 |
| *F | 1 | Q | 3 |
| G | 1 | R | 2 |
| *L | 1 | S | 2 |
| V | 1 | V | 1 |
| *Q | 1 | ||
Distribution of amino acids in the P1′ and P2′ positions of iMet removed N-terminal peptides. *Denotes amino acids that deviate from typical NME.
N-terminal dimethylated peptides detected with both iMet present and with iMet removed.
| Gene names | Protein names | Sequence |
|---|---|---|
| MHJ_0127 | 50S ribosomal protein L21 | (M)FAIIKTGGR |
| MHJ_0078 | NADH oxidase | (M)KIISIGTNHAGTSFLR |
| MHJ_0365 | Putative ABC transporter permease protein | (M)KKDNENLDYQNADFEYEIKKIR |
| MHJ_0667 | Transcription elongation factor GreA | (M)KNIVDDKILLTQQKLEEIEKELEHLINVER |
| MHJ_0189 | 50S ribosomal protein L4 | (M)NKISEISIQSQKTENLVKFNANDDL PKSLFEQKEPHFQAIFDSILSER |
| MHJ_0029 | DNA topoisomerase (ATP-hydrolyzing) | (M)NKSLDSVINSQLEKILAEKFIR |
| MHJ_0181 | 30S ribosomal protein S17 | (M)NNLTLEKKAQTR |
| MHJ_0650 | 30S ribosomal protein S9 | (M)NQPELSYYGTGR |
| MHJ_0125 | Putative aminopeptidase | (M)SILEKMKKYCDIDGMSR |
| MHJ_0080 | Purine-nucleoside phosphorylase | (M)TAHIEAKKNEIAPIVLMPGDPLR |
| MHJ_0506 | Phosphate acetyltransferase | (M)TYQEYLQAR |
| MHJ_0110 | Adenine phosphoribosyltransferase | (M)QINLEKYIR |
| MHJ_0611 | Phosphocarrier protein HPr | (M)VSFSAVVIDKLGFHAR |
Figure 1rMHJ_0169 resolves as a monomer of its theoretical mass and shows activity with Co2+. (A) rMHJ_0169 resolved as a single band at ~30 kDa after SDS-PAGE. (B) Tryptic peptides identified by LC-MS/MS peptide map to MHJ_0619 (underlined and bold) and comprise 61% sequence coverage. (C) rMHJ_0169 was most efficient at cleaving Met-AMC at pH 6.3 and in the presence of 10 µM of Co2+.
Figure 2Amino acid sequences of peptide substrates and MALDI spectra of MAP activity against MAKEII peptide. (A) Synthetic peptide sequences and their masses with and without N-terminal methionine (iMet). (B) Control MALDI-TOF MS/MS spectra of MAKEII in the absence rMHJ_0169. A prominent peak was seen at 704.85 Da which equates to the peptides intact mass. (C) MALDI-TOF MS/MS spectra of MAKEII after 60 mins incubation with rMHJ_0169 and Co2+ cofactor at pH 7.3. Parent peak was absent (↓) and a new, prominent peak was seen at 573.68 Da, the mass of AKEII, indicating the removal of M by rMHJ_0169. *denotes a matrix artifact observed in both control and test.
Figure 3Comparison of iMet removal from peptides by GAP, LAP, and MAP. MAP was most efficient at removing iMet from MAKEII but had no activity against other tested peptides. LAP aminopeptidase activity was strongest in the order MKNIVD < MKKIAI < MAKEII < MKIISI. GAP aminopeptidase activity was strongest in the order MKIISI < MKNIVD < MAKEII < MEIKVD < MDKKLE. MEIKVD and MDKKLE were poor substrates for all tested aminopeptidases.
M. hyopneumoniae surface proteins, found with N-terminally labelled peptides, with bioinformatic analyses and affinity chromatography results.
| Gene names | Protein names | N-term | NME | Proteolytic Cleavages | Cytosolic Prediction | TMD | Hep | Act | Fib | Pla |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| MHJ_0133 | L-lactate dehydrogenase | ✓ | 7 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| MHJ_0506 | Phosphate acetyltransferase | ✓ | ✓ | 4 | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0505 | Acetate kinase | 1 | ✓ | ✓ | ✓ | ✓ | ||||
| MHJ_0503 | Dihydrolipoamide acetyltransferase | ✓ | 5 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0487 | Phosphoglycerate kinase | ✓ | 5 | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0426 | Transketolase | ✓ | 2 | ✓ | ✓ | ✓ | ||||
| MHJ_0349 | HIT-like protein | ✓ | 1 | ✓ | ✓ | ✓ | ||||
| MHJ_0242 | Enolase | ✓ | 6 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| MHJ_0157 | Phosphopentomutase | ✓ | 1 | ✓ | ✓ | |||||
| MHJ_0122 | Pyruvate kinase | ✓ | 5 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0112 | Pyruvate dehydrogenase | 13 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0111 | Pyruvate dehydrogenase E1-alpha subunit | 12 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0107 | ATP-dependent 6-phosphofructokinase | ✓ | 1 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0031 | Glyceraldehyde-3-phosphate dehydrogenase | ✓ | 4 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0145 | Trigger factor | 3 | ✓ | ✓ | ✓ | |||||
| MHJ_0631 | Putative 5’-nucleotidase | 1 | ✓ | ✓ | ✓ | ✓ | ||||
| MHJ_0071 | Adenine phosphoribosyltransferase | ✓ | ✓ | 4 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| MHJ_0436 | 3-hexulose-6-phosphate synthase | 1 | ✓ | |||||||
| MHJ_0078 | NADH oxidase | ✓ | 7 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
| ||||||||||
| MHJ_0511 | 46 kDa surface antigen | 3 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0656 | Putative prolipoprotein p65 | 2 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0606 | ABC transporter xylose-binding lipoprotein | 2 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0363 | Putative lipoprotein | 4 | ✓ | ✓ | ✓ | ✓ | ||||
| MHJ_0362 | Putative lipoprotein | 1 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
|
| ||||||||||
| MHJ_0663 | Putative adhesin like-protein P146 | 17 | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0662 | Mhp683 homologue | ✓ | 22 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0493 | Putative P216 surface protein | ✓ | 44 | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0195 | P102 | 1 | NC | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0194 | P97/P123 | 13 | NC | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0494 | Putative p76 membrane protein | 28 | NC | ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
| ||||||||||
| MHJ_0659 | XAA-PRO aminopeptidase | 2 | ✓ | ✓ | ||||||
| MHJ_0525 | Lon protease | 4 | ✓ | ✓ | ✓ | ✓ | ||||
| MHJ_0522 | Oligoendopeptidase F | ✓ | 3 | ✓ | ✓ | |||||
| MHJ_0461 | Leucine aminopeptidase | ✓ | 2 | ✓ | ✓ | ✓ | ||||
| MHJ_0202 | ATP-dependent zinc metalloprotease FtsH | 5 | ✓ | ✓ | ✓ | ✓ | ||||
| MHJ_0125 | Glutamyl aminopeptidase | ✓ | 8 | ✓ | ✓ | ✓ | ||||
|
| ||||||||||
| MHJ_0577 | 30S ribosomal protein S4 | 9 | ✓ | ✓ | ✓ | ✓ | ||||
| MHJ_0287 | 30S ribosomal protein S6 | 6 | NC | ✓ | ✓ | ✓ | ||||
| MHJ_0190 | 50S ribosomal protein L3 | 6 | NC | ✓ | ✓ | |||||
| MHJ_0187 | 50S ribosomal protein L2 | ✓ | 8 | NC | ✓ | ✓ | ||||
| MHJ_0183 | 50S ribosomal protein L16 | ✓ | 1 | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0179 | 50S ribosomal protein L24 | 3 | NC | ✓ | ✓ | ✓ | ||||
| MHJ_0176 | 30S ribosomal protein S8 | 1 | ✓ | ✓ | ✓ | |||||
| MHJ_0175 | 50S ribosomal protein L6 | 4 | ✓ | ✓ | ✓ | |||||
| MHJ_0165 | 30S ribosomal protein S11 | 1 | NC | ✓ | ✓ | ✓ | ||||
| MHJ_0128 | 50S ribosomal protein L27 | 3 | NC | ✓ | ✓ | ✓ | ||||
| MHJ_0127 | 50S ribosomal protein L21 | ✓ | ✓ | 1 | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0051 | 30S ribosomal protein S2 | 1 | NC | ✓ | ✓ | ✓ | ||||
|
| ||||||||||
| MHJ_0535 | Ribosome recycling factor | 1 | ✓ | ✓ | ||||||
| MHJ_0667 | Transcription elongation factor GreA | ✓ | ✓ | 6 | ✓ | |||||
| MHJ_0618 | DNA-directed RNA polymerase subunit beta | ✓ | 3 | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0617 | DNA-directed RNA polymerase subunit beta | 4 | ✓ | ✓ | ✓ | |||||
| MHJ_0067 | Bacterial nucleoid DNA-binding protein | 1 | NC | ✓ | ✓ | |||||
|
| ||||||||||
| MHJ_0611 | Phosphocarrier protein HPr | ✓ | ✓ | 1 | ✓ | |||||
| MHJ_0469 | Phosphoenolpyruvate-protein phosphotransferase | 2 | ✓ | ✓ | ✓ | |||||
|
| ||||||||||
| MHJ_0227 | Periplasmic sugar-binding protein | 1 | NC | ✓ | ✓ | ✓ | ✓ | |||
|
| ||||||||||
| MHJ_0524 | Elongation factor Tu | ✓ | 26 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| MHJ_0071 | Elongation factor G | 3 | ✓ | ✓ | ||||||
| MHJ_0064 | Chaperone protein DnaJ | ✓ | 3 | NC | ✓ | ✓ | ✓ | |||
| MHJ_0063 | Chaperone protein DnaK | ✓ | 20 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| MHJ_0052 | Elongation factor Ts | ✓ | 1 | ✓ | ✓ | |||||
|
| ||||||||||
| MHJ_0504 | Dihydrolipoyl dehydrogenase | ✓ | 5 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0380 | Putative thioredoxin | 4 | ✓ | ✓ | ||||||
| MHJ_0228 | Putative myo-inositol 2-dehydrogenase | ✓ | 1 | NC | ✓ | |||||
| MHJ_0222 | Putative myo-inositol 2-dehydrogenase | ✓ | 3 | ✓ | ✓ | |||||
| MHJ_0219 | Putative methylmalonate-semialdehyde dehydrogenase | 3 | ✓ | ✓ | ✓ | |||||
|
| ||||||||||
| MHJ_0486 | Uncharacterized protein | 2 | ✓ | ✓ | ✓ | |||||
| MHJ_0373 | Uncharacterized protein | 3 | ✓ | ✓ | ||||||
| MHJ_0369 | Uncharacterized protein | 2 | NC | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0326 | Uncharacterized protein | ✓ | 5 | NC | ✓ | ✓ | ✓ | ✓ | ||
| MHJ_0212 | Uncharacterized protein | 10 | NC | ✓ | ✓ | ✓ | ✓ | |||
| MHJ_0009 | Uncharacterized protein | 3 | ✓ | ✓ | ✓ | |||||
*N-term column indicates sequences detected beginning at the predicted Open Reading Frame starting with the iMet residue. NME indicates sequences were detected with the initiating methionine removed. Proteolytic Cleavage column indicates the number of N-terminal peptides mapped to sequences starting from any other position in the protein. Cytosolic protein prediction made by PSORTb, with mention made to proteins secreted by non-classical (NC) mechanisms predicted by SecretomeP. Presence of transmembrane domains (TMDs) was predicted by TMPred. Final four columns indicate whether the protein was found in heparin (Hep), actin (Act), fibronectin (Fib) and/or plasminogen (Pla) affinity chromatography elutions.
Specificity matrix of cleavage events present in surface exposed M. hyopneumoniae proteins.
| Amino acid | P4 | P3 | P2 | P1 | P1′ | P2′ | P3′ | P4′ |
|---|---|---|---|---|---|---|---|---|
| Glycine (G) | 18 | 23 | 22 | 8 | 18 | 19 | 19 | 12 |
| Alanine (A) | 24 | 23 | 32 | 3 | 19 | 32 | 26 | 24 |
| Proline (P) | 13 | 12 | 22 | 4 | 0 | 20 | 18 | 22 |
| Valine (V) | 21 | 17 | 23 | 6 | 23 | 19 | 20 | 24 |
| Leucine (L) | 35 | 35 | 31 | 4 | 25 | 36 | 38 | 28 |
| Isoleucine (I) | 26 | 32 | 27 | 1 | 21 | 31 | 20 | 24 |
| Methionine (M) | 10 | 6 | 2 | 1 | 4 | 7 | 3 | 2 |
| Phenylalanine (F) | 16 | 17 | 12 | 8 | 10 | 14 | 20 | 12 |
| Tyrosine (Y) | 5 | 11 | 9 | 1 | 13 | 16 | 10 | 8 |
| Tryptophan (W) | 1 | 4 | 4 | 2 | 2 | 2 | 1 | 3 |
| Serine (S) | 1 | 28 | 19 | 9 |
| 11 | 16 | 25 |
| Threonine (T) | 20 | 16 | 18 | 0 |
| 15 | 18 | 26 |
| Asparagine (N) | 21 | 19 | 18 | 30 | 24 | 18 | 24 | 21 |
| Glutamine (Q) | 13 | 20 | 23 | 7 | 19 | 18 | 15 | 19 |
| Aspartic Acid (D) | 21 | 17 | 21 | 2 | 6 | 9 | 20 | 14 |
| Glutamic Acid (E) | 29 | 15 | 13 | 3 | 12 | 15 | 21 | 29 |
| Lysine (K) | 31 | 32 | 25 | 11 |
| 37 | 36 | 32 |
| Arginine (R) | 15 | 3 | 6 |
| 8 | 10 | 4 | 1 |
| Histidine (H) | 1 | 3 | 5 | 3 | 12 | 4 | 3 | 7 |
Figure 4Cleavage site sequence logo for surface exposed M. hyopneumoniae proteins. The most prevalent proteolytic activity appears to be non-specific trypsin-like cleavages at arginine (R).
Figure 5Predicted functions of endoproteolytically cleaved surface proteins. The most common category of endoproteolytically cleaved proteins found on the surface of M. hyopneumoniae were metabolic (25%) and ribosomal proteins (17%), followed by those listed as uncharacterised (9%).
Figure 6Number of cleaved surface proteins within functional groups identified in affinity chromatography columns. Adhesins and lipoproteins were most frequently recovered from columns coupled with heparin, actin, fibronectin and plasminogen. Almost all metabolic proteins were retained during heparin-agarose chromatography and on columns coupled with plasminogen, but only ~50% were retained on columns coupled with actin and fibronectin. All proteases bound heparin, none bound actin, 17% bound fibronectin and 67% bound plasminogen. The vast majority of ribosomal proteins (92%) interacted with both actin and plasminogen but only 23% with heparin.
Dimethyl-labelled peptides identified in M. hyopneumoniae GAPDH.
| Fragment | Peptide Sequence | E-value |
|---|---|---|
| F2 | 156C↓157TTNALAPLVNALDKEFGINHGFMTTIHAYTADQR190↓L191 | 1.1E−2 |
| 158T↓159TNALAPLVNALDKEFGINHGFMTTIHAYTADQR190↓L191 | 3.0E−4 | |
| F3 | 180T↓181TIHAYTADQR190↓L191 | 4.8E−3 |
| F4 | 189Q↓190RLQDAPHGDLR200↓R201 | 0.01 |
| 190R↓191 LQDAPHGDLRR201↓A202 | 5.4E−4 |
Figure 7Bioinformatic analysis of full-length M. hyopneumoniae GAPDH and fragments. The top black line represents full-length M. hyopneumoniae GAPDH, and black lines beneath it represent fragments generated by cleavages (blue downward arrows) detected by dimethyl labelled peptides. Colored circles represent predicted P:P interaction sites (red circles for sites within conserved amino acids, blue circles for sites within unconserved amino acids). Green triangles represent predicted nucleotide binding sites. Blue boxes represent putative binding regions, and yellow zigzags represent disordered regions. Bioinformatic predictions show that the fragments become disordered post-cleavage and have additional interaction sites that are concentrated at the newly formed termini. F1 constitutes the NAD-binding domain.