| Literature DB >> 32443484 |
Laure Tonini1, Bashir Sadet1, Alexandre Stella1, David Bouyssié1, Jérôme Nigou1, Odile Burlet-Schiltz1, Michel Rivière1.
Abstract
To date, Mycobacterium tuberculosis (Mtb) remains the world's greatest infectious killer. The rise of multidrug-resistant strains stresses the need to identify new therapeutic targets to fight the epidemic. We previously demonstrated that bacterial protein-O-mannosylation is crucial for Mtb infectiousness, renewing the interest of the bacterial-secreted mannoproteins as potential drug-targetable virulence factors. The difficulty of inventorying the mannoprotein repertoire expressed by Mtb led us to design a stringent multi-step workflow for the reliable identification of glycosylated peptides by large-scale mass spectrometry-based proteomics. Applied to the differential analyses of glycoproteins secreted by the wild-type Mtb strain-and by its derived mutant invalidated for the protein-O-mannosylating enzyme PMTub-this approach led to the identification of not only most already known mannoproteins, but also of yet-unknown mannosylated proteins. In addition, analysis of the glycoproteome expressed by the isogenic recombinant Mtb strain overexpressing the PMTub gene revealed an unexpected mannosylation of proteins, with predicted or demonstrated functions in Mtb growth and interaction with the host cell. Since in parallel, a transient increased expression of the PMTub gene has been observed in the wild-type bacilli when infecting macrophages, our results strongly suggest that the Mtb mannoproteome may undergo adaptive regulation during infection of the host cells. Overall, our results provide deeper insights into the complexity of the repertoire of mannosylated proteins expressed by Mtb, and open the way to novel opportunities to search for still-unexploited potential therapeutic targets.Entities:
Keywords: glycoproteins; glycoproteomic; mannoproteins; mass-spectrometry; mycobacterium; protein-o-mannosyl transferase; tuberculosis
Mesh:
Substances:
Year: 2020 PMID: 32443484 PMCID: PMC7287972 DOI: 10.3390/molecules25102348
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Identification of the known mannoproteins deriving glycopeptides detected in the MtbWT culture filtrate (CF). (the putative O-glycosylation sites with the highest occupancy probability according to NetOGlyc 4.0 [42] predictions are indicated by underscored Ser or Thr residues; exp-mr; experimental mass; mr-delta: mass difference between the experimentally determined and the calculated masses; start-end: number of the glycopeptide starting and final amino acids in the protein sequence; var-mod: variable modification of the peptide; Hex: hexose; Ox: methionine oxidation; Ca: cysteine carbamidomethylation; Pr: cysteine propionamidation).
| Gene/Protein Description | Exp_Mr | Mr_Delta | Start-End | Score | Pep_Seq | Var_Mod | Ref. |
|---|---|---|---|---|---|---|---|
| RV0175 | 5175.19 | −0.023 | 100–144 | 16.97 | DCVAATQAPDAGAMSA | 3 Hex, Ox, Pr | [ |
| 3311.31 | 0.006 | 100–124 | 12.85 | DCVAATQAPDAGAMSA | 5 Hex, 2 Ox | ||
| Rv0432 | 3480.57 | 0.009 | 53–83 | 34.27 | SPAPSGLSGHDEESPGAQSLTSTLTAPDGTK | 3 Hex | [ |
| 3134.43 | 0.007 | 55–83 | 15.87 | APSGLSGHDEESPGAQSLTSTLTAPDGTK | 2 Hex | ||
| Rv1270c | 2356.01 | 0.004 | 34–51 | 17.75 | A | 4 Hex | [ |
| 2193.97 | 0.008 | 34–51 | 53.89 | ASDTAATASNGDAAMLLK | 3 Hex | ||
| 2047.91 | 0.006 | 34–51 | 82.16 | ASDTAATASNGDAAMLLK | 2 Hex, Ox | ||
| 2031.91 | 0.007 | 34–51 | 75.72 | ASDTAATASNGDAAMLLK | 2 Hex | ||
| 1885.85 | 0.006 | 34–51 | 79.85 | ASDTAATASNGDAAMLLK | 1 Hex, Ox | ||
| 1869.86 | 0.005 | 34–51 | 80.64 | ASDTAATASNGDAAMLLK | 1 Hex | ||
| RV1368 | 3110.46 | 0.010 | 42–67 | 40.23 | KP | 3 Hex | [ |
| 2948.41 | 0.007 | 42–67 | 45.9 | KPTTASSPSPGSPSPEAQQILQDSSK | 2 Hex | ||
| 2786.35 | 0.008 | 42–67 | 41.05 | KPTTASSPSPGSPSPEAQQILQDSSK | 1 Hex | ||
| 2359.11 | 0.003 | 46–67 | 11.61 | ASSPSPGSPSPEAQQILQDSSK | 1 Hex | ||
| RV1860 | 4563.39 | 0.012 | 278–321 | 22.07 | ALAESIRPLVAPPPAPAPAPAEPAPAPAPAGEVAP | 2 Hex | [ |
| 4401.33 | 0.010 | 278–321 | 29.05 | ALAESIRPLVAPPPAPAPAPAEPAPAPAPAGEVAPTPTTPTPQR | 1 Hex | ||
| 2263.07 | −0.012 | 309–325 | 20.43 | EVAPTPTTPTPQRTLPA | 3 Hex | ||
| RV2068c | 2747.30 | −0.017 | 25–47 | 17 | ARHASGARPAS | 3 Hex | [ |
| 2005.98 | 0.000 | 31–47 | 27.16 | ARPASTTLPAGADLADR | 2 Hex | ||
| 1934.94 | 0.001 | 32–47 | 28.64 | RPASTTLPAGADLADR | 2 Hex | ||
| 1778.85 | 0.005 | 33–47 | 20.07 | PASTTLPAGADLADR | 2 Hex | ||
| 1772.81 | 0.004 | 35–47 | 16.3 | STTLPAGADLADR | 3 Hex | ||
| 1610.75 | 0.000 | 35–47 | 19.21 | STTLPAGADLADR | 2 Hex | ||
| 1523.72 | 0.001 | 36–47 | 20.21 | TTLPAGADLADR | 2 Hex | ||
| RV2290 | 4998.26 | −0.006 | 47–85 | 13.09 | ATVMVEGHTHTISGVVECRTSPAVR | 6 Hex | [ |
| 1497.68 | 0.003 | 73–85 | 23.23 | TATPSESGTQTTR | 1 Hex | ||
| RV2799 | 2096.95 | 0.003 | 68–83 | 26.33 | 2 Hex, Pr | [ | |
| 2082.94 | 0.006 | 68–83 | 61.05 | SPIVATTDPSPFDPCR | 2 Hex, Ca | ||
| 1920.88 | 0.002 | 68–83 | 37.81 | SPIVATTDPSPFDPCR | 1 Hex, Ca | ||
| RV2873 | 7063.14 | −0.022 | 44–106 | 11.82 | AAPV | 6 Hex, 2Ox | [ |
| 4982.32 | 0.012 | 21–62 | 10.69 | FLAGCSSTKPVSQDTSPKPATSPAAPVTTAAMADPAADLIGR | 5 Hex, Ox | ||
| 1974.89 | 0.003 | 48–62 | 14.46 | TTAAMADPAADLIGR | 3 Hex, Ox | ||
| 1958.90 | 0.011 | 48–62 | 43.02 | TTAAMADPAADLIGR | 3 Hex | ||
| RV3491 | 2022.08 | 0.005 | 164–180 | 25.55 | QPFSLQLIGPPP | 1 Hex | [ |
| 1649.91 | 0.007 | 167–180 | 36.78 | SLQLIGPPPSPVQR | 1 Hex | ||
| RV3835 | 6066.10 | −0.001 | 362–415 | 20.03 | LNLPPIPLQLP | 3 Hex | [ |
| 1730.00 | −0.001 | 362–375 | 10.6 | LNLPPIPLQLPTPR | 1 Hex |
Identification of the novel glycopeptide detected in the MtbWT CF by the improved “neutral loss” post analytical MS2 data processing. (* and §: See Figure 1 and Figure S3 for detailed identification; for additional legend see Table 1).
| Gene / Protein Description | Exp_Mr | Mr_Delta | Start-End | Score | Sequence (Predicted Glycosylation Sites) | Var_Mod | #MS2 |
|---|---|---|---|---|---|---|---|
|
| 3615.74 | 0.007 | 38–74 | 47.14 | APAGP | 1 Hex | 11 |
| 3453.68 | −0.001 | 38–74 | 46.3 | APAGP | (1) | ||
| 3544.70 | 0.005 | 39–74 | 28.62 | PAGP | 1 Hex | 7 | |
| 3382.64 | −0.010 | 39–74 | 36.27 | PAGP | (1) | ||
|
| 2051.97 | 0.007 | 47–62 | 11.36 | 3 Hex | 1 | |
| 1889.92 | 0.006 | 47–62 | 18.28 | 2 Hex | 6 | ||
|
| 1115.61 | 0.001 | 228–236 | 13.65 | VQV | 1 Hex | 2 |
| 953.55 | −0.001 | 228–236 | 39.14 | VQV | (12) | ||
| RV1625c | 1658.73 | 0.010 | 28–37 | 16.01 | TQARAPTQHY | 3Hex, Da | 2 |
| RV1731 | 1632.82 | 0.004 | 2–14 | 26.43 | PAP | 1 Hex | 4 |
| RV1935c | 1497.68 | −0.001 | 62–73 | 16.82 | AFELAEADDTVR | 1 Hex | 2 |
| Rv2563 | 1705.82 | 0.004 | 188–202 | 12.89 | GQPTISSIGIDGMPR | 1 Hex, Ox | 3 |
| Rv3209 | 3813.80 | −0.002 | 156–186 | 12.78 | LYSRLNCSIVNTGAQTVVASTNNAIIA | 3 Hex, 3 Da, Pr | 2 |
Figure 1Description of the multi-parametric confirmation of the glycosylation of peptide T47-R62 of the lipoprotein LpqR (UniProtKB Access. # O53850; Mtb gene Rv0838) as an illustration of the approach used for high confidence identification of new Mtb glycopeptides from the LC–MS/MS (CID) data. (A) and (B) annotated CID mass spectra of the doubly charged precursor ions (M + 2H)2+ at 945.96 Da and 1026.99 Da displaying respectively the expected characteristic signature fragment ions M − 1622+, M − 3242+ or M − 1622+, M − 3242+ and M − 4862+ corresponding to the neutral loss of 1 to 3 hexose residues and supporting the identification of the di-substituted and tri-substituted glycoforms of the tryptic peptide T47-R62 of the LpqR lipoprotein; (C) sequence of the LpqR tryptic peptide T47-R62 reporting the peptide sequence fragment ions observed in the MS2; (D) extracted ion chromatogram (with of an ion mass tolerance of ±5 ppm) of the different glycoforms detected for the LpqR peptide T47-R62 validating the coherence of the elution time as inversely related to the glycosylation degree of the peptide; (E) NetOGlyc 4.0 predictions of the O-glycosylation sites of the LpqR sequence assessing the highest glycosylation probability of the Threonine triad T47-T49 and of the Serine S51 of the peptide T47-R62 (NetOGlyc: [42]). Additional data confirming the glycosylation of the Rv0315 and Rv1411c proteins are available in Supplementary Figure S3.
Figure 2Variability of the Mtb H37Rv secreted glyco-proteome determination based on glycopeptide characterization: Venn diagram comparing the sets of putative mannoproteins identified with our SDS–PAGE based MS shotgun glycoproteomic approach (duplicate analyses Mtb-C1 and Mtb-C2) and with the pre-analytical lectin based glycopeptide enrichment used by Smith et al. These different analyses identify a total of 23 putative Mtb glycoproteins, but with a rather limited core of 5 glycoproteins shared by the 3 samples and a similar proportion of sample specific candidates (between 3 and 4). 2 by 2 analyses show very comparable results and do not evidence major quantitative divergences between the two analytical approaches used. (Novel glycoproteins are noted in bold).
Glycosylated peptides identified in the independent analysis of the MtbWT CF. (Newly identified sequences are in bold; *: See Figure S4 for detailed identification and Table 1 for additional legend).
| Peptide Sequence | Exper. Mr | Expec. Mr | Modification | Score | Gene: Protein Description |
|---|---|---|---|---|---|
|
| 2915.33 | 2752.28 | Hex, Da | 19.64 | Rv0237: beta-glucosidase |
| APAGPTPAPAAPAAATGGLLFHDEFDGPAGSVPDPSK |
|
|
|
|
|
| SPAPSGLSGHDEESPGAQSLTSTLTAPDGTK |
|
|
|
|
|
|
| 2205.93 | 1719.76 | 3 Hex | 47.62 | Rv0928: phosphate-binding protein pstS 3 |
| ASDTAATASNGDAAMLLK |
|
|
|
|
|
| TLSGDLTTNPTAATGNVK |
|
|
|
|
|
| ASTTLPAGADLADR |
|
|
|
|
|
| TATPSESGTQTTR |
|
|
|
|
|
| GAPSTAGPCEIVPNGTPAPK | 2082.98 | 1862.91 | Hex, Ca, Da | 30.21 | Rv2394: gamma-glutamyltransferase |
| VPVSPTASDPALLAEIR | 2059.06 | 1734.94 | 2 Hex | 35.43 | Rv2945c: lipoprotein lPpx |
| QPFSLQLIGPPPSPVQR |
|
|
|
|
|
| GETTTAAGTTASPGAASGPK | 2380.04 | 1731.82 | 4 Hex | 36.99 | Rv3763: lipoprotein lpqH |
| LNLPPIPLQLPTPR |
|
|
|
|
|
Peptides glycoforms detected in cell–lysate protein extracts of the MtbWT and PMTub-overexpressing strain. Specific cell-associated glycosylated proteins are noted in bold and their occurrence in the MtbWT and /or in the complemented strain overexpressing the PMTub gene (Cp) are reported in the right column. (see Table 1 for additional legend).
| Gene | Protein Description | exp_mr | mr_Delta (ppm) | Start-End | Score | Peptide Sequence | Var_Mod | Wt/Cp |
|---|---|---|---|---|---|---|---|---|
|
|
| 1833.94 | 4.39 | 151–166 | 28.45 | (R)ELDTQSGAARPVVTTK(L) | Hex | Wt/Cp |
| 1671.88 | 2.09 | 151–166 | 54.66 | (R)ELDTQSGAARPVVTTK(L) | Wt/Cp | |||
|
|
| 1667.84 | 0.02 | 138–151 | 28.6 | (R)GLNDENPPTAIPLR(H) | Hex | Wt/Cp |
| 1505.78 | 0.57 | 138–151 | 49.36 | (R)GLNDENPPTAIPLR(H) | Wt/Cp | |||
| RV0432 | superoxide dismutase [Cu-Zn] | 2966.33 | −1.00 | 57–83 | 30.15 | (P)SGLSGHDEESPGAQSLTSTLTAPDGTK(V) | 2 Hex | Wt/Cp |
| 3480.56 | 0.06 | 53–83 | 18.28 | (G)SPAPSGLSGHDEESPGAQSLTSTLTAPDGTK(V) | 3 Hex | Wt/Cp | ||
|
|
| 1662.80 | −2.39 | 214–228 | 50.29 | (K)VTLSTTGNAQEPAGR(F) | Hex | -/Cp |
| 1501.74 | 0.11 | 214–228 | 76.54 | (K)VTLSTTGNAQEPAGR(F) | Da | -/Cp | ||
| RV1270c | lipoprotein lprA | 2356.01 | −0.18 | 34–51 | 75.21 | (K)ASDTAATASNGDAAMLLK(Q) | 4 Hex, Da | Wt/Cp |
| 2193.96 | 1.41 | 34–51 | 66.26 | (K)ASDTAATASNGDAAMLLK(Q) | 3 Hex, Da | -/Cp | ||
| 2030.93 | 2.89 | 34–51 | 79.32 | (K)ASDTAATASNGDAAMLLK(Q) | 2 Hex | Wt/Cp | ||
| 1869.85 | −0.08 | 34–51 | 62.35 | (K)ASDTAATASNGDAAMLLK(Q) | Hex, Da | Wt/Cp | ||
| 1868.87 | 3.90 | 34–51 | 67.93 | (K)ASDTAATASNGDAAMLLK(Q) | Hex | Wt/Cp | ||
| 1707.80 | −0.75 | 34–51 | 85.51 | (K)ASDTAATASNGDAAMLLK(Q) | Da | Wt/Cp | ||
| RV1368 | lipoprotein lprF | 3272.50 | −0.54 | 42–67 | 43.73 | (K)KPTTASSPSPGSPSPEAQQILQDSSK(A) | 4 Hex | Wt/Cp |
| 3110.45 | −0.19 | 42–67 | 47.88 | (K)KPTTASSPSPGSPSPEAQQILQDSSK(A) | 3 Hex | Wt/Cp | ||
| 2948.39 | −2.28 | 42–67 | 61.82 | (K)KPTTASSPSPGSPSPEAQQILQDSSK(A) | 2 Hex | Wt/Cp | ||
| 2786.34 | −0.74 | 42–67 | 89.33 | (K)KPTTASSPSPGSPSPEAQQILQDSSK(A) | Hex | Wt/Cp | ||
| 2624.29 | −0.54 | 42–67 | 72.17 | (K)KPTTASSPSPGSPSPEAQQILQDSSK(A) | Wt/Cp | |||
| RV1411c | lipoprotein lprG | 2084.00 | −1.07 | 75–92 | 67.17 | (K)TLSGDLTTNPTAATGNVK(L) | 2 Hex | -/Cp |
| 1921.95 | 0.15 | 75–92 | 96.46 | (K)TLSGDLTTNPTAATGNVK(L) | Hex | Wt/Cp | ||
| 1759.89 | −0.41 | 75–92 | 98.98 | (K)TLSGDLTTNPTAATGNVK(L) | Wt/Cp | |||
| 1277.66 | −0.19 | 228–236 | 11.3 | (K)VQVTKPPVS(-) | 2 Hex | Wt/- | ||
| 1115.61 | −1.23 | 228–236 | 24.38 | (K)VQVTKPPVS(-) | Hex | Wt/Cp | ||
| 953.55 | −2.24 | 228–236 | 40.29 | (K)VQVTKPPVS(-) | Wt/Cp | |||
|
|
| 3563.73 | −1.49 | 26–57 | 11 | (K)SQPAVAPTGDAAAATQVPAGQTVPAQLQFSAK(T) | 3 Hex | Wt/Cp |
| 3401.68 | −0.35 | 26–57 | 32.98 | (K)SQPAVAPTGDAAAATQVPAGQTVPAQLQFSAK(T) | 2 Hex | Wt/Cp | ||
| 3239.63 | 0.45 | 26–57 | 49.38 | (K)SQPAVAPTGDAAAATQVPAGQTVPAQLQFSAK(T) | Hex | -/Cp | ||
| 3077.58 | 0.03 | 26–57 | 42.18 | (K)SQPAVAPTGDAAAATQVPAGQTVPAQLQFSAK(T) | Wt/Cp | |||
|
|
| 1937.95 | −0.11 | 369–384 | 21.41 | (R)TSAPPPPPPADAVELR(V) | 2 Hex | Wt/- |
| 1613.84 | −2.36 | 369–384 | 22.97 | (R)TSAPPPPPPADAVELR(V) | Wt/Cp | |||
|
|
| 2889.32 | −1.73 | 55–81 | 30.26 | (R)LGEASGDLASDSPAMPRPTTVENDAAR(W) | Hex | Wt/- |
| 2727.28 | 0.69 | 55–81 | 69.44 | (R)LGEASGDLASDSPAMPRPTTVENDAAR(W) | Wt/Cp | |||
| RV2068c | beta-lactamase | 1778.84 | 0.05 | 33–47 | 13.4 | (R)PASTTLPAGADLADR(F) | 2 Hex | -/Cp |
| 1454.74 | 0.11 | 33–47 | 61.69 | (R)PASTTLPAGADLADR(F) | -/Cp | |||
|
|
| 1459.71 | −1.44 | 240–253 | 52.1 | (R)GEALPAGGTTATPR(P) | Hex | Wt/Cp |
| 1297.66 | −0.89 | 240–253 | 88.98 | (R)GEALPAGGTTATPR(P) | Wt/Cp | |||
| RV2799 | hypothetical protein | 2082.93 | 2.19 | 68–83 | 43.65 | (K)SPIVATTDPSPFDPCR(D) | 2 Hex, Ca | Wt/Cp |
|
|
| 2076.05 | −0.45 | 236–255 | 56.32 | (K)IKEIVGDVPGIDSAAATATS(-) | Hex | Wt/- |
| 1913.99 | −0.25 | 236–255 | 64.95 | (K)IKEIVGDVPGIDSAAATATS(-) | Wt/Cp | |||
| 1851.97 | −0.65 | 202–218 | 39.2 | (K)VDGLAAAVNVHATPTVR(V) | Hex | Wt/- | ||
| 1689.91 | −1.01 | 202–218 | 82.14 | (K)VDGLAAAVNVHATPTVR(V) | Wt/- | |||
|
|
| 2354.19 | 0.16 | 274–295 | 44.23 | (K)GISPLGCWPGSSALPTIAVPGR(-) | Hex, Ca | Wt/Cp |
| 2192.14 | 0.77 | 274–295 | 79.41 | (K)GISPLGCWPGSSALPTIAVPGR(-) | Ca | Wt/Cp | ||
| RV3491 | hypothetical protein | 2022.08 | −0.37 | 164–180 | 38.76 | (K)QPFSLQLIGPPPSPVQR(Y) | Hex | -/Cp |
| 1649.90 | 1.07 | 167–180 | 19.71 | (F)SLQLIGPPPSPVQR(Y) | Hex | Wt/Cp | ||
| 1487.84 | −3.18 | 167–180 | 32.07 | (F)SLQLIGPPPSPVQR(Y) | Wt/- | |||
|
|
| 1998.96 | 2.22 | 164–182 | 47.69 | (K)AGQGSVMVPISAGLATPHE(-) | Hex, Ox | -/Cp |
| 1982.96 | 0.11 | 164–182 | 56.18 | (K)AGQGSVMVPISAGLATPHE(-) | Hex | Wt/- | ||
| 1820.91 | −0.63 | 164–182 | 110.3 | (K)AGQGSVMVPISAGLATPHE(-) | Wt/Cp | |||
|
|
| 2219.11 | −0.33 | 195–215 | 64.2 | (K)VISISEASGAAPTHEAPAPPR(G) | Hex | Wt/Cp |
| 2057.06 | 3.52 | 195–215 | 91.34 | (K)VISISEASGAAPTHEAPAPPR(G) | Wt/Cp | |||
| 2454.19 | 0.84 | 99–120 | 28.12 | (R)VTVAPQYSNPEPAGTATITGYR(A) | Hex | Wt/- | ||
| 2292.14 | −0.79 | 99–120 | 93.23 | (R)VTVAPQYSNPEPAGTATITGYR(A) | Wt/- | |||
| RV3763 | lipoprotein lpqH | 3623.49 | 1.18 | 27–51 | 17.12 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 9 Hex | Wt/Cp |
| 3461.43 | 0.06 | 27–51 | 32.84 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 8 Hex | Wt/Cp | ||
| 3299.38 | 1.35 | 27–51 | 38.84 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 7 Hex | Wt/Cp | ||
| 3137.33 | −0.11 | 27–51 | 19.27 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 6 Hex | Wt/Cp | ||
| 2813.22 | 0.64 | 27–51 | 31.75 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 4 Hex | Wt/Cp | ||
| 2651.17 | 1.27 | 27–51 | 51.82 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 3 Hex | Wt/Cp | ||
| 2489.12 | 1.19 | 27–51 | 69.7 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | 2 Hex | Wt/Cp | ||
| 2327.06 | 1.42 | 27–51 | 72.66 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | Hex | -/Cp | ||
| 2165.00 | −1.59 | 27–51 | 111.1 | (K)STTGSGETTTAAGTTASPGAASGPK(V) | Wt/Cp | |||
| RV3835 | hypothetical protein | 2030.98 | 0.87 | 72–87 | 13.38 | (R)LAGYIASNPVPSTGAK(I) | 3 Hex | -/Cp |
| 1706.87 | 0.35 | 72–87 | 26.07 | (R)LAGYIASNPVPSTGAK(I) | Hex | -/Cp | ||
|
|
| 1978.95 | 0.12 | 454–470 | 43.68 | (K)DAALLEHDTLPADPSPR(K) | Hex | Wt/Cp |
| 1816.89 | −1.10 | 454–470 | 60.59 | (K)DAALLEHDTLPADPSPR(K) | Wt/Cp |
Figure 3Analysis of the cell-associated glycoproteome of MtbWT reveals the presence of envelope specific glycosylated proteins. (A) Venn diagram comparing the mannoprotein repertoire identified from the culture filtrate and cell–lysate protein extracts from two independent culture MtbWT (Mtb-C1; Mtb-C2). Hypothetical proteins are noted in italic, while essential proteins for in vivo and in vitro are in bold and bold-underlined, respectively; (B) Topographical analysis of the putative glycosylated peptide sequences of the newly identified envelope associated glycoproteins of MtbWT localizes the glycosylated motif systematically in an extracellular segment. (Protein sequences are symbolized by blue lines; gray zones correspond to putative intracellular domains according TMHMM 2.0; the membrane spanning peptides are represented by hatched motifs; putative lipoproteins are represented with 3 terminal membrane anchored acyl groups figured by curved lines; the putative O-glycosylation sites with the highest occupancy probability according to NetOGlyc 4.0 predictions are indicated by underscored Ser or Thr residues).
Figure 4Influence of the PMTub increased expression on the mannoprotein repertoire expressed by Mtb. (A): constitutive higher expression level of the PMTub Rv1002c gene in vitro in the complemented ΔRv1002c: pBlaF*Rv1002c strain relative to that of the MtbWT strain during the exponential (DO = 0.5) and the stationary (DO = 1) growth phases (see Table S4 for raw data). (B) Repertoire of glycoproteins identified in the secretome of the PMTub-overexpressing strain under the control of the pBlaF* promotor. Venn diagram showing the set of mannoproteins shared between the Mtb complemented strain expressing the PMTub gene (Cp) and the two independent analyses of the MtbWT strain secretome (Mtb-C1 (blue); Mtb-C2 (green)). (Novel glycoproteins are noted in bold while complemented strain specific proteins are in italic; Culture-F: culture filtrate; Cell-L: cell lysate).
New glycosylated peptides detected in the secretome of the Mtb complemented strain overexpressing the PMTub gene. (Hex-num: hexose number, for additional legend see Table 1).
| Gene | prot_desc | pep_exp_mr | start-pep_end | pep_score | pep_seq | pep_var_mod |
|---|---|---|---|---|---|---|
| RV0822c | hypothetical protein | 1637.79 | 522-534 | 16.01 | LGNTPSTPPTTTK | 2 Hex |
| 1475.74 | 522-534 | 35.88 | LGNTPSTPPTTTK | Hex | ||
| 1313.68 | 522-534 | 32.55 | LGNTPSTPPTTTK | |||
| RV1157c | hypothetical protein | 1294.64 | 211-224 | 23.21 | AAAPAPASAAPAPA | Hex |
| 1132.59 | 211-224 | 22.01 | AAAPAPASAAPAPA | |||
| 1365.68 | 211-225 | 10.82 | AAAPAPASAAPAPAA | Hex | ||
| RV2094c | TatA/E family twin arginine-targeting protein translocase | 1503.74 | 55-66 | 22.44 | SIETPTPVQSQR | Hex |
| 1341.69 | 55-66 | 52.68 | SIETPTPVQSQR | |||
| RV2220 | glutamine synthetase 1 | 1340.63 | 2-11 | 21.43 | TEKTPDDVFK | Hex |
| 1178.58 | 2-11 | 55.91 | TEKTPDDVFK | |||
| RV3244c | lipoprotein lpqB | 3156.58 | 29-53 | 30.03 | GTVERPVPSNLPKPSPGMDPDVLLR | 3 Hex |
| 2994.52 | 29-53 | 22.99 | GTVERPVPSNLPKPSPGMDPDVLLR | 2 Hex | ||
| RV3413c | hypothetical protein | 1646.78 | 285-299 | 30.05 | PAGQPAPETPVSPTH | Hex |
| 1484.73 | 285-299 | 50.33 | PAGQPAPETPVSPTH | |||
| RV3705A | hypothetical protein | 1860.89 | 115-129 | 15.18 | VGPTGPGPTTAPARP | 3 Hex |
| 1698.83 | 115-129 | 30.6 | VGPTGPGPTTAPARP | 2 Hex | ||
| RV3706c | hypothetical protein | 2981.43 | 81-106 | 8.08 | AVRPGPGPGGPGQVPSSVSPPATPAP | 4 Hex |
| 2819.37 | 81-106 | 25.94 | AVRPGPGPGGPGQVPSSVSPPATPAP | 3 Hex | ||
| 2657.32 | 81-106 | 26.69 | AVRPGPGPGGPGQVPSSVSPPATPAP | 2 Hex | ||
| 2910.39 | 82-106 | 16.99 | VRPGPGPGGPGQVPSSVSPPATPAP | 4 Hex | ||
| 2748.34 | 82-106 | 20.4 | VRPGPGPGGPGQVPSSVSPPATPAP | 3 Hex | ||
| 2586.28 | 82-106 | 25.37 | VRPGPGPGGPGQVPSSVSPPATPAP | 2 Hex |
Figure 5Regulation of the expression of the protein-O-mannosyl transferase PMTub (Rv1002c) gene in the MtbWT during the infection. Quantitative polymerase chain reaction (qPCR) measurement of the relative amount (compared to the housekeeping gene RpoB) of the Rv1002c amplicons in the MtbWT and in the complemented strain overexpressing constitutively the PMTub (Cp), during the infection of the murine alveolar macrophage cell line MH-S at a MOI of 10 (numerical data are available in Table S6).
Figure 6Topological distribution, essentiality and putative physiological and immunomodulatory properties of the Mtb mannoproteins identified. Mannoproteins exclusively detected in the CF and in the cell lysate (CL) extracts are depicted on the top and the bottom of the cartoon, respectively, while those detected in both fractions are represented in the intermediate space. The secreted mannoproteins detected in the CF must rather be involved in the interaction with the host immune response effectors while those more tightly associated to the envelope and detected in the CL must contribute to the bacterial fitness and to the adaptive response to the host microbicidal processes. The stars highlight the detection in the culture filtrate of glycopeptides arising from mannoproteins annotated as membrane proteins. Reported phosphorylated proteins are labeled with blue diamond and filled blue diamond when the identified glycopeptide matches with a phosphopeptide reported in literature.