| Literature DB >> 30814666 |
Alemayehu Godana Birhanu1,2, Solomon Abebe Yimer3,4, Shewit Kalayou5, Tahira Riaz3, Ephrem Debebe Zegeye6, Carol Holm-Hansen7, Gunnstein Norheim4,7, Abraham Aseffa8, Markos Abebe8, Tone Tønjum9,10.
Abstract
Multiple regulatory mechanisms including post-translational modifications (PTMs) confer complexity to the simpler genomes and proteomes of Mycobacterium tuberculosis (Mtb). PTMs such as glycosylation play a significant role in Mtb adaptive processes. The glycoproteomic patterns of clinical isolates of the Mycobacterium tuberculosis complex (MTBC) representing the lineages 3, 4, 5 and 7 were characterized by mass spectrometry. A total of 2944 glycosylation events were discovered in 1325 proteins. This data set represents the highest number of glycosylated proteins identified in Mtb to date. O-glycosylation constituted 83% of the events identified, while 17% of the sites were N-glycosylated. This is the first report on N-linked protein glycosylation in Mtb and in Gram-positive bacteria. Collectively, the bulk of Mtb glycoproteins are involved in cell envelope biosynthesis, fatty acid and lipid metabolism, two-component systems, and pathogen-host interaction that are either surface exposed or located in the cell wall. Quantitative glycoproteomic analysis revealed that 101 sites on 67 proteins involved in Mtb fitness and survival were differentially glycosylated between the four lineages, among which 64% were cell envelope and membrane proteins. The differential glycosylation pattern may contribute to phenotypic variabilities across Mtb lineages. The study identified several clinically important membrane-associated glycolipoproteins that are relevant for diagnostics as well as for drug and vaccine discovery.Entities:
Year: 2019 PMID: 30814666 PMCID: PMC6393673 DOI: 10.1038/s41598-019-39654-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Abundance of glycosylation and glycoproteins in MTBC. Venn diagram showing the number of glycosylation events identified among the four lineages (N = 2944) (A), and the number of glycoproteins identified among the four lineages (N = 1325) (B), the number of N- and O-glycosylation events identified (N = 2944) (C), and number of glycosylation events identified per individual glycoprotein (D).
Figure 2The 3D models, acetylation sites and representative spectra of LpqH (A,B) and AcpM (C,D). The glycosylation sites in LpqH are clustered in the N-terminus between residues 27–48 of the 159 amino acid protein (A). The four sites, T40, T41, S43 and S48 were located in this binding domain of LpqH. In AcpM, three sites in the helix, S41, S43 and T51, were found in the carrier protein (CP) domain profile (C). (he = hexose).
Figure 3Glycosylation motif analysis. The N- and O-glycosylation motif generated from the high confidence identification indicates a higher likelihood of basic R, hydrophobic P, A, V and L, interspersed with Polar G; and some hydrophilic S and T around the O-glycosylation site. In addition, the glycosylation motifs seem to cluster predominantly at the N-terminus as indicated by the black boxes on the left. The height of each amino acid indicates its relative frequency at that specific position.
Figure 4Gene Ontology analysis of Mtb glycoproteins. The gene ontology analysis showed that the majority of the glycoproteins identified were localized in the cell wall and plasma membrane while lipid homeostasis, fatty acid metabolism, and response to the host immune system were among the enriched biological processes.
Virulence-associated membrane-bound glycoproteins, proteins involved in regulation, antimicrobial resistance (AMR) and chaperone proteins identified in MTBC.
| Function | Process | Glycoproteins involved |
|---|---|---|
| Cell envelope synthesis | Mycolic acid synthesis | AcpM, MmaA1, MmaA2, MmaA3, PcaA, Pks13, FbpB and FbpC |
| PDIM synthesis and transport | PpsA, PpsC, PpsD, PpsE, PapA1, FadD26, FadD28, LppX, DrrC and MmpL7 | |
| PG synthesis | PBPs) PbpA (Rv0016c), PbpB (Rv2163c), PonA1 (Rv0050) and LdtA (Rv0116c), MurA, MurE, MurF, LprQ, FtsW, MviN, GlmS, GlmM, DacB1 and Wag31 | |
| Capsule biosynthesis | GlgM (Rv1212c), GlgB (Rv1326c), GlgE (Rv1327c), TreZ (Rv1562c) and MalQ (Rv1781c) | |
| AG | AftD (Rv0236c), DprE1, EmbC and EmbR | |
| Lipoglycans (LM, LAM and PI) | PimB (Rv2188c), EmbC, Rv1459c and Rv2181 | |
| Membrane transport proteins | Sec | SecA1, SecY, SecD, SecE2, SecF |
| Tat | TatB | |
| MmpL lipid transporters | MmpL1, MmpL3, MmpL4, MmpL5, MmpL6, MmpL8, MmpL9, MmpL10, MmpL11, MmpL12 and MmpL13b | |
| ATP-binding cassette (ABC) | DrrC, DppA, DppC, DppD, FecB, UgpC, UgpE, ProZ, CydD, MalQ, Rv2326c, Rv2041c, Rv1680, Rv3197, Rv0987, Rv1281c, Rv3092c, Rv1747, Rv1273c, Rv1739c, Rv2564, Rv0073 | |
| Type-VII secretion | EccA1, EccB1, EccCb1, EccA2, EccB2, EccC2, EccD3, EccB3, EccC4, EccB4, EccA5, EccC5, | |
| Others | CpnT, NanT, IrtA, IrtB, ArsC | |
| MCE family proteins | Mce2A, Mce2D, Mce1E/LprK, Mce3R, Mce1A, Mce4C, Mce1C, Mce3C, Mce2R, Mce1R, Mce1B, Mce2F, Mce2B, Mce3D and Apa* | |
| Regulatory proteins | DevS-DosT/DosR, PhoR, WhiB3, WhiB4, WhiB5, WhiB7, TcrA, PrrA/PrrB, MtrA/MtrB, KdpD, KdpC, MoxR3, NarL, EmbR, PdtaR, GlnB, Mce1R, KstR, BlaR, BlaI and OxyS, Rv1353c, Rv0890c, Rv3095, Rv0494, Rv0043c, RamB, Rv0081, Rv0339c | |
| Chaperones | GroS, DnaK, GroeL1, GroeL2, ClpB, ClpX, Hsp | |
| Role in AMR | BlaC, KatG, RpoC, KasA, AhpD, FadE24, AcpM, IniB, IniC, EthA, OpcA, Wag31, RpoB, EmbR, EmbC, FabG1, RpsL, Mdh, Ndh, Alr, MtrAB, Rv2994, Rv0194, LprG, GyrA and GyrB | |
| Potential drug targets | Mur enzymes, DrrC, PknD, MmpL3, GlgB, GlgE, Hpt, PbpA, PbpB, PonA1 and LdtA |
Figure 5Protein-protein interaction networks of identified glycoproteins generated by Cytoscape. Networks are involved in lipid Metabolism & cell wall synthesis (A,D,E,H,I,J), protein synthesis (B), host-pathogen interaction (C), chaperone proteins (G), regulators (F,K), phthiocerol dimycocerosate (PDIM) synthesis (L), (sugar) transporters (M) and DNA replication, repair and recombination (N).
Representative biological activities elicited by glycolipoproteins identified from MTBC.
| Role or function | Glycolipoprotein(s) |
|---|---|
| Antigenicity | LpqH, LprG, LppX |
| Adhesion and cell invasion | LpqH, LprG, LprK, LprN, LppA, LpqG, LppX, MCE |
| Required for growth | LpqH, LprK, SugA, LpqY, LppY, LpqB |
| Signal transduction | LprF, LprA, LprG, LppR, LppX, LpqB |
| Role in AMR | LprG, BlaC |
| Cell wall metabolism | PbpB, PbpA, PonA1, LprQ, LprK, LppW, LppX, LpqY, LpqB |
| ACB transport system | UgpE, UgpC, Rv2041c, LpqY, MalQ, DppA, FecB |
| Degradation | LpqP, LpqI, Rv2672, LpqL |
| Other enzymes and metabolic activities | GgtB, Rv0526, DsbF, LppZ, LpqD, SodB, Rv0526 |
| Unknown function | Rv3693, Rv0679c, LppG, LpqU, LpqJ, LppO |
Figure 6Profile of the 101 differentially glycosylated proteins in MTBC. Hierarchical clustering of differentially glycosylated proteins (A), proteins hyper-glycosylated in lineage 3 and lineage 4 strains (B), proteins hyper-glycosylated in lineage 3, lineage 4 and lineage 5 strains (C), proteins hyper-glycosylated in lineage 5 and lineage 7 strains (D), proteins hyper-glycosylated in lineage 7 strain (E).