| Literature DB >> 28970579 |
Raghu Bhagavat1, Heung-Bok Kim2, Chang-Yub Kim2, Thomas C Terwilliger2, Dolly Mehta1, Narayanaswamy Srinivasan3, Nagasuma Chandra4.
Abstract
Nucleoside tri-phosphates (NTP) form an important class of small molecule ligands that participate in, and are essential to a large number of biological processes. Here, we seek to identify the NTP binding proteome (NTPome) in M. tuberculosis (M.tb), a deadly pathogen. Identifying the NTPome is useful not only for gaining functional insights of the individual proteins but also for identifying useful drug targets. From an earlier study, we had structural models of M.tb at a proteome scale from which a set of 13,858 small molecule binding pockets were identified. We use a set of NTP binding sub-structural motifs derived from a previous study and scan the M.tb pocketome, and find that 1,768 proteins or 43% of the proteome can theoretically bind NTP ligands. Using an experimental proteomics approach involving dye-ligand affinity chromatography, we confirm NTP binding to 47 different proteins, of which 4 are hypothetical proteins. Our analysis also provides the precise list of binding site residues in each case, and the probable ligand binding pose. As the list includes a number of known and potential drug targets, the identification of NTP binding can directly facilitate structure-based drug design of these targets.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28970579 PMCID: PMC5624866 DOI: 10.1038/s41598-017-12471-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) Pie-chart showing the different tuberculist functional classes for M.tb proteins that were identified in the NTPome. It was indeed interesting to see a major portion of the hypotheticals that were identified, which could serve as a possible function annotation at the molecular level of NTP binding for these proteins. Same is with the PE/PPE family proteins which are not known to bind NTP. (B) Distribution of identified NTP motifs in the M.tb proteome. Although 24 motifs were seen across the different proteins in NTPome, there was a preference for certain motifs, namely 3PNL-like motif, 3QUR-like, 2ZKJ-like, 3KEU-like and 3SBX-like motifs. 3PNL, 3QUR, 2ZKJ, 3KEU and 3SBX refer to the PDB codes of the representative proteins in the study of identifying NTP motifs that was carried out previously, and the number in each sector refers to the total number of M.tb proteins identified for that particular motif.
Figure 2Pair-wise superpositions for 15 pairs of proteins that fall under the category of hypotheticals. The reference NTP motif is shown in red sticks and sites belonging to M.tb proteins are shown in blue stick representations in each panel, with the ligand shown in ball and stick model. It can be seen that there are not only identical residue matches between the sites in some cases, but also, similar geometrical orientations of the side chain of amino acid residues, further strengthening the possibility of NTP binding by these proteins. It has to be noted that the proteins in the pair do not share any relatedness in their sequences and folds.
Figure 3(A) 2D-gel image of M.tb cytosolic DLAC fraction eluted by ATP. The identified M.tb proteins are labeled on the representative protein spots with their Rv IDs. The Rv IDs of the rest of spots are listed in Table 1. This image shows part of original 2-D gel in the range of MW 100-10 kDa and pI 6–4.5, which covers most of the DLAC eluted proteins by ATP. (B) 1D SDS-PAGE of NTP ligands interaction analysis by DLAC using purified Rv2780. Each lane shows the fraction obtained during the process of DLAC including recombinant Rv2780 loaded on to Cibacron Blue F3GA dye resin (Rv2780 input), the fraction after washing with column buffer (wash) and Rv2780 protein eluted by five NTP ligands (Adenosine, ATP, ADP, AMP and GTP) showing the molecular weight marker on left. The Rv2780 band shown in each ligand lane indicates the interaction of applied ligand. From the darkness of bands –protein amount eluted by each ligand, the degree of each ligand’s interaction with Rv2780 can be also obtained.
showing the list of M.tb proteins that were verified for NTP binding using DLAC. The different NTP ligands that showed binding for each protein are mentioned in the last column. The four proteins belonging to the hypothetical category are shown in bold-face.
| Rv ID | Protein Name | NTP ligands identified |
|---|---|---|
| Rv0054 | Single-strand binding protein Ssb (helix-destabilizing protein) | GTP |
| Rv0119 | Probable fatty-acid-CoA ligase FadD7 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) | AMP ADP GTP |
| Rv0350 | Probable chaperone protein DnaK (heat shock protein 70) (heat shock 70 kDa protein) (HSP70) | AMP ATP GTP |
| Rv0357c | Probable adenylosuccinate synthetase PurA (imp-aspartate ligase) (ADSS) (ampsase) | ADP ATP GTP |
| Rv0391 | Probable O-succinylhomoserine sulfhydrylase MetZ (OSH sulfhydrylase) | GTP |
| Rv0440 | 60 kDa chaperonin 2 GroEL2 (protein CPN60-2) (GroEL protein 2) (65 kDa antigen) (heat shock protein 65) (cell wall protein A) (antigen A) | ATP |
| Rv0467 | Isocitrate lyase Icl (isocitrase) (isocitratase) | AMP ADP GTP |
| Rv0672 | Probable acyl-CoA dehydrogenase FadE8 | ATP GTP |
| Rv0859 | Possible acyl-CoA thiolase FadA | AMP ADP ATP GTP |
| Rv1007c | Methionyl-tRNA synthetase MetS (MetRS) (methionine-tRNA ligase) | AMP ATP GTP |
| Rv1017c | Probable ribose-phosphate pyrophosphokinase PrsA (phosphoribosyl pyrophosphate synthetase) (PRPP synthetase) | AMP ADP ATP GTP |
| Rv1023 | Probable enolase Eno | ATP |
| Rv1133c | Probable 5-methyltetrahydropteroyltriglutamate–homocysteine methyltransferase MetE (methionine synthase) | ATP GTP |
| Rv1327c | Probable glucanase GlgE | GTP |
| Rv1383 | Probable carbamoyl-phosphate synthase small chain CarA (carbamoyl-phosphate synthetase glutamine chain) | ADP ATP |
| Rv1391 | Probable DNA/pantothenate metabolism flavoprotein homolog Dfp | ATP GTP |
| Rv1436 | Probable glyceraldehyde 3-phosphate dehydrogenase Gap (GAPDH) | AMP ADP ATP GTP |
| Rv1559 | Probable threonine dehydratase IlvA | ADP ATP GTP |
| Rv1688 | Possible 3-methyladenine DNA glycosylase Mpg | ADP ATP GTP |
| Rv1843c | Probable inosine-5′-monophosphate dehydrogenase GuaB1(imp dehydrogenase) (IMPDH) (IMPD) | AMP ADP ATP GMP GTP |
| Rv1908c | Catalase-peroxidase-peroxynitritase T KatG | AMP ADP GTP |
| Rv2029c | 6-phosphofructokinase PfkB (phosphohexokinase) (phosphofructokinase) | ADP GTP |
| Rv2031c | Heat shock protein HspX (alpha-crystallin homolog) (14 kDa antigen) (HSP16.3) | AMP ADP ATP GTP |
| Rv2145c | Diviva family protein Wag31 | AMP ADP ATP GTP |
| Rv2215 | DlaT | GTP |
| Rv2461c | Probable ATP-dependent CLP protease proteolytic subunit 1 ClpP1 (endopeptidase CLP) | ATP GTP |
| Rv2605c | Probable acyl-CoA thioesterase II TesB2 (TEII) | GTP |
| Rv2688c | Antibiotic-transport ATP-binding protein ABC transporter | ATP GTP |
| Rv2780 | Secreted L-alanine dehydrogenase Ald (40 kDa antigen) (TB43) | AMP ADP ATP GTP |
| Rv2783c | Bifunctional protein polyribonucleotide nucleotidyltransferase GpsI: guanosine pentaphosphate synthetase + polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) (pnpase) | AMP ADP ATP GTP |
| Rv2855 | NADPH-dependent mycothiol reductase Mtr | ATP |
| Rv2858c | Probable aldehyde dehydrogenase AldC | AMP ADP ATP GTP |
| Rv2996c | Probable D-3-phosphoglycerate dehydrogenase SerA1 (PGDH) | ADP |
| Rv3028c | Probable electron transfer flavoprotein (alpha-subunit) FixB (alpha-ETF) (electron transfer flavoprotein large subunit) (ETFLS) | GTP |
| Rv3273 | Probable transmembrane carbonic anhydrase (carbonate dehydratase) (carbonic dehydratase) | ADP ATP GTP |
| Rv3280 | Probable propionyl-CoA carboxylase beta chain 5 AccD5 (pccase) (propanoyl-CoA:carbon dioxide ligase) | AMP |
| Rv3285 | Probable bifunctional protein acetyl-/propionyl-coenzyme A carboxylase (alpha chain) AccA3: biotin carboxylase + biotin carboxyl carrier protein (BCCP) | AMP ADP ATP GTP |
| Rv3336c | Probable tryptophanyl-tRNA synthetase TrpS (tryptophan-tRNA ligase) (TRPRS) (tryptophan translase) | ATP |
| Rv3389c | Probable 3-hydroxyacyl-thioester dehydratase HtdY | ATP |
| Rv3401 | Conserved protein | AMP |
| Rv3457c | Probable DNA-directed RNA polymerase (alpha chain) RpoA (transcriptase alpha chain) (RNA polymerase alpha subunit) (DNA-directed RNA nucleotidyltransferase) | ATP GTP |
Figure 4Pair-wise alignments of 15 pairs of proteins that were successfully tested for NTP binding using DLAC technique are shown. It has to be noted that the two proteins in a pair are not closely related by their sequences, but share a significant similarity at their binding sites. In all panels, the reference NTP motif is shown as red sticks and the aligned pocket residues of M.tb protein are shown in blue sticks with the ligand shown in ball and stick representation colored by atom type. With this information, we can also get more details on the possible mode of NTP binding in these proteins along with the exact locations on the protein surface defined by a set of pocket residues.
Figure 5(A) A Venn-diagram representation showing the common number of proteins that have been identified by our study in comparison with other studies. In total, 1768 proteins were identified for NTP binding, out of which 218 are common to 317 proteins from Ansong et al. 134 out of 176 from Wolfe et al. 177 out of 245 from PATRIC DB and 87 out of 123 from TBDB. While Ansong et al. and Wolfe et al. are experimental based studies, TBDB and PATRIC databases are computational based studies. (B) Table showing an all-vs-all comparison of our predicted list of 1768 in the NTPome with the 4 different comparator groups. The number in each box indicates the number of common proteins identified between the pair.
List of proteins that belong to Structural Genomics initiatives that were identified in the NTPome. The binding site similarity scores of the individual proteins with their respective query NTP motif are also shown along with the ligand(s) matched.
| Sl. no | RvID | Protein Name | PMSmax | PMSmin | Ligands |
|---|---|---|---|---|---|
| 1 | Rv0674 | Conserved hypothetical protein | 0.41 | 0.82 | ATP, GTP |
| 2 | Rv0813c | Conserved protein | 0.49 | 0.83 | ATP |
| 3 | Rv1340 | Probable ribonuclease RphA (RNase PH) (tRNA nucleotidyltransferase) | 0.58 | 0.76 | TTP |
| 4 | Rv1626 | Probable two-component system transcriptional regulator regulatory proteins | 0.51 | 0.89 | ATP, GTP |
| 5 | Rv1825 | Conserved protein | 0.46 | 0.82 | UDP |
| 6 | Rv1873 | Conserved hypothetical protein | 0.48 | 0.78 | ATP |
| 7 | Rv2074 | Possible pyridoxamine 5′-phosphate oxidase (PNP/PMP oxidase) (pyridoxinephosphate oxidase) (PNPOX) (pyridoxine 5′-phosphate oxidase) | 0.52 | 0.79 | TTP, UDP |
| 8 | Rv2717c | Conserved protein | 0.41 | 0.79 | GTP, ATP |
20 example folds in M.tb for fold-site associations that can be implicated for NTP binding.
| Sl. No | Fold name | RvID | Protein name |
|---|---|---|---|
| 1 | 6-phosphogluconate dehydrogenase C-terminal domain-like | Rv2573 | Function unknown |
| 2 | Cytochrome P450 | Rv3059/Rv0327c | Probable cytochrome P450/ Possible cytochrome P450 135A1 Cyp135A1 |
| 3 | LolA-like prokaryotic lipoproteins and lipoprotein localization factors | Rv1270c | Lipoprotein LprA |
| 4 | Domain of alpha and beta subunits of F1 ATP synthase-like | Rv1293 | Diaminopimelate decarboxylase LysA (DAP decarboxylase) |
| 5 | Amidase signature (AS) enzymes | Rv3375 | Probable amidase AmiD (acylamidase) (acylase) |
| 6 | ClpP/crotonase | Rv2486 | Probable enoyl-CoA hydratase EchA14 (enoyl hydrase) |
| 7 | Rhodanese/Cell cycle control phosphatase | Rv2291 | Probable thiosulfate sulfurtransferase SseB |
| 8 | alpha/beta-Hydrolases | Rv1683 | Possible bifunctional enzyme; long-chain acyl-CoA synthase and lipase. |
| 9 | Chelatase-like | Rv0265c | Probable periplasmic iron-transport lipoprotein |
| 10 | Profilin-like | Rv1354c | Conserved hypothetical protein |
| 11 | Pentein, beta/alpha-propeller | Rv2323c | Function unknown |
| 12 | Nitrite and sulphite reductase 4Fe-4S domain-like | Rv2391 | Ferredoxin-dependent sulfite reductase SirA |
| 13 | Peptide deformylase | Rv0429c | Probable polypeptide deformylase Def (PDF) |
| 14 | FAH | Rv3536c | Probable hydratase |
| 15 | Thioesterase/thiol ester dehydrase-isomerase | Rv1532c | Conserved hypothetical protein |
| 16 | Zincin-like | Rv2367c | Conserved hypothetical protein |
| 17 | MFS general substrate transporter | Rv3331 | Probable sugar-transport integral membrane protein SugI |
| 18 | Rhomboid-like | Rv1337 | Probable integral membrane protein |
| 19 | MetI-like | Rv0929 | Phosphate-transport integral membrane ABC transporter PstC2 |
| 20 | Ketopantoate reductase PanE | Rv2573 | Conserved hypothetical protein |