| Literature DB >> 32042379 |
Sundeep Chaitanya Vedithi1, Carlos H M Rodrigues2,3, Stephanie Portelli2,3, Marcin J Skwark1, Madhusmita Das4, David B Ascher1,2,3, Tom L Blundell1, Sony Malhotra1.
Abstract
Rifampin resistance in leprosy may remain undetected due to the lack of rapid and effective diagnostic tools. A quick and reliable method is essential to determine the impacts of emerging detrimental mutations in the drug targets. The functional consequences of missense mutations in the β-subunit of RNA polymerase (RNAP) in Mycobacterium leprae (M. leprae) contribute to phenotypic resistance to rifampin in leprosy. Here, we report in-silico saturation mutagenesis of all residues in the β-subunit of RNAP to all other 19 amino acid types (generating 21,394 mutations for 1126 residues) and predict their impacts on overall thermodynamic stability, on interactions at subunit interfaces, and on β-subunit-RNA and rifampin affinities (only for the rifampin binding site) using state-of-the-art structure, sequence and normal mode analysis-based methods. Mutations in the conserved residues that line the active-site cleft show largely destabilizing effects, resulting in increased relative solvent accessibility and a concomitant decrease in residue-depth (the extent to which a residue is buried in the protein structure space) of the mutant residues. The mutations at residue positions S437, G459, H451, P489, K884 and H1035 are identified as extremely detrimental as they induce highly destabilizing effects on the overall protein stability, and nucleic acid and rifampin affinities. Destabilizing effects were predicted for all the clinically/experimentally identified rifampin-resistant mutations in M. leprae indicating that this model can be used as a surveillance tool to monitor emerging detrimental mutations that destabilise RNAP-rifampin interactions and confer rifampin resistance in leprosy. AUTHOREntities:
Keywords: In-silico Saturation Mutagenesis; Mutation Coolspots; Mycobacterium leprae; RNA Polymerase; Rifampin; Thermodynamic stability
Year: 2020 PMID: 32042379 PMCID: PMC7000446 DOI: 10.1016/j.csbj.2020.01.002
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1[A] Methodology and study design. [B] A lollipop plot with stability predictions for mutations reported in the literature and are known to confer rifampin resistance in Leprosy.
List of servers used in the computational analysis:
| Si No: | Name of web server | Function | Reference | Submission parameters |
|---|---|---|---|---|
| 1 | mCSM | Predict protein stability changes due to mutations. | Model PDB file, mutation and chain id. | |
| 2 | SDM | Predict protein stability changes due to mutations. | Model PDB file, mutation and chain id. | |
| 3 | mCSM-PPI | Predict stability of protein–protein interfaces due to mutations. | Model PDB file, mutation and chain id. | |
| 4 | mCSM-NA2 | Predict stability of protein-nucleic acid interactions due to mutations | Model PDB file, mutation, chain id and nucleic acid type. | |
| 5 | mCSM-lig | Stability of protein–ligand interactions due to mutations | Model PDB file, mutation, chain id, three letter code of the ligand and ligand affinity in wild type structure in nM concentration. | |
| 6 | FoldX4 | Predict protein stability changes due to mutations. | Model PDB file, list of mutations and chain ids. | |
| 7 | MAESTRO | Predict protein stability changes due to mutations. | Model PDB file, list of mutations and chain ids. | |
| 8 | CUPSAT | Predict protein stability changes due to mutations. | Model PDB file, list of mutations and chain ids. | |
| 9 | Imutant 2.0-Struc | Predict protein stability changes due to mutations. | Model PDB file, list of mutations and chain ids. | |
| 10 | Imutant 2.0 -Seq | Predict protein stability changes due to mutations using sequence information. | RNAP sequence file in fasta format, list of mutations and chain ids. | |
| 11 | PROVEAN | Predict protein stability changes due to mutations using sequence information. | RNAP sequence file in fasta format, list of mutations and chain ids. | |
| 12 | CONSURF | To calculate evolutionary conservation score of each residue in the protein. | Model PDB file | |
| 13 | ENCoM | Conformational Changes in protein due to mutations. | Model PDB file, list of mutations and chain ids. | |
| 14 | DynaMut | Conformational Changes in protein due to mutations. | Model PDB file, list of mutations and chain ids. | |
| 15 | Arpeggio | Map interatomic interactions between wildtype and mutant amino acids and the residue environment. | Model PDB file and the residue selection in standard format. | |
| 16 | Intermezzo | Map interatomic interactions between wildtype and mutant amino acids and the residue environment. | Bernardo Ochoa Montano & Blundell TL unpublished | Model PDB file and the residue selection in standard format. |
| 17 | ANDANTE | Works along with Modeller to generate mutant models from wildtype model files. | Model PDB file, mutation and chain id | |
| 18 | Fragment Hotspot Maps | Maps regions on the surface of the protein that has high propensity for small molecule binding. | Model PDB file. |
Fig. 2[A] The β-subunit of RNAP with residues where mutations were reported experimentally from patient samples in various studies (Supplementary Table 2) (highlighted in red). [B] Each residue in the β-subunit of RNAP that is colored by the conservations scores determined by CONSURF. The residues in green are variable (conservations scores greater than 1) and are usually surface exposed. The residues in red are conserved with conservation scores less than 1 and usually form the core of the protein. The rifampin binding site is highly conserved in M. leprae. [C] The maximum destabilizing effect (predicted by mCSM) on the protein stability for any mutation at each residue position, is mapped on the structure. Red are the regions that are largely destabilized by mutations while the white regions are relatively stable with mutations. [D] The converse of B where the regions, whose stability is least impacted by mutations, are coloured in blue and we called them “Mutation CoolSpots”. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3[A] The interfacial region of the β-subunit of RNAP highlighted in Maroon. [B]. The maximum destabilizing effect a mutation can induce on the interface stability, is predicted by mCSM-PPI and mapped on the structure. Red indicates regions that are highly destabilized by mutations (-5.108 Kcal/mol) while the blue indicates stable regions. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4[A] Change in relative solvent accessibility for maximum destabilizing mutants in the rifampin binding pocket (mCSM). [B]. Change in depth of the highly destabilizing mutant residue in the rifampin binding pocket (mCSM).
Fig. 5[A] The change in relative side chain solvent accessibility with mutations was mapped on to the structure. Blue indicates a decrease in RSA while red indicates an increase. [B] The changes in depth with highly destabilizing mutations at each residue position was mapped on the structure. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 6[A] Stability changes in β-subunit -RNA and β-subunit- rifampin [B] interactions due to mutations in the binding sites as predicted by mCSM-NA2 and mCSM-lig. The maximum destabilizing effect a mutation can cause at each residue position in the binding site is depicted on the structure.
Detrimental mutations and their corresponding stability changes that influence holoenzyme assembly, rifampin and RNA interactions.
| Method | Wild-type residue | Residue position | Average stability effect | Maximum stabilizing effect | Mutant residue | Maximum destabilizing effect | Mutant residue |
|---|---|---|---|---|---|---|---|
| mCSM-Stability (ΔΔG in kcal/mol) | S | 437 | −0.795 | −0.072 | L | −1.701 | H |
| H | 451 | −1.214 | −0.104 | Y | −1.898 | S | |
| G | 459 | −0.713 | −0.381 | V | −1.201 | W | |
| P | 489 | −1.135 | −0.507 | R | −1.771 | G | |
| K | 884 | −1.227 | −0.190 | L | −2.298 | S | |
| H | 1035 | −0.419 | 0.600 | Y | −1.421 | G | |
| mCSM-ppi (ΔΔG in kcal/mol) | S | 437 | −0.254 | 0.395 | H | −0.820 | R |
| H | 451 | −0.652 | −0.050 | S | −1.451 | M | |
| G | 459 | −0.397 | 0.237 | H | −1.042 | R | |
| P | 489 | −0.738 | −0.138 | W | −1.372 | R | |
| K | 884 | −0.105 | 0.160 | D | −0.685 | R | |
| H | 1035 | −0.754 | 0.115 | W | −1.726 | R | |
| mCSM-NA2 (ΔΔG in kcal/mol) | S | 437 | −1.538 | 4.922 | W | −3.857 | D |
| H | 451 | −1.300 | 5.147 | W | −3.632 | D | |
| G | 459 | 2.289 | 8.556 | W | −0.221 | D | |
| P | 489 | 1.926 | 8.195 | W | −0.582 | D | |
| K | 884 | 0.221 | 6.647 | W | −2.130 | D | |
| H | 1035 | 0.847 | 7.295 | W | −1.484 | D | |
| mCSM-lig (log-affinity change) | S | 437 | −0.646 | −0.484 | L | −1.062 | R |
| H | 451 | −0.510 | −0.076 | W | −0.777 | E | |
| G | 459 | −0.981 | −0.715 | A | −1.236 | R | |
| P | 489 | −0.598 | −0.254 | L | −0.917 | R | |
| K | 884 | −0.156 | −0.368 | D | −0.925 | R | |
| H | 1035 | −0.121 | 0.097 | V | −0.501 | E | |
| SDM (ΔΔG in kcal/mol) | S | 437 | 0.087 | 2.320 | V | −1.900 | P |
| H | 451 | −0.756 | 1.290 | L | −2.800 | G | |
| G | 459 | −2.842 | −1.780 | V | −3.800 | P | |
| P | 489 | −0.432 | 1.440 | Y | −1.070 | E | |
| K | 884 | 0.108 | 1.270 | V | −1.820 | P | |
| H | 1035 | −0.200 | 0.590 | V | −1.410 | P | |
| MAESTRO (ΔΔG in kcal/mol) | S | 437 | −0.21 | −0.14 | K | 0.24 | F |
| H | 451 | −0.12 | −0.05 | G | 0.22 | R | |
| G | 459 | −0.23 | −0.17 | S | 0.33 | W | |
| P | 489 | −0.26 | −0.22 | H | 0.31 | M | |
| K | 884 | −0.20 | −0.14 | G | 0.25 | M | |
| H | 1035 | −0.27 | −0.25 | P | 0.31 | Y | |
| CUPSAT (ΔΔG in kcal/mol) | S | 437 | 2.70 | 7.98 | I | −1.12 | G |
| H | 451 | 2.01 | 6.92 | W | −3.25 | K | |
| G | 459 | −2.51 | 5.00 | K | −5.53 | C | |
| P | 489 | −2.76 | −0.84 | A | −5.47 | M | |
| K | 884 | −2.99 | 3.42 | I | −8.03 | H | |
| H | 1035 | −1.07 | 2.15 | C | −3.23 | Y | |
| Imutant 2.0 Structure (Sign of prediction) | S | 437 | 4.05 | 9.00 | A | 1.00 | F |
| H | 451 | 6.00 | 8.00 | G | 3.00 | L | |
| G | 459 | 6.63 | 9.00 | N | 3.00 | I | |
| P | 489 | 7.11 | 9.00 | G | 3.00 | L | |
| K | 884 | 6.42 | 9.00 | G | 2.00 | M | |
| H | 1035 | 4.63 | 8.00 | G | 2.00 | L | |
| PROVEAN (ΔΔG in kcal/mol) | S | 437 | −4.79 | −3.00 | A | −7.00 | W |
| H | 451 | −8.66 | −5.73 | Y | −10.37 | C | |
| G | 459 | −8.10 | −6.00 | A | −10.00 | L | |
| P | 489 | −9.04 | −7.99 | A | −10.99 | F | |
| K | 884 | −5.97 | −2.91 | R | −7.75 | C | |
| H | 1035 | −8.98 | −5.79 | Y | −10.61 | C | |
| Imutant 2.0 Sequence (Sign of prediction) | S | 437 | 4.47 | 7.00 | F | 0.00 | H |
| H | 451 | 3.21 | 7.00 | P | 0.00 | F | |
| G | 459 | 3.53 | 7.00 | H | 0.00 | A | |
| P | 489 | 6.89 | 9.00 | G | 5.00 | L | |
| K | 884 | 3.53 | 8.00 | V | 0.00 | G | |
| H | 1035 | 2.95 | 6.00 | G | 0.00 | V | |
| FOldX4 (ΔΔG in kcal/mol) | S | 437 | 2.79 | −1.44 | I | 12.39 | R |
| H | 451 | 1.78 | −0.74 | L | 4.39 | W | |
| G | 459 | 9.14 | 3.96 | A | 20.76 | H | |
| P | 489 | 3.04 | 2.11 | N | 4.79 | R | |
| K | 884 | 1.06 | −2.12 | Y | 9.77 | L | |
| H | 1035 | 0.77 | −1.47 | P | 5.69 | Y | |
| ENCoM (ΔΔSvib in kcal/mol/K) | S | 437 | −0.44 | 0.48 | G | −1.50 | W |
| H | 451 | 0.34 | 0.97 | G | −0.46 | W | |
| G | 459 | −0.91 | −0.29 | A | −1.55 | W | |
| P | 489 | −0.16 | 0.14 | G | −0.82 | F | |
| K | 884 | 0.18 | 0.96 | G | −0.60 | W | |
| H | 1035 | 0.19 | 0.73 | G | −0.26 | W | |
| DynaMut (ΔΔG in kcal/mol) | S | 437 | 2.87 | 6.99 | L | −2.08 | G |
| H | 451 | −0.74 | 2.17 | Y | −3.43 | T | |
| G | 459 | 1.93 | 3.29 | N | −0.25 | S | |
| P | 489 | 0.94 | 3.26 | F | −0.72 | S | |
| K | 884 | 0.14 | 3.69 | W | −1.87 | E | |
| H | 1035 | 0.21 | 2.38 | W | −2.29 | G | |
Fig. 7[A] Interactions of S437 with the surrounding residue environment in the wildtype and of H437 in the S437H mutant [B]. [C] Interactions of G459 with the surrounding residue environment and [D] W459 in the mutant G459W. The red dotted lines represent hydrogen bonds. Orange dotted lines represent weak hydrogen bond interactions. Ring-Ring and intergroup interactions are depicted in cyan. Aromatic interactions are represented in sky-blue and carbonyl interactions in pink dotted lines. Green dotted lines represent hydrophobic interactions. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 8[A] Interactions of P489 with the surrounding residue environment in the wildtype and of G489 in the P489G mutant [B]. [C] Interactions of H451 with the surrounding residue environment and [D] S451 in the mutant H451S.
Fig. 9[A] Interactions of K884 with the surrounding residue environment in the wildtype and of S884 in the K884S mutant [B]. [C] Interactions of H1035 with the surrounding residue environment and [D] D1035 in the mutant H1035D. The blue dotted lines represent cation-π interaction. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 10[A] The maximum destabilizing effects on the protein stability, a mutation can induce at each residue position in the flexible conformations (as predicted by ENCoM [A] and DynaMut [B]), are mapped on the structure. Regions in red represent highly destabilizing while the blue regions are relatively stable with mutations. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 11Fragment hotspots were mapped on the structure which was coloured with maximum destabilizing effects of systematic mutations at each residue positions. Blue represents regions which are least impacted by any mutations. Stable and potential small molecule binding sites “A” and “B” are depicted on the structure.