| Literature DB >> 29747478 |
Dengming Ming1, Rui Chen2, He Huang3,4.
Abstract
Optimizing amino-acid mutations in enzyme design has been a very challenging task in modern bio-industrial applications. It is well known that many successful designs often hinge on extensive correlations among mutations at different sites within the enzyme, however, the underpinning mechanism for these correlations is far from clear. Here, we present a topology-based model to quantitively characterize non-additive effects between mutations. The method is based on the molecular dynamic simulations and the amino-acid network clique analysis. It examines if the two mutation sites of a double-site mutation fall into to a 3-clique structure, and associates such topological property of mutational site spatial distribution with mutation additivity features. We analyzed 13 dual mutations of T4 phage lysozyme and found that the clique-based model successfully distinguishes highly correlated or non-additive double-site mutations from those additive ones whose component mutations have less correlation. We also applied the model to protein Eglin c whose structural topology is significantly different from that of T4 phage lysozyme, and found that the model can, to some extension, still identify non-additive mutations from additive ones. Our calculations showed that mutation non-additive effects may heavily depend on a structural topology relationship between mutation sites, which can be quantitatively determined using amino-acid network k-cliques. We also showed that double-site mutation correlations can be significantly altered by exerting a third mutation, indicating that more detailed physicochemical interactions should be considered along with the network clique-based model for better understanding of this elusive mutation-correlation principle.Entities:
Keywords: amino-acid network; free energy change; k-clique community; mutation non-additive effect; protein dynamics
Mesh:
Substances:
Year: 2018 PMID: 29747478 PMCID: PMC5983764 DOI: 10.3390/ijms19051427
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Compare the free energy changes of double-site mutations with that derived by the summation of the two corresponding single-site mutations of the T4 phage lysozyme.
Figure 2The dependence of the double-site mutation effects on the distances between the two corresponding single sites of the T4 phage lysozyme.
Typical 3-cliques found in a T4 phage lysozyme amino-acid network *.
| Clique | Nodes (Residue Index) |
|---|---|
| 1 | 67, 4, 101, 70, 7, 104, 71, 11 |
| 2 | 34, 38, 42, 25 |
| 3 | 66, 46, 50, 54, 58, 27, 62 |
| 4 | 66, 70, 31 |
| 5 | 96, 99, 100, 103, 75, 78, 88, 91 |
| 6 | 91, 126, 95 |
| 7 | |
| 8 | 101, 105, 145, 149 |
| 9 | 128, 129, |
| 10 | 138, 139, 142, 146, 149, 150 |
| 11 | 160, 148, 151 |
* The bold numbers indicate non-additive double-site mutations found in reference [9].
Figure 3The typical 3-cliques of an amino-acid network of the T4 phage lysozyme. The clique nodes are represented by α-carbon as shown in a ball, and the labels correspond to the factions listed in Table 2.
P for double-site mutations derived from the different T4 phage lysozyme models. WT stands for wild type lysozyme structure (PDB code 2LZM [14]), K16E for the mutant structure (PDB code 1L42 [15]), R154E for the mutant structure (PDB code 1L47 [15]), the four structures of K16E/R154E, S117I, N132I, S117I/N132I are homology models derived by MODELLER [16] based on the structure of wild type lysozyme with the corresponding mutations of K16E/R154E, S117I, N132I, S117I/N132I.
| Double Mutations | Models | ||||||
|---|---|---|---|---|---|---|---|
| WT | K16E | R154E | K16E/R154E | S117I | N132I | S117I/N132I | |
| (116, 119) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (117, 132) | 0.48 | 0.45 | 0.28 | 0.56 | 0.60 | 0.39 | 0.52 |
| (119, 135) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (128, 131) | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 |
| (135, 147) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (16, 119) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (16, 135) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (16, 154) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (38, 144) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (41, 131) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (85, 96) | 0.07 | 0.05 | 0.05 | 0.05 | 0.12 | 0.11 | 0 |
| (89, 96) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (98, 152) | 0.43 | 0.40 | 0.24 | 0.03 | 0.02 | 0.56 | 0.12 |
P for double-site mutations for Eglin c protein. The simulation is based on wild type Eglin c (PDB code 1EGL [17]), and the distance is measured between the two mutation sites.
| Double Mutation | Distance (Å) | ΔΔΔG (kJ/mol) | |
|---|---|---|---|
| (18, 27) | 14.2 | −0.12 ± 0.20 | 0 |
| (18, 54) | 8.2 | −0.95 ± 0.19 | 0.11 |
| (34, 58) | 17.2 | −0.38 ± 0.13 | 0 |
The free energy changes of 13 double mutations in T4 phage lysozyme by MAESTROweb.
| Mutations | ΔΔΔG (Kcal/Mol) | ΔΔΔG (Kcal/Mol) | Cα Atom Distance (Å) |
|---|---|---|---|
| S117I/N132I | −1.5 | 1.12 | 5.95 |
| K16E/K135E | −0.2 | 0.40 | 33.69 |
| K16E/R154E | −0.2 | 0.69 | 35.33 |
| A41V/V131A | −0.11 | −0.21 | 39.20 |
| N116D/R119M | −0.1 | −0.60 | 5.09 |
| D89A/R96H | −0.1 | 0.87 | 8.35 |
| R119E/K135E | −0.06 | 0.07 | 15.06 |
| K16E/R119E | −0.06 | 0.32 | 39.63 |
| E128A/V131A | −0.01 | −1.11 | 5.38 |
| K135E/K147E | 0.2 | −0.04 | 11.58 |
| K85A/R96H | 0.2 | −0.40 | 10.73 |
| S38D/N144D | 0.3 | 0.14 | 29.07 |
| A98V/T152S | 2.7 | 1.84 | 6.35 |
The free energy changes of 13 double mutations in T4 phage lysozyme by MAESTROweb.
| Double Mutation | ΔΔΔG (Kj/Mol) | Distance (Å) |
|---|---|---|
| (18, 27) | −0.42 | 14.2 |
| (18, 54) | 0.11 | 8.2 |
| (34, 58) | 1.66 | 17.2 |
The type of interaction bonds and their energetic parameters used in defining protein amino-acid interaction network.
| Bonds | Cutoff (Å) | Energy (KJ/Mol) |
|---|---|---|
| Hydrogen bond | 3.5 | 17.0/40.0/115.0 (on distance) |
| Van der Walls | 0.5 | 6.0 |
| Disulfide bond | 2.5 | 167.0 |
| electrostatics | 4.0 | 20.0 |
| π-π stacking | 6.5 | 9.4 |
| π-cation | 5.0 | 9.6 |
Note: the distance in hydrogen bonds refers to that between the hydrogen donor and acceptor atoms. The distance of van der Waals interaction is that between the surfaces of two atoms. The distance in sulfur bonds refers to that between the two sulfur atoms. The distance used in electrostatic interaction calculations are measured between mass centers of the two oppositely charged groups. The distance in π-π stacking interaction refers to those of the geometric centers of benzene rings of the aromatic residues. The distance in a π-cation interaction is measured from the mass center of the positively charged group in a residue to that of the benzene ring in another residue. The energy of the action is averaged over the various cases of the same type of interaction, which is a rough approximation of the corresponding real interaction.