| Literature DB >> 27337695 |
Frank J Poelwijk1, Vinod Krishna1, Rama Ranganathan2.
Abstract
Entities:
Mesh:
Substances:
Year: 2016 PMID: 27337695 PMCID: PMC4919011 DOI: 10.1371/journal.pcbi.1004771
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Definitions of genotype, phenotype, and effects of mutations.
Representation of (A) single mutant, (B) double mutant, and (C) triple mutant experiments. Phenotypes are denoted by y, where g is the underlying genotype. g = {g,…,g1} with g ∈{0,1}; “0” or “1” indicates the state of the mutable site (e.g., amino acid position). The effect of a single, double, and triple mutation is given by the red arrows. Pairwise (or second-order) epistasis is defined as the differential effect of a mutation depending on the background in which it occurs; for example, in (B) it is the degree to which the effect of one mutation (e.g., y10−y00) deviates in the background of the second mutation (y11−y01). Thus, the expression for second-order epistasis is (y11−y10)−(y01−y00). The third order and higher cases are considered in the main text.
Fig 2Examples of epistasis in a PDZ domain (A) and a K+ ion channel (B).
(A) PDZ domains are small, mixed αβ proteins that bind target peptide ligand (in yellow stick bonds) in a groove formed between the β2 and α2 elements (PSD95pdz3 shown, Protein Data Bank (PDB) accession 1BE9). The study discussed in the main text and in Table 1 is focused on the epistatic interactions between three amino acid positions—two in the PDZ domain (H372 and G330) and one in the ligand (T-2) (red spheres). (B) a thermodynamic cube representing the energetics of mutations at the three positions; values are equilibrium dissociation constants (K) for the target ligand (CRIPT [58]) in μM for all eight possible combination of mutations; errors represent standard deviation. (C) structure of the homotetrameric KcsA K+ ion channel (PDB accession 1K4C), showing the four positions selected for mutation in Sadovsky and Yifrach (in red spheres, shown only for one subunit for clarity) [60]. Note that the experiments were carried out in the Shaker K+ ion channel, and the positions in Shaker numbering are given in parentheses. The positions form a network that roughly links the intracellular activation gate and the selectivity filter.
Interaction terms after applying the three different transforms to the PDZ–ligand dataset with three mutable positions: three-way mutant cycle, background-averaged epistasis, and regression (to second order).
| Genotype | Free Energy | Interaction Terms | Mutant Cycle | Background-Averaged Epistasis | Regression Terms |
|---|---|---|---|---|---|
| THG | |||||
| 000 | -8.17 (0.07) | *** | -8.17 (0.07) | -7.24 (0.03) | -7.96 (0.06) |
| 001 | -7.58 (0.09) | **1 | 0.59 (0.11) | -0.51 (0.06) | 0.17 (0.10) |
| 010 | -6.13 (0.14) | *1* | 2.05 (0.15) | 0.23 (0.06) | 1.63 (0.13) |
| 011 | -6.24 (0.07) | *11 | -0.70 (0.19) | 0.13 (0.12) | 0.13 (0.12) |
| 100 | -5.96 (0.03) | 1** | 2.22 (0.07) | -0.41 (0.06) | 1.80 (0.08) |
| 101 | -7.70 (0.11) | 1*1 | -2.33 (0.16) | -1.50 (0.12) | -1.50 (0.12) |
| 110 | -7.67 (0.09) | 11* | -3.76 (0.18) | -2.92 (0.12) | -2.92 (0.12) |
| 111 | -8.45 (0.06) | 111 | 1.67 (0.25) | 1.67 (0.25) | 0 (0.00) |
1 The three mutations are T-2F in the ligand and H372A and G330T in the protein, respectively. They are designated in this column as “THG.”
2 Free energies are in kcal/mol, with standard deviation in parentheses.
3 Interacting positions are in the same order as genotypes, e.g., “*11” indicates the epistasis between amino acid positions 372 and 330 in PSD95-PDZ3.
Standard deviations in epistatic terms are given in parentheses and calculated according to , where δs designate the error vectors and ∘ stands for the element-wise product (see also S2 Text).
Interaction terms based on the standard mutant cycle formalism () and on background-averaged epistasis () for pore-opening free energies in the Shaker K+ voltage-gated channel.
As for the PDZ domain (Table 1), background averaging modulates epistasis at each level given the existence of higher-order terms. Primary data are from [60] and [61].
| Genotype | Δ | Interaction Terms | Mutant Cycle | Background-Averaged Epistasis |
|---|---|---|---|---|
| 0000 | -1.97 (0.05) | **** | -1.97 (0.05) | -8.33 (0.05) |
| 0001 | -7.05 (0.12) | ***1 | -5.08 (0.13) | -0.64 (0.10) |
| 0010 | -13.57 (0.29) | **1* | -11.60 (0.29) | -3.52 (0.10) |
| 0011 | -9.47 (0.25) | **11 | 9.18 (0.40) | 2.97 (0.20) |
| 0100 | -7.97 (0.34) | *1** | -6.00 (0.34) | -1.09 (0.10) |
| 0101 | -8.11 (0.19) | *1*1 | 4.94 (0.41) | -1.13 (0.20) |
| 0110 | -10.01 (0.33) | *11* | 9.56 (0.56) | 1.46 (0.20) |
| 0111 | -13.50 (0.32) | *111 | -12.53 (0.73) | -3.00 (0.40) |
| 1000 | -7.04 (0.21) | 1*** | -5.07 (0.22) | 1.25 (0.10) |
| 1001 | -6.58 (0.08) | 1**1 | 5.54 (0.26) | 1.02 (0.20) |
| 1010 | -8.42 (0.13) | 1*1* | 10.22 (0.38) | 3.68 (0.20) |
| 1011 | -8.20 (0.16) | 1*11 | -9.42 (0.51) | 0.12 (0.40) |
| 1100 | -5.05 (0.12) | 11** | 7.99 (0.42) | 1.58 (0.20) |
| 1101 | -8.80 (0.09) | 11*1 | -9.15 (0.49) | 0.39 (0.40) |
| 1110 | -10.07 (0.11) | 111* | -13.20 (0.63) | -3.67 (0.40) |
| 1111 | -7.52 (0.04) | 1111 | 19.07 (0.81) | 19.07 (0.81) |
1 The four mutations are T469A, A465V, E395A, and A391V (corresponding to the bits in the first column in left-to-right order).
2 Standard deviations of epistatic terms are given in parentheses and computed according to (see S2 Text).
Mean absolute values of interaction terms for the four-mutation network in the Shaker K+ channel.
This analysis recapitulates the basic finding of Sadovsky and Yifrach [60] that these positions comprise a cooperative unit, a result that is further clarified with background averaging.
| Epistatic Order | Mutant Cycle | Background-Averaged Epistasis |
|---|---|---|
| 0 | 1.97 (0.05) | 8.33 (0.05) |
| 1 | 6.94 (0.26) | 1.63 (0.10) |
| 2 | 7.91 (0.42) | 1.98 (0.20) |
| 3 | 11.08 (0.60) | 1.79 (0.40) |
| 4 | 19.07 (0.81) | 19.07 (0.81) |
1 Order over which the absolute values of epistatic terms are averaged.
2 Errors on the mean are given in parentheses.
Fig 3Examples of matrices Z introduced to calculate the partial background-averaged epistasis for n = 3.
(A) 2 for when data for mutants up to second-order is available and (B) 1 for when only first-order mutants are available. Both matrices are self-similar, which allows their generation for arbitrary order, and are related to the logic Sierpinski triangle. For example, 2 = 1−Σ, where is the anti-diagonal identity matrix and Σ is the Sierpinski matrix (i.e., multigrade AND in Boolean logic) for three inputs.