| Literature DB >> 27563054 |
Arti Tripathi1, Kritika Gupta1, Shruti Khare1, Pankaj C Jain1, Siddharth Patel1, Prasanth Kumar1, Ajai J Pulianmackal1, Nilesh Aghera1, Raghavan Varadarajan2.
Abstract
Understanding how mutations affect protein activity and organismal fitness is a major challenge. We used saturation mutagenesis combined with deep sequencing to determine mutational sensitivity scores for 1,664 single-site mutants of the 101 residue Escherichia coli cytotoxin, CcdB at seven different expression levels. Active-site residues could be distinguished from buried ones, based on their differential tolerance to aliphatic and charged amino acid substitutions. At nonactive-site positions, the average mutational tolerance correlated better with depth from the protein surface than with accessibility. Remarkably, similar results were observed for two other small proteins, PDZ domain (PSD95pdz3) and IgG-binding domain of protein G (GB1). Mutational sensitivity data obtained with CcdB were used to derive a procedure for predicting functional effects of mutations. Results compared favorably with those of two widely used computational predictors. In vitro characterization of 80 single, nonactive-site mutants of CcdB showed that activity in vivo correlates moderately with thermal stability and solubility. The inability to refold reversibly, as well as a decreased folding rate in vitro, is associated with decreased activity in vivo. Upon probing the effect of modulating expression of various proteases and chaperones on mutant phenotypes, most deleterious mutants showed an increased in vivo activity and solubility only upon over-expression of either Trigger factor or SecB ATP-independent chaperones. Collectively, these data suggest that folding kinetics rather than protein stability is the primary determinant of activity in vivo This study enhances our understanding of how mutations affect phenotype, as well as the ability to predict fitness effects of point mutations.Entities:
Keywords: deep sequencing; fitness effect prediction; mutagenesis; protein folding
Mesh:
Substances:
Year: 2016 PMID: 27563054 PMCID: PMC5062330 DOI: 10.1093/molbev/msw182
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1Mutational effects on CcdB protein activity inferred from phenotypic screening and deep sequencing. (A), (B) and (C) show the MSseq values for representative exposed-site (accessibility >5%), all active-site and buried-site residues (accessibility ≤5%), respectively. On the vertical axis, residues are grouped into (G, P), aliphatic (A–M), aromatic (F–W), polar (S–Q) and charged (D–R) amino acids. Residue numbers and substitutions are indicated on the horizontal and vertical axes, respectively. Each heatmap is colored according to the MSseq value of the mutant. Green to red color gradation represents increasing MSseq values. Zero value (light green) indicates that the corresponding mutant was not observed in the library. WT residue at each position is indicated in white. Data for only representative residue positions are shown for clarity. (D) Active-site residues (highlighted in cyan) identified from the mutational phenotypes mapped onto the crystal structure of CcdB (PDB ID 3VUB).
Mutational Tolerance at the Buried-Site Residues at Lowest and Highest Expression Levels.
| Amino acid | No. of mutants | Depth (Å) | |||
|---|---|---|---|---|---|
| V05 | 18 | 6.8 | 0 | 39 | 94 |
| F17 | 17 | 7.3 | 0.2 | 82 | 100 |
| V18 | 18 | 9.3 | 0 | 33 | 83 |
| D19 | 18 | 6.7 | 1.4 | 83 | 100 |
| V20 | 19 | 8.6 | 0 | 32 | 74 |
| Q21 | 19 | 6.5 | 1 | 63 | 100 |
| M32 | 17 | 7.8 | 0.3 | 76 | 100 |
| V33 | 19 | 6.5 | 1.4 | 68 | 95 |
| I34 | 19 | 7.9 | 0 | 37 | 79 |
| L36 | 12 | 7.2 | 0 | 0 | 67 |
| P52 | 17 | 5.4 | 3.5 | 41 | 100 |
| V54 | 15 | 5.6 | 0.4 | 73 | 100 |
| M63 | 19 | 8.1 | 0.1 | 47 | 89 |
| T65 | 9 | 7.9 | 0 | 44 | 100 |
| M68 | 12 | 6.6 | 0 | 33 | 100 |
| L83 | 19 | 5.8 | 1.5 | 53 | 100 |
| I90 | 19 | 7.4 | 0.1 | 26 | 89 |
| A93 | 14 | 6.0 | 0 | 36 | 100 |
| I94 | 18 | 7.9 | 0.6 | 33 | 83 |
| M97 | 16 | 7.5 | 0 | 56 | 94 |
| F98 | 19 | 7.7 | 0.7 | 37 | 79 |
aSide-chain accessibility.
bMutational tolerance at the lowest (MID 2) and highest (MID 8) expression levels.
cResidues within van der Waals distance of the active-site residues.
dResidues present at dimer interface.
Fig. 2Relative tolerance for substitutions at buried positions. (A) Mutational sensitivity data at all buried positions, obtained at different expression levels for CcdB was used to obtain the distribution of MSseq values for a given mutant residue. The distributions for row and column residues were compared using a Wilcoxon signed-rank test and the corresponding P values were calculated. A log10 of the P values is indicated. Gradation from red to blue indicates increasing values log10 P, i.e., decreasing destabilizing effect of the row residue w.r.t. column residue. A lower P value implies that introduction of the row residue at a buried site is typically more destabilizing than introduction of the corresponding column residue. (B and C) Similar plot, but using values derived from saturation mutagenesis of the PDZ domain (PSD95pdz3) and lnW values from saturation mutagenesis of IgG Binding domain of protein G (GB1), respectively. (D–F) Correlation of the average MSseq values, values and lnW values with side-chain depth for all nonactive-site residues of CcdB, PSD95pdz3 and GB1, respectively. Accessibility and depth values were calculated based on the crystal structure of WT homodimeric CcdB (PDB ID 3VUB), PSD95pdz3 (PDB ID 1BE9) and GB1 (PDB ID 1PGA). A residue was defined as buried if the side-chain accessibility is ≤5%.
Mutant Phenotype Prediction by MSpred, SNAP2 and SuSPect.
| Protein | Prediction method | |||||
|---|---|---|---|---|---|---|
| CcdB | MSpred | 0.69 | 0.65 | 69 | 95 | 90 |
| SNAP2 | 0.27 | 0.19 | 100 | 11 | 37 | |
| SuSPect | 0.29 | 0.14 | 100 | 8 | 30 | |
| PSD95pdz3 | MSpred | 0.57 | 0.53 | 61 | 93 | 88 |
| SNAP2 | 0.24 | 0.15 | 100 | 7 | 34 | |
| SuSPect | 0.6 | 0.61 | 87 | 87 | 87 | |
| GB1 | MSpred | 0.65 | 0.49 | 44 | 96 | 79 |
| SNAP2 | 0.27 | 0.11 | 100 | 3 | 42 | |
| SuSPect | 0.08 | −0.03 | 73 | 24 | 38 |
aModulus of the correlation coefficient.
bMathews correlation coefficient = , where TP, TN, FP, FN are True Positives, True Negatives, False Positives and False Negatives, respectively.
cSensitivity = .
dSpecificity = .
eAccuracy = .
fMutant was classified as nonneutral if MSpred > 2 and neutral if the score = 2. Mutants were classified into true positives (TP), true negatives (TN), false positives (FP) and false negatives (FN).
gMutant was classified as nonneutral if SNAP2 score > 50 and neutral if the score < −50. −50 < Score < 50: low reliability predictions and were omitted. Mutants were classified into TP, TN, FP and FN.
hMutant was classified as nonneutral if SuSPect score > 75 and neutral if the score < 25. 25 < Score < 75: low reliability predictions and were omitted. Mutants were classified into TP, TN, FP and FN.
Fig. 3Correlation between apparent in vitro Tm, in vivo solubility and activity (MSseq value) for CcdB mutants. Correlations of ΔTm [Tm (WT)−Tm (Mutant)] for 67 single-site mutants with (A) in vivo activity and (B) in vivo fraction of soluble protein, respectively. (C) Correlation of relative thermal stability (ΔTm) of mutants with ΔΔGo of unfolding estimated by GdnHCl denaturation. (D) Correlation of fraction of protein in the soluble fraction with in vivo activity of mutants.
Kinetic Parameters for In Vitro Refolding and Unfolding of Selected, Moderately Stable CcdB Mutants.
| Fraction soluble | MSseq | CcdA binding to refolded protein (TSA) | Gyrase binding to refolded protein (BLI) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| a0 | a1 | a2 | A0 | A1 | |||||||||
| L16S | 0.4 | 2 | 16.7 | 0.04 | 0.72 | 0.07 | 0.24 | 0.02 | 0.83 | 0.17 | 0.06 | ++ ++ | +++ |
| V18T | 0.7 | 2 | 9 | 0.04 | 0.7 | 0.1 | 0.26 | 0.02 | 0.8 | 0.2 | 0.16 | ++ ++ | ++ ++ |
| R31G | 0.6 | 6 | 11 | 0.05 | 0.8 | 0.2 | 0.15 | 0.02 | 0.85 | 0.15 | 0.02 | – | – |
| V54E | 0.4 | 2 | 14.5 | 0.14 | 0.17 | 0.28 | 0.68 | 0.04 | 1 | – | – | – | – |
| M63N | 0.2 | 6 | 15.2 | 0.15 | – | – | 0.85 | 0.08 | 0.84 | 0.16 | 0.07 | – | – |
| V80N | 0.8 | 6 | 17.5 | 0.8 | – | > 0.5 | 0.2 | 0.04 | 0.76 | 0.24 | 0.07 | + | + |
| WT | 1 | 2 | – | 0.84 | – | > 0.5 | 0.16 | 0.046 | 0.62 | 0.38 | 0.04 | ++ ++ | ++ ++ |
aThe mutants chosen for refolding studies have similar stability and different solubility and activity (MSseq). Four other selected mutants, could not be used for refolding studies due to very low solubility and high protein precipitation under the given reaction conditions. These had MSseq values of 2, 2, 9 and 6, respectively.
bThe traces were fit to a 5-parameter equation for exponential decay for refolding (f = y0 + a×e(−bx)+c×e(−dx)), yielding fast (k1) and slow phase rate constants (k2), with associated amplitudes a1 and a2, respectively, and to a 3-parameter exponential rise for unfolding (f = y0 + a×e(b×x)) yielding the rate constant k1 with associated amplitude change, A1. a0 and A0 are the amplitudes for the burst phase for refolding and unfolding, respectively. Errors for all the observed parameters were ≤ 10% of the measured experimental value.
cNo observable binding.
Fig. 4In vivo activity and solubility of CcdB mutants, in presence and absence of ATP-independent chaperones. (A) The activity of the selected mutants was monitored in chaperone deleted (BWΔtig and BWΔsecB) as well as in chaperone over-expression strains (BWpTig and BWpSecB) under seven different repressing or activating conditions for the expression of mutants and the condition where growth ceased was reported as the active condition. (B and C) The fraction of protein for cells grown at 37 °C and induced for CcdB with 0.2% arabinose, in both supernatant (soluble) and pellet (insoluble), with or without over-expression of chaperones, Trigger Factor and SecB, respectively, determined following SDS–PAGE and Coomasie staining using Quantity One software (Bio-Rad). S and P are supernatant and pellet, respectively. Data for representative mutants is shown. The relative estimates of protein present in the soluble fraction and inclusion bodies for all mutants are shown in table 4. The arrow indicates the band for the induced chaperone.
In Vivo Activity and Solubility of CcdB Mutants in Presence and Absence of ATP-Independent Chaperones.
| Mutant | Fraction soluble | |||||||
|---|---|---|---|---|---|---|---|---|
| BW25113 | BWΔ | BWΔ | BWpTig | BWpSecB | Tig | SecB | ||
| WT | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| L16S | 4 | 7 | 7 | 2 | 3 | 0.4 | 1.5 | 1.7 |
| G29W | 8 | 8 | 8 | 2 | 4 | 0.6 | 1.2 | 1.1 |
| M32N | 4 | 6 | 6 | 2 | 3 | 0.1 | 3 | 2 |
| V33K | 4 | 7 | 6 | 2 | 3 | 0.1 | 2 | 2 |
| P35I | 8 | 7 | 8 | 5 | 5 | 0.6 | 1.3 | 1.1 |
| L36K | 8 | 8 | 8 | 3 | 5 | 0.05 | 4 | 2 |
| L41F | 7 | 8 | 8 | 3 | 3 | 0.4 | 1.8 | 2.5 |
| D67P | 6 | 8 | 8 | 2 | 4 | 0.2 | 0.8 | 0.5 |
| S70W | 6 | 8 | 8 | 2 | 4 | 0.5 | 1 | 0.4 |
| V73F | 7 | 7 | 8 | 2 | 3 | 0.5 | 2 | 1.3 |
| V80N | 6 | 6 | 7 | 3 | 4 | 0.6 | 1.3 | 1.2 |
aRatio of the soluble fraction of the protein in the presence of over-expressed chaperone (Trigger Factor and SecB, respectively) to the soluble fraction of the protein under normal conditions.