| Literature DB >> 35070165 |
Till El Harrar1,2, Mehdi D Davari3, Karl-Erich Jaeger4,5, Ulrich Schwaneberg1,6, Holger Gohlke2,7.
Abstract
Ionic liquids (IL) and aqueous ionic liquids (aIL) are attractive (co-)solvents for green industrial processes involving biocatalysts, but often reduce enzyme activity. Experimental and computational methods are applied to predict favorable substitution sites and, most often, subsequent site-directed surface charge modifications are introduced to enhance enzyme resistance towards aIL. However, almost no studies evaluate the prediction precision with random mutagenesis or the application of simple data-driven filtering processes. Here, we systematically and rigorously evaluated the performance of 22 previously described structure-based approaches to increase enzyme resistance to aIL based on an experimental complete site-saturation mutagenesis library of Bacillus subtilis Lipase A (BsLipA) screened against four aIL. We show that, surprisingly, most of the approaches yield low gain-in-precision (GiP) values, particularly for predicting relevant positions: 14 approaches perform worse than random mutagenesis. Encouragingly, exploiting experimental information on the thermostability of BsLipA or structural weak spots of BsLipA predicted by rigidity theory yields GiP = 3.03 and 2.39 for relevant variants and GiP = 1.61 and 1.41 for relevant positions. Combining five simple-to-compute physicochemical and evolutionary properties substantially increases the precision of predicting relevant variants and positions, yielding GiP = 3.35 and 1.29. Finally, combining these properties with predictions of structural weak spots identified by rigidity theory additionally improves GiP for relevant variants up to 4-fold to ∼10 and sustains or increases GiP for relevant positions, resulting in a prediction precision of ∼90% compared to ∼9% in random mutagenesis. This combination should be applicable to other enzyme systems for guiding protein engineering approaches towards improved aIL resistance.Entities:
Keywords: Bacillus subtilis lipase A; Ionic liquids; Protein engineering; Protein stability; Site-saturation mutagenesis
Year: 2021 PMID: 35070165 PMCID: PMC8752993 DOI: 10.1016/j.csbj.2021.12.018
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Distributions of relevant variants and positions for the four aIL in the BsLipA SSM library. Data is analyzed by focussing on relevant variants (A-B) and relevant positions (C-D). (A) The average number of relevant variants per position is mapped onto the BsLipA structure with blue (red) color depicting a low (high) amount of variants per position. The catalytic site residues S77, D133, and H156 are depicted as sticks and colored in green. (B) Average number of relevant variants per position. The majority of the positions yields less than one aIL resistant variant, and few positions yield multiple (>4) aIL resistant variants. (C) Number of positions that are relevant in n = 0 to 4 aIL. Almost half of all BsLipA positions (89 positions) yield relevant variants in three or more aIL, and only ∼20% (39 positions) yield variants that are not improved in any aIL. (D) Data of (C) mapped onto the BsLipA structure with colors depicting the number of aIL (white: 0; light blue: 1; blue: 2; magenta:3; red:4). The catalytic site residues S77, D133, and H156 are depicted as sticks and colored in green. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Predictive performance of selected approaches and physicochemical and evolutionary properties to predict relevant variants and positions.[a]
[a] Substitutions to specific residues are indicated by “→” plus one-letter code; in all other cases, substitutions to all residues are performed. The results for the predicted relevant variants and positions for all evaluated approaches, properties, and the combinations of both are shown along the sequence of BsLipA (see the top for a secondary structure representation): Red bars indicate relevant positions for which relevant variants were correctly predicted. Blue bars indicate relevant positions for which no relevant variant was correctly predicted. The height of red bars represents the fraction of relevant variants among all predicted variants for the given position, thus, describing the precision of predicting relevant variants. The height of blue bars represents the fraction of (falsely) predicted relevant variants of all possible variants at this position, thus, giving an estimate of the experimental work unnecessarily spent when investigating all predicted variants. In all, high red bars and low blue bars indicate a favorable approach, and vice versa. For random mutagenesis (Rd), the graph along the BsLipA sequence represents the experimentally determined mutagenesis efficiency (i.e., the relevance) of each sequence position. Thus, blue bars represent positions not relevant in all aIL, whereas red bars represent positions relevant in at least one aIL. The height of red bars displays the average fraction of relevant variants at the respective relevant position.
[b] Numbering of evaluated approaches and properties. A = Approach, P = Properties, C = Combination of approaches and properties.
[c] Number of relevant variants vs. all considered variants.
[d] Number of relevant positions vs. all considered positions.
[e] Random mutagenesis.
[f] Averaged percentage of relevant variants compared to the whole BsLipA SSM library.
[g] Averaged percentage of relevant positions compared to the whole BsLipA SSM library.
[h] Not determined.
[i] See Section 3.4 in the Supplementary Information for an explanation of the abbreviations.
Fig. 2Overview of evaluated structure-based approaches described in the literature for improving aIL resistance. The classification of the approaches (I-VI) is described in the text. Most approaches rely on analyzing direct protein-aIL interactions (A1-A7), whereas only a few investigate subsequent effects of the aIL interactions on the protein (A8-A11).
Fig. 3Only five approaches (A1, A2, A12, A14, A15) yield a significantly improved prediction precision for relevant variants compared to random mutagenesis; only two approaches (A12, A14) yield a markedly improved prediction precision for relevant positions compared to random mutagenesis. Approaches are colored according to their classification. See Fig. 2 for the color code. GiPvar and GiPpos are shown as mean ± standard error of the mean over the four BsLipA SSM libraries. Significant differences compared to random mutagenesis (Rd) are indicated with an asterisk if p < 0.05 for each of the four BsLipA SSM libraries.