| Literature DB >> 28842596 |
Daisuke Matsui1,2, Shogo Nakano1,2,3, Mohammad Dadashipour1,2, Yasuhisa Asano4,5.
Abstract
Insolubility of proteins expressed in the Escherichia coli expression system hinders the progress of both basic and applied research. Insoluble proteins contain residues that decrease their solubility (aggregation hotspots). Mutating these hotspots to optimal amino acids is expected to improve protein solubility. To date, however, the identification of these hotspots has proven difficult. In this study, using a combination of approaches involving directed evolution and primary sequence analysis, we found two rules to help inductively identify hotspots: the α-helix rule, which focuses on the hydrophobicity of amino acids in the α-helix structure, and the hydropathy contradiction rule, which focuses on the difference in hydrophobicity relative to the corresponding amino acid in the consensus protein. By properly applying these two rules, we succeeded in improving the probability that expressed proteins would be soluble. Our methods should facilitate research on various insoluble proteins that were previously difficult to study due to their low solubility.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28842596 PMCID: PMC5573320 DOI: 10.1038/s41598-017-09749-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Helical wheel depictions for α-helix regions of four enzymes that contributed to improving the protein solubility, and saturation mutagenesis at aggregation hotspots on α-helices of ChMOX and AtADC identified by directed evolution analysis. Helical wheels depict the following α-helix regions: residues 443–460 (RVDIDTMVRGVHVALNFG) of ChMOX (A) and residues 430–446 (RESCLLYVDQLKQRCVE) of AtADC (D). Hydrophobic and hydrophilic residues are shown in white and black letters, respectively. The mutation sites for saturation mutagenesis are represented by underlined residue numbers. An asterisk (*) adjacent to the sequence number indicates the first residue of an α-helix. The enzyme activity of the saturated mutants was measured for the following four residues: V455X (B) and V444X (C) for ChMOX and K441X (E) and L435X (F) for AtADC. The residues with low and high hydropathy indices are hydrophilic and hydrophobic, respectively. The dagger (†) indicates WT ChMOX (B,C) and AtADC (E,F). Trp and Pro are not shown in these figures because the side chains of these residues exhibit hydrophobic character despite being classified as hydrophilic groups in the hydropathy index.
Figure 2Expression levels of WT and variants of ChMOX. Protein levels were determined by Western blotting. White bar, amounts of soluble enzymes; gray bar, amounts of total enzymes (soluble and insoluble); open circle, ratio (%, soluble/total protein).
Figure 3Schematic model for selection of candidate residues for aggregation hotspots. Schematic model based on the hydropathy contradiction rule and the definition of the HiSol score (upper) and the α-helix rule (lower). Hydrophobic and hydrophilic residues are shown as orange and blue filled circles, respectively.
HiSol scores and conserved residues corresponding to mutated residues in MeHNL, SuPDH, ChMOX, AtADC, and MpLUC.
| Enzyme | Target residue | How residues were identified | HiSol score (Ranking)a | Conserved residuesb | Positionc | Result |
|---|---|---|---|---|---|---|
|
| Lys148 | Hydropathy contradiction rule d | −2.748 (1st score) | Ile (64%), Val (21%), Leu (9%) | Coil | Negative |
| Gln225 | −2.232 (2nd score) | Val (73%) | Helix | Positive | ||
| Lys374 | −2.023 (3rd score) | Ile (53%) | Coil | Negative | ||
| Gln337 | −1.949 (4th score) | Ile (72%) | Helix | Positive | ||
| Gln243 | −1.807 (5th score) | Val (62%) | Coil | Negative | ||
|
| Leu435 | α-helix rulee |
| His (31%) | Helix | Positive |
| Lys441 |
| Leu (21%), Lys (11%) | Helix | False negative | ||
|
| His103 | Hydropathy contradiction ruled | −2.283 (1st score) | Leu (47%), Val (30%) | Sheet | False negative |
|
| Val444 | α-helix rulee,f |
| Glu (9%) | Helix | False negative |
| Val455 |
| Leu (42%), Ile (18%) | Helix | False negative | ||
|
| Lys117 | α-helix rulee |
| Leu (30.5%), Ala (22.5%), Tyr (13.3%) | Helix | Positive |
|
| Val174 | α-helix rulee |
| Asp (24.3%), Glu (15.8%), | Helix | Positive |
|
| Ile80 | Hydropathy contradiction rule on α-helixg | 1.919 (2nd score) | Lys (53%), Ile (17%) | Helix | Positive |
| Arg87 | −1.293 (10nd score) | Arg (41%), Lys (20%), Val (11%), Phe (11%) | Helix | Negative | ||
| Ala177 | 1.361 (7th score) | Asp (57%), Ala (20%), Glu (13%) | Helix | Positive | ||
|
| Leu114 | Hydropathy contradiction rule on α-helixg | 2.042 (1st score) | Lys (41.4%) | Helix | Positive |
| Arg95 | −1.473 (2nd score) | Ser (41.1%), Val (16.9%) | Helix | Positive | ||
| Leu46 | 1.463 (3rd score) | Lys (12.2%), Phe (10.7%) | Helix | Positive | ||
| Leu88 | 1.369 (4th score) | Leu (33.2%), Glu (19.2%), Asn (16.0%) | Helix | Negative | ||
| Phe55 | 1.319 (5th score) | His (30.6%), Phe (20.2%), Asn (14.9%) | Helix | Positive | ||
| Leu82 | 1.231 (6th score) | Leu (23.8%), Arg(19.8%), Val (10.0%) | Helix | Negative | ||
| Val97 | 1.191 (7th score) | Val (28.3%), Glu (16.2%) | Helix | Positive |
aRanking of HiSol score is represented for the predicted hotspots. Here, the score bearing negative value would be ranked after lining the scores with descending order, and vice versa for the score bearing positive value.
bRatio of conservation > 10%.
cPosition in secondary structure was predicted by PSIPRED.
d“Hydropathy contradiction rule” means that the hotspots were predicted based on the analysis of the HiSol scores.e“α-helix rule” means that the hotspots were predicted from the α-helix rule according to the hydropathy index of the residues. The HiSol scores were not used for the prediction, but are shown in italic as references. Thus, the ranking of the HiSol scores are not always high, but distributed sparse (from 3rd to even 106th).fThese are good examples showing the merit of theα-helix rule: even residues with low appearance rates and lower HiSol scores were chosen as hotspots (V444E with 10th HiSol score and only 9% appearance rate, and Val455 with 106th HiSol score in ChMOX). g“Hydropathy contradiction rule on α-helix” means that hotspot residues were first selected based on HiSol scores, and then the residues located on the α-helices were selected.
Figure 4Relative activities of the WT and variants of SuPDH (A) and MpLUC (B). Enzymatic activities of SuPDH WT and its variants (A). White bar, induction at 30 °C; gray bar, induction at 37 °C. Enzyme activities of MpLUC WT and its variants (B). Confirmation of expression levels of WT and variants of hGH in E. coli (C). Protein levels were determined by Western blotting.
Comparison of the activities of WT and variants of ChMOX, AtADC, and MpLUC.
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| WT | Refolded WT | V444Ha | V455Qa | WT | L435Ha | K441La | WT | I80Ka | A177Da | |
| Total activity (U/mL, or RLUb) | ND | 0.13 | 0.61 | 0.63 | ND | 0.039 | 0.043 | 1.1 × 108 | 1.2 × 109 | 2.8 × 109 |
| Soluble protein (mg/mL) | ND | 0.01 | 0.05 | 0.045 | ND | 0.021 | 0.021 | 0.001 | 0.01 | 0.02 |
| Specific activity (U/mg, or RLU/mgc) | ND | 13 | 12.2 | 14 | ND | 1.9 | 2.0 | 1.1 × 1011 | 1.2 × 1011 | 1.4 × 1011 |
aHighly soluble.
bU/mL or RLU of cell-free extract prepared from a 1-mL LB culture in triplicate; ND, not determined.
cSpecific activity (U/mg) of these enzymes compared to their purified forms.
Figure 5Schematic of inference and validation of the α-helix rule and hydropathy contradiction rule. For the α-helix rule, we made inferences based on positive variants of four enzymes obtained by random mutagenesis screening. The rule was inferred by saturating mutagenesis at these sites in ChMOX (Val455) and AtADC (Leu435). On the other hand, for the hydropathy contradiction rule, we made inferences based on sequence analysis of MeHNL. The rule was validated by identifying five candidate hotspots in SuPDH. Finally, we evaluated the effectiveness of the rules by applying the method to MpLUC and hGH.