| Literature DB >> 27581654 |
Abstract
The ease of genetic manipulation, low cost, rapid growth and number of previous studies have made Escherichia coli one of the most widely used microorganism species for producing recombinant proteins. In this post-genomic era, challenges remain to rapidly express and purify large numbers of proteins for academic and commercial purposes in a high-throughput manner. In this review, we describe several state-of-the-art approaches that are suitable for the cloning, expression and purification, conducted in parallel, of numerous molecules, and we discuss recent progress related to soluble protein expression, mRNA folding, fusion tags, post-translational modification and production of membrane proteins. Moreover, we address the ongoing efforts to overcome various challenges faced in protein expression in E. coli, which could lead to an improvement of the current system from trial and error to a predictable and rational design.Entities:
Keywords: 5′UTR and N-terminal codons; Escherichia coli; fusion tag; high-throughput; membrane protein; recombinant protein expression
Mesh:
Substances:
Year: 2016 PMID: 27581654 PMCID: PMC5008019 DOI: 10.1098/rsob.160196
Source DB: PubMed Journal: Open Biol ISSN: 2046-2441 Impact factor: 6.411
Figure 1.Three strategies for preparing target genes. (a) Target genes can be obtained from a cDNA library after reverse transcription. (b) PCR can be used to amplify genes from a cDNA library or genomic DNA. (c) Array-based gene synthesis through the assembly of short oligos can be used to produce customized genes.
Figure 2.Schematic diagrams and principles of the construction of recombinant expression vectors. Target genes featuring two adapters are obtained from PCR or gene synthesis. (a) Construction of expression vectors using restriction enzymes and ligases. The vector and target genes harbouring restriction sites are digested using two rare-cutting enzymes, SgfI and PmeI. The linearized expression vector and inserts are ligated using T4 ligase to create the construct. (b) Construction of expression clones using recombination-based methods. The target genes are flanked by 15–25 bp recombination sites. Recombinase-mediated recombination between the homologous sites present in the insert and vector generates the final vector. (c) Construction of expression clones using LIC methods. Linearized vectors and target genes containing complementary 5′-tails are digested using enzymes possessing exonuclease activity in order to increase the proportion of recessed ends. The overhangs can anneal and are ligated in vivo after transformation into E. coli.
Figure 3.Basic expression vectors for high-throughput expression in E. coli of (a) cytoplasmic proteins and (b) membrane proteins. The T7 promoter is used to control expression of the protein in E. coli. The high-throughput assay requires tandem affinity tags, larger tag for protein expression initiation, protein solubility and soluble detection, and smaller tag for purification. TEV protease can be used to remove the tags. The tags for membrane proteins are located at the C-terminus for protein targeting, and GFP is a favourable choice for use as an indicator of protein folding. D tag, detection tag; P tag, purification tag; S tag, solubility and translation initiation tag; TT, transcriptional terminator; 5′UTR, 5′ untranslated region.
Figure 4.Escherichia coli strains for protein expression. (a) Escherichia coli strains widely used in recombinant protein production. In the expression vector, the target gene is under control of the T7 promoter. In the E. coli genome, the gene encoding T7 RNA polymerase is under control of the lacUV5. The strain BL21(DE3) is deficient in OmpT and Lon proteases. BL21STAR(DE3) is mutated in RNase E, reducing mRNA degradation. BL21trxB promotes the formation of disulfide bonds. In BL21pLysS(DE3), T7 lysozyme is expressed, and the enzyme inactivates any T7 RNA polymerase that may be produced without induction. Rosetta strains are designed to improve the expression of proteins encoded by genes containing rare codons used in E. coli. (b) Strategy for expressing a protein with post-translational modification in E. coli. Genes encoding kinases, glycosyltransferases, methylases, ligases or other modifying enzymes are coexpressed in order to produce post-translationally modified proteins. (c) Overview of E. coli strains used in membrane protein production. Walker strains (C41(DE3) and C43(DE3)) are commonly used to overcome the toxicity of membrane proteins. In Lemo21(DE3), expression can be tuned by adding different concentrations of l-rhamnose to the culture. Coexpression of membrane protein biogenesis factors may also facilitate the localization of target proteins. lysY, lysozyme; RNAP, RNA polymerase; tRS, tRNA synthetase.
Main characteristics of commonly used fusion tags for high-throughput protein production.
| tag | length/size (kDa) | matrix/elution | typical uses | comments | references |
|---|---|---|---|---|---|
| His-tag | 2–10, typically 6 (0.84) | divalent metal ion (Ni, Co, Cu, Zn)/imidazole or low pH | purification and detection | most common purification tag; denaturing purification possible; rarely affects the structure or function of fusion proteins; an anti-His antibody can be used for detection | [ |
| FLAG | 8 (1.0) | FLAG antibody/low pH, EDTA or FLAG peptide | purification and detection | small size and high solubility; the presence of an internal enterokinase cleavage site; very expensive resins with limited re-use cycles | [ |
| Strep-II | 8 (1.1) | Strep-Tactin/biotin or desthiobiotin | purification and detection | short, biologically inert and proteolytically stable; does not interfere with membrane translocation or protein folding | [ |
| Fh8 | 69 (8.0) | Ca2+-dependent hydrophobic interaction/ EDTA | purification, increased solubility and expression | relatively low molecular weight; with the combined features of enhancing protein solubility and purification | [ |
| Trx | 109 (11.7) | phenylarsinine oxide/thiol containing reducing agents | purification and increased solubility | one of the best N-terminal protein fusions to promote soluble expression; purification must be conducted in absence of thiol containing reducing agents until elution step; large tag or elution conditions may affect properties of fusion protein | [ |
| SUMO | 100 (12.0) | an affinity tag must be added (typically His-tag) | increased solubility and expression | has all the advantages of Trx; SUMO protease efficiently cleaves the tag; enhances membrane proteins expression | [ |
| GST | 211 (26.0) | glutathione/reduced glutathione | purification, detection and increased expression and solubility | very common purification tag; one-step purification of relatively pure protein; denaturing purification impossible | [ |
| GFP | 238 (26.9) | detection, increased solubility and expression | native detection protein solubility and expression without antibody, particularly for membrane proteins | [ | |
| HaloTag | 312 (34.0) | Chloroalkane/HaloTag buffer and TEV protease | purification, increased solubility and expression | allow for | [ |
| MBP | 396 (42.0) | cross-linked amylose/maltose | purification, detection, increased expression and solubility | can alleviate toxicity of fusion proteins; the target protein is prone to aggregation after removing tag; the large tag size may affect fusion protein properties and cause immunogenicity | [ |
Main characteristics of commonly used expression strains for high-throughput protein production.
| strains | genotype | features | references |
|---|---|---|---|
| BL21(DE3) | F− OmpT hsdSB(rB− mB−) gal dcm (DE3) | the most common protein expression strain; leaky expression can lead to uninduced expression of potentially toxic proteins | [ |
| BL21Star(DE3) | F− OmpT hsdSB(rB− mB−) gal dcm | mRNA levels and RNA stability are increased in the strain; thus, protein expression may be increased | [ |
| Origami(DE3) | F− OmpT hsdSB(rB− mB−) gal dcm | the | [ |
| BL21(DE3)pLysS | F− OmpT hsdSB(rB− mB−) gal dcm (DE3) [pLysS Camr] | the pLysS plasmid produces T7 lysozyme to reduce basal level expression, which is suitable for expression of toxic genes | [ |
| BL21-CodonPlus(DE3)-RIPL | F− OmpT hsdSB(rB− mB−) gal dcm (DE3) | the CodonPlus strains provide additional copies of rare tRNA genes; the RIPL strain carries genes for Arg (AGA and AGG), Ile (AUA), Pro (CCC) and Leu (CUA) | [ |
| Rosetta(DE3) | F− OmpT hsdSB(rB− mB−) gal dcm (DE3) [pRARE Camr] | the Rosetta strains enhance the expression of proteins that contain codons rarely used in | [ |
| C41(DE3)/C43(DE3) | selected mutants from BL21(DE3) | the strains harbour mutations in | [ |
| Lemo21(DE3) | F− OmpT hsdSB(rB− mB−) gal dcm (DE3) [pLemo Camr] | the strain allows for tunable expression of difficult clones; for difficult soluble proteins, tuning the expression level may also result in more soluble, properly folded protein | [ |