| Literature DB >> 20814932 |
Dana M Francis1, Rebecca Page.
Abstract
Recombinant protein expression in Escherichia coli (E. coli) is simple, fast, inexpensive, and robust, with the expressed protein comprising up to 50 percent of the total cellular protein. However, it also has disadvantages. For example, the rapidity of bacterial protein expression often results in unfolded/misfolded proteins, especially for heterologous proteins that require longer times and/or molecular chaperones to fold correctly. In addition, the highly reductive environment of the bacterial cytosol and the inability of E. coli to perform several eukaryotic post-translational modifications results in the insoluble expression of proteins that require these modifications for folding and activity. Fortunately, multiple, novel reagents and techniques have been developed that allow for the efficient, soluble production of a diverse range of heterologous proteins in E. coli. This overview describes variables at each stage of a protein expression experiment that can influence solubility and offers a summary of strategies used to optimize soluble expression in E. coli. (c) 2010 by John Wiley & Sons, Inc.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20814932 PMCID: PMC7162232 DOI: 10.1002/0471140864.ps0524s61
Source DB: PubMed Journal: Curr Protoc Protein Sci ISSN: 1934-3655
Comparison of Recombinant Protein Expression Systems
|
|
Yeast |
Insect cells |
Mammalian cells |
Cell‐free |
Cell‐free | |
|---|---|---|---|---|---|---|
|
Average time of cell division |
30 min |
90 min |
18 hr |
24 hr |
N/A |
N/A |
|
Cost of expression |
Low |
Low |
High |
High |
High |
High |
|
Expression level |
High |
Low‐High |
Low‐High |
Low‐Moderate |
Low‐High |
Low‐High |
|
Success rate (% soluble) |
40‐60 |
50‐70 |
50‐70 |
80‐95 |
Variable |
Variable |
|
Advantages |
Simple, low cost, rapid, robust, high yield, easy labeling for structural studies |
Simple, low cost |
Post‐translational modifications |
Natural protein configuration, post‐translational modifications |
High yield, fast, flexible, disulfide‐bonded and membrane proteins, easy labeling for structural studies |
Fast, flexible, disulfide‐bonded and membrane proteins, post‐translational modifications |
|
Disadvantages |
No post‐translational modifications, insoluble protein, production of disulfide‐bonded and membrane proteins is difficult |
Less post‐translational modifications, production of membrane proteins is difficult |
Slow, higher cost, production of membrane proteins is difficult |
Slow, high cost, lower yield |
High cost, less post‐translational modifications, efficient production requires highly‐specialized setup |
High cost, lower yield than |
Spirin, 2004; Langlais et al., 2007; for other references, see text.
Figure 1Schematic overview of the topics covered in this review, highlighting the multiple parameters (listed on the right) that can greatly impact the success of soluble expression.
Figure 2Flowchart of a general expression protocol used by the authors to express a broad range of targets, from phosphatases, to neuronal scaffolding proteins, to bacterial signaling proteins. The approximate time required to complete each segment of the protocol is listed to the left of the corresponding step.
Promoter Systems used to Direct Recombinant Protein Expression in
|
|
Description |
Induction |
Advantages |
Disadvantages |
|---|---|---|---|---|
|
| ||||
|
T7 RNA polymerase |
T7 RNA polymerase gene under the control of L8‐UV5 |
IPTG |
High level of expression: accumulate up to 50% total cell protein Well characterized, used most often Titrate expression using Tuner strains |
Leaky expression: use pLysS strains for expression of proteins toxic to host |
|
|
Promoter is controlled by AraC regulator |
|
Tight regulation Titrate expression levels from low to high Low basal expression: suitable for production of proteins toxic to host |
Repressed expression state is not always zero, gene‐dependent |
|
|
−35 sequence from |
IPTG |
High level of expression: accumulate 15%‐30% of total cell protein |
Very leaky expression: not optimal for expression of proteins toxic to host Newer, more efficient systems are available |
|
|
Promoter from the major cold‐shock protein in |
Temperature downshift from 37°C (expression optimal between 10°C‐25°C) |
Efficient expression at low temperatures Can improve folding, lower inclusion body formation Advantageous for expression of aggregation‐prone and proteolytically‐sensitive proteins Induction is cost efficient |
Leaky expression: not optimal for expression of proteins toxic to host Translational efficiency slows at lower temperatures Not titratable |
|
| ||||
|
Phage promoter |
Phage promoter that is regulated by the temperature‐sensitive cI repressor |
Temperature shift from 30°C to 42°C |
Moderately high expression Induction is cost efficient |
High basal level of expression at temperatures below 30°C Induction cannot be performed at low temperatures |
|
|
Promoter for the gene of the periplasmic alkaline phosphatase |
Lower phosphate concentration in the growth medium |
Promotes secretion to the periplasm Inexpensive induction |
Phosphate limitation can have negative effects on metabolism of host cell Not titratable Limited media options |
|
|
Promoter for |
Nalidixic acid |
Tight regulation No growth media or temperature restrictions |
Not titratable |
|
|
Regulated by the tetR repressor |
Anhydrotetracycline |
Moderately high expression Low basal expression Independent of |
Not titratable |
See Elvin et al., 1990.
See Kikuchi et al., 1981; Miyake et al., 1985; Chang et al., 1986.
See Shirakawa et al., 1984; Olins and Rangwala, 1990.
See Delatorre et al., 1984, Skerra, 1994.
Characteristics of Commonly used Fusion Tags
|
Tag |
Protein |
Amino acids |
Size (kDa) |
Source organism |
Purification aid |
Affinity matrix |
Comments |
|---|---|---|---|---|---|---|---|
|
| |||||||
|
GST |
Glutathione |
243 |
28.1 |
|
Yes |
Glutathione agarose |
Forms dimer in solution |
|
MBP |
Maltose binding protein |
390 |
43.0 |
|
Yes |
Amylose resin |
Strong solubility‐enhancer |
|
DsbA |
Disulfide oxidoreductase |
228 |
25.4 |
|
No |
Aids periplasmic disulfide‐bond formation | |
|
NusA |
N‐utilizing substance A protein |
535 |
59.3 |
|
No |
Strong solubility‐enhancer | |
|
Trx |
Thioredoxin |
135 |
14.7 |
|
No |
Aids cytosolic disulfide‐bond formation | |
|
Z‐domain |
Protein A IgG ZZ repeat domain |
91 |
10.6 |
|
Yes |
Protein A‐sepharose | |
|
GB1 |
Protein G β1 domain |
85 |
9.7 |
|
Yes |
IgG‐resins |
Used often with proteins for NMR |
|
SUMO |
Small ubiquitin‐like modifier |
99 |
11.1 |
|
No | ||
|
SET |
Solubility‐enhancing tags |
<40 |
T7 phage gene 10B; synthetic |
No |
Small, highly acidic peptide tags that limit protein aggregation | ||
|
HaloTag‐7 |
Catalytically‐inactive derivative of DhaA |
296 |
34.0 |
|
Yes |
HaloLink resin |
Strong solubility‐enhancer |
|
| |||||||
|
His6 |
Hexahistadine |
6 |
0.8 |
Synthetic |
Yes |
Immobilized metal resin |
Often combined with solubility‐enhancing tags |
|
Intein |
Protein splicing element |
128‐1650 |
Variable |
Yes |
Chitin resin |
Remove from resin by induced self‐cleavage | |
See Nilsson et al., 1987; Zhao et al., 2005.
See Bao et al., 2006.
See Ohana et al., 2009.
See Chong et al., 1998; Singleton et al., 2002.
Commonly used Proteases to Remove Fusion Tags from Recombinant Proteins
|
Protease |
Description |
Cleavage site |
Protease inhibitors |
Activity |
Comments |
|---|---|---|---|---|---|
|
TEV |
Catalytic domain of the Nuclear Inclusion a (NIa) protein, a cysteine protease, found in the tobacco etch virus (TEV) |
EXXYXQ‐(G/S) Most common: ENLYFQ‐X |
PMSF, AEBSF, TLCK, pepstatin A, bestatin, E‐64, zinc (>5 mM), EDTA (1 mM), reagents that react with cysteine; various detergents |
pH: 4‐9 Temperature: 4°C‐37°C (max. at 34°C) Can accommodate most buffers |
Sequence specificity more stringent than factor Xa, thrombin, and enterokinase Can be produced in |
|
3C |
Recombinant form of the 3C protease from human rhinovirus type 14 |
EVLFQ‐GP |
PMSF, TLCK, leupeptin, zinc (100 mM), urea (1 M), guanidine (1 M) |
pH: 3‐10 Temperature: 4°C‐37°C (max. at 4°C) Test activity in buffer of choice |
High sequence specificity Can be produced in |
|
Xa |
Factor Xa is a serine protease that converts prothrombin to thrombin |
I(E/N)GR‐ Will not cleave at site followed by P or R |
PMSF, AEBSF, DFP, aprotinin, antithrombin III, antipain, α1‐antitypsin, chymostatin, hirudin, leupeptin, urea (100 mM), guanidine (10 mM), NaCl (100 mM), imidazole (100 mM) |
pH: 6.5‐9 Temperature: 4°C‐37°C Activity is highly reduced in phosphate buffers compared to Tris or HEPES, CaCl2 should be included in cleavage reaction |
Nonspecific proteolysis may occur at secondary sites Cleavage usually performed near physiological conditions |
|
Thr |
Thrombin is a serine protease that converts fibrinogen into fibrin |
LVPR‐GS |
PMSF, AEBSF, aprotinin, antithrombin III, antipain, α1‐antitypsin, chymostatin, hirudin, leupeptin, reducing agents |
pH: 5‐10 Temperature: 4°C‐37°C (max. at 37°C) Can accommodate most buffers |
Nonspecific proteolysis may occur at secondary sites Cleavage usually performed near physiological conditions |
|
EntK |
Catalytic subunit of bovine enterokinase |
DDDDK‐ |
Serine protease inhibitors, PMSF, imidazole (250 mM), NaCl (250 mM), urea (2 M), SDS |
pH: 6.0‐8.5 Temperature: 4°C‐37°C Activity is highly reduced in phosphate buffers compared to Tris or MES |
Nonspecific proteolysis may occur at secondary sites |
The “–” indicates the site of cleavage within the single letter amino acid code.
See Miyashita et al., 1992.
See Nagai and Thogersen, 1984; Jenny et al., 2003.
Figure 3Flowchart depicting the critical factors to consider, common obstacles, and potential solutions for each stage of protein expression in . The left column lists the major steps of recombinant protein expression with key variables to consider. The middle column includes common obstacles encountered at each step, while possible solutions for each obstacle are presented in the right column.