| Literature DB >> 34963459 |
Hyunjun Ko1, Minsik Kang1,2, Mi-Jin Kim1, Jiyeon Yi1, Jin Kang1,2, Jung-Hoon Bae1, Jung-Hoon Sohn3,4, Bong Hyun Sung5,6.
Abstract
BACKGROUND: Proteins with novel functions or advanced activities developed by various protein engineering techniques must have sufficient solubility to retain their bioactivity. However, inactive protein aggregates are frequently produced during heterologous protein expression in Escherichia coli. To prevent the formation of inclusion bodies, fusion tag technology has been commonly employed, owing to its good performance in soluble expression of target proteins, ease of application, and purification feasibility. Thus, researchers have continuously developed novel fusion tags to expand the expression capacity of high-value proteins in E. coli.Entities:
Keywords: Carbohydrate-binding module; Escherichia coli; Fusion tag; Levan-agarose; Soluble expression
Mesh:
Substances:
Year: 2021 PMID: 34963459 PMCID: PMC8715580 DOI: 10.1186/s12934-021-01725-w
Source DB: PubMed Journal: Microb Cell Fact ISSN: 1475-2859 Impact factor: 5.328
General fusion tags used in recombinant protein production
| Tag | Protein | Origin | Size (kDa) | Purpose | References |
|---|---|---|---|---|---|
| MBP | Maltose-binding protein | 43 | B | [ | |
| GST | Glutathione- | 27 | B | [ | |
| Trx | Thioredoxin | 12 | S | [ | |
| NusA | N-utilization substance | 54 | S | [ | |
| SUMO | Small ubiquitin-modified | 11 | S | [ | |
| 6 × His | Hexa-histidine | n/a | < 1 | A | [ |
| FLAG | FLAG-octapeptide | n/a | < 1 | A | [ |
| STREP | Streptavidin binding peptide | n/a | < 1 | A | [ |
| CBM66 | Carbohydrate-binding module 66 | 18 | B | This study |
n/a not applicable, B bifunctional, S soluble expression, A affinity purification
Fig. 1Map of the constructed protein expression plasmid harboring CBM66 fusion tag. The target protein was designed to be expressed with a CBM66 tag on the N-terminus and a His-tag on the C-terminus under the T7 promoter. The flexible linker between CBM66 and the passenger protein comprised a double repeat of four glycine and one serine [(G4S)2]. EK site indicates enterokinase recognition sequence (D4K)
Fig. 2Soluble expression with the CBM66 tag. Expression profiles of seven target proteins analyzed using a SDS-PAGE and b western blotting. M, molecular marker; S, soluble protein; I, insoluble; ( −), without CBM66 tag; ( +), with CBM66 tag. c Quantification of expression profiles. The amount of soluble and insoluble proteins are represented as black and white bars, respectively. All images are representative of experiments performed in triplicate; all data are expressed as the mean ± standard deviation
Fig. 3Expression of PETase with the CBM66, MBP, and GST fusion tags. Expression levels of PETase with different fusion tags at various induction temperatures. The amounts of soluble and insoluble proteins are represented as black and white bars, respectively. All images are representative of experiments performed in triplicate; all data are expressed as the mean ± standard deviation. The statistical significance of the data was analyzed by unpaired t-test. Values of p < 0.05 were considered to indicate statistically significant results. *p < 0.05, ns: not significant
Fig. 4Purification of intact protein from the fusion product. a SDS-PAGE analysis of purification steps. The CBM66 tag and PETase cleaved by enterokinase (lane 1); PETase collected from flow through (lane 2). The CBM66 tag eluted with 50 mM and 100 mM sodium chloride solutions (lanes 3 and 4, respectively). b Bioactivity of PETase samples during the purification steps (intact, CBM66-fused, and purified PETase). Bioactivity of two CBM66-fused model proteins (EGF and ADH1) were compared with commercial enzymes on c and d, respectively. All images are representative of experiments performed in triplicate; all data are expressed as the mean ± standard deviation
Primers used in this study
| Amplicon | Primer | Sequence (5’ to 3’) |
|---|---|---|
| CBM66 | F1 | AGAAGGAGATATACATATGGGAACGACACCT |
| R1 | GTAACGAAGGAGTCTCTCGAGCACCACCAC | |
| CBM66-La | F2 | GTAACGAAGGAGTCTGGTGGCGGAGGCAGCGGTGGAGGCGGATCC |
| R2 | ATCGTCATCGTCACCGGATCCGCCTCCACCGCTGCCTCCGCCACC | |
| CBM66-L-EKb | F3 | GGTGGAGGCGGATCCGGTGACGATGACGATAAGXXXXXXXXXXXXXXX |
| R3 | XXXXXXXXXXXXXXXTCAGTGGTGGTGGTGGTGGTGCTCGAG | |
| pMAL-L-EK | F4 | GGAGATATACATATGAAAATAAAAACAGGT |
| R4 | GCTGCCTCCGCCACCAGTCTGCGCGTCTTT | |
| pGST-L-EK | F5 | GGAGATATACATATGTCCCCTATACTAGGT |
| R5 | GCTGCCTCCGCCACCTTTTGGAGGATGGTC | |
| pLE | F6 | AGAAGGAGATATACATATGGGTGGCGGAGGC |
aL, flexible linker domain [double repeat of four glycine and one serine, (G4S)2]
bEK, enterokinase cleavage site
X, nucleotides of target genes