Literature DB >> 28330113

Protein fusion tags for efficient expression and purification of recombinant proteins in the periplasmic space of E. coli.

Ajamaluddin Malik1.   

Abstract

Disulfide bonds occurred in majority of secreted protein. Formation of correct disulfide bonds are must for achieving native conformation, solubility and activity. Production of recombinant proteins containing disulfide bond for therapeutic, diagnostic and various other purposes is a challenging task of research. Production of such proteins in the reducing cytosolic compartment of E. coli usually ends up in inclusion bodies formation. Refolding of inclusion bodies can be difficult, time and labor consuming and uneconomical. Translocation of these proteins into the oxidative periplasmic compartment provides correct environment to undergo proper disulfide bonds formation and thus achieving native conformation. However, not all proteins can be efficiently translocated to the periplasm with the help of bacterial signal peptides. Therefore, fusion to a small well-folded and stable periplasmic protein is more promising for periplasmic production of disulfide bonded proteins. In the past decades, several full-length proteins or domains were used for enhancing translocation and solubility. Here, protein fusion tags that significantly increase the yields of target proteins in the periplasmic space are reviewed.

Entities:  

Keywords:  Fusion protein; Periplasmic space; Protein folding; Solubility enhancer

Year:  2016        PMID: 28330113      PMCID: PMC4742420          DOI: 10.1007/s13205-016-0397-7

Source DB:  PubMed          Journal:  3 Biotech        ISSN: 2190-5738            Impact factor:   2.406


Introduction

Since the advent of production of recombinant proteins, application of therapeutic and diagnostic proteins as biopharmaceuticals was changed remarkably (Walsh 2014). These proteins are required in huge amount and usually can not be obtained from natural sources due to extremely low availability. Moreover, Genetically engineered proteins with special benefits (e.g. Insulin analogs) are as such molecules which can therefore only be obtained via recombinant technology (Walsh 2000, 2006; Sanchez and Demain 2012). Escherichia coli was the first and still popularly used host for the fast and economical production of recombinant proteins (Vincentelli and Romier 2013; Chance et al. 1981; Choi and Lee 2004; Rosano and Ceccarelli 2014; Lebendiker and Danieli 2014). In-depth knowledge of genetic and biochemical pathways of E. coli and availability of variety of vectors made is an attractive host for such purposes. Although significant improvements have been made at transcription, translation and translocation, still obtaining soluble and bioactive proteins is a major challenge (Pines and Inouye 1999; Baneyx 1999; Rosano and Ceccarelli 2014). Secreted proteins such as antibodies, enzymes, hormones etc. are used for therapeutic and diagnostic applications. Secreted proteins having two or more cysteines makes disulfide bonds, which is usually vital for structure formation and bioactivity (Creighton 1997b; Creighton et al. 1995; Clarke and Fersht 1993). The cytosol of E. coli is reducing which gives inclusion bodies when such proteins are expressed in the cytosol (Freedman 1989; Hwang et al. 1992; Aslund et al. 1994; Carmel-Harel and Storz 2000; Russel 1995; Messens and Collet 2006). Usually in vitro oxidative refolding is difficult, laborious, time consuming and may be uneconomical depending upon refolding yield (Lilie et al. 1998; Lange and Rudolph 2009; Yamaguchi et al. 2013; Basu et al. 2011). Translocation of these proteins into the E. coli periplasm provides favorable environment for oxidative folding due to the presence of disulfide bond folding and isomerization machinery (Gopal and Kumar 2013; Yoon et al. 2010; Choi and Lee 2004). Moreover, proteases are less abundant in periplasm and also its relatively less crowded than cytosol which reduces the chances of proteolysis and ease in the purification of recombinant proteins (Makrides 1996). To secrete proteins into periplasmic space, a translocation signal sequence must be fused at the N terminus of proteins, but only the fusion of signal sequence is not enough for efficient protein translocation (Fekkes and Driessen 1999; Muller et al. 2001). The sequences on mature protein next to the signal peptidase cut site and other parts of mature protein play an important role in the secretion (Lee et al. 1989; Malik et al. 2006). Under such condition, fusion to a full-length periplasmic protein that is well stable, soluble and properly folded is more promising (Table 1).
Table 1

Properties of periplasmic fusion proteins

Fusion proteinMW (kDa)calc. pIS–S bondSubcellular location
Ecotin165.941Periplasm
Maltose-binding protein40.75.070Periplasm
Z-domain of protein A6.65.160Secreted
ABD-domain of protein G64.460Secreted
CBD from exonuclease11.18.441Secreted
CBD from endonuclease10.96.071Secreted
Disulfide bond oxidoreductase21.15.421Periplasm
Barnase12.38.880Secreted

Size, calculated isoelectric point, number of disulfide bond and native localization were evaluated

Properties of periplasmic fusion proteins Size, calculated isoelectric point, number of disulfide bond and native localization were evaluated Over two decades of extensive in vivo and in vitro research on protein fusions constructs concluded that fusion tags usually increases the yield and solubility of their fusion partners (Costa et al. 2014; Waugh 2005). Despite all these advancement, still it is difficult to choose the best fusion system for a given protein of interest. In general, selection of fusion tag depends upon the properties of protein of interest itself such as size, stability, and hydrophobicity; the expression site; and the usage of the recombinant protein. After coupling with second protein (fusion tag) the increase in yield and solubility the target proteins varies in each fusions. The detailed mechanism by which fusion proteins improve solubility and yield is not well understood. There is two hypotheses: (a) fusion of a stable or conserved structure to an insoluble recombinant protein may serve to stabilize and promote proper folding of the recombinant protein (Butt et al. 2005) and (b) fusion tags may act as a nucleus of folding “molten globule hypothesis” (Creighton 1997a). Ideally, an effective periplasmic fusion system should have the following features: (a) efficient translocator; (b) enhance folding and solubility; (c) help in purification; (d) facilitate quantification; (e) minimize proteolysis; (f) no adverse effect on the structure and bioactivity; (g) easy and specific removal of the fusion tag; (h) useful for different classes of proteins and peptides. However, none of the fusion tag is optimal with respect to all of these parameters. Successful examples of each periplasmic fusion proteins are listed in Table 2. In the following sections, merits and demerits of available periplasmic fusion proteins are discussed.
Table 2

Examples of protein fusion tag assisted production of recombinant proteins in the periplasmic space of E. coli

Fusion proteinModel proteinReferences
EcotinPepsinogenMalik et al. (2006)
EcotinProinsulinMalik et al. (2007)
EcotinOctapeptidePaal et al. (2009)
MBPMerozoite surface protein IPlanson et al. (2003)
MBPNanobodies (singe domain antibody)Salema and Fernandez (2013)
MBPMembrane protein U24Tait and Straus (2011)
MBPSingle chain antibodyHayhurst (2000)
MBPPokeweed antiviral proteinHonjo and Watanabe (1999)
SpAInsulin like growth factor-IIHammarberg et al. (1989)
SpAAlkaline phosphataseEngel et al. (1992)
SpGInsulin like growth factor-IIHammarberg et al. (1989)
SpGOctapeptideStahl et al. (1989)
CBDPolypeptideHasenwinkle et al. (1997)
CBDLipaseHwang et al. (2004)
CBDBeta-glucosidaseOng et al. (1991)
CBDAlkaline phosphataseGreenwood et al. (1989)
DsbAEnterokinase catalytic subunitCollinsracie et al. (1995)
DsbAProinsulinWinter et al. (2000)
BarnaseCystein knot peptideSchmoldt et al. (2005)
Examples of protein fusion tag assisted production of recombinant proteins in the periplasmic space of E. coli

Ecotin

Ecotin (E. coli trypsin inhibitor) is a homodimeric protein which is naturally localized in the periplasmic space (Table 1). The properties of ecotin make it a promising periplasmic fusion tag. It is moderately small in size (16 kDa monomer), extremely stable (tolerates pH 1.0 and 100 °C for 30 min) and contains one disulfide bond in each subunit (Chung et al. 1983). Due to the presence of disulfide bonds, ecotin undergoes a pathway of oxidative folding. Naturally, ecotin is constitutively expressed (Chung et al. 1983) for the defense of E. coli against trypsin like serine proteases in the digestive tract and neutrophil elastase like serine proteases in the blood. Ecotin had no metabolic role or interaction with other proteins in E. coli (Eggers et al. 2004). The C termini of each monomer in dimeric ecotin protrude in opposite directions (Fig. 1a), which will allow folding of passenger proteins at each end without steric hindrance. Strong affinity of ecotin’s for trypsin like serine protease will facilitate ecotin fusion protein to purify via affinity chromatography. Ecotin’s binding surface has been already randomized (Stoop and Craik 2003) to reduce its affinity to zymogens of serine proteases, which would help to elute ecotin fusion proteins under softer conditions.
Fig. 1

Three-dimensional structure of periplasmic fusion proteins. a ecotin (1ECZ), b Maltose binding protein (1DMB), c Z-domain of protein A (1LP1), d ABD-domain of protein G (1EM7), e CBD of endoglucanase (1EXG), f CBD of exoglucanase (modelled 3D structure), g disulfide bond oxidoreductase (1A2J), h Barnase (1RNB)

Three-dimensional structure of periplasmic fusion proteins. a ecotin (1ECZ), b Maltose binding protein (1DMB), c Z-domain of protein A (1LP1), d ABD-domain of protein G (1EM7), e CBD of endoglucanase (1EXG), f CBD of exoglucanase (modelled 3D structure), g disulfide bond oxidoreductase (1A2J), h Barnase (1RNB) Moreover, model protein in the ecotin fusion system can be quantatively measured in a very sensitive trypsin inhibition assays (Kang et al. 2005). Even in the cytosol ecotin is stable and active; which makes it suitable candidate to be used as cytoplasmic fusion tag (Kang et al. 2005). Ecotin can also be produced in monomeric native state after removal of the last 10 residues (Pal et al. 1996) Thus, ecotin fusion protein in monomeric state is feasible. Ecotin fusion tag have been used for efficient translocation, solubility enhancement and purification of proteins and peptides (Paal et al. 2009; Malik et al. 2006, 2007).

Maltose-binding protein

Maltose-binding protein (MBP) is cysteine-less relatively large (40.6 kDa) periplasmic protein (Fig. 1b) (Duplay et al. 1984). It is known for its noteworthy solubility enhancement when it is fused at the N terminus of model proteins (Raran-Kurussi et al. 2015; Raran-Kurussi and Waugh 2012; Sachdev and Chirgwin 1998). MBP has been frequently utilized for cytosolic expression but due to its natural periplasmic localization, it is also utilized as periplasmic fusion tag for enhancing secretion, solubility as well as purification of target proteins (Salema and Fernandez 2013; Planson et al. 2003). In certain cases, it was found that MBP attains natively folded state and remains soluble while the passenger proteins could not attained properly folded state and exist as in the state of soluble aggregates (Nallamsetty et al. 2005; Nomine et al. 2001; Sachdev and Chirgwin 1999). The affinity of MBP for maltose is ~1 μM which allowed to purify MBP fusion protein through affinity chromatography (Betton and Hofnung 1996). Moreover, MBP is thermodynamically moderately stable with the T m of 62.8 °C at pH 8.3 (Novokhatny and Ingham 1997) and individual components of MBP fusions are slightly more stable than their counterparts in the fusion protein (Blondel et al. 1996).

Staphylococcal protein A

Staphylococcal protein A (SpA) is a surface protein of Gram-positive bacterium Staphylococcus aureus which has strong affinity and high specificity for constant (Fc) part of human immunoglobulins as well as large number of other animals (Eliasson et al. 1988; Cedergren et al. 1993). SpA is a highly soluble 31 kDa protein. Chemically denatured SpA renatures efficiently which assists refolding of the target protein in the SpA fusion system (Samuelsson et al. 1991). SpA is a cysteine-less protein, thus abolishing the chances of interference in disulfide bond formation with fused protein of interest (Kashimura et al. 2013; Uhlen et al. 1984). The gene of SpA is highly repetitive which consists of signal sequence followed by five small highly similar domains (E, D, A, B and C) and C terminal membrane anchoring sequence. The B-domain has been engineered to create smaller variants (7 kDa) of SpA, called as Z-domain (Nilsson et al. 1987). Depending upon localization requirements of the target protein, large number of expression plasmids with or without signal sequences for the production of single Z-domain (7 kDa) or double Z-domains (14 kDa) fusions (Fig. 1c) has been developed (Nilsson et al. 1994, 1996; Hammarberg et al. 1989; Stahl et al. 1989). The fusion protein with Z-domain was more efficiently translocated in comparison to full length SpA proteins (Nilsson et al. 1997).

Streptococcal protein G

Streptococcal protein G (SpG) present on the streptococci surface is a bifunctional receptor and capable of binding with both IgG and serum albumin from different species with different affinities (Nygren et al. 1988). The IgG and albumin binding regions are structurally separated on the SpG. The serum albumin binding region is known as ABD (albumin-binding domain), consists of three binding motifs (each ~5 kDa) (Fig. 1d). Depending upon the localization of the target proteins, ABD with or without signal sequence has been used for expression of fusion protein. Subsequently, fusion proteins were purified via HSA-affinity chromatography in one-step (Hammarberg et al. 1989; Larsson et al. 1996; Stahl et al. 1989).

Cellulose binding domain (CBD)

Nearly 111 residues from endoglucanase (Fig. 1e) and 100 residues from exoglucanase (Fig. 1f) of Cellulomonas fimi, which has high affinity for cellulose, have been used for translocation to periplasmic space and solubility enhancement of target proteins (Gilkes et al. 1988, 1992; Warren et al. 1986; Hwang et al. 2004; Hasenwinkle et al. 1997; Creagh et al. 1996; Ong et al. 1991). The purification of cellulose binding domain fusion protein was achieved via relatively inexpensive ligand matrix (cellulose) (Greenwood et al. 1989, 1992; Ong et al. 1991).

Disulfide bond oxidoreductase

Disulfide bond oxidoreductase (DsbA) is the key enzyme of periplasmic oxidoreductive system (Fig. 1g). It facilitates correct disulfide bond formation via intra- and intermolecular catalysis (Bardwell et al. 1993). In biotechnological applications, target proteins having multiple disulfide bonds (enterokinase catalytic subunit, proinsulin) were fused at the C terminus of DsbA to enhance disulfide bond formation as well as stabilize unfolded target protein via its polypeptide binding site (Collinsracie et al. 1995; Winter et al. 2000). After fusion with DsbA, these proteins were obtained in the well-folded soluble state in the periplasmic space. DsbA is a potent protein thiol oxidase. It has been observed in vitro experiments that DsbA causes non-native disulfide bond formation in proteins having multiple disulfide bonds (Hirudin, BPTI) (Wunderlich and Glockshuber 1993; Zapun and Creighton 1994). Also, in vivo co-expression of DsbA resulted in inclusion bodies formation of IGF-I (Joly et al. 1998).

Barnase

Barnase is an enzymatically inactive variant (H102A) of extracellular RNAse from Bacillus amyloliquefaciens (Fig. 1h). It is monomeric, cysteine-less protein of relatively small size. For biotechnological applications, enzymatically inactive variant of RNAse was used as a fusion protein to enhance the secretion of cysteine-knot peptides in the periplasmic space. It was found that majority of the cysteine-knot peptides were in the native state when fused with barnase (Schmoldt et al. 2005). Moreover, the Barnase fusion protein could be purified via immobilized barstar (Barnase inhibitor) in single step (Schmoldt et al. 2005).

Conclusion

Every protein is unique and due to their different applications such as academic research, diagnostic or therapeutic usage, the quantity and purity level vary. Therefore, no single fusion tag will address every requirement. Fusion tags are helpful in enhancing their solubility and stability. Protein fusion tag with μM-nM ligand affinity generally results in 90–99 % purity after affinity chromatography. Removal of protein fusion tag and producing recombinant protein with authentic N terminal adds another layer of complexity. When considering which protein fusion to use, important queries should keep in mind such as: nature of protein itself, how much protein required, application of protein, is fusion tag removal necessary or not, how much additional residues could be tolerated at N terminal? To remove most part of the fusion protein, highly specific protease cleavage site (TEV protease, thrombin, enterokinase, etc.) could be placed in the linker region between fusion tag and model protein. Also, non-specific proteases such as trypsin could be used to generate authentic N terminus as demonstrated in the case of Ecotin-proinsulin fusion protein (Malik et al. 2007). If authentic N terminus is must for the application, ubiquitin fusion technology could be used as successfully demonstrated in ecotin–ubiquitin–peptide fusion system (Paal et al. 2009).
  87 in total

1.  A fusion protein system for the recombinant production of short disulfide bond rich cystine knot peptides using barnase as a purification handle.

Authors:  Hans-Ulrich Schmoldt; Alexander Wentzel; Stefan Becker; Harald Kolmar
Journal:  Protein Expr Purif       Date:  2005-01       Impact factor: 1.650

2.  Biopharmaceutical benchmarks 2006.

Authors:  Gary Walsh
Journal:  Nat Biotechnol       Date:  2006-07       Impact factor: 54.908

Review 3.  Refolding of proteins from inclusion bodies: rational design and recipes.

Authors:  Anindya Basu; Xiang Li; Susanna Su Jan Leong
Journal:  Appl Microbiol Biotechnol       Date:  2011-08-07       Impact factor: 4.813

Review 4.  How important is the molten globule for correct protein folding?

Authors:  T E Creighton
Journal:  Trends Biochem Sci       Date:  1997-01       Impact factor: 13.807

5.  Biopharmaceutical benchmarks 2014.

Authors:  Gary Walsh
Journal:  Nat Biotechnol       Date:  2014-10       Impact factor: 54.908

6.  Competitive elution of protein A fusion proteins allows specific recovery under mild conditions.

Authors:  J Nilsson; P Nilsson; Y Williams; L Pettersson; M Uhlén; P A Nygren
Journal:  Eur J Biochem       Date:  1994-08-15

Review 7.  Strategies for achieving high-level expression of genes in Escherichia coli.

Authors:  S C Makrides
Journal:  Microbiol Rev       Date:  1996-09

8.  Precise excision of the cellulose binding domains from two Cellulomonas fimi cellulases by a homologous protease and the effect on catalysis.

Authors:  N R Gilkes; R A Warren; R C Miller; D G Kilburn
Journal:  J Biol Chem       Date:  1988-07-25       Impact factor: 5.157

9.  Fusion to an endoglucanase allows alkaline phosphatase to bind to cellulose.

Authors:  J M Greenwood; N R Gilkes; D G Kilburn; R C Miller; R A Warren
Journal:  FEBS Lett       Date:  1989-02-13       Impact factor: 4.124

10.  The ability to enhance the solubility of its fusion partners is an intrinsic property of maltose-binding protein but their folding is either spontaneous or chaperone-mediated.

Authors:  Sreejith Raran-Kurussi; David S Waugh
Journal:  PLoS One       Date:  2012-11-16       Impact factor: 3.240

View more
  8 in total

1.  Ectopic Expression of Plant RNA Chaperone Offering Multiple Stress Tolerance in E. coli.

Authors:  Bushra Jabeen; S M Saqlan Naqvi; Tariq Mahmood; Tasawar Sultana; Madiha Arif; Fariha Khan
Journal:  Mol Biotechnol       Date:  2017-03       Impact factor: 2.695

2.  Extracellular production of Ulp1403-621 in leaky E. coli and its application in antimicrobial peptide production.

Authors:  Linglong Fu; Mengning Sun; Weizhang Wen; Na Dong; Defa Li
Journal:  Appl Microbiol Biotechnol       Date:  2022-10-19       Impact factor: 5.560

Review 3.  Challenges Associated With the Formation of Recombinant Protein Inclusion Bodies in Escherichia coli and Strategies to Address Them for Industrial Applications.

Authors:  Arshpreet Bhatwa; Weijun Wang; Yousef I Hassan; Nadine Abraham; Xiu-Zhen Li; Ting Zhou
Journal:  Front Bioeng Biotechnol       Date:  2021-02-10

4.  Coexpressing the Signal Peptide of Vip3A and the Trigger Factor of Bacillus thuringiensis Enhances the Production Yield and Solubility of eGFP in Escherichia coli.

Authors:  Jianhua Gao; Chunping Ouyang; Juanli Zhao; Yan Han; Qinghua Guo; Xuan Liu; Tianjiao Zhang; Ming Duan; Xingchun Wang; Chao Xu
Journal:  Front Microbiol       Date:  2022-07-18       Impact factor: 6.064

5.  Microreactor equipped with naturally acid-resistant histidine ammonia lyase from an extremophile.

Authors:  Carina Ade; Thaís F Marcelino; Mark Dulchavsky; Kevin Wu; James C A Bardwell; Brigitte Städler
Journal:  Mater Adv       Date:  2022-03-29

6.  Predictive approaches to guide the expression of recombinant vaccine targets in Escherichia coli: a case study presentation utilising Absynth Biologics Ltd. proprietary Clostridium difficile vaccine antigens.

Authors:  Hirra Hussain; Edward A McKenzie; Andrew M Robinson; Neill A Gingles; Fiona Marston; Jim Warwicker; Alan J Dickson
Journal:  Appl Microbiol Biotechnol       Date:  2021-06-28       Impact factor: 4.813

7.  Development of a generic β-lactamase screening system for improved signal peptides for periplasmic targeting of recombinant proteins in Escherichia coli.

Authors:  Tania Selas Castiñeiras; Steven G Williams; Antony Hitchcock; Jeffrey A Cole; Daniel C Smith; Tim W Overton
Journal:  Sci Rep       Date:  2018-05-03       Impact factor: 4.379

8.  Passenger sequences can promote interlaced dimers in a common variant of the maltose-binding protein.

Authors:  Afaque A Momin; Umar F Shahul Hameed; Stefan T Arold
Journal:  Sci Rep       Date:  2019-12-31       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.