| Literature DB >> 22073212 |
Jin-Ho Ahn1, Jung-Won Keum, Dong-Myung Kim.
Abstract
While access to soluble recombinant proteins is essential for a number of proteome studies, preparation of purified functional proteins is often limited by the protein solubility. In this study, potent solubility-enhancing fusion partners were screened from the repertoire of endogenous E. coli proteins. Based on the presumed correlation between the intracellular abundance and folding efficiency of proteins, PCR-amplified ORFs of a series of highly abundant E. coli proteins were fused with aggregation-prone heterologous proteins and then directly expressed for quantitative estimation of the expression efficiency of soluble translation products. Through two-step screening procedures involving the expression of 552 fusion constructs targeted against a series of cytokine proteins, we were able to discover a number of endogenous E. coli proteins that dramatically enhanced the soluble expression of the target proteins. This strategy of cell-free expression screening can be extended to quantitative, global analysis of genomic resources for various purposes.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22073212 PMCID: PMC3206877 DOI: 10.1371/journal.pone.0026875
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Three-step PCR reactions to assemble linear expression template.
(A) Schematic representation of PCR-based generation of fusion constructs. Two primary PCR products with defined overlapping ends are synthesized by the first PCR reaction. These two fragments are joined in a second PCR, overlap extension PCR and subsequent third PCR step introduces the regulatory elements necessary for transcription and translation to the fused target genes. (B) Sequence elements of final amplified expression template.
Fusion partners used in this study.
| Category | Partner protein |
| Conventional fusion partner (8) | MBP |
| Ribosomal protein (56) | 30S ribosomal subunitS1, S2, S3, S4, S5, S6, S7, S8, S9, S10, S11, S12, S13, S14, S15, S16, S17, S18, S19, S20, S21, S2250S ribosomal subunitL1, L2, L3, L4, L5, L6, L7, L9, L10, L11, L13, L14, L15, L16, L17, L18, L19, L20, L21, L22, L23, L24, L25, L27, L28, L29, L30, L31, L31B, L32, L33, L34, L35, L36 |
| Translation-related factor (4) | EF-Tu, EF-P, IF1, IF3 |
| Chaperone protein (17) | IbpA, IbpB, Skp, SlyD, DsbA, DsbB, DsbC, SecB, SecE, SecG, GrpE, FkpB, FklB, GroEL, GroES, GroEL191–345, GroEL191–376 |
| OB-fold domain (3) | LysN1–145, AspN1–102, AsnN1–99 |
*This family contains OB-fold domains that bind to nucleic acids (oligonucleotide/oligosaccharide-binding fold). The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl-tRNA synthetases [43].
Figure 2Agarose gel electrophoresis of PCR products.
Figure 3Solubility and expression efficiency of fused gene constructs.
Eighty seven fusion partner genes were combinatorially fused to three different genes (hBD2, hEGF, hEPO) using three-step PCR. All PCR products coding each fused gene were directly used as expression templates for cell-free protein synthesis where expression efficiency and solubility were measured. After 3 h for cell-free expression, the reaction samples were centrifuged at 10,000 rpm for 30 min. Both pellet and soluble fractions were analyzed by radioactivity counting. The degree of solubility and expression yield enhancement for each fusion gene is colorized with red and blue respectively.
Figure 4Combinatorial examination of fusion partners for the expression of different cytokines.
(A) Fold enhancement of soluble expression of cytokines by the examined fusion partners. (B) Fold enhancement of total expression yield by fusion with the examined fusion partners. (C) A bubble chart where the size of each bubble diameter is proportional to the fold enhancement of the expression efficiency of total protein. Detailed stacked bar graphs of individual fusion protein are shown in Figure S1.
Figure 5Statistical analysis for the relationships between solubility/yield and physicochemical properties.
(A) Relationship between expression efficiency and GC content. (B) Histograms of codon adaptation index for highly expressed genes and poorly expressed genes. (C) Sequence logos of downstream region of proteins with high expression efficiency (>70% enhancement), which was created with WebLogo software [44]. (D) Solubility distribution for quantified proteins. Histogram of solubility for the quantified proteins in Figure 3. The proteins with solubilities <30% and >70% were defined as the aggregation-prone (Agg, colored blue) and soluble (Sol, colored pink) groups, respectively. Scatter plot of solubility versus isoelectric point (E) and molecular weight (F). Histograms of the relative contents of negatively charged residues (Asp and Glu) (Left), hydrophobic residues (Val, Leu and Ile), aromatic residues (Phe, Tyr, and Trp), and positively charged residues (Lys, Arg, and His) in the Total, Agg, and Sol groups.