| Literature DB >> 18084029 |
S Care1, C Bignon, M C Pelissier, E Blanc, B Canard, B Coutard.
Abstract
Recombinant protein translation in Escherichia coli may be limited by stable (i.e. low free energy) secondary structures in the mRNA translation initiation region. To circumvent this issue, we have set-up a computer tool called 'ExEnSo' (Expression Enhancer Software) that generates a random library of 8192 sequences, calculates the free energy of secondary structures of each sequence in the -70/+96 region (base 1 is the translation initiation codon), and then selects the sequence having the highest free energy. The software uses this 'optimized' sequence to create a 5' primer that can be used in PCR experiments to amplify the coding sequence of interest prior to sub-cloning into a prokaryotic expression vector. In this article, we report how ExEnSo was set-up and the results obtained with nine coding sequences with low expression levels in E. coli. The free energy of the -70/+96 region of all these coding sequences was increased compared to the non-optimized sequences. Moreover, the protein expression of eight out of nine of these coding sequences was increased in E. coli, indicating a good correlation between in silico and in vivo results. ExEnSo is available as a free online tool.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18084029 PMCID: PMC2248745 DOI: 10.1093/nar/gkm1097
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Summary of the sequence of events performed by ExEnSo. (B) Sequence alignment of the mutable region (−17/+9) of the 10 sequences optimized by ExEnSo. S, sequence of the standard forward primer. 1 to 10, coding sequences used in this study (for details, see Supplementary Figure 1). On top of the alignment are indicated the mutable bases (N1 to Y8) and the translation initiation codon (ATG). Grey boxes contain the mutated bases.
Figure 2.(A) Left, graph representing the calculated free energy of region −70/+96. The Y-axis (ΔGO − ΔGS) is the difference between the free energy of the mutated sequence (ΔGO) and the free energy of the standard sequence (ΔGS). Raw ΔGO and ΔGS are in Supplementary Figure 1. The X-axis is made of the 10 coding sequences (the numbering from 1 to 10 is that used in Supplementary Figure 1). The negative control (a protein highly expressed in E. coli although bearing a standard −70/+96 sequence) is number 9. Open triangle, mono-adenine (1A) substituted sequence; open square, penta-adenine (5A) substituted sequence; black filled square, sequence optimized by ExEnSo. Right, the different mutations of the mutable region (−17/+9) are indicated in front of the corresponding curve [the baseline (X-axis) is the standard sequence (R4 = G)]. (B) In vivo protein expression of coding sequences 1–10 (protein numbering on top of gels, western- and dot-blots is the same as in A) was performed following a fractional factorial approach made of 12 expression conditions (12), and then analyzed by dot-blot using anti-His antibodies (left panel). The results provided by the 12 conditions are individually displayed from top to bottom. The condition providing the best optimized to standard expression ratio was used for analysis by SDS-PAGE followed by Coomassie blue staining (1–4, 10, middle panel). Proteins with low expression levels were analysed by western blotting and anti-His antibodies (5–8, right panel). S, standard sequence; O, sequence optimized by ExEnSo. M1, molecular weight marker (Proteins 2 and 4: 116, 66.2, 45, 35, 25, 18.4, 14.4 kDa. Protein 10: 116, 66.2, 45, 35, 25). M2, His-tagged molecular weight marker (40, 30, 20, 15, 10 kDa).