| Literature DB >> 30484964 |
Thijs Nieuwkoop1, Nico J Claassens1,2, John van der Oost1.
Abstract
Different codon optimization algorithms are available that aim at improving protein production by optimizing translation elongation. In these algorithms, it is generally not considered how the altered protein coding sequence will affect the secondary structure of the corresponding RNA transcript, particularly not the effect on the 5'-UTR structure and related ribosome binding site availability. This is a serious drawback, because the influence of codon usage on mRNA secondary structures, especially near the start of a gene, may strongly influence translation initiation. In this study, we aim to reduce the effect of codon usage on translation initiation by applying a bicistronic design (BCD) element. Protein production of several codon-optimized gene variants is tested in parallel for a BCD and a standard monocistronic design (MCD). We demonstrate that these distinct architectures can drastically change the relative performance of different codon optimization algorithms. We conclude that a BCD is indispensable in future studies that aim to reveal the impact of codon optimization and codon usage correlations. Furthermore, irrespective of the algorithm used, using a BCD does improve protein production compared with an MCD. The overall highest expression from BCDs for both GFP and RFP is at least twofold higher than the highest levels found for the MCDs, while for codon variants having very low expression from the MCD, even 10-fold to 100-fold increases in expression were achieved by the BCD. This shows the great potential of the BCD element for recombinant protein production.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30484964 PMCID: PMC6302717 DOI: 10.1111/1751-7915.13332
Source DB: PubMed Journal: Microb Biotechnol ISSN: 1751-7915 Impact factor: 5.813
Figure 1(A) Genetic architecture of monocistronic and bicistronic design. (B,C) The effect of a bicistronic and monocistronic design on the expression of different codon‐optimized GFPuv (B) and mRFP (C) variants (RFU: relative fluorescence units). The regularly used GFPuv (GFPuv‐RU) sequence is compared to an optimized sequence (GFPuv‐Opt), Escherichia coli tRNA‐optimized sequence (GFPuv‐tRNA), E. coli tRNA‐optimized sequence with subsequent minimalized free energy (GFPuv‐tRNA‐dG), a minimal free energy transcript (GFPuv‐dG) and an E. coli harmonized sequence (GFPuv‐H). The regularly used mRFP sequence (mRFP‐RU) is compared with the E. coli optimized sequence (mRFP‐Opt), E. coli tRNA‐optimized sequence (mRFP‐tRNA), E. coli tRNA‐optimized sequence with subsequent minimalized free energy (mRFP‐tRNA‐dG) and a minimal free energy transcript (mRFP‐dG). Production is determined using flow cytometry for eight biological replicates for each variant. The error bars depict the standard deviation for the average expression of eight biological replicates. For each replicate, the expression level of 50 000 single cells is measured, averaged and normalized to a cell culture not expressing any fluorescent protein. For all cases, except mRFP‐Opt, the fluorescence of the BCD variants over the MCD variants is significantly different at a P‐value of 0.001. Similar results are obtained for fluorescence measurements obtained with a plate reader (Fig. S4). The MCD and BCD sequence can be found in Table S1.
Figure 2A. Secondary structure prediction of the mRFP‐tRNA transcript with a monocistronic design. The arrow indicates the nucleotide that was silently mutated in an attempt dissolve the structure (M83; C → A. 5′ and 3′ indicate the orientation of the RBS).
B. Secondary structure prediction of the mRFP‐tRNA transcript with a bicistronic design.
C. Secondary structure prediction of the mRFP‐tRNA M83 transcript with a monocistronic design. The RBS sites are highlighted in black, and the positional entropy for each nucleotide is indicated with a colour gradient. The free energy for each construct is calculated with a sequence window containing the 5′‐UTR and the first 36 nucleotides of the CDS.
D. Relative mRFP expression of the mRFP‐tRNA with the MCD, BCD and MCD M83 mutation (RFU: relative fluorescence units). The error bars depict the standard deviation for the average expression of eight biological replicates. For each replicate, the expression level of 50 000 single cells is measured, averaged and normalized to a cell line without mRFP. The mean differences are significantly different at a P‐value of 0.001.