| Literature DB >> 26760206 |
Reka Letso1,2, Helen Neely1,2, W Nicholson Price1,2, Grégory Boël1,2,3, Kam-Ho Wong1,2, Min Su1,2, Jon Luff1,2, Mayank Valecha1,2, John K Everett2,4, Thomas B Acton2,4, Rong Xiao2,4, Gaetano T Montelione2,4,5, Daniel P Aalberts6, John F Hunt1,2.
Abstract
Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyse the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26760206 PMCID: PMC5054687 DOI: 10.1038/nature16509
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962