| Literature DB >> 28992099 |
Meng-Ze Du1, Wen Wei1, Lei Qin1, Shuo Liu1, An-Ying Zhang1,2, Yong Zhang1,2, Hong Zhou1,2, Feng-Biao Guo1,2,3.
Abstract
Although more and more entangled participants of translation process were realized, how they cooperate and co-determine the final translation efficiency still lacks details. Here, we reasoned that the basic translation components, tRNAs and amino acids should be consistent to maximize the efficiency and minimize the cost. We firstly revealed that 310 out of 410 investigated genomes of three domains had significant co-adaptions between the tRNA gene copy numbers and amino acid compositions, indicating that maximum efficiency constitutes ubiquitous selection pressure on protein translation. Furthermore, fast-growing and larger bacteria are found to have significantly better co-adaption and confirmed the effect of this pressure. Within organism, highly expressed proteins and those connected to acute responses have higher co-adaption intensity. Thus, the better co-adaption probably speeds up the growing of cells through accelerating the translation of special proteins. Experimentally, manipulating the tRNA gene copy number to optimize co-adaption between enhanced green fluorescent protein (EGFP) and tRNA gene set of Escherichia coli indeed lifted the translation rate (speed). Finally, as a newly confirmed translation rate regulating mechanism, the co-adaption reflecting translation rate not only deepens our understanding on translation process but also provides an easy and practicable method to improve protein translation rates and productivity.Entities:
Keywords: amino acid usage; co-adaption; minimum cost; tRNA gene copy number; translation rate
Mesh:
Substances:
Year: 2017 PMID: 28992099 PMCID: PMC5726483 DOI: 10.1093/dnares/dsx030
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Co-adaption between frequencies of tRNA gene copy numbers and amino acid usage. (A) Linear fitting results for 410 organisms. Dotted lines represent organisms with the P values of linear fitting are greater than 0.05; straight lines indicate that P values less than 0.05. (B) Corresponding Spearman rank correlation coefficients r for the linear fit of 410 genomes. (C) Linear fit of four model organisms.
Figure 2Co-adaption at the genome scale. (A) TAAIs of bacteria divided into two groups based on their growth rates. The dataset includes 53 bacteria with available information on growth rates: which were classed into two groups by the mean growth time. The fast group has higher TAAIs than the slow. The GC content of the two groups are similar (The students’ t test: P = 0.42), their genome sizes varied non-significantly (The fast generally has larger genome than the slow: P = 0.07). (B) Prokaryotic organisms’ TAAIs, associated with corresponding genome sizes. The prokaryotic organisms were divided into three groups, showing significantly different TAAIs. Here, 376 prokaryotic genomes were involved in the analysis of correlation between TAAIs and genome sizes. In addition, the GC content also affect the TAAIs along with the increase of genome sizes: GC small < GC median < GC large (P < 2.2e−16).
Figure 3Intra-genome variation and translation factors. (A) TAAI distribution in three model organisms. (B) For human proteins having similar backgrounds (GC%: 0.64–0.66; lengths: 14,300–14,700 bp), they have variable intensities of co-adaption. Amino acid compositions of the top four proteins (Black ones) linearly correlated with the tRNA gene copy numbers, and Amino acid compositions of the bottom four (Red ones) do not change with the growth of the tRNA gene copy numbers, the gene names were listed in the subfigures. (C) Analysis of six model organisms’ abundance shows that highly expressed proteins generally have higher TAAIs than proteins with lower expression levels. Proteins in the high and the low groups have the similar GC content and lengths (The students’ t test: P > 0.05). Their GC contents are: A. thaliana 0.41–0.42; C. elegans 0.34–0.35; S. cerevisiae 0.39–0.41; E. coli 0.50–0.53; B. subtilis 0.50∼0.53; H. sapiens 0.5–0.53. We try to determine the most appropriate GC contents under which more proteins can be included into the analysis. (D) The housekeeping genes and old genes have higher TAAIs than those tissue specific and young genes. These results accord with the results by Ma et al. and Yin et al. that housekeeping and old genes suffered more translation selection.
Analysis of the potentially rapidly expressed proteins according to their functions in E. coli and S. cerevisiae
| Number | Abundance | TAAI | CAI | Number | Abundance | TAAI | CAI | |
|---|---|---|---|---|---|---|---|---|
| Whole genome | 3133 | 319.2 | 0.51 | 0.35 | 6087 | 163.8 | 0.62 | 0.18 |
| Ribosome subunits | 57 | 4724.28 | 0.59 | 0.63 | 178 | 1231.65 | 0.74 | 0.48 |
| Cell division | 19 | 0.57 | 0.35 | 20 | 0.67 | |||
| Two-component system | 58 | 0.57 | 216 | 203.86 | 0.65 | 0.19 | ||
| Mismatch repair | 20 | 0.53 | 19 | 0.72 | ||||
| Sugar metabolism | 47 | 325.52 | 0.52 | 0.41 | 28 | 563.53 | 0.69 | 0.26 |
Proteins in bold face had lower corresponding values than genome’s average values.
Figure 4EGFP expression of original and optimized TAAIs in E. coli. (A) is the upregulated TAAI value resulting from adding a copy of one of the 20 standard tRNAs. The star marks tRNA-Asp, tRNA-Ile and tRNA-Tyr. (B) Confocal micrographs of the control and experimental groups present different fluorescence intensities in the EGFP channel; the corresponding merge figures of the bright-field and EGFP channel are shown in Supplementary Fig. S2. (C) Fluorescence intensities of four nascent sequences with EGFP at 513 nm from 0 to 2.5 h. All of these results showed that in the experimental group transformed with the Asp tRNA gene, there was a lift approximately ten-fold. (D) Western blot results for the nascent sequences. The following histograms show the normalized density of the corresponding lane, and the chemiluminescence intensity of the corresponding target band using stain-free technology (Bio-Rad). The corresponding electrophoretogram, shown in Supplementary Fig. S2B, reflects the loading volume of the total proteins.
Figure 5Comparison of the co-adaption (TAAI) of three domains. The medians for Archaea, Bacteria and Eukarya were 0.37, 0.61 and 0.69. Archaea had relatively low TAAIs, and Eukarya had the highest. Analysis of variance revealed that the three domains were significantly different, with a P value close to zero (P = 2.57e−06).