Literature DB >> 24561808

Deciphering the rules by which dynamics of mRNA secondary structure affect translation efficiency in Saccharomyces cerevisiae.

Yuanhui Mao¹, HuiLing Liu, Yanlin Liu, Shiheng Tao.

Abstract

Messenger RNA (mRNA) secondary structure decreases the elongation rate, as ribosomes must unwind every structure they encounter during translation. Therefore, the strength of mRNA secondary structure is assumed to be reduced in highly translated mRNAs. However, previous studies in vitro reported a positive correlation between mRNA folding strength and protein abundance. The counterintuitive finding suggests that mRNA secondary structure affects translation efficiency in an undetermined manner. Here, we analyzed the folding behavior of mRNA during translation and its effect on translation efficiency. We simulated translation process based on a novel computational model, taking into account the interactions among ribosomes, codon usage and mRNA secondary structures. We showed that mRNA secondary structure shortens ribosomal distance through the dynamics of folding strength. Notably, when adjacent ribosomes are close, mRNA secondary structures between them disappear, and codon usage determines the elongation rate. More importantly, our results showed that the combined effect of mRNA secondary structure and codon usage in highly translated mRNAs causes a short ribosomal distance in structural regions, which in turn eliminates the structures during translation, leading to a high elongation rate. Together, these findings reveal how the dynamics of mRNA secondary structure coupling with codon usage affect translation efficiency.

Entities: Chemical Disease Mutation Species

Mesh：

Substances：
Codon
RNA, Messenger

Year: 2014 PMID： 24561808 PMCID： PMC4005662 DOI： 10.1093/nar/gku159

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Current understanding of the ribosome and the mechanism of translation has been significantly strengthened and expanded by recent research efforts (1–4). Gene translation is a highly regulated process with intricate interactions among messenger RNAs (mRNAs), ribosomes and mRNA-binding factors, leading to changes in protein abundance, thus enabling the cell to respond rapidly to a wide range of environmental conditions (5–10). Although the underlying factors have been analyzed extensively, the determinants of translation efficiency are still under debate (11–17). The central dogma of molecular biology deals with the flow of genetic information from DNA via mRNA to protein. It is well understood how an mRNA is sequentially translated into amino acid chains. However, the correlation between mRNA level and protein abundance varies across species [R2 = ∼0.1–0.7, reviewed in (11) and (12)], suggesting widespread regulations occurring at translational and post-translational levels. Translation consists of three stages, namely, initiation, elongation and termination. The initiation has been regarded as rate-limiting for translation in general (18,19). Yet this seems to be uncertain for some genes. For instance, genes with high initiation rate might be subject to non-optimal codon usage or bad codon order downstream, resulting in ribosomal traffic jam, thus significantly decreasing translation efficiency (15,20–23). Moreover, numerous studies have shown that mRNA tends to fold into local secondary structures (so-called mRNA secondary structures) (24–30). These structures are unevenly distributed within mRNA (30) and can act as roadblocks that might influence the rhythm of protein synthesis. During translation, ribosomes move along mRNA and pause at paired sites until the base pairings are broken (31,32). Although it is known that mRNA secondary structure decreases elongation rate (28,31,33,34), a surprising finding in vitro showed a strong positive correlation between the strength of mRNA secondary structure and protein abundance in yeast (17,35). The counterintuitive finding indicates an intricate interaction between translating ribosomes and mRNA secondary structures, affecting translation efficiency in an undetermined manner. Local secondary structures are distributed densely within mRNA. The distance between adjacent structures in vitro (6.8 nt on average, Figure 1A) is significantly shorter than 28 nt [the length of the footprint that a ribosome protects its mRNA from nuclease digestion (36)]. This suggests that most of the translating ribosomes are located in structural regions and thereby undergo mRNA structural dynamics (folded and unfolded) (Figure 1B). On one hand, mRNA secondary structure blocks ribosomal migration (31,32,37). On the other hand, translating ribosomes constrain mRNA folding. As a ribosome moves along a structural region, mRNA secondary structures disappear, and they may appear after the ribosome passes through the region. The re-folded structures hold until the next ribosome arrives at this region and re-unwinds it. Previous studies (21,38–41) proposed a series of stochastic models to explore the determinant of translation efficiency. mRNA secondary structure was usually integrated into their models (15,21). However, these models used fixed mRNA secondary structures estimated in vitro and did not consider the dynamics of structure during translation. To date, the mechanism by which the dynamics of mRNA secondary structure affect translation efficiency at the genome-wide scale remains unclear.

Figure 1.

A schematic illustration of the interaction between mRNA secondary structure and translating ribosomes. (A) The distance between adjacent secondary structures within mRNA. mRNA secondary structures in yeast were obtained from the work of Kertesz et al. (27). (B) Structural dynamics when a ribosome moves along mRNA. pre-mF refers to the folding of mRNA without ribosome binding (see text). With this background, we hypothesized that mRNA secondary structures exert their regulatory effect on translation efficiency through the interactions among translating ribosomes, codon usage and mRNA secondary structures. To test this hypothesis, in the current study, we performed a genome-wide analysis of the effect of mRNA secondary structure on translation efficiency based on a novel computational model, taking into account the dynamics of mRNA secondary structure during elongation. In our model, in contrast to mRNA folding strength (mF strength) during translation, the folding strength of mRNA without ribosome binding was termed previous mF strength (pre-mF strength). Using the model, we first analyzed the association between ribosomal distance and structural dynamics. Then, we revealed a general mechanism by which dynamics of mRNA secondary structure, coupling with other sequence features, likely regulate the organism’s translation efficiency.

MATERIALS AND METHODS

Data collection

Coding sequence

Coding sequences (CDSs) of Saccharomyces cerevisiae S288C were downloaded from the National Center for Biotechnology Information FTP server. Considering that prediction for secondary structures of long sequences is very resource-demanding, we excluded the CDSs longer than 2000 nt. In total, 5369 sequences were obtained.

Experimentally determined mRNA secondary structures

The data were downloaded from the study of Kertesz et al. (27), which provided parallel analysis of RNA structure (PARS) scores at all sites and PARS-assisted secondary structures of 3002 CDSs (2534 CDSs shorter than 2000 nt). PARS score measures the probability of a nucleotide to be in double-stranded conformation, which is significantly correlated with minimum folding free energy (Supplementary Figure S1).

Protein abundance

Protein abundance data were obtained from the PaxDb (42). In total, 2974 data on S. cerevisiae were used when analyzing the correlation between PARS score and protein abundance.

Transfer RNA gene copy numbers

Transfer RNA (tRNA) gene copy numbers of S. cerevisiae were downloaded from the Genomic tRNA Database (43).

Calculation of pre-mF strength and mF strength

pre-mF strength was defined as the mean predicted base pairing probability (PP) of CDS. PPs of CDS were predicted by RNAfold in Vienna RNA package (44) using default parameters. During translation, mF strength was defined as the mean PP of CDS with ribosomal constraints (see following text).

The relationship between ribosomal distance and mF strength

We investigated the variation pattern of mF strength against ribosomal distance. First, we randomly assigned ribosomes on CDS. The number of ribosomes on each CDS was obtained by dividing sequence length by the mean ribosomal distance [154 nt on average (45)]. Second, we used RNAfold with the parameter −C to predict the PPs of CDS under ribosomal constraints (46). Note that the bases near a ribosome cannot be paired with others due to the constraints of spatial structure of the ribosome. Therefore, in our analysis, the region constrained by a ribosome consists of three subregions: the region covered by the ribosome (28 nt) and two flanking regions (14 nt in all). The length of constrained region was set to 42 nt. Third, for each region between adjacent ribosomes, we calculated the mean PP difference () by Equation (1). where refers to the predicted PP at site i of the region when ribosomal constraints on CDS exist. refers to the predicted PP without constraint. is the length of the region. To exclude the effect of ribosomal positions that we randomly assigned, the aforementioned processes were repeated five times, and the mean value of given a ribosomal distance was obtained by averaging the values of with same distance. In addition, all mRNAs were divided into five groups based on their pre-mF strength from high to low (G1–G5), which was used to test whether the pattern we estimated is sequence-specific.

Calculation of tRNA adaptation index

The tRNA adaptation index (tAI) of CDS was calculated by Equation (2). where is the relative adaptiveness value of codon k, the values for all 64 codons were calculated according to the work of dos Reis et al. (47), and is the length of CDS. In addition, we refer to the of a single codon as the codon’s tAI.

Simulation of translation process by taking into account the interactions among translating ribosomes, codon usage and mRNA secondary structures

Model

Inspired by previous studies reporting a significant positive correlation between the mean PARS score of CDS and protein abundance (Supplementary Figure S2) (17,35), we decided to investigate how mRNA secondary structure exerts a positive effect on translation efficiency, as there are numerous lines of evidence in vitro revealing that mRNA secondary structure decreases elongation rate. To this end, we developed a novel computational model (Figure 2) to simulate the process of translation. Translation process is divided into three phases: initiation, elongation and termination. In our model, ribosomes arrive at the start site with initiation rate . At the last codon, ribosomes detach and release proteins with termination rate . During elongation, translation cycle consists of two steps. The first step is that cognate tRNA arrives at ribosome A site (codon i) with rate , and that ribosome unwinds base pairings located at codon i + L/2 (L is set to 42 nt) with rate simultaneously. The second step is translocation. Translocation rate is fast and codon-independent, thus a constant rate was used. Therefore, the rate that a ribosome moves from the current codon to the next codon is determined by and (see the following subsection for details). Moreover, the ribosome cannot translocate if the next codon is occupied by the former ribosome. In addition, the ribosomes can capture a cognate tRNA when they are waiting for the next codon to become vacant.

Figure 2.

The model. In our model, ribosomes arrive at translation start site with rate , and release proteins with rate . During elongation, translating ribosomes wait for their cognate tRNAs at position i and simultaneously unwind the base pairings located at position i + L/2 (L = 42 nt). (A) shows the structure that the first ribosome encounters (the first ribosome is blue). (B) shows the structure that the third ribosome encounters. mRNA secondary structure is weakened due to the constraints of ribosomes. In our model, we assume that different ribosomes might encounter the structure with different folding strength at the same site because the pattern of mRNA folding might be changed when ribosomes have bound to mRNA (see text for details). Importantly, in contrast to other models using fixed mRNA secondary structures, our model considers dynamic structures during translation, which means that different ribosomes might be subject to different folding strength at the same codon (Figure 2). This assumption is based on our finding that the folding strength of the region between adjacent ribosomes is strongly dependent on ribosomal distance (Figure 3). Moreover, we focus on the folding behavior of mRNA secondary structure during elongation and its effect on ribosomal distance. Other factors affecting ribosomal distance were controlled. Therefore, a relative rate instead of an absolute rate was used in the model. We set an arbitrary value for . To obtain the relative rate of elongation, and were multiplied by the weights and , respectively (see following text for details). Unless otherwise indicated, we used the values listed in Table 1 for all simulations.

Figure 3.

Variation of mF strength against ribosomal distance. All mRNAs were divided into five groups (G1–G5) based on their pre-mF strength from high to low. (A) There is no significant difference in (see ‘Materials and Methods’ section for the calculation of ) among the five groups when ribosomal distance is longer than 5 nt. (B) When the distance is shorter than 5 nt, there is no structure between adjacent ribosomes, is determined by pre-mF strength.

Table 1.

Parameters used in simulation

Parameter	Value	Descriptions
	8	Initiation rate (s).
	0.1	Termination rate (s).
	0.5	The weight for .
	1.0	The weight for .
	0.12/Various	Dwell time at a codon caused by codon usage (s). When investigating the function of mRNA secondary structure without considering the effect of codon usage, an equal rate for all codons is used (0.12). When investigating the combined effect of mRNA secondary structure and codon usage, the value of depends on codon’s tAI (various).
	Various	Dwell time at a codon caused by mRNA secondary structure (s). Various means that the value depends on the PP of the codon.
	42	The length of the region constrained by a ribosome (nt).
	Constant	Translocation time (s). Translocation time is equal to the running time that a ribosome moves from the current codon to the next.

Parameters required for simulation

We used the multiple threads in Perl 5.12 (Supplementary files: Simulation.pl) to simulate translation process (Figure 2 and Supplementary Figure S3). The parameters required for simulation are: Initiation rate (): Although the initiation rate for individual mRNA has been estimated by previous studies based on ribosomal density or codon usage (39,48), we set an equal rate for all mRNAs to exclude the effect of initiation rate on ribosomal density. Termination rate (): Termination process is assumed to be fast compared with other processes (e.g. elongation) (49). Therefore, the effect of termination was neglected. A constant rate (0.1 s) was used for all mRNAs. Elongation rate (): Elongation rate is determined by codon usage and mRNA secondary structure. The dwell time at a codon caused by codon usage () was calculated as follows. First, the relative adaptiveness values of 64 codons (, Supplementary Table S1) were calculated according to the work of dos Reis et al. (47). Then, of each codon was obtained by Equation (3). where is the adaptiveness value of codon k. In particular, of the codon CGA is very large (Supplementary Table S1) compared with others. Therefore, this codon was excluded when we calculated of each codon. of this codon was set to be equal to the maximum value of (1.0, Supplementary Table S1). In addition, when investigating the functions of mRNA secondary structure without considering the effect of codon usage, the values of for all codons are equal and set to 0.12 (Table 1). It is not feasible to estimate the dwell time at a codon caused by mRNA secondary structure (), we thus introduced a simplification in our analysis. Note that there are many identical mRNAs expressed at the same time in the same cell. PP at codon i (averaging PPs of the three sites of codon i) can be treated as the proportion of identical mRNAs whose sites at codon i are paired. Therefore, higher PP at codon i indicates that ribosomes unwind base pairings of this codon with a lower rate on average (averaging rates at codon i of all identical mRNAs). With this simplification, we assume that of a codon is equal to the PP of the codon. Here, we did not calculate the PP of a codon directly during translation (because this requires massive computational effort), but set it to be equal to the mean PP of the codons present in the region between adjacent ribosomes, which was estimated based on the fitting of ribosomal distance and (Figure 3). Moreover, we showed that the variation of mF strength against ribosomal distance is not position-specific (Supplementary Figure S4) and sequence-specific (Figure 3). Therefore, at different codons in different mRNAs was estimated based on the same pattern shown in Figure 3 (also listed in Supplementary Table S2). In addition, if ribosomal distance is longer than 200 nt, we assume that the two ribosomes cannot hinder the folding of the region between them. In this case, at a codon is equal to the PP of the codon without ribosomal constraint. In summary, was calculated by Equation (4). where indicates that PP was calculated based on ribosomal distance. refers to the predicted PP without ribosomal constraint. Together, the elongation rate () at a codon is determined by Equation (5). where and are the weights for and , respectively. Note that unwinding base pairings and waiting for tRNA occur at two different codons (Figure 2). The distance of the two codons was set to 21 nt (L/2) in our analysis (32). Translocation rate (): During simulation, translocation rate was set to be equal to the running time that a ribosome moves from the current codon to the next codon. The time is very short (Supplementary Figure S5) and thus treated as ‘constant’.

Estimation of parameters

For each mRNA, we simulated translation processes of 60 ribosomes. Generally, translation arrives at the steady state (the number of ribosomes on mRNA remains unchanged, Supplementary Figure S6) when the 10th ribosome detaches. We estimated parameters by averaging the values of 11 ribosomes (from the 30th to 40th ribosomes, Supplementary files: Para.pl and Pause.pl). The parameters are number of ribosomes per mRNA, distance between adjacent ribosomes, mean dwell time at each codon (averaging the values at the same position of mRNAs), the time that a ribosome completes a translation cycle (translation time), mean elapsed time after initiation (elongation time, averaging the values at the same position of mRNAs), pause sites and percentage of collision sites per mRNA and flag at each codon (averaging the values at the same position of mRNAs). Here, we used a flag to record whether mRNA secondary structure is used during elongation. If mRNA secondary structure determines the rate at a codon (, the flag at this codon is set to 1, and if the determinant is codon usage (, the flag is set to 2.

Calculation of the correlation

All correlations reported in this study are Spearman’s rank correlations.

RESULTS

Regulation on ribosomal distance through the variation of mF strength

There is a significant positive correlation between pre-mF strength and ribosomal density when controlling for sequence length [ρ(PP, ribosomal density | sequence length) = 0.10, P = 4.8 × 10−9, Supplementary Figure S7], indicating that mRNA secondary structure can shorten the distance of the ribosomes in structural regions. During translation, ∼30% of coding regions are covered by ribosomes (45,49). The pattern of mRNA folding during translation is significantly different from that without ribosome binding (50–52) (Supplementary Figure S8). To date, the way in which pre-mF strength affects ribosomal distance during translation remains elusive. To address this question, we simulated the translation processes of CDSs by setting an equal initiation rate. An equal rate was used to exclude the effect of initiation rate on ribosomal density (22), which notably influences ribosomal distance (high ribosomal density is sufficient but not necessary for short ribosomal distance). Moreover, to exclude the effect of codon usage, the rates for all 64 codons were set to be equal. The main parameters required for simulation are listed in Table 1. We calculated the number of ribosomes on mRNA and the distance between adjacent ribosomes when translation reaches the steady state. Consistent with previous studies based on the data of ribosomal profiling (15), when considering structural dynamics during translation, we also found a positive correlation between the number of ribosomes per mRNA and pre-mF strength (ρ = 0.41, P < 2.2 × 10−16, Figure 4A). More importantly, there is a strong negative correlation between pre-mF strength and ribosomal distance (ρ = −0.81, P < 2.2 × 10−16, Figure 4B). We validated the correlations by setting different weights for . Similar results were obtained (Supplementary Figure S9A and B). Moreover, the results were not changed significantly when we used different initiation rates (Supplementary Figure S10A and B).

Figure 4.

mRNA secondary structure shortens ribosomal distance. (A) Correlation between the number of ribosomes and pre-mF strength. (B) Correlation between ribosomal distance and pre-mF strength. (C) Mean dwell time at each codon. R1 and R3 indicate the mean dwell time (averaging the values of all mRNAs) of the first and third ribosomes at each codon, respectively. R3 shows translation pattern of the ribosomes when translation does not reach steady state. R_mean indicates the mean dwell time at each codon during steady state. (D) Ribosomal distance decreases over time. (E) The variation of the percentage of mRNAs with ribosomal collisions against elongation time during steady state. For details, we analyzed the variations of dwell time and ribosomal distance along mRNA. We found that the first ribosome (R1 in Figure 4C) on mRNA has longer dwell time than following ribosomes (Figure 4C), suggesting that following ribosomes are subject to a lower mF strength when the first ribosome has bound to mRNA. A lower mean mF strength (i.e. a lower value for ) between adjacent ribosomes causes a higher movement rate of the latter ribosome compared with the rate that the former ribosome passes through this region, thereby leading to a decrease in ribosomal distance, which further reduces the mean mF strength of the region between the two ribosomes (Figure 3). The process continues until the following ribosome collides with its former one. When only considering the effect of mRNA secondary structure, we found that ribosomal distance decreases over time (Figure 4D). Notably, the percentage of mRNAs with ribosomal collisions increases over time (Figure 4E). Again, we considered different weights for and different initiation rates. We found that the increase in the weight for (corresponding to increase mean mF strength) or in the initiation rate significantly decreases ribosomal distance (Supplementary Figures S9C and S10C) and increases the probability of ribosomal collisions (Supplementary Figures S9D and S10D). Taken together, our results revealed that mRNA secondary structure shortens ribosomal distance through the variation of mF strength during translation.

The effect of the location of mRNA secondary structure

Based on our model, the simulation of translation process revealed how pre-mF strength affects ribosomal distance. To investigate the effect of the location of mRNA secondary structure, we ran a simulation on an artificial sequence of 600 nt. Because the effect of codon usage was not considered, the codons of the sequence were generated randomly based on their genomic frequencies. A sliding window with a length of 100 nt and a step of 3 nt was used to specify the structural region (Figure 5A), where PPs at all sites were set to 0.5 (other PPs were also tested, Supplementary Figure S11). We applied our model to this sequence with = 1.0 (a higher weight for was used to make the pattern more obvious). The other parameters are listed in Table 1. In addition, we ran a simulation with = 0 to obtain the mean ribosomal distance when no structure exists within the sequence, which was used as control. We found that mean ribosomal distance varies significantly when changing the position of mRNA secondary structure. Longer ribosomal distance was observed when the structure is in 5′ regions. In particular, the distance is longer than the control if the structure is located at the head of the sequence, which supports the hypothesis that mRNA secondary structure at the beginning of CDS decreases the probability of ribosomal traffic jams (21). The mean ribosomal distance decreases notably when the structure moves toward the tail of the sequence (Figure 5B). Similar results were obtained when different PPs in structural region were used (Supplementary Figure S11). The results based on the artificial sequence raise the possibility that increased pre-mF strength tends to emerge at the end of sequence to meet the requirement for a high level of ribosomal density. When analyzing the association between the experimentally determined data of ribosomal density (36) and mean PARS score, we found a significant difference in pre-mF strength at the end of CDS (Wilcoxon test, P = 0.02) between the two groups with ribosomal densities at the top and bottom 30% (Figure 5C).

Figure 5.

Effect of mRNA secondary structure at different positions. (A) The sliding structural region on an artificial sequence. The paired sites are indicated by gray lattices (each lattice represents a site). The PP of paired site was set to 0.5. (B) The variation of ribosomal distance during steady state when mRNA secondary structure is located in different regions. The distance without considering the effect of secondary structure is indicated by dashes. (C) Comparison of normalized PARS scores at the end of CDSs between the two groups with ribosomal densities at the top and bottom 30%. PARS score at each site was normalized by the mean score of CDS. RD, ribosomal density.

The effect of codon usage

When only considering the effect of mRNA secondary structure, as reported earlier, we found that mRNA secondary structure can cause a high frequency of ribosomal collisions if translation time is long enough (Figure 4). The collision should be avoided because ribosomal traffic jam might lead to premature termination of translation or prolonged elongation time, increasing the translational cost. When simulating translation process using the parameters in Table 1 ( = 0.12), we found that the mean value of the flag (see ‘Materials and Methods’ section) increases obviously with the decrease in ribosomal distance (Figure 6A), suggesting that codon usage becomes the determinant of elongation rate when ribosomal distance is short (Figure 6A). This result raises the possibility that the effect of codon usage can prevent the collisions caused by mRNA secondary structures.

Figure 6.

Effect of codon usage. (A) The variation of the mean value of flag against ribosomal distance. The mean value of flag increases sharply with the decrease in ribosomal distance when the distance is <30 nt. If the value is <1.5 (below the red dashes), mRNA secondary structure determines elongation rate. If the value is >1.5, indicating that most of mRNA secondary structures disappear, codon usage determines elongation rate. (B) All mRNAs were classified into five groups (G1–G5) based on their pre-mF strength from high to low. For each group, mRNAs were classified into five subgroups (T1–T5) based on their tAI from low to high (see text). To test this possibility, we simulated translation process by considering the effect of codon usage ( = various, Table 1). Moreover, we classified all mRNAs into five groups (G1–G5) according to their pre-mF strength from high to low. In this case, mRNAs in the same group have similar pre-mF strength. For each group, mRNAs were also divided into five subgroups (T1–T5) based on their tAI from low to high (∼320 mRNAs in each subgroup). We found that the percentage of collision sites per mRNA increases with the increase in tAI (from T1 to T4 and Figure 6B), suggesting that low tAI codons can decrease the probability of the ribosomal collisions caused by mRNA secondary structures. We reasoned that when two adjacent ribosomes are close enough, mRNA secondary structures between them disappear, and codon usage determines the elongation rate (Figure 6A). Unlike mRNA secondary structure, codon usage does not significantly decrease ribosomal distance (Supplementary Figure S12). Besides, the lower the tAI, the higher the probability that codon usage determines the rate [Equation (5)]. Therefore, the probability of the collisions caused by mRNA secondary structures is lower in those mRNAs with lower tAI. Moreover, in all groups, we found that mRNAs in T5 have a lower probability of collisions compared with those in T4. A possible reason is as follows. If the tAI of an mRNA is high (T5 in Figure 6B), the dwell time caused by codon usage is low and ribosomes pass through non-structural regions rapidly on average (averaging the rates in all non-structural regions of mRNA). In this case, the effect of codon usage can partly counteract the negative effect (decreasing elongation rate) of mRNA secondary structure, thus mRNAs in T5 have a higher elongation rate compared with those in T4, leading to a lower ribosomal density and hence lower probability of ribosomal collisions (Figure 6B).

The way in which mRNA secondary structure affects translation efficiency

mRNA is a single-strand molecule with a strong potential to fold back on itself if no ribosome binds to it. Although mRNA secondary structure slows down translation elongation, a considerable number of mRNA secondary structures are kept in highly expressed mRNAs during evolution (24,27,29,30), suggesting an important role of mRNA secondary structure during translation. The results in previous subsections raise a hypothesis (Figure 7A) describing how mRNA secondary structure affects translation efficiency through structural dynamics. We propose that high pre-mF strength leads to a short ribosomal distance (Figure 4), which in turn significantly decreases the mF strength during translation. In particular, mRNA secondary structures disappear if translating ribosomes are close enough. In this case, codon usage becomes the determinant of elongation rate, and the negative effect of mRNA secondary structure (decreasing elongation rate) is negligible. In contrast, ribosomes on the mRNAs with lower pre-mF strength are more distant, and hence, a higher fraction of mRNA regions can fold into secondary structures during translation. The effect of mRNA secondary structure cannot be neglected. Therefore, higher translation efficiency is observed in the mRNAs with higher pre-mF strength compared with that in the mRNAs with lower pre-mF strength.

Figure 7.

Correlation between pre-mF strength and translation time. (A) A schematic illustration of the effect of mRNA secondary structure on translation efficiency. We propose that high pre-mF strength leads to a short ribosomal distance, which in turn eliminates the effect of mRNA secondary structure during translation (see descriptions in the ‘Results’ and ‘Discussion’ sections). (B) Mean ribosomal distance at different initiation rates. (C) Correlation between pre-mF strength and the mean value of flag (averaging the value of flag at all sites of mRNA). A positive correlation is observed when ribosomes are close (corresponding initiation rate = 2, 3, 4 or 5 s), suggesting that high pre-mF strength leads to a less use of mRNA secondary structure during elongation. (D) Correlation between pre-mF strength and translation time/sequence length. A negative correlation indicates that high pre-mF strength leads to high translation efficiency. We tested this hypothesis by running a series of simulations with = 2.0 (a higher weight for was used to decrease the probability of the collisions caused by mRNA secondary structures) and different initiation rates. was set to 0.12. We used a flag to estimate the usage of structures during elongation (see ‘Materials and Methods’ section for details). We found that there is a negative correlation between the value of flag and pre-mF strength when ribosomal distance is long (corresponding to a low initiation rate: 8, 7 or 6 s. ρ < −0.4, P < 10−15, Figure 7B and C). Surprisingly, the correlation becomes positive when ribosomes are close (corresponding to a high initiation rate: 2, 3, 4 or 5 s. ρ > 0.1, P < 10−15, Figure 7B and C), suggesting that high pre-mF strength leads to a less use of mRNA secondary structure during elongation when ribosomal distance is short. Moreover, we calculated the correlation between translation time and pre-mF strength. Consistent with our hypothesis, we found a positive correlation between translation time and pre-mF strength when initiation rate is low (Figure 7D), whereas the correlation becomes negative when initiation rate = 2 or 3 s. Overall, the results support our hypothesis that mRNAs with high pre-mF strength have high elongation rates because high pre-mF significantly shortens ribosomal distance, which in turn eliminates mRNA secondary structures during translation.

DISCUSSION

mRNA accommodates numerous regulatory signals delineated along the protein coding regions in an intricate overlapping manner (22,53). These signals, such as codon usage bias and mRNA secondary structure, are all known to modulate protein synthesis. Although the functions of the individual signal have been investigated extensively by previous studies (7,23,32,38,39,54–57), how these signals combine to affect translation efficiency remains elusive. In the current study, we used a computational model to investigate the effect of mRNA secondary structure during translation. Our model takes into consideration the interactions among translating ribosomes, codon usage and mRNA secondary structures. In particular, for the first time, our model allows us to analyze the structural dynamics during translation. The simulation of translation based on our model revealed that mRNA secondary structure shortens ribosomal distance through the variation of mF strength. Moreover, we found that low tAI codons can decrease the probability of the ribosomal collisions caused by mRNA secondary structures when ribosomal distance is short. Based on these results, we explained how structural dynamics affect translation efficiency. Why is high pre-mF strength favored? mRNA secondary structure blocks the migration of ribosomes, thereby decreasing elongation rate. For highly expressed mRNAs, mRNA secondary structures should be eliminated to meet the requirement for a high level of elongation rate. However, removing all non-functional structures by natural selection is difficult. First, because there is a tendency to increase the frequency of the codons with high translation efficiency (so-called optimal codons) in highly translated mRNAs. Increase in optimal codon usage leads to an increase in pre-mF strength (Supplementary Figure S13). Second, mRNA is a single-strand molecule with a strong tendency to fold back on itself, making it impossible to prevent the mRNA from folding into secondary structures. Because removing all non-functional structures is difficult, another strategy, increasing pre-mF strength, is used to eliminate mRNA secondary structures during translation. Together, these findings explain why highly expressed mRNAs have undergone stronger natural selection for high pre-mF strength than infrequently expressed mRNAs (35). There are many factors affecting ribosomal distance. In particular, increase in the initiation rate or the frequency of non-optimal codons greatly increases ribosomal density, thus significantly decreasing ribosomal distance. Why is high pre-mF strength required for a short ribosomal distance? The possible reasons are as follows. On one hand, the availability of free ribosomes is limited, making it difficult to significantly increase initiation rate (41). On the other hand, high initiation rate is not sufficient for short ribosomal distance. A lower elongation rate is also required [reviewed in ref (22)]. In addition, although both non-optimal codon and mRNA secondary structure can decrease ribosomal distance, non-optimal codon will decrease translation accuracy and efficiency. Importantly, non-optimal codon affects all translating ribosomes, while mRNA secondary structure usually decreases the rate of the first several ribosomes. mRNA secondary structures disappear when ribosomes are close (Supplementary Figure S14). Although our model enables us to investigate the effect of structural dynamics during elongation, there are many simplifying assumptions. For instance, to develop the translation model, we first investigated the relationship between ribosomal distance and mF strength by predicting mF strength under the constraints of ribosomes that were assigned on mRNAs randomly. Note that the predicted PPs during translation for single mRNA might be different from the PPs in vivo. To decrease the estimation error of given a specific ribosomal distance, we calculated the mean value of (see ‘Materials and Methods’ section) by averaging the values with same ribosomal distance. In addition, the length of the region constrained by a ribosome during translation is unclear. In reality, the true length might be longer than 42 nt that we used in our analysis (58). Therefore, the ribosomal distance, which efficiently eliminates mRNA secondary structures during translation, might be reached more easily. Our model also did not distinguish between functional structures and other structures. Functional structures refer to those involved in co-translational regulations, such as co-translational folding of proteins and ribosomal frameshifts. Generally, functional structures are necessary for translation, thus they will not disappear during translation. In our model, it is difficult to retain the folded conformations for functional structures due to the constraints of ribosomes, especially for those in highly translated mRNAs. Therefore, it is worth investigating how these functional structures are maintained during elongation, probably involving the interaction between codon usage bias and mRNA secondary structures (e.g. a cluster of rare codon is assigned before a functional structure).

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Funding for open access charge: National Natural Science Fund Program [31271917]. Conflict of interest statement. None declared.

58 in total

Review 1. Ribosome structure and the mechanism of translation.

Authors: V Ramakrishnan
Journal: Cell Date: 2002-02-22 Impact factor: 41.582

2. Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae.

Authors: Yoav Arava; Yulei Wang; John D Storey; Chih Long Liu; Patrick O Brown; Daniel Herschlag
Journal: Proc Natl Acad Sci U S A Date: 2003-03-26 Impact factor: 11.205

3. Solving the riddle of codon usage preferences: a test for translational selection.

Authors: Mario dos Reis; Renos Savva; Lorenz Wernisch
Journal: Nucleic Acids Res Date: 2004-09-24 Impact factor: 16.971

4. Rate-limiting steps in RNA chain initiation.

Authors: W R McClure
Journal: Proc Natl Acad Sci U S A Date: 1980-10 Impact factor: 11.205

5. Determination of in vivo RNA structure in low-abundance transcripts.

Authors: Chun Kit Kwok; Yiliang Ding; Yin Tang; Sarah M Assmann; Philip C Bevilacqua
Journal: Nat Commun Date: 2013 Impact factor: 14.919

6. Co-crystal structure of a T-box riboswitch stem I domain in complex with its cognate tRNA.

Authors: Jinwei Zhang; Adrian R Ferré-D'Amaré
Journal: Nature Date: 2013-07-28 Impact factor: 49.962

7. Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure.

Authors: David H Mathews; Matthew D Disney; Jessica L Childs; Susan J Schroeder; Michael Zuker; Douglas H Turner
Journal: Proc Natl Acad Sci U S A Date: 2004-05-03 Impact factor: 11.205

8. Widespread selection for local RNA secondary structure in coding regions of bacterial genes.

Authors: Luba Katz; Christopher B Burge
Journal: Genome Res Date: 2003-09 Impact factor: 9.043

9. Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo.

Authors: Silvi Rouskin; Meghan Zubradt; Stefan Washietl; Manolis Kellis; Jonathan S Weissman
Journal: Nature Date: 2013-12-15 Impact factor: 49.962

Review 10. Comparing protein abundance and mRNA expression levels on a genomic scale.

Authors: Dov Greenbaum; Christopher Colangelo; Kenneth Williams; Mark Gerstein
Journal: Genome Biol Date: 2003-08-29 Impact factor: 13.583

33 in total

1. Synonymous variants that disrupt messenger RNA structure are significantly constrained in the human population.

Authors: Jeffrey B S Gaither; Grant E Lammi; James L Li; David M Gordon; Harkness C Kuck; Benjamin J Kelly; James R Fitch; Peter White
Journal: Gigascience Date: 2021-04-05 Impact factor: 6.524

Review 2. Synonymous codons, ribosome speed, and eukaryotic gene expression regulation.

Authors: Daniel Tarrant; Tobias von der Haar
Journal: Cell Mol Life Sci Date: 2014-07-20 Impact factor: 9.261

3. Design of Adjacent Transcriptional Regions to Tune Gene Expression and Facilitate Circuit Construction.

Authors: Fuqing Wu; Qi Zhang; Xiao Wang
Journal: Cell Syst Date: 2018-02-07 Impact factor: 10.304

4. RPLP1 and RPLP2 Are Essential Flavivirus Host Factors That Promote Early Viral Protein Accumulation.

Authors: Rafael K Campos; Benjamin Wong; Xuping Xie; Yi-Fan Lu; Pei-Yong Shi; Julien Pompon; Mariano A Garcia-Blanco; Shelton S Bradrick
Journal: J Virol Date: 2017-01-31 Impact factor: 5.103

5. The KLF14 Variant is Associated with Type 2 Diabetes and HbA_1C Level.

Authors: Ensieh Shahvazian; Mohammad Bagher Mahmoudi; Ehsan Farashahi Yazd; Saba Gharibi; Bahram Moghimi; Pouria HosseinNia; Masoud Mirzaei
Journal: Biochem Genet Date: 2021-01-03 Impact factor: 1.890

6. Predictive biophysical modeling and understanding of the dynamics of mRNA translation and its evolution.

Authors: Hadas Zur; Tamir Tuller
Journal: Nucleic Acids Res Date: 2016-09-02 Impact factor: 16.971

7. A transformation clustering algorithm and its application in polyribosomes structural profiling.

Authors: Wenhong Jiang; Jonathan Wagner; Wenjing Du; Juergen Plitzko; Wolfgang Baumeister; Florian Beck; Qiang Guo
Journal: Nucleic Acids Res Date: 2022-09-09 Impact factor: 19.160

Review 8. RNA Droplets.

Authors: Kevin Rhine; Velinda Vidaurre; Sua Myong
Journal: Annu Rev Biophys Date: 2020-02-10 Impact factor: 12.981

9. A Single Point Mutation, Asn₁₆→Lys, Dictates the Temperature-Sensitivity of the Reovirus tsG453 Mutant.

Authors: Kathleen K M Glover; Danica M Sutherland; Terence S Dermody; Kevin M Coombs
Journal: Viruses Date: 2021-02-12 Impact factor: 5.048

10. The RNA structurome in the asexual blood stages of malaria pathogen plasmodium falciparum.

Authors: Diana Renteria Alvarez; Alejandra Ospina; Tiffany Barwell; Bo Zheng; Abhishek Dey; Chong Li; Shrabani Basu; Xinghua Shi; Sabah Kadri; Kausik Chakrabarti
Journal: RNA Biol Date: 2021-06-23 Impact factor: 4.766