| Literature DB >> 29389943 |
Fei Qi1, Magdalena Motz2,3, Kirsten Jung2,3, Jürgen Lassak2,3, Dmitrij Frishman1,4.
Abstract
Translation of consecutive prolines causes ribosome stalling, which is alleviated but cannot be fully compensated by the elongation factor P. However, the presence of polyproline motifs in about one third of the E. coli proteins underlines their potential functional importance, which remains largely unexplored. We conducted an evolutionary analysis of polyproline motifs in the proteomes of 43 E. coli strains and found evidence of evolutionary selection against translational stalling, which is especially pronounced in proteins with high translational efficiency. Against the overall trend of polyproline motif loss in evolution, we observed their enrichment in the vicinity of translational start sites, in the inter-domain regions of multi-domain proteins, and downstream of transmembrane helices. Our analysis demonstrates that the time gain caused by ribosome pausing at polyproline motifs might be advantageous in protein regions bracketing domains and transmembrane helices. Polyproline motifs might therefore be crucial for co-translational folding and membrane insertion.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29389943 PMCID: PMC5811046 DOI: 10.1371/journal.pcbi.1005987
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Distribution and conservation of polyproline motifs.
(A) Occurrence of polyproline motifs in E. coli K-12 MG1655 is lower than the random level (fold change 0.80). The histogram shows the numbers of motifs found in 1,000 sets of random sequences, and the blue line shows the number of motifs found in real sequences. (B) Numbers of polyproline motifs negatively correlate with the strength of the ribosome stalling effect in E. coli K-12 MG1655. The differences are significant according to Mann-Whitney-Wilcoxon test. (C) Occurrence of polyproline motifs in the core proteome of E. coli K-12 MG1655 is lower than that in the accessory proteome. The differences are significant according to Mann-Whitney-Wilcoxon test. (D) In the core proteome more aligned regions have a negative PSEC (chi-squared test) while in the accessory proteome PSEC values display no strong preference.
Fig 2Correlation between translation efficiency, protein abundance and frequency of polyproline motifs.
(A) Proteins with high translation efficiency tend to have fewer polyproline motifs (Spearman's rho = -0.105, p-value = 1.13e-5). (B) High abundance proteins tend to have fewer motifs (Spearman's rho = -0.135, p-value = 1.47e-8).
Fig 3Functional role of polyproline motifs.
(A) Occurrence of polyproline motifs in the first 50 residues is higher than elsewhere in the protein sequence (Mann-Whitney-Wilcoxon test, p-value < 2.2e-16; fold change 0.94 vs 0.78). Error bars indicate the standard deviation. (B) Occurrence of polyproline motifs is associated with domain boundaries. Regions with relatively high motif occurrence are marked red. Data are smoothed over a three-residue window. Left: frequency of motifs relative to domain start (dashed line). Right: frequency of motifs relative to domain end (dashed line). The enrichment of motifs in these two regions is significant (p-values < 0.05; fold changes 1.19 and 1.23). (C) Frequency of polyproline motifs relative to the start position of TMH. TMH is marked green (assuming the typical length of 21 residues). Regions with high motif frequency are marked red. Data are smoothed over a three-residue window. (D) Schematic illustration of the site III location relative to TMH and the non-transmembrane region. In protein A site III of TMH1 locates in the TMH2 while in protein B site III of TMH1 is in the non-transmembrane region.