| Literature DB >> 18671878 |
Abstract
BACKGROUND: Eukaryotic mRNAs often contain secondary structures in their untranslated regions that are involved in expression regulation. Whether secondary structures in the protein coding regions are of functional importance remains unclear: laboratory studies suggest stable secondary structures within the protein coding sequence interfere with translation, while several bioinformatic studies indicate stable mRNA structures are more frequent than expected.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18671878 PMCID: PMC2533328 DOI: 10.1186/1471-2148-8-224
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Comparison of conserved optimal codon numbers at paired and unpaired sites.
| Method | Algorithm | ALL | GC-ending | AT-ending | GC leu & lys |
|---|---|---|---|---|---|
| RNAfold | MFE | 0.646 *** | 0.624 ** | 0.503 *** | 1.353 *** |
| Mc | 0.542 *** | 0.567 *** | 0.422 *** | 0.910 *** | |
| RNAfold | MFE | 0.667 *** | 0.608 *** | 0.542 NS | 1.137 *** |
| Mc | 0.560 *** | 0.571 *** | 0.407 ** | 1.119 *** | |
| RNAfold | MFE | 0.653 *** | 0.590 *** | 0.544 * | 1.172 *** |
| Mc | 0.572 *** | 0.623 *** | 0.411 *** | 1.131 *** | |
| RNAfold | MFE | 0.640 *** | 0.597 ** | 0.502 *** | 1.181 *** |
| Mc | 0.537 ** | 0.557 *** | 0.401 *** | 1.003 *** | |
| ALIfold | MFE | 0.638 *** | 0.577 *** | 0.465 NS | 1.499 *** |
| Mc | 0.468 ** | 0.444 *** | 0.326 *** | 0.997 *** | |
| RNAfold | MFE | 0.920 *** | 0.863 *** | 0.751 *** | 1.584 *** |
| Mc | 0.841 *** | 0.866 *** | 0.676 *** | 1.436 *** | |
| RNAfold | MFE | 0.878 *** | 0.826 *** | 0.733 *** | 1.497 *** |
| Mc | 0.819 *** | 0.822 *** | 0.659 *** | 1.460 *** | |
| RNAfold | MFE | 0.912 *** | 0.814 *** | 0.790 NS | 1.160 *** |
| Mc | 0.839 *** | 0.869 *** | 0.740 NS | 1.404 *** | |
| RNAfold | MFE | 0.904 *** | 0.855 *** | 0.742 *** | 1.590 *** |
| Mc | 0.833 *** | 0.840 *** | 0.675 *** | 1.455 *** | |
| ALIfold | MFE | 0.899 *** | 0.759 ** | 0.739 NS | 1.937 *** |
| Mc | 0.770 *** | 0.724 *** | 0.645 *** | 1.529 *** | |
I combine contingency tables for all amino acids and genes (ALL) and subsets of amino acids with GC- and AT- ending optimal codons (leu and lys are treated separately, as these two GC-ending amino acids behave very opposing, see below Table 2). Mantel Haenzsel estimators and significances are presented, WMH <1 = lower optimal codon use at paired than at unpaired sites.
* < 0.05, ** < 0.01, *** < 0.005, NS = not significant. Structure prediction is based on ALIfold and RNAfold using MFE and McCaskill's (Mc) algorithm.
Comparison of conserved optimal codon numbers at paired and unpaired sites.
| RNAfold ( | ALIfold | |||
|---|---|---|---|---|
| MFE | Mc | MFE | Mc | |
| 1.380 *** | 1.273 *** | 1.811 *** | 1.456 *** | |
| 1.772 *** | 1.654 *** | 2.019 *** | 1.654 *** | |
| 1.075 *** | 1.120 *** | 0.786 NS | 0.864 ** | |
| 0.809 NS | 0.720 NS | 0.741 NS | 0.631 NS | |
| 0.707 NS | 0.600 NS | 0.568 NS | 0.657 * | |
| 0.813 NS | 0.838 NS | 0.656 *** | 0.746 ** | |
| 0.891 * | 0.831 NS | 0.725 ** | 0.637 NS | |
| 1.318 *** | 1.254 *** | 1.555 *** | 1.049 *** | |
| 1.272 *** | 0.978 *** | 1.561 *** | 1.009 *** | |
| 0.585 NS | 0.852 *** | 0.734 *** | 0.560 *** | |
| 0.705 * | 0.794 NS | 0.806 * | 0.686 NS | |
| 0.838 ** | 0.921 *** | 0.811 *** | 0.750 *** | |
| 0.940 *** | 0.876 *** | 0.932 *** | 0.655 *** | |
| 0.614 NS | 0.668 NS | 0.583 NS | 0.591 NS | |
| 0.794 NS | 0.816 NS | 0.816 *** | 0.985 *** | |
| 0.968 ** | 0.904 *** | 1.073 *** | 0.883 *** | |
| 0.588 *** | 0.545 *** | 0.518 *** | 0.460 *** | |
| 0.894 NS | 0.779 NS | 0.901 *** | 0.752 NS | |
| 0.442 *** | 0.353 *** | 0.293 *** | 0.278 *** | |
| 0.946 *** | 0.350 *** | 0.386 *** | 0.315 *** | |
| 0.851 NS | 0.734 NS | 0.729 ** | 0.689 * | |
| 0.500 NS | 0.382 NS | 0.755 *** | 0.403 NS | |
Separately for each amino acid, I combine contingency tables of the different genes. Mantel Haenzsel estimators and significances are presented, with WMH <1 = lower optimal codon use at paired than at unpaired sites. Structure prediction is based on ALIfold and RNAfold using MFE and McCaskill's (Mc) algorithm.
* < 0.05, ** < 0.01, *** < 0.005, NS = not significant
Similarity of predicted structures for species pairs.
| Species comparison | Prediction Method | (P+U)/all | P/all | U/all |
|---|---|---|---|---|
| MFE | 27% ± 0.6 | 17% ± 0.3 | 10% ± 0.5 | |
| Mc | 27% ± 0.4 | 9% ± 0.2 | 18% ± 0.3 | |
| MFE | 63% ± 0.6 | 36% ± 0.3 | 27% ± 0.3 | |
| Mc | 64% ± 0.5 | 22% ± 0.4 | 42% ± 0.5 | |
| MFE | 59% ± 0.3 | 34% ± 0.2 | 25% ± 0.2 | |
| Mc | 61% ± 0.4 | 20% ± 0.4 | 41% ± 0.6 | |
| MFE | 42% ± 0.4 | 23% ± 0.5 | 19% ± 0.1 | |
| Mc | 59% ± 0.3 | 20% ± 0.0 | 40% ± 0.5 | |
| MFE | 25% ± 0.3 | 8% ± 0.1 | 16% ± 0.2 | |
| Mc | 25% ± 0.4 | 16% ± 0.2 | 9% ± 0.1 | |
| MFE | 61% ± 0.4 | 36% ± 0.2 | 25% ± 0.2 | |
| Mc | 63% ± 0.3 | 22% ± 0.3 | 41% ± 0.4 | |
| MFE | 58% ± 0.2 | 35% ± 0.2 | 23% ± 0.1 | |
| Mc | 60% ± 0.3 | 20% ± 0.3 | 40% ± 0.4 | |
| MFE | 57% ± 0.2 | 34% ± 0.1 | 23% ± 0.1 | |
| Mc | 58% ± 0.2 | 20% ± 0.2 | 39% ± 0.4 | |
The average percentages of sites (± variances) unambiguously predicted to be paired (P/all) and/or unpaired ((P+U)/all, U/all) for the respective species comparison using RNAfold MFE and McCaskill's (Mc) algorithm are presented.
Comparison of synonymous substitution numbers at paired and unpaired sites.
| AT | GC | All | ||||||
|---|---|---|---|---|---|---|---|---|
| 0.511 | 0.542 | 0.600 | 0.488 | 0.296 | 0.414 | 0.481 | ||
| 0.567 | 0.528 | 0.546 | 0.544** | 0.327* | 0.458** | 0.505 | ||
| 0.255 | 0.201 | 0.232 | 0.140 | 0.257 | 0.213 | 0.225 | ||
| 0.255 | 0.230 | 0.238 | 0.264 | 0.167 | 0.222 | 0.232 |
Looking at S. cerevisiae and S. paravensis , I compare numbers of each codon in S. cerevisiae being either synonymous non-conserved or conserved at paired or unpaired sites. Structure prediction is based on RNAfold upon the S. cerevisiae sequence using MFE and McCaskill's (Mc) algorithm. Mantel Haenzsel estimators and significances are presented. WMH<1 = lower numbers of synonymous substitutions at paired sites. (1) All genes, (2) Genes that are shorter than 800 bp.
* < 0.05, ** < 0.01, *** < 0.005, NS = not significant