| Literature DB >> 21481245 |
Abstract
BACKGROUND: Surprisingly, in several multi-cellular eukaryotes optimal codon use correlates negatively with gene length. This contrasts with the expectation under selection for translational accuracy. While suggested explanations focus on variation in strength and efficiency of translational selection, it has rarely been noticed that the negative correlation is reported only in organisms whose optimal codons are biased towards codons that end with G or C (-GC). This raises the question whether forces that affect base composition--such as GC-biased gene conversion--contribute to the negative correlation between optimal codon use and gene length.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21481245 PMCID: PMC3096941 DOI: 10.1186/1471-2148-11-93
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Difference of optimal codons depending on whether they end with -GC or -AT.
| Amino acid | Gene length | Expression | Recombination |
|---|---|---|---|
| +0.0229NS | +0.2546*** | Co:-0.0656** | |
| +0.0002NS | +0.4301*** | Co:+0.0293NS | |
| +0.0535* | +0.2293*** | Co:-0.0162NS | |
| +0.0603* | +0.2076*** | Co:-0.0777** | |
| +0.1396*** | +0.1663*** | Co:-0.0658** | |
| +0.0470NS | +0.4385*** | Co:+0.0241S | |
| +0.0304NS | +0.3114*** | Co:-0.0509* | |
| +0.1271*** | +0.3732*** | Co:-0.0867** | |
| Leu | -0.1031*** | +0.3690*** | Co:+0.1828*** |
| -0.1588*** | +0.2880*** | Co:+0.2101*** | |
| -0.1995*** | +0.1485*** | Co:+0.1976*** | |
| -0.1850*** | +0.2240*** | Co:+0.1718*** | |
| -0.1103*** | +0.2729*** | Co:+0.2689*** | |
| -0.2207*** | +0.3210*** | Co:+0.2248*** | |
| -0.1010*** | +0.1060*** | Co:+0.1193*** | |
| -0.2604*** | +0.4005*** | Co:+0.2935*** | |
Spearman Rank Correlations between optimal codon use, gene length, expression, and various recombination measures for amino acids and groups of amino acids with either AT- or with GC-ending optimal codons. 1554 genes for which all amino acids were present at least four times were used. Presented for grouped variables are also MR results of variables whose log-transformation did not grossly deviate from a normal distribution. *P < 0.05, **P < 0.01,***P < 0.001, NS = not significant.
Effect of base composition independent of translational selection.
| Amino acid | Gene length | Expression | Recombination |
|---|---|---|---|
| -0.1567*** | +0.1026*** | Co:+0.2280*** | |
| -0.1054*** | +0.0354NS | Co:+0.1602*** | |
| -0.0899** | +0.0837** | Co: = 0.2019*** | |
| -0.1544*** | +0.0954*** | Co:+0.2036*** | |
| -0.2153*** | +0.1304*** | Co:+0.3181*** | |
| -0.0855** | +0.1573*** | Co:+0.1592*** | |
| -0.0152NS | +0.2595*** | Co:+0.0435NS | |
| -0.0689* | +0.1643*** | Co:+0.1707*** | |
| -0.1961*** | -0.0819** | Co:+0.2238*** | |
| -0.1141*** | -0.1121*** | Co:+0.1576*** | |
| -0.0736** | -0.0469NS | Co:+0.1404*** | |
| -0.1153*** | -0.0056NS | Co:+0.1943*** | |
| -0.0599* | +0.1127*** | Co:+0.2433*** | |
| -0.2858*** | +0.0820** | Co:+0.3194*** | |
Spearman Rank Correlations between the frequency of GC-ending optimal or non-optimal codons with gene length, expression, and various recombination measures. For each amino acid, only genes were used for which at least 4 optimal or 4 non-optimal codons were present respectively. Presented for grouped variables are also MR results of variables whose log-transformation did not grossly deviate from a normal distribution. *P < 0.05, **P < 0.01,***P < 0.001, NS = not significant.
Effect of gene position.
| Position 1-300 (49-300) | |
|---|---|
| +0.2679*** (+0.2656***) | |
| +0.1244* (+0.0648NS) | |
| -0.0647NS (-0.1091NS) | |
| -0.1227* (-0.1591**) | |
Spearman Rank Analysis of frequencies of optimal and GC-ending codons with gene position for position numbers 1-300 and 49-300. *P < 0.05, **P < 0.01,***P < 0.001, NS = not significant.
Controlling the effect of gene position.
| Gene length | Expression | Recombination | |
|---|---|---|---|
| -0.0480NS | +0.3696*** | Co:-0.1064*** | |
| -0.1289*** | +0.3839*** | Co:+3284*** | |
| -0.0751** | +0.1216*** | Co:+0.2735*** | |
| -0.0481NS | +0.0410NS | Co:+0.2990*** | |
Spearman Rank Correlations between frequencies of optimal and GC-ending codons with gene length, expression, and various recombination measures after controlling for a potential effect of gene position. The same 1571 genes were used for all measures. Presented are also MR results of variables whose log-transformation did not grossly deviate from a normal distribution. *P < 0.05, **P < 0.01,***P < 0.001
Substitution patterns.
| Gene length | Expression | Recombination | |
|---|---|---|---|
| +0.1082* | -0.1922*** | Co:+0.0178NS | |
| +0.2193*** | -0.2435*** | Co:-0.1277*** | |
| +0.2048*** | -0.1615*** | Co:+0.0210NS | |
| +0.1559*** | -0.0454NS | Co:-0.0156NS | |
| -0.0752*** | +0.0441* | Co:+0.0983*** | |
| -0.0281NS | -0.1210*** | Co:+0.0264NS | |
| +0.0118NS | -0.1420*** | Co:-0.1025*** | |
| -0.0136NS | -0.0748*** | Co:+0.0345NS | |
| -0.0460* | +0.0322NS | Co:-0.0151NS | |
| -0.0329NS | +0.0131NS | CO:+0.0824** | |
Spearman Rank Correlations of directed substitutions at four-fold degenerated sites (that to not change the non-optimal status) and of the proportion of AT->GC substitutions with gene length, expression, and various recombination measures, with and without controlling for an effect of gene position. Presented are also MR results of variables whose log-transformation did not grossly deviate from a normal distribution. *P < 0.05, **P < 0.01,***P < 0.001, NS = not significant.