| Literature DB >> 23527255 |
Abstract
According to the ramp model of mRNA translation, the first 50 codons favor rare codons and have slower speed of translation. This study aims to detect translational selection on coding synonymous single nucleotide polymorphisms (sSNP) to support the ramp theory. We investigated fourfold degenerate site (FFDS) sSNPs with A ↔ G or C ↔ T substitutions in human genome for distribution bias of synonymous codons (SC), grouped by CpG or non-CpG sites. Distribution bias of sSNPs between the 3(rd) ~50(th) codons and the 51(st) ~ remainder codons at non-CpG sites were observed. In the 3(rd) ~50(th) codons, G → A sSNPs at non-CpG sites are favored than A → G sSNPs [P = 2.89 × 10(-3)], and C → T at non-CpG sites are favored than T → C sSNPs [P = 8.50 × 10(-3)]. The favored direction of SC usage change is from more frequent SCs to less frequent SCs. The distribution bias is more obvious in synonymous substitutions CG(G → A), AC(C → T), and CT(C → T). The distribution bias of sSNPs in human genome, i.e. frequent SCs to less frequent SCs is favored in the 3(rd) ~50(th) codons, indicates translational selection on sSNPs in the ramp regions of mRNA templates.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23527255 PMCID: PMC3602041 DOI: 10.1371/journal.pone.0059706
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A↔G and C↔T fourfold degenerate site sSNPs.
| Substitution type | A→G n(%) | G→A n(%) | C→T n(%) | T→C n(%) | Total (count) |
|
| |||||
| 3rd ∼50th codons | 137(9.3%) | 453(30.7%) | 727(49.3%) | 158(10.7%) | 1475 |
| 51st ∼ Remainder codons | 1632(11.6%) | 3988(28.5%) | 6575(46.9%) | 1818(13.0%) | 14013 |
| 3rd ∼50th codons | p = 2.89×10−3 ** | p = 8.50×10−3 ** | |||
|
| |||||
| 3rd ∼50th codons | 283(15.2%) | 760(40.9%) | 642(34.6%) | 173(9.3%) | 1858 |
| Remainder codons | 3614(16.5%) | 9212(42.1%) | 7046(32.2%) | 1990(9.1%) | 21862 |
| 3rd ∼50th codons | p = 0.471 | p = 0.599 | |||
χ2 test of the difference of substitution direction between the first 50 codons and the remainder codons; * P<0.05; **P<0.01.
Significant distribution bias of sSNPs between the 3rd ∼50th codons and the 51st ∼ remainder codons was identified at non-CpG sites.
A↔G fourfold degenerate site sSNPs.
| First two codon positions | CpG site (yes or no) | 3rd ∼50th codons | 51st ∼ Remainder codons | 3rd ∼50th codons | ||
| A→G n(%) | G→A n(%) | A→G n(%) | G→A n(%) | |||
| AC | Yes | 68(1.8%) | 164(4.4%) | 1021(27.1%) | 2508(66.7%) | 0.902 |
| CC | Yes | 84(2.1%) | 226(5.7%) | 1039(26.2%) | 2621(66%) | 0.628 |
| CG | No | 14(2%) | 68(9.7%) | 172(24.5%) | 449(63.9%) | 0.040* |
| CT | No | 50(2.2%) | 171(7.4%) | 601(26%) | 1491(64.5%) | 0.055 |
| GC | Yes | 72(2%) | 246(6.8%) | 907(25.2%) | 2369(65.9%) | 0.054 |
| GG | No | 40(2.6%) | 96(6.3%) | 499(32.8%) | 887(58.3%) | 0.125 |
| GT | No | 33(2%) | 118(7.1%) | 360(21.5%) | 1161(69.4%) | 0.616 |
| TC | Yes | 59(2.3%) | 124(4.9%) | 647(25.4%) | 1714(67.4%) | 0.159 |
χ2 test of the difference of substitution direction between the first 50 codons and the remainder codons; *Uncorrected P<0.05. By Bonferroni correction for multiple comparisons, the threshold for statistical significance is P<0.00625.
The G→A bias was mainly explained by the CG(G→A) substitution at non-CpG sites.
Figure 1The distribution bias of CG(G→A) and CG(A→G) at the ramp regions.
The ratio of CG(G→A)/CG(A→G) at the ramp regions is larger than that at the reminder coding regions (P = 0.040). CG(A↔G) synonymous substitutions are all at non-CpG sites.
C↔T fourfold degenerate site sSNPs.
| First two codon positions | CpG site (yes or no) | 3rd ∼50th codons | 51st ∼ Remainder codons | 3rd ∼50th codons | ||
| C→T n(%) | T→C n(%) | C→T n(%) | T→C n(%) | |||
| AC | Yes | 79(5.5%) | 17(1.2%) | 1027(71.6%) | 311(21.7%) | 0.212 |
| AC | No | 97(7.8%) | 13(1%) | 866(69.8%) | 264(21.3%) | 0.006** |
| CC | Yes | 106(6%) | 31(1.8%) | 1249(70.8%) | 377(21.4%) | 0.882 |
| CC | No | 81(6.2%) | 26(2%) | 906(69.1%) | 298(22.7%) | 0.917 |
| CG | Yes | 42(7.1%) | 8(1.4%) | 437(73.9%) | 104(17.6%) | 0.578 |
| CG | No | 48(9.6%) | 6(1.2%) | 377(75.7%) | 67(13.5%) | 0.435 |
| CT | Yes | 69(8.3%) | 15(1.8%) | 613(73.5%) | 137(16.4%) | 0.927 |
| CT | No | 129(9.3%) | 17(1.2%) | 1006(72.5%) | 236(17%) | 0.029* |
| GC | Yes | 127(6%) | 35(1.7%) | 1514(71.9%) | 430(20.4%) | 0.879 |
| GC | No | 126(7.8%) | 27(1.7%) | 1165(72.2%) | 296(18.3%) | 0.442 |
| GG | Yes | 116(8.8%) | 25(1.9%) | 958(72.7%) | 219(16.6%) | 0.800 |
| GG | No | 103(8.8%) | 20(1.7%) | 834(71.2%) | 215(18.3%) | 0.267 |
| GT | Yes | 33(5.7%) | 13(2.2%) | 408(70.2%) | 127(21.9%) | 0.491 |
| GT | No | 40(5%) | 14(1.8%) | 593(74.7%) | 147(18.5%) | 0.285 |
| TC | Yes | 70(5.7%) | 29(2.4%) | 840(68.6%) | 285(23.3%) | 0.387 |
| TC | No | 103(8.2%) | 35(2.8%) | 828(65.7%) | 295(23.4%) | 0.819 |
χ2 test of the difference of substitution direction between the first 50 codons and the remainder codons; *Uncorrected P<0.05; **Uncorrected P<0.01. By Bonferroni correction for multiple comparisons, the threshold for statistical significance is P<0.003125.
The C→T bias was mainly explained by the AC(C→T) and CT(C→T) substitutions at non-CpG sites.
Figure 2The distribution bias of (C→T) and (T→C) at non-CpG sites of the ramp regions.
(a) The ratio of AC(C→T)/AC(T→C) at non-CpG sites of the ramp regions is larger than that at the reminder coding regions(P = 0.006). (b) The ratio of CT(C→T)/CT(T→C) at non-CpG sites of the ramp regions is larger than that at the reminder coding regions (P = 0.029).