| Literature DB >> 21801361 |
Louise J Johnson1, James A Cotton, Conrad P Lichtenstein, Greg S Elgar, Richard A Nichols, P David Polly, Steven C Le Comber.
Abstract
BACKGROUND: Efficient gene expression involves a trade-off between (i) premature termination of protein synthesis; and (ii) readthrough, where the ribosome fails to dissociate at the terminal stop. Sense codons that are similar in sequence to stop codons are more susceptible to nonsense mutation, and are also likely to be more susceptible to transcriptional or translational errors causing premature termination. We therefore expect this trade-off to be influenced by the number of stop codons in the genetic code. Although genetic codes are highly constrained, stop codon number appears to be their most volatile feature.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21801361 PMCID: PMC3161013 DOI: 10.1186/1471-2148-11-227
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Values of π[aa:stop] and π[stop:aa] as number of stop codons, S, increases
| 1 | 1/63 | 1 |
| 2 | 8/279 | 8/9 |
| 3 | 7/183 | 7/9 |
| 4 | 2/45 | 2/3 |
| 5 | 31/531 | 31/45 |
| 6 | 2/29 | 2/3 |
| 7 | 13/171 | 13/21 |
| 8 | 5/63 | 5/9 |
| 9 | 1/11 | 5/9 |
Figure 1Proportion near-stop codons in human coding sequences. The proportion of triplets within a single point mutation of a stop codon, as a function of distance from the correct stop, for 500 bp upstream and 1000 bp downstream of a) all protein-coding transcripts in the human genome; b) all coding sequences in the yeast genome. Negative values represent upstream triplets and positive values downstream triplets.
Figure 2Length of coding sequences. Average gene length (y-axis) of GenBank coding sequences from taxa within which genetic codes differ in stop codon number (x-axis). Solid lines, nuclear genes; dotted lines, mitochondrial genes. See Methods for details.
Figure 3Thresholds for changes in the number of stop codons. Contour plots showing the length of coding sequence, in triplets, at which the transition from (a) one to two stop codons; (b) two to three stop codons; (c) three to four stop codons; (d) four to five stop codons; and (e) five to six stop codons becomes possible, for values of R from 1 to 100 and values of k from 0.1 to 1 (see Equations 1 and 2). Contour lines separate lengths of coding sequence from 0-25 triplets (darkest areas) to 200-225 triplets (lightest areas), in increments of 25 triplets. Transitions to greater numbers of stop codons become increasingly difficult as the number of stop codons increases; the transition from four to five stops is favoured by selection only when the mean coding sequence length is very low (very dark shading over most of the plotted area).