| Literature DB >> 27297468 |
Yulong Wei1, Juan Wang1, Xuhua Xia2.
Abstract
Three stop codons in bacteria represent different translation termination signals, and their usage is expected to depend on their differences in translation termination efficiency, mutation bias, and relative abundance of release factors (RF1 decoding UAA and UAG, and RF2 decoding UAA and UGA). In 14 bacterial species (covering Proteobacteria, Firmicutes, Cyanobacteria, Actinobacteria and Spirochetes) with cellular RF1 and RF2 quantified, UAA is consistently over-represented in highly expressed genes (HEGs) relative to lowly expressed genes (LEGs), whereas UGA usage is the opposite even in species where RF2 is far more abundant than RF1. UGA usage relative to UAG increases significantly with PRF2 [=RF2/(RF1 + RF2)] as expected from adaptation between stop codons and their decoders. PRF2 is > 0.5 over a wide range of AT content (measured by PAT3 as the proportion of AT at third codon sites), but decreases rapidly toward zero at the high range of PAT3 This explains why bacterial lineages with high PAT3 often have UGA reassigned because of low RF2. There is no indication that UAG is a minor stop codon in bacteria as claimed in a recent publication. The claim is invalid because of the failure to apply the two key criteria in identifying a minor codon: (1) it is least preferred by HEGs (or most preferred by LEGs) and (2) it corresponds to the least abundant decoder. Our results suggest a more plausible explanation for why UAA usage increases, and UGA usage decreases, with PAT3, but UAG usage remains low over the entire PAT3 range.Entities:
Keywords: RF1; RF2; gene expression.; prfA; prfB; release factors; stop codon; translation termination
Mesh:
Substances:
Year: 2016 PMID: 27297468 PMCID: PMC4989110 DOI: 10.1093/molbev/msw107
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Bacterial Species with Both RF1 and RF2 Concentrations (in ppm, with mean values presented for multiple measurements) in PaxDB (Wang et al. 2012), Together with Stop Codon Usage in Highly Expressed and Lowly Expressed Genes (HEGs and LEGs).
| Speciesa | RF1 | RF2 | PAT3 | PRF2 | PUAA.LEG | PUAA.HEG | PUAG.LEG | PUAG.HEG | PUGA.LEG | PUGA.HEG | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1,000 | 53.1 | 453.00 | 0.4383 | 0.8951 | 0.5730 | 0.7770 | 0.1070 | 0.0320 | 0.3200 | 0.1910 | |
| 300 | 11.6 | 672.00 | 0.4979 | 0.9830 | 0.6100 | 0.7433 | 0.1300 | 0.0700 | 0.2600 | 0.1867 | |
| 800 | 200.5 | 548.50 | 0.2018 | 0.7323 | 0.1525 | 0.1688 | 0.2713 | 0.3538 | 0.5763 | 0.4775 | |
| 600 | 59.2 | 142.89 | 0.4008 | 0.7070 | 0.5717 | 0.7650 | 0.1083 | 0.0433 | 0.3200 | 0.1917 | |
| 300 | 45.5 | 98.05 | 0.7247 | 0.6833 | 0.7167 | 0.9100 | 0.1067 | 0.0467 | 0.1767 | 0.0433 | |
| 500 | 56.4 | 167.00 | 0.1262 | 0.7475 | 0.0560 | 0.2640 | 0.1280 | 0.0480 | 0.8160 | 0.6880 | |
| 300 | 157.0 | 214.00 | 0.5777 | 0.5768 | 0.6267 | 0.6600 | 0.1567 | 0.1567 | 0.2167 | 0.1833 | |
| 600 | 139.3 | 183.00 | 0.6969 | 0.5677 | 0.5683 | 0.6467 | 0.1317 | 0.0983 | 0.3000 | 0.2550 | |
| 1,000 | 35.1 | 27.00 | 0.6059 | 0.4348 | 0.4639 | 0.5908 | 0.2392 | 0.2536 | 0.2970 | 0.1556 | |
| 301 | 246.5 | 74.65 | 0.6766 | 0.2324 | 0.5748 | 0.7902 | 0.2292 | 0.1475 | 0.1960 | 0.0623 | |
| 1,000 | 216.0 | 205.00 | 0.5518 | 0.4869 | 0.5600 | 0.7300 | 0.1530 | 0.1240 | 0.2870 | 0.1460 | |
| 300 | 94.3 | 4.59 | 0.7349 | 0.0464 | 0.7367 | 0.8567 | 0.1367 | 0.0867 | 0.1267 | 0.0567 | |
| 392 | 496.0 | 47.70 | 0.7702 | 0.0877 | 0.7398 | 0.8475 | 0.1633 | 0.1025 | 0.0969 | 0.0500 | |
| 301 | 425.5 | 377.00 | 0.3096 | 0.4698 | 0.2425 | 0.3033 | 0.1362 | 0.1433 | 0.6213 | 0.5533 |
aFull species names are, in the same order, Escherichia coli, Yersinia pestis CO92, Mycobacterium tuberculosis, Salmonella enterica, Lactococcus lactis, Pseudomonas aeruginosa, Helicobacter pylori, Leptospira interrogans, Microcystis aeruginosa, Streptococcus pyogenes, Bacillus subtilis, Bacillus anthracis, Staphylococcus aureus sp. Mu50, and Acidithiobacillus ferrooxidans.
bNumber of genes in top and bottom 25% on the gene expression scale (ranked by either protein abundance values in PaxDB). If 25% includes >1,000 genes, then use 1,000.
cProportion of AT at third codon site.
dProportion of RF2, i.e., RF2/(RF1 + RF2).
eProportion of UAA stop codons in LEGs. The same format applies to the last five columns.
Fig. 1.Stop codon UAA is preferred in highly expressed genes (HEGs) relative to lowly expression genes (LEGs) in all 14 species, regardless of (a) relative abundance in RF1 and RF2, measured by PRF2 as RF2/(RF1 + RF2), or (b) proportion of AT at third codon site (PAT3).
Fig. 2.Stop codon UGA is never preferred in HEGs relative to LEGs even RF2 is far more abundant than RF1 (a), and stop codon UAG is preferred in HEGs in 3 of the 14 species (b).
Fig. 3.Relative usage of UGA and UAG, measured as P2UGA = UGA/(UGA + UAG), increases significantly with relative abundance of RF2, measured as PRF2 = RF2/(RF1 + RF2).
Fig. 4.Phylogenetic tree built with small subunit ribosomal RNA sequences (ssu rRNA), used for independent contrasts, with leaves denoted by species name and GenBank accession for genomes from which the ssu rRNA sequences are extracted. Only the first annotated ssu rRNA sequence is used.
Fig. 5.Relative abundance of RF2 decreases rapidly at high range of AT content, measured by proportion of AT at third codon site (PAT3).
Fig. 6.UAA usage increases, and UGA usage decreases, with PAT3, but UAG usage is low and changes little with PAT3. The pattern is consistent in both highly expressed genes (a) and lowly expressed genes (b).