| Literature DB >> 15728743 |
Paul M Sharp1, Elizabeth Bailes, Russell J Grocock, John F Peden, R Elizabeth Sockett.
Abstract
Among bacteria, many species have synonymous codon usage patterns that have been influenced by natural selection for those codons that are translated more accurately and/or efficiently. However, in other species selection appears to have been ineffective. Here, we introduce a population genetics-based model for quantifying the extent to which selection has been effective. The approach is applied to 80 phylogenetically diverse bacterial species for which whole genome sequences are available. The strength of selected codon usage bias, S, is found to vary substantially among species; in 30% of the genomes examined, there was no significant evidence that selection had been effective. Values of S are highly positively correlated with both the number of rRNA operons and the number of tRNA genes. These results are consistent with the hypothesis that species exposed to selection for rapid growth have more rRNA operons, more tRNA genes and more strongly selected codon usage bias. For example, Clostridium perfringens, the species with the highest value of S, can have a generation time as short as 7 min.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15728743 PMCID: PMC549432 DOI: 10.1093/nar/gki242
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The 80 bacterial genome sequences analysed
| Species code | Gene numbers | GC content | Random | Accession nos | Species | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| rRNA | tRNA | ORF | i | ii | iii | ||||||
| Gamma proteobacteria | |||||||||||
| Esccol | 7 | 86 | 4289 | 51 | 54 | 48 | 1.488 | (0.308/−0.286) | 992 | U00096 | |
| Salent | 7 | 84 | 4452 | 53 | 58 | 50 | 1.522 | (0.292/−0.254) | 993 | AE006468 | |
| Yerpes | 6 | 68 | 4008 | 48 | 48 | 43 | 1.153 | (0.258/−0.243) | 991 | AL590842 | |
| BucaAp | 1 | 30 | 564 | 26 | 12 | 12 | −0.017 | (0.179/−0.228) | 1200 | BA000003 | |
| BucaBp | 1 | 31 | 504 | 25 | 12 | 12 | −0.590 | (0.356/−0.448) | 1241 | AF492592 | |
| BucaSg | 1 | 31 | 545 | 25 | 10 | 11 | −0.069 | (0.213/−0.265) | 1223 | AE013218 | |
| Wigglo | 2 | 34 | 611 | 22 | 9 | 10 | 0.105 | (0.203/−0.247) | 1262 | BA000021 | Wigglesworthia glossinidia |
| Haeinf | 6 | 56 | 1709 | 38 | 27 | 24 | 1.492 | (0.330/−0.325) | 1001 | L42023 | Haemophilus influenzae |
| Pasmul | 6 | 56 | 2014 | 41 | 32 | 27 | 1.339 | (0.289/−0.282) | 1007 | AE004439 | Pasteurella multocida |
| Vibcho | 8 | 98 | 3828 | 47 | 47 | 37 | 1.725 | (0.294/−0.273) | 970 | AE003852* | Vibrio cholerae |
| Vibpar | 11 | 126 | 4832 | 45 | 44 | 33 | 1.886 | (0.336/−0.300) | 960 | BA000031* | Vibrio parahaemolyticus |
| Vibvul | 9 | 111 | 4537 | 47 | 47 | 34 | 1.950 | (0.296/−0.266) | 973 | AE016795* | |
| Sheone | 9 | 100 | 4630 | 46 | 45 | 37 | 1.377 | (0.313/−0.275) | 983 | AE014299 | Shewenella oneidensis |
| Pseaer | 4 | 62 | 5566 | 67 | 87 | 74 | −0.019 | (0.484/−0.507) | 940 | AE004091 | Pseudomonas aeruginosa |
| Pseput | 7 | 74 | 5350 | 62 | 77 | 64 | 0.917 | (0.360/−0.317) | 966 | AE015451 | Pseudomonas putida |
| Psesyr | 5 | 64 | 5566 | 58 | 71 | 58 | 0.701 | (0.255/−0.243) | 958 | AE016853 | Pseudomonas syringae |
| Xanaxo | 2 | 54 | 4312 | 65 | 80 | 80 | 0.636 | (0.273/−0.261) | 952 | AE008923 | Xanthomonas axonopodis |
| Xancam | 2 | 54 | 4181 | 65 | 81 | 80 | 0.607 | (0.292/−0.299) | 958 | AE008922 | Xanthomonas campestris |
| Xylfas | 2 | 49 | 2034 | 52 | 54 | 40 | −0.781 | (0.382/−0.324) | 990 | AE009442 | |
| Coxbur | 1 | 42 | 2009 | 43 | 38 | 43 | 0.175 | (0.170/−0.184) | 975 | AE016828 | Coxiella burnetii |
| Beta proteobacteria | |||||||||||
| Neimen | 4 | 58 | 2121 | 52 | 60 | 42 | −0.099 | (0.373/−0.346) | 1015 | AL157959 | |
| Niteur | 1 | 41 | 2574 | 51 | 53 | 37 | −0.884 | (0.258/−0.253) | 1006 | AL954747 | Nitrosomonas europaea |
| Ralsol | 3 | 57 | 5120 | 67 | 87 | 80 | 0.024 | (0.451/−0.371) | 992 | AL646052* | Ralstonia solanacearum |
| Alpha proteobacteria | |||||||||||
| Agrtum | 4 | 53 | 4661 | 59 | 71 | 69 | 1.048 | (0.217/−0.202) | 1033 | AE008688* | |
| Sinmel | 3 | 54 | 6205 | 63 | 79 | 77 | 0.637 | (0.236/−0.225) | 1027 | AL591688 | Sinorhizobium meliloti |
| Brumel | 3 | 54 | 3198 | 57 | 66 | 67 | 0.896 | (0.237/−0.202) | 1037 | AE008917* | Brucella melitensis |
| Meslot | 2 | 52 | 6752 | 63 | 79 | 83 | 0.757 | (0.283/−0.245) | 1029 | BA000012 | Mesorhizobium loti |
| Brajap | 1 | 50 | 8317 | 64 | 82 | 86 | 0.741 | (0.312/−0.281) | 968 | BA000040 | Bradyrhizobium japonicum |
| Caucre | 2 | 51 | 3737 | 67 | 86 | 83 | 1.152 | (0.370/−0.310) | 970 | AE005673 | Caulobacter crescentus |
| Ricpro | 1 | 33 | 834 | 29 | 16 | 14 | −0.421 | (0.225/−0.243) | 1157 | AJ235269 | Rickettsia prowazekii |
| Riccon | 1 | 33 | 1374 | 32 | 21 | 17 | −0.410 | (0.234/−0.214) | 1135 | AE006914 | Rickettsia conorii |
| Epsilon proteobacteria | |||||||||||
| Camjej | 3 | 43 | 1654 | 31 | 17 | 16 | 0.486 | (0.300/−0.375) | 1119 | AL111168 | |
| Helpyl | 2 | 36 | 1491 | 39 | 41 | 42 | 0.016 | (0.184/−0.195) | 1138 | AE001439 | |
| Firmicutes (A+T-rich gram positives) | |||||||||||
| Bacsub | 10 | 86 | 4100 | 44 | 43 | 30 | 1.360 | (0.232/−0.224) | 1059 | AL009126 | Bacillus subtilis |
| Bacant | 11 | 95 | 5311 | 35 | 23 | 24 | 2.045 | (0.338/−0.316) | 1022 | AE016879 | |
| Bachal | 8 | 78 | 4066 | 44 | 40 | 34 | 0.999 | (0.166/−0.174) | 1046 | BA000004 | Bacillus halodurans |
| Oceihe | 7 | 69 | 3496 | 36 | 23 | 22 | 1.301 | (0.180/−0.197) | 1067 | BA000028 | Oceanobacillus iheyensis |
| Lismon | 6 | 67 | 2855 | 38 | 28 | 23 | 1.198 | (0.296/−0.288) | 1072 | AL591824 | |
| Entfae | 4 | 67 | 3113 | 38 | 28 | 24 | 1.840 | (0.324/−0.287) | 1083 | AE016830 | Enterococcus faecalis |
| Lacpla | 5 | 72 | 3051 | 45 | 43 | 34 | 1.253 | (0.271/−0.268) | 1032 | AL935263 | Lactobacillus plantarum |
| Laclac | 6 | 62 | 2266 | 35 | 23 | 23 | 2.288 | (0.334/−0.321) | 1035 | AE005176 | Lactococcus lactis lactis |
| Straga | 7 | 80 | 2124 | 36 | 23 | 21 | 1.504 | (0.282/−0.252) | 1070 | AE009948 | |
| Strpyo | 6 | 60 | 1696 | 39 | 30 | 24 | 1.759 | (0.299/−0.286) | 1081 | AE004092 | |
| Strpne | 4 | 58 | 2043 | 40 | 34 | 26 | 1.720 | (0.380/−0.364) | 1074 | AE007317 | |
| Staaur | 5 | 61 | 2593 | 33 | 20 | 18 | 1.564 | (0.248/−0.267) | 1084 | BA000018 | |
| Staepi | 5 | 58 | 2419 | 32 | 19 | 16 | 1.164 | (0.254/−0.243) | 1073 | AE015929 | Staphylococcus epididermis |
| Cloace | 11 | 73 | 3672 | 31 | 18 | 14 | 0.838 | (0.283/−0.286) | 856 | AE001437 | Clostridium acetobutylicum |
| Cloper | 10 | 95 | 2660 | 29 | 14 | 18 | 2.648 | (0.434/−0.420) | 838 | BA000016 | Clostridium perfringens |
| Clotet | 6 | 54 | 2373 | 29 | 14 | 13 | 1.004 | (0.244/−0.272) | 817 | AE015927 | Clostridium tetani |
| Theten | 4 | 55 | 2588 | 38 | 32 | 35 | 0.457 | (0.265/−0.266) | 842 | AE008691 | Thermoanaerobacter tengcongensis |
| Mycgen | 1 | 35 | 480 | 32 | 22 | 26 | 0.318 | (0.269/−0.310) | 1360 | L43967 | Mycoplasma genitalium |
| Mycpne | 1 | 36 | 688 | 40 | 41 | 43 | 0.324 | (0.206/−0.217) | 1307 | U00089 | Mycoplasma pneumoniae |
| Mycgal | 2 | 31 | 726 | 31 | 22 | 21 | 0.498 | (0.285/−0.391) | 1355 | AE015450 | Mycoplasma gallisepticum |
| Mycpen | 1 | 29 | 1037 | 26 | 12 | 11 | 0.496 | (0.237/−0.253) | 1379 | BA000026 | Mycoplasma penetrans |
| Mycpul | 1 | 28 | 782 | 27 | 13 | 12 | 0.380 | (0.235/−0.267) | 1235 | AL445566 | Mycoplasma pulmonis |
| Ureure | 1 | 29 | 611 | 26 | 11 | 10 | 0.401 | (0.232/−0.262) | 1223 | AF222894 | Ureaplasma urealyticum |
| Actinobacteria (G+C-rich gram positives) | |||||||||||
| Coreff | 5 | 56 | 2950 | 63 | 79 | 76 | 1.040 | (0.495/−0.395) | 1051 | BA000035 | Corynebacterium efficiens |
| Corglu | 6 | 60 | 3099 | 54 | 58 | 65 | 2.185 | (0.467/−0.381) | 1047 | BA000036 | Corynebacterium glutamicum |
| Myclep | 1 | 45 | 2720 | 58 | 64 | 73 | 0.515 | (0.224/−0.193) | 939 | AL450380 | Mycobacterium leprae |
| Myctub | 1 | 45 | 3918 | 66 | 79 | 83 | 0.452 | (0.256/−0.242) | 937 | AL123456 | |
| Strcoe | 6 | 63 | 7825 | 72 | 93 | 92 | 0.986 | (1.049/−0.618) | 921 | AL645882 | Streptomyces coelicolor |
| Strave | 6 | 68 | 7575 | 71 | 91 | 89 | 0.686 | (0.703/−0.501) | 937 | BA000030 | Streptomyces avermitilis |
| Trowhi | 1 | 50 | 808 | 46 | 41 | 46 | 0.014 | (0.189/−0.191) | 841 | AE014184 | |
| Biflon | 4 | 56 | 1729 | 60 | 75 | 79 | 1.344 | (0.519/−0.449) | 999 | AE014295 | Bifidobacterium longum |
| Cyanobacteria | |||||||||||
| Nostoc | 4 | 67 | 5366 | 41 | 33 | 38 | 0.763 | (0.295/−0.271) | 1020 | BA000019 | |
| Theelo | 1 | 40 | 2475 | 54 | 57 | 56 | 0.178 | (0.306/−0.207) | 1018 | BA000039 | Thermosynechococcus elongatus |
| Syn680 | 2 | 41 | 3056 | 48 | 48 | 53 | 0.678 | (0.243/−0.253) | 1024 | BA000022 | |
| Spirochaetes | |||||||||||
| Borbur | 1 | 32 | 850 | 29 | 19 | 20 | −0.308 | (0.436/−0.579) | 1215 | AE000783 | Borrelia burgdorferi |
| Trepal | 2 | 45 | 1031 | 53 | 53 | 54 | −0.015 | (0.248/−0.255) | 956 | AE000520 | Treponema pallidum |
| Lepint | 1 | 37 | 4358 | 36 | 28 | 33 | 0.670 | (0.254/−0.258) | 1192 | AE010300* | |
| Chlamydiae | |||||||||||
| Chltra | 2 | 37 | 894 | 41 | 32 | 31 | 0.132 | (0.236/−0.247) | 974 | AE001273 | Chlamydia trachomatis |
| Chlmur | 1 | 37 | 904 | 40 | 31 | 30 | 0.145 | (0.244/−0.239) | 989 | AE002160 | Chlamydia muridarum |
| Chlcav | 1 | 38 | 998 | 39 | 30 | 28 | 0.113 | (0.224/−0.208) | 1028 | AE015925 | Chlamydophila caviae |
| Chlpne | 1 | 38 | 1110 | 41 | 33 | 26 | −0.065 | (0.223/−0.234) | 1027 | AE002161 | |
| Fusobacteria | |||||||||||
| Fusnuc | 5 | 47 | 2067 | 27 | 10 | 10 | 1.244 | (0.242/−0.274) | 872 | AE009951 | Fusobacterium nucleatum |
| Bacteroidetes/Chlorobi | |||||||||||
| Bacthe | 5 | 70 | 4778 | 43 | 43 | 32 | 0.237 | (0.445/−0.418) | 1198 | AE015928 | Bacteroides thetaiotamicron |
| Chltep | 2 | 50 | 2252 | 57 | 72 | 65 | 0.069 | (0.301/−0.311) | 1072 | AE006470 | Chlorobium tepidum |
| Deinococci | |||||||||||
| Deirad | 3 | 49 | 2936 | 67 | 84 | 86 | 1.491 | (0.299/−0.280) | 990 | AE000513* | Deinococcus radiodurans |
| Thermotogae | |||||||||||
| Themar | 1 | 46 | 1846 | 46 | 51 | 48 | 0.365 | (0.281/−0.276) | 954 | AE000512 | Thermotoga maritima |
| Aquificae | |||||||||||
| Aquaeo | 2 | 43 | 1522 | 43 | 47 | 48 | 0.393 | (0.260/−0.273) | 837 | AE000657 | Aquifex aeolicus |
aThe species code (as used in Figure 1).
bThe numbers of rRNA operons, tRNA genes and putative protein-coding genes (ORFs).
cThe %G+C content of (i) the genome, or at synonymously variable third positions (ii) averaged over all genes and (iii) in the highly expressed gene data set.
dThe strength of selected codon usage bias (S) calculated for the highly expressed gene data set.
eThe 95% range of values of S among 1000 sets of randomly selected genes.
fThe number of codons used to estimate S, in the highly expressed gene data set.
gThe GenBank accession number for the genome sequence; asterisk indicates species with two chromosomes; the accession numbers for the two chromosomes are consecutive, except for D.radiodurans, where the second accession no. is AE001825.
Figure 1Phylogenetic relationships of the 80 bacterial genomes analysed. Species codes are given in Table 1.
Figure 2G+C content at the three codon positions within the dnaA gene, compared with the G+C content of the genome as a whole, for 79 bacterial genomes (no dnaA homologue has been found in W.glossinidia). Positions 1, 2 and 3 are indicated by open circles, open triangles and filled circles, respectively. The third position is strongly influenced by G+C bias; the first two positions are also influenced, implying an effect on amino acid composition (68).
Figure 3Selected codon usage bias (S) and genomic G+C bias for 80 bacterial species. Genomic G+C bias is estimated by the overall GC3S. Open circles denote species where the S-value is not greater than found among randomly selected genes; filled triangles denote three Clostridium species.
Figure 4Ribosomal RNA operon copy number and tRNA gene number for 80 bacterial species. Symbols are as in Figure 3.
Figure 5Selected codon usage bias (S) and ribosomal RNA operon copy number for 80 bacterial species. Symbols are as in Figure 3.
Figure 6Selected codon usage bias (S) and tRNA gene number for 80 bacterial species. Symbols are as in Figure 3.
Figure 7Comparison of two estimates of selected codon usage bias: x-axis values are taken from this paper, y-axis values from dos Reis et al. (62). Values significantly greater than zero in the dos Reis et al. analysis are shown as circles; values significantly greater than zero in our analysis are shown as filled symbols.