| Literature DB >> 20691090 |
Jon Bohlin1, Lars Snipen, Simon P Hardy, Anja B Kristoffersen, Karin Lagesen, Torunn Dønsvik, Eystein Skjerve, David W Ussery.
Abstract
BACKGROUND: Bacterial genomes possess varying GC content (total guanines (Gs) and cytosines (Cs) per total of the four bases within the genome) but within a given genome, GC content can vary locally along the chromosome, with some regions significantly more or less GC rich than on average. We have examined how the GC content varies within microbial genomes to assess whether this property can be associated with certain biological functions related to the organism's environment and phylogeny. We utilize a new quantity GCVAR, the intra-genomic GC content variability with respect to the average GC content of the total genome. A low GCVAR indicates intra-genomic GC homogeneity and high GCVAR heterogeneity.Entities:
Mesh:
Year: 2010 PMID: 20691090 PMCID: PMC3091660 DOI: 10.1186/1471-2164-11-464
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The distributions of GC difference within genomes. The histograms show the distribution of GC difference, D, (Equation (1) in the Methods section) for eight different microbial genomes. The blue curves are empirical density estimates, while the red curves are Gaussian densities using based on the same means and standard deviations as the empirical estimates. The upper panels show the statistical distributions for four AT rich genomes, while the lower panels show the distributions for four GC rich genomes.
The coefficient estimates from the GCVAR regression model
| Phylum | Acidobacteria | 2 | 60 | 7.8 | -0.23 | 0.05 |
| Phylum | Actinobacteria | 42 | 66 | 4.8 | -0.11 | 0.003 |
| Phylum | Bacteroides | 16 | 44 | 3.6 | 0.18 | <0.001 |
| Phylum | Betaproteobacteria | 64 | 64 | 3.4 | 0.1 | 0.002 |
| Phylum | Chlamydiae | 8 | 43 | 1.9 | -0.28 | <0.001 |
| Phylum | Crenarchaeota | 16 | 48 | 2 | 0.22 | <0.001 |
| Phylum | Cyanobacteria | 17 | 48 | 4.4 | 0.3 | <0.001 |
| Phylum | Deltaproteobacteria | 18 | 58 | 4.7 | 0.15 | 0.001 |
| Phylum | Epsilonproteobacteria | 12 | 38 | 1.9 | 0.1 | 0.04 |
| Phylum | Euryarcheota | 31 | 46 | 2.4 | 0.16 | <0.001 |
| Phylum | Firmicutes | 89 | 37 | 2.6 | 0.12 | <0.001 |
| Phylum | Gammaproteobacteria | 92 | 47 | 3.7 | 0.12 | <0.001 |
| Phylum | Planctomycetes | 1 | 55 | 7.2 | -0.48 | 0.002 |
| Phylum | Spirochaetes | 11 | 37 | 1.7 | 0.14 | 0.01 |
| Oxygen | Anaerobic | - | - | - | 0.11 | <0.001 |
| GC | - | - | - | - | 0.37 | <0.001 |
The variable GC is continuous while phylum and oxygen are categorical variables. Note that for the phylum variable we have used the sum-to-zero parameterization, i.e. all estimated effects are deviations from the mean phylum effect. For the oxygen requirement variable however, we used a relative parameterization where the category "aerobic" is the reference, i.e. the estimated effect is the deviation from the aerobic effect. In addition, the number of chromosomes, average %GC, and average genomes size in mbp, are included for each phylogenetic group.
Figure 2Significant effects on GC variation. The bars indicate 95% confidence intervals for the effects of various phyla (top panel) and oxygen requirements (lower panel) based on the regression model described by Equation (3) in the Methods section. Note that the values on the horizontal axis are scaled differently in the two panels. Categories with non-overlapping intervals can be said to differ significantly at a 5% level. Only significant effects are included.
Figure 3Overall relation between GC content and . The Figure shows GCVAR on the vertical axis versus %GC on the horizontal axis. The trend line is made using standard loess smoother.