| Literature DB >> 20838590 |
Eduardo P C Rocha1, Edward J Feil.
Abstract
Entities:
Mesh:
Substances:
Year: 2010 PMID: 20838590 PMCID: PMC2936526 DOI: 10.1371/journal.pgen.1001104
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1The GC composition of genomes is strongly correlated with second codon (GC2) and 4-fold degenerate positions (GC4) [.
Second codon positions show low variability due to purifying selection on non-synonymous changes. 4-fold degenerate positions vary between 5% and 97% GC among published genomes. In the classical neutral scenario (red), 4-fold degenerate positions are nearly neutral and their composition results essentially from mutational patterns. These patterns are modified in bacteria that lose repair genes, such as mutators, which show additional AT pressure (grey area) [19]. In the selectionist view (blue), the composition of 4-fold degenerate positions results from selection for GC content, the mutational patterns are AT-rich relative to genome composition, and there are no neutral positions. Naturally, this is an idealized view of genomes that code for many additional overlapping signals that are under selection, e.g., codon usage bias, regulatory signals, etc.
Variables Historically Proposed to Explain GC Variation in Prokaryotes.
| Variable | Why? | But… |
| Background selection | GC-rich regions recombine more in | Unclear if the GC effect in recombination is general and strong enough to explain the observations. |
| Biased gene conversion | Repair resulting from conversion between mismatched sequences distorts sequence composition, increasing GC | Recombination increases the efficiency of selection, and thus also facilitates selection for GC. BGC cannot explain GC richness in nearly clonal bacteria. Observed recombination/mutation ratios do not correlate with GC content |
| DNA folding | In dsDNA, GC increases stability, whereas AT increasesflexibility | Unclear if GC-based stability is selected for in dsDNA given the observed low effect of temperature on GC content and the preference for AT-rich sequences at promoters. |
| Environment | Different environments contain bacteria differently enriched in GC | Mechanisms underlying this variable are unclear and could result from combinations of the other variables |
| Gene length | GC richness favors large genes by reducing the frequency of non-sense mutations. Gene GC content correlates with its length | Genomic GC content is at best weakly correlated with the average gene length, which does not vary widely between genomes |
| Genome length | Genome reduction is often driven by low effective population size(Ne) | Gene density being high in prokaryotes, genome length is a proxy of many variables. This renders clear biological interpretations difficult. |
| Mutation pressure | Mutations are AT rich | Does not explain the compositional gap between mutation patterns and actual composition of genomes. Does not explain the existence of GC-rich genomes. |
| Nitrogen-fixation | Selection to save nitrogen (N) use in DNA and RNA because both are N-rich molecules, A/T/U having 7 and G/C 8 N atoms. GC content is higher in N-fixers | GC content is higher in 2 genera of aerobic nitrogen fixers but lower in 2 anaerobic genera |
| Oxygen | Tightly packed GC-rich DNA might be less prone to oxidation. Synonymous Gs could have a sacrificial role in oxidizing environments. Aerobes are GC rich | It's hard to envisage selection of GC polymorphisms for future sacrificial roles. In general, G is the nucleotide most prone to oxidation. |
| Parasitism | Pathogens, plasmids, transposable elements, and bacteriophages are enriched in the costless and abundant AT | Does not explain the existence of GC-rich genomes. |
| Protein composition and folding | GC-rich codons encode amino acids biosynthetically cheaper | Selection on GC should not be driven by protein composition because purifying selection on GC content is strongest at degenerate and intergenic sites. |
| RNA folding | Practically all positions in bacterial genomes are transcribed, and GC-rich RNA structures are more stable. | Only stable RNAs, not all mRNAs, are strongly enriched in GC in thermophiles |
| Speciation & self- recognition | Different GC contents would favor speciation and recognition of self- from non-self DNA | It does not explain why there are traces of pervasive selection only for GC. |
| Temperature | GC richness increases thermostability of dsDNA, RNA structures, and codon-anticodon pairing | Association of optimal growth temperature with genomic GC is weak at best |
| UV radiation | AT-rich dinucleotides are more susceptible to form pyrimidine dimmers upon UV irradiation | No observable counter-selection of UV-susceptible dinucleotides |