| Literature DB >> 30691392 |
Sidra Aslam1, Xin-Ran Lan1, Bo-Wen Zhang1, Zheng-Lin Chen1, Li Wang1, Deng-Ke Niu2.
Abstract
BACKGROUND: Among the four bases, guanine is the most susceptible to damage from oxidative stress. Replication of DNA containing damaged guanines results in G to T mutations. Therefore, the mutations resulting from oxidative DNA damage are generally expected to predominantly consist of G to T (and C to A when the damaged guanine is not in the reference strand) and result in decreased GC content. However, the opposite pattern was reported 16 years ago in a study of prokaryotic genomes. Although that result has been widely cited and confirmed by nine later studies with similar methods, the omission of the effect of shared ancestry requires a re-examination of the reliability of the results.Entities:
Keywords: Aerobe; Anaerobe; Guanine oxidation; Nucleotide composition; Oxygen requirement; Phylogenetic generalized least squares (PGLS) regression; Phylogenetically independent; Reactive oxygen species
Mesh:
Year: 2019 PMID: 30691392 PMCID: PMC6350292 DOI: 10.1186/s12862-019-1365-8
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Pairwise comparison of GC content between aerobic and anaerobic prokaryotes. a Illustration of the difference between nonphylogenetically-controlled comparisons and phylogenetically-controlled comparison performed in this study. In a nonphylogenetically controlled comparison, the aerobes (including strain 1, species 7, species 8, and species 11) are compared to all the anaerobes (including strain 2, species 1–6, and species 9–10). However, only three changes in oxygen requirement are observed in the illustrated evolutionary tree. The differences in GC content between these three branches are likely to be associated with changes in the oxygen requirement. Therefore, only three pairs should be included in a phylogenetically controlled comparison. For branches having multiple strains/species with different evolutionary rates (e.g., species 4–8), we paired the slowly evolved aerobic strain/species with the slowly evolved anaerobic strain/species (species 6 vs species 7). In cases with two or more strains/species with identical divergence times, we preferentially selected the genomes in which more genes had been annotated. Next, the comparisons were duplicated using the dataset including the quickly evolved pairs (e.g., species 5 vs species 8 selected from species 4–8). Nearly identical results were obtained in the duplicated comparison. The results of the former are presented in Fig. 1b, c, and d and Table S1, and those of the latter are deposited as electronic supplementary material (Additional file 1: Figure S1 and Table S2). The choice of an anaerobe from species 4, 5 or 6 or an aerobe from species 7 or 8 did not alter the results. b Comparison of the GC content calculated from whole-genome sequences. c Comparison of GC content at the 4FDS of all protein-coding genes in each genome. d Comparison of GC content at the 4FDS of orthologous genes. The diagonal line represents cases in which aerobes and their paired anaerobes have the same GC content. Points above the line represent cases in which anaerobes have higher GC content than their paired aerobes, while points below the line indicate the reverse. All significance values were calculated using two-tailed Wilcoxon signed-rank tests
PGLS regression analysis of the relationship between aerobiosis and GC content
| whole-genome GC content | GC content at 4FDS | GC content at ZRS | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| No. of species | λ | Slope |
| λ | Slope |
| λ | Slope |
| |
| Aerobes + obligate aerobes (3) vs. anaerobes + obligate anaerobes (2) | 884 | 1.0 | 0.44 | 0.552 | 1.0 | −0.25 | 0.882 | 1.0 | 0.42 | 0.153 |
| Aerobe (3) vs. anaerobe (2) | 799 | 1.0 | 0.34 | 0.661 | 1.0 | −0.29 | 0.865 | 1.0 | 0.36 | 0.239 |
| Obligate aerobes (3) vs. obligate anaerobes (2) | 85 | 1.0 | 8.82 | 0.004 | 1.0 | 15.0 | 0.023 | 1.0 | 4.36 | 6 × 10−4 |
| Obligate aerobes (3) vs. anaerobes (2) | 333 | 1.0 | 5.21 | 0.001 | 1.0 | 7.87 | 0.037 | 1.0 | 4.50 | 0.005 |
| Obligate aerobes (3) vs. aerobes (2) | 546 | 1.0 | 1.47 | 0.014 | 1.0 | 3.19 | 0.014 | 1.0 | 2.13 | 0.324 |
| Aerobes (3) vs. obligate anaerobes (2) | 551 | 1.0 | 3.26 | 0.077 | 1.0 | 6.75 | 0.096 | 1.0 | −0.68 | 0.294 |
| Anaerobes (3) vs. obligate anaerobes (2) | 338 | 1.0 | 0.86 | 0.407 | 1.0 | 3.50 | 0.143 | 1.0 | −2.26 | 0.225 |
The numbers in parentheses are those assigned to each group of organisms in PGLS regression analyses. The data used in this analysis are deposited in Additional file 2. 4FDS: 4-fold degenerate sites, ZRS: zerofold redundant sites