| Literature DB >> 20333185 |
Chih-Horng Kuo1, Howard Ochman.
Abstract
Elevated levels of genetic drift are hypothesized to be a dominant factor that influences genome size evolution across all life-forms. However, increased levels of drift appear to be correlated with genome expansion in eukaryotes but with genome contraction in bacteria, suggesting that these two groups of organisms experience vastly different mutational inputs and selective constraints. To determine the contribution of small insertion and deletion events to the differences in genome organization between eukaryotes and prokaryotes, we systematically surveyed 17 taxonomic groups across the three domains of life. Based on over 5,000 indel events in noncoding regions, we found that deletional events outnumbered insertions in all groups examined. The extent of deletional bias, when measured by the total length of insertions to deletions, revealed a marked disparity between eukaryotes and prokaryotes, whereas the ratio was close to one in the three eukaryotic groups examined, deletions outweighed insertions by at least a factor of 10 in most prokaryotes. Moreover, the strength of deletional bias is associated with the proportion of coding regions in prokaryotic genomes. Considering that genetic drift is a stochastic process and does not discriminate the exact nature of mutations, the degree of bias toward deletions provides an explanation to the differential responses of eukaryotes and prokaryotes to elevated levels of drift. Furthermore, deletional bias, rather than natural selection, is the primary mechanism by which the compact gene packing within most prokaryotic genomes is maintained.Entities:
Keywords: genome evolution; genome size; indels; mutational spectra; organismal complexity
Year: 2009 PMID: 20333185 PMCID: PMC2817411 DOI: 10.1093/gbe/evp016
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary of Taxon Sampling
| Domain | Phylum | Genus | Sampled Genomes | NCBI Genome ID |
| A | Crenarchaeota | (( | ((246, 13935), 108) | |
| A | Euryarchaeota | (( | ((10632, 19639), 17889) | |
| B | Actinobacteria | (( | ((15642, 16725), 88) | |
| B | Chlorobi | (( | ((12606, 12609), 12607) | |
| B | Cyanobacteria | (( | ((13643, 13655), 230) | |
| B | Firmicutes | (( | ((76, 13403), 12388) | |
| B | Proteobacteria | (( | ((44, 196), 16249) | |
| B | Proteobacteria | (( | ((43, 10679), 12952) | |
| B | Proteobacteria | (( | ((272, 30313), 12475) | |
| B | Proteobacteria | (( | ((255, 16393), 23) | |
| B | Proteobacteria | (( | ((177, 192), 15768) | |
| B | Proteobacteria | (( | ((245, 312), 256) | |
| B | Proteobacteria | (( | ((225, 259), 313) | |
| B | Spirochetes | (( | ((13597, 18233), 3) | |
| E | Chordata | (( | NA | |
| E | Arthropoda | (( | NA | |
| E | Ascomycota | (( | NA |
NA, Not applicable.
A: Archaea; B: Bacteria; E: Eukaryota.
Parentheses denote the phylogenetic grouping of taxa in standard Newick tree format.
FExtent of indel bias in cellular genomes. (A) Ratios of deletion to insertion events. A ratio of less than one indicates a bias toward deletions. (B) Indel bias based on the total length of DNA gained and lost. A ratio of less than 1 indicates a bias toward DNA loss.
Summary of Indel Statistics
| Noncoding Regions Examined | Insertional Events | Deletional Events | |||||
| Domain | Genus | Number | Total Length (bp) | Number | Total Length (bp) | Number | Total Length (bp) |
| A | 14 | 9,210 | 20 | 173 | 69 | 2,015 | |
| A | 21 | 12,925 | 45 | 178 | 180 | 7,514 | |
| B | 28 | 17,017 | 21 | 121 | 136 | 6,089 | |
| B | 19 | 6,440 | 11 | 33 | 123 | 7,090 | |
| B | 14 | 4,761 | 15 | 64 | 79 | 3,334 | |
| B | 50 | 15,682 | 79 | 318 | 339 | 15,195 | |
| B | 34 | 30,775 | 123 | 1,433 | 370 | 11,582 | |
| B | 16 | 17,606 | 64 | 345 | 158 | 2,823 | |
| B | 10 | 10,915 | 31 | 471 | 35 | 507 | |
| B | 13 | 9,728 | 24 | 216 | 84 | 4,016 | |
| B | 22 | 3,401 | 9 | 35 | 121 | 7,516 | |
| B | 29 | 23,111 | 105 | 676 | 377 | 9,334 | |
| B | 12 | 4,909 | 8 | 9 | 59 | 8,223 | |
| B | 18 | 6,3532 | 17 | 59 | 199 | 9,335 | |
| E | 136 | 1,182,162 | 235 | 3,343 | 412 | 1,610 | |
| E | 170 | 235,213 | 204 | 1,372 | 385 | 2,582 | |
| E | 99 | 167,335 | 374 | 980 | 801 | 2,168 | |
A: Archaea; B: Bacteria; E: Eukaryota.
Sequence length in focal lineages before alignment.
FLength distribution of small indels across the three domains of life. (A) Archaea, (B) Bacteria, and (C) Eukaryotes.
FCorrelation between indel bias and genomic features in prokaryotes. (A) Gene density. (B) Genome size. Data points are labeled as follows: A, Sulfolobus; B, Methanococcus; C, Mycobacterium; D, Chlorobium; E, Synechococcus; F, Bacillus; G, Bartonella; H, Rickettsia; I, Wolbachia; J, Neisseria; K, Geobacter; L, Buchnera; M, Escherichia; and N, Borrelia.