| Literature DB >> 23821522 |
Yuri I Wolf1, Sagi Snir, Eugene V Koonin.
Abstract
The shape of the distribution of evolutionary distances between orthologous genes in pairs of closely related genomes is universal throughout the entire range of cellular life forms. The near invariance of this distribution across billions of years of evolution can be accounted for by the Universal Pace Maker (UPM) model of genome evolution that yields a significantly better fit to the phylogenetic data than the Molecular Clock (MC) model. Unlike the MC, the UPM model does not assume constant gene-specific evolutionary rates but rather postulates that, in each evolving lineage, the evolutionary rates of all genes change (approximately) in unison although the pacemakers of different lineages are not necessarily synchronized. Here, we dissect the nearly constant evolutionary rate distribution by comparing the genome-wide relative rates of evolution of individual genes in pairs or triplets of closely related genomes from diverse bacterial and archaeal taxa. We show that, although the gene-specific relative rate is an important feature of genome evolution that explains more than half of the variance of the evolutionary distances, the ranges of relative rate variability are extremely broad even for universal genes. Because of this high variance, the gene-specific rate is a poor predictor of the conservation rank for any gene in any particular lineage.Entities:
Keywords: evolutionary rate; molecular clock; universal genes; universal pacemaker of genome evolution
Mesh:
Year: 2013 PMID: 23821522 PMCID: PMC3730350 DOI: 10.1093/gbe/evt098
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Relative Evolutionary Rate and Conservation Rank Variation among Nearly Universal Genes of Bacteria and Archaea
| P1, T1 | P2, T2 | |
|---|---|---|
| Bacterial clades (pairs, triplets) | 48, 32 | 52, 43 |
| Archaeal clades (pairs, triplets) | 6, 3 | 5, 3 |
| Mean standard deviation factor within 100 COGs | ×1.84 | ×1.51 |
| Mean standard deviation factor within 100 COGs expected from sampling fluctuation | ×1.26 | ×1.16 |
| 38% of total | 37% of total | |
| Spearman correlation between rate variation and COG profile length | −0.54 | −0.36 |
| Spearman correlation between rate variation and relative rate | −0.44 | −0.59 |
| Mean rank interquartile distance | 0.19 | 0.16 |
| Mean interquartile distance for genomic ranks of COGs with median evolution rates | 0.26 | 0.25 |
| Mean interquartile distance factor for sister branch ratios | ×1.68 | ×1.61 |
FSchematic of the comparative analysis of sets of closely related species of archaea and bacteria.
FRelative evolution rates. (A) Distributions of the short-term (obtained from distances in the P1 and P2 data sets) and long-term (obtained under the UPM model) relative evolutionary rates. (B) Correlation between short-term (P1) and long-term (UPM) evolutionary rates. (C) Correlation between short-term (P1 and P2) evolutionary rates.
FRelative conservation ranks and relative evolutionary rates. (A) Correlation between the relative evolutionary rates and the median conservation ranks (P1). (B) Correlation between the relative evolutionary rates and the median conservation ranks (P2). (C) Correlation between the median conservation ranks (P1 and P2).
FVariation of the relative conservation ranks. (A) Set P1. (B) Set P2. The boxes show interquartile distances; the whiskers show the full range.
Examples of (Nearly) Universal Genes with Diverse Biological Functions and Varying Ranges of Conservation Ranks
| COG | Max | q3 | Med | q1 | Min | Protein Name/Function |
|---|---|---|---|---|---|---|
| COG0048 | 1.000 | 0.999 | 0.993 | 0.987 | 0.888 | Ribosomal protein S12 |
| COG0099 | 1.000 | 0.989 | 0.967 | 0.924 | 0.658 | Ribosomal protein S13 |
| COG0092 | 0.996 | 0.980 | 0.962 | 0.887 | 0.546 | Ribosomal protein S3 |
| COG0096 | 0.996 | 0.982 | 0.961 | 0.931 | 0.625 | Ribosomal protein S8 |
| COG0250 | 0.999 | 0.985 | 0.936 | 0.820 | 0.581 | Transcription antiterminator |
| COG0174 | 0.992 | 0.970 | 0.928 | 0.870 | 0.244 | Glutamine synthetase |
| COG0057 | 0.992 | 0.958 | 0.904 | 0.832 | 0.365 | Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
| COG0504 | 0.983 | 0.927 | 0.880 | 0.806 | 0.392 | CTP synthase (UTP-ammonia lyase) |
| COG0436 | 0.984 | 0.926 | 0.873 | 0.783 | 0.245 | Aspartate/tyrosine/aromatic aminotransferase |
| COG0469 | 0.951 | 0.912 | 0.859 | 0.698 | 0.355 | Pyruvate kinase |
| COG0548 | 0.979 | 0.919 | 0.834 | 0.634 | 0.012 | Acetylglutamate kinase |
| COG0449 | 0.973 | 0.865 | 0.814 | 0.684 | 0.325 | Glucosamine 6-phosphate synthetase; contains amidotransferase and phosphosugar isomerase domains |
| COG0152 | 1.000 | 0.900 | 0.779 | 0.647 | 0.358 | Phosphoribosylaminoimidazole-succinocarboxamide (SAICAR) synthase |
| COG0495 | 0.969 | 0.838 | 0.748 | 0.637 | 0.305 | Leucyl-tRNA synthetase |
| COG0524 | 0.981 | 0.808 | 0.748 | 0.511 | 0.043 | Sugar kinases; ribokinase family |
| COG0329 | 0.996 | 0.845 | 0.700 | 0.581 | 0.090 | Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
| COG0149 | 0.988 | 0.803 | 0.657 | 0.450 | 0.120 | Triosephosphate isomerase |
| COG0177 | 0.881 | 0.782 | 0.649 | 0.457 | 0.172 | Predicted EndoIII-related endonuclease |
| COG0084 | 0.984 | 0.682 | 0.594 | 0.434 | 0.152 | Mg-dependent DNase |
| COG0778 | 0.838 | 0.648 | 0.565 | 0.383 | 0.145 | Nitroreductase |
Note.—Highest, 3rd quartile; median, 1st quartile; and lowest relative conservation ranks are shown for selected COGs in the P1 set.
FVariation of the ratio of sister branch lengths. (A) Set T1. (B) Set T2. The boxes show interquartile distances; the whiskers show the full range.