| Literature DB >> 21241503 |
Christian M Zmasek1, Adam Godzik.
Abstract
BACKGROUND: Genome size and complexity, as measured by the number of genes or protein domains, is remarkably similar in most extant eukaryotes and generally exhibits no correlation with their morphological complexity. Underlying trends in the evolution of the functional content and capabilities of different eukaryotic genomes might be hidden by simultaneous gains and losses of genes.Entities:
Mesh:
Year: 2011 PMID: 21241503 PMCID: PMC3091302 DOI: 10.1186/gb-2011-12-1-r4
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1An overview of a current model of eukaryote evolution [30,67]. Numbers in brackets indicate the number of genomes from each branch analyzed in this work.
Figure 2Domain gains and losses during eukaryote evolution. (a) Inferred domainome sizes for ancestral genomes on the path from the LECA to mammals are shown on the left. (b) The numbers of gained protein domains per branch (edge), inferred by Dollo parsimony, are shown in green, whereas inferred losses are shown in red. (c) The numbers of distinct domains per genome in extant species are shown on the right side; for groups of species represented as triangles, these numbers are averages. Species, or groups of species, that are mostly parasitic are shown in grey. For more detailed data see Additional files 3 and 4. This figure was made using 'gathering' cutoffs provided by Pfam; for a corresponding figure using a E-value cutoff of 10-8, see Additional file 13.
Protein domain gains and loss comparison between lophotrochozoans and ecdysozoans
| Ancestor domains | Extant domains | |||||
|---|---|---|---|---|---|---|
| Gains | Losses | Present | Mean | Standard deviation | Genomes analyzed | |
| 16 | 545 | 4,215 | 3,605 | 320 | 3 | |
| Annelids | 16 | 721 | 4,039 | 3,602 | 393 | 2 |
| 24 | 736 | 4,032 | 3,202 | 190 | 12 | |
| Arthropods | 38 | 864 | 3,918 | 3,256 | 172 | 9 |
| Nematodes | 39 | 1,477 | 3,306 | 3,039 | 143 | 3 |
In this table, gains and losses are relative to the last common ancestor of deuterostomes and protostomes, the Urbilateria. For the calculation of extant domain statistics, data from parasitic species is omitted (the nematode Brugia malayi and the flatworm Schistosoma mansoni).
Functional differences in gained and lost domains
| Biological regulation | Metabolic process | |||
|---|---|---|---|---|
| Gains | Losses | Gains | Losses | |
| LECA to mammals | 12.0 | 5.2 | 8.1 | 21.6 |
| LECA to plants | 2.6 | 8.2 | 14.7 | 33.5 |
| LECA to homobasidiomycetes | 4.3 | 13.6 | 7.3 | 45.8 |
Average domain gain/loss counts per tree branch (edge) are shown.
Enriched gained and lost Gene Ontology terms along path from Unikonta to Mammalia
| Enriched gained GO terms | Enriched lost GO terms | |||
|---|---|---|---|---|
| Protein import into peroxisome matrix, docking | 9.5E-3 | |||
| 1.9E-2 | ||||
| Organelle organization* | 2.6E-2 | |||
| 1.3E-2 | Protein-heme linkage | 5.2E-3 | ||
| 1.0E-2 | ||||
| 2.2E-3 | 1.6E-5 | |||
| 9.2E-3 | 3.1E-4 | |||
| 1.2E-7 | 1.1E-4 | |||
| 2.3E-3 | ||||
| Cell-matrix adhesion | 4.0E-4 | 6.9E-3 | ||
| Apoptosis | 3.1E-4 | Protein folding | 1.7E-3 | |
| and Cnidaria) | Peptide cross-linking | 4.7E-4 | Transcription initiation | 3.8E-3 |
| Mitochondrial electron transport, NADH to ubiquinone | 8.3E-6 | 3.3E-4 | ||
| 2.3E-3 | ||||
| 2.7E-4 | 5.0E-3 | |||
| Protein transport | 8.2E-2 | 7.0E-4 | ||
| Phosphoenolpyruvate-dependent sugar phosphotransferase system | 3.2E-3 | |||
| 3.2E-3 | 2.1E-2 | |||
| 6.7E-3 | ||||
| Antigen processing and presentation | 5.5E-3 | 2.3E-3 | ||
| Protein amino acid phosphorylation | 1.8E-2 | 3.0E-3 | ||
| Immune response | 4.4E-11 | DNA topological change | 2.0E-3 | |
| 1.6E-5 | 3.1E-3 | |||
| 1.3E-2 | Valyl-tRNA aminoacylation | 4.3E-3 | ||
| Synaptic transmission | 2.0E-2 | Response to water | 8.6E-3 | |
| Immune response | 1.8E-3 | 9.2E-8 | ||
| 1.0E-3 | ||||
| Defense response | 2.0E-3 | 1.8E-3 | ||
| Hemopoiesis | 2.8E-3 | 1.1E-2 | ||
| Reciprocal meiotic recombination | 8.3E-3 |
The two terms with the lowest P-values are shown (calculated by the Ontologizer 2.0 software [63] with the Topology-Elim algorithm [64]), with the exception of the four terms marked by an asterisk, due to the relevance of these terms for this work. Prototypical regulatory terms are in bold text, prototypical metabolic terms are in italics (Additional files 5 and 6 list all gained and lost domains together with their associated GO terms and Additional file 14 summarizes the results of using different parameters in Ontologizer 2.0 software).
Enriched gained and lost Gene Ontology terms for select clades
| Enriched gained GO terms | Enriched lost GO terms | |||
|---|---|---|---|---|
| Corticata | 3.2E-7 | 7.9E-3 | ||
| Photosynthesis | 2.3E-6 | Lipid transport | 1.1E-2 | |
| Archaeplastida | Photosynthesis | 3.1E-25 | 9.1E-4 | |
| Glycyl-tRNA aminoacylation | 1.8E-2 | 4.8E-3 | ||
| Viridiplantae | Photosynthesis | 2.7E-3 | 7.1E-3 | |
| Protein import into mitochondrial outer membrane | 7.1E-3 | 1.3E-2 | ||
| Protostomia | Sensory perception of smell | 4.3E-3 | Protein secretion by the type II secretion system | 3.4E-3 |
| Cell adhesion | 4.2E-3 |
The two terms with the lowest P-values are shown (calculated by the Ontologizer 2.0 software [63] with the Topology-Elim algorithm [64]). Prototypical regulatory terms are in bold text, prototypical metabolic terms are in italics (for detailed results see Additional files 5 and 6).
Figure 3Dynamics of genomes during animal evolution. The functional contents of inferred ancestral genomes from the LECA to hominids (humans and great apes) are shown. (a) GO categories involved in various aspects of regulation. (b) GO categories involved in various aspects of metabolism (for detailed results see Additional files 5 and 6). Divergence time estimates are based on the fossil record and thus are minimum time constrains [39,68,69]. Geological periods are indicated on both panels ('Ed' stands for Ediacaran period and 'Cz' for Cenozoic era).
Figure 4Classifying eukaryotes by the functional profiles of their genomes. A two-dimensional plot of regulatory function versus catalytic activity percentages for ancestral and extant domainomes.