| Literature DB >> 16420703 |
Etienne G J Danchin1, Philippe Gouret, Pierre Pontarotti.
Abstract
BACKGROUND: Gene losses played a role which may have been as important as gene and genome duplications and rearrangements, in modelling today species' genomes from a common ancestral set of genes. The set and diversity of protein-coding genes in a species has direct output at the functional level. While gene losses have been reported in all the major lineages of the metazoan tree of life, none have proposed a focus on specific losses in the vertebrates and mammals lineages. In contrast, genes lost in protostomes (i.e. arthropods and nematodes) but still present in vertebrates have been reported and extensively detailed. This probable over-anthropocentric way of comparing genomes does not consider as an important phenomena, gene losses in species that are usually described as "higher". However reporting universally conserved genes throughout evolution that have recently been lost in vertebrates and mammals could reveal interesting features about the evolution of our genome, particularly if these losses can be related to losses of capability.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16420703 PMCID: PMC1382263 DOI: 10.1186/1471-2148-6-5
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Phylogeny of Eukaryotes with gene loss pattern. Taxa for which a species whose genome is fully sequenced and which were used to determine clusters of ancestrally present genes lost in mammals are represented in Bold. Model species used are indicated in italics. Number of ancestral Opisthokonts gene families present at each node is indicated in green, and the number of gene families that have been lost is represented in red. Due to lack of genomic data we were unable to evaluate the number of ancestral genes still present in the Lophotrochozoan ancestor. The phylogeny represented is according to the Ecdysozoa hypothesis for the position of Caenorhabditis elegans, note that under the Coelomata hypothesis the loss pattern would be unchanged.
The twelve families of universally conserved genes missing in mammals
| Gene Family Name1 | A.t2 | S.c2 | A.g2 | D.m2 | C.e2 | Loss extends to | Function | EC | KEGG pathway |
| ACH | 0 | 1 | 1 | 1 | 1 | Deutero. | Acetyl-CoA Hydrolase, Pyruvate Metabolism | 3.1.2.1 | sce00620 |
| TPS | 4 | 1 | 1 | 1 | 2 | Deutero. | Trehalose 6P biosynthesis | 2.4.1.15 | Sce00500 |
| YD56 | 4 | 3 | 2 | 1 | 1 | Chordates | Multicopper Ion transporter | 1.-.-.- | XXX |
| YMT1 | 1 | 1 | 1 | 3 | 1 | Chordates | Putative oxidroeductase and K+ ion transporter | XXX | XXX |
| PNC1 | 0 | 1 | 1 | 1 | 1 | Chordates | Nicotinate and Nicotinamide Metabolism | 3.5.1.19 | sce00760 |
| YM74 | 3 | 2 | 1 | 1 | 1 | Chordates | Transcription factor activity/DNA Binding | XXX | XXX |
| THDH1/ILV1 | 1 | 1 | 1 | 1 | 2 | Chordates | Leucine, Isoleucine, Valine Biosynthesis. Threonine Metabolism | 4.3.1.19 | sce00260 |
| GLT | 1 | 1 | 1 | 1 | 1 | Chordates | Glutamate synthesis | 1.4.1.13 | sce00251 sce00910 |
| AMT/MEP -1,2,3 | 6 | 3 | 1 | 1 | 4 | Eu-chordates | Amonium Transporter | XXX | XXX |
| YKH1 | 0 | 1 | 1 | 1 | 9 | Mammals | Metabolism, oxidoreductase | XXX | XXX |
| URH1 | 2 | 1 | 3 | 3 | 2 | Mammals | Uridine catabolism, hydrolase activity | 3.2.2.3 | XXX |
1 Gene family name is given according to the gene's name in Saccharomyces cerevisiae.
2 Number of gene copies from the considered family present in the genome, for each model species, A.t = Arabidopsis thaliana, S.c = Saccharomyces cerevisiae, A.g = Anopheles gambiae, D.m = Drosophila melanogaster, C.e = Caenorhabditis elegans.
3 KEGG's biochemical pathway's accession number for Saccharomyces cerevisiae.
Figure 2phylogenetic tree of the TPS family. The tree clearly shows that all the genes represented here are orthologous. Bootstrap values are indicated to evaluate each node's robustness, for Neighbor Joining, Maximum Parsimony and Maximum Likelihood methods respectively. The Opisthokonts group is represented and total absence of deuterostomian species is remarkable. This tree, as all the trees in the present analysis, was constructed with the automated phylogenomic annotation pipeline available in the FIGENIX platform [24]. For a matter of clarity, only one representative and one close relative of each of the 4 core metazoan species selected for constructing the COGS are represented on this tree. The whole tree, available upon request, includes several other fungal, plant and protostomian species.
Figure 3general strategy used for identifying lineage-specific gene family losses. This chart represents the general strategy we adopted to find gene-families that were specifically lost in specific lineages of the tree of life. The 4 main steps are represented in yellow boxes, main results and data in blue boxes, and decisions in red and green boxes.