| Literature DB >> 17500592 |
Abstract
Genes vary greatly in their long-term phylogenetic stability and there exists no general explanation for these differences. The cytochrome P450 (CYP450) gene superfamily is well suited to investigating this problem because it is large and well studied, and it includes both stable and unstable genes. CYP450 genes encode oxidase enzymes that function in metabolism of endogenous small molecules and in detoxification of xenobiotic compounds. Both types of enzymes have been intensively studied. My analysis of ten nearly complete vertebrate genomes indicates that each genome contains 50-80 CYP450 genes, which are about evenly divided between phylogenetically stable and unstable genes. The stable genes are characterized by few or no gene duplications or losses in species ranging from bony fish to mammals, whereas unstable genes are characterized by frequent gene duplications and losses (birth-death evolution) even among closely related species. All of the CYP450 genes that encode enzymes with known endogenous substrates are phylogenetically stable. In contrast, most of the unstable genes encode enzymes that function as xenobiotic detoxifiers. Nearly all unstable CYP450 genes in the mouse and human genomes reside in a few dense gene clusters, forming unstable gene islands that arose by recurrent local gene duplication. Evidence for positive selection in amino acid sequence is restricted to these unstable CYP450 genes, and sites of selection are associated with substrate-binding regions in the protein structure. These results can be explained by a general model in which phylogenetically stable genes have core functions in development and physiology, whereas unstable genes have accessory functions associated with unstable environmental interactions such as toxin and pathogen exposure. Unstable gene islands in vertebrates share some functional properties with bacterial genomic islands, though they arise by local gene duplication rather than horizontal gene transfer.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17500592 PMCID: PMC1866355 DOI: 10.1371/journal.pgen.0030067
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Excerpt from Tree of Vertebrate CYP450 Proteins
Shown is one segment of 61 proteins from a maximum-likelihood protein tree of all 628 CYP450 proteins from 10 vertebrate species. The complete tree is shown in Figure S1. Protein names are color coded by species and include a genus-species prefix (e.g., “hs” is H. sapiens). Human and mouse protein names end with the chromosome and position (in Mb) of the corresponding gene. Solid black squares mark human proteins with a known endogenous substrate. Solid blue circles mark human proteins with known xenobiotic substrates. Red asterisks mark genes for which positive selection was detected; tests for positive selection were performed only on primate and rodent genes. Other markings indicate enzyme activity, gene clusters in human (blue), and mouse (pink), and specific groups tested where positive selection was not found (green). Numbers on tree branches show percent bootstrap support for all branches important for interpretation. Almost half of the missing cow CYP46A1 ortholog is present in the Btau3.1 assembly based on a TBLASTN search (Chromosome 21, plus strand at ~58.4 Mb) (see Table S1). Though not apparent from this tree segment, clear expansions in the number of CYP450 genes are present on other parts of the complete tree for fish, frog, and chicken (Figure S1).
Gene Group Counts
Recent Probable Duplications