| Literature DB >> 25808902 |
Yoshiki Yasukochi1, Yoko Satta2.
Abstract
The human cytochrome P450 (CYP) 2D6 gene is a member of the CYP2D gene subfamily, along with the CYP2D7P and CYP2D8P pseudogenes. Although the CYP2D6 enzyme has been studied extensively because of its clinical importance, the evolution of the CYP2D subfamily has not yet been fully understood. Therefore, the goal of this study was to reveal the evolutionary process of the human drug metabolic system. Here, we investigate molecular evolution of the CYP2D subfamily in primates by comparing 14 CYP2D sequences from humans to New World monkey genomes. Window analysis and statistical tests revealed that entire genomic sequences of paralogous genes were extensively homogenized by gene conversion during molecular evolution of CYP2D genes in primates. A neighbor-joining tree based on genomic sequences at the nonsubstrate recognition sites showed that CYP2D6 and CYP2D8 genes were clustered together due to gene conversion. In contrast, a phylogenetic tree using amino acid sequences at substrate recognition sites did not cluster the CYP2D6 and CYP2D8 genes, suggesting that the functional constraint on substrate specificity is one of the causes for purifying selection at the substrate recognition sites. Our results suggest that the CYP2D gene subfamily in primates has evolved to maintain the regioselectivity for a substrate hydroxylation activity between individual enzymes, even though extensive gene conversion has occurred across CYP2D coding sequences.Entities:
Keywords: CYP2D6; CpG site methylation; drug metabolism; gene conversion; molecular evolution; purifying selection
Mesh:
Substances:
Year: 2015 PMID: 25808902 PMCID: PMC4419802 DOI: 10.1093/gbe/evv056
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FNJ tree of the CYP2D genes of primates, based on synonymous substitutions of the entire coding region. The modified Nei–Gojobori method was used as a distance measure with Jukes–Cantor correction. The internal node number represents bootstrap values (>50%). The diamond represents a gene divergence. The CYP2D sequences indicated in bold were obtained from the whole-genome sequence data, and nucleotide sequences of some fragments were determined in our previous study (Yasukochi and Satta 2011). The mouse Cyp2d22 was used as an outgroup sequence. Hosa, human; papa, pygmy chimpanzee; patr, chimpanzee; poab, orangutan; mamu, rhesus monkey; mafa, cynomolgus monkey; caja, white-tufted-ear marmoset; mumu, house mouse.
F(A) The possible topologies at parsimony informative sites. In analysis, α topology was expected when a gene duplication event occurred before speciation, whereas β topology was expected when gene conversion or recombination occurred after its duplication. H6, human CYP2D6 gene; H7, human CYP2D7P; C6, chimpanzee CYP2D6; C7, chimpanzee CYP2D7. (B) The alignment of nucleotide sequences based on parsimony informative sites in the CYP2D6 and CYP2D7(P) of the human and chimpanzee. The white column indicates α topology. The filled black column indicates β topology. The filled gray column indicates γ topology.
Two-Sample Runs Test for Detection of Gene Conversion
| Sister Group Status | A | B | |||
|---|---|---|---|---|---|
| (β, γ), α | 26 | 201 | ≈6.9 × 10−25 | 10 | 5 |
| ≈7.0 × 10−26 | 9 | 4 | |||
| ≈2.6 × 10−27 | 8 | 4 | |||
| ≈1.9 × 10−28 | 7 | 3 | |||
| ≈5.1 × 10−30 | 6 | 3 | |||
| ≈2.4 × 10−31 | 5 | 2 | |||
| ≈4.2 × 10−33 | 4 | 2 | |||
| ≈9.5 × 10−35 | 3 | 1 | |||
| ≈8.5 × 10−37 | 2 | 1 | |||
| ≪0.001 | ≤10 | ||||
| (α, γ), β | 202 | 25 | ≈5.9 × 10−25 | 10 | 5 |
| ≈6.3 × 10−26 | 9 | 4 | |||
| ≈2.3 × 10−27 | 8 | 4 | |||
| ≈1.7 × 10−28 | 7 | 3 | |||
| ≈4.7 × 10−30 | 6 | 3 | |||
| ≈2.3 × 10−31 | 5 | 2 | |||
| ≈4.1 × 10−33 | 4 | 2 | |||
| ≈9.5 × 10−35 | 3 | 1 | |||
| ≈8.5 × 10−37 | 2 | 1 | |||
| ≪0.001 | ≤10 |
Note.—α, a sister group state, [(H6, C6),(H7, C7))]; β, a sister group state, [(H6, H7),(C6, C7))]; γ, a sister group state, [(H6, C7),(H7, C6))]. H6, human CYP2D6; H7, human CYP2D7P; C6, chimpanzee CYP2D6; C7, chimpanzee CYP2D7. A, site defined as one that does not support a certain sister group status; B, site defined as one that supports the specified sister group status; Z, continuum of the same sister group status (run).
aZ = 2 k when the number of Z is even, and Z = 2 k + 1, when the number of Z is odd.
FNJ trees based on the genetic distances of three β regions in CYP2D6 and CYP2D7(P) genes of the human, chimpanzee, and orangutan. The genetic distances (three decimal places) were based on silent substitutions, which include synonymous substitutions in coding regions and changes in noncoding regions. H6, human CYP2D6 gene; H7, human CYP2D7P; C6, chimpanzee CYP2D6; C7, chimpanzee CYP2D7; O6, orangutan CYP2D6; O7, orangutan CYP2D7P.
FThree phylogenetic trees of primate CYP2D genes. (A) NJ tree based on amino acid sequences at SRSs. (B) NJ tree based on amino acid sequences at non-SRSs. (C) NJ tree based on nucleotide sequences in intron regions. A p-distance was used as a distance measure for amino acid sequences (A and B). A genetic distance of nucleotide sequences was calculated by using the Kimura two-parameter model (C). Diamond represents a gene divergence. Only bootstrap values over 50% are shown in the NJ tree. The mouse Cyp2d22 was used as an outgroup sequence. Hosa, human; papa, pygmy chimpanzee; patr, chimpanzee; poab, orangutan; mamu, rhesus monkey; caja; white-tufted-ear marmoset; mumu, house mouse.
FThe variability level at amino acid residues among the CYP2D6 genes in six primates by modified Wu–Kabat plot. Gray boxes indicate the SRSs identified by Gotoh (1992). The ordinate axis represents the variability of amino acid residues. The abscissa axis represents the position of the human CYP2D6 molecule.
FThe chronological order of evolutionary events in the primate CYP2D gene cluster, based on the results of this study. The number in a box represents the CYP2D isoforms. A cross mark signifies a loss of function (pseudogene). The number of the site on a box represents a putative deleterious mutation site. A filled arrow and stop mark represent frameshift and nonsense mutations, respectively. The number of the site on the bottom represents the total number of their mutations.
Estimates of ω Ratios for the Five Primate CYP2D8(P) Genes under Various Models
| Model | np | Likelihood | Model Comparison | |||
|---|---|---|---|---|---|---|
| A | All branches same | 10 | 0.34 | −3,220 | ||
| B | Hominidae branches | 13 | −3,205 | B versus A | 3.8 × 10−8 | |
| C | Hominidae branches | 12 | −3,205 | C versus B | 1.0 |
Note.—np, number of parameters.
Mean dS and dN Distances among the Functional CYP2D6/D7/D8 Genes and among CYP2D6 Genes
| No. of OTU | SRSs | Non-SRSs | Entire Coding Region | |||||
|---|---|---|---|---|---|---|---|---|
| Primate functional | 9 | 0.10 ± 0.017 | 0.79 | 0.04 ± 0.004 | 0.32 | 0.05 ± 0.005 | 0.12 ± 0.010 | 0.39 |
| Primate | 6 | 0.05 ± 0.012 | 0.48 | 0.03 ± 0.004 | 0.24 | 0.03 ± 0.004 | 0.11 ± 0.010 | 0.28 |
Note.—The dS and dN values are calculated by the modified Nei–Gojobori method with Jukes–Cantor (R = 2.25; Zhang et al. 1998).
aExcluding CYP2D7P genes of the human and orangutan and CYP2D8P genes of Hominidae.
*P < 0.05 (dN > dS; Wilcoxon signed-rank test).