| Literature DB >> 16197734 |
Emma E Thompson1, Hala Kuttab-Boulos, Matthew D Krasowski, Anna Di Rienzo.
Abstract
Members of the NR1I subfamily of nuclear receptors play a role in the transcriptional activation of genes involved in drug metabolism and transport. NR1I3, the constitutive androstane receptor (CAR), mediates the induction of several genes involved in drug response, including members of the CYP3A, CYP2B and UGT1A subfamilies. Large inter-individual variation in drug clearance has been reported for many drug metabolising enzyme genes. Sequence variation at the CAR locus could potentially contribute to variation in downstream targets, as well as to the substantial variation in expression level reported. We used a comparative genomics-based approach to select resequencing segments in 70 subjects from three populations. We identified 21 polymorphic sites, one of which results in an amino acid substitution. Our study reveals a common haplotype shared by all three populations which is remarkably similar to the ancestral sequence, confirming that CAR is under strong functional constraints. The level and pattern of sequence variation is approximately similar across populations, suggesting that interethnic differences in drug metabolism are not likely to be due to genetic variation at the CAR locus. We also identify several common non-coding variants that occur at highly conserved sites across four major branches of the mammalian phylogeny, suggesting that they may affect CAR expression and, ultimately, the activity of its downstream targets.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16197734 PMCID: PMC3525124 DOI: 10.1186/1479-7364-2-3-168
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Sequences used in Phylogenetic Analysis by Maximum Likelihood analysis.
| Gene | Domain | Species | Accessiona |
|---|---|---|---|
| CAR | Full-length | Human | NM_005122 |
| Rhesus | AY116212 | ||
| Mouse | NM_009803 | ||
| Rat | NM_022941 | ||
| Fur Seal | AB109554 | ||
| Baikal seal | AB109553 | ||
| LBD only | Dog | AAEX01049487 and AAEX01049488 | |
| DBD only | Cow | 30573, December 10, 2004 | |
| PXR | Full-length | Human | NM_003889 |
| Chimpanzee | Nov. 2003 chimpanzee Arachne assembly, NCBI Build 1 version 1, UCSC: panTro1 | ||
| Rhesus | AF454671 | ||
| Rabbit | AF188476 | ||
| Mouse | AF031814 | ||
| Rat | AF151377 | ||
| LBD only | Dog | AF454670 | |
| Pig | AF454672 | ||
| VDR | Full-length | Human | NM_000376 |
| Cotton-top tamarin | AF354232 | ||
| Mouse | NM_009504 | ||
| Rat | NM_017058 | ||
| DBD only | Chimpanzee | Nov. 2003 chimpanzee Arachne assembly, NCBI Build 1 version 1, UCSC: panTro1 | |
| TRα | Full-length | Human | NM_003250 |
| Sheep | Z68308 | ||
| Pig | AJ005797 | ||
| Rat | M18028 | ||
| Mouse | NM_178060 | ||
| DBD only | Chimpanzee | Nov. 2003 chimpanzee Arachne assembly, NCBI Build 1 version 1, UCSC: panTro1 | |
| FXR | Full-length | Human | NM_005123 |
| Chimp | Nov. 2003 chimpanzee Arachne assembly, NCBI Build 1 version 1, UCSC: panTro1 | ||
| Rabbit | AF497630 | ||
| Golder hamster | AF478441 | ||
| Mouse | NM_009108 | ||
| Rat | NM_021745 |
a All accession number refer to Genbank sequences unless otherwise specified.
Abbreviations: CAR, constitutive androstane receptor: PXR, pregnane X receptor; VDR, vitamin D receptor; TRα, thyroid receptor-α, FXR, farnesoid X receptor; DBD, DNA binding domain; LBD, ligand binding domain; NCBI, The National Center for Biotechnology Information; UCSC, The University of California Santa Cruz.
Figure 1MultiPipMaker plot illustrating sequence conservation at the . The vertical axis ranges between 50 per cent and 100 per cent sequence identity; areas within the plot shaded in pink correspond to the exons of CAR and those shaded in yellow correspond to exons of the predicted protein FLJ12770. Bars along the bottom indicate the segments included in the resequencing survey. Numbers across the bottom indicate the position relative to the reverse-complemented reference sequence (AL50974).
Summary statistics of sequence variation at the CAR locus.
| Sample | Sa | πb | θwc | TDd | θw/dive |
|---|---|---|---|---|---|
| African Americans | 16 | 0.50 (13) | 0.89 (33) | -1.37 (6) | 11.7 (57) |
| Europeans | 12 | 0.55 (41) | 0.66 (9.5) | -0.48 (23) | 8.7 (78) |
| Han | 6 | 0.42 | 0.33 | 0.72 |
a Number of segregating sites.
b Nucleotide diversity per base pair (× 10-3). Numbers in parentheses indicate the percentile rank of π relative to the distribution of π values in the Seattle SNP genes.
cWatterson's estimator of the population mutation rate parameter θ (= 4Nμ) per bp (× 10-3) [37].
d Tajima's D (TD) [38]. Numbers in parenthesis indicate the percentile rank of the TD value relative to the distribution of TD values in the Seattle SNP genes.
e Ratio of θW to the amount of sequence divergence between human and chimpanzee (per cent). Numbers in parenthesis indicate the percentile rank of the θW:div value relative to the distribution of θW:div values in the Seattle SNP genes.
Genomic features of polymorphic positions at the CAR locus.
| Positiona | Location | Nucleotide presentb | Derived allele frequency | Possible consequencesc | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Human | Chimp | Cow | Dog | Rat | Mouse | AA | Han | EU | |||
| 28187 | 5' sequence | n/a | n/a | n/a | n/a | 0.02 | 0 | 0 | Disrupts NF-1 site | ||
| 41893 | Intron 2 | G | 0 | 0 | 0.02 | ||||||
| 42202 | Intron 2 | - | - | 0.04 | 0 | 0 | Disrupts HNF-1 site | ||||
| 42218 | Intron 2 | C | C | 0.07 | 0 | 0 | Disrupts NF-1 site | ||||
| 42221 | Intron 2 | 0 | 0 | 0.02 | Disrupts NF-1 site | ||||||
| 42225 | Intron 2 | 0.07 | 0.18 | 0.21 | Disrupts NF-1 site | ||||||
| 42267 | Intron 2 | A | A | 0.04 | 0 | 0 | Disrupts C/EBP, USF sites | ||||
| 44896 | Intron 3 | 0.29 | 0.47 | 0.31 | |||||||
| 45014 | Exon 4 | 0.02 | 0 | 0 | |||||||
| 45045 | Exon 4 | 0.02 | 0 | 0 | Radical NS change; Disrupts predicted ESE | ||||||
| 45518 | Exon 5 | G | 0.16 | 0.45 | 0.42 | ||||||
| 45567 | Intron 5 | G | C | G | 0.05 | 0 | 0.02 | Disrupts HNF-1 site | |||
| 47331 | Intron 7 | - | - | - | 0.02 | 0 | 0 | ||||
| 47537 | Intron 8 | G | G | C | C | 0.05 | 0.15 | 0.21 | Disrupts NF-1 site | ||
| 47636 | Intron 8 | 0.26 | 0.09 | 0.23 | |||||||
| 48318 | Intron 8 | n/a | - | C | 0.02 | 0 | 0.02 | ||||
| 48342 | Intron 8 | n/a | C | 0.04 | 0 | 0 | |||||
| 48419 | Exon 9 | n/a | A | A | 0 | 0 | 0.02 | ||||
| 48621 | 3' sequence | C | T | C | C | 0 | 0 | 0.02 | |||
| 48692 | 3' sequence | T | G | G | 0.02 | 0 | 0.02 | ||||
| 48734 | 3' sequence | 0 | 0.02 | 0 | |||||||
a Positions are based on the reverse complement of AL50974.
b Nucleotides in bold are identical to that observed in chimpanzee. 'n/a' denotes missing data, while '-' indicates that the position is absent relative to human.
c Transcription factor binding site predictions are based on output from Match; exon splice enhancer (ESE) predictions are based on predictions from ESEfinder (see Methods).
Abbreviations: C/EBP, CCAAT enhancer binding protein; HNF, hepatocyte nuclear factor; NF, nuclear factor; USF, upsteam stumulatory factor.
McDonald-Kreitman test comparing levels of human polymorphism and divergence between human and chimpanzee at conserved versus non-conserved sites across distantly related mammals.
| Non-conserveda | Conserved | |
|---|---|---|
| Fixed | 19 | 5 |
| Polymorphicb | 11 | 4 |
| Fixed | 19 | 5 |
| Polymorphic | 1 | 4 |
a If the allele in chimpanzee (ancestral) was identical to that found in the same position in rodent, artiodactyl and carnivore, the site was considered conserved across three lineages.
b The singleton indel was not included in this analysis.
Figure 2Inferred haplotypes at . The chimpanzee sequence was used to infer the ancestral allele at each site, denoted with a grey box; derived alleles are indicated by black. The numbers on the right indicate the number of haplotypes in each population. Numbers across the top indicate the position of each polymorphic site relative to the reverse complemented reference sequence (AL50974).
Phylogenetic analysis by maximum likelihood (PAML) models from analysis of sequence data from the CAR, PXR, VDR, FXR and TRα genes (values in parentheses indicate percentage of codons falling in a particular ω category).
| PAML model | |
|---|---|
| Full sequence | M3, ω = 0.03 (51%), 0.47 (49%) |
| DBD only | M0, ω = 0.1598 |
| LBD only | M3, ω = 0.06 (68%), 0.72 (32%) |
| Full sequence | M3, ω = 0.00001 (77%), 0.333 (23%) |
| DBD only | M0, ω = 0.0085 |
| LBD only | M3, ω = 0.00001 (67%), 0.32 (33%) |
| Full sequence | M3, ω = 0.049 (79%), 0.666 (21%) |
| DBD only | M0, ω = 0.0441 |
| LBD only | M3, ω = 0.01 (51%), 0.26 (42%), |
| Full sequence | M0, ω = 0.02 |
| DBD only | M0, ω = 0.0001 |
| LBD only | M0, ω = 0.01 |
| Full sequence | M3, ω = 0.00001 (72%), 0.22 (28%) |
| DBD only | M0, ω = 0.0137 |
| LBD only | M3, ω = 0.00001 (83%), 0.295 (17%) |
Abbreviations: CAR, constitutive androstane receptor; PXR, pregnane X receptor;
VDR, vitamin D receptor; FXR, farnesoid X receptor; TRα, thyroid receptor-α; DBD,
DNA binding domain; LBD, ligand binding domain.