Literature DB >> 16709249

Linkage disequilibrium blocks, haplotype structure, and htSNPs of human CYP7A1 gene.

Kaori Nakamoto1, Shuang Wang, Robert D Jenison, Grace L Guo, Curtis D Klaassen, Yu-Jui Yvonne Wan, Xiao-bo Zhong.   

Abstract

BACKGROUND: Cholesterol 7-alpha-hydroxylase (CYP7A1) is the rate limiting enzyme for converting cholesterol into bile acids. Genetic variations in the CYP7A1 gene have been associated with metabolic disorders of cholesterol and bile acids, including hypercholesterolemia, hypertriglyceridemia, arteriosclerosis, and gallstone disease. Current genetic studies are focused mainly on analysis of a single nucleotide polymorphism (SNP) at A-278C in the promoter region of the CYP7A1 gene. Here we report a genetic approach for an extensive analysis on linkage disequilibrium (LD) blocks and haplotype structures of the entire CYP7A1 gene and its surrounding sequences in Africans, Caucasians, Asians, Mexican-Americans, and African-Americans. RESULT: The LD patterns and haplotype blocks of CYP7A1 gene were defined in Africans, Caucasians, and Asians using genotyping data downloaded from the HapMap database to select a set of haplotype-tagging SNPs (htSNP). A low cost, microarray-based platform on thin-film biosensor chips was then developed for high-throughput genotyping to study transferability of the HapMap htSNPs to Mexican-American and African-American populations. Comparative LD patterns and haplotype block structure was defined across all test populations.
CONCLUSION: A constant genetic structure in CYP7A1 gene and its surrounding sequences was found that may lead to a better design for association studies of genetic variations in CYP7A1 gene with cholesterol and bile acid metabolism.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16709249      PMCID: PMC1488870          DOI: 10.1186/1471-2156-7-29

Source DB:  PubMed          Journal:  BMC Genet        ISSN: 1471-2156            Impact factor:   2.797


Background

Cholesterol 7-alpha-hydroxylase (CYP7A1) catalyzes the first reaction in the cholesterol catabolic pathway in liver. This pathway converts cholesterol to bile acids, which is the primary mechanism for the removal of cholesterol from the body. The CYP7A1 catalytic reaction is the rate-limiting step and the major site for regulating homeostasis of cholesterol and bile acids. The gene encoding CYP7A1 was cloned by using a rat homolog probe [1] and mapped to chromosome 8q11 [2]. The CYP7A1 gene spans about 10 kb and contains 6 exons, 5 introns, one 5'-UTR, and one 3'-UTR. In its 5' flanking region, consensus recognition sequences for a number of transcription factors were identified [2]. A TATA box and a modified CAAT box were also identified in the promoter region of the CYP7A1 gene [3]. Numerous laboratories have illustrated a multiplex nuclear receptor mediated network that controls CYP7A1 gene expression and maintains cholesterol and bile acid balance [4]. Within this network, nuclear receptors of farnesoid X receptor (FXR), liver X receptor (LXR), retinoid X receptor (RXR), small heterodimer partner (SHP), and liver receptor homologue 1 (LRH1) are involved in a positive-versus-negative regulation. Using a FXR-deficient (-/-) mouse model, we have demonstrated feedback suppression on CYP7A1 gene transcription by FXR [5,6]. Genetic variations in the CYP7A1 gene associated to disorders of cholesterol and bile acid metabolism have been studied extensively in different laboratories. Most studies have focused on a single nucleotide polymorphism (SNP) in the promoter region of the CYP7A1 gene. This is an A/C transversion polymorphism at -278 from the translation initiation codon, or -204 from the transcriptional start site. This polymorphism was first reported by Wang et al. [7] to link to high plasma low-density lipoprotein cholesterol concentrations. Association of this polymorphism to plasma lipid levels, hypertriglyceridemia, hypercholesterolemia, and risk to arteriosclerosis, gallstone disease, and colorectal cancer has been studied in adults and children in Caucasian and Asian populations with conflicting results [8-19]. A CYP7A1 enzyme deficiency caused by a homozygous 1302–1303 delTT deletion mutation in CYP7A1 exon 6, leading to a frameshift (L413fsX414), has been linked to a hypercholesterolaemic phenotype [20]. The information has indicated that genetic variations in the CYP7A1 gene have high impact on human cholesterol metabolic regulation and human health; however, these studies have mainly focused on a single polymorphism or a mutation. Linkage of genes for a complex disease relies on having a priori knowledge of linkage disequilibrium (LD) blocks and haplotype structure to identify polymorphisms that are associated with the disease. Therefore, it is important to determine whether there are LD blocks existing in the CYP7A1 gene in different populations. This information can be used to identify a set of haplotype-tagging SNP (htSNP) markers that can be used in an association study. The LD blocks and haplotype structure of CYP7A1 gene can be firstly defined in three general human populations of Africans, Asians, and Caucasians using a public-available database generated by the International HapMap Project [21]. The HapMap LD patterns and haplotype structure can serve as reference to select htSNPs for an association study. LD patterns and htSNPs defined by the HapMap Project are transferable to other populations in some loci, but may vary significantly in other loci [22]. To test whether the htSNPs identified in the HapMap populations are useful for association studies in other populations, we analyzed LD patterns and haplotype structures of CYP7A1 gene in both Mexican-American and African-American populations using the selected HapMap htSNPs. Mexican-American is the fastest growing population in USA, but genetic study on this population is extremely limited. Mexican-American genetic background is a mixture of European American (50–60%) (mainly Spanish), American Indian (30–40%), and African (<5%) [23]. African-American is the major minority population in USA and has an admixture genetic background from African and European Americans [24]. Genotyping of the selected htSNPs on these two populations can provide verification of transferability of the HapMap htSNPs among populations.

Results and discussion

Linkage disequilibrium blocks and haplotype structures of CYP7A1 gene in Caucasians, Africans, and Asians

A LD block is found in the HapMap Caucasians (CEU) spanning a 14-kb region from the proximal promoter (rs3824260) to the 3'-downstream (rs10504255) of the CYP7A1 gene (Figure 1. CEU-B1). A similar LD block from rs3824260 to the 3'-downstream was also reported in a Swedish population [18]. About 4.4 kb upstream from rs3824260, there is another LD block (CEU-B2) crossing a 3-kb region at the distal promoter region. Recombination between the two blocks is 0.84. Only five haplotypes with a frequency > 2% exist in CEU-B1 (Figure 2. CEU-B1H1 to CEU-B1H5). CEU-B1H1 and CEU-B1H2 are two common haplotypes, together representing a total of 68% of the haplotype frequency in CEU-B1. CEU-B1H1 carries common alleles at all markers except rs8192879 in 3'-UTR, whereas CEU-B1H2 is composed of less common alleles at 5 out of 8 loci. In CEU-B2, there are only two types of haplotypes (CEU-B2H1 and CEU-B2H2). CEU-B2H1 carries common alleles at all SNP loci, whereas CEU-B2H2 has less common alleles. A similar LD pattern is found in the HapMap African YRI (Figure 1), but the larger LD block (YRI-B1) is slight shorter (9 kb from rs8192879 to rs3824260) than CEU-B1. The haplotype structure is also similar between YRI and CEU, however, the frequency of each haplotype is different. The most common haplotype (55%) in YRI-B1, YRI-B1H1, has identical haplotype structure with CEU-B1H2, whereas the second common haplotype in YRI-B1, YRI-B1H2 (22.5%), is the same as CEU-B1H1. YRI-B1H1 and YRIB1H2 together add up to 77.5% of the total haplotypes in YRI-B1. In YRI-B2, the dominant haplotype YRI-B2H1 has the same haplotype structure as CEU-B2H2, whereas less common haplotype YRI-B2H2 is identical to the common haplotype CEU-B2H1. A similar recombination (0.81) is also found between YRI-B1 and YRI-B2. In CHB and JPT, only one LD block is found from the distal promoter to a part of the CYP7A1 gene. Although the JPT-B1 (16 kb, from rs8192879 to rs1023649) is larger than CHB-B1 (10 kb, from rs1457043 to rs1023650), LD is weak between rs8192879 in intron 4 and rs1457043 in intron 2 in JPT. CHB and JPT share almost the same haplotype structure within the block. JPT-B1H1, JPT-B1H2 and JPT-B1H3 are the same as CHB-B1H1, CHB-B1H2 and CHB-B1H3, respectively. Only CHB-B1H4 (6%) is unique in CHB.
Figure 1

Linkage disequilibrium of the SNP markers in the CYP7A1 gene in the HapMap populations of CEU, CHB, JPT and YRI. A standard color scheme is used to display LD with bright red color for very strong LD (LOD = 2 D' = 1), white color for no LD (LOD<2, D'<1), pink red (LOD = 2 D'<1), and blue (LOD<2 D' = 1) for intermediate LD.

Figure 2

Haplotype frequencies of the HapMap selected SNPs in the CYP7A1 gene in CEU, YRI, JPT, and CHB. In each haplotype, blue bars represent allele 1, whereas red bars represent allele 2 for correlated SNPs. Black bars indicate that the SNPs are not present in this population. Numbers next to each haplotype bar are haplotype frequencies. Up-side-down red triangles indicate htSNPs in the populations. In the crossing areas, a value of multiallelic D' is shown to represent the level of recombination between the two blocks.

Linkage disequilibrium of the SNP markers in the CYP7A1 gene in the HapMap populations of CEU, CHB, JPT and YRI. A standard color scheme is used to display LD with bright red color for very strong LD (LOD = 2 D' = 1), white color for no LD (LOD<2, D'<1), pink red (LOD = 2 D'<1), and blue (LOD<2 D' = 1) for intermediate LD. Haplotype frequencies of the HapMap selected SNPs in the CYP7A1 gene in CEU, YRI, JPT, and CHB. In each haplotype, blue bars represent allele 1, whereas red bars represent allele 2 for correlated SNPs. Black bars indicate that the SNPs are not present in this population. Numbers next to each haplotype bar are haplotype frequencies. Up-side-down red triangles indicate htSNPs in the populations. In the crossing areas, a value of multiallelic D' is shown to represent the level of recombination between the two blocks. In comparison of LD and haplotype structure among the HapMap populations, strong LD is found from the distal promoter region to intron 2 of the CYP7A1 gene across the HapMap populations. Two common haplotypes with complete opposite alleles at all loci (common-versus-less common alleles) within this region count for more than 85% of total haplotype frequencies in all four HapMap populations. A diverted LD degree exists between intron 2 and the 3'-downstream region from high to low across CEU, YRI, JPT, and CHB.

Genotyping of htSNPs in Mexican-Americans and African-Americans

Because of the strong LD in the CYP7A1 genes, some markers correlate 100% with each other in a population. Only a subset of representative SNPs is necessary for defining a haplotype. These SNPs can tag either neighboring markers or a set of common haplotypes within an LD block. The htSNPs in CYP7A1 were selected using Tagger, implemented in the HaploView 3.12, which combines the simplicity of pairwise methods with the potential efficiency of multimarker approaches [25]. The CYP7A1 htSNPs are different in the various populations (see upside-down red triangles in Figure 2 for each HapMap population). Some markers tag on all three populations, but others for only one or two. It has been suggested that the populations genotyped in the HapMap project may serve as reference populations for the selection of htSNP markers in association studies [26]. Nine SNP markers and one short deletion marker were selected (see detail in Table 3), in which, eight are htSNP markers defined by the HapMap populations, including rs3808607, a functional polymorphism at A-278C in the promoter region. Two functional mutations were also included. One is a two-base deletion in exon 6 (1302 delTT) causing a frame shift and CYP7A1 enzyme deficiency [20]. The other one is a C/T SNP in exon 3, causing an amino acid change at Asn233Ser. This is the only non-synonymous SNP reported in NCBI SNP database in the CYP7A1 gene.
Table 3

Ten selected genetic variations for CYP7A1 genotyping.

SNPAllele 1/2PositionNote
rs10504255A/G3'-downstreamHapMap htSNP in CEU
rs8192879C/T3'-UTRHapMap htSNP in CEU, JPT, and YRI
1302 delTTTT/--Exon 6Frameshift, CYP7A1 deficiency, high hepatic cholesterol content
rs11786580C/TIntron 4HapMap htSNP in CEU, JPT, and YRI
rs8192874C/TExon 3Non-synonymous SNP, Asn33Ser
rs216245953A/GIntron 1HapMap htSNP in JPT
rs3808607T/GPromoterHapMap htSNP in CEU and JPT
rs7833904T/A5-upstreamHapMap htSNP in YRI
rs1125226A/C5-upstreamHapMap htSNP in JPT
rs1023652C/G5'-upstreamHapMap htSNP in CEU
To perform genotyping of the 10 markers in the Mexican-American and African-American populations, a high-throughput and inexpensive SNP genotyping platform was developed using thin-film biosensor chips. We have reported a microarray platform for genotyping both SNPs and microsattelite repeat on thin-film biosensor chips [27,28]. The thin-film biosensor chip has excellent sensitivity of detection and extremely low non-specific binding, making it an excellent platform for discrimination of polymorphisms [29]. A positive reaction (blue color signal) can be visualized over the unreacted background (gold color) by an unaided human eye, without any instrumentation. Once the chips are printed, they are robust. Several thousands of genotypes can be performed in a 96-well plate in a laboratory with a standard molecular genetics setting within a few hours. Cost for reagents and materials to genotype 10 CYP7A1 htSNPs, including genomic DNA isolation, PCR reaction, and SNP genotyping on the thin-film biosensor chips, is ~US$0.20 per SNP per sample. It is relative less expensive than other high-throughput genotype platforms, such as TaqMan or Real-time PCR. To verify genotyping specificity on the thin-film biosensor chips, a pool of the synthetic targets for allele 1 or allele 2 was applied to a chip for hybridization and ligation. After signals were developed, the result images were captured by a black-white camera on a Nucleosite™ Image Analyzer (Biostar, Inc., Louisville, Colorado). High specificity was achieved on these synthetic targets with unambiguous genotypes (see images in Figure 3B and 3C). A negative control showed the signals are target dependent (Figure 3D). As a positive control for genotyping, 12 HapMap DNA samples were purchased from Coriell Cell Repositories (Camden, NJ), which are one family trio from YRI (NA18500, NA18501 and NA18502); one family trio from CEU (NA06985, NA06991, and NA06993); three independent individuals from CHB (NA18524, NA18526, and NA18529); and three independent individuals from JPT (NA18940, NA18942, and NA18943). Genotypes of the 8 HapMap htSNPs in the 12 HapMap samples were determined on thin-film biosensor chips. A 100% concordance was obtained between the 96 genotypes generated by thin-film biosensor chips and the 96 genotypes downloaded from the HapMap database which are generated by Illumina Bead Assay.
Figure 3

Genotyping of the 10 markers of CYP7A1 htSNPs and small deletion on thin-film biosensor chips. A. A design for arraying the capture probes on a thin-film biosensor chip. A pair of capture probes for each SNP were arrayed in duplicate next to each other with allele 1 left and allele 2 right. M indicates a positive control marker with 20 dATP and 3'-biotin. B. SNP discrimination with a pool of synthetic oligonucleotide targets of allele 1. C. SNP discrimination with a pool of synthetic oligonucleotide targets of allele 2. D. Negative control with no targets. E. A representative image showing genotypes of a Mexican-American individual with homozygous allele 1 for the most markers, but homozygous allele 2 for rs8192879. F. A representative image showing genotypes of a Mexican-American individual with homozygous allele 2 for the most markers, but homozygous allele 1 for rs18192879, rs11786580, rs8192874, 1302 TT. G. A representative image showing genotypes of a Mexican-American individual with heterozygous for the most markers except rs8192874, 1302 TT.

Genotyping of the 10 markers of CYP7A1 htSNPs and small deletion on thin-film biosensor chips. A. A design for arraying the capture probes on a thin-film biosensor chip. A pair of capture probes for each SNP were arrayed in duplicate next to each other with allele 1 left and allele 2 right. M indicates a positive control marker with 20 dATP and 3'-biotin. B. SNP discrimination with a pool of synthetic oligonucleotide targets of allele 1. C. SNP discrimination with a pool of synthetic oligonucleotide targets of allele 2. D. Negative control with no targets. E. A representative image showing genotypes of a Mexican-American individual with homozygous allele 1 for the most markers, but homozygous allele 2 for rs8192879. F. A representative image showing genotypes of a Mexican-American individual with homozygous allele 2 for the most markers, but homozygous allele 1 for rs18192879, rs11786580, rs8192874, 1302 TT. G. A representative image showing genotypes of a Mexican-American individual with heterozygous for the most markers except rs8192874, 1302 TT. To define the LD pattern and haplotype structures of CYP7A1 gene in Mexican-American and African-American populations, DNA samples from 90 healthy individuals for each population were randomly selected from our DNA bank. These DNA samples were collected by other research projects on alcoholism in the Mexican-American population [30] and pharmacogenomics of CYP enzymes in both Mexican-Americans and African-Americans [31,32]. Genotypes of the 10 selected markers on the 90 Mexican-American and 90 African-American subjects were determined by using the thin-film biosensor chip platform. Representative images of the different genotypes from different individuals are shown in Figure 3E, 3F, and 3G. Genotypes of each individual on the 10 markers were saved in linkage format and uploaded to HaploView. Observed genotype frequencies, allele frequencies, expected heterozygosity, and Hardy-Weinberg p-value of the 10 markers is summarized in Table 5. No significant HW p-values (<0.0010) were found. No TT deletion mutation at 1302 and C mutation at rs8192874 were detected in these two population samples. This indicates that these mutations have very low frequencies in the general populations. The genotyping data of the 8 htSNPs were uploaded to HaploView 3.12 to define LD patterns and haplotypes structures of CYP7A1. In Mexican-Americans, three LD blocks were identified. In comparison to the CEU LD blocks, MA-B3 in the distal promoter region has the same pattern as CEU-B2, but haplotype frequencies are different. MA-B3H1 has a frequency (78%) higher than CEU-B2H1 (55%). Unlike one big block in CEU, the CYP7A1 gene is divided by two LD blocks in the Mexican-American population. MA-B2 covers from proximal promoter to intron 2, whereas MA-B1 extents from 3'-UTR to 3'-downstream. The recombination frequencies between the blocks are 80–90%. In African-Americans, two LD blocks were recognized. AA-B2 has the identical structure as YRI-B2 and frequencies of the two haplotypes (AA-B2H1 and AA-B2H1) in the African-American population are almost the same as YRI-B2H1 and YRI-B2H2. AA-B1 is shorter than YRI-B1. The HapMap htSNPs are necessary SNP markers to capture all haplotypes in the MA and AA populations.
Table 5

Allele frequencies of the selected CYP7A1 htSNPs and mutation markers A. in the Mexican-American population.

#SNP IDVariation Allele 1/2Observed genotype frequenciesObserved allele frequenciesHardy-Wenberg

Homo Allele 1HeteroHomo Allele 2Allele 1Allele 2Expect heterozygousP value
1rs10504255A/G0.710.250.030.840.160.270.79
2rs8192879C/T0.410.420.180.610.390.470.34
3rs11786580C/T0.750.2500.870.130.220.45
4rs8192874T/C1.00001.000
5rs2162459A/G0.590.310.100.750.250.380.13
6rs3808607T/G0.620.300.090.760.240.370.09
7rs7833904T/A0.640.300.070.790.210.330.37
8rs1125226A/C0.620.300.080.780.220.340.18
9rs1023652C/G0.670.260.070.810.190.320.14
101302 delTTTT/–1.00001.000

B. in the African-American population

1rs10504255A/G0.810.180.010.840.100.181.00
2rs8192879C/T0.560.360.080.610.260.380.82
3rs11786580C/T0.600.360.040.870.220.340.90
4rs8192874T/C1.00001.000
5rs2162459A/G0.250.340.410.750.580.490.01
6rs3808607T/G0.230.360.400.760.580.490.03
7rs7833904T/A0.200.500.300.790.550.501.00
8rs1125226A/C0.200.490.310.780.560.491.00
9rs1023652C/G0.330.380.290.810.490.500.04
101302 delTTTT/--1.00001.000

* Shadows indicate SNP nucleotides in P1 probe and target sequences.

In summary, the human CYP7A1 gene and its surrounding sequences have constant genetic structures across all populations. This genetic structure can be divided into three components: (1) the distal promoter region, about 7-kb upstream from the transcriptional start code, there is a 3-kb LD block highly conserved across all populations. Only two haplotypes exist in this region in the most populations, except YRI. The most common haplotype in CEU and Mexican-American becomes the second most common haplotypes in YRI, African-American, CHB, and JPT populations. (2) A relative conserved LD block is present in the proximal part of CYP7A1 gene from the proximal promoter region (about 500 bp from the transcriptional start code) to intron 2 of CYP7A1. The two most conserved haplotypes count for up to 80 to 90% of the haplotype frequencies in all populations in this region. (3) A much diverted LD pattern is observed in the lower part of the CYP7A1 gene (from intron 4 to 3'-downstream). In CEU and YRI, a complete or partial LD block is merged to the block in the proximal part of the CYP7A1 gene. In Mexican-Americans, a LD block in this region is separated from the block in the proximal part of the CYP7A1 gene. In JPT, a weak linkage makes the proximal part block extended into the 3'-UTR. In CHB and African-Americans, there is no LD existing in this area.

Conclusion

Here we demonstrate a genetic approach to analyze LD patterns and haplotype blocks in CYP7A1 gene. Various degree of LD is found across different regions in different populations. A set of htSNPs is identified that can be used in an association study to capture common haplotypes in different populations. An inexpensive genotyping platform on thin-film biosensor chips is established to genotype the htSNPs. This chip technology can be applied in any laboratory with basic molecular genetic setting. The defined haplotype block structure in CYP7A1 gene may lead to a better design for genetic association studies to correlate genetic variations in CYP7A1 gene to cholesterol and bile acid metabolism and human diseases, such as gallstone disease. Because of high polymorphism and strong LD in the promoter region of CYP7A1, it should be considered in future studies to evaluate which CYP7A1 promoter haplotypes are more efficient for transcriptional regulation by its regulatory factors, such as FXR, LXR, RXR, PXR, SHP, and LRH1.

Methods

Human subject

The DNA samples for the HapMap come from a total of 270 people: 90 individuals from the Yoruba of Ibadan, Nigeria (YRI), (30 sets of trios, each trio with samples from two parents and an adult child); 90 individuals (30 sets of trios) from U.S. residents with northern and western European ancestry collected by the Centre d'Etude du Polymorphisme Humain (CEU); 45 unrelated individuals from the Tokyo area in Japan (JPT); and 45 unrelated individuals from Beijing, China (CHB). Two American population samples of Mexican-Americans and African-Americans were also used in this study. These DNA samples were collected by other research projects on alcoholism in the Mexican-American population [30] and pharmacogenomics of CYP enzymes in African-Americans [32]. Studies on these human subjects were approved by the Human Subjects Committee of the Kansas University Medical Center. Ninety DNA samples of healthy individuals were randomly selected from each population.

Analysis of linkage disequilibrium blocks and haplotype structure

LD patterns and haplotype structures of CYP7A1 gene in the HapMap populations were analyzed by using genotype data from the HapMap database. In the HapMap Phase II database, a total of about 5.9 million SNPs (about 1 SNP every 500 bp across the genome) are typed in the four HapMap populations [21]. Genotypes of each selected SNP in the 270 HapMap population samples can be downloaded from its database [34]. Fourteen SNPs are genotyped by the HapMap project in a total of 25 kb region with 10 kb of 5'-upstream flanking, 5 kb of 3'-downstream flanking, and the CYP7A1 gene sequences. The SNP density is about 1.8 kb per SNP. The CYP7A1 A-278C (or A-204C) promoter polymorphism is included with an ID number of rs3808607. A close promoter polymorphism C-554T, which was identified together with A-278C by Wang et al. [7], is also included as rs3824260. Chromosomal positions and locations in the CYP7A1 gene regions of the 14 SNPs are listed in Table 1 with polymorphic allele 1 for common allele and allele 2 for less common allele in CEU. Genotypes of each individual samples for the 14 SNP markers were dumped from the HapMap database and saved as a HapMap formatted file that can be opened directly by HaploView 3.12 for defining LD patterns and haplotype structure [25]. Four HapMap files were separately saved for CEU, CHB, JPT, and YRI, respectively. By uploading the files into HaploView, frequencies of allele 2, frequencies of observed heterozygous genotypes, and Hardy-Weinberg p-value for each marker were summarized for each population (see Table 2). No marker has a HW p-value smaller than the cutoff value of 0.0010 in the four populations. The LD between any two markers was defined by HaploView 3.12. A standard color scheme is used to display LD in Figure 1. A LD block was created by confidence intervals [25] if 95% of the informative comparisons are in strong LD using default algorithms of 95% confidence bounds on D prime. Haplotypes structure was defined by using an accelerated EM algorithm, similar to the partition/ligation method [33]. This creates highly accurate population frequency estimates of the phased haplotypes, based on the maximum likelihood as determined from the unphased input. Haplotypes with frequency > 2% in a block in CEU, YRI, JPT, and CHB are displayed in Figure 2. Alleles with blue boxes and red boxes represent common alleles and less common alleles in CEU, defined as Allele 1 and Allele 2, respectively. In the crossing areas, a value of multiallelic D' is shown to represent the level of recombination between the two blocks.
Table 1

Chromosomal positions and gene locations of the 14 CYP7A1 SNPs

#SNP IDChromosome positionLocation in CYP7A1 geneVariation* Allele 1/2
1rs10504255594484223'-downstreamA/G
2rs10957057594503013'-downstreamC/T
3rs8192879594535373'-UTRC/T
4rs1178658059455901Intron 4C/T
5rs374780959456945Intron 4T/C
6rs145704359460400Intron 2T/C
7rs216245959461003Intron 1A/G
8rs3808607594628855'-upstreamT/G
9rs3824260594631515'-upstreamG/A
10rs7833904594676235'-upstreamT/A
11rs3903445594683035'-upstreamG/T
12rs1125226594694725'-upstreamA/C
13rs1023649594703795'-upstreamA/G
14rs1023652594705775'-upstreamC/G

* Variation allele sequences are based on the plus strand sequence.

Table 2

Genotype frequencies of the 14 CYP7A1 SNPs in the four HapMap populations.

#SNP IDVariation Allele 1/2CEUCHBJPTYRI
FA2ObHHW-pFA2ObHHW-pFA2ObHHW-pFA2ObHHW-p
1rs10504255A/G0.3170.4561.00.2000.3560.8860.1700.2500.6940.0170.0221.0
2rs10957057C/T0.1420.2221.00.1220.2440.9990.1590.3180.5860.0920.1781.0
3rs8192879C/T0.3670.4670.7540.2670.4440.6630.2610.3861.00.2250.3331.0
4rs11786580C/T0.2420.3671.00.0560.1111.00.0910.1821.00.1920.3111.0
5rs3747809A/G000.0110.0221.00.0580.1161.000
6rs1457043T/C0.3920.5220.7540.5670.5560.6160.6250.5230.7180.5750.4670.911
7rs2162459A/G0.3920.5220.7540.5670.5560.6160.6250.5230.7180.5830.4561.0
8rs3808607T/G0.3670.4891.00.4110.4670.9890.5700.5350.8360.5830.4561.0
9rs3824260G/A0.3670.4891.00.4110.4670.9890.5700.5350.8360.5500.4780.784
10rs7833904T/A0.4080.5111.00.6220.5330.6120.6630.5350.3820.5580.4440.313
11rs3903445G/T0.4080.5111.00.6220.5330.6120.6480.5230.5800.5500.4670.44
12rs1125226A/C0.3980.5171.00.6220.5330.6120.6630.5350.3820.5750.4560.343
13rs1023649A/G0.4150.5171.00.6220.5330.6120.6480.5230.5800.5750.4560.343
14rs1023652C/G0.4080.5111.00.4330.5560.6160.3140.4880.6650.1250.1890.073

FA2: frequency of allele 2; ObH: observed frequency of heterozygous genotypes; HW-p: Hardy-Weinberg p value.

Chromosomal positions and gene locations of the 14 CYP7A1 SNPs * Variation allele sequences are based on the plus strand sequence. Genotype frequencies of the 14 CYP7A1 SNPs in the four HapMap populations. FA2: frequency of allele 2; ObH: observed frequency of heterozygous genotypes; HW-p: Hardy-Weinberg p value.

Genotyping on thin-film biosensor chip

For each selected SNP, target DNA molecules from each sample were amplified by PCR. PCR primers were designed based on the following criteria to make the PCR reaction uniform: (1) product size should be 120–200 bp with about 50–100 base flanking sequences around the SNP site in both directions, and (2) annealing temperature should be about 60°C for a standard PCR reaction condition. The best primer sets were selected by DS Gene Software version 1.5 (accelrys). The primer sequences for each SNP site are listed in Table 4. The selected primer sequences were synthesized by Invitrogen (Carlsbad, California). Multiple sets of the PCR products were amplified in a single PCR reaction.
Table 4

Oligonucleotide sequences of capture probe P1, detection probe P2, synthetic targets, and PCR primers.

SNPOligonucleotide sequence
rs10504255 P1-AALD-AAAAAAAAAAAACCTGAGCACTAGCCAGCTGTGTTCTCAATTCTGTGTGTAA
rs10504255 P1-GALD-AAAAAAAAAAAACCTGAGCACTAGCCAGCTGTGTTCTCAATTCTGTGTGTAG
rs10504255 P2Phosphate-CTCTTTCCCAATATTTCAAT-biotin
rs10504255 Target for P1-AATTGAAATATTGGGAAAGAGTTACACACAGAATTGAGAACA
rs10504255 Target for P1-GATTGAAATATTGGGAAAGAGCTACACACAGAATTGAGAACA
rs10504255 ForwardTAAACCTGCCTTGTCACGC
rs10504255 ReverseAGTTGCAAACGCTGGTTG
rs8192879 P1-CALD-AAAAAAAAAAGAAAAAACAATCTGCCAATTAGAATACATATCTTTTCTTC
rs8192879 P1-TALD-AAAAAAAAAAGAAAAAACAATCTGCCAATTAGAATACATATCTTTTCTTT
rs8192879 P2Phosphate-GGAGACGGGATCTCACTAAG-biotin
rs8192879 Target for P1-CCTTAGTGAGATCCCGTCTCCGAAGAAAAGATATGTATTCTA
rs8192879 Target for P1-TCTTAGTGAGATCCCGTCTCCAAAGAAAAGATATGTATTCTA
rs8192879 ForwardACCTGTAGTCTTAGCTACTCG
rs8192879 ReverseCCTTAGGAAAAAACAATCTGCC
rs11786580 P1-CALD-AAAAAAAAAACATTTACAAGAACCTATTTTTATCCATATACTATCTGATGC
rs11786580 P1-TALD-AAAAAAAAAACATTTACAAGAACCTATTTTTATCCATATACTATCTGATGT
rs11786580 P2Phosphate-TGCGTGGCGATCACCGCTGC-biotin
rs11786580 Target for P1-CGCAGCGGTGATCGCCACGCAGCATCAGATAGTATATGGATA
rs11786580 Target for P1-TGCAGCGGTGATCGCCACGCAACATCAGATAGTATATGGATA
rs11786580 ForwardTCTAGGTGTTTAATGCAGCTTC
rs11786580 ReverseGTCCCTTTATGCCTTTACCG
rs8192874 P1-CALD-AAAAAAAAAATTCTCGTGCCTCAAGCTCTCTGCCAGTTTCTCCCGGGCAC
rs8192874 P1-TALD-AAAAAAAAAATTCTCGTGCCTCAAGCTCTCTGCCAGTTTCTCCCGGGCAT
rs8192874 P2Phosphate-TGTGCGCAGTCCTGAACATG-biotin
rs8192874 Target for P1-CCATGTTCAGGACTGCGCACAGTGCCCGGGAGAAACTGGCAG
rs8192874 Target for P1-TCATGTTCAGGACTGCGCACAATGCCCGGGAGAAACTGGCAG
rs8192874 ForwardTCAGTTCTGAGATGCTTTCCC
rs8192874 ReverseAGTCTTTCCAGCCCTGGTAG
rs2162459 P1-AALD-AAAAAAAAAATCTCTAGAGGTGGTTCACCCGTTTGCCTGTCAGACACAAA
rs2162459 P1-GALD-AAAAAAAAAATCTCTAGAGGTGGTTCACCCGTTTGCCTGTCAGACACAAG
rs2162459 P2Phosphate-TGTATGATAGACATGGATGA-biotin
rs2162459 Target for P1-ATCATCCATGTCTATCATACATTTGTGTCTGACAGGCAAACG
rs2162459 Target for P1-GTCATCCATGTCTATCATACACTTGTGTCTGACAGGCAAACG
rs2162459 ForwardAAATTGCAGAGCACAGCC
rs2162459 ReverseTCAAAGTTTGAAGTCAGTGGG
rs3808607 P1-GALD-AAAAAAAAAACAAAGCAATCAGAGACCTGCAATACTTGATAAGTTGAAGG
rs3808607 P1-TALD-AAAAAAAAAACAAAGCAATCAGAGACCTGCAATACTTGATAAGTTGAAGT
rs3808607 P2Phosphate-TCTCTCAAATATATGTTGAC-biotin
rs3808607 Target for P1-GGTCAACATATATTTGAGAGACCTTCAACTTATCAAGTATTG
rs3808607 Target for P1-TGTCAACATATATTTGAGAGAACTTCAACTTATCAAGTATTG
rs3808607 ForwardAGTCCACAGGTATCAGAAGTG
rs3808607 ReverseCCCCAGGTCCGAATGTTAAG
rs7833904 P1-AALD-AAAAAAAAAACAGGAGAAAGCGTAAGTGCCCTGACAAAGAAAAGAGGTAA
rs7833904 P1-TALD-AAAAAAAAAACAGGAGAAAGCGTAAGTGCCCTGACAAAGAAAAGAGGTAT
rs7833904 P2Phosphate-AGTTGCCAGAAAACCTGGCA-biotin
rs7833904 Target for P1-ATGCCAGGTTTTCTGGCAACTTTACCTCTTTTCTTTGTCAGG
rs7833904 Target for P1-TTGCCAGGTTTTCTGGCAACTATACCTCTTTTCTTTGTCAGG
rs7833904 ForwardAATGTGAGCAAGAAAGCCC
rs7833904 ReverseGAACTCAGTTTATTTTGCCAGG
rs1125226 P1-AALD-AAAAAAAAAACAACTGGTTCATCAGACTTGCTGCGACCAATTAACCTTGA
rs1125226 P1-CALD-AAAAAAAAAACAACTGGTTCATCAGACTTGCTGCGACCAATTAACCTTGC
rs1125226 P2Phosphate-CAGCTCAGGGAGAGAGAGAG-biotin
rs1125226 Target for P1-ACTCTCTCTCTCCCTGAGCTGTCAAGGTTAATTGGTCGCAGC
rs1125226 Target for P1-CCTCTCTCTCTCCCTGAGCTGGCAAGGTTAATTGGTCGCAGC
rs1125226 ForwardTTTCTCTCTCTCCCTCTTTCTC
rs1125226 ReverseCTGGTTCATCAGACTTGCTG
rs1023652 P1-CALD-AAAAAAAAAAAAAATTCAAGTACCATAATATTCACCCCTTTAAAGAGTAC
rs1023652 P1-GALD-AAAAAAAAAAAAAATTCAAGTACCATAATATTCACCCCTTTAAAGAGTAG
rs1023652 P2Phosphate-AATTCAAAATTTCTAGCATA-biotin
rs1023652 Target for P1-CTATGCTAGAAATTTTGAATTGTACTCTTTAAAGGGGTGAAT
rs1023652 Target for P1-GTATGCTAGAAATTTTGAATTCTACTCTTTAAAGGGGTGAAT
rs1023652 ForwardTGGAATGCCAGTTACCCCAC
rs1023652 ReverseGGTGATGGTCACACAATCTTTG
1302 delTT P1-TTALD-AAAAAAAAAATGTATTTTTATTGCAGACTTTTAAATATGATAGGTATCTT
1302 delTT P1-DELALD-AAAAAAAAAATTTGTATTTTTATTGCAGACTTTTAAATATGATAGGTATC
1302 delTT P2Phosphate-GATGAAAACGGGAAGACAAA-biotin
1302 delTT Target for P1-TTTGTCTTCCCGTTTTCATCAAGATACCTATCATATTTAAAA
1302 delTT Target for P1-DELTGTCTTCCCGTTTTCATCGATACCTATCATATTTAAAA
1302 delTT ForwardACTAGCTTAAAGGCGGTTTTC
1302 delTT ReverseCGATCCAAAGGGCATGTAG

* Shadows indicate SNP nucleotides in P1 probe and target sequences.

For each SNP, three oligonucleotide probes were synthesized. A pair of allele specific P-1 oligos, differing only in their 3'-terminal nucleotide sequence, generally has 40 nucleotides complementary to the corresponding target sequences, and an additional 10-dA residue at their 5'-ends that constitutes a "spacer". Their 5'-terminal nucleotide is modified with an aldehyde group, allowing covalent attachment to the chip surface [27]. A second oligonucleotide probe (biotin-P2) with 20 nucleotides immediately adjacent to the SNP nucleotide carries a biotin at the 3' end for detection, and a phosphate at its 5' end for ligation. To test genotyping specificity, a pair of oligonucleotide targets was also synthesized. The P1, P2 and target sequences for each SNP are listed in Table 4. The synthesized P1 oligos were dissolved to 100 μM in 0.1 M phosphate buffer, pH 7.8. A P1 working solution of 1 μM in 0.1 M phosphate buffer, pH 7.8, and 10% glycerol was prepared for each P-1 probe before spotting. Twenty nano liter of the P1 working solution was spotted on a 7 × 7 mm2 chip in an 8 per row × 6 per column format, by a BioDot PC controlled dispense arrayer AD3200. A duplicate set of P1 probes were spotted on a chip with a spotting pattern shown in Figure 3A. After the spotted chips were incubated in a humidity-controlled chamber for at least 2 hrs, the chips were washed with 0.1% SDS, water, and air dried. A standard operating procedure for genotyping SNPs on the printed biosensor chips was described previously [27]. An arrayed chip was assembled into a square well of a 96-well microtiter plate for hybridization. A ligation reaction was carried out in a microtiter plate well containing an arrayed chip. A reaction solution (100 μl) contained 100 femtomoles of each relevant PCR amplicon of the 10 CYP7A1 SNPs, 10 nM P-2 probe (one for each SNP) and 5 units of mutant Ampligase in a buffer of 20 mM Tris-HCl, pH 8.3, 25 mM KCl, 10 mM MgCl2, 0.5 mM NAD, 0.01% Triton X-100, and 5 mg/ml alkaline treated casein. The ligation reaction was incubated for 20 min at 60°C. 96 chips in a 96-well plate were processed simultaneously. After a stringent wash (3 times in 0.01 M NaOH at room temperature and 3 times in 0.1 × SSC), the chips were incubated with an antibiotin-horse radish peroxidase (HRP) conjugate (1 μg/ml in hybridization buffer) for 10 min, and the chips were rinsed with 0.1 × SSC. 100 μl of a precipitate-generating HRP substrate TMB (BioFx) was added to each chip and incubated for 5 min, rinsed in ddH20, and air-dried.

Authors' contributions

KN carried out the genotyping experiments. SW provided assistance on haplotype analysis. RDJ provide thin-film biosensor chip for genotyping and participated in the experiment design. GLG participated in the design of the study and data analysis. YYW provided Mexican-American and African-American samples for this study. CDK participated in study design and help preparation of manuscript. XBZ coordinated the experimental design and was responsible for quality control, data analysis and manuscript preparation. All authors read and approved the final manuscript. LD patterns and haplotype structures of the CYP7A1 gene in Mexican-American and African-American populations. Ten selected genetic variations for CYP7A1 genotyping. Oligonucleotide sequences of capture probe P1, detection probe P2, synthetic targets, and PCR primers. * Shadows indicate SNP nucleotides in P1 probe and target sequences. Allele frequencies of the selected CYP7A1 htSNPs and mutation markers A. in the Mexican-American population. * Shadows indicate SNP nucleotides in P1 probe and target sequences.
  33 in total

1.  Interference-based detection of nucleic acid targets on optically coated silicon.

Authors:  R Jenison; S Yang; A Haeberli; B Polisky
Journal:  Nat Biotechnol       Date:  2001-01       Impact factor: 54.908

2.  Analysis of the CYP2D6 gene polymorphism and enzyme activity in African-Americans in southern California.

Authors:  Y J Wan; R E Poland; G Han; T Konishi; Y P Zheng; N Berman; K M Lin
Journal:  Pharmacogenetics       Date:  2001-08

3.  CYP2D6 polymorphism in a Mexican American population.

Authors:  R Mendoza; Y J Wan; R E Poland; M Smith; Y Zheng; N Berman; K M Lin
Journal:  Clin Pharmacol Ther       Date:  2001-12       Impact factor: 6.875

4.  Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms.

Authors:  Zhaohui S Qin; Tianhua Niu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2002-11       Impact factor: 11.025

5.  Markers that discriminate between European and African ancestry show limited variation within Africa.

Authors:  Heather E Collins-Schramm; Rick A Kittles; Darwin J Operario; James L Weber; Lindsey A Criswell; Richard S Cooper; Michael F Seldin
Journal:  Hum Genet       Date:  2002-09-14       Impact factor: 4.132

6.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

7.  Linkage disequilibrium patterns and tagSNP transferability among European populations.

Authors:  Jakob C Mueller; Elin Lõhmussaar; Reedik Mägi; Maido Remm; Thomas Bettecken; Peter Lichtner; Saskia Biskup; Thomas Illig; Arne Pfeufer; Jan Luedemann; Stefan Schreiber; Peter Pramstaller; Irene Pichler; Giovanni Romeo; Anthony Gaddi; Alessandra Testa; Heinz-Erich Wichmann; Andres Metspalu; Thomas Meitinger
Journal:  Am J Hum Genet       Date:  2005-01-06       Impact factor: 11.025

8.  Genetic polymorphism in cytochrome P450 7A1 and risk of colorectal cancer: the Fukuoka Colorectal Cancer Study.

Authors:  Tomoko Hagiwara; Suminori Kono; Guang Yin; Kengo Toyomura; Jun Nagano; Tetsuya Mizoue; Ryuichi Mibu; Masao Tanaka; Yoshihiro Kakeji; Yoshihiko Maehara; Takeshi Okamura; Kouji Ikejiri; Kitaroh Futami; Youichi Yasunami; Takafumi Maekawa; Kenji Takenaka; Hitoshi Ichimiya; Nobutoshi Imaizumi
Journal:  Cancer Res       Date:  2005-04-01       Impact factor: 12.701

9.  Linkage of the cholesterol 7alpha-hydroxylase gene and low-density lipoprotein cholesterol conditional on apolipoprotein E association: the National Heart, Lung, and Blood Institute Family Heart Study.

Authors:  Jing-Ping Lin; Richard H Myers; Laura Almasy; Hilary H Coon; Donna K Arnett; Yuling Hong; Steven C Hunt
Journal:  Chin Med J (Engl)       Date:  2005-03-05       Impact factor: 2.628

10.  Human cholesterol 7alpha-hydroxylase (CYP7A1) deficiency has a hypercholesterolemic phenotype.

Authors:  Clive R Pullinger; Celeste Eng; Gerald Salen; Sarah Shefer; Ashok K Batta; Sandra K Erickson; Andrea Verhagen; Christopher R Rivera; Sean J Mulvihill; Mary J Malloy; John P Kane
Journal:  J Clin Invest       Date:  2002-07       Impact factor: 14.808

View more
  15 in total

1.  The effect of CYP7A1 polymorphisms on lipid responses to fenofibrate.

Authors:  Jian Shen; Donna K Arnett; Laurence D Parnell; Chao-Qiang Lai; Robert J Straka; Paul N Hopkins; Ping An; Mary F Feitosa; José M Ordovás
Journal:  J Cardiovasc Pharmacol       Date:  2012-03       Impact factor: 3.105

2.  Association between CYP7A1 and the risk of proximal colon cancer in Japanese.

Authors:  Hidemi Ito; Keitaro Matsuo; Satoyo Hosono; Miki Watanabe; Takakazu Kawase; Takeshi Suzuki; Takashi Hirai; Yasushi Yatabe; Hideo Tanaka; Kazuo Tajima
Journal:  Int J Mol Epidemiol Genet       Date:  2009-10-12

3.  SNP discovery and haplotype analysis in the bovine PRKAA2 gene.

Authors:  Qin Zhang; Sheng Zhao; Hong Chen; Li Zhang; Liangzhi Zhang; Fei Li; Xuemin Wang
Journal:  Mol Biol Rep       Date:  2010-09-15       Impact factor: 2.316

4.  Allelic polymorphism detected in the bovine FTO gene.

Authors:  Bao Zhang; Ya Zhang; Liangzhi Zhang; Jing Wang; Zhuanjian Li; Hong Chen
Journal:  Mol Biotechnol       Date:  2011-11       Impact factor: 2.695

5.  Interactions Between Regulatory Variants in CYP7A1 (Cholesterol 7α-Hydroxylase) Promoter and Enhancer Regions Regulate CYP7A1 Expression.

Authors:  Danxin Wang; Katherine Hartmann; Michal Seweryn; Wolfgang Sadee
Journal:  Circ Genom Precis Med       Date:  2018-10

6.  Single nucleotide polymorphisms and haplotypic diversity in the bovine PRKAB1 gene.

Authors:  Qin Zhang; Hong Chen; Sheng Zhao; Li Zhang; Liangzhi Zhang; Fei Li; Xueming Wang
Journal:  Mol Biotechnol       Date:  2009-07-04       Impact factor: 2.695

7.  Genetic Variations Associated with Anti-Tuberculosis Drug-Induced Liver Injury.

Authors:  Yifan Bao; Xiaochao Ma; Theodore P Rasmussen; Xiao-Bo Zhong
Journal:  Curr Pharmacol Rep       Date:  2018-03-15

8.  Population analysis of vitamin D receptor polymorphisms and the role of genetic ancestry in an admixed population.

Authors:  Tulio C Lins; Rodrigo G Vieira; Dario Grattapaglia; Rinaldo W Pereira
Journal:  Genet Mol Biol       Date:  2011-07-01       Impact factor: 1.771

9.  Ethnic differences in the prevalence of polymorphisms in CYP7A1, CYP7B1 AND CYP27A1 enzymes involved in cholesterol metabolism.

Authors:  Vera Dias; V Ribeiro
Journal:  J Pharm Bioallied Sci       Date:  2011-07

10.  Up to date on cholesterol 7 alpha-hydroxylase (CYP7A1) in bile acid synthesis.

Authors:  John Y L Chiang; Jessica M Ferrell
Journal:  Liver Res       Date:  2020-06-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.