Literature DB >> 29085579

UGT1A1 gene linkage analysis: application of polymorphic markers rs4148326/rs4124874 in the Iranian population.

Abstract

OBJECTIVES: Mutations in the UGT1A1 gene are responsible for hyperbilirubinemia syndromes including Crigler-Najjar type 1 and 2 and Gilbert syndrome. In view of the genetic heterogeneity and involvement of large numbers of the disease causing mutations, the application of polymorphic markers in the UGTA1 gene could be useful in molecular diagnosis of the disease.
MATERIALS AND METHODS: In the present study, two polymorphic markers including rs4148326 and rs4124874 in the UGT1A1 gene region were characterized. The markers were selected using bioinformatics analysis of the UGT1A1 gene region and genotyped in 212 unrelated healthy individuals and 13 family trios in the Iranian population using Tetra-Primer ARMS PCR technique. The allele frequency and population status of the alleles were estimated using GENEPOP, FBAT, PowerMarker and Arlequin software.
RESULTS: The results indicated that in the case of rs4148326 marker, allele frequency for T and C allele was 66.04% and 33.96%, respectively. For rs4124874 marker, allele frequency for G and T alleles was 39.4% and 60.6%, respectively. The values of heterozygosity index for the markers examined were 64.1 for rs4148326 and 72.1 for rs4124874, respectively. The haplotype estimation analysis of the markers resulted in three informative haplotypes with frequencies ≥0.05. Moreover, the results suggested the presence of linkage disequilibrium between two markers.
CONCLUSION: Altogether, the data suggested that rs4148326 and rs4124874 could be introduced as informative markers for molecular diagnosis of Crigler-Najjar type 1 and 2 and Gilbert syndrome in the Iranian population.

Entities: Chemical Disease Gene Mutation Species

Keywords: Genotyping; Linkage analysis; Molecular diagnostic; Polymorphic markers; UGT1A1 enzyme

Year: 2017 PMID： 29085579 PMCID： PMC5651473 DOI： 10.22038/IJBMS.2017.9109

Source DB: PubMed Journal: Iran J Basic Med Sci ISSN： 2008-3866 Impact factor: 2.699

Introduction

Uridine diphosphate glucuronosyl transferase 1A1 (UGT1A1) catalyzes the conjugation of insoluble bilirubin with glucuronic acid in hepatic cells, which is necessary for efficient biliary removal of serum bilirubin (1, 2). Deficiency of the enzyme activity results in the accumulation of unconjugated bilirubin in serum, causing hyperbilirubinemia (3, 4). Muta-tions in the UGT1A1 gene are responsible for hyper-bilirubinemia syndromes including Gilbert syndrome (GS) and Crigler–Najjar (CN) type 1 and 2 (CN-1, CN-2) (5, 6). CN-1 is a severe and lethal form of hereditary unconju-gated hyperbilirubinemia, with autosomal recessive inheritance manner. CN-2 and GS are the moderate and mild forms of hereditary unconjugated hyper-bilirubinemia, respectively (7-9). CN type 1 and 2 and GS result from the combination of several mutations of variable severity at a single locus. Therefore, the phenotype variability of these synd-romes is, at least in part, due to genetic heterogeneity. Genetic counseling as well as molecular diagnosis, including antenatal diagnosis is proposed in families at risk for CN and GS (8). To date, over 130 different mutations in the UGT1A1 gene have been recorded in the human genome mutation database (HGMD) (10). Molecular diagnosis of the syndrome(s) has been essentially based on direct mutation analysis of the UGT1A1 gene region (11). However, due to the large number of mutations associated with the disease as well as the genetic heterogeneity of the mutations, direct analysis is mostly time-consuming and costly proce-dure, especially in developing countries. Alternatively, indirect analysis of mutations using linkage analysis could be considered as a practical approach in families with an affected individual. Several polymorphic markers in the UGT1A1 gene region have been introduced. However, the application of the markers is population-dependent. Therefore, the suitability of each marker should be first determined in the population under study, before further application in linkage and carrier-ship investigations (12). To our knowledge, there is no study performed on the molecular markers linked to the UGT1A1 gene region by means of linkage study for molecular diagnosis of these syndromes. In the present study, using bioinformatics analysis of the UGT1A1 gene region, among the markers present in the gene region, two single nucleotide polymorphisms (SNP), rs4148326 and rs4124874, located at 5’ region of the gene (respectively in intron 1 and the promoter region) were selected (13-15). The genetic structure as well as haplotype frequency of the markers was investigated in the Iranian population.

Materials and Methods

Bioinformatics studies and SNP selection

An in silico analysis was performed on the poly-morphic markers in the UGT1A1 gene region, which were reported in databases dbSNP, ALFRED, UCSC Genome Browser, SNPper and openSNP. The mar-kers with high minor allele frequency (MAF) and heterozygosity including rs4124874 and rs4148326 were selected. MAF and heterozygosity rate are two important characteristics of genetic markers for their informativeness in linkage analysis (16).

DNA sample

Peripheral blood samples were collected from 225 individuals consists of 212 unrelated healthy persons and 13 family trios. The trios family was consisted of both parents and one child with healthy status, which were selected randomly. Unrelated healthy individuals and both parents in family group were sex- and age- matched (aged between 25 and 39). The children in the family group were 5-11 years old. All subjects were from the central province of Iranian population with Fars ethnicity. Genomic DNA was extracted from peripheral blood mononuclear cells by salting out procedure (17).

SNPs genotyping

The selected rs4124874 and rs4148326 markers were genotyped using tetra-primer ARMS-PCR tech-nique (18). This method combines two allele-specific inner primers and two outer primers in a single reaction and encompasses deliberate mismatches at position-2 from the 3’ end of inner primers. Genotyping by tetra-primer ARMS-PCR requires only a single PCR amplification followed by electrophoresis for the determination of genotypes (18, 19). The optimal selection of primers can be achieved in an automated way using a program, which evaluates candidate primers for a given sequence. In this study, the Primer1 software was used for tetra-primer ARMS-PCR primer design (20, 21). Primer sets are shown in Table 1.

Table 1

Primer sets for genotyping of rs4148326 and rs4124874

Primer	Sequence (5’→3’)

	rs4148326	rs4124874
IF	CAGGGTGTTCTTGCTACAAACCAAAAcAc	GCTGGCCAAGGGTAGAGTTCAaTG
OR	ATCCTCCCCACCACCATGCTTCA	TGTCCAAGCTCATTCCTCCTCTC
OF	AAGTAAGCCATTTACCAACGCTCAG	TCTTTGCTTTGATAAATTGTGGGGC
IR	AGGATGCTGGTCACCCTAGaTG	CACCATGTGGGTCATCTGTGACTTtAa

IF: Inner Forward; OR: Outer reverse; OF: Outer forward; IR: Inner reverse

Primer sets for genotyping of rs4148326 and rs4124874 IF: Inner Forward; OR: Outer reverse; OF: Outer forward; IR: Inner reverse The PCR reactions were carried out in reactions of 25 μl total volume containing 100 ng of template DNA, 15 pmol of each inner primer, 3 pmol of each outer primer, 75 mM MgCl2, 150 μM dNTP, 2.5 μl 10× buffer and 2.5 U Taq DNA polymerase. Initial denaturation was performed at 94 °C for 5 min, followed by 30 cycles including 30 sec denaturation at 94 °C; different annealing temperature depending on the marker used (64 °C for rs4148326 and 66 °C for rs4124874) for 50 sec, extension at 72 °C for 50 sec; followed by a 10 min final extension at 72 °C. The PCR products were separated on 2% agarose gel electrophoresis and analyzed.

Statistical analysis

Allele frequency as well as the observed and expect-ed heterozygosity was determined using the GENEPOP software (22). The haplotype frequency was estimated using Arlequin (23) and FBAT software (24) on the data obtained from unrelated individuals and family trios, respectively. The program Arlequin implements expectation-maximization (EM) algorithm (23). In order to provide a better description of the obtained results from the haplotype estimation, the estimation of linkage disequilibrium (LD) for 212 unrelated individuals was also performed using the PowerMarker computer program (25). LD was measured by using the standardized D’(26). D’ is the LD relative to its maximum value for a given set of allelic frequencies for the pair of sites (27). In this sense, D’ is a normalized value of LD. D’ ranges between 0 and 1. When D’=1, the LD was completed and indicated LD. When D’ was estimated zero, it indicated the presence of LD between two loci. Given that D’ alone could not provide great prediction in the likely performance for multiallelic markers; therefore, Chi-square (χ2), which is another multiallelic LD statistics, was used (28). Chi-square provides a direct assessment of the strength of the association between alleles located at the two loci. The estimation of D’ and χ2 was performed using PowerMarker computer program. The null hypothesis of random association between pairs of alleles at the two loci (D’= 0) was assessed by χ2 value. The χ2 value for two pairwise marker is compared with χ2 obtained from the chi-square chart (P≤0.05). Haplotypes construction for the markers was carried out through the PowerMarker computer program, which has been based on EM algorithm and calculates the most likely haplotypes with respect to the provided genotypic data (29).

HapMap analysis

The genotyping data were obtained from the HapMap database (30). Then the genotyping data from the Iranian population (present study) were compared with those of the other populations.

Results

In the present study, two markers in the UGT1A1 gene region including rs4148326 and rs4124874 were genotyped in the Iranian population (Figure 1, 2). The allelic frequency and the expected and observed hetero-zygosity of the markers were estimated by GENEPOP software based on the identified unrelated hetero-zygous individuals. As presented in Table 2, the MAF for rs4148326 (C allele) was 33.96, and for rs4124874 (T allele) was 39.4. Both markers showed a relatively high degree of allelic heterozygosity (64.1 for rs4148326 and 72.1 for rs4124874, respectively) in the studied population (Table 2).

Figure 1

Genotyping of rs4148326 marker in the Iranian population

PCR products of five samples were shown. The heterozygous (TC), homozygous (TT) and (CC) genotype were indicated above each lane. Each product size is also shown at the right side of picture

Figure 2

Genotyping of rs4124874 marker in Iranian population

PCR products of six samples were shown. The heterozygous (GT), homozygous (GG) and (TT) genotype were indicated above each lane. Each product size is also shown at the right side of picture

Table 2

The frequency distribution and the expected and observed heterozygosity frequency of rs4148326 and Rs4124874 markers in the Iranian population

Marker	Minnor allele frequency	Major allele frequency	Ho	He
rs4148326	C allele 33.96	T allele 66.04	64.15	44.96
rs4124874	T allele 39.4	G allele 60.6	72.1	47.86

Ho: Observed heterozygosity, He: Expected heterozygosity

Genotyping of rs4148326 marker in the Iranian population PCR products of five samples were shown. The heterozygous (TC), homozygous (TT) and (CC) genotype were indicated above each lane. Each product size is also shown at the right side of picture Genotyping of rs4124874 marker in Iranian population PCR products of six samples were shown. The heterozygous (GT), homozygous (GG) and (TT) genotype were indicated above each lane. Each product size is also shown at the right side of picture The frequency distribution and the expected and observed heterozygosity frequency of rs4148326 and Rs4124874 markers in the Iranian population Ho: Observed heterozygosity, He: Expected heterozygosity The haplotype frequency was estimated using Arlequin and FBAT software. As shown in Table 3, among the four possible haplotypes identified by Arlequin software for the markers, three haplotypes were considered as informative haplotypes with frequencies ≥5% in the Iranian population.

Table 3

Analysis of haplotypes frequency of rs4148326 andrs4124874 markers in the Iranian population using Arlequin and FBAT softwares and informative state of each haplotype

Index	Haplotype	Frequency		Informativeness
	rs4148326-rs4124874	Unrelated	Family trios	(+/-)
1	T-T	0.36024	0.311	+/+
2	C-G	0.30599	0.292	+/+
3	T-G	0.30014	0.285	+/+
4	C-T	0.03363	0.112	-/+

The symbols + and - in the table show the informative and non informative state of each haplotype in unrelated individuals and family trios, respectively

Analysis of haplotypes frequency of rs4148326 andrs4124874 markers in the Iranian population using Arlequin and FBAT softwares and informative state of each haplotype The symbols + and - in the table show the informative and non informative state of each haplotype in unrelated individuals and family trios, respectively Haplotype estimation of families was performed by FBAT software, which resulted in four informative haplotypes with frequencies ≥5% among four possi-ble haplotypes. Three haplotypes including T-T, C-G and T-G were estimated as informative haplotypes using both computer programs (Table 3). The data suggested that a combination of rs4124874 and rs4148326 could be introduced as an informative haplotype. Given the presence of relatively high proportion of informative haplotypes in the population under the study, the LD pattern for the pairing of these two markers was assessed by calculating average D’ and χ2 using PowerMarker program (25). As shown in Table 3, the results of estimation for D’ values for two possible pairing of markers were higher than zero. The calculated χ2 values were also higher than χ2 values obtained from the chi-square table (P<0.05) (Table 4) (31).

Table 4

Analysis of D’ and χ2 values for pairing of the rs4148326 and rs4124874 markers in the UGT1A1 locus in the Iranian population

Pairing of markers	D′	P value	χ²
rs4124874-rs4148326	0.75	0.000	79.87

Analysis of D’ and χ2 values for pairing of the rs4148326 and rs4124874 markers in the UGT1A1 locus in the Iranian population

Discussion

CN syndrome type 1 and 2 and GS result from mutations in UGT1A1gene (32). One of the clinical manifestations of these diseases is neonatal jaundice, which is also very common in newborn infants. Even though it is often a natural and transitional condition, some infants develop severe hyperbilirubinemia in which unconjugated bilirubin in the serum may cross the blood-brain-barrier causing encephalopathy or kernicterus (33). To date, numerous polymorphisms of UGT1A1 gene have been identified in association with hyperbilirubinemia diseases of CN-1, CN-2, and GS. Thus, the UGT1A1 gene markers may be used as bio-markers pertaining to examine individualized disease risk. Due to the greater power and more appropriate state of using haplotypes for genotype-phenotype correlations than individual SNPs, it is useful to perform haplotype analyses (34, 35). Even though each marker could be assessed separately, it is more informative to analyze them in groups. When multiple markers in a unique chromosome were used to determine their association with a disease, analyzing of haplotype could be more informative than independent analyses of individual markers (36). Linkage analysis using polymorphic markers linked to UGT1A1 gene region could facilitate carrier detection and prenatal diagnosis of CN-1, CN-2, and GS in families with an affected individual. To our knowledge, molecular diagnosis of the syndrome has been based on direct mutation analysis. In the present study, among the markers present in the UGT1 gene complex region, the intra-genic marker, rs4148326 and extra-genic marker, rs4124874 were investigated. The allele and haplotype frequency of the markers were estimated in the Iranian population. The results showed that both markers were in high allelic frequency. As shown in Table 5, the MAF of the SNP markers was compared with the data from other populations. The observed heterozygosity for the markers that was calculated based on the identified unrelated heterozygous individuals indicated a relatively high heterozygosity for these markers in the Iranian population. In view of the high level of MAF and observed heterozygosity, rs4148326 and s4124874 SNPs could be suggested as informative markers in the Iranian population when compared to other populations. Therefore, these markers could be considered as the first markers to be introduced in carrier detection and prenatal diagnosis of the CN-1, CN-2 and GS in the Iranian population.

Table 5

Comparison of minor allele frequency (MAF) of the rs4148326 and rs4124874 markers between the Iranian population and data from populations presented in UCSC genome browser database

Population	rs4148326 C allele frequency	rs4124874 T allele frequency
IRI	33.96	39.4
CEU	43.94	56.06
CHB	30.36	69.64
CHD	34.12	65.88
YRI	63.47	10.24
GIH	60.23	39.77
ASW	54.22	20.73
JPT	33.14	66.86
LWK	69.44	11.11
MEX	49.35	48.70
MKK	69.88	15.59
TSI	42.61	57.39
Average	52.11	38.78

CEU (northern and western Europe), CHB (Han Chinese in Beijing, China), CHD (Chinese Ancestry in Metropolitan Denver, CO, US), YRI (Yoruba in Ibadan, Nigeria), GIH (Gujarati Indians in Houston, TX), ASW (African Ancestry in SouthWestern United States), JPT (Japanese in Tokyo, Japan), LWK (Luhya in Webuye, Kenya), MEX (Mexican Ancestery in Los Angeles, CA, US), Masai in Kinyawa, Kenya MKK and TSI (Toscani in Italia)

Comparison of minor allele frequency (MAF) of the rs4148326 and rs4124874 markers between the Iranian population and data from populations presented in UCSC genome browser database CEU (northern and western Europe), CHB (Han Chinese in Beijing, China), CHD (Chinese Ancestry in Metropolitan Denver, CO, US), YRI (Yoruba in Ibadan, Nigeria), GIH (Gujarati Indians in Houston, TX), ASW (African Ancestry in SouthWestern United States), JPT (Japanese in Tokyo, Japan), LWK (Luhya in Webuye, Kenya), MEX (Mexican Ancestery in Los Angeles, CA, US), Masai in Kinyawa, Kenya MKK and TSI (Toscani in Italia) Our study samples consisted of unrelated healthy individuals and family trios. Combining data when data are collected through different study designs, such as family trios and unrelated samples, achieves more accuracy and is cost-effective compared with analyzing each data separately (37). Estimation of haplotype frequency of the markers in families and unrelated individuals showed three out of four haplotypes to be common in the Iranian population with frequencies ≥5% using both the FBAT and Arlequin software. These result indicated that the estimated frequencies of common haplotypes did not differ significantly using phase-known versus phase-unknown data. Further-more, the estimation of D’ and χ2 for the pair of markers showed that they were in strong LD. Therefore, a large percentage of the chromosomes in this population may display common haplotypes for these two markers.

Conclusion

Altogether, the results from the present study showed that the degree of heterozygosity for rs4148326 and s4124874 markers were high in the Iranian population. Our results also demonstrated the ethnic specificity in degree of heterozygosity and informative haplotypes of the markers. These findings suggested the presence of genetic diversity at these two markers in the UGT1A1 gene region. These markers could be considered in families with at least one affected child for carrier detection and prenatal diagnosis with a high degree of accuracy regardless of the nature of mutation. The data also suggested that the combination of rs4148326-s4124874 could be considered as an informative tool in performing carrier and prenatal diagnosis of UGT1A1 gene mutations in the Iranian families with the CN type 1 and 2 syndrome as well as GS.

31 in total

Review 1. The predictive power of haplotypes in clinical response.

Authors: R Judson; J C Stephens; A Windemuth
Journal: Pharmacogenomics Date: 2000-02 Impact factor: 2.533

Review 2. Methods of parentage analysis in natural populations.

Authors: Adam G Jones; William R Ardren
Journal: Mol Ecol Date: 2003-10 Impact factor: 6.185

3. Genotyping single nucleotide polymorphisms in barley by tetra-primer ARMS-PCR.

Authors: E Chiapparino; D Lee; P Donini
Journal: Genome Date: 2004-04 Impact factor: 2.166

4. Analysis of any point mutation in DNA. The amplification refractory mutation system (ARMS).

Authors: C R Newton; A Graham; L E Heptinstall; S J Powell; C Summers; N Kalsheker; J C Smith; A F Markham
Journal: Nucleic Acids Res Date: 1989-04-11 Impact factor: 16.971

5. An efficient procedure for genotyping single nucleotide polymorphisms.

Authors: S Ye; S Dhillon; X Ke; A R Collins; I N Day
Journal: Nucleic Acids Res Date: 2001-09-01 Impact factor: 16.971

6. Molecular diagnosis of a familial nonhemolytic hyperbilirubinemia (Gilbert's syndrome) in healthy subjects.

Authors: J Borlak; T Thum; O Landt; K Erb; R Hermann
Journal: Hepatology Date: 2000-10 Impact factor: 17.425

Review 7. Genetic lesions of bilirubin uridine-diphosphoglucuronate glucuronosyltransferase (UGT1A1) causing Crigler-Najjar and Gilbert syndromes: correlation of genotype to phenotype.

Authors: A Kadakol; S S Ghosh; B S Sappal; G Sharma; J R Chowdhury; N R Chowdhury
Journal: Hum Mutat Date: 2000-10 Impact factor: 4.878

8. Association of UGT1A1 variants and hyperbilirubinemia in breast-fed full-term Chinese infants.

Authors: Youyou Zhou; San-nan Wang; Hong Li; Weifeng Zha; Xuli Wang; Yuanyuan Liu; Jian Sun; Qianqian Peng; Shilin Li; Ying Chen; Li Jin
Journal: PLoS One Date: 2014-08-07 Impact factor: 3.240

9. A genome-wide association study of total bilirubin and cholelithiasis risk in sickle cell anemia.

Authors: Jacqueline N Milton; Paola Sebastiani; Nadia Solovieff; Stephen W Hartley; Pallav Bhatnagar; Dan E Arking; Daniel A Dworkis; James F Casella; Emily Barron-Casella; Christopher J Bean; W Craig Hooper; Michael R DeBaun; Melanie E Garrett; Karen Soldano; Marilyn J Telen; Allison Ashley-Koch; Mark T Gladwin; Clinton T Baldwin; Martin H Steinberg; Elizabeth S Klings
Journal: PLoS One Date: 2012-04-27 Impact factor: 3.240

Review 10. The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine.

Authors: Peter D Stenson; Matthew Mort; Edward V Ball; Katy Shaw; Andrew Phillips; David N Cooper
Journal: Hum Genet Date: 2014-01 Impact factor: 4.132