| Literature DB >> 23861666 |
Rupali Chopra1, Shafat Ali, Amit K Srivastava, Shweta Aggarwal, Bhupender Kumar, Siddharth Manvati, Ponnusamy Kalaiarasan, Mamta Jena, Vijay K Garg, Sambit N Bhattacharya, Rameshwar N K Bamezai.
Abstract
Leprosy is a chronic infectious disease caused by Mycobacterium Leprae, where the host genetic background plays an important role toward the disease pathogenesis. Various studies have identified a number of human genes in association with leprosy or its clinical forms. However, non-replication of results has hinted at the heterogeneity among associations between different population groups, which could be due to differently evolved LD structures and differential frequencies of SNPs within the studied regions of the genome. A need for systematic and saturated mapping of the associated regions with the disease is warranted to unravel the observed heterogeneity in different populations. Mapping of the PARK2 and PACRG gene regulatory region with 96 SNPs, with a resolution of 1 SNP per 1 Kb for PARK2 gene regulatory region in a North Indian population, showed an involvement of 11 SNPs in determining the susceptibility towards leprosy. The association was replicated in a geographically distinct and unrelated population from Orissa in eastern India. In vitro reporter assays revealed that the two significantly associated SNPs, located 63.8 kb upstream of PARK2 gene and represented in a single BIN of 8 SNPs, influenced the gene expression. A comparison of BINs between Indian and Vietnamese populations revealed differences in the BIN structures, explaining the heterogeneity and also the reason for non-replication of the associated genomic region in different populations.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23861666 PMCID: PMC3701713 DOI: 10.1371/journal.pgen.1003578
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1A partial map of Chromosome-6q26 expanded to show the position and distribution of 11 significant SNPs (shown with rs numbers) in the regulatory region of the PARK2 and PACRG genes.
Allele and genotype frequencies for 11 significant SNPs within PARK2 and PACRG gene regulatory region in two different cohorts of patients with Leprosy.
| North India | Orissa Population | Combined | Logistic p Value | |||||||||||||
| SNPs | Risk Allele | Control Frequency | Patient Frequency | pa value | Control Frequency | Patient Frequency | pb value | Control Frequency | Patient Frequency | PB Frequency | MB Frequency | pc value | Model | Unadjusted pd value | Adjusted pe value | OR(95%CI) |
| rs10945859 | Minor-C | 0.28 | 0.33 | 0.00020 | 0.16 | 0.23 | 0.013 | 0.26 | 0.31 | 0.31 | 0.32 | 0.00012, 0.0075, 0.00058 | CT+CCvsTT | 0.00053, 0.015, 0.0022 | 0.0039, 0.06, 0.0078 | 1.32(1.12–1.54), 1.29(1.05–1.59), 1.35(1.11–1.64) |
| rs9347683 | Minor-C | 0.28 | 0.33 | 0.00024 | 0.16 | 0.23 | 0.017 | 0.26 | 0.31 | 0.31 | 0.32 | 0.00018, 0.0094, 0.00075 | CA+CCvsAA | 0.00077, 0.015, 0.0035 | 0.0056, 0.062, 0.013 | 1.31(1.12–1.53), 1.29(1.05–1.59), 1.33(1.09–1.62) |
| rs9347684 | Minor-C | 0.27 | 0.31 | 0.0015 | 0.14 | 0.22 | 0.0057 | 0.25 | 0.30 | 0.30 | 0.30 | 0.00081, 0.013, 0.0043 | CT+CCvsTT | 0.0014, 0.01, 0.011 | 0.0083, 0.041, 0.034 | 1.29(1.1–1.51), 1.31(1.06–1.62), 1.28(1.05–1.56) |
| rs9346929 | Minor-A | 0.28 | 0.33 | 0.00066 | 0.15 | 0.23 | 0.0047 | 0.27 | 0.31 | 0.31 | 0.32 | 0.00033, 0.014, 0.0012 | AG+AAvsGG | 0.00064, 0.015, 0.0029 | 0.0047, 0.058, 0.01 | 1.31(1.12–1.54), 1.29(1.05–1.6), 1.34(1.1–1.63) |
| rs4709648 | Minor-C | 0.37 | 0.42 | 0.0014 | 0.22 | 0.31 | 0.0064 | 0.35 | 0.40 | 0.40 | 0.39 | 0.00067, 0.0042, 0.0092 | CG+CCvsGG | 0.012, 0.037, 0.051 | 0.055, 0.12, 0.13 | 1.23(1.04–1.44), 1.25(1.01–1.56), 1.21(0.99–1.48) |
| rs12215676 | Minor-G | 0.36 | 0.42 | 0.00021 | 0.22 | 0.31 | 0.0079 | 0.34 | 0.40 | 0.40 | 0.39 | 0.00012, 0.0015, 0.0026 | CG+GGvsCC | 0.0022, 0.01, 0.021 | 0.013, 0.032, 0.067 | 1.28(1.09–1.51), 1.33(1.07–1.65), 1.26(1.03–1.54) |
| rs10806765 | Minor-T | 0.28 | 0.33 | 0.00068 | 0.15 | 0.23 | 0.0039 | 0.27 | 0.31 | 0.31 | 0.32 | 0.00031, 0.01, 0.0015 | TC+TTvsCC | 0.00053, 0.012, 0.0029 | 0.0039, 0.045, 0.01 | 1.32(1.12–1.55), 1.31(1.06–1.61), 1.34(1.1–1.63) |
| rs6936373 | Minor-G | 0.36 | 0.42 | 0.00040 | 0.22 | 0.31 | 0.0089 | 0.35 | 0.40 | 0.40 | 0.39 | 0.00022, 0.0019, 0.0045 | GC+GGvsCC | 0.0043, 0.021, 0.023 | 0.023, 0.067, 0.07 | 1.26(1.07–1.48), 1.28(1.03–1.59), 1.26(1.03–1.53) |
| rs1333957 | Minor-A | 0.28 | 0.33 | 0.00031 | 0.15 | 0.23 | 0.0031 | 0.26 | 0.31 | 0.31 | 0.31 | 0.00013, 0.0064, 0.00078 | AC+AAvsCC | 0.00042, 0.0092, 0.0027 | 0.0034, 0.042, 0.0095 | 1.32(1.13–1.55), 1.32(1.07–1.62), 1.34(1.1–1.63) |
| rs9365492 | Minor-C | 0.27 | 0.33 | 0.000040 | 0.15 | 0.23 | 0.0038 | 0.25 | 0.31 | 0.30 | 0.32 | 0.000015, 0.0033, 0.000081 | CT+CCvsTT | 0.000034, 0.0033, 0.0003 | 0.00036, 0.013, 0.0015 | 1.39(1.19–1.63), 1.36(1.11–1.68), 1.43(1.17–1.74) |
| rs9355403 | Minor-A | 0.28 | 0.32 | 0.00051 | 0.15 | 0.23 | 0.0026 | 0.26 | 0.31 | 0.30 | 0.31 | 0.00018, 0.013, 0.00056 | AG+AAvsGG | 0.00062, 0.021, 0.0019 | 0.0047, 0.077, 0.0072 | 1.31(1.12–1.54), 1.27(1.03–1.57), 1.36(1.12–1.65) |
p and p value for 2×2 chi test for overall allelic frequencies comparison of samples from North Indian and samples from Orissa. p value for 2×2 chi test for overall allelic frequencies comparison of combined samples, PB and MB samples. p and p values for genotypic model by logistic regression for combines samples, PB and MB samples before and after adjustment for sex as a covariate. Bonferroni correction of 32 SNPs was applied for multiples testing. Out of total 96 SNPs tested 64 were in 14 bin set (r2>0.8) [data not shown].
Figure 2The association statistics of the 11 significant SNPs in the regulatory region of the PARK2 and PACRG genes; presented as negative logarithm of the P-Value and their linkage disequilibrium (LD) plot based on pairwise LD for r2 cut off value ≥0.80.
Figure 3A schematic lay-out of the BIN structure (r2≥0.80) in the regulatory region of the PARK2 and PACRG genes in North Indian and East Indian-Orissa and Vietnamese population for 41 SNPs spanning 148 Kb region of Chromosome 6q26, where 36 SNPs are common to both Vietnamese and Indian population and 5 significant SNPs (No. 20, 22, 23, 26, 32) are exclusively studied in the Indian population.
[It may be noted that a similar BIN structure was observed in the North-Indian and East-Indian-Orissa populations]. Physical location of the studied chromosomal region is given in Mb on top. Vietnamese population information of Mira et al, 2004 and common SNPs between Indian and Vietnamese population (Alter et al, 2012) was shared by Prof. Schurr. Rest of the SNPs & BIN structure information was retrieved from Alter et al (2012). SNPs in star shape indicate the significant association (in two respective populations-Indian and Vietnamese). 11 significantly associated SNPs in studied Indian populations are distributed in two BINs (BIN 1 with 8 and BIN 2 with 3 SNPs). Distribution of significant SNPs in Vietnamese population is shown in BIN 1, BIN 2 and BIN3. SNPs, rs10945859 (No. 1) and rs9347684 (No. 9), although shared significance in both the Indian and Vietnamese population, but these showed no significant difference in expression in in vitro reporter assay for the alternative alleles. Each SNP is designated by a No. ranging from 1 to 41 according to increasing order of the chromosomal position. Filled Black Star - Significant SNPs in Indian (North and East Indian-Orissa) population, Unfilled Star - Significant SNPs in Vietnamese population, Filled Black Dot - Non-Significant SNPs in North and East Indian-Orissa population, Unfilled Dot - Non-Significant SNPs in Vietnamese population, Black Circled Dot and Black Circled Star SNPs (No. 5, 7, 8, 24, 35) studied by us earlier [26].
Haplotype structure, haplotype frequencies, significant p values and odds ratio between patients versus healthy controls of 11 significantly associated SNPs.
| Haplotype | rs10945859 | rs9347683 | rs9347684 | rs9346929 | rs4709648 | rs12215676 | rs10806765 | rs6936373 | rs1333957 | rs9365492 | rs9355403 | Hap-Score | pa-val | pool.hf | control.hf | case.hf | glm.eff | OR.lower | OR | OR.upper |
| 1 | T | A | T | G | G | C | C | C | C | T | G | −3.14 |
| 0.62 | 0.63 | 0.59 | Base | NA | 1 | NA |
| 2 | C | C | T | A | C | G | T | G | A | C | A | 0.20 | 8.40E-01 | 0.01 | 0.01 | 0.01 | Eff | 0.70 | 1.16 | 1.93 |
| 3 | T | A | T | G | C | G | C | G | C | T | G | 0.41 | 6.84E-01 | 0.08 | 0.08 | 0.09 | Eff | 0.90 | 1.12 | 1.40 |
| 4 | C | C | C | A | C | G | T | G | A | C | A | 4.71 |
| 0.25 | 0.23 | 0.29 | Eff | 1.19 | 1.36 | 1.56 |
| 5 | C | A | C | A | C | C | T | C | A | C | A | NA | NA | 0.00 | 0.00 | NA | <NA> | NA | NA | NA |
Column: Hap-Score shows haplotype score statistic; Base, part of the baseline; Frequencies and disease association of haplotype of SNP alleles was tested using haplo.cc extended application of Haplo.stasts software (v1.4.4) which combines the results of haplo.score, haplo.group and haplo.glm. Haplotype frequency was computed by maximum likelihood estimates of haplotype probabilities with progressive insertion algorithm and haplo.cc computed score statistic to test association between haplotype and traits with adjustment for non-genetic covariates (sex).
pa Indicates the haplotype comparison statistics for patients vs controls.
Haplotype structure, haplotype frequencies, significant p values and odds ratio between patients versus healthy controls of 3 SNPs representing BIN-2 of Indian population.
| Haplotype | rs4709648 | rs12215676 | rs6936373 | Hap-Score | pa-val | pool.hf | control.hf | case.hf | glm.eff | OR.lower | OR | OR.upper |
| 1 | G | C | C | −3.56 |
| 0.63 | 0.65 | 0.59 | Base | NA | 1.00 | NA |
| 2 | C | G | G | 4.48 |
| 0.36 | 0.34 | 0.40 | Eff | 1.14 | 1.29 | 1.45 |
| 3 | C | C | C | NA | NA | 0.01 | 0.01 | 0.00 | R | 0.20 | 0.39 | 0.76 |
Column: Hap-Score shows haplotype score statistic; Base, part of the baseline; Frequencies and disease association of haplotype of SNP alleles was tested using haplo.cc extended application of Haplo.stasts software (v1.4.4) which combines the results of haplo.score, haplo.group and haplo.glm. Haplotype frequency was computed by maximum likelihood estimates of haplotype probabilities with progressive insertion algorithm and haplo.cc computed score statistic to test association between haplotype and traits with adjustment for non-genetic covariates (sex).
pa Indicates the haplotype comparison statistics for patients vs controls.
Haplotype structure, haplotype frequencies, significant p values and odds ratio between patients versus healthy controls of 8 SNPs representing BIN-1 of Indian population.
| Haplotype | rs10945859 | rs9347683 | rs9347684 | rs9346929 | rs10806765 | rs1333957 | rs9365492 | rs9355403 | Hap-Score | pa-val | pool.hf | control.hf | case.hf | glm.eff | OR.lower | OR | OR.upper |
| 1 | T | A | T | G | C | C | T | G | −3.33 |
| 0.71 | 0.72 | 0.68 | Base | NA | 1.00 | NA |
| 2 | C | C | T | A | T | A | C | A | 0.24 | 8.07E-01 | 0.01 | 0.01 | 0.01 | Eff | 0.70 | 1.17 | 1.94 |
| 3 | C | C | C | A | T | A | C | A | 4.68 |
| 0.25 | 0.23 | 0.29 | Eff | 1.18 | 1.34 | 1.53 |
| 4 | C | A | C | A | T | A | C | A | NA | NA | 0.00 | 0.00 | NA | <NA> | NA | NA | NA |
Column: Hap-Score shows haplotype score statistic; Base, part of the baseline; Frequencies and disease association of haplotype of SNP alleles was tested using haplo.cc extended application of Haplo.stasts software (v1.4.4) which combines the results of haplo.score, haplo.group and haplo.glm. Haplotype frequency was computed by maximum likelihood estimates of haplotype probabilities with progressive insertion algorithm and haplo.cc computed score statistic to test association between haplotype and traits with adjustment for non-genetic covariates (sex).
pa Indicates the haplotype comparison statistics for patients vs controls.
Figure 4Luciferase expression assay of upstream SNPs of PARK2 gene (rs9365492 (T/C) and rs9355403 (G/A)): C & A respectively represent risk allele for the SNP.
Bar with standard error shows the mean expression values in three different cell lines (HepG2, MCF7 and HeLa) for different Clones in PGL3 promoter vector: Clone1, with protective allele combination - rs9365492(T)-rs9355403(G); Clone2, with risk and protective allele combination - rs9365492(C)-rs9355403(G); Clone3, with protective and risk allele combination - rs9365492(T)-rs9355403(A) and Clone4, with risk allele combination - rs9365492(C)-rs9355403(A). P-Values for comparison of mean (one way ANOVA) expression between clones with different allele combination of 2 SNPs is also shown.