| Literature DB >> 22096524 |
Erin M Hill-Burns1, Stewart A Factor, Cyrus P Zabetian, Glenys Thomson, Haydeh Payami.
Abstract
Parkinson's disease (PD) was recently found to be associated with HLA in a genome-wide association study (GWAS). Follow-up GWAS's replicated the PD-HLA association but their top hits differ. Do the different hits tag the same locus or is there more than one PD-associated variant within HLA? We show that the top GWAS hits are not correlated with each other (0.00≤r(2)≤0.15). Using our GWAS (2000 cases, 1986 controls) we conducted step-wise conditional analysis on 107 SNPs with P<10(-3) for PD-association; 103 dropped-out, four remained significant. Each SNP, when conditioned on the other three, yielded P(SNP1) = 5×10(-4), P(SNP2) = 5×10(-4), P(SNP3) = 4×10(-3) and P(SNP4) = 0.025. The four SNPs were not correlated (0.01≤r(2)≤0.20). Haplotype analysis (excluding rare SNP2) revealed increasing PD risk with increasing risk alleles from OR = 1.27, P = 5×10(-3) for one risk allele to OR = 1.65, P = 4×10(-8) for three. Using additional 843 cases and 856 controls we replicated the independent effects of SNP1 (P(conditioned-on-SNP4) = 0.04) and SNP4 (P(conditioned-on-SNP1) = 0.04); SNP2 and SNP3 could not be replicated. In pooled GWAS and replication, SNP1 had OR(conditioned-on-SNP4) = 1.23, P(conditioned-on-SNP4) = 6×10(-7); SNP4 had OR(conditioned-on-SNP1) = 1.18, P(conditioned-on-SNP1) = 3×10(-3); and the haplotype with both risk alleles had OR = 1.48, P = 2×10(-12). Genotypic OR increased with the number of risk alleles an individual possessed up to OR = 1.94, P = 2×10(-11) for individuals who were homozygous for the risk allele at both SNP1 and SNP4. SNP1 is a variant in HLA-DRA and is associated with HLA-DRA, DRB5 and DQA2 gene expression. SNP4 is correlated (r(2) = 0.95) with variants that are associated with HLA-DQA2 expression, and with the top HLA SNP from the IPDGC GWAS (r(2) = 0.60). Our findings suggest more than one PD-HLA association; either different alleles of the same gene, or separate loci.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22096524 PMCID: PMC3212531 DOI: 10.1371/journal.pone.0027109
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
HLA SNPs that have shown the most significant associations with PD in three GWAS's.
| GWAS | Ncases | Ncontrols | SNP | Gene | BP | MAFcases | MAFcontrols | OR | P |
| NGRC | 2000 | 1986 | rs3129882 |
| 32517508 | 0.46 | 0.40 | 1.31 | 3×10−8 |
| Dutch | 772 | 2024 | rs4248166 |
| 32474399 | 0.21 | 0.17 | 1.36 | 4×10−5 |
| IPDGC | 5333 | 12019 | chr6:32588205 | 3′ of | 32588205 | (0.15 unspecified) | 0.70 | 3×10−8 |
Figure 1Linkage disequilibrium (LD) among top HLA associations from three PD GWAS's.
LD was measured as D′ (panel A) and as r2 (panel B) among rs3129882 from the NGRC GWAS [1], rs4248166 from Dutch GWAS [5], and 6–32588205 from IPDGC meta analysis [6]. Numbers in cells are % D′ (panel A) and % r2 (panel B). The red cells with no number reflects D′ = 1.0.
Step-wise conditional analysis.
| SNP | BP | Minor/MajorAllele | MAFcases | MAFcontrols | HWEP | Unconditioned (GWAS results for SNPs that survived conditional analysis, see | Conditioned on SNP1 | Conditioned on SNP1 & SNP2 | Conditioned on SNP1 & SNP2 & SNP3 | |||||
| OR | P | OR | P | OR | P | OR | P | |||||||
| SNP1 | rs3129882 | 32517508 |
| 0.46 | 0.40 | 0.82 | 1.31 | 3×10−8 | ||||||
| SNP2 | rs3993757 | 31698725 |
| 0.03 | 0.02 | 1.00 | 1.79 | 7×10−4 | 1.70 | 2×10−3 | ||||
| SNP3 | rs2844505 | 31547042 |
| 0.29 | 0.25 | 0.01 | 1.23 | 9×10−5 | 1.15 | 0.011 | 1.16 | 6×10−3 | ||
| SNP4 | rs9268515 | 32487273 | C/ | 0.16 | 0.20 | 0.33 | 1.25 | 4×10−4 | 1.14 | 0.049 | 1.16 | 0.028 | 1.16 | 0.025 |
| Number of SNPs remaining significant at the end of each round | 107 with P<10−3 in GWAS ( | 16 with P<0.05 | 13 with P<0.05 | 1 with P<0.05 | ||||||||||
Step-wise conditional analysis was performed for 107 SNPs in the HLA region that achieved P<10−3 in GWAS. The full analysis is shown in . Here, we show the summary results for the four SNPs that remained significant after conditioning on the other significant SNPs. For consistency, we show all odds ratios (OR) on the positive side (i.e., testing risk allele against the alternate allele). The risk allele at each SNP is shown in bold. All association tests were adjusted for age at enrollment, sex, and PC1 and PC2 (principal components that define Jewish/non-Jewish origin and the European country of ancestry). Once the four SNPs that retain conditioned P<0.05 were identified, we re-tested association of each of the SNPs with PD conditioning on the other three. We obtained P = 5×10−4 for SNP1 conditioned on SNP2 and SNP3 and SNP4, P = 5×10−4 for SNP2 conditioned on SNP1 and SNP3 and SNP4, P = 4×10−3 for SNP3 conditioned on SNP1 and SNP2 and SNP4, and P = 0.025 for SNP4 conditioned on SNP1 and SNP2 and SNP3. BP = base pair position of the SNP on chromosome 6. Minor/major allele = the two alternative nucleotides at the SNP, the one with higher frequency denoted as major allele. MAF = minor allele frequency. HWE P = P value for the test of Hardy-Weinberg Equilibrium.
Figure 2LD among the four significant SNPs from NGRC conditional analysis and the most significant HLA SNPs from other PD GWAS's.
LD was measured as D′ (panel A) and as r2 (panel B). NGRC SNPs are numbered 1–4 and marked in red frame. The most significant HLA SNP from other GWAS's are indicated by rs number and the study name. Numbers in cells are % D′ (panel A) and % r2 (panel B). Red cell without a number reflects D′ = 1, blue cells without a number reflect D′ = 0.
Haplotype analysis.
| Haplotypes SNP1 (rs3129882) SNP3 (rs2844505) SNP4 (rs9268515) | Freq.Cases | Freq.Controls | OR (95% CI) | P |
| AAC | 0.11 | 0.15 | Reference | |
| AA | 0.33 | 0.36 | 1.27 (1.07–1.50) | 5×10−3 |
| A | 0.03 | 0.03 | 1.34 (0.93–1.91) | 0.11 |
|
| 0.02 | 0.01 | 1.75 (1.07–2.87) | 0.03 |
| A | 0.07 | 0.06 | 1.56 (1.22–1.99) | 4×10−4 |
|
| 0.006 | 0.007 | - | - |
|
| 0.26 | 0.23 | 1.57 (1.32–1.86) | 2×10−7 |
|
| 0.18 | 0.15 | 1.65 (1.38–1.97) | 4×10−8 |
Haplotypes were composed of SNP1, SNP3 and SNP4, shown in that order. These SNPs and SNP2 survived conditional analysis at P<0.05. SNP2 had low frequency and any haplotype with minor allele of SNP2 was too infrequent (<0.01) to be included in haplotype association tests. For each SNP, the allele that was associated with higher risk for PD (see ) is shown in bold. OR and P values were calculated for each haplotype relative to AAC haplotype which has the lowest risk for PD. Analyses were adjusted for age at enrollment, sex, PC1, and PC2 (principal components that define Jewish/non-Jewish origin and the European country of ancestry). AAG consists of major alleles for each SNP; GGC consists of minor alleles for each SNP. Haplotype GGC was not tested because its frequency was <0.01.
Replication of conditional analysis.
| Minor/MajorAllele | Replication 843 cases, 856 controls | Pooled NGRC & Replication 2843 cases, 2842 controls | |||||||||||||||||||
| MAFCase | MAFControl | HWEP | Unconditioned | Conditioned on SNP1 | Conditioned on SNP4 | MAFCase | MAFControl | HWEP | Unconditioned | Conditioned on SNP1 | Conditioned on SNP4 | ||||||||||
| SNP | BP | OR | P | OR | P | OR | P | OR | P | OR | P | OR | P | ||||||||
| SNP1 | rs3129882 | 32517508 |
| 0.45 | 0.41 | 0.69 | 1.20 | 0.006 | - | - | 1.15 | 0.04 | 0.46 | 0.40 | 0.98 | 1.28 | 3×10−10 | - | - | 1.23 | 6×10−7 |
| SNP3 | rs2844505 | 31547042 |
| 0.26 | 0.24 | 0.18 | 1.04 | 0.32 | - | - | - | - | - | - | - | - | - | - | - | - | - |
| SNP4 | rs9268515 | 32487273 | C/ | 0.20 | 0.23 | 0.29 | 1.25 | 0.004 | 1.19 | 0.04 | - | - | 0.17 | 0.21 | 0.96 | 1.28 | 2×10−6 | 1.18 | 3×10−3 | - | - |
*Odds ratios (OR) were calculated for the risk allele (shown in bold).
In the replication dataset, SNP1 and SNP3 were genotyped, SNP2 was not genotyped and did not impute, SNP4 was imputed (info score = 0.95). We used imputed genotype probabilities in the R software, adjusting for age and sex for replication, and adjusting for age, sex and study for pooled data. Replication P values are one sided [28] given the directionality of the hypothesis; P values for pooled NGRC and replication are two sided.
Haplotype association test of SNP1 (rs3129882) and SNP4 (rs9268515) with PD in NGRC and Replication.
| Haplotype SNP1 (rs3129882) SNP4 (rs9268515) | NGRC | Replication | Pooled NGRC + Replication | |||||||||
| Freq incasesN = 4000 | Freq incontrolsN = 3972 | OR(95%CI) | P | Freq incasesN = 1686 | Freq incontrolsN = 1712 | OR(95%CI) | P | Freq incasesN = 5686 | Freq incontrolsN = 5684 | OR(95% CI) | P | |
| AC | 0.14 | 0.18 | reference | 0.17 | 0.22 | Reference | 0.15 | 0.19 | reference | |||
|
| 0.02 | 0.02 | 1.59(1.01–2.51) | 0.05 | 0.02 | 0.02 | 1.71(0.85–3.44) | 0.07 | 0.02 | 0.02 | 1.63(1.11–2.39) | 0.01 |
| A | 0.40 | 0.43 | 1.23(1.07–1.42) | 3×10−3 | 0.38 | 0.38 | 1.28(1.04–1.57) | 0.01 | 0.40 | 0.41 | 1.25(1.11–1.40) | 2×10−4 |
|
| 0.44 | 0.38 | 1.51(1.32–1.72) | 10−9 | 0.43 | 0.39 | 1.41(1.16–1.71) | 2×10−4 | 0.44 | 0.38 | 1.48(1.33–1.65) | 2×10−12 |
OR and P values were calculated for each haplotype relative to the low-risk haplotype (AC) for association with PD. N = number of chromosomes. The bolded alleles are associated with higher risk. Freq = haplotype frequency. P values are two sided for NGRC, one-sided for replication.
Additive effects of SNP1 and SNP4 genotypes.*
| SNP1 genotype & SNP4 genotype | N case | N control | OR (95% CI) | P |
| AA & C_ | 362 | 535 | Ref | ref |
| AA & | 439 | 469 | 1.37 (1.24–1.51) | 0.001 |
| A | 436 | 461 | 1.42 (1.28–1.56) | 4×10−4 |
| A | 959 | 865 | 1.61 (1.48–1.76) | 2×10−8 |
|
| 47 | 38 | 1.78 (1.41–2.26) | 0.01 |
|
| 531 | 414 | 1.94 (1.76–2.14) | 2×10−11 |
*NGRC and replication are pooled.
The C allele of SNP4 is rare; therefore we combined CC and CG genotypes and denoted as C_.
N = number of individuals with each genotype combination. Risk alleles are shown in bold. Tests were adjusted for age and sex.