| Literature DB >> 22737082 |
Krzysztof Kiryluk1, Yifu Li, Simone Sanna-Cherchi, Mersedeh Rohanizadegan, Hitoshi Suzuki, Frank Eitner, Holly J Snyder, Murim Choi, Ping Hou, Francesco Scolari, Claudia Izzi, Maddalena Gigante, Loreto Gesualdo, Silvana Savoldi, Antonio Amoroso, Daniele Cusi, Pasquale Zamboli, Bruce A Julian, Jan Novak, Robert J Wyatt, Krzysztof Mucha, Markus Perola, Kati Kristiansson, Alexander Viktorin, Patrik K Magnusson, Gudmar Thorleifsson, Unnur Thorsteinsdottir, Kari Stefansson, Anne Boland, Marie Metzger, Lise Thibaudin, Christoph Wanner, Kitty J Jager, Shin Goto, Dita Maixnerova, Hussein H Karnib, Judit Nagy, Ulf Panzer, Jingyuan Xie, Nan Chen, Vladimir Tesar, Ichiei Narita, Francois Berthoux, Jürgen Floege, Benedicte Stengel, Hong Zhang, Richard P Lifton, Ali G Gharavi.
Abstract
IgA nephropathy (IgAN), major cause of kidney failure worldwide, is common in Asians, moderately prevalent in Europeans, and rare in Africans. It is not known if these differences represent variation in genes, environment, or ascertainment. In a recent GWAS, we localized five IgAN susceptibility loci on Chr.6p21 (HLA-DQB1/DRB1, PSMB9/TAP1, and DPA1/DPB2 loci), Chr.1q32 (CFHR3/R1 locus), and Chr.22q12 (HORMAD2 locus). These IgAN loci are associated with risk of other immune-mediated disorders such as type I diabetes, multiple sclerosis, or inflammatory bowel disease. We tested association of these loci in eight new independent cohorts of Asian, European, and African-American ancestry (N = 4,789), followed by meta-analysis with risk-score modeling in 12 cohorts (N = 10,755) and geospatial analysis in 85 world populations. Four susceptibility loci robustly replicated and all five loci were genome-wide significant in the combined cohort (P = 5×10⁻³²-3×10⁻¹⁰), with heterogeneity detected only at the PSMB9/TAP1 locus (I² = 0.60). Conditional analyses identified two new independent risk alleles within the HLA-DQB1/DRB1 locus, defining multiple risk and protective haplotypes within this interval. We also detected a significant genetic interaction, whereby the odds ratio for the HORMAD2 protective allele was reversed in homozygotes for a CFHR3/R1 deletion (P = 2.5×10⁻⁴). A seven-SNP genetic risk score, which explained 4.7% of overall IgAN risk, increased sharply with Eastward and Northward distance from Africa (r = 0.30, P = 3×10⁻¹²⁸). This model paralleled the known East-West gradient in disease risk. Moreover, the prediction of a South-North axis was confirmed by registry data showing that the prevalence of IgAN-attributable kidney failure is increased in Northern Europe, similar to multiple sclerosis and type I diabetes. Variation at IgAN susceptibility loci correlates with differences in disease prevalence among world populations. These findings inform genetic, biological, and epidemiological investigations of IgAN and permit cross-comparison with other complex traits that share genetic risk loci and geographic patterns with IgAN.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22737082 PMCID: PMC3380840 DOI: 10.1371/journal.pgen.1002765
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Replication Study Results and Combined Meta-Analysis.
| Replication StudyN = 4,789 across 8 cohorts(2,228 cases/2,561 controls) | Replication and GWASN = 10,755 across 12 cohorts(5,372 cases/5,383 controls) | ||||||||||||||
| Fixed Effects | Random Effects | Fixed Effects | Random Effects | ||||||||||||
| Chr | Location (kb) | SNP (minor allele) | OR | P-value | OR | P-value | I2 | Q-test | OR | P-value | OR | P-value | I2 | Q-test | Annotation |
| 1 | 194,918 |
| 0.78 | 2.5×10−4 | 0.78 | 4.2×10−4 | 0% | 0.84 (NS) | 0.78 | 7.9×10−8 | 0.78 | 1.3×10−7 | 6% | 0.39 (NS) |
|
| 1 | 194,953 |
| 0.78 | 3.1×10−5 | 0.78 | 5.5×10−5 | 0% | 0.48 (NS) | 0.74 | 2.1×10−13 | 0.74 | 4.6×10−13 | 21% | 0.23 (NS) | |
| 6 | 32,768 |
| 0.75 | 3.6×10−11 | 0.75 | 7.1×10−11 | 0% | 0.67 (NS) | 0.72 | 8.5×10−30 | 0.72 | 2.8×10−29 | 0% | 0.69 (NS) |
|
| 6 | 32,778 |
| 0.86 | 1.1×10−3 | 0.86 | 1.8×10−3 | 0% | 0.71 (NS) | 0.77 | 6.6×10−16 | 0.78 | 7.3×10−16 | 29% | 0.16 (NS) | |
| 6 | 32,779 |
| 1.22 | 5.0×10−5 | 1.22 | 8.7×10−5 | 19% | 0.27 (NS) | 1.28 | 2.6×10−14 | 1.26 | 4.6×10−14 | 30% | 0.14 (NS) | |
| 6 | 32,789 |
| 0.75 | 5.3×10−9 | 0.75 | 9.5×10−9 | 0% | 0.60 (NS) | 0.67 | 5.0×10−32 | 0.67 | 3.1×10−32 | 43% | 0.05 (NS) | |
| 6 | 32,917 |
| 0.96 | 5.8×10−1 | 0.97 | 9.4×10−2 | 54% | 0.025 | 0.79 | 1.1×10−8 | 0.87 | 2.6×10−11 | 70% | 1.0×10−4
|
|
| 6 | 32,919 |
| 0.91 | 1.7×10−1 | 0.92 | 1.2×10−1 | 43% | 0.08 (NS) | 0.78 | 5.7×10−10 | 0.84 | 4.0×10−11 | 61% | 2.0×10−3
| |
| 6 | 33,194 |
| 0.87 | 3.1×10−3 | 0.87 | 5.0×10−3 | 0% | 0.96 (NS) | 0.82 | 3.0×10−10 | 0.82 | 5.9×10−10 | 0% | 0.86 (NS) |
|
| 6 | 33,205 |
| 0.89 | 1.1×10−2 | 0.89 | 1.7×10−2 | 0% | 0.75 (NS) | 0.83 | 2.5×10−9 | 0.83 | 4.6×10−9 | 0% | 0.51 (NS) | |
| 22 | 28,824 |
| 0.81 | 1.1×10−6 | 0.81 | 2.1×10−6 | 24% | 0.23 (NS) | 0.80 | 4.0×10−15 | 0.80 | 9.5×10−15 | 12% | 0.33 (NS) |
|
| 22 | 28,859 |
| 0.81 | 6.9×10−7 | 0.81 | 1.2×10−6 | 29% | 0.19 (NS) | 0.80 | 9.9×10−15 | 0.80 | 2.3×10−14 | 16% | 0.29 (NS) | |
Combined association results for 12 SNPs representing 5 independent regions that reached genome-wide significance in the original GWAS. The combined effect estimates (per allele odds ratios) in the replication cohorts were all direction-consistent with the ones in the original GWAS cohorts. Significant heterogeneity was noted only for the second HLA locus represented by rs9357155 and rs2071543.
Q-test: P-value for the Cochrane's Q statistic for heterogeneity, NS: heterogeneity test not significant,
*: heterogeneity P<0.05,
**: heterogeneity P<0.01;
I2: Heterogeneity Index (0–100%), where <25% corresponds to low, 50%–75% to medium, and >75% to high level of heterogeneity;
OR: Additive (per-allele) Odds Ratio;
Han and Eskin random effects model.
Conditional analysis of the HLA-DQB1, HLA-DQA1, HLA-DRB1 locus.
| Replication StudyN = 4,789 across 8 cohorts(2,228 cases/2,561 controls) | Replication and GWASN = 10,755 across 12 cohorts(5,372 cases/5,383 controls) | ||||||||
| UNADJUSTED | CONDITIONED | UNADJUSTED | CONDITIONED | ||||||
| OR | P-value | OR | P-value | OR | P-value | OR | P-value | CONDITIONING SNPs | |
|
| 0.75 | 4×10−11 | 0.71 | 2×10−6 | 0.72 | 9×10−30 | 0.75 | 7×10−10 | rs2856717, rs9275424, rs9275596 |
|
| 0.86 | 1×10−3 | 1.72 | 1×10−6 | 0.77 | 7×10−16 | 1.61 | 2×10−10 | rs9275224, rs9275424, rs9275596 |
|
| 1.22 | 5×10−5 | 1.06 | 3×10−1 | 1.28 | 3×10−14 | 1.11 | 7×10−3 | rs9275224, rs2856717, rs9275596 |
|
| 0.75 | 5×10−9 | 0.64 | 2×10−6 | 0.67 | 5×10−32 | 0.58 | 3×10−16 | rs9275224, rs2856717, rs9275424 |
Haplotype analysis of rs9275224, rs2856717, rs9275424, and rs9275596 at the HLA-DQB1/DRB1 locus.
| All Cohorts: N = 10,755 (5,372 cases/5,383 controls) | ||||||
| Freq. Overall | Freq. Cases | Freq. Controls | OR | 95%CI | P-global | |
|
| 0.352 | 0.365 | 0.338 | -reference- | -reference- | 3×10−43 |
|
| 0.213 | 0.180 | 0.245 | 0.69 | 0.64–0.74 | |
|
| 0.130 | 0.119 | 0.141 | 0.78 | 0.71–0.85 | |
|
| 0.050 | 0.058 | 0.043 | 1.25 | 1.10–1.42 | |
|
| 0.246 | 0.270 | 0.222 | 1.12 | 1.04–1.20 | |
The most common haplotype of 4 major alleles (GCAT) is used as a reference to derive odds ratios for all other haplotypes. Only common haplotypes (frequency >1%) are tested for association.
The best predictive model for IgAN based on all the genotyped SNPs and their pairwise interaction terms.
| Best Predictive Model | |||||
| Predictor (Reference Allele) | Coeficient (β) | OR (95%CI) | P-value | Chr. | Annotation of Genes in the Region |
|
| −0.49371 | 0.61 (0.53–0.71) | 2.2×10−11 | 1q32 |
|
|
| −0.31307 | 0.73 (0.67–0.80) | 2.5×10−11 | 6p21 |
|
|
| 0.42265 | 1.53 (1.31–1.78) | 8.2×10−8 | 6p21 |
|
|
| −0.51157 | 0.60 (0.52–0.69) | 5.9×10−13 | 6p21 |
|
|
| −0.28621 | 0.75 (0.69–0.82) | 3.8×10−10 | 6p21 |
|
|
| −0.1805 | 0.83 (0.78–0.90) | 4.8×10−7 | 6p21 |
|
|
| −0.28592 | 0.75 (0.70–0.81) | 2.3×10−15 | 22q12 |
|
|
| 0.23171 | 1.26 (1.12–1.43) | 2.2×10−4 | – |
|
This model represents the solution of a stepwise logistic regression algorithm (BIC-based stepwise model selection). The coefficients from this model are used to refine the risk score for IgAN.
Figure 1Multiplicative interaction between Chr. 22q12 (rs2412971) and Chr. 1q32 (rs6677604) loci.
The allelic effects of rs2412971-A by genotype class of rs9275596 (top signal in the HLA, no interaction) and rs6677604 (top signal in at CFHR1/R3 locus on Chr. 1q32, significant interaction). The protective effect of rs2412971-A allele is reversed in homozygotes for the rs6677604-A allele, which tags a deletion in CFHR3/R1. The allelic effects are expressed on the log-odds scale and correspond to beta coefficients of the logistic regression model. Error bars correspond to 95% confidence intervals.
The comparison of the original and the newly refined IgAN risk score.
| Original Risk Score | Newly Refined Risk Score | ||||||||
| Cohort: | N | R2
| C | OR | P-value | R2
| C | OR | P-value |
|
| 5,631 | 5.0% | 0.61 | 1.51 | 3.1×10−46 | 5.7% | 0.62 | 1.56 | 4.1×10−52 |
|
| 4,422 | 2.2% | 0.58 | 1.29 | 5.4×10−17 | 3.2% | 0.59 | 1.36 | 3.3×10−24 |
|
| 4,582 | 4.5% | 0.60 | 1.53 | 3.0×10−34 | 5.0% | 0.61 | 1.52 | 2.6×10−38 |
|
| 5,386 | 2.6% | 0.58 | 1.34 | 3.7×10−24 | 3.6% | 0.59 | 1.42 | 6.7×10−33 |
|
| 10,053 | 3.8% | 0.60 | 1.42 | 6.2×10−63 | 4.7% | 0.61 | 1.47 | 1.2×10−76 |
The expanded version of this table can be found in supplemental material (Table S7).
Number of analyzed individuals with 100% non-missing genotypes across all 7 scored loci.
*: 2: Nagelkerke R square (expressed as percentage).
**: C-statistic: area under the ROC curve.
***: Odds ratio per one standard deviation of the standardized risk score.
****: Wald's test for risk score as a quantitative predictor of disease status.
Figure 2Differences in the distribution of the 7-SNP genetic risk score by ethnicity.
Only healthy control participants of the replication studies that were fully genotyped at all 7 loci were used in this analysis. Similar to the GWAS study, the risk score distributions were significantly different by ethnicity (ANOVA p = 2.1×10−38). The corresponding differences in the distribution of risk alleles are depicted in Figure S1.
Figure 3Worldwide geospatial risk analysis.
Surface interpolation of the standardized risk score over Africa and Euroasia (main), and Americas (inset). Symbols represent the locations of sampled populations: HGDP (circles), HapMap-III (diamonds), and healthy controls from this study (triangles).
Figure 4Correlation of average country latitude with country-specific genetic risk and IgAN–attributable ESRD across the European continent.
The South to North latitude is indicated on the X-axis. The median genetic risk (x) is indicated on the right Y-axis. The following incidence and prevalence metrics (o) are indicated on the left Y-axis: (panel a) the incidence of ESRD due to IgAN per million population (correlation with latitude: r = 0.54, p = 0.05); (panel b) the prevalence of ESRD due to IgAN per million population (correlation with latitude: r = 0.47, p = 0.10); (panel c) the percent of IgAN patients among all ESRD cases (correlation with latitude: r = 0.67, p = 0.01); and (panel d) among ESRD cases due to primary glomerular disease (correlation with latitude: r = 0.71, p = 0.006). All p-values are derived based on a two-sided hypothesis test.
Figure 5High-resolution geospatial risk analysis for Italy.
A well defined region of higher genetic risk was uncovered in Northern Italy that centers on Valtrompia, Brescia, and Cremona (median standardized risk scores 0.31, 0.24 and 0.24, respectively). The healthy individuals from Valtrompia had the highest risk scores when compared to 16 other Italian populations sampled.
Figure 6Genetic risk and IgAN–attributable ESRD among major US ethnicities.
The relationship between IgAN risk scores (red line) and IgAN incidence and prevalence (bars) among US ethnicities are shown. The following metrics of IgAN occurrence are depicted: (panel a) the incidence of ESRD due to IgAN per million population by ethnicity, (panel b) the prevalence of ESRD due to IgAN per million population by ethnicity, (panel c) percent of IgAN among the total ESRD population by ethnicity; and (panel d) percent of IgAN among ESRD due to glomerular disease by ethnicity.