| Literature DB >> 30085094 |
Ken B Hanscombe1, David L Morris1, Janelle A Noble2, Alexander T Dilthey3, Philip Tombleson1, Kenneth M Kaufman4, Mary Comeau5, Carl D Langefeld5, Marta E Alarcon-Riquelme6,7, Patrick M Gaffney8, Chaim O Jacob9, Kathy L Sivils8, Betty P Tsao10, Graciela S Alarcon11, Elizabeth E Brown12, Jennifer Croker13, Jeff Edberg11, Gary Gilkeson14, Judith A James8,15, Diane L Kamen14, Jennifer A Kelly8, Joseph McCune16, Joan T Merrill17, Michelle Petri18, Rosalind Ramsey-Goldman19, John D Reveille20, Jane E Salmon21, Hal Scofield8,22, Tammy Utset23, Daniel J Wallace15, Michael H Weisman15, Robert P Kimberly11, John B Harley4, Cathryn M Lewis1,24, Lindsey A Criswell25, Timothy J Vyse1.
Abstract
Genetic variation within the major histocompatibility complex (MHC) contributes substantial risk for systemic lupus erythematosus, but high gene density, extreme polymorphism and extensive linkage disequilibrium (LD) have made fine mapping challenging. To address the problem, we compared two association techniques in two ancestrally diverse populations, African Americans (AAs) and Europeans (EURs). We observed a greater number of Human Leucocyte Antigen (HLA) alleles in AA consistent with the elevated level of recombination in this population. In EUR we observed 50 different A-C-B-DRB1-DQA-DQB multilocus haplotype sequences per hundred individuals; in the AA sample, these multilocus haplotypes were twice as common compared to Europeans. We also observed a strong narrow class II signal in AA as opposed to the long-range LD observed in EUR that includes class I alleles. We performed a Bayesian model choice of the classical HLA alleles and a frequentist analysis that combined both single nucleotide polymorphisms (SNPs) and classical HLA alleles. Both analyses converged on a similar subset of risk HLA alleles: in EUR HLA- B*08:01 + B*18:01 + (DRB1*15:01 frequentist only) + DQA*01:02 + DQB*02:01 + DRB3*02 and in AA HLA-C*17:01 + B*08:01 + DRB1*15:03 + (DQA*01:02 frequentist only) + DQA*02:01 + DQA*05:01+ DQA*05:05 + DQB*03:19 + DQB*02:02. We observed two additional independent SNP associations in both populations: EUR rs146903072 and rs501480; AA rs389883 and rs114118665. The DR2 serotype was best explained by DRB1*15:03 + DQA*01:02 in AA and by DRB1*15:01 + DQA*01:02 in EUR. The DR3 serotype was best explained by DQA*05:01 in AA and by DQB*02:01 in EUR. Despite some differences in underlying HLA allele risk models in EUR and AA, SNP signals across the extended MHC showed remarkable similarity and significant concordance in direction of effect for risk-associated variants.Entities:
Mesh:
Year: 2018 PMID: 30085094 PMCID: PMC6196648 DOI: 10.1093/hmg/ddy280
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 5.121
Figure 1Association signal across the extended MHC region in EUR and AA data. In each panel, the title contains the most significant genetic marker and its P-value. A small black dot indicates the most significant marker. (A) LD with the most significant SNP in EUR. In EUR, a high level of LD exists across the entire extended MHC. (B) LD with the most significant SNP in AA. In AA, LD with the most significant SNP is restricted to a single peak in class II. (C) Association signal in class I and class II classical HLA alleles in EUR. The classical HLA alleles reflect the signal of the greyed-out SNPs from which they were imputed. (D) Association signal in classes I and II classical HLA alleles in AA.
Figure 2Models of association across HLA alleles. (A) Bayesian model choice fit using RJMCMC. (B-i) Most associated class II haplotypes, (B-ii) models with lowest AIC and BIC comprising classes I and II alleles. (B-iii) Stepwise regression starting from the alleles in B-ii. Alleles in LD with HLA-DRB1 (DR3) are coloured in red, and alleles in LD with HLA-DRB1 (DR2) are coloured in purple.
AA association results for HLA alleles class II led stepwise regression
| Conditional results (multiple regression) | Single-marker results | |||||
|---|---|---|---|---|---|---|
| ALLELE | Odds Ratio | 95% C.I. | P | Odds Ratio | 95% C.I. | P |
| HLA-C*17:01 | 1.42 | 1.21–1.65 | 7.40E-06 | 1.25 | 1.08–1.42 | 2.64E-03 |
| HLA-B*08:01 | 1.75 | 1.41–2.17 | 1.01E-07 | 1.93 | 1.59–2.35 | 9.74E-12 |
| HLA-DRB1*15:03 | 2.03 | 1.73–2.37 | 1.47E-18 | 1.86 | 1.65–2.09 | 9.72E-26 |
| HLA-DQA*01:02 | 1.13 | 0.98–1.29 | 8.64E-02 | 1.23 | 1.11–1.36 | 5.22E-06 |
| HLA-DQA*02:01 | 0.31 | 0.19–0.49 | 1.14E-06 | 0.90 | 0.78–1.02 | 1.24E-01 |
| HLA-DQA*05:01 | 1.46 | 1.27–1.67 | 4.11E-09 | 1.38 | 1.22–1.54 | 9.52E-09 |
| HLA-DQA*05:05 | 0.17 | 0.07–0.35 | 4.74E-06 | 0.19 | 0.09–0.37 | 1.47E-06 |
| HLA-DQB*03:19 | 1.65 | 1.38–1.96 | 2.01E-09 | 1.62 | 1.38–1.89 | 9.85E-10 |
| HLA-DQB*02:02 | 6.23 | 3.53–11.0 | 1.50E-10 | 1.11 | 0.92–1.31 | 2.63E-01 |
Alleles in LD with HLA-DRB1 (DR3) are coloured in red, and alleles in LD with HLA-DRB1 (DR2) are coloured in purple. AIC = 7119, BIC = 7195.
Figure 3Class II informed conditional analyses. EUR (A) and AA (B) SNP and classical HLA allele association signals after conditioning on the best class II model (EUR: HLA-DRB1; AA: HLA-DRB1). EUR (C) and AA (D) SNP and classical HLA allele association signals after conditioning on the best class II + I model (EUR: HLA-DRB1; AA: HLA-DRB1). EUR (E) and AA (F) SNP and classical HLA allele association signals after conditioning on the best overall HLA model (EUR: HLA-DRB1; AA: HLA-DRB1).
Association results for amino acid data from stepwise regression in the AA data
| Conditional results (multiple regression) | Single-marker results | HLA alleles specific to amino acid | |||||
|---|---|---|---|---|---|---|---|
| ALLELE | OR | 95% C.I. | P | OR | 95% C.I. | P | |
| DRB1 71A | 2.22 | 1.97–2.49 | 2.82E-40 | 1.72 | 1.55–1.91 | 1.07E-24 | DRB1*15:01, :02, :03 |
| DQB -18A | 1.63 | 1.48–1.80 | 1.19E-22 | 1.20 | 1.10–1.30 | 3.34E-05 | DQB*02:01, :02, :03, 03:01, :04, :09, *03:19, *06:01 |
| B 156D | 1.57 | 1.40–1.76 | 2.20E-15 | 1.42 | 1.28–1.58 | 1.80E-10 | B*08:01, *37:01, :41, :42, :45, :82 |
| DQA -13 T | 0.11 | 0.05–0.22 | 2.26E-09 | 0.19 | 0.09–0.37 | 1.47E-06 | DQA*05:05 |
a These are the HLA alleles that are specific to the amino acid, for example, the only HLA alleles observed in our data that code for DRB-71-A are DRB1
Association results for SNPs from stepwise regression in the AA data. AIC = 7176, BIC = 7224
| Conditional results (multiple regression) | Single-marker results | ||||||
|---|---|---|---|---|---|---|---|
| SNP | Effect (Other) Allele | OR | 95% C.I. | P | OR | 95% C.I. | P |
| RS9271413 | G (A) | 2.07 | 1.84–2.33 | 4.69E-34 | 1.72 | 1.56–1.91 | 8.30E-26 |
| RS9273481 | C (G) | 1.45 | 1.32–1.60 | 3.80E-14 | 1.19 | 1.09–1.29 | 6.56E-05 |
| 6:31323500:D | D (I) | 1.48 | 1.31–1.68 | 7.79E-10 | 1.47 | 1.30–1.65 | 2.28E-10 |
| RS115549526 | T (C) | 1.60 | 1.36–1.18 | 1.33E-08 | 2.13 | 1.83–2.49 | 6.12E-22 |
Figure 4Work flow chart for analyses strategy. To determine the best model of association over HLA alleles, a Bayesian approach to model selection (left) was taken in parallel to a classical model choice (right). Following this a classical stepwise regression on SNPs was performed conditional on HLA alleles returned by the model choice.
EUR association results for HLA alleles in models of association from class II led stepwise regression
| Conditional results (multiple regression) | Single marker results | |||||
|---|---|---|---|---|---|---|
| Stepwise regression | ||||||
| ALLELE | OR | 95% C.I. | P | OR |
|
|
| HLA-B*08:01 | 1.63 | 1.45–1.83 | 1.13E-14 | 2.41 | 2.22–2.60 | 1.47E-93 |
| HLA-B*18:01 | 1.40 | 1.24–1.58 | 1.64E-07 | 1.28 | 1.14–1.44 | 3.03E-05 |
| HLA-DRB1*15:01 | 1.20 | 1.04–1.37 | 7.55E-03 | 1.32 | 1.22–1.43 | 4.53E-11 |
| HLA-DQA*01:02 | 1.27 | 1.13–1.42 | 6.74E-05 | 1.27 | 1.17–1.37 | 6.36E-11 |
| HLA-DQB*02:01 | 1.84 | 1.63–2.07 | 1.29E-24 | 2.32 | 2.14–2.50 | 4.34E-95 |
| HLA-DRB3*02 | 0.76 | 0.70–0.82 | 1.01E-10 | 0.70 | 0.65–0.76 | 5.46E-21 |
Alleles in LD with HLA-DRB1 (DR3) are coloured in red, and alleles in LD with HLA-DRB1 (DR2) are coloured in purple. AIC = 13319, BIC = 13392.
Association results for amino acid data from stepwise regression in the EUR data
| Conditional results (multiple regression) | Single-marker results | HLA alleles specific to amino acid | |||||
|---|---|---|---|---|---|---|---|
| ALLELE | OR | 95% C.I. | P | OR | 95% C.I. | P | |
| DRB1 77 N | 2.02 | 1.78–2.30 | 2.86E-27 | 2.30 | 2.13–2.49 | 1.24E-93 | DRB1*03:01, :02 |
| DQA 207 M | 1.50 | 1.39–1.62 | 9.40E-25 | 1.27 | 1.18–1.36 | 6.36E-11 | DQA*01:02 |
| B 9D | 1.66 | 1.47–1.87 | 1.90E-16 | 2.42 | 2.22–2.63 | 1.47E-93 | B*08:01, *39:12 |
| DRB1 233 T | 1.24 | 1.16–1.33 | 3.25E-09 | 0.85 | 0.81–0.90 | 7.06E-08 | DRB1*15:01, :02, *01:01, :02, :03, *04:01, :02, :03, :04, :05, :07, :08, :10, *07:01, *08:01, :03, :04, :10, 09:01, *10:01, *16:01 |
| B 30G | 1.34 | 1.18–1.52 | 4.39E-06 | 1.28 | 1.14–1.44 | 3.45E-05 | B*18:01, :03 |
These are the HLA alleles that are specific to the amino acid. See
Association results for SNPs from stepwise regression in the EUR data. AIC = 13241, BIC = 1333
| Conditional results (multiple regression) | Single-marker results | ||||||
|---|---|---|---|---|---|---|---|
| SNP | Effect (Other) Allele | OR | 95% C.I. | P | OR | 95% C.I. | P |
| RS141910407 | T (C) | 1.52 | 1.29–1.81 | 1.18E-06 | 2.71 | 2.47–2.97 | 3.93E-99 |
| RS9260 | A (G) | 1.51 | 1.40–1.63 | 6.41E-27 | 1.33 | 1.24–1.43 | 2.17E-15 |
| RS9273336 | T (C) | 1.84 | 1.62–2.08 | 1.43E-21 | 2.18 | 2.01–2.35 | 1.24E-85 |
| X6:31428746:I | I (D) | 1.35 | 1.21–1.51 | 1.29E-07 | 2.12 | 1.97–2.28 | 6.59E-87 |
| RS9270807 | G (A) | 1.25 | 1.17–1.34 | 6.00E-10 | 0.87 | 0.82–0.92 | 1.71E-06 |
| RS2293861 | C (T) | 1.27 | 1.16–1.38 | 1.03E-07 | 1.48 | 1.36–1.61 | 5.78E-20 |
| RS142903940 | G (A) | 1.19 | 1.11–1.29 | 3.79E-06 | 1.03 | 0.96–1.11 | 3.56E-01 |
| RS501480 | C (T) | 1.15 | 1.08–1.22 | 7.83E-06 | 1.22 | 1.15–1.29 | 2.26E-11 |