| Literature DB >> 30507971 |
Rebecca Darlay1, Kristin L Ayers1, George F Mells2, Lynsey S Hall1,3, Jimmy Z Liu4, Mohamed A Almarri4,5, Graeme J Alexander6, David E Jones7, Richard N Sandford2, Carl A Anderson4, Heather J Cordell1.
Abstract
Primary Biliary Cholangitis (PBC) is a chronic autoimmune liver disease characterised by progressive destruction of intrahepatic bile ducts. The strongest genetic association is with HLA-DQA1*04:01, but at least three additional independent HLA haplotypes contribute to susceptibility. We used dense single nucleotide polymorphism (SNP) data in 2861 PBC cases and 8514 controls to impute classical HLA alleles and amino acid polymorphisms using state-of-the-art methodologies. We then demonstrated through stepwise regression that association in the HLA region can be largely explained by variation at five separate amino acid positions. Three-dimensional modelling of protein structures and calculation of electrostatic potentials for the implicated HLA alleles/amino acid substitutions demonstrated a correlation between the electrostatic potential of pocket P6 in HLA-DP molecules and the HLA-DPB1 alleles/amino acid substitutions conferring PBC susceptibility/protection, highlighting potential new avenues for future functional investigation.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30507971 PMCID: PMC6292650 DOI: 10.1371/journal.pgen.1007833
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Comparison of highly significant marginal association results (P<0.000001) from 4 packages: HLA*IMP:02, HLA*IMP:03, HIBAG (calculated dosages and best guess) and SNP2HLA.
HLA*IMP:01 results from Liu et al. (2012) are shown for comparison. Haplogroups were considered separate (i.e. independent) if predictors remained significant (P< 0.0001) when the top allele from any previously-identified haplogroup was included (pairwise) in the regression model.
| Haplogroup | Gene | Allele | HLA*IMP:01 (dosages, results from Liu et al. 2012) | HLA*IMP:02 (dosages) | HLA*IMP:03 | HIBAG 1.2 (best guess, provided probability > 0.8) | HIBAG 1.2 (dosages) | SNP2HLA | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OR | P | OR | P | OR | P | OR | P | OR | P | OR | P | |||
| 1 | HLA-DQA1 | 04:01 | 3.07 | 5.90E-45 | 3.14 | 5.94E-49 | 3.05 | 1.16E-44 | 3.04 | 2.63E-42 | 3.14 | 3.48E-45 | 3.08 | 2.64E-45 |
| HLA-DQB1 | 04:02 | 3.04 | 1.91E-42 | 3.10 | 1.26E-45 | 3.04 | 2.35E-45 | 3.07 | 1.88E-44 | 3.08 | 1.37E-47 | 3.05 | 2.00E-45 | |
| HLA-DRB1 | 08:01 | 3.18 | 1.14E-40 | 3.12 | 1.35E-45 | 3.16 | 2.18E-45 | 3.17 | 2.47E-37 | 3.26 | 1.25E-45 | 3.18 | 4.57E-45 | |
| HLA-B | 39:05 | 5.48 | 4.81E-12 | - | - | - | - | - | - | - | - | - | - | |
| HLA-B | 39:06 | - | - | 2.22 | 2.43E-09 | 2.28 | 7.72E-10 | 2.72 | 3.57E-10 | 2.78 | 1.60E-11 | - | - | |
| 2 | HLA-DQB1 | 06:02 | 0.64 | 2.32E-15 | 0.66 | 2.96E-16 | 0.66 | 8.96E-17 | 0.67 | 2.58E-15 | 0.66 | 2.14E-16 | 0.66 | 2.28E-16 |
| HLA-DRB1 | 15:01 | 0.65 | 2.78E-15 | 0.67 | 4.01E-16 | 0.67 | 3.89E-16 | 0.66 | 9.55E-15 | 0.67 | 4.57E-16 | 0.67 | 4.21E-16 | |
| HLA-DQA1 | 01:02 | 0.69 | 4.19E-15 | 0.71 | 9.52E-16 | 0.72 | 6.06E-15 | 0.71 | 6.60E-15 | 0.71 | 3.26E-15 | 0.72 | 9.60E-15 | |
| HLA-B | 07:02 | 0.73 | 4.93E-10 | 0.73 | 4.14E-11 | 0.73 | 4.83E-11 | 0.72 | 1.08E-11 | 0.73 | 3.96E-11 | 0.73 | 3.62E-11 | |
| 3 | HLA-DQB1 | 03:01 | 0.70 | 6.48E-14 | 0.72 | 7.79E-15 | 0.72 | 4.92E-15 | 0.71 | 2.63E-15 | 0.70 | 1.93E-15 | 0.71 | 3.78E-15 |
| HLA-DRB1 | 11:01 | 0.33 | 2.14E-13 | 0.35 | 6.78E-19 | 0.46 | 1.60E-15 | 0.43 | 0.001 | 0.31 | 1.15E-18 | 0.41 | 8.57E-17 | |
| HLA-DRB1 | 11:04 | 0.24 | 3.72E-09 | 0.14 | 1.38E-10 | 0.44 | 9.45E-06 | 0.44 | 0.011 | 0.14 | 5.00E-13 | 0.32 | 1.69E-07 | |
| HLA-DRB1 | 11:03 | - | - | 0.31 | 0.076 | 0.65 | 0.202 | - | - | 0.002 | 2.15E-10 | - | - | |
| HLA-DQA1 | 05:01 | 0.75 | 4.76E-12 | 0.90 | 0.013 | 0.75 | 1.01E-13 | 0.98 | 0.654 | 0.98 | 0.5895 | 0.75 | 1.38E-13 | |
| HLA-DQA1 | 05:05 | - | - | 0.37 | 4.22E-26 | - | - | 0.49 | 1.31E-25 | 0.50 | 1.73E-25 | - | - | |
| 4 | HLA-DRB1 | 04:04 | 1.57 | 1.22E-09 | 1.61 | 8.59E-11 | 1.45 | 4.29E-09 | 1.45 | 4.09E-05 | 1.64 | 4.17E-10 | 1.54 | 1.25E-10 |
| HLA-DRB1 | 04:03 | - | - | 3.89 | 1.25E-07 | 1.32 | 0.183 | 1.23 | 0.626 | 2.92 | 1.21E-05 | - | - | |
| HLA-DQB1 | 03:02 | 1.34 | 6.96E-09 | 1.34 | 7.83E-10 | 1.33 | 8.62E-10 | 1.33 | 2.11E-09 | 1.34 | 1.04E-09 | 1.28 | 1.45E-07 | |
| HLA-DQA1 | 03:01 | - | - | 1.22 | 2.29E-06 | 1.14 | 4.14E-04 | 1.36 | 1.41E-09 | 1.37 | 1.03E-09 | 1.15 | 2.20E-04 | |
| 5 | HLA-DPB1 | 03:01 | - | - | 1.69 | 1.20E-31 | 1.53 | 3.68E-24 | 1.49 | 2.04E-07 | 1.80 | 1.60E-26 | 1.69 | 1.08E-27 |
| HLA-DPB1 | 06:01 | - | - | - | - | 1.71 | 1.51E-10 | - | - | 9.14 | 3.26E-26 | 1.97 | 6.30E-09 | |
| 6 | HLA-DPB1 | 04:01 | - | - | 0.74 | 2.04E-18 | 0.78 | 3.29E-16 | 0.81 | 4.62E-09 | 0.75 | 1.29E-18 | 0.76 | 2.23E-17 |
| 7 | HLA-C | 04:01 | - | - | 1.37 | 2.76E-10 | 1.37 | 3.15E-10 | 1.37 | 2.93E-10 | 1.37 | 4.01E-10 | 1.37 | 3.98E-10 |
| 8 | HLA-DPB1 | 10:01 | - | - | 2.71 | 0.002 | 1.91 | 2.68E-11 | 2.19 | 4.11E-13 | 2.04 | 1.90E-12 | 1.93 | 3.09E-11 |
| 9 | HLA-DPB1 | 17:01 | - | - | 2.43 | 0.005 | 2.40 | 1.03E-14 | 2.84 | 1.30E-17 | 2.47 | 5.55E-15 | 2.48 | 1.66E-15 |
| 10 | HLA-DPA | 02:01 | - | - | - | - | 1.27 | 2.03E-09 | - | - | - | - | 1.27 | 2.08E-09 |
a Note that HLA-B 39:05 and 39:06 are perhaps being used interchangeably: HLA*IMP:01 identified 39:05 whereas HLA*IMP:02, HLA*IMP:03 and HIBAG identify 39:06. Neither 39:05 or 39:06 are present in the SNP2HLA analysis.
b Note that there are only two amino acids that differ between HLA-DQA1 05:01 and 05:05, one of which corresponds to our 5th most significant result (position -13 of DQA1, see Table 2). This could perhaps explain the discrepancies between the results from the different programs for these alleles. We note that when HIBAG and HLA*IMP:02 identified either 05:01 or 05:05, the other programs (HLA*IMP:01, HLA*IMP:03 and SNP2HLA) identified exclusively 05:01
c Note that HLA-DPA is only examined in HLA*IMP:03 and SNP2HLA
Amino acid residues significantly associated (P<4.87E-05) with PBC in forward stepwise regression analysis.
Shown are odds ratios (ORs) and P-values from stepwise and marginal logistic regression calculated using posterior probabilities from HIBAG, together with marginal associations from SNP2HLA. Classical alleles in bold are significantly associated with PBC in the current study, see Table 1.
| Gene | BP position | Peptide position | Residue | Stepwise associations (HIBAG 1.2 dosage) | Marginal associations (HIBAG 1.2 dosage) | Marginal associations (SNP2HLA) | Classical HLA alleles within same gene carrying first listed residue observed in our study (imputed frequency in cases; imputed frequency in controls) | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Order of entry | P to enter | OR | P | OR | P | |||||
| HLA-DPB1 | 33156444 | 11 | L/G | 1 | 6.64E-59 | 1.765 | 6.64E-59 | 1.752 | 1.09E-58 | |
| HLA-DRB1 | 32659927 | 74 | L | 2 | 1.73E-39 | 3.065 | 1.14E-46 | 2.828 | 5.65E-43 | |
| HLA-DQB1 | 32740667 | 57 | D | 3 | 6.42E-22 | 0.789 | 2.87E-14 | 0.787 | 9.69E-15 | |
| HLA-C | 31346910 | 156 | R | 4 | 2.70E-12 | 1.255 | 7.16E-12 | 1.250 | 1.20E-11 | 01:02 (0.042; 0.034) |
| HLA-DQA1 | 32713244 | -13 | A | 5 | 2.48E-12 | 1.997 | 6.02E-29 | - | - | 01:01 (0.14; 0.12) |
| HLA-B | 31432581 | 45 | T | 6 | 3.13E-06 | 1.280 | 2.35E-10 | 1.278 | 1.65E-10 | 18:01 (0.043; 0.037), 18:02 (0.000041; 0.000035), 18:03 (0.00045; 0.00078), 18:11 (0.00023; 0.00017), 18:18 (0.000029; 0.000028), 35:01 (0.062; 0.050), 35:02 (0.0028; 0.0037), 35:03 (0.017; 0.011), 35:08 (0.0030; 0.0026), 35:17 (0.00029; 0.00034), 35:41 (0.00032; 0.00024), 35:55 (0.000020; 0.000016), 37:01 (0.017; 0.015), 44:06 (0.00026; 0.00018), 51:01 (0.046; 0.036), 51:05 (0.00022; 0.00024), 51:07 (0.00053; 0.00037), 51:08 (0.0010; 0.00068), 51:09 (0.00010; 0.00011), 52:01 (0.0054; 0.0044), 53:01 (0.0027; 0.0019), 58:01 (0.0047; 0.0051) |
| HLA-DQA1 | 32718440 | 207 | V | 7 | 4.38E-06 | 1.400 | 6.35E-16 | 1.386 | 1.23E-14 | 01:01 (0.14; 0.12), 01:03 (0.047; 0.053), 01:04 (0.029; 0.023), 01:05 (0.0046; 0.0053), 02:01 (0.16; 0.15), |
| HLA-DPB1 | 33156663 or 33161618 | 84 or 215 | V or T | 8a or 8b | 2.18E-05 | 1.713 | 0.000913 | NA (OR = 0.550 for residue I) | NA (P = 4.74E-34 for residue I) | 15:01 (0.011; 0.0068) |
| HLA-B | 31432689 | 9 | H | 9 | 2.15E-05 | 1.181 | 1.07E-05 | 1.19 | 2.80E-06 | 18:01 (0.043; 0.037), 18:02 (0.000041; 0.000035), 18:03 (0.00045; 0.00078), 18:11 (0.00023; 0.00017), 18:18 (0.000029; 0.000028), 27:02 (0.0028; 0.0033), 27:03 (0.0000090; 0.000034), 27:05 (0.041; 0.040), 27:07 (0.00019; 0.00016), 27:09 (0.000045; 0.000097), 37:01 (0.017; 0.014), 40:01 (0.072; 0.056), 40:02 (0.0074; 0.010), 40:06 (0.00033; 0.00033), 40:27 (0.000030; 0.000045), 40:32 (0.00045; 0.00028), 41:01 (0.0024; 0.0021), 41:02 (0.0041; 0.0041), 42:02 (0.000077; 0.000046), 45:01 (0.0078; 0.0073), 49:01 (0.010; 0.012), 50:01 (0.014; 0.0089), 50:02 (0.00019; 0.00013), 73:01 (0.00019; 0.00013) |
Fig 1Stepwise logistic regression of amino acid residue dosages with up to five amino acid residue dosages included as covariates.
The x axis denotes amino acid position. A) no conditioning, B) conditioned on HLA-DPB11L, C) conditioned on HLA-DPB11L and HLA-DRB74L, D) conditioned on HLA-DPB11L, HLA-DRB74L and HLA-DQB57D, E) conditioned on HLA-DPB11L, HLA-DRB74L, HLA-DQB57D and HLA-C155R and F) conditioned on HLA-DPB11L, HLA-DRB74L, HLA-DQB57D, HLA-C155R and HLA-DQA-13A.
Fig 2Stepwise logistic regression of individual SNPs with up to five amino acid residue dosages included as covariates.
The x axis denotes base pair position on chromosome 6. A) All SNPs, no conditioning, B) conditioned on HLA-DPB11L, C) conditioned on HLA-DPB11L and HLA-DRB74L, D) conditioned on HLA-DPB11L, HLA-DRB74L and HLA-DQB57D, E) conditioned on HLA-DPB11L, HLA-DRB74L, HLA-DQB57D and HLA-C155R and F) conditioned on HLA-DPB11L, HLA-DRB74L, HLA-DQB57D, HLA-C155R and HLA-DQA-13A.
Fig 3Stepwise logistic regression of classical HLA alleles with up to five amino acid residue dosages included as covariates.
The ordering of the alleles within a gene along the x axis is alphabetical by allele name. A) no conditioning, B) conditioned on HLA-DPB11L, C) conditioned on HLA-DPB11L and HLA-DRB74L, D) conditioned on HLA-DPB11L, HLA-DRB74L and HLA-DQB57D, E) conditioned on HLA-DPB11L, HLA-DRB74L, HLA-DQB57D and HLA-C155R and F) conditioned on HLA-DPB11L, HLA-DRB74L, HLA-DQB57D, HLA-C155R and HLA-DQA-13A.
Fig 4Empirical distribution of deviance in 10,000 permutations of amino acid sequences across classical HLA alleles.
The vertical red lines indicate the deviances (261.89, 434.78, 527.37, 576.27, 625.34) explained by the top 1–5 amino acids in the actual data, which generate empirical P-values (0.0330, 0.0195, 0.0515, 0.0603, 0.0333) respectively.
Fig 5Structure and molecular surface electrostatic potential of pocket P6 in HLA-DP molecules.
A) The structure and electrostatic potential of HLA-DPB1*03:01. The area within the frame is depicted in expanded form in B and C, and indicates the position of pocket P6 (arrows). All structures were superimposed on HLA-DPB1*03:01 and therefore show the same view. HLA-DPB alleles associated with an increased risk of PBC (03:01, 06:01, 10:01, 17:01) are shown in panel B whereas those associated with a protective effect (04:01) are shown in panel C. Negatively charged potentials (less than 5 kT/e) are coloured red, positively charged (greater than 5 kT/e) blue, and neutral potentials (0 kT/e) are coloured white. Linear interpolation was used to produce the colour for surface potentials between these values.