| Literature DB >> 21413052 |
Johannes R Hov1, Vasilis Kosmoliaptsis, James A Traherne, Marita Olsson, Kirsten M Boberg, Annika Bergquist, Erik Schrumpf, J Andrew Bradley, Craig J Taylor, Benedicte A Lie, John Trowsdale, Tom H Karlsen.
Abstract
UNLABELLED: The strongest genetic risk factors for primary sclerosing cholangitis (PSC) are found in the human leukocyte antigen (HLA) complex at chromosome 6p21. Genes in the HLA class II region encode molecules that present antigen to T lymphocytes. Polymorphisms in these genes are associated with most autoimmune diseases, most likely because they contribute to the specificity of immune responses. The aim of this study was to analyze the structure and electrostatic properties of the peptide-binding groove of HLA-DR in relation to PSC. Thus, four-digit resolution HLA-DRB1 genotyping was performed in 356 PSC patients and 366 healthy controls. Sequence information was used to assign which amino acids were encoded at all polymorphic positions. In stepwise logistic regressions, variations at residues 37 and 86 were independently associated with PSC (P = 1.2 × 10(-32) and P = 1.8 × 10(-22) in single-residue models, respectively). Three-dimensional modeling was performed to explore the effect of these key residues on the HLA-DR molecule. This analysis indicated that residue 37 was a major determinant of the electrostatic properties of pocket P9 of the peptide-binding groove. Asparagine at residue 37, which was associated with PSC, induced a positive charge in pocket P9. Tyrosine, which protected against PSC, induced a negative charge in this pocket. Consistent with the statistical observations, variation at residue 86 also indirectly influenced the electrostatic properties of this pocket. DRB1*13:01, which was PSC-associated, had a positive P9 pocket and DRB1*13:02, protective against PSC, had a negative P9 pocket.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21413052 PMCID: PMC3128712 DOI: 10.1002/hep.24299
Source DB: PubMed Journal: Hepatology ISSN: 0270-9139 Impact factor: 17.425
Characteristics of Included Individuals
| PSC | Healthy Controls | |
|---|---|---|
| N | 356 | 366 |
| Male, n (%) | 254 (71) | 256 (70) |
| Age at diagnostic cholangiography, years, median (range) | 36 (12-75) | — |
| Concomitant inflammatory bowel disease, n (%) | 290 | — |
| Ulcerative colitis/Crohn's disease/indeterminate, % | 81/12/7 | — |
| Cholangiocarcinoma, n (%) | 50 (14) | — |
| Endpoint (tx or death), n (%) | 210 (59) | — |
| Follow-up, years, median (range) | 10 (0-34) | — |
Norway n = 230, Sweden n = 126.
Missing information about the intestine in three patients.
Association Analyses Between Amino Acid Variation in the HLA-DRβ1 Chain and PSC*
| Single-Residue LR | Two-Residue LR | Three-Residue LR | ||||||
|---|---|---|---|---|---|---|---|---|
| Basic Model: - | Basic Model: Residue 37 | Basic Model: Residue 86 | Basic Model: Residues 37+86 | |||||
| Residue | Observed Amino Acids | Rank | Rank | Rank | ||||
| 9 | Glu, Lys,Trp | 0.051 | 27 | 0.59 | 22 | 0.0086 | 20 | 0.29 |
| 10 | Glu,Gln,Tyr | 2.4 × 10−11 | 14 | 0.54 | 21 | 0.0017 | 17 | 0.48 |
| 11 | Asp,Gly,Leu,Pro,Ser,Val | 1.2 × 10−18 | 8 | 0.23 | 15 | 1.7 × 10−8 | 3 | 0.43 |
| 12 | Lys,Thr | 2.7 × 10−12 | 12 | 0.64 | 24 | 0.00046 | 14 | 0.74 |
| 13 | Phe,Gly,His,Arg,Ser,Tyr | 9.6 × 10−25 | 2 | 0.22 | 14 | 2.1 × 10−11 | 2 | 0.25 |
| 14 | Glu,Lys | 0.0014 | 20 | 0.36 | 17 | 0.74 | 28 | 0.83 |
| 16 | His,Tyr | 3.2 × 10−5 | 17 | 0.88 | 29 | 0.00053 | 15 | 0.78 |
| 25 | Gln,Arg | 0.0014 | 21 | 0.36 | 18 | 0.74 | 29 | 0.83 |
| 26 | Phe,Leu,Tyr | 2.6 × 10−19 | 6 | 0.024 | 2 | 3.9 × 10−8 | 4 | 0.23 |
| 28 | Asp,Glu,His | 2.8 × 10−5 | 16 | 0.13 | 8 | 0.0053 | 19 | 0.10 |
| 30 | Cys,Gly,His Leu,Arg,Tyr | 0.00012 | 18 | 0.12 | 7 | 2.0 × 10−5 | 11 | 0.20 |
| 31 | Phe,Ile,Val | 0.31 | 29 | 0.078 | 5 | 0.0046 | 18 | 0.12 |
| 32 | His,Tyr | 1.2 × 10−21 | 4 | 0.38 | 19 | 6.6 × 10−8 | 6 | 0.64 |
| 33 | His,Asn | 1.4 × 10−15 | 11 | 0.40 | 20 | 5.0 × 10−8 | 5 | 0.31 |
| 37 | Phe,Leu,Asn,Ser,Tyr | — | — | |||||
| 38 | Ala,Leu,Val | 0.029 | 26 | 0.17 | 11 | 0.0014 | 16 | 0.14 |
| 40 | Phe,Tyr | 0.77 | 30 | 0.17 | 12 | 0.52 | 27 | 0.14 |
| 47 | Phe,Tyr | 2.4 × 10−21 | 5 | 0.11 | 6 | 0.00024 | 13 | 0.29 |
| 57 | Ala,Asp,Ser,Val | 2.6 × 10−5 | 15 | 0.78 | 26 | 0.16 | 22 | 0.44 |
| 58 | Ala,Glu | 0.00055 | 19 | 0.85 | 28 | 0.035 | 21 | 0.84 |
| 60 | His,Ser,Tyr | 0.0046 | 24 | 0.59 | 23 | 0.28 | 24 | 0.29 |
| 67 | Phe,Ile,Leu | 0.0018 | 22 | 0.80 | 27 | 0.45 | 25 | 0.22 |
| 70 | Asp,Gln,Arg | 0.063 | 28 | 0.16 | 10 | 0.49 | 26 | 0.32 |
| 71 | Ala,Glu,Lys,Arg | 1.4 × 10−15 | 10 | 0.043 | 3 | 5.4 × 10−6 | 8 | 0.32 |
| 73 | Ala,Gly | 5.5 × 10−12 | 13 | 0.14 | 9 | 1.7 × 10−5 | 10 | 0.34 |
| 74 | Ala,Glu,Leu,Gln,Arg | 1.7 × 10−18 | 9 | 0.25 | 16 | 9.4 × 10−6 | 9 | 0.59 |
| 77 | Asn,Thr | 5.8 × 10−19 | 7 | 0.060 | 4 | 4.8 × 10−7 | 7 | 0.34 |
| 78 | Val,Tyr | 0.023 | 25 | 0.77 | 25 | 0.25 | 23 | 0.20 |
| 85 | Ala,Val | 0.0037 | 23 | 0.21 | 13 | 7.2 × 10−5 | 12 | 0.13 |
| 86 | Gly,Val | 1.8 × 10−22 | 3 | — | — | |||
Stepwise logistic regressions were performed assuming an “allele dosage” effect.
P-values of likelihood ratio tests of whether residue n improves the logistic regression model when added to a model with one other residue (37 or 86).
P-values of likelihood ratio tests of whether residue n improves the model when added to a model with both residues 37 and residue 86.
LR: likelihood ratio; Rank: residue rank according to P-value, lowest P-value is highlighted in bold.
Summary of Association Analyses Between Amino Acid Variation and PSC Assuming a “Genotype” Model*
| Single-Residue LR | Two-Residue LR | Three-Residue LR | ||||
|---|---|---|---|---|---|---|
| Basic Model: - | Basic Model: Residue 26 | Basic Model: Residue 37 | Basic Model: Residues 77 | Basic Model: Residues 86 | Basic Model: Residues 37 + 86 | |
| Residue | ||||||
| 26 | 6.2×10−20 | — | 4.2×10−7 | 0.19 | 1.3×10−8 | 4.1×10−5 |
| 37 | 6.9×10−32 | 3.3×10−19 | — | 1.5×10−19 | 4.8×10−17 | — |
| 70 | 0.013 | 0.23 | 0.023 | 0.083 | 0.055 | 0.036 |
| 71 | 1.6×10−17 | 3.8×10−14 | 0.00034 | 3.4×10−13 | 7.1×10−8 | 0.014 |
| 73 | 1.1×10−12 | 0.15 | 0.00075 | 0.17 | 8.1×10−6 | 0.0038 |
| 74 | 1.6×10−18 | 0.14 | 6.0×10−5 | 0.096 | 1.3×10−6 | 0.00093 |
| 77 | 7.3×10−21 | 0.80 | 8.5×10−8 | — | 7.8×10−9 | 3.4×10−6 |
| 86 | 2.0×10−21 | 1.3×10−9 | 1.2×10−5 | 2.2×10−9 | — | — |
Stepwise logistic regressions were performed assuming a “genotype” effect, entering all different pairs (combinations) of amino acids at a given residue. Only residues contributing significantly to the model when added to residue 37 and 86 in the three-residue regressions are shown.
P-values of likelihood ratio tests of whether residue n improves the logistic regression model when added to a model with one other residue (26, 37, 77, or 86).
P-values of likelihood ratio tests of whether residue n improves the model when added to a model with both residues 37 and residue 86.
LR: likelihood ratio.
Frequencies of Different Amino Acids at HLA-DRβ1 Residues 37 and 86
| Allele Frequency, n (%) | Carrier Frequency, n (%) | ||||
|---|---|---|---|---|---|
| Residue | Amino Acid | PSC | Healthy Controls | PSC | Healthy Controls |
| 37 | Asparagine (Asn) | 398 (56) | 199 (27) | 292 (82) | 163 (45) |
| Leucine (Leu) | 4 (1) | 16 (2) | 4 (1) | 15 (4) | |
| Phenylalanine (Phe) | 35 (5) | 67 (9) | 34 (10) | 62 (17) | |
| Serine (Ser) | 192 (27) | 208 (28) | 162 (46) | 175 (48) | |
| Tyrosine (Tyr) | 83 (12) | 242 (33) | 80 (22) | 197 (54) | |
| 86 | Glycine (Gly) | 185 (26) | 368 (50) | 164 (46) | 284 (78) |
| Valine (Val) | 527 (74) | 364 (50) | 335 (94) | 282 (77) | |
Fig. 1Structure and molecular surface electrostatic potential of pocket P9. (A) The structure and electrostatic potential of HLA-DRB1*03:01. The area within the frame is depicted in expanded form in (B,C). All structures were superimposed on HLA-DRB1*03:01 and therefore show the same view. HLA-DR carrying the risk residue Asn37 in the β chain had P9 pockets (arrows) with positive charge (B), whereas molecules expressing Tyr37 had P9 pockets (arrows) with consistently negative charge (C). Potentials less than −5 kT/e are colored red, those greater than 5 kT/e blue, and neutral potentials (0 kT/e) are colored white. Linear interpolation was used to produce the color for surface potentials between these values.
Fig. 2Structure and molecular surface electrostatic potential of pocket P1. (A) The structure and electrostatic potential of HLA-DRB1*03:01. The area within the frame is depicted in expanded form in (B). All structures were superimposed on HLA-DRB1*03:01 and therefore show the same view. Structural modeling and calculation of the electrostatic potential at the P1 pocket (arrows) of representative HLA-DR molecules showed that the Gly/Val dimorphism at position 86 had a minimal physiochemical effect (B). The majority of HLA-DR molecules examined had P1 pockets with an overall neutral charge. HLA-DRB1*03:01 and -DRB1*14:01 express Val86 whereas -DRB1*01:01 and -DRB1*04:01 express Gly86. Potentials less than −5 kT/e are colored red, those greater than 5 kT/e blue, and neutral potentials (0 kT/e) are colored white. Linear interpolation was used to produce the color for surface potentials between these values.
Fig. 3Electrostatic modification of pocket P9 by the Gly86-to-Val86 substitution. The structure and electrostatic potential of the peptide-binding groove is shown for (structures were superimposed) HLA-DRB1*13:01 (left figures) and -DRB1*13:02 (right figures); these molecules have a single amino acid sequence difference at position 86 (Val86 and Gly86, respectively). HLA-DRB1*13:02 has an electronegative pocket P9 despite the presence of Asn at position 37, suggesting a long-range effect of the Val86-to-Gly86 substitution. The molecular surface is colored according to the calculated electrostatic potential, as for Figs. 1 and 2.
Frequency of HLA-DRB1Alleles Encoding Asn37
| PSC (2n=712) | Healthy Controls (2n=732) | ||||||
|---|---|---|---|---|---|---|---|
| Allele | n | (%) | n | (%) | OR (95%CI) | Residue 86 | |
| 03:01 | 254 | (36) | 106 | (14) | 3.3 (2.5-4.2) | 1.3 × 10−20 | Val |
| 09:01 | 12 | (2) | 6 | (1) | 2.0 (0.8-5.0) | 0.14 | Gly |
| 13:01 | 117 | (17) | 47 | (6) | 2.8 (2.0-4.1) | 2.0 × 10−9 | Val |
| 13:02 | 15 | (2) | 39 | (5) | 0.4 (0.2-0.7) | 0.0013 | Gly |
| 14:02 | 0 | (0) | 1 | (0) | 0.3 (0.0-3.8) | 1.0 | Gly |
Calculated with Woolf's formula with Haldane's correction.
Not corrected. Calculated with chi-square tests or Fisher's exact test where appropriate.
Pcorrected = 0.040.
OR = odds ratio. CI = confidence interval.