Literature DB >> 22573116

Classical HLA-DRB1 and DPB1 alleles account for HLA associations with primary biliary cirrhosis.

P Invernizzi1, M Ransom, S Raychaudhuri, R Kosoy, A Lleo, R Shigeta, A Franke, F Bossa, C I Amos, P K Gregersen, K A Siminovitch, D Cusi, P I W de Bakker, M Podda, M E Gershwin, M F Seldin.   

Abstract

Susceptibility to primary biliary cirrhosis (PBC) is strongly associated with human leukocyte antigen (HLA)-region polymorphisms. To determine if associations can be explained by classical HLA determinants, we studied Italian, 676 cases and 1440 controls, genotyped with dense single-nucleotide polymorphisms (SNPs) for which classical HLA alleles and amino acids were imputed. Although previous genome-wide association studies and our results show stronger SNP associations near DQB1, we demonstrate that the HLA signals can be attributed to classical DRB1 and DPB1 genes. Strong support for the predominant role of DRB1 is provided by our conditional analyses. We also demonstrate an independent association of DPB1. Specific HLA-DRB1 genes (*08, *11 and *14) account for most of the DRB1 association signal. Consistent with previous studies, DRB1*08 (P=1.59 × 10(-11)) was the strongest predisposing allele, whereas DRB1*11 (P=1.42 × 10(-10)) was protective. Additionally, DRB1*14 and the DPB1 association (DPB1*03:01; P=9.18 × 10(-7)) were predisposing risk alleles. No signal was observed in the HLA class 1 or class 3 regions. These findings better define the association of PBC with HLA and specifically support the role of classical HLA-DRB1 and DPB1 genes and alleles in susceptibility to PBC.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22573116      PMCID: PMC3423484          DOI: 10.1038/gene.2012.17

Source DB:  PubMed          Journal:  Genes Immun        ISSN: 1466-4879            Impact factor:   2.676


Introduction

The human major histocompatibility complex, HLA, has been implicated in the etiopathogenesis of primary biliary cirrhosis (PBC), similar to many other autoimmune diseases. Genome-wide association studies (GWAS) of PBC including our own find the strongest association with single nucleotide polymorphisms (SNPs) within the HLA region[1-3]. In these studies the peak association signal is between HLA-DQA1 and HLA-DQB1. Multiple studies of PBC also show association with particular classical HLA alleles in PBC (reviewed in Invernizzi[4]). These studies have variably implicated different DRB1 alleles in European populations with most studies including all larger cohorts showing association of DRB1*08[5, 6]. Our previous studies in an Italian cohort with PBC showed the association of DRB1*08 as predisposing, and DRB1*11 and DRB1*13 as protective alleles[6]. A study using a small cohort (32 German PBC cases and 47 controls) suggest that DPB1 associations may also be present in Europeans[7]. However, a comprehensive study of HLA region associations has not been performed and like other autoimmune diseases it is unclear which determinants are actually causally related to pathogenesis. To further study HLA associations in PBC, in the current study we used the most recent advances in imputation algorithms and sequence information resources including the 1000 genome database to accurately impute missing SNPs, and importantly HLA classical alleles. Specifically, our investigation rests on recent development and resources for imputing HLA classical alleles including a reference set of European subjects[8]. For our study we used an inference set of SNP genotypes from both GWAS and a designed chip array, the Immunochip[9] that contains a set of SNPs that have been used in multiple studies of HLA[10]. We perform a series of conditioning analyses that clarify which HLA genes and alleles underlie the major component of the genetic associations of PBC.

Results

Analyses Show Strong Association of Imputed SNPs and HLA Determinants

To further define PBC - HLA region associations we analyzed association using imputed genotypes with high probabilities and information scores (see Materials and Methods). These studies utilized genotypes from both GWAS and Immunochip arrays that contained large numbers of SNPs in the MHC region (Table 1 Supporting Table 1 and see Materials and Methods). Strong association was observed with the peak association (p = 9.83 × 10−17) with rs115721871 at position 32653792 distal to DQB1 (Fig. 1A, Table 2 and Supporting Table 2). Although the strongest associations were with non-coding SNPs, multiple classical genes in HLA show strong association with PBC (Table 2). For the classical HLA genes the strongest association was with DRB*08 (p = 1.59 × 10−11). The DQB1*04:02 and DQA1*04:01 in tight LD with DRB*08 (r2 = 0.84 and 0.89, respectively) showed nearly equivalent signals (1.38 × 10−10, 1.90 × 10−10, respectively). Very strong association was also observed for DRB1*11 (p = 1.42 × 10−10) with a weaker association with the DQB1 allele (DQB1*03:01, p = 6.10×10−9) that is in LD (r2 = 0.75) with DRB1*11. Less strong associations were observed with DRB1*14, DQB1*05:03 (6.89 × 10−7 and 6.21 × 10−7, respectively), and DPB1*03:01 (p =9.18 × 10−7). DQB1*05:03 is in nearly complete LD with DRB1*14 (r2 = 0.97). DPB1*03:01 is not in LD with any of the DRB1, DQB1 or DQA1 classical alleles or AAs (r2 < 0.01). There was no association (p > 10−4) observed for classical alleles in HLA A, B, C or DPA1.
Table 1

Summary of Subject Genotyping Information

Genotypinga

GWASbICcGWAS-onlyIC-onlyBothdTotal

Italian PBC Cases45362254223399676
Italian Controls10425978433981991440
MHC region SNP genotypes

GWASICGWAS-onlyIC-onlyBothTotal

Number of SNPse1548488560439419445489
SNPs overlaping 1000 Genome referencef1435438659535468404981
SNPs overlaping HLA referencef64814441669624821610

Additional details on subject set genotyping is in Supplemental Table 1.

GWAS: Genotyped using Illumina platforms with >550K genome-wide association study (GWAS) SNPs. The samples included in previous GWAS study.

IC: Subjects genotyped using Immunochip (IC) array.

Subjects genotyped using both GWAS platform and Immunochip array

Number of SNPs within the region [chromosome 6 bps 28911802 – 33813043 (HG19 map)] used for analyses.

Number of SNPs genotyped in region that overlap the reference sets used for imputation.

Figure 1

Analysis of the HLA region association signals in PBC

In each panel, the symbols show the strength of the association signal (ordinate) for the corresponding position (Mb, HG19) on chromosome 6 (abscissa). For panel A, the p value before conditioning is shown. For panels B-F the p values are shown after conditioning on the HLA determinant(s) indicated in the panel. The blue color coded symbols denotes the strongest associated marker with p value <10−6, and the other markers are color coded to indicate marker LD with the strongest associated marker: Markers with strong LD (r2 > 0.8) (red); moderate LD (r2 > 0.5) (orange), weak LD (r2 > 0.2) (yellow) and little or no LD (open symbols) are shown. The SNPs with the strongest associations were rs115721871 at bp 32653792 (panels A and D), rs9277558 at bp 33056711 (panels B and C), and rs9268668 at bp 32413889 (panel F).

Table 2

Summary of Association Results and Conditioning Studies of HLA alleles in PBCa

Marker/AminoAcid/HLAdeterminantbbase pairposition(HG19)cAllele 1(MinorAllele)dAllele 2MAF(Case)eMAF(Control)p-valuefOddsRatiominoralleleCond. p-valueDRB1gCond. p-valueDPB1gCond. p-valueDRB1 andDPB1g
rs11572187132653792AG0.170.089.8E-172.619.4E-021.3E-191.2E-01
rs424605532592360CT0.190.102.9E-162.472.1E-021.6E-192.8E-02
rs11418393532655730AG0.590.438.1E-161.801.9E-021.2E-183.1E-02
rs11443244332652278TC0.390.534.6E-150.552.3E-021.4E-172.8E-02
rs11649371232669767TC0.570.461.1E-091.562.6E-015.2E-101.8E-01
rs11479688132664960AG0.300.411.2E-090.633.9E-011.8E-103.7E-01
rs11432727432289594AG0.240.174.1E-081.661.2E-011.9E-081.2E-01
rs11634841732389648AG0.260.374.9E-080.665.2E-016.2E-115.7E-01
HLA-DRB1*0832552064SPECOTHER0.080.031.6E-113.221.0E+007.3E-131.0E+00
HLA-DQB1*04:0232631061SPECOTHER0.070.031.4E-103.166.8E-019.0E-124.5E-01
HLA-DRB1*1132552064SPECOTHER0.150.261.4E-100.551.0E+009.4E-138.3E-01
HLA-DQA1*04:0132608306SPECOTHER0.070.021.9E-100.326.5E-019.9E-125.0E-01
HLA-DQB1*03:0132631061SPECOTHER0.210.316.1E-090.618.9E-013.8E-104.5E-01
HLA-DQB1*05:0332631061SPECOTHER0.090.056.2E-072.147.6E-018.2E-091.0E+00
HLA-DRB1*1432552064SPECOTHER0.090.056.9E-072.121.0E+008.8E-091.0E+00
HLA-DPB1*03:0133049368DETOTHER0.160.109.2E-071.784.2E-061.0E+001.0E+00
DRB1-AA7432551948OTHERA0.430.311.2E-111.705.9E-013.2E-129.9E-01
DRB1-AA7432551948LOTHER0.080.031.3E-113.232.9E-016.1E-132.0E-01
DQA1-AA6932609279TOTHER0.080.032.7E-113.216.0E-018.5E-136.4E-01
DRB1-AA5832551996EA0.150.261.5E-100.551.0E+009.4E-131.0E+00

This Table provides a summary of results before and after conditioning on particular HLA determinants and includes: 1) the top four imputed SNPs (rows 1–4); 2) top four genotyped SNPs (rows 5–8); 3) all classical HLA determinants with p-values < 10–6 (rows 9–16); and 4) the top four HLA amino acid (AA) associations (rows 17–20). P values after conditioning on particular HLA genes (see methods) are also shown. The p values were determined using SNPTEST V2 (see Patients and Methods) and used the genotype probabilities. A complete summary of strongly association markers is provided in Supplementary Table 1.

The SNP name, classical HLA determinant or AA is shown.

The position corresponding to the marker is shown (chromosomal base pair positions from HG19).

Allele 1 is the minor allele used for the analysis. For sequences designated with amino acid (AA) numbers the symbol refers to the allele symbol corresponds to standard AA nomenclature. Where the AA is compared with multiple other AAs, "Other" is used to represent all alternative AAs. The AA position includes the leader sequence (e.g. DRB1* position 104 is the same as AA position 74 without the leader sequence). Where HLA determinants are provided, the specificity is compared with not having the specificity (OTHER).

MAF: Minor allele frequencies were estimated from probabilities (SNPTEST V2).

p values before conditioning using population substructure and gender as covariates.

p value after conditioning on the allele(s) listed in the heading line.

As expected from the analysis of classical HLA alleles, PBC also showed strong association with specific AAs in these genes. Most of the HLA AA showing association signals corresponded to the key residues that distinguish the specific classical alleles which for DRB1, included lysine (L) at AA74 in DRB1 (DRB*08), glutamate (E) at AA58 (DRB*11), alanine (A) at AA57 (DRB1*14), and histidine (H) at AA60 (DRB1*14). Similar results were observed for specific DQB1, and DQA1 AAs that are in strong LD with specific DRB1 alleles and AAs (Supporting Table 2).

Conditioning Studies Using Classical HLA Genes

To examine whether these associations could be explained for by known coding differences in genes we next performed a series of conditional analyses. These studies were done by conditioning on a combination of the alleles from an HLA gene (e.g. DRB1) to control for the association that might be attributable to each gene, albeit some of the effect may not be directly attributed to that gene due to extensive LD across this region. The residual signals after controlling for the effect of various combinations of classical alleles and AA residues in these HLA genes show that both DRB1 and DQB1 could account for most of the association signal (Fig. 1 B-E, Table 2 and Supporting Table 3). In addition, the signal in the DPB1 region was only marginally decreased conditioning on DRB1, DQB1 or DQA1. Conditioning on DPB1 eliminated the signal in the DPB1 region and showed a modest increase in the signal in the across DRB1, DQA1 and DQB1. To further assess these conditional analyses, we also examined the relative difference of the conditioning by different HLA genes by examining beta estimates and their differences. The beta estimate is the measure of the increase in log-odds that can be attributed to each copy of a given minor allele. The largest effect is from the composite of DRB1 alleles as shown by the residual beta estimates (and odds ratios) after conditioning and the mean change in the beta estimates (Table 3). This is most evident examining the SNPs with the strongest signals from association (original signal < 5 × 10−8). For example, the DRB1 conditioning had a much larger mean change in beta estimate (−0.424) compared with DQB1 (−0.236) (p value < 10−10, paired T test). Additional conditional analyses using combinations with DQA1 demonstrated that DQA1 could not substitute for DRB1 or DPB1 in any of the combinations tested (data not shown).
Table 3

Comparison of Residual Association Beta Estimates and Odds Ratios after Conditioning Studies

All MarkersaAssociationbControlledDRB1cControlledDQB1ControlledDQA1ControlledDPB1ControlledDRB1 &DPB1ControlledDQB1 &DPB1ControlledDRB1*8, 11,14, & DPB1*03:01
mean B estimate0.2420.1880.2050.2110.2370.1800.1970.190
mean change in beta estimateReference−0.055−0.037−0.026−0.005−0.062−0.045−0.052
mean OR1.2741.2071.2281.2341.2681.1971.2181.209
SNPs < 5E-8dAssociationControlledDRB1ControlledDQB1ControlledDQA1ControlledDPB1ControlledDRB1 &DPB1ControlledDQB1 &DPB1ControlledDRB1*8, 11,14, &DPB1*03:01

mean B estimate0.7100.1650.2970.3400.5850.1590.2950.179
mean change in beta estimateReference−0.424−0.236−0.229−0.005−0.430−0.240−0.410
mean OR2.031.181.351.411.791.171.341.20

All markers including SNPs (MHC region), classical HLA determinants and HLA Amino Acids.

The beta estimate, change in beta estimate and odds ratios from SNPTESTV2 score test (see methods).

Results after conditioning on the indicated HLA genes or alleles.

Results using only markers with p values < 5 × 10−8 in the initial association tests.

Conditioning on Specific HLA alleles

We next examined the effect of conditioning on specific DRB1, DQA1, DQB1 and DPB1 classical genes and AAs. A clear pattern emerged showing that the association of groups of SNPs was specifically controlled by different alleles. These results are highlighted in Table 4 and in a more complete version Supporting Table 3. Individually, the specific SNPs conditioned a part of the association signal largely corresponding to those SNPs in moderate or strong LD (r2 > 0.5) with the particular classical specificity or AA. Similar effects were observed for specific alleles in one gene in strong LD with another gene. This is particularly evident for the DRB1*14 and DQB1*05:03 in which the effects of controlling for either of these alleles was virtually indistinguishable (Supporting Table 3). Other pairs (e.g. DRB1*08 and DQB1*04:02) showed small but consistent differences in which the DRB1 allele diminished more signal than when the DQB1 allele was used in the conditioning analyses (Supporting Table 3). Notably the strongest AA association was at position 74 that is in the antigen binding pocket of DRB1[11, 12].
Table 4

Conditioning on Specific HLA DRB1 and DPB1 Alleles

GroupaMarker/AminoAcid/HLAdeterminantbbase pairposition(HG19)cp-valuedCond. p-valueDRB1*08eCond. p-valueDRB1*11eCond. p-valueDRB1*14eCond. p-valueDPB1*03:01e
1rs114991247325835045.45E-132.41E-033.96E-132.15E-183.75E-14
1rs116240177325616628.20E-134.82E-031.17E-123.66E-184.29E-14
1DRB1-AA74L325519481.33E-114.72E-012.26E-113.67E-161.48E-12
1HLA-DRB1*08325520641.59E-111.00E+002.73E-114.55E-161.77E-12
1DQA1-AA69T326092792.75E-118.93E-013.55E-115.99E-162.35E-12
1HLA-DQB1*04:02326310611.38E-105.79E-011.86E-104.02E-151.63E-11
1HLA-DQA1*04:01326083061.90E-109.58E-011.73E-105.55E-152.01E-11
2rs9461776325757353.02E-122.39E-143.35E-039.62E-151.53E-15
2rs115671039326032121.98E-119.29E-134.62E-025.22E-132.91E-14
2HLA-DRB1*11325520641.42E-101.34E-129.99E-014.34E-139.25E-14
2DRB1-AA58E325519961.45E-101.34E-129.98E-014.38E-139.30E-14
2DQB1-AA71T326326465.25E-103.65E-092.62E-044.68E-101.69E-12
2HLA-DQB1*03:01326310616.10E-093.29E-109.76E-013.95E-103.25E-11
3rs380911322094518.94E-091.36E-104.52E-063.71E-045.26E-09
3rs115240978326723831.28E-081.30E-102.24E-066.75E-038.16E-10
3HLA-DQB1*05:03326310616.21E-074.79E-095.13E-056.15E-012.31E-08
3HLA-DRB1*14325520646.89E-075.29E-094.83E-059.99E-012.46E-08
3DRB1-AA57A325519996.90E-075.29E-094.82E-059.99E-012.46E-08
3DRB1-AA60H325519906.90E-075.29E-094.82E-059.99E-012.46E-08
4rs116751915330525028.11E-099.21E-093.09E-061.76E-101.69E-04
4rs114761768330504421.79E-083.09E-084.73E-064.90E-103.98E-04
4DPB1-AA76M330486626.13E-083.78E-072.11E-051.07E-088.71E-03
4DPB1-AA170I330529581.06E-072.61E-071.61E-053.21E-098.41E-03
4HLA-DPB1*03:01330493689.18E-072.05E-061.01E-063.70E-099.98E-01

The group corresponds to those markers that are successfully controlled by specific HLA alleles (p value after conditioning is >1 × 10–4. The bold highlighted p-values indicate the p-values used in the “group” assignment. For each group the following are shown in this abbreviated summary: i) the two SNPs with the strongest association in the group; ii) the two HLA amino acids (AA) with the strongest association; and iii) all classical HLA alleles with p <10–6 prior to conditioning. The p values were determined using SNPTEST V2 (see methods) and used the genotype probabilities. A more complete summary of strongly association markers is provided in Supplementary Table 2.

The SNP name, classical HLA determinant or AA is shown. For the AA the letter following the residue number provides the relevant AA.

The position corresponding to the marker is shown (chromosomal base pair positions from HG19).

p values after conditioning using population substructure and gender as covariates.

p value after conditioning (Cond) on the allele(s) listed in the heading line.

Similar to when we conditioned on genes (“all” alleles at a particular gene), the DPB1 region SNP signal was only substantially reduced when DPB1*03:01 (Table 4 and Supporting Table 3) or when specific DPB1 AA’s were used in conditioning. The strongest effects were observed for the lysine at AA position 11 and the methionine at AA position 76 that are both members of the 16 AA in the putative antigen binding pocket of this gene[11]. Thus, the vast majority of the HLA region association signal can be accounted individually by conditioning on one of four specific alleles, three in DRB1 (*08, *11, and *14), and one in DPB1 (*03:01). Combinations of these specific alleles accounted for most of the remaining signal (Supporting Table 3 and Fig. 1F) and are also reflected in the strong reduction in beta estimates (Table 3). However, there are signals from several SNPs in the DRB1-DQB1 region that are not accounted for by these conditioning studies. None of these SNPs with signals p <10−5 after conditioning on DRB*08, *11 and *14, and DPB1*03:01 were among the stronger associated SNPs prior to conditioning (all with original association p values >10−6). In particular, the strongest associated SNP after conditioning (rs9268668, p = 1.67 × 10−7), showed no signal prior to conditioning (p = 0.40). Whether these residual or new signals are also due to other specific classical HLA genes is not clear, however, conditioning on “all” DRB1 and DPB1 alleles ablated all signals with resulting p-values > 10−5 (Fig. 1F) suggesting that additional sequence differences (e.g. putative regulatory SNPs) do not have to be postulated. Most of the signal observed for specific AAs was also specifically eliminated when conditioning on the DRB1 or DPB1 classical alleles. However, there were several exceptions in which the association signal was not readily decreased by controlling for single classical HLA alleles. These AAs included DRB1-AA47F, DRB1 AA74A, DQB1-AA26G, and DQB1-AA74S. For these AAs the signal was ablated when conditioning on two DRB1 alleles (DRB1*08 and DRB1*11) (Supporting Table 3). Conversely, conditioning on these AA could not account for the association of the most of the other SNPs that were not ablated by single classical HLA alleles (Supporting Table 3). Therefore, it may be less likely that these particular AAs are critical to explaining the association patterns we observed. However, we cannot exclude a specific functional role for these AAs and it is notable that DRB1-AA47 and DRB1-AA74 are both in the antigen binding pocket of DRB1[11, 12] and that conditioning on DRB1-AA47F did ablate several of the association signals that were not controlled by individual DRB1 classical alleles.

Genotypic Associations

We also examined genotypic associations including combinations of susceptibility alleles and combinations of risk susceptibility and protective alleles. Examining individuals with combinations of risk alleles DRB1*08 combined with DPB1*03:01 or DRB1*14 combined with DPB1*03:01 we found higher odds ratios for disease association than when examining only individuals with single susceptibility alleles (Supporting Table 4). There were insufficient numbers of DRB1*08/DRB1*14 (frequency <1%) to evaluate these heterozygote genotypes. The increased risk of the combining DPB1*03:01 with DRB1*08 or DRB1*14 was observed whether DPB1*0301 was on the same or different haplotype as that of the DRB1 risk allele (Supporting Table 4). When risk alleles were combined with the DRB1*11 protective allele the odds ratios were near 1 and there was no significant association with disease. Finally, we also examined the cumulative combination of risk predisposing and protective alleles (Table 5 and Supporting Table 4). The count of predisposing alleles minus protective alleles showed a strong correspondence with the odds ratio for PBC. Individuals with an excess of one or two or more risk alleles showed an odds ratio (OR) of 3.05 and 5.25 between cases and controls, and conversely individuals with an excess of one or two or more protective alleles had OR of 0.5 or 0.38, respectively (Table 5). All results were compatible with an additive model of action between each of the alleles similar to our previous observations[6].
Table 5

Cumulative Effect of Risk and Protective Classical HLA Alleles for PBC Susceptibility

CategoryaControlsNumber (%)bCasesNumber (%)Odds Ratiop-value
RISK - PROTECTIVE =/< −2177 (12.3)34 (5.0)0.385.87E-08
RISK - PROTECTIVE =/< −1682 (48.1)185 (27.4)0.502.03E-12
RISK - PROTECTIVE = 0522 (36.3)246 (36.4)1.00NS
RISK - PROTECTIVE =/> +1226 (15.7)245 (36.2)3.056.94E-25
RISK - PRPTECTIVE =/> 232 (2.2)72 (10.7)5.251.37E-15

This table shows the results of categorizing each participant based on the sum of the each risk allele (positive number) and each protective allele (negative number). The risk alleles are DRB1*08, DRB1*14 and DPB1*03:01. The protective alleles are DRB1*11 and DRB1*13. The alleles were determined from the most probable allele after imputation and haplotype analyses (see Methods).

The number of subjects that are in each category are shown together with the % of total controls (1440) or percent of cases (676).

The two tailed Fisher exact p value from contingency table analyses comparing cases and controls in each category.

Discussion

The current study of PBC association with HLA differs from previous investigations by providing the most comprehensive analysis of the entire HLA region while correcting for multiple confounding factors. Our results are consistent with a predominant role for class II genes and we believe exclude any substantial effect from either HLA class I or class III genes (there were no residual signals for these genes with p <0.0005 after accounting for class II genes). This contrasts other autoimmune diseases in which HLA class I or class III plays a predominant role (e.g. myasthenia gravis[13]) or strong class I gene effects are observed independent of class II associations (e.g. type 1 diabetes[14] and multiple sclerosis[15, 16]). Our study strongly suggests that the major gene in HLA that underlies susceptibility to PBC is DRB1. Overall DRB1 alleles show the strongest associations and conditioning studies show that DRB1 could account for almost all (except DPB1 region) of the association signal. HLA-DQB1 shows association that is only marginally less than that observed for DRB1. However, several points suggest that these associations are secondary to the strong LD between DRB1 and DQB1: 1) the overall strength of association of particular DRB1 alleles is stronger than the corresponding DQB1 allele; 2) conditioning on DRB1 could account for all DQB1 associations; and 3) residual beta-estimates after conditioning showed a substantially stronger DRB1 than DQB1 effect. In addition, our study provides strong evidence for an independent effect of DPB1. Although previous studies have as indicated in the introduction suggested DPB1 associations in PBC, these were based on small subject sets and were difficult to evaluate. Our study demonstrates that the association of DPB1 cannot be accounted for by controlling for other HLA region genes. These results are also consistent with findings in some, but not other, autoimmune diseases in which an independent effect of DPB1 has been reported. These include juvenile idiopathic arthritis[17], type 1 diabetes[18], multiple sclerosis[16, 19] and particular autoantibodies in systemic lupus erythematosus[20]. We note that our study does not directly address whether DRB3, DRB4, DRB5 or structural variations might have additional independent associations. At present such studies are challenging due to absence of reference sets for imputation and/or difficulty in assessing these polymorphisms including whether missing genotypes (excluded SNPs with call rates <0. 95) may have excluded analysis or inclusion of SNPs within these genes in available arrays. This study has also addressed the association of specific HLA-gene alleles. Most of the HLA association with PBC can be attributed to specific associations with DRB1*08, DRB1*11, DRB1*14 and DPB1*03:01. DRB1*08 has the strongest association, followed by DRB1*11 consistent with several previous studies[6, 7]. PBC associations with DRB1*14 has not been previously demonstrated, however, this weaker effect is supported by our conditioning studies that show that this classical allele can control for a set of associated SNPs and AAs that are not strongly influenced by other classical alleles (Table 3 and Supporting Table 3). The DPB1*0301 association is consistent with a previous study of a small German cohort[7]. In a previous study we observed that DRB1*13 was a protective allele[6]. In the current study the association of DRB1*13 was weak (p = 4.9 × 10−3, OR= 0.69, 95% CL = 0.53 – 0.89) compared to the previous study (p = 3.6 × 10−6). This may be due to several factors: i) the previous study did not explicitly control for population substructure; ii) the overlap of subjects with the previous study is < 25% and the difference may reflect statistical noise; and iii) the previous study used DNA typing rather than the imputation used in the current study. It may be worth noting that DRB1*13 like DRB1*11 has an alanine at AA position 74 and thus contributes to the protective effect observed for this AA (p = 1.33 × 10−11, Supporting Table 3). Similarly, our previous study[6] showed only a marginal association of DRB1*14 (uncorrected p value = 0.004) compared with a strong association (p = 6.9 × 10−7) observed in our current study. Here, the conditioning study results including the effect of controlling SNPs with very strong associations (see group 3, Table 4) provide additional support for the role of DRB1*14. Finally, we have also considered specific HLA AAs. Most of the associated AAs are both nearly unique to the specific HLA classical alleles discussed above and also correspond to critical residues for the antigen binding pocket. Thus, associated AAs in DRB1 at AA positions 37, 47, 57, 60, 67, 70 and 74; and DPB1 at AA positions 9, 11, 76, 84, 87 are antigen pocket AAs[11, 12]. Consistent with our results strong associations have been recently observed with serine at position 57 and leucine at position 74 in a Japanese PBC cohort[21]. We also note that many of the associated DQB1 AAs are also in critical residues for antigen binding (DQB1-AA13, 26, 70, 71, 74)[11, 12]. Of the DRB1 associated only AA58 is not among the AAs in this functional class, whereas for DQB1 several are not in this functional class (DQB1-AA45, 56, 75, 167, 185). In conclusion, the most parsimonious explanation consistent with the current study is that classical HLA genes and the coding variations within these genes are responsible for the HLA associations with PBC. Although we cannot exclude the possibility that other sequence variations affecting for example gene regulation could be important, our data indicates that a limited set of classical DRB1 and DPB1 alleles are sufficient to explain HLA associations with this disease. We believe the current data provides cogent information for understanding HLA-associations in PBC. Studies in other ethnic groups both within Europe and in other continental groups will also be important in further definition of the role of particular HLA genes and alleles. Lastly, our results provide additional rational for functional studies examining specific HLA genes and their relative binding to the putative disease associated epitopes of the PDC-E2 the immunodominant autoantigen epitopes of PBC[22] [23].

Materials and Methods

Study Population and design

The Italian PBC cases were obtained through a multi-center study and met internationally accepted criteria for the diagnosis of PBC as detailed in a previous study[6]. Each of the included cases also met ancestry criteria as defined below (see Ancestry). Controls were derived from several sources and this sample set information is detailed in Supporting Table 1. After data filtering and ancestry analyses contained 676 Italian PBC cases and 1440 Italian controls. All subjects enrolled in the study provided written informed consent and the study followed ethical guidelines of the most recent revision of the Declaration of Helsinki (Edinburgh, 2000). All samples were genotyped with either Illumina (San Diego, CA 92121) genome-wide and/or Immunochip SNP platforms and the participants included the dataset from our previous GWAS as well as new samples (see Supporting Table 1). With the exception of ancestry information and assessment of relatedness the current study was restricted to genotypes in an ~4 megabase segment of human chromosome 6 (bps 28911802 – 33813043, HG19 map). This dataset comprised a minimum of 1548 and a maximum of 5489 genotyped SNPs in each individual (Supporting Table 1) and was used for the SNP and HLA imputations (see Imputation).

Data Filtering

We used stringent quality control criteria to ensure that high-quality data were included in the analyses. We excluded individuals who had >5% missing data and all individuals with cryptic relatedness and duplicate samples based on identity-by-descent status for genome-wide SNPs (PI^ > 0.15) using PLINK[24]. We included only SNPs with <5% missing data, Hardy-Weinberg (H-W) equilibrium p values >10−4 in controls and >10−5 in combined cases and controls (to exclude most genotyping errors) applying these procedures in a stepwise approach separately for each dataset. For each of the separately derived control genotyping sets (Supporting Table 1), SNPs were excluded if they failed the above criteria within the individual control set or in combination with any of the other control groups, or in the complete data set. The H-W criteria were applied after exclusion of non-European individuals (see Ancestry). Finally, SNPs were excluded if allele frequency differed by > 10% between different control subject groups.

Ancestry

European ancestry was determined using 883 genome-wide SNPs with minimal or no linkage disequilibrium (LD) (r2 < 0.1). SNPs analyzed using the STRUCTURE v2.1 program[25] and subjects of known European, Amerindian, East Asian and West African origin as previously described[26]. We used STRUCTURE to exclude non-European and admixed study participants since this method allows exclusion/inclusion criteria to be set using reference populations. Subjects with >15% non-European ancestry were excluded from further analysis. Italian ancestry was defined using principal components analyses (PCA). For subjects with GWAS data, we used the same methods and criteria applied in a previous study with largely the same dataset. Briefly, PCA was performed using the EIGENSOFT statistical package[27] utilizing 34 thousand SNPs distributed throughout the genome (r2 < 0.1) that we have previously used to define population genetic substructure[2]. These analyses used an independent set of Italian subjects for establishing membership [+/− 2 standard deviations (sd) in first 4 principal components (PCs)]. In the current study, a substantial portion of the samples did not have GWAS data (Supporting Table 1). For these samples we used a set of 12,579 SNPs from the Immunochip for which our empiric analyses demonstrated the ability of this set to discern Italian ancestry and exclude both other European ethnicities including Sardinian Italians (Supporting Fig. 1). Using the GWAS defined individuals, the Immunochip only genotyped samples were included using 2 sd in the first 4 PCs. In addition to the subject selection, we used the eigenvalues from the first four PCs (only the first four PCs were significant based on Tracy-Widom statistics) as covariates in our association analyses.

Imputation

We imputed SNPs, HLA classical alleles, and HLA gene amino acids (AAs) using phased reference genotypes from both the 1000 genome sequencing project (interim release June 2011) (http://www.1000genomes.org/) and an HLA defined reference set[8]. For the 1000 genome imputation we used IMPUTE version 2[28] under default parameters. The reference haplotypes for this imputation were from 1094 subjects including 381 European subjects and 98 Tuscan Italians. The number of genotyped (inference) SNPs that overlapped with the 1000 genome reference set ranged from 1435 SNPs (samples typed by GWAS), 4386 (samples typed by Immunochip) to 4981 SNPs (samples typed by GWAS plus Immunochip) (Supporting Table 1). For subsequent data analyses we utilized only imputed genotypes with maximum posterior probability scores of > 0.90. Using this parameter our empiric testing (leave one-out analyses) indicated that the maximum error rate for genotype assignment was < 0.05 and the mean error rate was < 0.01. To impute classical HLA alleles and corresponding amino acids determinants we utilized a reference separate dataset of collected by the Type 1 Diabetes Genetics Consortium (T1DGC). This reference data contains genotype data for 2,537 SNPs, selected to tag the entire MHC, and classical types for HLA-A, B, C, DRB1, DQA1, DQB1, DPA1 and DPB1 at 4-digit resolution in 2767 unrelated individuals of European descent[29]. The Beagle software package[30] was used for this imputation under default parameters. The number of inference SNPs that overlapped with this reference dataset ranged from 648 (samples typed by GWAS), 1444 SNPs (samples typed by Immunochip) to 1610 SNPs (GWAS plus Immunochip) (Supporting Table 1). Similar to the imputation using 1000 Genome data, only SNPs with posterior probabilities of >0.90 were included in our final analyses. For imputed SNPs that overlapped between the two imputation sets and algorithms used (Impute V2.0 and Beagle) there was a nearly complete concordance of the association testing results indicating similar performance of these algorithms for this dataset. After imputation and selecting only those markers meeting posterior probability criterion this region contained a total of 49,885 markers including the genotyped SNPs that were included in association test analyses.

Association and Conditional Association Tests of Imputed SNPs and HLA determinants

SNPTEST V2.0[28] (web) was used for the primary association analyses for the imputed genotypes. This software uses the genotype probabilities for the imputed SNPs or determinants and accounts for genotype uncertainty. The first four PC eigenvalue scores were used as continuous variables in the association test together with the gender covariate. Analyses were performed using the SNPTEST v2 Score test algorithm that enabled both the inclusion of the covariates and conditioning tests and all of our reported results used an additive model. To minimize potential spurious results we limited our main and conditioning analyses to markers with information scores (Inf) > 0.85. This parameter is a measure of the observed statistical information for the estimate of SNP allele frequency (for additional information see https://mathgen.stats.ox.ac.uk/genetics_software/snptest/snptest.v2.pdf). Conditioning on multiple markers either separately or together was performed using an additive model. For the HLA region, over 150 conditional analyses were performed using the SNPs and HLA determinants including all HLA determinants with p values < 10−6. Nominal p values after correction for covariates and conditioning are provided throughout the manuscript. The p-values <10−6 would remain significant after conservative (Bonferroni) correction for the number of markers (<50,000) tested after imputation.
  30 in total

1.  Evolution of HLA class II molecules: Allelic and amino acid site variability across populations.

Authors:  H Salamon; W Klitz; S Easteal; X Gao; H A Erlich; M Fernandez-Viña; E A Trachtenberg; S K McWeeney; M P Nelson; G Thomson
Journal:  Genetics       Date:  1999-05       Impact factor: 4.562

2.  Human leukocyte antigen class II molecules confer both susceptibility and progression in Japanese patients with primary biliary cirrhosis.

Authors:  Takeji Umemura; Satoru Joshita; Tetsuya Ichijo; Kaname Yoshizawa; Yoshihiko Katsuyama; Eiji Tanaka; Masao Ota
Journal:  Hepatology       Date:  2011-12-19       Impact factor: 17.425

3.  HLA class II alleles, genotypes, haplotypes, and amino acids in primary biliary cirrhosis: a large-scale study.

Authors:  Peter T Donaldson; Anna Baragiotta; Michael A Heneghan; Annarosa Floreani; Carla Venturi; James A Underhill; David E J Jones; Oliver F W James; Margaret F Bassendine
Journal:  Hepatology       Date:  2006-09       Impact factor: 17.425

4.  Pleiotropic effects of the 8.1 HLA haplotype in patients with autoimmune myasthenia gravis and thymus hyperplasia.

Authors:  Claire Vandiedonck; Geneviève Beaurain; Matthieu Giraud; Catherine Hue-Beauvais; Bruno Eymard; Christine Tranchant; Philippe Gajdos; Jean Dausset; Henri-Jean Garchon
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-15       Impact factor: 11.205

5.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

6.  HLA-DRB1, TAP2/TAP1, and HLA-DPB1 haplotypes in Finnish juvenile idiopathic arthritis: more complexity within the MHC.

Authors:  J A Runstadler; H Säilä; A Savolainen; M Leirisalo-Repo; K Aho; E Tuomilehto-Wolf; J Tuomilehto; M F Seldin
Journal:  Genes Immun       Date:  2004-11       Impact factor: 2.676

7.  HLA-DPB1 alleles association of anticardiolipin and anti-beta2GPI antibodies in a large series of European patients with systemic lupus erythematosus.

Authors:  G D Sebastiani; M Galeazzi; A Tincani; R Scorza; A Mathieu; G Passiu; G Morozzi; J C Piette; R Cervera; F Houssiau; J Smolen; A Fernandez Nebro; E De Ramon; A Jedryka Goral; C Papasteriades; G B Ferrara; C Carcassi; F Bellisai; R Marcolongo
Journal:  Lupus       Date:  2003       Impact factor: 2.911

8.  Association of primary biliary cirrhosis with the allele HLA-DPB1*0301 in a German population.

Authors:  J G Mella; E Roschmann; K P Maier; B A Volk
Journal:  Hepatology       Date:  1995-02       Impact factor: 17.425

9.  Identification and specificity of a cDNA encoding the 70 kd mitochondrial antigen recognized in primary biliary cirrhosis.

Authors:  M E Gershwin; I R Mackay; A Sturgess; R L Coppel
Journal:  J Immunol       Date:  1987-05-15       Impact factor: 5.422

10.  A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC.

Authors:  Paul I W de Bakker; Gil McVean; Pardis C Sabeti; Marcos M Miretti; Todd Green; Jonathan Marchini; Xiayi Ke; Alienke J Monsuur; Pamela Whittaker; Marcos Delgado; Jonathan Morrison; Angela Richardson; Emily C Walsh; Xiaojiang Gao; Luana Galver; John Hart; David A Hafler; Margaret Pericak-Vance; John A Todd; Mark J Daly; John Trowsdale; Cisca Wijmenga; Tim J Vyse; Stephan Beck; Sarah Shaw Murray; Mary Carrington; Simon Gregory; Panos Deloukas; John D Rioux
Journal:  Nat Genet       Date:  2006-09-24       Impact factor: 38.330

View more
  31 in total

Review 1.  Primary biliary cirrhosis: From bench to bedside.

Authors:  Elias Kouroumalis; George Notas
Journal:  World J Gastrointest Pharmacol Ther       Date:  2015-08-06

2.  HLA-DRB1 the notorious gene in the mosaic of autoimmunity.

Authors:  María-Teresa Arango; Carlo Perricone; Shaye Kivity; Enrica Cipriano; Fulvia Ceccarelli; Guido Valesini; Yehuda Shoenfeld
Journal:  Immunol Res       Date:  2017-02       Impact factor: 2.829

Review 3.  Primary biliary cholangitis: a comprehensive overview.

Authors:  Ana Lleo; Simona Marzorati; Juan-Manuel Anaya; M Eric Gershwin
Journal:  Hepatol Int       Date:  2017-11-21       Impact factor: 6.047

4.  Autoimmune hepatitis and eosinophilic granulomatosis with polyangiitis: a rare association.

Authors:  Saroj Lohani; Salik Nazir; Niranjan Tachamo; Pavani Pagolu
Journal:  BMJ Case Rep       Date:  2017-01-20

Review 5.  Animal models of primary biliary cirrhosis.

Authors:  Tomohiro Katsumi; Kyoko Tomita; Patrick S C Leung; Guo-Xiang Yang; M Eric Gershwin; Yoshiyuki Ueno
Journal:  Clin Rev Allergy Immunol       Date:  2015-06       Impact factor: 8.667

Review 6.  The genetics of complex cholestatic disorders.

Authors:  Gideon M Hirschfield; Roger W Chapman; Tom H Karlsen; Frank Lammert; Konstantinos N Lazaridis; Andrew L Mason
Journal:  Gastroenterology       Date:  2013-04-10       Impact factor: 22.682

Review 7.  The genetics of human autoimmune disease: A perspective on progress in the field and future directions.

Authors:  Michael F Seldin
Journal:  J Autoimmun       Date:  2015-09-04       Impact factor: 7.094

8.  Phenome-wide scanning identifies multiple diseases and disease severity phenotypes associated with HLA variants.

Authors:  Jason H Karnes; Lisa Bastarache; Christian M Shaffer; Silvana Gaudieri; Yaomin Xu; Andrew M Glazer; Jonathan D Mosley; Shilin Zhao; Soumya Raychaudhuri; Simon Mallal; Zhan Ye; John G Mayer; Murray H Brilliant; Scott J Hebbring; Dan M Roden; Elizabeth J Phillips; Joshua C Denny
Journal:  Sci Transl Med       Date:  2017-05-10       Impact factor: 17.956

9.  Acute autoimmune-like hepatitis with atypical anti-mitochondrial antibody after mRNA COVID-19 vaccination: A novel clinical entity?

Authors:  Michele Ghielmetti; Helen Dorothea Schaufelberger; Giorgina Mieli-Vergani; Andreas Cerny; Eric Dayer; Diego Vergani; Benedetta Terziroli Beretta-Piccoli
Journal:  J Autoimmun       Date:  2021-07-15       Impact factor: 7.094

10.  Rheumatoid arthritis and primary biliary cirrhosis: cause, consequence, or coincidence?

Authors:  Daniel S Smyk; Dimitrios P Bogdanos; Maria G Mytilinaiou; Andrew K Burroughs; Eirini I Rigopoulou
Journal:  Arthritis       Date:  2012-10-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.