Literature DB >> 24278156

Diversity of extended HLA-DRB1 haplotypes in the Finnish population.

Annika Wennerström1, Efthymia Vlachopoulou, L Elisa Lahtela, Riitta Paakkanen, Katja T Eronen, Mikko Seppänen, Marja-Liisa Lokki.   

Abstract

The Major Histocompatibility Complex (MHC, 6p21) codes for traditional HLA and other host response related genes. The polymorphic HLA-DRB1 gene in MHC Class II has been associated with several complex diseases. In this study we focus on MHC haplotype structures in the Finnish population. We explore the variability of extended HLA-DRB1 haplotypes in relation to the other traditional HLA genes and a selected group of MHC class III genes. A total of 150 healthy Finnish individuals were included in the study. Subjects were genotyped for HLA alleles (HLA-A, -B, -DRB1, -DQB1, and -DPB1). The polymorphism of TNF, LTA, C4, BTNL2 and HLA-DRA genes was studied with 74 SNPs (single nucleotide polymorphism). The C4A and C4B gene copy numbers and a 2-bp silencing insertion at exon 29 in C4A gene were analysed with quantitative genomic realtime-PCR. The allele frequencies for each locus were calculated and haplotypes were constructed using both the traditional HLA alleles and SNP blocks. The most frequent Finnish A∼B∼DR -haplotype, uncommon in elsewhere in Europe, was A*03∼B*35∼DRB1*01∶01. The second most common haplotype was a common European ancestral haplotype AH 8.1 (A*01∼B*08∼DRB1*03∶01). Extended haplotypes containing HLA-B, TNF block, C4 and HLA-DPB1 strongly increased the number of HLA-DRB1 haplotypes showing variability in the extended HLA-DRB1 haplotype structures. On the contrary, BTNL2 block and HLA-DQB1 were more conserved showing linkage with the HLA-DRB1 alleles. We show that the use of HLA-DRB1 haplotypes rather than single HLA-DRB1 alleles is advantageous when studying the polymorphisms and LD patters of the MHC region. For disease association studies the HLA-DRB1 haplotypes with various MHC markers allows us to cluster haplotypes with functionally important gene variants such as C4 deficiency and cytokines TNF and LTA, and provides hypotheses for further assessment. Our study corroborates the importance of studying population-specific MHC haplotypes.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24278156      PMCID: PMC3836878          DOI: 10.1371/journal.pone.0079690

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The Major Histocompatibility Complex located on chromosome 6p21 has a complex allelic structure with extended linkage disequilibrium (LD) and polymorphism. The traditional human leukocyte antigen (HLA) genes encode the cell-surface antigen-presenting proteins, the HLA molecules, and fall into two major MHC classes; Class I (HLA-A, -C, and -B) and Class II (HLA-DRB1, -DQB1, and –DPB1). Many non-HLA genes related to immune responses e.g. tumor necrosis factor (TNF), lymphotoxin-alpha (LTA), complement C4 genes (C4A and C4B) and butyrophilin-like protein 2 (BTNL2), are located in the MHC class III region that resides between the MHC Class I and II regions [1]–[3]. The location of recombination hotspots and the length of LD blocks (genomic fragments inherited together) are population specific [1], [4], [5]. Interestingly, it has been shown that the Finnish population has distinctive population substructure compared with other Europeans [6]–[8]. The HLA genes play a critical role in hematopoietic stem cell transplantation, and HLA mismatching has been associated with graft failure and graft-versus-host disease [9], [10]. In addition to the traditional HLA genes, it has been suggested that the HLA-SNP haplotypes influence on the outcome of the transplantation [11]. The association between MHC and diseases has been known for decades. The polymorphic HLA-DRB1 has been associated with several complex and infectious diseases, such as acute coronary syndrome [12], HSV2 related meningitis [13], type I diabetes, celiac disease and multiple sclerosis (reviewed in [14]). Recently, genome wide association analyses (GWA) have increased the number of MHC gene associations with several drug-induced hypersensitivity reactions, autoimmune, infectious and inflammatory disorders [15]–[20] (GWAS Integrator, http://genome.ucsc.edu). However, the predisposing HLA-alleles are common in the healthy population. Other genetic and environmental triggers are required for disease susceptibility [15], [21]. Thus, the complexity of MHC makes the variant(s) responsible for the causal effect difficult to pinpoint. As the traditional HLA typing is considered rather expensive and laborious and the analysis of MHC regions is complex, many studies do not probe MHC associations more closely. Thus the functionality of the variants remains uncertain although identification of a specific risk HLA allele could help to understand a disease and its causality further. At present, for transplantation and many diagnostics proposes, traditional HLA typing still remains as the most used method [14]. De Bakker [22] presented a tag-SNP based HLA-typing method as an alternative solution for traditional HLA typing. It has promoted the screening of certain disease related HLA markers [21], [23]. However, not only HLA allele and haplotype frequencies [8], [24] but also the SNP content of haplotypes differ in ethnically diverse populations complicating the imputation process. It is clear that more information on the precise content of HLA haplotypes is needed for transplantation, disease association, anthropological, and epidemiological studies [8], [25]. In this study we focused on studying HLA-DRB1 haplotype structures in a Finnish population. The allelic diversities of other traditional HLA-genes and selected group of MHC class III genes were included and extended haplotypes inferred. We aimed to interpret the haplotype diversities in relation to HLA-DRB1 locus because of its higher amount of polymorphism when compared to the other MHC Class II genes (IMGT database, [26]) and its well-known associations with several complex diseases. The selected MHC genes TNF, LTA, C4, BTNL2, and HLA-DRA all have been associated with several autoimmune or infectious diseases, as well. We hypothesized that the use of longer MHC blocks, rather than single alleles, could be advantageous when studying the polymorphisms in the MHC region.

Materials and Methods

Study Population

The study consists of one hundred and fifty (150) healthy Finnish individuals who were randomly selected. Description of the sample set and the DNA extraction has previously been published by Seppänen et al. [17]. Briefly, samples from 49 males and 101 females with mean age of 33.7 years (range 18–60), were used. The local ethical committee (The Ethics Committee: Department of Medicine, Hospital District of Helsinki and Uusimaa) approved the study protocol (Dnro 6/E5/2001,25.1.2001). All the contributors provided the written informed consent.

Genotyping of HLA Genes

The HLA genotyping and/or analysis was carried out in an EFI (European Federation for Immunogenetics) accredited HLA Laboratory. The genotyping of HLA-A, -B and –DPB1 genes were performed using sequence specific primers (SSP: Olerup SSP AB, Stockholm, Sweden). The HLA-DRB1 alleles were detected using sequence based typing (SBT; InvitrogenTM, Life Technologies, Carlsbad, CA, USA). HLA-DQB1 alleles were detected with a panel of lanthanide-labeled oligonucleotide probes [27]. The reactions were performed according to manufacturers’ instructions giving at least four-digit resolution (for example, HLA-DRB1*01∶01) for HLA-DRB1 and –DPB1, and at least two-digit resolution (for example, HLA-A*02) for HLA-A, -B and -DQB1. For resolving ambiguities, high resolution SSPs or SBT HARPs (heterozygous ambiguity resolving primers) were used. The HLA alleles were assessed using HLA nomenclature release 3.5.0 (IMGT/HLA database) and carefully interpreted by two persons.

SNP Selection and Genotyping

We aimed to genotyped LTA, TNF, BTNL2 and HLA-DRA genes with SNPs. All the selected genes were covered entirely including 5′ and 3′- flanking regions. SNPs were chosen from the HapMap database [28] and/or from the public dbSNP database (http://www.ncbi.nlm.nih.gov/projects/SNP). Information about the validation status, tagging quality, minor allele frequency (>0.01) and gene structure were used for selecting the SNPs. Altogether seventy-four SNPs were chosen from the two gene regions, TNFLTA, (here referred as TNF block) and BTNL2HLA-DRA (here referred as BTNL2 block). SNP genotyping was performed using the Sequenom MassArray iPLEX system (Sequenom, San Diego, CA, USA). In the iPLEX assay, the SNP alleles are separated based on the differences of the single base extension (SBE) products. Manufacture’s instructions were used to design the assays (AssayDesign software) and to perform the multiplex PCR and the iPLEX reaction using 9–10 ng of DNA as a template. The complement C4A and C4B gene copy numbers and a 2-bp silencing insertion at exon 29 (CT) in C4A gene were analysed by quantitative genomic realtime-PCR Rotor-Gene 6000 (Corbett Research, Sydney, Australia) according to Paakkanen et al. [29]. The C4 allotypes were determined by immunofixation [17]. One subject was excluded from the analysis.

Statistical Analysis

Allele, phenotype and haplotype frequencies were calculated by direct counting. To detect significant departure from Hardy-Weinberg equilibrium (p<0.001), Haploview (SNPs) [30] or ARLEQUIN 3.11 (HLA genes) [31], were used. SNP haplotypes were constructed using Haploview [30]. Multi-locus haplotype frequencies and recombination rates were estimated from allele data using the Bayesian method with PHASE v. 2.1.1 [32]. The haplotypes were constructed using all the selected markers simultaneously. Taken into account the small sample size and to exclude unreliable haplotypes, only haplotypes greater than 1% (observed more than 3 times) were used in the analysis [24], [33]–[35]. The LD measures (D′ and r2) were determined using either the Haploview software (biallelic markers) [30] or the ARLEQUIN 3.11 software (multiallelic markers) [31]. At this step, only one SNP from a LD block was chosen for further analysis. It has been shown that for multiallelic loci, D′ estimates the strength of LD (>0.80 strong LD, −0.5 moderate LD, −0 weak LD) better than r2 [36]. The R-package ‘ape’ was used to perform a Neighbor-joining algorithm according to the method of Saitou and Nei [37]. The HapMap data was used for illustrating the recombination hotspots in the MHC region [28]. The proxy SNPs (r2>0.9) for genotyped SNPs were detected using software SNAP [38].

Results

Genotyping of HLA-alleles

The HLA alleles and their frequencies are given in parallel with the phenotype frequency in Table 1. HLA allele distributions followed the Hardy–Weinberg equation. A total of 91 HLA alleles (15, 21, 26, 11 and 18 alleles in HLA-A, -B, -DRB1, -DQB1 and -DPB1, respectively) were noted. In the Finnish population, two HLA-A alleles accounted for >66% and five HLA-DPB1 alleles for >91% of the variation at the loci. Allele frequencies of other HLA loci were more equally distributed. These observations are consistent with the previous Finnish studies [39], [40].
Table 1

The Finnish HLA allele and phenotype frequencies.

HLA-AAllelePhenotypeHLA-DRB1AllelePhenotypeHLA-DQB1AllelePhenotypeHLA-DPB1AllelePhenotype
AllelenfnfAllelenfnfAllelenfnfAllelenfnf
*02 1260.4201010.673 *15∶01 440.147420.280 *06∶02 440.147420.280 *04∶01 1030.343840.553
*03 710.237610.407 *01∶01 440.147380.253 *05∶01 480.160400.267 *04∶02 610.203550.367
*01 230.077230.153 *08∶01 370.123330.220 *02# 410.137390.260 *02∶01 480.160430.287
*24 200.067200.133 *03∶01 290.097280.187 *03∶01 450.150390.260 *03∶01 430.143390.260
*68 140.047140.093 *13∶01 270.090260.173 *04# 380.127340.227 *01∶01 170.057170.113
*11 120.040120.080 *04∶01 210.070210.140 *03∶02 270.090270.180 *05∶01 100.033100.067
*32 100.033100.067 *07∶01 160.053150.100 *06∶03 270.090260.173 *19∶01 30.01030.020
*31 70.02370.047 *13∶02 130.043120.080 *03∶03 130.043120.080 *13∶01 20.00720.013
*29 50.01750.033 *11∶01 120.040110.073 *06∶04 130.043120.080 *14∶01 20.00720.013
*26 40.01340.027 *12∶01 110.037110.073 *05∶02 20.00720.013 *17∶01 20.00720.013
*30 30.01030.020 *09∶01 100.03390.060 *05∶03 20.00720.013 *20∶01 20.00720.013
*66 20.00720.013 *04∶04 60.02060.040 11 alleles *06∶01 10.00310.007
*23 10.00310.007 *04∶08 60.02060.040 *104∶01 10.00310.007
*25 10.00310.007 *04∶03 40.01340.027 *15∶01 10.00310.007
*33 10.00310.007 *14∶01:01G 30.01030.020 *16∶01 10.00310.007
15 alleles *04∶02 20.00720.013 *24∶01 10.00310.007
*08∶03 20.00720.013 *81∶01 10.00310.007
HLA-B Allele Phenotype *10∶01 20.00720.013 *85∶01 10.00310.007
Allele n f n f *11∶04 20.00720.013 18 alleles
*15 430.143410.273 *14∶02 20.00720.013
*07 400.133400.267 *16∶01 20.00720.013
*35 380.127350.233 *01∶03 10.00310.007
*08 300.100290.193 *04∶07 10.00310.007
*27 270.090260.173 *08∶02 10.00310.007
*44 260.087260.173 *08∶04 10.00310.007
*40 260.087240.160 *11∶03 10.00310.007
*51 210.070200.133 26 alleles
*39 110.037110.073
*13 70.02370.047
*56 60.02060.040
*18 50.01750.033
*47 40.01340.027
*57 40.01340.027
*41 30.01030.020
*38 20.00720.013
*53 20.00720.013
*55 20.00720.013
*14 10.00310.007
*45 10.00310.007
*50 10.00310.007
21 alleles

n = number of alleles/phenotypes.

Number of subjects = 150; Number of alleles = 300.

The phenotype or carrier positivity contains one or two copies of the variant.

HLA-DRB1*14∶01:01G stands for DRB1*14∶01:01 and DRB1*14∶54.

# 2-digit resolution genotyping for HLA-DQB1*02 and *04.

n = number of alleles/phenotypes. Number of subjects = 150; Number of alleles = 300. The phenotype or carrier positivity contains one or two copies of the variant. HLA-DRB1*14∶01:01G stands for DRB1*14∶01:01 and DRB1*14∶54. # 2-digit resolution genotyping for HLA-DQB1*02 and *04.

HLA Haplotypes

Three-locus haplotypes between HLA-DRB1 and MHC Class I (HLA-A and –B; Table 2) and MHC Class II (HLA-DQB1 and -DPB1; Table 3) were constructed. Haplotypes having frequencies higher than 1.0% are presented in Table 2 and 3 and compared with different populations that had reported the A∼B∼DRB1 and DQB1∼-DPB1DRB1 haplotypes in The Allele Frequency Net Database [41]–[46]. The three most common A∼B∼DRB1 haplotypes in the Finnish population were A*03∼B*35∼DRB1*01∶01 (7.1%), A*01∼B*08∼DRB1*03∶01 (4.0%) and A*02∼B*07∼DRB1*15∶01 (3.5%). The second most common HLA-DRB1*01∶01 haplotype had HLA-B*07 instead of HLA-B*35. The two-locus haplotypes (DRB1DPB1, DQB1DPB1 and B∼DRB1) are presented in Table S1.The linkage between HLA-DRB1 and HLA-DQB1 was stronger than between HLA-DRB1 and HLA-DPB1 (Table S2).
Table 2

The Finnish A∼B∼DRB1 haplotype frequencies (>1%) and comparison with other populations [41]–[46].

Haplotype A∼B∼DRB1Finland (n = 150)Sami (n = 130) [44] Russia (n = 207) [46] Ireland (n = 1000) [41][43]
HLA-AHLA-BHLA-DRB1ffff
*01*08*03∶010.0400.0390.0500.09
*02*07*15∶010.0350.0380.03
*02*27*08∶010.0270.014
*02*15*13∶010.0230.0310.012
*02*15*04∶010.0190.023
*02*15*08∶010.019
*02*13*07∶010.0160.021
*02*15*15∶010.013
*02*44*04∶010.0120.0190.04
*02*40*13∶020.011
*03*35*01∶010.0710.0310.033
*03*08*03∶010.021
*03*15*08∶010.0170.015
*03*07*01∶010.0140.019
*03*07*13∶010.012
*03*07*15∶010.0110.0420.0470.05
*11*07*15∶010.015

f = frequency.

Table 3

The Finnish DRB1∼DQB1∼DPB1 haplotype frequencies (>1%) and comparison with other populations [41]–[46].

Haplotype DRB1∼DQB1∼DPB1 Finland (n = 150)Ireland (n = 250) [41][43] Greece (n = 246) [45]
HLA-DRB1HLA-DQB1HLA-DPB1fff
*01∶01*05∶01*04∶020.0590.024
*01∶01*05∶01*04∶010.0430.0280.035
*01∶01*05∶01*02∶010.0400.011
*03∶01*02*01∶010.0490.0210.014
*03∶01*02*04∶010.0200.0980.015
*04∶01*03∶02*04∶010.0260.013
*04∶01*03∶02*04∶020.016
*04∶04*03∶02*02∶010.013
*04∶08*03∶01*02∶010.016
*07∶01*02*04∶010.0230.0170.014
*08∶01*04*03∶010.071
*08∶01*04*04∶010.033
*08∶01*04*04∶020.014
*09∶01*03∶03*04∶020.022
*11∶01*03∶01*04∶020.013
*11∶01*03∶01*04∶010.0120.036
*11∶01*03∶01*02∶010.0120.023
*12∶01*03∶01*04∶010.025
*13∶01*06∶03*04∶010.0270.0140.016
*13∶01*06∶03*02∶010.0230.023
*13∶01*06∶03*05∶010.012
*13∶02*06∶04*03∶010.030
*13∶02*06∶04*04∶020.013
*15∶01*06∶02*04∶010.0900.1490.015
*15∶01*06∶02*04∶020.020
*15∶01*06∶02*05∶010.014

f = frequency.

Total number of haplotypes in the population sample (Haplotypes/2n) shown in frequencies (f).

f = frequency. f = frequency. Total number of haplotypes in the population sample (Haplotypes/2n) shown in frequencies (f).

SNP Analysis

Altogether 74 SNPs were successfully genotyped. The average success rate was 99% and no discrepancies were observed. Fourteen SNPs were excluded due to minor allele frequency (<0.01) or HWE (<0.001) and five SNPs were excluded as they were in total LD (r2 = 1) with another SNP. There can be found several proxy SNPs (SNPs in strong LD) for our SNPs (examples shown in Table S3) [38]. Many of the proxy SNPs have been previously associated with diseases (http://www.snp-nexus.org/). A summary of the accepted SNPs (n = 55) is given in Table S4. The allele frequencies of our genotyped SNPs did not differ significantly from the HapMap (CEU population, European decent) [28], except the twelve SNPs in BTNL2 or HLA-DRA genes (Table S4). The LD structure of the SNPs (TNF or BTNL2 block) is presented in the Figure S1. As expected, due to the SNP selection criteria the pairwise LD between SNPs was always r2<1. Previously using HapMap data [28], high recombination rates have been observed between HLA-A and –B loci, upstream and downstream of TNF, in the BTNL2 region and between HLA-DQB1 and –DPB1 loci (Figure 1A). Here, the recombination rate estimation was performed using the HLA alleles (HLA-A, -B, -DRB1, -DQB1, -DPB1), C4 gene copy numbers and SNPs (n = 55). Thirty-one SNPs were common in both HapMap and in our study. The location of the highest recombination rate in the Finnish sample was observed in the BTNL2 promoter region corresponding to HapMap data (Figures 1 B and C).
Figure 1

Recombination rates in MHC region.

1A: The recombination rates for (HapMap [28]) in MHC region. 1B: Recombination rates in HapMap [28] for the 31 SNPs that are common in the current study and HapMap. 1C: Recombination rates for the 31 SNPs in the study (that are common in the study and HapMap [28])). The highest recombination rate was observed in the BTNL2 promoter region.

Recombination rates in MHC region.

1A: The recombination rates for (HapMap [28]) in MHC region. 1B: Recombination rates in HapMap [28] for the 31 SNPs that are common in the current study and HapMap. 1C: Recombination rates for the 31 SNPs in the study (that are common in the study and HapMap [28])). The highest recombination rate was observed in the BTNL2 promoter region.

Haplotypes of TNF, BTNL2 and C4

SNP haplotypes were constructed. The rare haplotypes (observed less than 3 times i.e. frequency <1%) were excluded at this point. Nine 13-SNP haplotypes of TNF block (Table 4) and twelve 42-SNP haplotype of BTNL2 block (Table 6) were observed with frequencies ranging from 3% to 21% and 1.3% to 15%, respectively. In both regions, there seemed to be four to five common haplotypes of almost equal frequency in addition to the less common haplotypes. C4A and C4B gene copy numbers were constructed into haplotypes and the results are shown in Table 8. As opposed to TNF and BTNL2 haplotypes, there was one haplotype, which covered more than half of the observed haplotypes (C4_1 haplotype, 59%). The frequency of the next common haplotype was less than 20%, showing a sharp decrease.
Table 4

TNF blocks (TNF_1– TNF_9) and their frequencies (>1.0%).HLA-DRB1 Haplotypes with TNF, BTNL2 and C4 Blocks.

SNP TNF_1 TNF_2 TNF_3 TNF_4 TNF_5 TNF_6 TNF_7 TNF_8 TNF_9
rs2009658CCCCGCCGC
rs2239704ACCCCAACC
rs2229094TTTCCTTCT
rs2229092AAAAAAACA
rs1041981CAACCCCCA
rs1799964TTTTCTTCT
rs1799724CCCCCCTCC
rs1800629GGAGGGGGG
rs361525GGGGGGGGG
rs3093664AAAAAAAAG
rs769178GGGGGGTGG
rs3093553TTTTTTTTT
rs2256965AGGAGGAGG
F0.2100.1670.1530.1300.0900.0800.0570.0470.0300.96#
Table 6

BTNL2 blocks (BTNL2_1– BTNL2_12) and their frequencies (>1.0%).

SNP BTNL2_1 BTNL2_2 BTNL2_3 BTNL2_4 BTNL2_5 BTNL2_6 BTNL2_7 BTNL2_8 BTNL2_9 BTNL2_10 BTNL2_11 BTNL2_12
rs28362678TCCCCTCCCCCC
rs2076530CTCTTCTTTTTT
rs9268480CCTCCCCCCCCC
rs2076529CTCTTCTTTTTT
rs3793127CCTCCCCCCCCC
rs28362683GGGAGAGGGGGG
rs3763311CCTCCCCCCCCC
rs3763312GGAGGGGGGGGG
rs3763313CAACAAAAAAAC
rs3763317TCTTCCCCCCCT
rs5007259CTCCTTTTTTTC
rs17208888GGGGGGGAGGGG
rs9405098GGGGGGGGGGGG
rs9268528AAGAAAAGAGGA
rs9268541TTTTTTTTTCCT
rs2395166CCTTTTTTTTTC
rs3135365TGTTTTTTTTTT
rs3135363TTTTCCCCCTTT
rs3135351GGGGTTTGTGGG
rs3135344GGAAAAAAAAAG
rs3129843AAAAGAAAAAAA
rs3135341TGTTTTTTTTTG
rs2027856CCCTCCCCCCCC
rs3129871CACACACCCAAA
rs9405035GGGGGGGGGGGG
rs9268644AACCAAACAACC
rs3129877AGGGAAAGAGGG
rs3135392TGGTTTTGTTTT
rs3129882AGAGAAAGAGGG
rs8084CACAAAAAAAAA
rs2239804AAGAAAAGAAAA
rs11544315GGGGGGGGGGGA
rs3177928AGGGGGGGGGGA
rs3135388CTCCCCCCCCCC
rs2213585TCTCCCCTCCCT
rs6937545CACCAAACACCC
rs9268833CCTCCCCCCCCC
rs6919855TCTTTTTCTTTT
rs7766843CCCTTTTCTTTC
rs2395185GGTGGGGGGGGT
rs9268979TTCCCCCTCCCC
rs7748472AAAAAAGAAAAA
f0.1500.1430.1370.1030.0970.0530.0400.0400.0200.0200.0170.013
Table 8

C4 blocks (C4_1– C4_6) and their frequencies (>1.0%).

C4 gene C4_1 C4_2 C4_3 C4_4 C4_5 C4_6
C4A121Q0InsCT2
C4B1Q0Q0111
f0.590.170.090.080.030.03
f = frequency. Rare haplotypes were excluded from the analysis. #Rare haplotypes were excluded. f = frequency. The significant pair-wise LD [(D′>0.80, r2>0.35, P value (P) <0.05)] between HLA-DRB1 alleles and other markers is presented in Table 10. The HLA-DRB1 allele distribution (%) presented in Table S2 shows how HLA-DRB1 alleles are clustered with MHC class I, II and III alleles and MHC blocks.
Table 10

HLA-DRB1 alleles in strong LD# with TNF, C4 and BTNL2 blocks and HLA-DQB1 and -DPB1 alleles. LD is measured with D′/r2.

HLA-DRB1*01∶01HLA-DRB1*03∶01HLA-DRB1*04∶01HLA-DRB1*08∶01HLA-DRB1*09∶01HLA-DRB1*11∶01HLA-DRB1*12∶01HLA-DRB1*13∶01HLA-DRB1*13∶02HLA-DRB1*15∶01
TNF_3 0.80/0.37
TNF_9 0.88/0.53
C4_4 0.73/0.46
C4_5 0.90/0.61
BTNL2-1 1.0/0.97
BTNL2-2 1.0/0.97
BTNL2-3 0.95/0.43
BTNL2-4 1.0/0.82
BTNL2-5 1.0/1.0
BTNL2-6 1.0/0.57
BTNL2-7 1.0/0.92
BTNL2-8 0.91/0.84
BTNL2-10 1.0/0.54
BTNL2-11 1.0/0.45
DQB1*02 1.0/0.68
DQB1*03∶03 1.0/0.76
DQB1*04 1.0/0.97
DQB1*05 1.0/0.90
DQB1*06∶02 1.0/1.0
DQB1*06∶03 1.0/1.0
DQB1*06∶04 1.0/1.0
DPB1*01∶01 0.87/0.43

Here the strong LD (D′>0.80) presented if r2>0.35 and P value <0.05.

Rare haplotypes were excluded. Q0 = null allele. InsCT = insertion in C4A gene. f = frequency. Here the strong LD (D′>0.80) presented if r2>0.35 and P value <0.05. The linkage between HLA-DRB1 and TNF block (Table 5) indicates that a given HLA-DRB1 allele was combined with different TNF blocks. Two cases, HLA-DRB1*03∶01 and TNF_3 and HLA-DRB1*13∶02 and TNF_9 showed strong LD (D′ = 0.80; D′ = 0.88, respectively, Table 10).
Table 5

Two-locus haplotypes of HLA-DRB1 with TNF block (>1.0%)#.

DRB1 alleleTNF blockf
*01∶01TNF_40.103
*03∶01TNF_30.076
*04∶01TNF_20.022
*04∶01TNF_50.027
*04∶08TNF_10.016
*07∶01TNF_10.040
*08∶01TNF_10.027
*08∶01TNF_20.048
*08∶01TNF_60.030
*09∶01TNF_60.025
*11∶01TNF_30.014
*13∶01TNF_10.017
*13∶01TNF_20.035
*13∶02TNF_90.026
*15∶01TNF_10.075
*15∶01TNF_40.016
*15∶01TNF_70.014

f = frequency.

Rare haplotypes were excluded from the analysis.

Contrary to TNF block, the linkage between HLA-DRB1 and BTNL2 block (Table 7) showed that mostly HLA-DRB1 alleles (*01∶01, *03∶01, *04∶01, *08∶01, *11∶01, *13∶02 and *15∶01) were strongly linked with BTNL2 blocks (Table 10, Table S2). However, there were some exceptions such as HLA-DRB1 *12∶01 and *13∶01 (Table S2).
Table 7

Two-locus haplotypes of HLA-DRB1 with BTNL2 block (>1.0%)#.

HLA-DRB1BTNL2 blockf
*01∶01BTNL2_10.147
*03∶01BTNL2_50.097
*04∶01BTNL2_30.064
*04∶03BTNL2_30.013
*04∶08BTNL2_30.020
*07∶01BTNL2_30.023
*07∶01BTNL2_120.013
*08∶01BTNL2_40.103
*11∶01BTNL2_80.037
*12∶01BTNL2_100.020
*12∶01BTNL2_110.016
*13∶01BTNL2_60.053
*13∶02BTNL2_70.040
*15∶01BTNL2_20.143

#Rare haplotypes were excluded.

f = frequency.

The DRB1∼C4 haplotypes behaved like the DRB1TNF haplotypes showing association with different HLA-DRB1 alleles (Table 9). C4B null allele (haplotypes C4_2 and C4_3) were found with DRB1*01∶01, *04∶01, *08∶01 and *13∶01. HLA-DRB1*03∶01 had C4AQ0 (C4_4; D′ = 0.73; Table 10), but also a haplotype without a C4A null allele (C4_1) was observed. HLA-DRB1*13∶02 had typically an insertion in the C4A gene (C4_5; D´ = 0.90 Table 10).
Table 9

Two-locus haplotypes of HLA-DRB1 with C4 block (>1.0%)#.

HLA-DRB1C4 blockf
*01∶0120.083
*01∶0110.032
*01∶0130.029
*03∶0140.068
*03∶0110.022
*04∶0110.038
*04∶0120.026
*04∶0410.019
*04∶0810.019
*07∶0110.046
*08∶0110.082
*08∶0130.041
*09∶0110.029
*11∶0110.036
*12∶0110.024
*13∶0110.045
*13∶0120.032
*13∶0250.030
*15∶0110.120
*15∶0160.016

Rare haplotypes were excluded.

Q0 = null allele.

InsCT = insertion in C4A gene.

f = frequency.

To summarize the LD and haplotype analysis and to the polymorphism of the MHC region and its relation to HLA-DRB1, an additional four-locus haplotype (HLA-DRB1, TNF block, BTNL2 block and C4; Table 11) and a six-locus haplotype (HLA-DRB1, HLA-B, TNF block, BTNL2 block, C4A and C4B allotypes; Table S5) were performed. The results showed that the extended HLA-DRB1 haplotypes were broken down when HLA-B, C4 allotypes and TNF block were taken into account.
Table 11

The extended HLA-DRB1 haplotypes with TNF, C4 and BTNL2 blocks (>1%).

HLA-DRB1TNF blockC4 blockBTNL2 blockf
*01∶01TNF_42BTNL2_10.059
*01∶01TNF_43BTNL2_10.026
*01∶01TNF_41BTNL2_10.017
*01∶01TNF_82BTNL2_10.012
*03∶01TNF_34BTNL2_50.065
*03∶01TNF_31BTNL2_50.011
*04∶01TNF_51BTNL2_30.024
*04∶01TNF_22BTNL2_30.022
*04∶08TNF_11BTNL2_30.013
*07∶01TNF_11BTNL2_30.023
*08∶01TNF_61BTNL2_40.028
*08∶01TNF_21BTNL2_40.022
*08∶01TNF_23BTNL2_40.022
*08∶01TNF_11BTNL2_40.016
*11∶01TNF_51BTNL2_80.011
*13∶01TNF_22BTNL2_60.014
*13∶01TNF_11BTNL2_60.012
*13∶02TNF_95BTNL2_70.027
*15∶01TNF_11BTNL2_20.059
*15∶01TNF_71BTNL2_20.016
*15∶01TNF_51BTNL2_20.012

f = haplotype frequency.

f = haplotype frequency. To further analyze the structure of TNF and BTNL2 blocks we created phylogenetic trees of the genetic distance using sequence similarities (Figure S2). Three branches of TNF were observed, from which two branches, TNF_3 and _9 formed haplotypes with HLA-DRB1*03∶01 and HLA-DRB1*13∶02, respectively, which we previously showed with high LD. The third branch divided into three groups, first with TNF_2, second with TNF_4, _5 and _8 and third with TNF_1, _6, and _7. C4BQ0 related TNF haplotypes, TNF_2 and TNF_4, belonged to the same main branch. The phylogenetic tree of BTNL2 blocks showed that BTNL2_5 and _7 (in linkage with HLA-DRB1*03∶01 and *13∶02, respectively) and BTNL2_6 (in linkage with HLA-*13∶01) had different BTNL2 block structure than the rest. HLA-DRB1*12∶01 alleles had either BTNL2 blocks BTNL2_10 or _11. HLA-DRB1*11∶01 and *15∶01 had structurally similar BTNL2 blocks, BTNL2_8 and _2, respectively. HLA-DRB1*01∶01, HLA-DRB1*04 alleles and HLA-DRB1*07∶01 and *08∶01 formed a wide branch with similar BTNL2 blocks _3, _4, and _8, _10 and _11 (in linkage with HLA-DRB1*01∶01, *04∶01-08, *07∶01, *08∶01, *11∶01, *12∶01, *15∶01) and BTNL2_9 (in linkage with HLA-DRB1*13∶01).

Discussion

To our knowledge, no extensive study exist that combines information from HLA-A,-B,-DRB1,-DQB1 and –DPB1 alleles with TNF, BTNL2 and complement C4 blocks. In this study, we addressed (i) the diversity of extended HLA-DRB1 haplotypes covering the MHC class I, II and III regions, (ii) the shared MHC or SNP markers in extended HLA-DRB1 haplotypes, and (iii) the challenge in detecting causal variants in the HLA data. The most common Finnish A∼B∼DR –haplotype was A*03∼B*35∼DRB1*01∶01. Also other HLA-DRB1*01∶01 haplotypes with variable levels of LD were observed. According to The Allele Frequency Net Database [41], only in a few populations e.g. the Swedish Sami [44] and Russia [46] the A*03∼B*35∼DRB1*01∶01 was found with the frequency >2%. In Finland, the second most common haplotype was A*01∼B*08∼DRB1*03∶01 in high LD. This haplotype is not so common in Finns as in other Europeans and has been previously referred as the ancestral haplotype AH 8.1 or autoimmune haplotype [47], [48]. Another conserved haplotype with strong LD, but rare in the Finnish population, was HLA-DRB1*13∶02 reaching from TNF to HLA-DQB1. The enrichment or loss of certain HLA haplotypes (Table 2) reflects the characteristics of the Finnish population structure, which has evolved through multiple genetic bottlenecks [6], [7]. Detailed information of population substructures was presented by the 16th International HLA and Immunogenic Workshop IHIW project (“Analysis of HLA Population Data” [8]. Most importantly, the multidimensional scaling (MDS) of HLA-DRB1 revealed that the Finns and the Sami are closer to the North-East Asians than to other European populations [8]. In general, the HLA variation in Europe follows the North to Southeast axis corresponding to the previous principal component analysis (PCA) based results that utilized genome-wide SNP data [7]. Interestingly in Finland, the prevalence of certain HLA alleles have shown regional differences [40] e.g. HLA-B*35 being highest in the Eastern parts of Finland. Also GWASs have shown similar trends [7]. The HLA haplotype deviations in relation to the Finnish population substructure warrant replication studies. The population stratification is important e.g. for control selection [34]. We found that the majority of HLA-DRB1 alleles were inherited as extended blocks from BTNL2 to HLA-DQB1. In spite of the observed recombination rate in the BTNL2 promoter region, most DRB1BTNL2 blocks appeared to be conserved. Including HLA-B, TNF, C4 and HLA-DPB1 the number of extended HLA-DRB1 haplotypes strongly increased. The positions of the MHC recombination sites vary between populations [49] explaining partly the non-replication of disease associations between populations (e.g. [19], [50], [51]). Our multi-locus haplotype analysis shows that the extended HLA-DRB1 haplotypes can be grouped according to functional similarities. Especially interesting for disease association studies are the HLA-DRB1 haplotypes not common in the general population. Furthermore, TNF and LTA genes are cytokines involved in the activation of inflammatory processes; hence the HLA-DRB1 haplotypes with gene expression related SNPs (rs2239704, rs1041981 and rs1800629) [2], [52] are plausible candidates for inflammatory diseases. For example, a rare HLA-DRB1*15∶01∼TNF_4 haplotype has a different nucleotide in rs2239704 compared with more frequent HLA-DRB1*15∶01∼TNF haplotypes. Interestingly, the same TNF_4 is also found with HLA-DRB1*01∶01 shown to be associated with inflammatory reactions [53]. The extended HLA-DRB1 haplotypes of HLA-DRB1*03∶01, *08∶01, *11∶01, *12∶01, *13∶02 and *15∶01 can be grouped according to exonic missense SNP (rs2076530) causing truncated protein of the T-cell inhibitor BTNL2 [19]. Of the complement C4 proteins, the C4A null alleles were primarily found with two conserved haplotypes, HLA-DRB1*03∶01 (AH 8.1) and HLA-DRB1*13∶02. C4B null alleles were characteristically inherited with the most common Finnish haplotype A*03∼B*35∼DRB1*01∶01 or with HLA-DRB1*04∶01, *08∶01 and *13∶01. C4 null alleles have shown to be related to many diseases [18]. Taken together, the extended HLA-DRB1 haplotype analysis can reveal predisposing/protective associations between markers and disease loci not detectable with a single MHC allele [34]. Overall, analysis of immunogenomic data is challenging. Resolving HLA allele ambiguity, phasing and LD calculation warrant particular expertise, and the traditional software tools (e.g. Haploview and PLINK) are not suitable for multiple loci polymorphic data like HLA [34], [54]. Haplotypes rather than single markers were used to decrease phasing errors [55]. Due to the strong LD, multiple SNPs may have corresponding statistical proof of association making the search for possible causal variants exceptionally difficult. To clarify the complexity, the known tag-SNP for HLA-DRB1*15∶01 (rs3135388) (see Figure S3), was shown to have at least 20 proxy SNPs (r2>0.9) with variable function and in different gene regions [14], [22], [38]. The SNPs’ allele frequency might be also population specific (rs2213585; Table S4), and thus the tag-SNPing (i.e. imputation) should not be used unless the ethnic background is known [56]. Here in this material, except the tag-SNP for HLA-DRB1*15∶01 (rs3135388) [14], [22], we did not detect any single tag-SNP for a specific HLA-DRB1 allele. Indeed, large population specific cohorts and dense SNP genotyping is needed for detecting HLA tag-SNPs. We acknowledge that the multiple ambiguous alleles, limited sample size, and rare HLA alleles can influence the haplotype phasing and LD leading to false positive results [34]. In case of small sample size, the study of the rare MHC haplotypes is challenging. Thus, we presented only frequent MHC haplotypes (>1%) and interpreted the LD between markers carefully [34], [35], [54], [55], [57]. The HLA allele distributions were consistent with the previously published Finnish registry studies [39], [40] suggesting that the Finnish HLA profile can be estimated with a sample set containing 150 individuals. One of the limitations of this study was the lack of genome-wide SNP data. Hence, we were not able to use HLA*IMP [58] for allele imputation or analyse the HLA tagging SNPs in the Finnish populations [22]. Because of the differences between populations, population specific validation is highly recommended before using either the HLA*IMP or the HLA tagging SNPs [7], [14], [23]. Taken together, we stress the importance of understanding the population specific MHC haplotypes and the analysis of immunogenetic data. The study of extended HLA-DRB1 haplotypes indicates the functionality of the implicated genes and provides hypotheses for further assessment of HLA-DRB1. The results presented here assist for disease association studies focusing in chronic inflammatory, autoimmune and infectious diseases. The LD (r and blocks using SNPs. (TIF) Click here for additional data file. Phylogenetic trees based on the genetic distance of and blocks with bootstrap values. (TIF) Click here for additional data file. A known tag-SNP for is in strong LD with many other SNPs in the MHC Class III region. The known proxies are taken from the HapMap [28] and using the software SNAP [38]. (TIF) Click here for additional data file. The two-locus haplotypes with frequency >1%. (DOC) Click here for additional data file. The observed alleles (%) with and alleles and and blocks. (DOC) Click here for additional data file. In the database (HapMap or 1000Genomes) there can be found several proxy SNPs (r (DOC) Click here for additional data file. A summary of the accepted SNPs (n = 55). The Finnish allele frequencies were compared with HapMap project (CEU) data [28]. (DOC) Click here for additional data file. The haplotypes with alleles, TNF and BTNL2 blocks and C4 allotypes (>1%). (DOC) Click here for additional data file.
  58 in total

1.  The D' measure of overall gametic disequilibrium between pairs of multiallelic loci.

Authors:  C Zapata
Journal:  Evolution       Date:  2000-10       Impact factor: 3.694

2.  Recombination hotspots rather than population history dominate linkage disequilibrium in the MHC class II region.

Authors:  Liisa Kauppi; Antti Sajantila; Alec J Jeffreys
Journal:  Hum Mol Genet       Date:  2003-01-01       Impact factor: 6.150

3.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

Review 4.  Interrogating the major histocompatibility complex with high-throughput genomics.

Authors:  Paul I W de Bakker; Soumya Raychaudhuri
Journal:  Hum Mol Genet       Date:  2012-09-12       Impact factor: 6.150

5.  Sarcoidosis is associated with a truncating splice site mutation in BTNL2.

Authors:  Ruta Valentonyte; Jochen Hampe; Klaus Huse; Philip Rosenstiel; Mario Albrecht; Annette Stenzel; Marion Nagy; Karoline I Gaede; Andre Franke; Robert Haesler; Andreas Koch; Thomas Lengauer; Dirk Seegert; Norbert Reiling; Stefan Ehlers; Eberhard Schwinger; Matthias Platzer; Michael Krawczak; Joachim Müller-Quernheim; Manfred Schürmann; Stefan Schreiber
Journal:  Nat Genet       Date:  2005-02-27       Impact factor: 38.330

6.  HLA class II associated risk and protection against multiple sclerosis-a Finnish family study.

Authors:  Mikko Laaksonen; Tomi Pastinen; Minna Sjöroos; Satu Kuokkanen; Juhani Ruutiainen; Marja Liisa Sumelahti; Helena Reijonen; Reijo Salonen; Juhani Wikström; Martin Panelius; Jukka Partanen; Pentti J Tienari; Jorma Ilonen
Journal:  J Neuroimmunol       Date:  2002-01       Impact factor: 3.478

7.  Analysis of BTNL2 genetic polymorphisms in British and Dutch patients with sarcoidosis.

Authors:  P Spagnolo; H Sato; J C Grutters; E A Renzoni; S E Marshall; H J T Ruven; A U Wells; A Tzouvelekis; C H M van Moorsel; J M M van den Bosch; R M du Bois; K I Welsh
Journal:  Tissue Antigens       Date:  2007-09

8.  High-resolution donor-recipient HLA matching contributes to the success of unrelated donor marrow transplantation.

Authors:  Stephanie J Lee; John Klein; Michael Haagenson; Lee Ann Baxter-Lowe; Dennis L Confer; Mary Eapen; Marcelo Fernandez-Vina; Neal Flomenberg; Mary Horowitz; Carolyn K Hurley; Harriet Noreen; Machteld Oudshoorn; Effie Petersdorf; Michelle Setterholm; Stephen Spellman; Daniel Weisdorf; Thomas M Williams; Claudio Anasetti
Journal:  Blood       Date:  2007-09-04       Impact factor: 22.113

9.  Analysis of the adequate size of a cord blood bank and comparison of HLA haplotype distributions between four populations.

Authors:  Katri Haimila; Antti Penttilä; Anne Arvola; Marja-Kaisa Auvinen; Matti Korhonen
Journal:  Hum Immunol       Date:  2012-11-05       Impact factor: 2.850

10.  Allele frequency net: a database and online repository for immune gene frequencies in worldwide populations.

Authors:  Faviel F Gonzalez-Galarza; Stephen Christmas; Derek Middleton; Andrew R Jones
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

View more
  4 in total

1.  HLA-DPB1 and HLA class I confer risk of and protection from narcolepsy.

Authors:  Hanna M Ollila; Jean-Marie Ravel; Fang Han; Juliette Faraco; Ling Lin; Xiuwen Zheng; Giuseppe Plazzi; Yves Dauvilliers; Fabio Pizza; Seung-Chul Hong; Poul Jennum; Stine Knudsen; Birgitte R Kornum; Xiao Song Dong; Han Yan; Heeseung Hong; Cristin Coquillard; Joshua Mahlios; Otto Jolanki; Mali Einen; Isabelle Arnulf; Sophie Lavault; Birgit Högl; Birgit Frauscher; Catherine Crowe; Markku Partinen; Yu Shu Huang; Patrice Bourgin; Outi Vaarala; Alex Désautels; Jacques Montplaisir; Steven J Mack; Michael Mindrinos; Marcelo Fernandez-Vina; Emmanuel Mignot
Journal:  Am J Hum Genet       Date:  2015-01-08       Impact factor: 11.025

2.  Highly conserved extended haplotypes of the major histocompatibility complex and their relationship to multiple sclerosis susceptibility.

Authors:  Douglas S Goodin; Pouya Khankhanian; Pierre-Antoine Gourraud; Nicolas Vince
Journal:  PLoS One       Date:  2018-02-13       Impact factor: 3.240

3.  Complement activation and regulation in preeclamptic placenta.

Authors:  Anna Inkeri Lokki; Jenni Heikkinen-Eloranta; Hanna Jarva; Terhi Saisto; Marja-Liisa Lokki; Hannele Laivuori; Seppo Meri
Journal:  Front Immunol       Date:  2014-07-09       Impact factor: 7.561

Review 4.  The Immunogenetic Conundrum of Preeclampsia.

Authors:  A Inkeri Lokki; Jenni K Heikkinen-Eloranta; Hannele Laivuori
Journal:  Front Immunol       Date:  2018-11-13       Impact factor: 7.561

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.