Literature DB >> 35504942

Characterizing the diversity of MHC conserved extended haplotypes using families from the United Arab Emirates.

Halima Alnaqbi1,2, Guan K Tay1,3,4, Sarah El Hajj Chehadeh1,5, Habiba Alsafar6,7,8.   

Abstract

Aside from its anthropological relevance, the characterization of the allele frequencies of genes in the human Major Histocompatibility Complex (MHC) and the combination of these alleles that make up MHC conserved extended haplotypes (CEHs) is necessary for histocompatibility matching in transplantation as well as mapping disease association loci. The structure and content of the MHC region in Middle Eastern populations remain poorly characterized, posing challenges when establishing disease association studies in ethnic groups that inhabit the region and reducing the capacity to translate genetic research into clinical practice. This study was conceived to address a gap of knowledge, aiming to characterize CEHs in the United Arab Emirates (UAE) population through segregation analysis of high-resolution, pedigree-phased, MHC haplotypes derived from 41 families. Twenty per cent (20.5%) of the total haplotype pool derived from this study cohort were identified as putative CEHs in the UAE population. These consisted of CEHs that have been previously detected in other ethnic groups, including the South Asian CEH 8.2 [HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.094)] and the common East Asian CEH 58.1 [HLA- C*03:02-B*58:01-DRB1*03:01- DQA1*05:01-DQB1*02:01 (H.F. 0.024)]. Additionally, three novel CEHs were identified in the current cohort, including HLA- C*15:02-B*40:06-DRB1*16:02-DQB1*05:02 (H.F. 0.035), HLA- C*16:02-B*51:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.029), and HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.024). Overall, the results indicate a substantial gene flow with neighbouring ethnic groups in the contemporary UAE population including South Asian, East Asian, African, and European populations. Importantly, alleles and haplotypes that have been previously associated with autoimmune diseases (e.g., Type 1 Diabetes) were also present. In this regard, this study emphasizes that an appreciation for ethnic differences can provide insights into subpopulation-specific disease-related polymorphisms, which has remained a difficult endeavour.
© 2022. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35504942      PMCID: PMC9065074          DOI: 10.1038/s41598-022-11256-y

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.996


Introduction

Interest in the genes of the Major Histocompatibility Complex (MHC), and in particular the Human Leukocyte Antigens (HLA), on the short arm of chromosome 6 is primarily due to their involvement in determining the histocompatibility between organs or cells for transplantation purposes. There are over 300 genes in this short 3 to 5 megabase region, many of which are highly polymorphic, and many belong to multigene families (HLA, C4, TAP, Cyp21, LMP, and others)[1-3]. The HLA class I (HLA-A, HLA-C, HLA-B) and HLA class II (HLA-DR, HLA-DQ, HLA-DP) genes located within this region encode important proteins for cell surface antigen presentation and are key components of the immune system, hence their involvement processes that might lead to autoimmune diseases[2,4-6]. The high degree of molecular polymorphism observed in the classical HLA class I and class II genes reflect their direct involvement as antigen-presenting molecules against the variety of pathogens encountered throughout human evolution[7,8]. As a result of DNA insertion, deletion, and gene duplication events, distances between MHC loci, and hence the size of MHC haplotypes, may vary between different individuals[2]. While allelic and haplotype frequencies are relatively stable within an ethnic group and often definable by the Hardy–Weinberg equilibrium, these frequencies may vary substantially across populations[7]. Arguably, the first description of conserved DNA blocks within the MHC was the combination of complement genes Bf-C2-C4A-C4B, by Alper et al. in 1983[9]. Upon coining the term MHC ‘complotype’ for the sub-region of the entire region between HLA-B (the centromeric end of the class I region) and HLA-DR (the telomeric end of the class II region); the group has continued to describe long-range or extended conservation within MHC haplotypes[10-12]. Subsequently, several groups have supported this hypothesis by reporting that unrelated individuals from well-defined human populations share short blocks of conserved DNA sequence having precise HLA allele combinations of two or more neighbouring loci within the MHC region[13-16]. On the other hand, far longer fragments of common conserved MHC DNA sequences occur in people from the same population or ancestry[8,15]. Those long fragments consist of combinations of four or more HLA loci and were termed ‘Conserved Extended Haplotypes (CEHs)’ by Alper et al.[12,17], supratypes[18] and ‘Ancestral Haplotypes (AHs)’ by the Dawkins group in Australia[3,19]. Both CEH and AH are commonly used. For consistency, the term ‘CEH’ will be used throughout this report to refer to conserved, long stretches of DNA that span more than 2.7 megabases (Mb) and extend from HLA-C to HLA-DQB1[8]. The extent of the CEHs has since been expanded to include the region telomeric of HLA-A, at least as far as the microsatellite marker D6S105[20]. A haplotype must have a minimum required frequency of 0.005 in a certain population to be considered a common CEH. Nonetheless, the minimum CEH frequency cutoff should also be dependent on the sample size such that a study with a small sample size requires the use of a higher frequency than a study with larger sample size. According to Dawkins and Lloyd [21], MHC CEHs have been carried by different ancestral groups which have migrated out of Africa[21]. As a result of ethnic admixture, new MHC haplotypes have emerged and gradually become fixed in human populations and been perpetuated[14,19]. This has, in part, given rise to the unique population-specific frequencies which are now observed. Some CEHs are ethnic-specific and may have arisen from specific combinations of connected blocks or the ancestral sequences[21]. Subsequently, the HLA markers included inside a specific block would predictably be similar, or nearly identical, among unrelated people[8]. In this context, MHC CEHs have been used to characterize human diversity, and ethnic origin, or to identify and localize disease susceptibility genes, especially those related to autoimmune diseases[4-6,18] and for transplant matching[22,23]. The characterization of the genetic architecture of any population is useful prior to conducting genetic association studies[4]. MHC disease association studies have been dominated by analyses based on populations of European ancestries. However, this is gradually changing, allowing researchers to fill the knowledge gaps in disease risk predictions in some ethnic groups[24]. Nevertheless, despite the efforts of the Haplotype Map (HapMap) project and other international consortia[25], the genome structure, including that of the MHC, of populations from the Middle East remains poorly characterized, calling for the need to encourage disease association studies in the region as highlighted in our recent review[26]. The distribution of ancestry category of Genome-Wide Association Studies (GWAS) retrieved in 2019, showed that studies on Greater Middle Eastern/ Native American/ Oceanian altogether represent only 1.24% of the total studies[24,27]. The knowledge gap in the Arabian genome influences the ability of healthcare in the region to translate research outcomes from genetic studies into clinical practice, especially for critical clinical assays such as histocompatibility matching. In this regard, more effort should be put into studying the MHC region of the Arabian population, particularly for individuals of Arabian ancestry, to offer better healthcare and benefit from the new paradigm of healthcare referred to as personalized or precision medicine. Genotypically MHC-identical individuals can be found among siblings from a nuclear family[28], and haplotypes are definable by segregation studies of the MHC genes carried in families[4]. Henceforth, family segregation analyses remain the gold standard for defining the structure of MHC CEHs and are preferred over population data which is less reliable due to the reliance on bioinformatics algorithms that infer linkage between loci. There have been notable efforts in characterizing the MHC region and HLA genes in populations of the Arabian Peninsula from Bahrain[29], Kuwait[30], and Saudi Arabia[31,32] to overcome the knowledge gap. By combining high-resolution typing by next-generation sequencing (NGS) with haplotype segregation analysis of family pedigrees, this report adds to these efforts by presenting data based on a powerful strategy. Although often grouped for their shared language, history, and culture, the populations of the Arabian Peninsula represent a genetically diverse group. The United Arab Emirates (UAE) is situated in the southeast of the Arabian Peninsula, an ethnically diverse region that has emerged as a result of social and cultural influences arising from important bidirectional human migration events between the African, European, and Asian continents. The original people of the Arabian Peninsula lived a nomadic lifestyle, migrating around the peninsula in search of suitable waterholes, creating settlements that served as a hub for commerce and cultural exchange. The subsequent establishment of trade routes[33] has enhanced bidirectional gene flow into and out of the area[34], resulting in the present diversity of contemporary Arabia. This study characterizes and identifies conserved HLA CEHs of the UAE populations using high-resolution HLA pedigree-phased haplotypes. With the UAE recently establishing its national organ registry program, this study provides insights on the MHC of the UAE population, which is important for matching recipients to appropriate donors. In time, our understanding of the involvement of specific alleles of relevant MHC genes in autoimmune disease is expected to be revealed.

Methods

Recruitment

Families were approached and briefed on the study and invited to participate. The cohort also included a subset of five families that have been previously published by Tay, et al. [35]. Those families included healthy parents and at least one child with Type 1 Diabetes. Specifically, only the phased haplotypes of the healthy parents were retained for the current study. Families were randomly recruited from different parts of the UAE including northern, western, eastern, and south-eastern regions. All the participants recruited for the study were UAE nationals. Nonetheless, no sub-ethnic or country of ancestral origin information was collected from the recruited participants.

Ethics declarations

All participants who chose to participate in the study completed a consent form and a questionnaire approved by Mafraq Hospital’s Institutional Review Board (IRB) committee (MAF-REC 07/2016 04) and Dubai Health Authority (DSREC-07/2020_39). Informed written consent was obtained from all the participants, and they authorized the storage of their DNA samples. Written informed consent was obtained from the parent of participants under the age of 18 years at the time of sample collection. All methods were carried out in accordance with relevant guidelines and regulations approved jointly by the IRB committee at Mafraq Hospital (MAF-REC 07/2016 04) and Dubai Health Authority (DSREC-07/2020_39).

Sample collection and DNA extraction

In total, 235 saliva samples were collected from 41 UAE families, including one 3-generations family (family ID: HF8), using the Oragene-DNA collection kit (Genotek, Ottawa, Canada) according to the guidelines provided by the manufacturer. Genomic DNA (gDNA) was extracted from buccal cells using prepIT L2P reagents supplied with the Oragene-DNA kit (DNA Genotek, Canada), as per the manufacturer’s instructions. The quality of the gDNA was verified by OD260/OD280 > 1.8 measurements performed on Nanodrop One UV–Vis Spectrophotometer (Thermo Fisher Scientific, Waltham, USA) and by agarose gel. The concentration of each gDNA sample was measured using the dsDNA broad range fluorescence-based quantitation method (Denovix, Wilmington, USA).

High-resolution HLA typing by NGS

High-resolution HLA typing was conducted using the Holotype HLA 96/11 library kit (Omixon, Budapest, Hungary) according to the manufacturer’s protocol. The Holotype HLA 96/11 kit uses long-range PCR amplification in the gDNA sample preparation step to provide comprehensive gene coverage for up to 11 HLA loci (HLA-A, HLA-B, HLA-C, HLA-DRB1/3/4/5, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1). The library preparation step includes enzymatic fragmentation, end-repair, and ligation with indexed adaptors for each individual sample. The libraries are then combined into a single pooled library and size-selected using AMPure XP beads (Beckman Coulter, Massachusetts, USA). The concentration of the final library is determined using KAPPA library quantification ROX low kit on the Viaa7 Real-time PCR instrument (Applied Biosystems, Foster City, USA) (Kappa Biosystems, Wilmington, USA). The final library is then loaded onto the Illumina Miseq platform (Illumina, San Diego, USA). For analyses of results, FASTQ sequencing files are imported into Omixon’s HLA Twin Software v4.2.0 (Budapest, Hungary) where sequences are aligned to the most updated version of the International ImMunoGeneTics/ HLA (IMGT/HLA) database (www.ebi.ac.uk/imgt/hla/) using two independent computational algorithms for high confidence allele calling.

Segregation analysis

Segregation analysis by pedigree was independently conducted by the co-authors, and all haplotypes assigned by these individuals were concordant. Each family had identical 8-locus haplotypes (HLA-A-C-B-DRB1-DQA1-DQB1-DPA1-DPB1) by descent. When a parent’s genotype is missing, data of at least two non-HLA identical children were required for the family to be included in the study.

HLA nomenclature

This report follows the latest HLA nomenclature system for reporting and naming HLA alleles and haplotypes [36]. The asterisk "*" denotes molecular typing. The digits before the first colon (field 1) indicate the allele group or type. The subtype is indicated by the next set of digits (field 2), while synonymous variants are indicated by the third set of digits (field 3).

Population genetic analysis

The samples were genotyped at up to the 4th field of resolution. However, statistical population genetic analysis was limited to the 2nd field of resolution to allow for comparisons with previously published reports in other populations. Allele frequencies (A.F.), the degree of heterozygosity, and Guo and Thompson Hardy Weinberg equilibrium (HWE) at a locus-by-locus level were computed using Python for Population Genomics (PyPop v.0.7.0)[37]. The genetic diversity at the allelic level for the UAE cohort was calculated using polymorphism information content (PIC) and power of discrimination (PD) implemented in the FORSTAT tool[38]. Slatkin’s implementation of the Ewens-Watterson (EW) homozygosity test of neutrality, implemented in PyPop, was performed to examine the effect of natural selection on HLA loci. The test calculated the normalized deviation of homozygosity (Fnd) which is defined as the difference between observed and expected homozygosity divided by the square root of the expected homozygosity’s variance. Haplotypes HLA- A-C-B-DRB1-DQA1-DQB1, HLA-C-B, HLA-DRB1-DQA1-DQB1 and HLA-DPA1-DPB1 were observed and manually counted by the co-authors using MS Excel.

MHC conserved extended haplotypes (CEHs)

Putative CEHs (extending from HLA-C to HLA-DQB1) were identified through a previously described and established approach[3,8,13,15,19]. A haplotype frequency cut-off of 0.005 is usually used to distinguish a common CEH in a certain population, considering the high level of polymorphism within the MHC[8]. Nonetheless, due to the sample size, a cutoff of 0.02 is used in this study to distinguish CEHs in the current cohort. First, the complete dataset of 170 phased extended 8-locus HLA haplotypes (HLA- A-C-B-DRB1-DQA1-DQB1-DPA1-DPB1) obtained from the segregation analysis were sorted based on HLA-B, HLA-DRB1, and HLA-DQB1 loci respectively using Microsoft Excel. Next, 5-locus haplotypes (HLA- C-B-DRB1-DQA1-DQB1) Haplotypes that were observed at least 5 times were extracted for further analysis of CEH. Novel CEH were named according to a previously described system by Degli-Esposti, et al. [19], in which the CEH is identified by its HLA-B allele type, followed by a sequential number indicating its order of discovery (e.g., 18.1, 18.2, 18.3).

Analysis of genetic relationships with other populations

A Principal Component Analysis (PCA) plot and a phylogenetic tree were generated for 50 populations including the cohort studies herein, with high-resolution genotypes of HLA-A, HLA-B, and HLA-DRB1. Those loci were chosen as they exhibit the greatest level of heterogeneity, effectively representing world populations while simultaneously expanding the number of datasets available for the analysis. The world populations datasets were obtained from the Allele Frequency Net Database (AFND)[39]. The populations were selected from different world regions including the Middle East, Central and South Asia, Sub-Saharan Africa, North Africa, Oceania, South America, East Asia, and Europe. The world populations datasets were chosen only if they satisfy the gold and silver quality standard based on AFND criteria[39]. The PCA was conducted using IBM SPSS Statistics 19 software (IBM Corporation, Armonk, NY, USA). The phylogenetic tree was constructed using the neighbour-joining (NJ) clustering method implemented in POPTREEW. The distance was set to Nei's genetic distance (DA), and the Bootstrap to 1,000 replications.

Results

HLA allele and MHC haplotype frequencies: genetic similarity with other populations

The current cohort included 40 two-generation and one three-generation families from the UAE (see Table S1). In total, 170 phased HLA- A-C-B-DRB1-DQA1-DQB1-DPA1-DPB1 haplotypes were described by segregation analysis. Ten haplotypes were obtained from the three-generation family (referred to as HF8); 4 from the grandparents, and 6 from 3 individuals who married into the family. Ambiguities and allelic dropout in parental genotypes were resolved by inference from offspring. Only one and three genotypes were missing from HLA-DQA1 and HLA-DQB1 respectively, due to sequencing error. HLA class I and class II allele count, and frequencies are listed in Tables 1 and 2. Cumulatively, 31 different alleles were observed in HLA-A, 29 in HLA-C, 41 in HLA-B, 30 in HLA-DRB1, 13 in HLA-DQA1, and 15 in HLA-DQB1. The most frequent alleles were HLA-A*02:01 (A.F. 0.15), HLA-C*04:01 (A.F. 0.19), HLA-B*51:01 (A.F. 0.12), HLA-DRB1*03:01 (A.F. 0.29), HLA-DQA1*05:01 (A.F. 0.28), HLA-DQB1*02:01 (A.F. 0.29), HLA-DPA1*01:03 (A.F. 0.67), and HLA-DPB1*04:01 (A.F. 0.31).
Table 1

HLA class I (HLA-A, HLA-C, HLA-B) allelic count and frequencies observed in the UAE cohort. A.F.: allele frequency.

HLA-ACountA.F.HLA-CCountA.F.HLA-BCountA.F.
02:01250.1504:01320.1951:01200.12
11:01180.1107:02210.1208:01180.11
24:02140.0806:02200.1240:06130.08
32:01130.0815:02180.1150:01110.06
03:01120.0707:01100.0618:01100.06
68:01120.0716:02100.0658:0190.05
01:01110.0603:0290.0535:0180.05
30:02100.0612:0360.0435:0380.05
26:0190.0508:0250.0353:0160.04
30:0170.0412:0250.0335:0850.03
33:0370.0417:0150.0352:0150.03
23:0160.0415:0540.0207:0240.02
03:0240.0201:0220.0114:0240.02
29:0230.0202:0220.0141:0140.02
31:0120.0103:0420.0142:0140.02
68:0220.0107:0420.0157:0140.02
01:0310.0114:0220.0138:0130.02
02:0210.0115:0420.0145:0130.02
02:0310.0115:1320.0107:0520.01
02:0910.0116:0120.0113:0120.01
02:1110.0102:1010.0115:1020.01
26:1710.0102:1610.0127:0320.01
29:0110.0103:0310.0135:0220.01
29:1010.0104:0310.0139:0120.01
30:0410.0107:1810.0144:0220.01
33:0110.0108:0110.0158:0220.01
34:0210.0112:1910.0113:0210.01
36:0110.0116:0410.0114:0110.01
66:0210.0118:0110.0115:0210.01
69:0110.0115:0310.01
74:0110.0115:1710.01
15:2210.01
15:6710.01
37:0110.01
40:1610.01
44:0310.01
47:0310.01
51:0810.01
55:0110.01
73:0110.01
81:0110.01
Table 2

HLA class II (HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1) allelic count and frequencies observed in the UAE cohort. A.F.: allele frequency.

HLA-DRB1CountA.F.HLA-DQA1CountA.F.HLA-DQB1CountA.F.HLA-DPA1CountA.F.HLA-DPB1CountA.F.
03:01490.2905:01480.2802:01490.2901:031140.6704:01520.31
16:02160.0901:02420.2505:02350.2102:01420.2502:01340.20
16:01150.0905:05160.0903:02180.1102:0260.0414:01170.10
11:01120.0703:03120.0705:01140.0802:0730.0204:02140.08
04:0590.0501:01110.0703:01130.0803:0120.0101:0170.04
07:0190.0503:01110.0702:02100.0601:0410.0103:0170.04
01:0160.0401:0390.0506:0180.0501:1410.0113:0160.04
04:0260.0402:0170.0404:0260.0402:0910.0117:0160.04
01:0260.0301:0540.0206:0260.04104:0160.04
11:0450.0301:0430.0205:0330.0210:0140.02
15:0150.0304:0130.0203:1910.0118:0140.02
15:0240.0203:0220.0103:2710.0109:0130.02
15:0340.0205:0910.0103:3510.01105:0120.01
10:0130.0206:0310.01107:0120.01
03:0220.0106:0410.0115:0110.01
04:0320.0126:0110.01
04:0620.0139:0110.01
09:0120.0145:0110.01
13:0120.0191:0110.01
03:0710.01124:0110.01
04:0110.01
04:0410.01
08:0410.01
11:0210.01
13:0210.01
13:0310.01
14:0410.01
14:1510.01
14:2110.01
15:0610.01
HLA class I (HLA-A, HLA-C, HLA-B) allelic count and frequencies observed in the UAE cohort. A.F.: allele frequency. HLA class II (HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1) allelic count and frequencies observed in the UAE cohort. A.F.: allele frequency. Overall, no deviation from HWE was observed except for HLA-DQB1 (Table S2). The PIC and PD for HLA-A, HLA-C, HLA-B, HLA-DRB1, HLA-DQA1, and HLA-DQB1 were calculated to measure the extent of genetic diversity within the cohort (Table S2). The HLA class I loci were relatively more diverse compared to the HLA class II loci with HLA-B being the most polymorphic locus at a PIC of 0.94 and HLA-DQA1 being the least polymorphic locus with a PIC of 0.82. A PD value greater than 0.80 is indicative of a high degree of polymorphism[40]. The results of the EW homozygosity test of neutrality are summarized in Table S3. A large negative Fnd value suggests that the observed homozygosity is skewed toward balancing selection, while a strong positive value implies directional selection. From the results, only the HLA-DRB1 locus showed a slight directional selection. The two loci HLA-DPA1 and HLA-DPB1 were excluded from the HWE, PIC, PD, and EW homozygosity analyses. From HLA class I, the most frequent HLA-C-B two-locus haplotype was HLA- C*07:02-B*08:01 (H.F. 0.094) (Table 3). From HLA class II, HLA-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.253), and HLA-DPA1*01:03-DPB1*04:01 (H.F. 0.276) were the most frequent HLA-DRB1-DQA1-DQB1 (Table 4) and HLA-DPA1-DPB1 haplotypes (Table 5), respectively. Please refer to supplementary Tables S4–S6 for the complete list of the HLA-C-B, HLA-DRB1-DQA1-DQB1 and HLA-DPA1-DPB1 frequencies.
Table 3

Five most frequent HLA-C-B two-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency.

HaplotypesCount (n = 170)H.F.
HLA- C*07:02-B*08:01160.094
HLA- C*15:02-B*40:06120.071
HLA- C*06:02-B*50:01100.059
HLA- C*03:02-B*58:0180.047
HLA- C*04:01-B*35:0380.047
Table 4

Five most frequent HLA-DRB1-DQA1-DQB1 three-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency.

HaplotypeCount (N = 170)H.F.
HLA- DRB1*03:01-DQA1*05:01-DQB1*02:01430.253
HLA- DRB1*16:01-DQA1*01:02-DQB1*05:02150.088
HLA- DRB1*16:02-DQA1*01:02-DQB1*05:02140.082
HLA- DRB1*11:02-DQA1*05:05-DQB1*03:0170.041
HLA- DRB1*04:05-DQA1*03:03-DQB1*03:0260.035
Table 5

Six most frequent HLA-DPA1-DPB1 two-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency.

HaplotypeCount (N = 170)H.F.
HLA- DPA1*01:03-DPB1*04:01470.276
HLA- DPA1*01:03-DPB1*02:01320.188
HLA- DPA1*02:01-DPB1*14:01150.088
HLA- DPA1*01:03-DPB1*04:02140.082
HLA- DPA1*01:03-DPB1*104:0160.035
HLA- DPA1*02:01-DPB1*17:0160.035
Five most frequent HLA-C-B two-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency. Five most frequent HLA-DRB1-DQA1-DQB1 three-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency. Six most frequent HLA-DPA1-DPB1 two-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency. The PCA plot shown in Fig. 1 shows that the UAE clusters with the Omani population (abbreviated as ‘Oma’) and the Baloch subpopulation of Iran (abbreviated as ‘IrB’), and then South American and European populations (with some proximity to East Asian populations). Similarly, the phylogenetic tree in Fig. 2 shows that the UAE population is genetically close to the Baloch subpopulation of Iran. Description and reference for each population dataset used in the PCA and phylogenetic tree are listed in Table S7.
Figure 1

Principal Component Analysis (PCA) for 50 populations (including the UAE cohort reported herein) from different world regions calculated using HLA-A, -B, and -DRB1 loci. The first component is explained by 58.0% of the variance, while the second component is described by 81.5% of the total variance. The Sub-Sharan Africa populations are denoted in yellow triangles, while European populations are represented by light blue dots; the Middle Easter populations are presented in red dots; the Oceania populations are in purple squares; the South Asian populations are indicated by orange dots; black dots were assigned to East Asian populations and green dots to South American groups. For the complete PCA plot and description of datasets used and their abbreviations, refer to Table S7.

Figure 2

A zoom in view of the neighbor-joining phylogenetic tree showing relatedness between the UAE population and other populations calculated using HLA-A, -B and -DRB1 loci. For the complete phylogenetic tree and description of datasets used and their abbreviations, refer to Table S7.

Principal Component Analysis (PCA) for 50 populations (including the UAE cohort reported herein) from different world regions calculated using HLA-A, -B, and -DRB1 loci. The first component is explained by 58.0% of the variance, while the second component is described by 81.5% of the total variance. The Sub-Sharan Africa populations are denoted in yellow triangles, while European populations are represented by light blue dots; the Middle Easter populations are presented in red dots; the Oceania populations are in purple squares; the South Asian populations are indicated by orange dots; black dots were assigned to East Asian populations and green dots to South American groups. For the complete PCA plot and description of datasets used and their abbreviations, refer to Table S7. A zoom in view of the neighbor-joining phylogenetic tree showing relatedness between the UAE population and other populations calculated using HLA-A, -B and -DRB1 loci. For the complete phylogenetic tree and description of datasets used and their abbreviations, refer to Table S7.

Identification of HLA conserved extended haplotypes

The complete list of the phase-segregated 5-locus MHC haplotypes (HLA-C-B-DRB1-DQA1-DQB1) observed in the current UAE cohort is presented in Table S8. To allow for a more rigorous identification of MHC CEHs in the UAE population, only CEHs with H.F. > 0.02, are described and discussed hereafter (See Table 6). Those include HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.094), HLA- C*15:02-B*40:06-DRB1*16:02-DQA1*01:02-DQB1*05:02 (H.F. 0.035), HLA- C*16:02-B*51:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.029), HLA- C*03:02-B*58:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.024), and HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.024).
Table 6

MHC conserved extended haplotypes (CEH) derived from 41 UAE families with H.F. > 0.02. H.F.: haplotypes frequency.

Conserved Extended Haplotypes (CEH)n (N = 170)H.F.
HLA- C*07:02B*08:01DRB1*03:01DQA1*05:01DQB1*02:01160.094
HLA- C*15:02B*40:06DRB1*16:02DQA1*01:02DQB1*05:0260.035
HLA- C*16:02B*51:01DRB1*16:01DQA1*01:02DQB1*05:0250.029
HLA- C*03:02B*58:01DRB1*03:01DQA1*05:01DQB1*02:0140.024
HLA- C*03:02B*58:01DRB1*16:01DQA1*01:02DQB1*05:0240.024
MHC conserved extended haplotypes (CEH) derived from 41 UAE families with H.F. > 0.02. H.F.: haplotypes frequency. When combined, these five CEHs represent 20.6% (35 out of 170) of the haplotype pool in the current UAE cohort. Subsequently, these CEH were analyzed to infer their most probable ancestry (MPA) based on previously published frequencies in African, Asian, and Caucasian populations[41]. MPA is based on evaluating the existence of distinctive ethnic/region-specific CEH in the relevant continental such that CEHs that are generally present in high frequency (e.g., H.F. > 0.10) in a particular non-recently admixed human continental group were regarded to be indicative of that regional origin. Table S9 provides the names for the CEHs observed in the study. HLA-B*08:01 (A.F. 0.11) was the most common allele inherited as part of the haplotype block HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.094) (Table 7). The most common HLA-A alleles linked to the HLA- C*07:02-B*08:01-DRB1*03:01-DQB1*02:01 CEH in the UAE cohort were HLA-A*26:01 (31.3%), HLA-A*68:01 (25.0%), HLA-A*24:02 (18.8%), HLA-68:01 (6.3%), HLA-A*02:01 (6.3%), HLA-A*03:02 (6.3%) and HLA-A*11:01 (6.3%), See Table 7. This CEH was frequently associated with HLA- DPA1*01:03-DPB1*02:01 (18.8%) and HLA- DPA1*01:03-DPB1*04:02 (25.0%) haplotype blocks.
Table 7

MHC haplotypes of the UAE families selected on the basis of HLA-B*08:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. CEH: conserved extended haplotypes; MPA: Most probable Ancestry; SA: South Asian; C: Caucasian.

FamilyHaplotypeHLA-AHLA-CHLA-BHLA-DRB1HLA-DQA1HLA-DQB1HLA-DPA1HLA-DPB1CEHMPA
DF24a26:01:01:0107:02:01:0108:01:0103:01:0105:01:01:0302:01:0102:01:0104:01:018.2SA[46,47,54]
DF22b26:01:01:0107:02:01:0108:01:0103:01:0105:01:01:0302:01:0102:01:0117:018.2SA[46,47,54]
DF13a26:01:01:0107:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:01:0101:03:01:0102:01:02:018.2SA[46,47,54]
HF20b26:01:01:0107:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:01:0102:01:01:0214:01:01:018.2SA[46,47,54]
DF31a26:01:01:0107:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:0101:03:01:0303:01:01:018.2SA[46,47,54]
DF24d68:01:01:0207:02:01:0108:01:0103:01:0105:01:01:0302:01:0101:03:01:0404:01:01:018.2SA[46,47,54]
HF19a68:01:01:0207:02:01:0108:01:01:0103:01:01:0105:01:01:0302:01:01:0101:03:01:0102:01:02:058.2SA[46,47,54]
HF7b68:01:01:0207:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:0101:03:01:0404:01:01:068.2SA[46,47,54]
HF19a68:01:01:0207:02:01:0108:01:01:0103:01:01:0105:01:01:0302:01:01:0101:03:01:0102:01:02:058.2SA[46,47,54]
DF23d68:02:01:0107:02:01:0108:01:0103:01:0105:01:01:0302:01:0101:03:01:0504:02:01:028.2SA[46,47,54]
HF9a24:02:01:0407:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:0101:03:01:0504:02:01:028.2SA[46,47,54]
HF9b24:02:01:0407:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:0101:03:01:0504:02:01:028.2SA[46,47,54]
HF9c24:02:01:0507:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:0102:07:01:0104:01:01:018.2SA[46,47,54]
HF28c03:02:01:0107:02:01:0108:01:01:0103:01:01:0105:01:01:0302:01:01:0101:03:01:0303:01:01:018.2SA[46,47,54]
HF7d02:01:01:0107:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:0102:01:01:06107:018.2SA[46,47,54]
HF29a11:01:01:0107:02:01:0108:01:01:0103:01:01:0305:01:01:0302:01:01:0201:03:01:0104:02:01:018.2SA[46,47,54]
HF11d29:02:01:0107:01:01:0108:01:01:0103:01:01:0105:01:01:0202:01:01:0102:01:01:0214:01:01:018.1C[8,15,19]
HF18b30:02:01:0307:01:01:0108:01:01:0103:01:01:0201:03:01:0106:01:01:0101:03:01:0204:01:01:04
MHC haplotypes of the UAE families selected on the basis of HLA-B*08:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. CEH: conserved extended haplotypes; MPA: Most probable Ancestry; SA: South Asian; C: Caucasian. Allele HLA-B*40:06 (A.F. 0.08) was frequently inherited as part of the HLA- C*15:02-B*40:06-DRB1*16:02-DQA1*01:02-DQB1*05:02 CEH (H.F. 0.035). Eighty-Three per cent (83.3%) of this CEH extended to include HLA-A*11:01 (See Table 8). This CEH was associated with HLA- DPA1*01:03-DPB1*02:01 (33.3%), HLA- DPA1*01:03-DPB1*04:02 (16.7%), HLA- DPA1*01:03-DPB1*04:02 (16.7%), HLA- DPA1*01:03-DPB1*18:01 (16.7%), HLA- DPA1*01:03-DPB1*04:01 (16.7%), and HLA- DPA1*02:01-DPB1*14:01 (16.7%).
Table 8

MHC haplotypes of UAE families marked by HLA-B*40:06. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. ‡Proposed.

FamilyHaplotypeHLA-AHLA-CHLA-BHLA-DRB1HLA-DQA1HLA-DQB1HLA-DPA1HLA-DPB1CEH
HF6a11:01:01:0115:02:01:0140:06:01:0203:01:01:0305:01:01:0302:01:01:0101:03:01:0404:01:01:04
HF38d11:01:01:0115:02:01:0140:06:01:0210:01:01:0301:05:01:0105:01:01:0501:03:01:0102:01:02:28
HF16b11:01:01:0115:02:01:0140:06:04:0114:04:01:0201:04:01:0205:03:01:0101:03:01:0102:01:02:01
HF30b11:01:01:0115:02:01:0140:06:01:0216:01:0101:02:02:0105:02:01:0102:01:01:0214:01:01:01
HF36b01:01:01:0115:02:01:0140:06:01:0216:02:01:0201:02:02:0105:02:0101:03:01:0218:01:01:0160.4
DF2d11:01:01:0115:02:01:0140:06:01:0216:02:01:0201:02:0205:02:0101:03:01:0404:01:0160.4
HF24a11:01:01:0115:02:01:0140:06:01:0216:02:01:0201:02:02:0105:02:01:0101:03:01:0102:01:02:0160.4
HF8i11:01:01:0115:02:01:0140:06:01:0216:02:01:0201:02:02:0105:02:01:0101:03:01:0104:02:01:0260.4
HF29b11:01:01:0115:02:01:0140:06:01:0216:02:01:0201:02:02:0105:02:01:0101:03:01:0502:01:02:0360.4
HF25d11:01:01:0115:02:01:0140:06:01:0216:02:01:0201:02:02:0105:02:01:0102:01:01:0214:01:01:0160.4
MHC haplotypes of UAE families marked by HLA-B*40:06. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. ‡Proposed. HLA-B*51:01 (A.F. 0.12) allele was the most frequent HLA-B allele in the current cohort, and it was frequently observed as part of the HLA- C*16:02-B*51:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 CEH (H.F. 0.029) (See Table 9). This CEH was either associated with HLA-A*32:01 (60%) or HLA-A*02:01 (40%) and extended to include HLA-DPA1*01:03-DPB1*02:01.
Table 9

MHC haplotypes of UAE families selected on the basis of HLA-B*51:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. ‡Proposed.

FamilyHaplotypeHLA-AHLA-CHLA-BHLA-DRB1HLA-DQA1HLA-DQB1HLA-DPA1HLA-DPB1CEH
HF39b02:01:01:0116:02:01:0151:01:01:0116:01:0101:02:02:0105:02:01:0101:03:01:0102:01:02:0551.2
HF22a02:01:01:0116:02:01:0151:01:01:0116:01:0101:02:02:0105:02:01:0101:03:01:0102:01:02:0551.2
HF8a32:01:01:0116:02:01:0151:01:01:0116:01:0101:02:02:0105:02:01:0101:03:01:0102:01:02:0151.2
HF8f32:01:01:0116:02:01:0151:01:01:0116:01:0101:02:02:0105:02:01:0101:03:01:0102:01:02:0151.2
HF8j32:01:01:0116:02:01:0151:01:01:0116:01:0101:02:02:0105:02:01:0101:03:01:0502:01:02:0151.2
MHC haplotypes of UAE families selected on the basis of HLA-B*51:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. ‡Proposed. The HLA-B*58:01 allele (A.F. 0.05) was associated with two different CEHs including the East Asian CEH 58.1 (HLA- C*03:01-B*58:01-DRB1*03:01-DQA1*05:01-DQB1*02:01)[15], and HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (Table 10). Both haplotypes were associated with the same class I haplotype block (HLA- A*33:03-C*03:02-B*58:01). Fifty per cent of the 58.1 CEHs were associated with HLA- DPA1*02:02-DPB1*13:01, while HLA- DPA1*01:03-DPB1*04:01 haplotype was associated with 50% of HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 CEH observed.
Table 10

MHC haplotypes of UAE families marked by HLA-B*58:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. CEH: conserved extended haplotypes; MPA: Most probable Ancestry; EA: East Asia; ‡Proposed.

FamilyHaplotypeHLA-AHLA-CHLA-BHLA-DRB1HLA-DQA1HLA-DQB1HLA-DPA1HLA-DPB1CEHMPA
HF26c02:01:01:0103:02:02:0158:01:01:0103:01:01:0105:01:01:0302:01:01:0102:01:01:0214:01:01:0158.1EA[5]
HF10d23:01:01:0103:02:02:0158:01:01:0103:01:01:0105:01:01:0302:01:01:0102:01:01:0317:01:01:0158.1EA[5]
HF17b33:03:01:0103:02:02:0158:01:01:0103:01:01:0105:01:01:0302:01:01:0101:03:01:0204:01:01:0458.1EA[5]
HF7a33:03:01:0103:02:02:0158:01:01:0103:01:01:0305:01:01:0302:01:0101:03:01:0204:01:01:0158.1EA[5]
DF14d33:03:01:0103:02:02:0158:01:01:0116:01:0101:02:0205:02:0102:02:0213:01:0158.2
HF39d33:03:01:0103:02:02:0158:01:01:0116:01:0101:02:02:0105:02:01:0101:03:01:0404:01:01:0458.2
HF1a33:03:01:0103:02:02:0158:01:01:0116:01:0101:02:02:0105:02:01:0102:01:01:0214:01:01:0158.2
DF14d33:03:0103:02:02:0158:01:01:0116:01:0101:02:0205:02:0102:02:0213:01:0158.2
MHC haplotypes of UAE families marked by HLA-B*58:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. CEH: conserved extended haplotypes; MPA: Most probable Ancestry; EA: East Asia; ‡Proposed.

Discussion

The first whole genomes analysis of two UAE nationals[42,43] has provided insights into the genomic structure and the putative genetic origins of its population. Following that, a comprehensive, large-scale stratification study of the UAE population concluded that genetic admixture throughout the Arabian Peninsula's eastern shore and south-eastern tip happened gradually and without clear social stratification boundaries[43]. This, and another mitogenome study[44], have shown that there was no apparent association between birthplace and ancestral background, indicating that the contemporary UAE population developed over generations prior to the establishment of the current political borders with a significant genetic influence from the Middle East, Central/South Asia, and Sub-Sahara[43]. Conserved extended haplotypes (CEHs) of the MHC, and their fragments, have been shown to be useful as markers for disease association, immune response, and anthropology. This study describes the diversity of MHC CEHs derived from 41 UAE families. As in the previously cited publications, the data presented herein suggest evidence of gene flow from neighbouring ethnic groups in the contemporary UAE population. Overall, the most prevalent HLA class I allele lineages reported [e.g., HLA-A*02 (A.F. 15.30%), HLA-A*11 (A.F. 10.60%), HLA-C*04 (A.F. 19.40%), HLA-C*06 (A.F. 11.80%), HLA-C*07 (A.F. 20.10%), HLA-B*08 (A.F. 10.60%), HLA-B*50 (A.F. 6.50%) and HLA-B*51 (A.F. 11.80%)] are consistent with previous reports on the UAE population using PCR-SSP methods[45]. The current study detected 5 putative CEHs in the current UAE population, three of which were identified as novel CEHs. Overall, the aggregate percentage of those 5 putative CEHs was 20.6%. As noted earlier, HLA-B is the most polymorphic HLA locus. Thus, individual CEHs will be discussed hereafter based on the relevance of the HLA-B allele each CEH contains. The examination of the MHC CEHs in the current cohort has revealed that HLA-B*08:01 (A.F. 0.11), the second most frequent HLA-B allele, commonly marked the HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 CEH, which extended to include HLA-A*26:01 (31.3%), HLA-68:01 (25.0%), and HLA-A*24:02 (18.8%). This CEH, previously assigned as 8.2 by Witt, et al. [46] in Northern Indians, differs from the Caucasian 8.1 at the HLA-C locus, in the complement region, and by several repeat units at most microsatellite loci. Hence, it has been suggested that the two CEHs are not derived from one another[46,47]. This CEH was also found to be commonly associated with HLA-A*26:01in Asian Indians[47]. The 8.2 CEH was also observed at 2.68% in Kuwaiti unrelated subjects[13], and 3.00% in unrelated Saudi Arabian bone marrow donors[32]. Of the total number of HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 CEHs observed, 25% were extended to HLA-A*68:01. The association of the 8.2 CEH with the HLA-A*68:01 allele has not been identified in South Indians. Nonetheless, the HLA-A*68:01 allele has been found to be highly prevalent in Native Americans[48] and Africans[49], whereas it is found to be at low levels in Southeast Asia[50]. A genome-wide study of populations of the Arabian Peninsula demonstrated a Sub-Saharan African input of only 4.0% by 1,754 Common Era (CE) in a cohort from the UAE[51]. Therefore, it can be argued that HLA-A*68:01 was introduced to the UAE from a Sub-Saharan founder, considering that both West and East African populations were transported to the Middle East, Arabia, and the Indian Ocean during the 15th to 19th centuries during a time when the slave trade was common[52]. HLA-A*68:01 is of particular interest due to several unusual features, such as its weak binding affinity to CD8 and its ability to bind unusual long peptides because of peptide bending in the binding groove[53]. Overall, 88.9% of the HLA-B*08:01 alleles observed were part of CEHs identified in South Asians[46,47,54]. On the other hand, however, one family (Family IDs: HF11) carried the Caucasian 8.1 CEH, implying a possible Caucasian origin (Table 7). According to the IMGT/HLA database HLA-B*40 is one of the most polymorphic lineages of HLA antigens[55]. However, only two HLA-B*40 subtypes were identified in this study, specifically HLA-B*40:06 (7.60%) and HLA-B*40:16 (0.60%). The second most prevalent haplotype in the current cohort CEH HLA- C*15:02-B*40:06-DRB1*16:02-DQA1*01:02-DQB1*05:02 extended to include HLA-A*11:01. Interestingly, unlike the other CEH in this study, class I fragments of this haplotype (HLA- A*11:01-C*15:02-B*40:06) were also observed (Table 8). CEHs were initially described using serological methods, where the HLA-B*40:01 allele is recognized by B60 antigen serotype[13]. Subsequently, CEHs that included HLA-B*40:01 such as HLA- C*03:04-B*40:01-DRB1*04:04-DQA1*03:01-DQB1*03:02, HLA- C*03:04-B*40:01-DRB1*08:01-DQA1*04:01-DQB1*04:02, and HLA- C*03:04-B*40:01-DRB1*13:02-DQA1*01:02-DQB1*06:04 were named 60.1, 60.2, and 60.3, respectively. In this regard, we suggest referring to the current CEH (HLA- C*15:02-B*40:06-DRB1*16:02-DQA1*01:02-DQB1*05:02) as 60.4. This CEH also existed, at a frequency as low as 0.92% and 1.10%, in a cohort from Kuwait[30] and the Balouch group in Iran[56], respectively. Although HLA-B*51:01 (A.F. 0.118) is the most frequent HLA-B allele in the current cohort, it was only observed in a single CEH, HLA- C*16:02-B*51:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (unlike HLA-B*08:01 or even HLA-B*58:01). This CEH also extended to include HLA-A*32:01 and HLA-DPA1*01:03:01-DPB1*02:01. We suggest referring to this CEH as 51.2 since CEH 51.1 has been previously reported[15,57]. The HLA-B*51 allele is considered the risk factor for Behçet’s disease, a disease that has a strong geographical prevalence distribution along the ancient Silk Road which ran from the Mediterranean to Northern China[58]. Therefore, the prevalence of Behçet’s is highest among populations of Japan, China, Korea, Turkey, Iran, Tunisia, and other Middle Eastern countries[59], whereas it is low in Africa, Oceania, and South America, where the frequency of the HLA-B*51 allele is low[60,61]. HLA-B*58:01 (A.F. 0.05) was associated with two different CEH, the East Asian 58.1 CEH[5,15] (HLA- C*03:02-B*58:01-DRB1*03:01-DQA1*05:01-DQB1*02:01), and HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02, which both extended to include HLA-A*33:03. We suggest that the latter be referred to as 58.2. Both CEH 58.1 and 58.2 only differed in their HLA- DRB1-DQA1-DQB1 haplotype. The 58.1 CEH was associated with HLA- DRB1*03:01-DQA1*05:02-DQB1*02:01, similar to 8.2 CEH, whereas 58.2 shared the same HLA- DRB1*16:01-DQA1*01:02-DQB1*05:02 with CEH 51.2. The East Asian 58.1 and its recombinants were also observed at high frequency in people from the Arabian Peninsula [31,32,39], as well as South Asia[46], but not in Caucasians[8], indicating a possible genetic link with populations from East Asia. This can be supported by historical documents which indicate that bidirectional trade movements from Central and South Asia through the Arabian Gulf into the Arabian peninsula's south-eastern region, which currently includes the UAE, were feasible and did occur[62]. Furthermore, as evident by autosomal Short Tandem Repeats (STRs) genotyping, this cultural diffusion from Arabia has shaped worldwide Muslim populations in Asia including the Thai-Malay[63] and Chinese Muslim populations[64]. Furthermore, analysis by autosomal STRs[65], mitochondrial DNA[66], and Y-chromosomes[67] have revealed that historically attested movements into the Indian subcontinent have accounted for a cultural diffusion as well as a minor but detectable gene flow from West Asia and Arabia. Natural selection[3,8,17] is often considered an important component in the evolution of the MHC and the production of CEHs. However, evident by the information presented here and other reports[42-44], it seems that the MHC genomic landscape of the contemporary UAE nationals must have also been shaped by both transcontinental migration between Africa, Asia, and Europe, which involved a diverse array of ethnic groupings[34,51], and the nomadic lifestyles of some Arabian communities, notably the Bedouins. HLA allele frequency as genetic estimators were shown to have the ability to mimic the results obtained with genome-wide data for PCA[6]. In the current study, high resolution and quality HLA allele frequency data from Middle Eastern populations were scarce, which may have resulted in an imbalance of the clustering pattern in the PCA plot (Fig. 1) and the phylogenetic tree (Fig. 2). The analysis of the genetic relationship between the current UAE dataset with world populations using PCA and the phylogenetic tree seem to provide significantly different qualitative findings from one another. Additionally, the identified CEH and their ethnic identities observed in the current cohort do not seem to correlate with the results of the PCA plot or the phylogenetic tree. We argue that the direction of the gene flow at the CEH level (whether it is from East to West or vice versa) requires additional evaluations of the whole Asian continent from the Arabian Peninsula to north-eastern Siberia, and from the northern Urals to Southeast Asia. High-resolution HLA typing and haplotyping are critical in hematopoietic stem cell transplantation for both unrelated and related donors, particularly in reducing post-transplantation adverse outcomes[68,69]. It is noted that a single high-resolution HLA mismatch may have the same negative effect on outcomes as a low-resolution one[70,71]. As a result, high-resolution HLA typing to lower the probability of missing a clinically important mismatch has been proposed[68]. To this end, data presented herein provide a framework for donor selection during organ and bone marrow transplantation, as well as the identification of permitted mismatches disease risk markers. Previously, results generated from this laboratory on UAE families with Type 1 Diabetes identified two CEHs (namely 8.2 and 50.2) that have been previously associated with the disease in a neighbouring Indian population[54]. Likewise, several alleles and CEHs associated with autoimmunity and related conditions in other genetically related populations have been identified with high frequency in the current cohort. In this context, further research could be directed into comparing the influence of established HLA autoimmune diseases associations in Arabs using pedigree-based analysis. For example, all the Indian 8.2 CEHs identified herein were intact and therefore present a good model for recombination and disease association mapping. Further investigation can be carried out in a larger sample size in addition to genotyping different marker catalogues including non-HLA genes (e.g. MICA, MICB, TNF, C2, Bf, C4, among others), microsatellite markers, and polymorphic Alu insertions (POALINs)[72-74] across the MHC of the UAE populations to ascertain the degree of similarities to other haplotypes of the same CEH blocks, measure the sizes of DNA blocks that may be fixed, and map the recombination hotspots.

Conclusion

Despite being based on a limited number of haplotypes, this preliminary report identified conserved extended HLA haplotypes in UAE populations and presented evidence of the presence of shared CEHs between the UAE Arab population and other neighboring populations. To the best of our knowledge, this is the first attempt to identify CEH in Arabs using high-resolution HLA pedigree-phased haplotypes. Supplementary Tables. Supplementary Figures.
  71 in total

1.  Extended HLA haplotypes in a Carib Amerindian population: the Yucpa of the Perija Range.

Authors:  Z Layrisse; Y Guedez; E Domínguez; N Paz; S Montagnani; M Matos; F Herrera; V Ogando; O Balbas; A Rodríguez-Larralde
Journal:  Hum Immunol       Date:  2001-09       Impact factor: 2.850

2.  An update to HLA nomenclature, 2010.

Authors:  S G E Marsh; E D Albert; W F Bodmer; R E Bontrop; B Dupont; H A Erlich; M Fernández-Viña; D E Geraghty; R Holdsworth; C K Hurley; M Lau; K W Lee; B Mach; M Maiers; W R Mayr; C R Müller; P Parham; E W Petersdorf; T Sasazuki; J L Strominger; A Svejgaard; P I Terasaki; J M Tiercy; J Trowsdale
Journal:  Bone Marrow Transplant       Date:  2010-03-29       Impact factor: 5.483

3.  The major histocompatability complex (MHC) contains conserved polymorphic genomic sequences that are shuffled by recombination to form ethnic-specific haplotypes.

Authors:  S Gaudieri; C Leelayuwat; G K Tay; D C Townend; R L Dawkins
Journal:  J Mol Evol       Date:  1997-07       Impact factor: 2.395

4.  Pedigree-Defined Haplotypes and Their Applications to Genetic Studies.

Authors:  Chester A Alper; Charles E Larsen
Journal:  Methods Mol Biol       Date:  2017

5.  Disease associations with complotypes, supratypes and haplotypes.

Authors:  R L Dawkins; F T Christiansen; P H Kay; M Garlepp; J McCluskey; P N Hollingsworth; P J Zilko
Journal:  Immunol Rev       Date:  1983       Impact factor: 12.988

6.  Extended major histocompatibility complex haplotypes in man: role of alleles analogous to murine t mutants.

Authors:  C A Alper; Z L Awdeh; D D Raum; E J Yunis
Journal:  Clin Immunol Immunopathol       Date:  1982-08

7.  Differentiation between African populations is evidenced by the diversity of alleles and haplotypes of HLA class I loci.

Authors:  K Cao; A M Moormann; K E Lyke; C Masaberg; O P Sumba; O K Doumbo; D Koech; A Lancaster; M Nelson; D Meyer; R Single; R J Hartzman; C V Plowe; J Kazura; D L Mann; M B Sztein; G Thomson; M A Fernández-Viña
Journal:  Tissue Antigens       Date:  2004-04

Review 8.  Conserved, extended MHC haplotypes.

Authors:  C A Alper; Z Awdeh; E J Yunis
Journal:  Exp Clin Immunogenet       Date:  1992

Review 9.  The HLA genomic loci map: expression, interaction, diversity and disease.

Authors:  Takashi Shiina; Kazuyoshi Hosomichi; Hidetoshi Inoko; Jerzy K Kulski
Journal:  J Hum Genet       Date:  2009-01-09       Impact factor: 3.172

Review 10.  Major Histocompatibility Complex and Hematopoietic Stem Cell Transplantation: Beyond the Classical HLA Polymorphism.

Authors:  Alice Bertaina; Marco Andreani
Journal:  Int J Mol Sci       Date:  2018-02-22       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.