| Literature DB >> 35504942 |
Halima Alnaqbi1,2, Guan K Tay1,3,4, Sarah El Hajj Chehadeh1,5, Habiba Alsafar6,7,8.
Abstract
Aside from its anthropological relevance, the characterization of the allele frequencies of genes in the human Major Histocompatibility Complex (MHC) and the combination of these alleles that make up MHC conserved extended haplotypes (CEHs) is necessary for histocompatibility matching in transplantation as well as mapping disease association loci. The structure and content of the MHC region in Middle Eastern populations remain poorly characterized, posing challenges when establishing disease association studies in ethnic groups that inhabit the region and reducing the capacity to translate genetic research into clinical practice. This study was conceived to address a gap of knowledge, aiming to characterize CEHs in the United Arab Emirates (UAE) population through segregation analysis of high-resolution, pedigree-phased, MHC haplotypes derived from 41 families. Twenty per cent (20.5%) of the total haplotype pool derived from this study cohort were identified as putative CEHs in the UAE population. These consisted of CEHs that have been previously detected in other ethnic groups, including the South Asian CEH 8.2 [HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.094)] and the common East Asian CEH 58.1 [HLA- C*03:02-B*58:01-DRB1*03:01- DQA1*05:01-DQB1*02:01 (H.F. 0.024)]. Additionally, three novel CEHs were identified in the current cohort, including HLA- C*15:02-B*40:06-DRB1*16:02-DQB1*05:02 (H.F. 0.035), HLA- C*16:02-B*51:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.029), and HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.024). Overall, the results indicate a substantial gene flow with neighbouring ethnic groups in the contemporary UAE population including South Asian, East Asian, African, and European populations. Importantly, alleles and haplotypes that have been previously associated with autoimmune diseases (e.g., Type 1 Diabetes) were also present. In this regard, this study emphasizes that an appreciation for ethnic differences can provide insights into subpopulation-specific disease-related polymorphisms, which has remained a difficult endeavour.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35504942 PMCID: PMC9065074 DOI: 10.1038/s41598-022-11256-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
HLA class I (HLA-A, HLA-C, HLA-B) allelic count and frequencies observed in the UAE cohort. A.F.: allele frequency.
| 02:01 | 25 | 0.15 | 04:01 | 32 | 0.19 | 51:01 | 20 | 0.12 |
| 11:01 | 18 | 0.11 | 07:02 | 21 | 0.12 | 08:01 | 18 | 0.11 |
| 24:02 | 14 | 0.08 | 06:02 | 20 | 0.12 | 40:06 | 13 | 0.08 |
| 32:01 | 13 | 0.08 | 15:02 | 18 | 0.11 | 50:01 | 11 | 0.06 |
| 03:01 | 12 | 0.07 | 07:01 | 10 | 0.06 | 18:01 | 10 | 0.06 |
| 68:01 | 12 | 0.07 | 16:02 | 10 | 0.06 | 58:01 | 9 | 0.05 |
| 01:01 | 11 | 0.06 | 03:02 | 9 | 0.05 | 35:01 | 8 | 0.05 |
| 30:02 | 10 | 0.06 | 12:03 | 6 | 0.04 | 35:03 | 8 | 0.05 |
| 26:01 | 9 | 0.05 | 08:02 | 5 | 0.03 | 53:01 | 6 | 0.04 |
| 30:01 | 7 | 0.04 | 12:02 | 5 | 0.03 | 35:08 | 5 | 0.03 |
| 33:03 | 7 | 0.04 | 17:01 | 5 | 0.03 | 52:01 | 5 | 0.03 |
| 23:01 | 6 | 0.04 | 15:05 | 4 | 0.02 | 07:02 | 4 | 0.02 |
| 03:02 | 4 | 0.02 | 01:02 | 2 | 0.01 | 14:02 | 4 | 0.02 |
| 29:02 | 3 | 0.02 | 02:02 | 2 | 0.01 | 41:01 | 4 | 0.02 |
| 31:01 | 2 | 0.01 | 03:04 | 2 | 0.01 | 42:01 | 4 | 0.02 |
| 68:02 | 2 | 0.01 | 07:04 | 2 | 0.01 | 57:01 | 4 | 0.02 |
| 01:03 | 1 | 0.01 | 14:02 | 2 | 0.01 | 38:01 | 3 | 0.02 |
| 02:02 | 1 | 0.01 | 15:04 | 2 | 0.01 | 45:01 | 3 | 0.02 |
| 02:03 | 1 | 0.01 | 15:13 | 2 | 0.01 | 07:05 | 2 | 0.01 |
| 02:09 | 1 | 0.01 | 16:01 | 2 | 0.01 | 13:01 | 2 | 0.01 |
| 02:11 | 1 | 0.01 | 02:10 | 1 | 0.01 | 15:10 | 2 | 0.01 |
| 26:17 | 1 | 0.01 | 02:16 | 1 | 0.01 | 27:03 | 2 | 0.01 |
| 29:01 | 1 | 0.01 | 03:03 | 1 | 0.01 | 35:02 | 2 | 0.01 |
| 29:10 | 1 | 0.01 | 04:03 | 1 | 0.01 | 39:01 | 2 | 0.01 |
| 30:04 | 1 | 0.01 | 07:18 | 1 | 0.01 | 44:02 | 2 | 0.01 |
| 33:01 | 1 | 0.01 | 08:01 | 1 | 0.01 | 58:02 | 2 | 0.01 |
| 34:02 | 1 | 0.01 | 12:19 | 1 | 0.01 | 13:02 | 1 | 0.01 |
| 36:01 | 1 | 0.01 | 16:04 | 1 | 0.01 | 14:01 | 1 | 0.01 |
| 66:02 | 1 | 0.01 | 18:01 | 1 | 0.01 | 15:02 | 1 | 0.01 |
| 69:01 | 1 | 0.01 | 15:03 | 1 | 0.01 | |||
| 74:01 | 1 | 0.01 | 15:17 | 1 | 0.01 | |||
| 15:22 | 1 | 0.01 | ||||||
| 15:67 | 1 | 0.01 | ||||||
| 37:01 | 1 | 0.01 | ||||||
| 40:16 | 1 | 0.01 | ||||||
| 44:03 | 1 | 0.01 | ||||||
| 47:03 | 1 | 0.01 | ||||||
| 51:08 | 1 | 0.01 | ||||||
| 55:01 | 1 | 0.01 | ||||||
| 73:01 | 1 | 0.01 | ||||||
| 81:01 | 1 | 0.01 |
HLA class II (HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1) allelic count and frequencies observed in the UAE cohort. A.F.: allele frequency.
| 03:01 | 49 | 0.29 | 05:01 | 48 | 0.28 | 02:01 | 49 | 0.29 | 01:03 | 114 | 0.67 | 04:01 | 52 | 0.31 |
| 16:02 | 16 | 0.09 | 01:02 | 42 | 0.25 | 05:02 | 35 | 0.21 | 02:01 | 42 | 0.25 | 02:01 | 34 | 0.20 |
| 16:01 | 15 | 0.09 | 05:05 | 16 | 0.09 | 03:02 | 18 | 0.11 | 02:02 | 6 | 0.04 | 14:01 | 17 | 0.10 |
| 11:01 | 12 | 0.07 | 03:03 | 12 | 0.07 | 05:01 | 14 | 0.08 | 02:07 | 3 | 0.02 | 04:02 | 14 | 0.08 |
| 04:05 | 9 | 0.05 | 01:01 | 11 | 0.07 | 03:01 | 13 | 0.08 | 03:01 | 2 | 0.01 | 01:01 | 7 | 0.04 |
| 07:01 | 9 | 0.05 | 03:01 | 11 | 0.07 | 02:02 | 10 | 0.06 | 01:04 | 1 | 0.01 | 03:01 | 7 | 0.04 |
| 01:01 | 6 | 0.04 | 01:03 | 9 | 0.05 | 06:01 | 8 | 0.05 | 01:14 | 1 | 0.01 | 13:01 | 6 | 0.04 |
| 04:02 | 6 | 0.04 | 02:01 | 7 | 0.04 | 04:02 | 6 | 0.04 | 02:09 | 1 | 0.01 | 17:01 | 6 | 0.04 |
| 01:02 | 6 | 0.03 | 01:05 | 4 | 0.02 | 06:02 | 6 | 0.04 | 104:01 | 6 | 0.04 | |||
| 11:04 | 5 | 0.03 | 01:04 | 3 | 0.02 | 05:03 | 3 | 0.02 | 10:01 | 4 | 0.02 | |||
| 15:01 | 5 | 0.03 | 04:01 | 3 | 0.02 | 03:19 | 1 | 0.01 | 18:01 | 4 | 0.02 | |||
| 15:02 | 4 | 0.02 | 03:02 | 2 | 0.01 | 03:27 | 1 | 0.01 | 09:01 | 3 | 0.02 | |||
| 15:03 | 4 | 0.02 | 05:09 | 1 | 0.01 | 03:35 | 1 | 0.01 | 105:01 | 2 | 0.01 | |||
| 10:01 | 3 | 0.02 | 06:03 | 1 | 0.01 | 107:01 | 2 | 0.01 | ||||||
| 03:02 | 2 | 0.01 | 06:04 | 1 | 0.01 | 15:01 | 1 | 0.01 | ||||||
| 04:03 | 2 | 0.01 | 26:01 | 1 | 0.01 | |||||||||
| 04:06 | 2 | 0.01 | 39:01 | 1 | 0.01 | |||||||||
| 09:01 | 2 | 0.01 | 45:01 | 1 | 0.01 | |||||||||
| 13:01 | 2 | 0.01 | 91:01 | 1 | 0.01 | |||||||||
| 03:07 | 1 | 0.01 | 124:01 | 1 | 0.01 | |||||||||
| 04:01 | 1 | 0.01 | ||||||||||||
| 04:04 | 1 | 0.01 | ||||||||||||
| 08:04 | 1 | 0.01 | ||||||||||||
| 11:02 | 1 | 0.01 | ||||||||||||
| 13:02 | 1 | 0.01 | ||||||||||||
| 13:03 | 1 | 0.01 | ||||||||||||
| 14:04 | 1 | 0.01 | ||||||||||||
| 14:15 | 1 | 0.01 | ||||||||||||
| 14:21 | 1 | 0.01 | ||||||||||||
| 15:06 | 1 | 0.01 |
Five most frequent HLA-C-B two-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency.
| Haplotypes | Count (n = 170) | H.F. | |||
|---|---|---|---|---|---|
| 16 | 0.094 | ||||
| 12 | 0.071 | ||||
| 10 | 0.059 | ||||
| 8 | 0.047 | ||||
| 8 | 0.047 | ||||
Five most frequent HLA-DRB1-DQA1-DQB1 three-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency.
| Haplotype | Count (N = 170) | H.F. | |||||
|---|---|---|---|---|---|---|---|
| 43 | 0.253 | ||||||
| 15 | 0.088 | ||||||
| 14 | 0.082 | ||||||
| 7 | 0.041 | ||||||
| 6 | 0.035 | ||||||
Six most frequent HLA-DPA1-DPB1 two-locus haplotype counts observed in the UAE cohort. H.F.: haplotypes frequency.
| Haplotype | Count (N = 170) | H.F. | |||
|---|---|---|---|---|---|
| 47 | 0.276 | ||||
| 32 | 0.188 | ||||
| 15 | 0.088 | ||||
| 14 | 0.082 | ||||
| 6 | 0.035 | ||||
| 6 | 0.035 | ||||
Figure 1Principal Component Analysis (PCA) for 50 populations (including the UAE cohort reported herein) from different world regions calculated using HLA-A, -B, and -DRB1 loci. The first component is explained by 58.0% of the variance, while the second component is described by 81.5% of the total variance. The Sub-Sharan Africa populations are denoted in yellow triangles, while European populations are represented by light blue dots; the Middle Easter populations are presented in red dots; the Oceania populations are in purple squares; the South Asian populations are indicated by orange dots; black dots were assigned to East Asian populations and green dots to South American groups. For the complete PCA plot and description of datasets used and their abbreviations, refer to Table S7.
Figure 2A zoom in view of the neighbor-joining phylogenetic tree showing relatedness between the UAE population and other populations calculated using HLA-A, -B and -DRB1 loci. For the complete phylogenetic tree and description of datasets used and their abbreviations, refer to Table S7.
MHC conserved extended haplotypes (CEH) derived from 41 UAE families with H.F. > 0.02. H.F.: haplotypes frequency.
| Conserved Extended Haplotypes (CEH) | n (N = 170) | H.F. | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| – | – | – | – | 16 | 0.094 | |||||
| – | – | – | – | 6 | 0.035 | |||||
| – | – | – | – | 5 | 0.029 | |||||
| – | – | – | – | 4 | 0.024 | |||||
| – | – | – | – | 4 | 0.024 | |||||
MHC haplotypes of the UAE families selected on the basis of HLA-B*08:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. CEH: conserved extended haplotypes; MPA: Most probable Ancestry; SA: South Asian; C: Caucasian.
| Family | Haplotype | CEH | MPA | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| DF24 | a | 26:01:01:01 | 02:01:01 | 04:01:01 | 8.2 | SA[ | |||||
| DF22 | b | 26:01:01:01 | 02:01:01 | 17:01 | 8.2 | SA[ | |||||
| DF13 | a | 26:01:01:01 | 01:03:01:01 | 02:01:02:01 | 8.2 | SA[ | |||||
| HF20 | b | 26:01:01:01 | 02:01:01:02 | 14:01:01:01 | 8.2 | SA[ | |||||
| DF31 | a | 26:01:01:01 | 01:03:01:03 | 03:01:01:01 | 8.2 | SA[ | |||||
| DF24 | d | 68:01:01:02 | 01:03:01:04 | 04:01:01:01 | 8.2 | SA[ | |||||
| HF19 | a | 68:01:01:02 | 01:03:01:01 | 02:01:02:05 | 8.2 | SA[ | |||||
| HF7 | b | 68:01:01:02 | 01:03:01:04 | 04:01:01:06 | 8.2 | SA[ | |||||
| HF19 | a | 68:01:01:02 | 01:03:01:01 | 02:01:02:05 | 8.2 | SA[ | |||||
| DF23 | d | 68:02:01:01 | 01:03:01:05 | 04:02:01:02 | 8.2 | SA[ | |||||
| HF9 | a | 24:02:01:04 | 01:03:01:05 | 04:02:01:02 | 8.2 | SA[ | |||||
| HF9 | b | 24:02:01:04 | 01:03:01:05 | 04:02:01:02 | 8.2 | SA[ | |||||
| HF9 | c | 24:02:01:05 | 02:07:01:01 | 04:01:01:01 | 8.2 | SA[ | |||||
| HF28 | c | 03:02:01:01 | 01:03:01:03 | 03:01:01:01 | 8.2 | SA[ | |||||
| HF7 | d | 02:01:01:01 | 02:01:01:06 | 107:01 | 8.2 | SA[ | |||||
| HF29 | a | 11:01:01:01 | 01:03:01:01 | 04:02:01:01 | 8.2 | SA[ | |||||
| HF11 | d | 29:02:01:01 | 02:01:01:02 | 14:01:01:01 | 8.1 | C[ | |||||
| HF18 | b | 30:02:01:03 | 07:01:01:01 | 08:01:01:01 | 03:01:01:02 | 01:03:01:01 | 06:01:01:01 | 01:03:01:02 | 04:01:01:04 |
MHC haplotypes of UAE families marked by HLA-B*40:06. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. ‡Proposed.
| Family | Haplotype | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| HF6 | a | 11:01:01:01 | 15:02:01:01 | 40:06:01:02 | 03:01:01:03 | 05:01:01:03 | 02:01:01:01 | 01:03:01:04 | 04:01:01:04 | |
| HF38 | d | 11:01:01:01 | 15:02:01:01 | 40:06:01:02 | 10:01:01:03 | 01:05:01:01 | 05:01:01:05 | 01:03:01:01 | 02:01:02:28 | |
| HF16 | b | 11:01:01:01 | 15:02:01:01 | 40:06:04:01 | 14:04:01:02 | 01:04:01:02 | 05:03:01:01 | 01:03:01:01 | 02:01:02:01 | |
| HF30 | b | 11:01:01:01 | 15:02:01:01 | 40:06:01:02 | 16:01:01 | 01:02:02:01 | 05:02:01:01 | 02:01:01:02 | 14:01:01:01 | |
| HF36 | b | 01:01:01:01 | 01:03:01:02 | 18:01:01:01 | 60.4‡ | |||||
| DF2 | d | 11:01:01:01 | 01:03:01:04 | 04:01:01 | 60.4‡ | |||||
| HF24 | a | 11:01:01:01 | 01:03:01:01 | 02:01:02:01 | 60.4‡ | |||||
| HF8 | i | 11:01:01:01 | 01:03:01:01 | 04:02:01:02 | 60.4‡ | |||||
| HF29 | b | 11:01:01:01 | 01:03:01:05 | 02:01:02:03 | 60.4‡ | |||||
| HF25 | d | 11:01:01:01 | 02:01:01:02 | 14:01:01:01 | 60.4‡ |
MHC haplotypes of UAE families selected on the basis of HLA-B*51:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. ‡Proposed.
| Family | Haplotype | CEH | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| HF39 | b | 02:01:01:01 | 01:03:01:01 | 02:01:02:05 | 51.2‡ | |||||
| HF22 | a | 02:01:01:01 | 01:03:01:01 | 02:01:02:05 | 51.2‡ | |||||
| HF8 | a | 32:01:01:01 | 01:03:01:01 | 02:01:02:01 | 51.2‡ | |||||
| HF8 | f | 32:01:01:01 | 01:03:01:01 | 02:01:02:01 | 51.2‡ | |||||
| HF8 | j | 32:01:01:01 | 01:03:01:05 | 02:01:02:01 | 51.2‡ |
MHC haplotypes of UAE families marked by HLA-B*58:01. The group of alleles that make up the designated CEH in column “CEH” are denoted in bold. CEH: conserved extended haplotypes; MPA: Most probable Ancestry; EA: East Asia; ‡Proposed.
| Family | Haplotype | CEH | MPA | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| HF26 | c | 02:01:01:01 | 02:01:01:02 | 14:01:01:01 | 58.1 | EA[ | |||||
| HF10 | d | 23:01:01:01 | 02:01:01:03 | 17:01:01:01 | 58.1 | EA[ | |||||
| HF17 | b | 33:03:01:01 | 01:03:01:02 | 04:01:01:04 | 58.1 | EA[ | |||||
| HF7 | a | 33:03:01:01 | 01:03:01:02 | 04:01:01:01 | 58.1 | EA[ | |||||
| DF14 | d | 33:03:01:01 | 02:02:02 | 13:01:01 | 58.2‡ | ||||||
| HF39 | d | 33:03:01:01 | 01:03:01:04 | 04:01:01:04 | 58.2‡ | ||||||
| HF1 | a | 33:03:01:01 | 02:01:01:02 | 14:01:01:01 | 58.2‡ | ||||||
| DF14 | d | 33:03:01 | 02:02:02 | 13:01:01 | 58.2‡ |