| Literature DB >> 31970929 |
Carolyn K Hurley1, Jane Kempenich2, Kim Wadsworth2, Jürgen Sauter3, Jan A Hofmann3, Daniel Schefzyk3, Alexander H Schmidt3, Pablo Galarza4, Maria B R Cardozo4, Malgorzata Dudkiewicz5, Lucie Houdova6, Pavel Jindra7, Betina S Sorensen8, Latha Jagannathan9,10, Ankit Mathur10, Tiina Linjama11, Tigran Torosian12, Rafi Freudenberger13, Anastasios Manolis14, John Mavrommatis14, Nezih Cereb15, Sigal Manor16, Nira Shriki16, Nicoletta Sacchi17, Reem Ameen18, Raewyn Fisher19, Heather Dunckley20, Irene Andersen21, Ahmed Alaskar22, Mohsen Alzahrani22, Ali Hajeer22, Dunia Jawdat22, Grazia Nicoloso23, Pawinee Kupatawintu24, Louise Cho25, Ashminder Kaur25, Mats Bengtsson26, Jason Dehn2.
Abstract
A catalog of common, intermediate and well-documented (CIWD) HLA-A, -B, -C, -DRB1, -DRB3, -DRB4, -DRB5, -DQB1 and -DPB1 alleles has been compiled from over 8 million individuals using data from 20 unrelated hematopoietic stem cell volunteer donor registries. Individuals are divided into seven geographic/ancestral/ethnic groups and data are summarized for each group and for the total population. P (two-field) and G group assignments are divided into one of four frequency categories: common (≥1 in 10 000), intermediate (≥1 in 100 000), well-documented (≥5 occurrences) or not-CIWD. Overall 26% of alleles in IPD-IMGT/HLA version 3.31.0 at P group resolution fall into the three CIWD categories. The two-field catalog includes 18% (n = 545) common, 17% (n = 513) intermediate, and 65% (n = 1997) well-documented alleles. Full-field allele frequency data are provided but are limited in value by the variations in resolution used by the registries. A recommended CIWD list is based on the most frequent category in the total or any of the seven geographic/ancestral/ethnic groups. Data are also provided so users can compile a catalog specific to the population groups that they serve. Comparisons are made to three previous CWD reports representing more limited population groups. This catalog, CIWD version 3.0.0, is a step closer to the collection of global HLA frequencies and to a clearer view of HLA diversity in the human population as a whole.Entities:
Keywords: HLA; alleles; ethnic groups; gene frequency
Mesh:
Substances:
Year: 2020 PMID: 31970929 PMCID: PMC7317522 DOI: 10.1111/tan.13811
Source DB: PubMed Journal: HLA ISSN: 2059-2302 Impact factor: 4.513
Participating registries and number of volunteer donors with HLA assignments contributed
| Registry | Number of volunteer donors | World Health Organization region |
|---|---|---|
| Argentine HSC Donors Registry | 129 879 | Americas |
| Bone Marrow Donor Programme Singapore | 32 875 | Western Pacific |
| Central Unrelated Potential Bone Marrow Donor and Cord Blood Registry Poltransplant | 6918 | European |
| Czech National Marrow Donors Registry | 36 289 | European |
| Danish Stem Cell Donors—West | 21 234 | European |
| DKMS (Germany, Polska, UK, India, USA) | 5 316 717 | Multi‐region |
| Finnish Stem Cell Registry | 20 427 | European |
| Gift of Life Marrow Registry | 125 489 | Americas |
| Hellenic Transplant Organization (HTO) | 121 088 | European |
| India—DATRI Blood Stem Cell Donor Registry | 347 989 | South East Asia |
| Israel‐Ezer Mizion BMDR | 138 175 | European |
| Italian Bone Marrow Donor Registry | 30 117 | European |
| Saudi Stem Cell Donor Registry | 34 729 | Eastern Mediterranean |
| Kuwait National Stem Cell Registry | 673 | Eastern Mediterranean |
| New Zealand Bone Marrow Donor Registry | 237 | Western Pacific |
| NMDP/Be The Match‐USA and Mexico | 1 567 473 | Americas |
| Norwegian Bone Marrow Donor Registry | 7202 | European |
| Swiss Blood Stem Cells | 84 789 | European |
| Thai National Stem Cell Donor Registry | 5000 | South East Asia |
| Tobias Registry of Swedish Bone Marrow Donors | 50 502 | European |
|
|
|
Number of donors varied by locus. The largest number is listed.
Regions and included countries are defined at http://apps.who.int/medicinedocs/en/d/Jwhozip16e/2.html#Jwhozip16e.2
Self‐reported ancestrya of individuals included in the dataset and the number of P group assignments represented by each population group
| Number of HLA assignments (P group, two‐field) | |||||||
|---|---|---|---|---|---|---|---|
| Ancestry | HLA‐A | HLA‐B | HLA‐C | HLA‐DRB1 | HLA‐DRB3/4/5 | HLA‐DQB1 | HLA‐DPB1 |
| African/African American (AFA) | 362 966 | 370 020 | 347 465 | 373 023 | 72 964 | 361 723 | 333 978 |
| Asian/Pacific Islands (API) | 1 263 239 | 1 274 300 | 1 195 602 | 1 305 847 | 117 098 | 1 299 499 | 1 080 334 |
| European/European descent (EURO) | 11 273 788 | 11 260 688 | 10 488 431 | 11 637 283 | 635 606 | 11 507 902 | 10 655 587 |
| Middle East/North coast of Africa (MENA) | 389 811 | 391 642 | 379 666 | 396 117 | 15 867 | 400 453 | 296 490 |
| South or Central America/Hispanic/Latino (HIS) | 640 949 | 655 091 | 606 689 | 668 011 | 193 504 | 654 807 | 580 236 |
| Native American populations (NAM) | 62 326 | 63 323 | 57 232 | 65 025 | 16 966 | 64 730 | 59 017 |
| Unknown/Not asked/Multiple ancestries/Other (UNK) | 1 229 127 | 1 248 774 | 1 149 774 | 1 286 614 | 128 841 | 1 195 557 | 966 013 |
|
|
|
|
|
|
|
|
|
Ancestry designations were determined by each contributing registry.
Odd number is because of removal of alleles having incorrect format/assignment not based on HLA nomenclature.
Self‐reported ancestrya of individuals included in the dataset and the number of totalb assignments represented by each population group
| Number of HLA assignments (total) | |||||||
|---|---|---|---|---|---|---|---|
| Ancestry | HLA‐A | HLA‐B | HLA‐C | HLA‐DRB1 | HLA‐DRB3/4/5 | HLA‐DQB1 | HLA‐DPB1 |
| African/African American (AFA) | 388 476 | 388 579 | 389 619 | 389 241 | 198 134 | 378 275 | 336 535 |
| Asian/Pacific Islands (API) | 1 291 125 | 1 298 351 | 1 255 403 | 1 318 229 | 297 151 | 1 314 598 | 1 082 177 |
| European/European descent (EURO) | 11 929 417 | 11 941 489 | 11 827 887 | 11 938 778 | 1 329 462 | 11 735 570 | 10 680 854 |
| Middle East/North coast of Africa (MENA) | 402 447 | 402 160 | 403 229 | 403 373 | 25 832 | 402 840 | 296 914 |
| South or Central America/Hispanic/Latino (HIS) | 700 632 | 700 912 | 690 043 | 700 830 | 400 002 | 678 680 | 581 973 |
| Native American populations (NAM) | 66 971 | 66 967 | 67 072 | 66 984 | 39 907 | 66 309 | 59 113 |
| Unknown/Not asked/Multiple ancestries/Other (UNK) | 1 320 493 | 1 320 714 | 1 302 662 | 1 321 251 | 226 038 | 1 227 926 | 969 187 |
|
|
|
|
|
|
|
|
|
Ancestry designations were determined by each contributing registry.
Total assignments include HLA assignments at all levels of resolution as submitted by the registries.
Odd number is because of removal of alleles having incorrect format/assignment not based on HLA nomenclature.
The representation of total P group two‐field HLA assignments in registry databases
| Locus | Total P group alleles in IPD‐IMGT/HLA 3.31.0 | Alleles observed in registry dataset | Percentage of total alleles observed | Alleles in CIWD categories | Percentage of alleles assigned as CIWD |
|---|---|---|---|---|---|
| HLA‐A | 2827 | 1521 | 53.8 | 673 | 23.8 |
| HLA‐B | 3537 | 1853 | 52.4 | 864 | 24.4 |
| HLA‐C | 2494 | 1366 | 54.8 | 602 | 24.1 |
| HLA‐DRB1 | 1559 | 817 | 52.4 | 422 | 27.1 |
| HLA‐DRB3 | 125 | 66 | 52.8 | 32 | 25.6 |
| HLA‐DRB4 | 63 | 17 | 27.0 | 10 | 15.9 |
| HLA‐DRB5 | 50 | 31 | 62.0 | 15 | 30.0 |
| HLA‐DQB1 | 669 | 384 | 57.4 | 179 | 26.8 |
| HLA‐DPB1 | 628 | 430 | 68.5 | 258 | 41.1 |
Abbreviations: C, common; I, intermediate; WD, well‐documented.
Figure 1Distribution of HLA alleles into three frequency categories (common, intermediate, well‐documented) at P group (two‐field) resolution
Figure 2Distribution of HLA alleles into three frequency categories at G group resolution
Overall comparison of HLA P two‐field assignments among CIWD catalogsa , b
| 2.0.0 CWD | EFI CWD | China CWD | |||||||
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
|
| Common n = 63 | WD n = 169 | Not‐CWD | Common n = 35 | WD n = 180 | Not‐CWD n = 513 | Common n = 26 | WD n = 112 | Not‐CWD n = 590 |
| Common total n = 105 | 60 | 43 | 2 | 34 | 36 | 35 | 26 | 28 | 51 |
| Intermediate n = 108 | 3 | 64 | 41 | 1 | 31 | 76 | 0 | 18 | 90 |
| Well‐documented n = 460 | 0 | 59 | 401 | 0 | 106 | 354 | 0 | 21 | 439 |
| Not‐CIWD | 0 | 3 | 52 | 0 | 7 | 48 | 0 | 45 | 10 |
|
| |||||||||
|
| Common n = 121 | WD n = 230 | Not‐CWD n = 578 | Common n = 62 | WD n = 257 | Not‐CWD n = 610 | Common n = 54 | WD n = 143 | Not‐CWD n = 732 |
| Common total n = 195 | 119 | 74 | 2 | 61 | 60 | 74 | 54 | 60 | 81 |
| Intermediate n = 128 | 2 | 78 | 48 | 1 | 39 | 88 | 0 | 14 | 114 |
| Well‐documented n = 541 | 0 | 71 | 470 | 0 | 143 | 398 | 0 | 23 | 518 |
| Not‐CIWD n = 65 | 0 | 7 | 58 | 0 | 15 | 50 | 0 | 46 | 19 |
|
| |||||||||
|
| Common n = 36 | WD n = 87 | Not‐CWD n = 508 | Common n = 29 | WD n = 158 | Not‐CWD n = 444 | Common n = 24 | WD n = 82 | Not‐CWD n = 525 |
| Common total n = 72 | 36 | 34 | 2 | 26 | 16 | 30 | 24 | 23 | 25 |
| Intermediate n = 99 | 0 | 41 | 58 | 3 | 28 | 68 | 0 | 18 | 81 |
| Well‐documented n = 431 | 0 | 11 | 420 | 0 | 108 | 323 | 0 | 18 | 413 |
| Not‐CIWD n = 29 | 0 | 1 | 28 | 0 | 6 | 23 | 0 | 23 | 6 |
|
| |||||||||
|
| Common n = 72 | WD n = 138 | Not‐CWD n = 247 | Common n = 42 | WD n = 86 | Not‐CWD n = 329 | Common n = 34 | WD n = 93 | Not‐CWD n = 330 |
| Common total n = 86 | 62 | 24 | 0 | 42 | 18 | 26 | 34 | 25 | 27 |
| Intermediate n = 98 | 8 | 68 | 22 | 0 | 25 | 73 | 0 | 29 | 69 |
| Well‐documented n = 238 | 2 | 40 | 196 | 0 | 37 | 201 | 0 | 16 | 222 |
| Not‐CIWD n = 35 | 0 | 6 | 29 | 0 | 6 | 29 | 0 | 23 | 12 |
|
| |||||||||
|
| Common n = 14 | WD n = 9 | Not‐CWD n = 35 | Common | WD | Not‐CWD | Common | WD | Not‐CWD |
| Common total n = ND | Data insufficient | Not reported | Not reported | ||||||
| Intermediate n = ND | Data insufficient | ||||||||
| Well‐documented n = 57 | 14 | 8 | 35 | ||||||
| Not‐CIWD n = 1 | 0 | 1 | 0 | ||||||
|
| |||||||||
|
| Common n = 19 | WD n = 4 | Not‐CWD n = 165 | Common n = 20 | WD n = 42 | Not‐CWD n = 126 | Common n = 15 | WD n = 20 | Not‐CWD n = 153 |
| Common total n = 29 | 19 | 3 | 7 | 17 | 5 | 7 | 15 | 6 | 8 |
| Intermediate n = 31 | 0 | 0 | 31 | 2 | 8 | 21 | 0 | 4 | 27 |
| Well‐documented n = 119 | 0 | 1 | 118 | 1 | 26 | 92 | 0 | 4 | 115 |
| Not‐CIWD n = 9 | 0 | 0 | 9 | 0 | 3 | 6 | 0 | 6 | 3 |
|
| |||||||||
|
| Common n = 38 | WD n = 11 | Not‐CWD n = 210 | Common n = 27 | WD n = 45 | Not‐CWD n = 187 | Common | WD | Not‐CWD |
| Common total n = 58 | 38 | 5 | 15 | 26 | 15 | 17 | Not reported | ||
| Intermediate n = 49 | 0 | 5 | 44 | 1 | 16 | 32 | |||
| Well‐documented n = 151 | 0 | 1 | 150 | 0 | 13 | 138 | |||
| Not‐CIWD n = 1 | 0 | 0 | 1 | 0 | 1 | 0 | |||
Abbreviations: C, common; I, intermediate; WD, well‐documented; ND, not determined.
Based on the highest frequency for any population. Supporting Information Table S3a lists the alleles and their frequency categories: HLA‐A Supporting Information Table S3a, HLA‐B 3b, HLA‐C 3c, HLA‐DRB1 3d, HLA‐DRB3/4/5 3e, HLA‐DQB1 3f, HLA‐DPB1 3g.
Alleles observed at least five times in any population in the current dataset are included in this table. Also included are alleles that were observed in previous datasets but observed in this study less than five times. Note that this two‐field designation is the equivalent of a “P” assignment so that alleles within a P group that encode polypeptides that differ outside of the antigen recognition domain are not summed separately. Any nonexpressed alleles that are excluded from P groups are listed separately.
Some alleles listed in earlier CWD versions are not included because they are within a P group or were condensed into a two‐field assignment.
Not‐CWD or not‐CIWD based only on alleles categorized as CIWD observed in any catalog.
Figure 3Distribution of nonexpressed HLA alleles into three frequency categories and 1‐4 occurrences at P group resolution