Literature DB >> 31207125

High-resolution analysis of the HLA-A, -B, -C and -DRB1 alleles and national and regional haplotype frequencies based on 120 926 volunteers from the Italian Bone Marrow Donor Registry.

Nicoletta Sacchi1, Mauro Castagnetta1, Valeria Miotti2, Lucia Garbarino1, Annamaria Gallina1.   

Abstract

HLA genes are highly polymorphic and structurally complex. They are located in the major histocompatibility complex (MHC) on chromosome 6, and the frequency of alleles and haplotypes varies widely among human populations. In this paper, we calculated the allele and haplotype frequencies using the HLA data of more than 120 000 Italian unrelated bone marrow donors enrolled in the national registry (IBMDR) and typed them with a high-resolution (HR) method for the HLA-A, -B, -C and -DRB1 alleles. The allele frequency data were obtained by manual counting; haplotype frequencies were calculated using the expectation maximisation (EM) algorithm. The total numbers of observed alleles were 226 for HLA-A, 343 for HLA-B, 201 for HLA-C and 210 for HLA-DRB1, which account for 5.4%, 6.7%, 5.2% and 8.5%, respectively, of each locus allele (IPD-IMGT/HLA Database Release 3.32, April 2018). The three most frequent Italian haplotypes were HLA-A*01:01~B*08:01~C*07:01~DRB1*03:01 (2.5%), A*02:01~B*18:01~C*07:01~DRB1*11:04 (1.1%) and A*30:01~B*13:02~C*06:02~DRB1*07:01 (1.1%). Moreover, for a relevant subset of the examined population (>100 000 individuals), the birthplace was available, and thus, we grouped the frequency data based on the corresponding Italian geographic areas, describing the HLA specificity of the Italian regional populations. The haplotype frequencies were also compared between national and regional data, and we observed remarkable differences in the regional haplotype frequencies, particularly in Sardinia. This study represents a valid tool to identify a more efficient haematopoietic stem cell unrelated donor recruitment and selection strategy, as well as for population genetic and HLA-disease association fields.
© 2019 The Authors. HLA: Immune Response Genetics published by John Wiley & Sons Ltd.

Entities:  

Keywords:  frequency; haplotype; regions; unrelated donors

Mesh:

Substances:

Year:  2019        PMID: 31207125      PMCID: PMC6771744          DOI: 10.1111/tan.13613

Source DB:  PubMed          Journal:  HLA        ISSN: 2059-2302            Impact factor:   4.513


INTRODUCTION

Allogenic haematopoietic stem cell transplantation (HSCT) is a widely used and effective therapy for haematopoietic malignant diseases and numerous other disorders. HR donor‐recipient matching for the human leukocyte antigen (HLA) genes improves patient survival as the number of mismatches correlates with the risk of rejection and/or graft vs host disease (GVHD).1, 2, 3, 4, 5 When a related HLA‐identical donor is not available, an alternative donor of haematopoietic stem cells must be found. Although allo‐HSCT from other sources, such as cord blood and related HLA‐mismatched or haploidentical donors, can be an option,6, 7, 8 an unrelated donor who matches with the corresponding patient at least for HLA‐A, ‐B, ‐C and ‐DRB1 at HR (HLA 8/8 matched) is still one of the best options.9 Consequently, there is the need to develop and maintain a useful and efficient unrelated donor registry. This can be done by increasing donor numbers,10 by recruiting young males,11 or by increasing the possibility of identifying a fully matched donor using available HLA allele and haplotype frequencies12, 13, 14 to establish an efficient recruitment and selection strategy. In fact, the knowledge of population‐specific HR HLA haplotype frequency distributions facilitates individual donor searches and provides the theoretical background for estimating the chance for a patient to find fully matched donors in the registry. Unfortunately, due to the high level of HLA polymorphism in the Italian population,15, 16 50% of Italian patients who need HSCT are currently unable to find an HLA 8/8 matched donor.17 Consequently, the characterisation of HR HLA haplotype frequency distribution in the Italian population is helpful for applying a recruitment strategy that increases the diversity of the donor pool. The Italian national Bone Marrow Donor Registry (IBMDR) was started in 1989 with the objective of providing HLA‐matched unrelated donors for Italian patients and worldwide patients at large. IBMDR manages the unrelated haematopoietic stem cell donor recruitment, maintenance and search countrywide. This registry is composed of a network of 17 active regional donor registries and 75 donor centres representing the Italian geographic regions. At the end of December 2017, IBMDR listed 392 873 available donors, and approximately 25 000 new donors are registered every year. Since 2015, IBMDR has performed HLA ‐A, ‐B, ‐C and ‐DRB1 typing at a HR level (two fields) in all newly recruited volunteers. Additional HLA HR‐typing data are obtained from patient‐directed typing or due to prospective typing of partially typed young donors. Due to these strategies, the IBMDR has accumulated a large data set of HR HLA‐typed individuals (more than 131 000 at the end of 2017). Over the last 10 years, several papers have been published on HLA allele and haplotype distributions in the Italian population.16, 18, 19 These studies analysed unrelated donor HLA data and demonstrated that the genetic structure of the Italian population could be represented by volunteer haematopoietic stem cell donors enrolled in the IBMDR. However, in those studies, the haplotype frequencies were estimated at the HLA low resolution level or were calculated from limited samples of two field‐typed donors. In this study, a sample of 120 926 randomly HR‐typed donors was extracted from the IBMDR database, and the frequencies of the HLA‐A, ‐B, ‐C and ‐DRB1 alleles and haplotypes were estimated. The distribution of HLA allele and haplotype frequencies were reported nationally for Italy and then grouped into regional data to assess regional differences in the frequency data. We did not estimate the frequencies of the DQB and DPB alleles even if they can be a matching criterion in the selection of an unrelated haematopoietic stem cell donor because these two loci are not routinely typed at recruitment. In Italy, the DQB and DPB typing is only performed if there is a request for a specific patient, and this fact could create bias in the frequency analysis of these two loci. Furthermore, we investigated the prevalence of observed alleles by applying the criteria described by Mack et al.20 to identify common (C) and well‐documented (WD) alleles in our database and then compared our findings with those of the CWD European catalogue from EFI.21

MATERIALS AND METHODS

Sample population

Our analysis consisted of 120 926 individuals (dataset 1) who were included in the Italian Registry IBMDR at the end of December 2017 and typed for HLA‐A, ‐B, ‐C and ‐DRB1 at the HR level. To avoid bias, we included only the donors fully typed at the recruitment step in this study. All donors included in the analysis fulfilled the eligibility criteria and were registered in the IBMDR donor centres located in all 20 Italian regions. At the time of the analysis, all subjects were from 18 to 55 years old, and 77.8% were younger than 35 years. The sample was composed of 45.6% males and 54.4% females (Table 1). Upon recruitment, volunteers were asked to sign an informed consent form and to provide personal information. For a sample of 104 135 donors (data set 2), the birthplace and ethnic origin were available. In our study, the HLA data of these individuals were classified and grouped according to the Italian geographical region where they were born.
Table 1

Characteristics of the data sets

Sample sizeGenderAge
Males (%)18‐2526‐3536‐4546‐55Median (25th‐75th)
Data set 1120 92645.644 466 (36.8%)49 582 (41.0%)23 760 (19.6%)3118 (2.6%)28 (23‐35)
Data set 2104 13545.937 459 (36.0%)43 254 (41.5%)20 761 (19.9%)2661 (2.6%)28 (24‐35)
Data set 355 53846.818 496 (33.3%)22 441 (40.4%)12 167 (21.9%)2434 (4.4%)29 (24‐36)
Characteristics of the data sets A sample of 55 538 individuals (data set 3) with no group “P” or group “G” alleles at any HLA‐A, ‐B, ‐C and ‐DRB1 locus was used to calculate the number of observed alleles. This sample was also used to establish the catalogue of Italian national CWD alleles by applying the criteria described by Mack et al..20 To put our findings into the context of relevant existing CWD catalogues, if necessary, allele designations were collapsed to two fields of resolution. The reference datasets we used are the CWD 2.0.0 catalogue (ASHI CWD),20 based on observations from several worldwide populations, and the EFI CWD catalogue,21 based on individuals of European origin.

HLA typing and processing of HLA data

All donors included in this study were typed for HLA‐A, B, C and ‐DRB1 loci at a HR level by ASHI or EFI‐accredited tissue typing laboratories using HR molecular biology techniques (SBT, SSO, SSP and more recently NGS). The alleles are generally assigned in two field forms, and ambiguities within the relevant exons were resolved with different approaches. Nevertheless, in the analysed population samples, some alleles were not distinguished by typing systems, and genomic regions were not defined outside the antigen recognition domain (ARD). Therefore, some HLA alleles with nucleotide sequences that encode the same protein sequence for the ARD (exons 2 and 3 for HLA class I and exon 2 for HLA class II alleles) were designated group “P” alleles; on the other hand, some alleles belonging to the same P group were unambiguously assigned (eg, DRB1*14:01P and DRB1*14:54). The same protocol was performed for some G group alleles (eg, C*04:01:01G and C*04:01:01:01). To avoid bias, the same approach of Schmidt et al.12 was applied: alleles with synonymous mutations inside or outside the relevant exons were merged to the corresponding two field alleles, and alleles that differed by nonsynonymous mutations outside the relevant exons were merged. The resulting alleles were characterised through the letter “g” (for “group”) that was appended to the first possible allele.22, 23

Statistical analysis

The HLA‐A, ‐B, ‐C and ‐DRB1 allele frequencies were obtained by the direct counting method. The Arlequin software package (version 3.5.2.2, Excoffier & Lischer, 201024) was used to estimate maximum likelihood haplotype frequencies based on the expectation‐maximisation (EM) algorithm and to assess the Hardy‐Weinberg equilibrium (HWE) for each of the four loci. A total of 50 starting points and ≤1000 interactions were chosen as input parameters for the EM algorithm, and the threshold for stopping the algorithm was ε = 10−7. We did not assess HWE in data set 1 because the national sample has a very large size, and this contributed to the determination of a very small P‐value and a small HWE deviation. A previous report showed25 that deviations from HWE did not affect the accuracy of HF estimations with the EM algorithm in these cases; therefore, we decided to test HWE only at the regional level. The input parameters for the Markov chain Monte Carlo test by Guo and Thompson26 were 106 Markov chain steps and 105 dememorization steps. Values of P < 0.01 were regarded as significant.

Evaluation of genetic variation in the Italian regions

Italy is subdivided into 20 geographical regions from North to South: Lombardia, Piemonte, Valle d'Aosta, Liguria, Veneto, Friuli Venezia Giulia, Province TN e BZ and Emilia Romagna (Northern Italy), Toscana, Marche, Umbria, Lazio, Abruzzo, Molise (Central Italy), Campania, Puglia, Basilicata, Calabria, Sicilia (Southern Italy), Sardinia. In this study, we estimated the HLA Italian regional frequencies based on the donors' birthplace. In particular, we extracted the birthplace from the Italian health insurance card (which contains the birthplace code) of the donors who self‐reported Italian origin in the consent form at recruitment. This sample of 104 135 donors (Dataset 2) was grouped into the 20 Italian regional populations with the goal of reflecting the genetic diversity of the populations in this country. To estimate if the regional sample sizes (RSS) obtained were representative of the specific regional population, we applied the same approach as in other studies,18 which define the RSS as acceptable when the ratio between sampled individuals/resident population is >5/10000. Table S1 reports the numbers of individuals of data set 2 grouped per region and this ratio index calculated on the resident population having the same age as our donors (18‐55 years). Like any other approach used to classify populations in geographic subgroups, our method is affected by a possible selection bias since it is not able to take into account possible gene flow due to population migrations, which, in Italy, usually occur from the South to the North. Nevertheless, by comparing the obtained Italian HLA regional frequencies with those from previous studies based on living district information,18, 19 we found comparable results. For this reason, we suppose that these subsamples are sufficiently large to be representative of the Italian regional populations independently of demographic flows.

RESULTS

HLA allele frequencies

Allele frequencies were calculated from the genotypes of 120 926 individuals processed as previously described and are reported in Table 2 (first 20 frequency ranked alleles) and in Table S2. Cumulative allele frequencies for the four analysed HLA genes are displayed in Figure 1. The greatest allelic diversity in our data set occurs for HLA‐B, and the lowest diversity occurs for HLA‐C. For each locus, the three most frequent alleles we found were A*02:01g (22.8%), A*24:02g (12.3%) and A*01:01g (11.5%); B*51:01g (9.8%), B*18:01g (9.5%) and B*35.01g (8.0%); C*04:01g (17.5%), C*07:01g (17.1%) and C*06:02g (9.9%); DRB1*07:01g (12.5%), DRB1*11:01g (11.6%) and DRB1*11:04g (10.1%), as reported in Table 2. Hardy‐Weinberg equilibrium (HWE) was tested at the regional level, and no significant difference from the expected equilibrium proportions was detected.
Table 2

The first 20 frequency‐ranked HLA‐A, ‐B, ‐C and ‐DRB1 alleles

RankHLA‐AHLA‐BHLA‐CHLA‐DRB1
AlleleFreq. (%)AlleleFreq. (%)AlleleFreq. (%)AlleleFreq. (%)
102:01g22.82051:01g9.79804:01g17.52607:01g12.525
224:02g12.28618:01g9.52307:01g17.10811:01g11.632
301:01g11.52835:01g8.00406:02g9.92111:04g10.067
403:01g10.62808:01g5.76012:03g9.02803:01g9.472
511:01g5.97307:02g5.23907:02g6.58201:01g5.987
632:01g5.21744:02g4.40205:01g5.77315:01g5.874
726:01g4.58544:03g3.82802:02g4.60014:01g5.389
868:01g2.93949:01g3.54815:02g3.84513:01g5.135
930:01g2.69738:01g3.45103:03g3.60416:01g4.957
1023:01g2.66113:02g3.41908:02g3.41313:02g4.937
1129:02g2.55635:03g3.37201:02g3.38101:02g2.447
1231:01g2.33615:01g3.11616:01g2.53308:01g2.017
1330:02g1.87314:02g2.93114:02g2.31510:01g1.765
1402:05g1.83935:02g2.74903:04g1.93204:03g1.642
1533:01g1.74557:01g2.55912:02g1.63704:01g1.636
1625:01g1.64558:01g1.97617:01g1.48811:03g1.572
1768:02g0.80250:01g1.95407:04g1.25315:02g1.396
1803:02g0.79255:01g1.93616:02g1.11404:02g1.381
1929:01g0.75839:01g1.92515:05g0.91504:05g1.379
2033:03g0.62352:01g1.62803:02g0.68912:01g1.335
Figure 1

Cumulative HLA‐A, ‐B, ‐C, ‐DRB1 allele frequencies

The first 20 frequency‐ranked HLA‐A, ‐B, ‐C and ‐DRB1 alleles Cumulative HLA‐A, ‐B, ‐C, ‐DRB1 allele frequencies The number of observed alleles (Table S3) was then counted in the sample of 55 538 individuals (data set 3) with no “P” group or “G” group alleles at any locus, and they included 226 HLA‐A, 343 HLA‐B, 201 HLA‐C and 210 HLADRB1 alleles, which accounted for 5.4%, 6.7%, 5.2% and 8.5%, respectively, of the alleles known at each locus (IPD‐IMGT/HLA Database Release 3.32, April 2018; http://www.ebi.ac.uk/ipd/imgt/hla/stats.html).

Common and well‐documented alleles

A total of 692 two field HLA‐A, ‐B, ‐C and ‐DRB1 alleles were observed in the Italian population. According to the ASHI criteria,20 we identified 318 (46%) common (C) and well‐documented (WD) alleles (Table 3). Among these, 30, 50, 24 and 35 of the HLA‐A, ‐B, ‐C and ‐DRB1 alleles, respectively, were common with frequencies greater than 0.001, and 38, 72, 33 and 36, respectively were well‐documented alleles observed in at least five independent unrelated individuals or three times in a specific haplotype in unrelated individuals. Almost all individuals in our sample carry at least one CWD allele on each investigated HLA locus.
Table 3

Number of common (C) and well‐documented (WD) alleles (CWD) in 55 538 Italian individuals

LocusCWDno CWDCWDTotal number of HLA alleles% CWD
A3038846815244.7
B507212612224849.2
C2433775713442.5
DRB13536877115844.9
Total13917937431869246.0
Number of common (C) and well‐documented (WD) alleles (CWD) in 55 538 Italian individuals According to these results, the “no CWD” alleles in the Italian population represent 54% of the total. This high percentage is mainly due to the high number of alleles that appeared just once in data set 3 (from 52.4% at locus A to 68.8% at locus C). The CWD Italian allele catalogue was also compared with the most recent EFI CWD21 catalogue (Table 4).
Table 4

Comparison between the numbers of Italian common (C) and well‐documented (WD) alleles (CWD), ASHI criteria vs. EFI catalogue

EFI CWD catalogue
Italian CWD according to ASHICWDNo CWD
OverallC139 132 70
WD17935 75 69
No CWD3745125 244
Total692172207313
Locus AC30 28 20
WD385 17 16
No CWD84234 48
Total152355364
Locus BC50 46 40
WD7215 32 25
No CWD126142 83
Total2486278108
Locus CC24 23 10
WD337 13 13
No CWD77225 50
Total134323963
Locus DRB1C35 35 00
WD368 13 15
No CWD87024 63
Total158433778
Comparison between the numbers of Italian common (C) and well‐documented (WD) alleles (CWD), ASHI criteria vs. EFI catalogue We found that 65% of the Italian catalogue is shared with the EFI catalogue, and notably, all our common alleles are inside the EFI CWD group. In this comparison, we also observed 130 “no CWD” Italian alleles that move to the C and, more frequently, to the WD allele category according to the EFI catalogue. This result is not surprising because the EFI CWD catalogue has more WD alleles, reflecting its larger samples. Vice versa, 69 Italian WD alleles (10%) are not represented in the EFI CWD catalogue, and the prevalence of “no CWD” alleles remained 45.2%, in confirmation of the previous results.

Haplotype frequencies

The frequencies of HLA‐A‐B‐C‐DRB1 haplotypes were estimated from the sample of 120 926 IBMDR donors with the EM algorithm using the Arlequin programme. The algorithm calculated the frequencies of the 25 057 most frequent haplotypes according to a cumulative frequency of 99.99%. Table S4 lists the haplotypes with a frequency >0.0005. The most frequent haplotypes (>1%) are HLA‐A*01:01~B*08:01~C*07:01~DRB1*03:01 (2.5%); HLA‐A*02:01~B*18:01~C*07:01~DRB1*11:04 (1.1%) and HLA‐A*30:01~B*13:02~C*06:02~DRB1*07:01 (1.1%). The list of the 50 most common haplotypes found in Italy is shown in Table 5. The frequencies of the 10 most common haplotypes sum up to 11%.
Table 5

The 50 most common haplotypes observed in the Italian population

RankHLA haplotypeFrequency
ABCDRB1
101:01g08:01g07:01g03:01g0.025357
202:01g18:01g07:01g11:04g0.011435
330:01g13:02g06:02g07:01g0.010879
429:02g44:03g16:01g07:01g0.010829
503:01g07:02g07:02g15:01g0.010167
633:01g14:02g08:02g01:02g0.009459
724:02g35:02g04:01g11:04g0.009276
830:02g18:01g05:01g03:01g0.008925
903:01g35:01g04:01g01:01g0.007673
1001:01g57:01g06:02g07:01g0.006078
1111:01g35:01g04:01g01:01g0.005041
1223:01g44:03g04:01g07:01g0.004771
1302:01g13:02g06:02g07:01g0.004577
1402:01g35:01g04:01g14:01g0.004511
1502:01g07:02g07:02g15:01g0.004326
1611:01g35:01g04:01g14:01g0.004191
1724:02g18:01g12:03g11:04g0.004070
1802:01g18:01g05:01g03:01g0.003987
1902:05g50:01g06:02g07:01g0.003841
2002:01g08:01g07:01g03:01g0.003554
2124:02g18:01g07:01g11:04g0.003441
2223:01g49:01g07:01g11:01g0.003120
2326:01g38:01g12:03g13:01g0.003038
2402:05g58:01g07:01g16:01g0.002856
2524:02g07:02g07:02g15:01g0.002715
2602:01g51:01g15:02g11:01g0.002711
2702:01g57:01g06:02g07:01g0.002604
2801:01g52:01g12:02g15:02g0.002543
2901:01g15:17g07:01g13:02g0.002512
3002:01g44:02g05:01g04:01g0.002458
3102:01g44:02g05:01g11:01g0.002458
3225:01g18:01g12:03g15:01g0.002451
3301:01g35:02g04:01g11:04g0.002418
3402:01g51:01g01:02g11:01g0.002297
3511:01g52:01g12:02g15:02g0.002279
3602:01g18:01g07:01g11:01g0.002149
3702:01g44:02g05:01g13:01g0.002074
3824:02g13:02g06:02g07:01g0.002074
3901:01g37:01g06:02g10:01g0.002046
4002:01g39:01g12:03g16:01g0.001893
4124:02g08:01g07:01g03:01g0.001840
4211:01g35:01g04:01g11:01g0.001839
4302:01g38:01g12:03g13:01g0.001800
4468:01g44:02g07:04g11:01g0.001793
4503:01g07:02g07:02g11:01g0.001788
4602:01g51:01g14:02g11:01g0.001776
4702:01g51:01g02:02g11:01g0.001755
4802:01g18:01g12:03g11:04g0.001706
4901:01g08:01g07:01g11:01g0.001701
5002:01g35:01g04:01g01:01g0.001643
The 50 most common haplotypes observed in the Italian population

Regional distribution of alleles

Haplotype frequencies were compared between national and regional data. The 10 most frequent haplotypes for each Italian region were tabulated and compared with the corresponding national data to assess regional differences in the frequency data (Table S5). The sum of squares (SS) was then calculated to evaluate which regions deviate from the national values (Figure 2), and we found no correlation between the SS value and the regional sample size (P = 0.359).
Figure 2

Sum of squares (SS) frequency differences between each region and the national data

Sum of squares (SS) frequency differences between each region and the national data Our results showed that the regions with haplotype frequencies significantly deviating from the national frequencies are Sardinia (SS = 12 746), Molise (SS = 1509) and Valle D'Aosta (SS = 940). The graphical representation of this comparison is shown in Figure 3 for the regions with SS > 500.
Figure 3

Comparison between national and regional haplotype frequencies (per thousand)

Comparison between national and regional haplotype frequencies (per thousand)

DISCUSSION

In this study, we presented the HR allele and haplotype frequencies of the Italian population based on a data set of 120 926 unrelated stem cell donors of the Italian Bone Marrow Donor Registry. Compared to previous studies,16, 18, 19 our data included the largest number of HLA‐A, ‐B, ‐C and ‐DRB1 alleles and haplotypes ever analysed in Italy. The ranking and HR haplotype frequencies indicate that differences between all studies are generally small, but our data have significantly expanded the numbers of alleles observed at the HLA‐A, ‐B, ‐C and ‐DRB1 loci. We used the ASHI criteria20 to identify CWD alleles in the Italian population, and we compared the results with the classification of CWD alleles published in Reference.21 The percentage of “no CWD” HLA alleles (~50%) vs “common” and “well documented” alleles in the Italian population can be suggestive of some evolutionary pressure toward allele diversification or be the result of several migratory waves, and it could in part justify the high polymorphism of our population with the consequent major difficulty of Italian patients finding an 8/8 matched unrelated donor. Previous studies15, 16, 18 have already described the genetic heterogeneity of the Italian population as a result of the history of ancient peoples who settled in the country. Genetic drift, the cause of differentiation among the different ethnic groups and migrations among them over the centuries, probably produced the observed genetic pattern in modern Italy. In addition, other factors contributed to the variable genetic composition of Italy, such as the effect of selection following exposure to pathogens that may have favoured or disfavoured carriers of specific HLA genes. The haplotype A*01:01~B*08:01~C*07:01~DRB1*03:01 is the most frequent nationwide and is the most common in 12 out of 20 regions; nevertheless, as described in the results, some HLA four‐locus haplotypes exhibit specific regional characteristics. In Valle d'Aosta, the abovementioned national most frequent haplotype goes down to the 4th ranking position, while in the same region, the most frequent haplotype is A*02:01~B*35:01~C*04:01~DRB1*14:01, which interestingly is only the 50th at the national level; furthermore, the third most frequent haplotype in the Valle d'Aosta A*02:01~B*44:02~C*07:04~DRB1*11:01 is only at the 137th position nationwide. In Molise, among the first 10 haplotypes is a relatively rare haplotype, A*02:01~B*14:02~C*08:02~DRB1*11:01, which in the national list is ranked 2027th (0.000076%). Sardinia calls for a peculiar consideration: ranking the Sardinian regional data from the most frequent to the less frequent haplotype, the data clearly show that 8 out of 10 Sardinian more frequent haplotypes are virtually absent in the national ranking (frequency <0.01). This phenomenon is associated with the insular nature of Sardinia, a typical cause of geographic barriers to gene flow and local selective pressure. The most represented haplotype in Sardinia is A*30:02~B*18:01~C*05:01~DRB1*03.01, which is significantly associated with celiac disease27 and multiple sclerosis.28 Ordering the sum of the SS data for each region, from the most different to the most similar to the national distribution, Sardinia is the first region, followed by Molise, Valle D'Aosta, Basilicata, Calabria and so on. Apparently, the Italian southern regions are more “different” than the central and northern regions, confirming the south‐north genetic gradient observed in previous studies.18, 19 This phenomenon can hardly be associated with a single cause; although the Sardinian population suffers from past regional isolation, in other southern regions, there might be other causes explaining this phenomenon. These data showed that the Italian donor population can be split into at least four areas, with Sardinia representing an area per se and the remaining regions being broadly grouped in north/centre/south, as previously stated.18 Because donor‐recipient matching in haematopoietic stem cell transplantation should, at least, consider the HLA‐A, ‐B, ‐C and ‐DRB1 genes at HR,2 these data are highly relevant for both individual donor searches and strategic donor registry planning. The allele and haplotype frequencies obtained in this study are useful for the following purposes: (a) to determine which alleles are suitable to be defined by HR techniques because of the higher heterogeneity; (b) to assign the most likely types in the donor population, recruited in the past and still typed with low‐ and medium‐resolution methods; (c) to categorise a patient's haplotype at the beginning of the unrelated donor search as either common or uncommon in the national donor population, giving an important predictive estimation of the probability of finding a matched unrelated donor. Moreover, the determination of HR HLA diversity of donors in IBMDR, particularly at the regional level, is a valid tool to develop better recruitment and unrelated donor search strategies to satisfy requests for haematopoietic stem cell donation and will consequently be relevant for resource allocation. In addition, these data will contribute to HLA and disease association studies as well as forensic and anthropology fields.

CONFLICT OF INTEREST

The authors have declared no conflicting interests. Table S1 Dataset 2, regional sample size (RSS) and its index (sample size/resident population) calculated on the eligible resident population Click here for additional data file. Table S2 HLA‐A, ‐B, ‐C, ‐DRB1 allele frequencies calculated from 120 926 individuals Click here for additional data file. Table S3 Number of HLA alleles observed among 55 538 Italian individuals Click here for additional data file. Table S4 List of the haplotypes with frequency >0.0005 observed in the Italian population Click here for additional data file. Table S5 Comparison of haplotype frequencies (per thousand) between national and regional data Click here for additional data file.
  27 in total

1.  Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.

Authors:  D Fallin; N J Schork
Journal:  Am J Hum Genet       Date:  2000-08-22       Impact factor: 11.025

2.  Performing the exact test of Hardy-Weinberg proportion for multiple alleles.

Authors:  S W Guo; E A Thompson
Journal:  Biometrics       Date:  1992-06       Impact factor: 2.571

3.  Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows.

Authors:  Laurent Excoffier; Heidi E L Lischer
Journal:  Mol Ecol Resour       Date:  2010-03-01       Impact factor: 7.090

4.  Common and well-documented HLA alleles over all of Europe and within European sub-regions: A catalogue from the European Federation for Immunogenetics.

Authors:  A Sanchez-Mazas; J M Nunes; D Middleton; J Sauter; S Buhler; A McCabe; J Hofmann; D M Baier; A H Schmidt; G Nicoloso; M Andreani; Z Grubic; J-M Tiercy; K Fleischhauer
Journal:  HLA       Date:  2017-02       Impact factor: 4.513

5.  Association between the ancestral haplotype HLA A30B18DR3 and multiple sclerosis in central Sardinia.

Authors:  P P Bitti; B S Murgia; A Ticca; R Ferrai; L Musu; M L Piras; E Puledda; S Campo; S Durando; C Montomoli; D G Clayton; A P Mander; L Bernardinelli
Journal:  Genet Epidemiol       Date:  2001-02       Impact factor: 2.135

6.  Current Use and Trends in Hematopoietic Cell Transplantation in the United States.

Authors:  Anita D'Souza; Stephanie Lee; Xiaochun Zhu; Marcelo Pasquini
Journal:  Biol Blood Marrow Transplant       Date:  2017-06-09       Impact factor: 5.742

7.  High-resolution donor-recipient HLA matching contributes to the success of unrelated donor marrow transplantation.

Authors:  Stephanie J Lee; John Klein; Michael Haagenson; Lee Ann Baxter-Lowe; Dennis L Confer; Mary Eapen; Marcelo Fernandez-Vina; Neal Flomenberg; Mary Horowitz; Carolyn K Hurley; Harriet Noreen; Machteld Oudshoorn; Effie Petersdorf; Michelle Setterholm; Stephen Spellman; Daniel Weisdorf; Thomas M Williams; Claudio Anasetti
Journal:  Blood       Date:  2007-09-04       Impact factor: 22.113

8.  HLA polymorphisms in Italian bone marrow donors: a regional analysis.

Authors:  S Rendine; I Borelli; M Barbanti; N Sacchi; S Roggero; E S Curtoni
Journal:  Tissue Antigens       Date:  1998-08

9.  Common and well-documented HLA alleles: 2012 update to the CWD catalogue.

Authors:  S J Mack; P Cano; J A Hollenbach; J He; C K Hurley; D Middleton; M E Moraes; S E Pereira; J H Kempenich; E F Reed; M Setterholm; A G Smith; M G Tilanus; M Torres; M D Varney; C E M Voorter; G F Fischer; K Fleischhauer; D Goodridge; W Klitz; A-M Little; M Maiers; S G E Marsh; C R Müller; H Noreen; E H Rozemuller; A Sanchez-Mazas; D Senitzer; E Trachtenberg; Marcelo Fernandez-Vina
Journal:  Tissue Antigens       Date:  2013-04

10.  A comparison between allogeneic stem cell transplantation from unmanipulated haploidentical and unrelated donors in acute leukemia.

Authors:  Simona Piemontese; F Ciceri; M Labopin; W Arcese; S Kyrcz-Krzemien; S Santarone; H Huang; D Beelen; N C Gorin; C Craddock; Z Gulbas; A Bacigalupo; M Mohty; A Nagler
Journal:  J Hematol Oncol       Date:  2017-01-19       Impact factor: 17.388

View more
  6 in total

1.  High-resolution analysis of the HLA-A, -B, -C and -DRB1 alleles and national and regional haplotype frequencies based on 120 926 volunteers from the Italian Bone Marrow Donor Registry.

Authors:  Nicoletta Sacchi; Mauro Castagnetta; Valeria Miotti; Lucia Garbarino; Annamaria Gallina
Journal:  HLA       Date:  2019-07-03       Impact factor: 4.513

2.  HLA-B*44 and C*01 Prevalence Correlates with Covid19 Spreading across Italy.

Authors:  Pierpaolo Correale; Luciano Mutti; Francesca Pentimalli; Giovanni Baglio; Rita Emilena Saladino; Pierpaolo Sileri; Antonio Giordano
Journal:  Int J Mol Sci       Date:  2020-07-23       Impact factor: 5.923

Review 3.  Approaching Genetics Through the MHC Lens: Tools and Methods for HLA Research.

Authors:  Venceslas Douillard; Erick C Castelli; Steven J Mack; Jill A Hollenbach; Pierre-Antoine Gourraud; Nicolas Vince; Sophie Limou
Journal:  Front Genet       Date:  2021-12-02       Impact factor: 4.599

4.  Correlation of the two most frequent HLA haplotypes in the Italian population to the differential regional incidence of Covid-19.

Authors:  Simona Pisanti; Joris Deelen; Anna Maria Gallina; Mariella Caputo; Marianna Citro; Mario Abate; Nicoletta Sacchi; Carmine Vecchione; Rosanna Martinelli
Journal:  J Transl Med       Date:  2020-09-15       Impact factor: 5.531

5.  HLA allele frequencies and susceptibility to COVID-19 in a group of 99 Italian patients.

Authors:  Antonio Novelli; Marco Andreani; Michela Biancolella; Laura Liberatoscioli; Chiara Passarelli; Vito Luigi Colona; Paola Rogliani; Francesca Leonardis; Andrea Campana; Rita Carsetti; Massimo Andreoni; Sergio Bernardini; Giuseppe Novelli; Franco Locatelli
Journal:  HLA       Date:  2020-09-03       Impact factor: 8.762

6.  Tumor genotype, location, and malignant potential shape the immunogenicity of primary untreated gastrointestinal stromal tumors.

Authors:  Daniela Gasparotto; Marta Sbaraglia; Sabrina Rossi; Davide Baldazzi; Monica Brenca; Alessia Mondello; Federica Nardi; Dominga Racanelli; Matilde Cacciatore; Angelo Paolo Dei Tos; Roberta Maestro
Journal:  JCI Insight       Date:  2020-11-19
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.