| Literature DB >> 27219013 |
W Bochtler1, L Gragert2, Z I Patel3, J Robinson3,4, D Steiner5, J A Hofmann6, J Pingel6, A Baouz7, A Melis8, J Schneider2, H-P Eberhard1, M Oudshoorn8, S G E Marsh3,4, M Maiers2, C R Müller1.
Abstract
The accuracy of human leukocyte antigen (HLA)-matching algorithms is a prerequisite for the correct and efficient identification of optimal unrelated donors for patients requiring hematopoietic stem cell transplantation. The goal of this World Marrow Donor Association study was to validate established matching algorithms from different international donor registries by challenging them with simulated input data and subsequently comparing the output. This experiment addressed three specific aspects of HLA matching using different data sets for tasks of increasing complexity. The first two tasks targeted the traditional matching approach identifying discrepancies between patient and donor HLA genotypes by counting antigen and allele differences. Contemporary matching procedures predicting the probability for HLA identity using haplotype frequencies were addressed by the third task. In each task, the identified disparities between the results of the participating computer programs were analyzed, classified and quantified. This study led to a deep understanding of the algorithms participating and finally produced virtually identical results. The unresolved discrepancies total to less than 1%, 4% and 2% for the three tasks and are mostly because of individual decisions in the design of the programs. Based on these findings, reference results for the three input data sets were compiled that can be used to validate future matching algorithms and thus improve the quality of the global donor search process.Entities:
Keywords: donor registry; hematopoietic stem cell transplantation; human leukocyte antigen; immunogenetics; matching algorithm; patient-donor matching
Mesh:
Substances:
Year: 2016 PMID: 27219013 PMCID: PMC5089599 DOI: 10.1111/tan.12817
Source DB: PubMed Journal: HLA ISSN: 2059-2302 Impact factor: 4.513
Figure 1Result file formats for matching validation task (MVT) 1−3 (A to C). A single line had to be reported for each of the 107 possible patient−donor pairs using semicolon as field separator.
Observed disparities between the seven results submitted for MVT 1a
| A tot | B tot | DRB1 tot | |
|---|---|---|---|
| Unique HLA typing pairs | 123,708 | 580,659 | 229,504 |
| Discrepant cases between all result files | 857 | 2450 | 1951 |
| Discrepant cases between all result files (%) | 0.69 | 0.42 | 0.85 |
The number of unique locus‐wise typing pairs, the number of discrepant cases and the percentage of discrepant cases are shown for the total number of differences (tot) for the loci HLA‐A, ‐B and ‐DRB1.
HLA, human leukocyte antigen; MVT, matching validation task.
Observed disparities between the six results submitted for MVT 2a
| A tot | A ag | B tot | B ag | DRB1 tot | DRB1 ag | |
|---|---|---|---|---|---|---|
| Unique HLA typing pairs | 124,950 | 124,950 | 592,800 | 592,800 | 232,311 | 232,311 |
| Discrepant cases between all result files | 661 | 4774 | 2751 | 14,358 | 1018 | 5573 |
| Discrepant cases between all result files (%) | 0.53 | 3.82 | 0.46 | 2.42 | 0.44 | 2.40 |
The number of unique locus‐wise typing pairs, the number of discrepant cases and the percentage of discrepant cases are shown for the total number of differences (tot) and the number of antigen differences (ag) for the loci HLA‐A, ‐B and ‐DRB1.
HLA, human leukocyte antigen; MVT, matching validation task.
Number and percentage (in brackets) of discrepant patient−donor pairs (1 × 107 data items) for any two participants concerning the locus‐specific match grade characters (A/P/M) for all five HLA loci for MVT 3
| #1 | #2 | #3 | #4 | #5 | |
|---|---|---|---|---|---|
| #1 | × | 9,356,595 (93.6) | 3681 (<0.1) | 9,356,505 (93.6) | 4 (<0.1) |
| #2 | × | 9,355,983 (93.6) | 52 (<0.1) | 9,356,505 (93.6) | |
| #3 | × | 9,355,983 (93.6) | 3677 (<0.1) | ||
| #4 | × | 9,356,505 (93.6) | |||
| #5 | × |
HLA, human leukocyte antigen; MVT, matching validation task.
Number of discrepant patient−donor pairs (1 × 107 data items) for any two participants concerning the 9/10 and 10/10 predictions for MVT 3a
| #1 | #2 | #3 | #4 | #5 | |
|---|---|---|---|---|---|
| #1 | × | 454 1,2 | 94 2 | 454 1,2 | 2111 1,2,3 |
| #2 | × | 361 1 | 0 | 1783 3 | |
| #3 | × | 361 1 | 2018 1,3 | ||
| #4 | × | 1783 3 | |||
| #5 | × |
MVT, matching validation task.
All percentages are below 0.1 and not shown. Superscripts indicate reasons for discrepancies as: 1, discrepant mismatch counting between a homozygous patient and a donor with a null allele or vice versa cases; 2, discrepant mismatch counting when patient and donor are homozygous; 3, trimming of the set of possible diplotypes.
Number and percentage (in brackets) of discrepant patient−donor pairs (1 × 107 data items) for any two participants concerning the 2/2 locus‐specific predictions for all five HLA loci for MVT 3a
| #1 | #2 | #3 | #4 | #5 | |
|---|---|---|---|---|---|
| #1 | × |
729 1,2,5
| n/a 6 |
678 1,2,5
|
186,919 2,3,5
|
| #2 | × | n/a 6 |
225 4,5
|
186,453 3,5
| |
| #3 | × | n/a 6 | n/a 6 | ||
| #4 | × |
186,430 3,5
| |||
| #5 | × |
HLA, human leukocyte antigen; MVT, matching validation task; n/a, not applicable.
Superscripts indicate reasons for discrepancies as: 1, discrepant mismatch counting between a homozygous patient and a donor with a null allele or vice versa cases; 2, discrepant mismatch counting when patient and donor are homozygous; 3, trimming of the set of possible diplotypes; 4, treatment of discontinued multiple allele codes; 5, numerical artifacts due to floating point arithmetic in combination with rounding; 6, conditional locus‐specific probabilities.
Figure 2Distribution of the discrepancies for the 9/10 and 10/10 predictions of algorithm #1 compared to algorithm #2 in matching validation task (MVT) 3. The comparison encompasses 2 × 107 data items.
Figure 3Distribution of the locus‐specific discrepancies for the human leukocyte antigen (HLA)‐C predictions of algorithm #1 compared to algorithm #2 in matching validation task (MVT) 3. The comparison encompasses 1 × 107 data items. For HLA‐C, a substantial number of donors were not typed, complicating the matching computations and therefore leading to higher and more instructive deviations.