Christian Schäfer1, Alexander H Schmidt1, Jürgen Sauter2. 1. DKMS gemeinnützige GmbH, Kressbach 1, 72072, Tübingen, Germany. 2. DKMS gemeinnützige GmbH, Kressbach 1, 72072, Tübingen, Germany. sauter@dkms.de.
Abstract
BACKGROUND: Knowledge of HLA haplotypes is helpful in many settings as disease association studies, population genetics, or hematopoietic stem cell transplantation. Regarding the recruitment of unrelated hematopoietic stem cell donors, HLA haplotype frequencies of specific populations are used to optimize both donor searches for individual patients and strategic donor registry planning. However, the estimation of haplotype frequencies from HLA genotyping data is challenged by the large amount of genotype data, the complex HLA nomenclature, and the heterogeneous and ambiguous nature of typing records. RESULTS: To meet these challenges, we have developed the open-source software Hapl-o-Mat. It estimates haplotype frequencies from population data including an arbitrary number of loci using an expectation-maximization algorithm. Its key features are the processing of different HLA typing resolutions within a given population sample and the handling of ambiguities recorded via multiple allele codes or genotype list strings. Implemented in C++, Hapl-o-Mat facilitates efficient haplotype frequency estimation from large amounts of genotype data. We demonstrate its accuracy and performance on the basis of artificial and real genotype data. CONCLUSIONS: Hapl-o-Mat is a versatile and efficient software for HLA haplotype frequency estimation. Its capability of processing various forms of HLA genotype data allows for a straightforward haplotype frequency estimation from typing records usually found in stem cell donor registries.
BACKGROUND: Knowledge of HLA haplotypes is helpful in many settings as disease association studies, population genetics, or hematopoietic stem cell transplantation. Regarding the recruitment of unrelated hematopoietic stem cell donors, HLA haplotype frequencies of specific populations are used to optimize both donor searches for individual patients and strategic donor registry planning. However, the estimation of haplotype frequencies from HLA genotyping data is challenged by the large amount of genotype data, the complex HLA nomenclature, and the heterogeneous and ambiguous nature of typing records. RESULTS: To meet these challenges, we have developed the open-source software Hapl-o-Mat. It estimates haplotype frequencies from population data including an arbitrary number of loci using an expectation-maximization algorithm. Its key features are the processing of different HLA typing resolutions within a given population sample and the handling of ambiguities recorded via multiple allele codes or genotype list strings. Implemented in C++, Hapl-o-Mat facilitates efficient haplotype frequency estimation from large amounts of genotype data. We demonstrate its accuracy and performance on the basis of artificial and real genotype data. CONCLUSIONS: Hapl-o-Mat is a versatile and efficient software for HLA haplotype frequency estimation. Its capability of processing various forms of HLA genotype data allows for a straightforward haplotype frequency estimation from typing records usually found in stem cell donor registries.
Authors: Stephanie J Lee; John Klein; Michael Haagenson; Lee Ann Baxter-Lowe; Dennis L Confer; Mary Eapen; Marcelo Fernandez-Vina; Neal Flomenberg; Mary Horowitz; Carolyn K Hurley; Harriet Noreen; Machteld Oudshoorn; Effie Petersdorf; Michelle Setterholm; Stephen Spellman; Daniel Weisdorf; Thomas M Williams; Claudio Anasetti Journal: Blood Date: 2007-09-04 Impact factor: 22.113
Authors: James Robinson; Jason A Halliwell; James D Hayhurst; Paul Flicek; Peter Parham; Steven G E Marsh Journal: Nucleic Acids Res Date: 2014-11-20 Impact factor: 16.971
Authors: N Ikeda; H Kojima; M Nishikawa; K Hayashi; T Futagami; T Tsujino; Y Kusunoki; N Fujii; S Suegami; Y Miyazaki; D Middleton; H Tanaka; H Saji Journal: Tissue Antigens Date: 2015-02-27
Authors: W Bochtler; L Gragert; Z I Patel; J Robinson; D Steiner; J A Hofmann; J Pingel; A Baouz; A Melis; J Schneider; H-P Eberhard; M Oudshoorn; S G E Marsh; M Maiers; C R Müller Journal: HLA Date: 2016-05-24 Impact factor: 4.513
Authors: Gonzalo Montero-Martín; Kalyan C Mallempati; Sridevi Gangavarapu; Francisco Sánchez-Gordo; Maria J Herrero-Mata; Antonio Balas; Jose L Vicario; Florentino Sánchez-García; Maria F González-Escribano; Manuel Muro; Maria R Moya-Quiles; Rafael González-Fernández; Javier G Ocejo-Vinyals; Luis Marín; Lisa E Creary; Kazutoyo Osoegawa; Tamara Vayntrub; Jose L Caro-Oleas; Carlos Vilches; Dolores Planelles; Marcelo A Fernández-Viña Journal: Hum Immunol Date: 2019-02-11 Impact factor: 2.850
Authors: Susanne Seitz; Vinzenz Lange; Paul J Norman; Jürgen Sauter; Alexander H Schmidt Journal: Int J Immunogenet Date: 2021-09-23 Impact factor: 2.385
Authors: B A González-Quezada; L E Creary; A J Munguia-Saldaña; H Flores-Aguilar; M A Fernández-Viña; C Gorodezky Journal: Hum Immunol Date: 2019-01-29 Impact factor: 2.850
Authors: Aviva Geretz; Lauryn Cofer; Philip K Ehrenberg; Jeffrey R Currier; In-Kyu Yoon; Maria T P Alera; Richard Jarman; Alan L Rothman; Rasmi Thomas Journal: Hum Immunol Date: 2020-07-09 Impact factor: 2.850
Authors: Mqondisi Tshabalala; Charlotte Ingram; Terry Schlaphoff; Veronica Borrill; Alan Christoffels; Michael S Pepper Journal: J Immunol Res Date: 2018-04-23 Impact factor: 4.818
Authors: Alexander H Schmidt; Jürgen Sauter; Daniel M Baier; Jessica Daiss; Andreas Keller; Anja Klussmeier; Thilo Mengling; Gabi Rall; Tobias Riethmüller; Gerhard Schöfl; Ute V Solloch; Tigran Torosian; David Means; Helen Kelly; Latha Jagannathan; Patrick Paul; Anette S Giani; Sabine Hildebrand; Stephan Schumacher; Jan Markert; Monika Füssel; Jan A Hofmann; Thomas Schäfer; Julia Pingel; Vinzenz Lange; Johannes Schetelig Journal: Int J Immunogenet Date: 2020-01-06 Impact factor: 1.466