| Literature DB >> 33991581 |
Matthias Haimel1, Julia Pazmandi1, Raúl Jiménez Heredia1, Jasmin Dmytrus1, Sevgi Köstel Bal1, Samaneh Zoghi1, Paul van Daele2, Tracy A Briggs3, Carine Wouters4, Brigitte Bader-Meunier5, Florence A Aeschlimann5, Roberta Caorsi6, Despina Eleftheriou7, Esther Hoppenreijs8, Elisabeth Salzer9, Shahrzad Bakhtiar10, Beata Derfalvi11, Francesco Saettini12, Maaike A A Kusters7, Reem Elfeky7, Johannes Trück13, Jacques G Rivière14, Mirjam van der Burg15, Marco Gattorno6, Markus G Seidel16, Siobhan Burns17, Klaus Warnatz18, Fabian Hauck19, Paul Brogan7, Kimberly C Gilmour20, Catharina Schuetz21, Anna Simon22, Christoph Bock23, Sophie Hambleton24, Esther de Vries25, Peter N Robinson26, Marielle van Gijn27, Kaan Boztug28.
Abstract
BACKGROUND: Accurate, detailed, and standardized phenotypic descriptions are essential to support diagnostic interpretation of genetic variants and to discover new diseases. The Human Phenotype Ontology (HPO), extensively used in rare disease research, provides a rich collection of vocabulary with standardized phenotypic descriptions in a hierarchical structure. However, to date, the use of HPO has not yet been widely implemented in the field of inborn errors of immunity (IEIs), mainly due to a lack of comprehensive IEI-related terms.Entities:
Keywords: HPO; diagnostic support; disease classification; genetic analysis; immunodeficiencies; inborn errors of immunity; ontology; patient matching; phenotype; rare diseases
Mesh:
Year: 2021 PMID: 33991581 PMCID: PMC9346194 DOI: 10.1016/j.jaci.2021.04.033
Source DB: PubMed Journal: J Allergy Clin Immunol ISSN: 0091-6749 Impact factor: 14.290
FIG 1.Pipeline for standardized reannotation of IEI diseases. First, scientific publications were collected by experts for each disease within the subgroups. Second, HPO terms were extracted from the provided publications for each disease using machine learning and summarized into Excel documents. Third, a 2-tier expert review evaluated the text-mined terms, suggested additional terms if required, and the responsible working group agreed on the final HPO annotations for each disease. Fourth, data were collated, and the agreed terms were submitted to HPO.
FIG 2.Revision and expansion of the HPO tree. A, Schematic representation of the restructuring of the HPO tree. Main branches of the HPO tree where restructuring was performed are marked with light green. B, “Abnormality of temperature,” “Abnormality of immunoglobulin level,” and “Unusual infections” as examples of revised branches of the HPO tree. New additions and suggestions are marked with green, and repositioned terms are marked with yellow.
FIG 3.Results of disease reannotation. A, HPO annotation availability in the subset of 72 diseases. B, Distribution of number of available HPO terms per disease. C, Pipeline for the reannotation process. D, Distribution of the number of articles used per disease for the reannotation pipeline. E, Number of mined terms per disease. Each dot represents a disease. F, All mined vs all accepted terms. G, Number of available terms per disease before and after reannotation. Each dot represents a disease. H, Mean information content available per disease before and after reannotation. I, The aggregate mean annotation per disease after reannotation. J, All text-mined terms from PAD publications. K, Frequency distribution of different PAD terms according to the experts.
FIG 4.Patient-disease matching. A, Schematic overview of the different steps of patient-to-disease matching. First, the phenotypes were identified in a patient’s clinical history. Second, these phenotypes were translated to HPO terms. Finally, patient phenotype to disease matching was measured by Lin similarity. B, Matching patient 1 to a diagnosis. C, Similarity of patients in patient cohort to genetic diagnosis before and after reannotation. D, The rank of correct clinical diagnosis more often is in the top 10 of matched diseases after reannotation. E, Improvement of ranks of clinical diagnosis before and after reannotation. Significance was assessed by Student t test.
FIG 5.Phenotypic similarity of diseases before and after reannotation. Diseases are annotated with the IUIS disease group (inner circle), subgroup (outer circle), and OMIM identifier. A, Clustering of diseases based on phenotypic similarity before reannotation. B, Clustering of diseases based on phenotypic similarity after reannotation. OMIM, Online Mendelian Inheritance in Men; SCID, severe combined immunodeficiency.