Literature DB >> 33991581

Curation and expansion of Human Phenotype Ontology for defined groups of inborn errors of immunity.

Matthias Haimel¹, Julia Pazmandi¹, Raúl Jiménez Heredia¹, Jasmin Dmytrus¹, Sevgi Köstel Bal¹, Samaneh Zoghi¹, Paul van Daele², Tracy A Briggs³, Carine Wouters⁴, Brigitte Bader-Meunier⁵, Florence A Aeschlimann⁵, Roberta Caorsi⁶, Despina Eleftheriou⁷, Esther Hoppenreijs⁸, Elisabeth Salzer⁹, Shahrzad Bakhtiar¹⁰, Beata Derfalvi¹¹, Francesco Saettini¹², Maaike A A Kusters⁷, Reem Elfeky⁷, Johannes Trück¹³, Jacques G Rivière¹⁴, Mirjam van der Burg¹⁵, Marco Gattorno⁶, Markus G Seidel¹⁶, Siobhan Burns¹⁷, Klaus Warnatz¹⁸, Fabian Hauck¹⁹, Paul Brogan⁷, Kimberly C Gilmour²⁰, Catharina Schuetz²¹, Anna Simon²², Christoph Bock²³, Sophie Hambleton²⁴, Esther de Vries²⁵, Peter N Robinson²⁶, Marielle van Gijn²⁷, Kaan Boztug²⁸.

Abstract

BACKGROUND: Accurate, detailed, and standardized phenotypic descriptions are essential to support diagnostic interpretation of genetic variants and to discover new diseases. The Human Phenotype Ontology (HPO), extensively used in rare disease research, provides a rich collection of vocabulary with standardized phenotypic descriptions in a hierarchical structure. However, to date, the use of HPO has not yet been widely implemented in the field of inborn errors of immunity (IEIs), mainly due to a lack of comprehensive IEI-related terms.
OBJECTIVES: We sought to systematically review available terms in HPO for the depiction of IEIs, to expand HPO, yielding more comprehensive sets of terms, and to reannotate IEIs with HPO terms to provide accurate, standardized phenotypic descriptions.
METHODS: We initiated a collaboration involving expert clinicians, geneticists, researchers working on IEIs, and bioinformaticians. Multiple branches of the HPO tree were restructured and extended on the basis of expert review. Our ontology-guided machine learning coupled with a 2-tier expert review was applied to reannotate defined subgroups of IEIs.
RESULTS: We revised and expanded 4 main branches of the HPO tree. Here, we reannotated 73 diseases from 4 International Union of Immunological Societies-defined IEI disease subgroups with HPO terms. We achieved a 4.7-fold increase in the number of phenotypic terms per disease. Given the new HPO annotations, we demonstrated improved ability to computationally match selected IEI cases to their known diagnosis, and improved phenotype-driven disease classification.
CONCLUSIONS: Our targeted expansion and reannotation presents enhanced precision of disease annotation, will enable superior HPO-based IEI characterization, and hence benefit both IEI diagnostic and research activities.

Entities: Chemical

Keywords: HPO; diagnostic support; disease classification; genetic analysis; immunodeficiencies; inborn errors of immunity; ontology; patient matching; phenotype; rare diseases

Mesh：

Year: 2021 PMID： 33991581 PMCID： PMC9346194 DOI： 10.1016/j.jaci.2021.04.033

Source DB: PubMed Journal: J Allergy Clin Immunol ISSN： 0091-6749 Impact factor: 14.290

Rare and undiagnosed diseases pose challenges for affected patients, clinicians, and researchers working to improve diagnostic and therapeutic approaches. Because of the rarity, clinicians often see only a few patients with specific rare phenotypes throughout their careers, leading to considerable diagnostic delay.[1] Genetic research on rare diseases often relies on single pedigrees or a few patients, leaving many patients undiagnosed.[1] Compiling a cohort of patients—so-called patient matching—is often crucial to gain insight into the phenotypic spectrum, natural/clinical history of the disease, and adequate monitoring and treatment strategies. The rare disease community has recognized these challenges and established tools enabling efficient data sharing across institutions and borders, including genetic data exchange through the Matchmaker Exchange platform[2] to solve undiagnosed exomes and genomes.[3] These platforms, however, are highly dependent on accurately phenotyped and categorized patients and standardized disease classifications. To date, several nomenclatures and reference systems for diseases have been developed.[4,5] In parallel, ontologies were established to provide a more systematic, hierarchical classification of diseases.[6,7] However, these nomenclatures group patients by disease label and do not describe the underlying phenotypic features. Consequently, clinical features, laboratory measurements, and anatomical and functional phenotypes of patients are often described with variable quality and specificity, which hampers patient matching, diagnostic efficiency, genetic variant prioritization in diagnostic pipelines, and global data exchange. Given these challenges and the need for accurate, standardized phenotyping, the Human Phenotype Ontology (HPO) system was conceptualized and published with initial terminology in 2008.[8,9] To date, HPO provides the most comprehensive deep phenotyping resource for rare diseases for clinicians, researchers, bioinformaticians, and electronic health record systems in the world. HPO is used in many projects including the 100,000 Genomes Project, the NIH Undiagnosed Disease Program and Network, the Undiagnosed Diseases Network International, RD-CONNECT, and SOLVE-RD.[1,10-13] HPO is a community-based tool and is increasingly adapted as the standard to describe phenotypic abnormalities for everyday use.[14] Each term in HPO describes a distinct phenotypic feature (eg, lymphadenopathy, HP:0002716), and the HPO tree structure allows similarity measures between patient phenotypes. HPO contains more than 200,000 phenotypic annotations for hereditary diseases, of which 2,120 are considered rare diseases. Inborn errors of immunity (IEIs) form a subgroup of these rare diseases. Clinical experts in IEI agree that a major barrier to the adoption of HPO terminology is that it has not been used widely for IEIs. This is partly due to the lack of disease-specific HPO terms to describe patients with IEI.[15] Adequate depiction of the complex clinical and immunologic phenotypes of IEI disease entities with HPO terms would allow discrimination between heterogeneous groups of IEIs. Illustrating the lack of terms, in 2017, HPO contained more than 11,000 terms, of which 5,000 terms have been applied to the musculoskeletal system, with only 1,000 terms related to IEIs.[9,15] In addition, the phenotypic annotation of IEIs often includes results of specific immunologic assays, which pose a challenge to accurately reflect in HPO terms.[15] Because of the lack of specific HPO terms depicting results of laboratory assays, often a nonspecific broader term is used for the annotation of IEIs. Therefore, HPOs are currently not specific enough to be used for genetic analysis and diagnostic aid for IEIs. In a study addressing the clinical efficacy of genetic testing in IEI, bioinformatics tools using existing HPO terms missed the disease-causing gene in 37% of the patients with known monogenic disorders.[16] In this study, we set out to improve HPO terminology for IEIs by applying established bioinformatic methodologies coupled with expert review. The aims of this project were therefore to (1) systematically review existing HPO terms for IEIs, (2) revise ontology structures, (3) add missing terms, as well as (4) reannotate existing IEIs with HPO terms, to collectively enable systematic use of HPO by the IEI community.

METHODS

Spearheaded by the European Reference Network on Rare Primary Immunodeficiency, Autoinflammatory and Autoimmune diseases (ERN-RITA) and the European Society for Immunodeficiencies (ESID), we set up working groups comprising members of the participating immunodeficiency societies to revise and expand HPO terms for IEIs. Three workshops, numerous teleconferences, and joint task forces took place over the span of 2 years, with more than 30 participants including expert clinicians, geneticists, researchers working on IEIs, and bioinformaticians. All participating clinicians and geneticists identified through the ERN-RITA, ESID, and the International Society of Systemic Autoinflammatory Diseases are established experts in their fields from different European countries and North America. Additional scientific support provided the indispensable bioinformatics expertise.

Establishment of working structure

A remote working structure (detailed in this article’s Methods section in the Online Repository at www.jacionline.org) was launched to address gaps in the HPO tree and in the annotation of IEI diseases.

Expansion and restructuring of disease-related branches of the HPO tree

Disease-specific HPO restructuring was discussed within 4 working groups. Each group focused on a different HPO branch; the suggested changes were agreed on among all participants. Differences between centers and countries in the use of terms and definitions were highlighted during the face-to-face workshops. The results were summarized electronically in Excel documents or pictures and flipchart drawings by the main coordinators before being submitted to HPO. The full list of restructured tree elements and new submitted HPO terms is detailed in the Document S1 in this article’s Online Repository at www.jacionline.org. In addition, missing terms describing pulmonary and gastrointestinal complications of primary antibody deficiency (PAD) were discussed during teleconferences and thereafter submitted to update the HPO ontology. A list of HPO resources can be found in this article’s Methods section in the Online Repository.

Standardized reannotation of rare, genetically diagnosed diseases

A 4-step process was developed for a standardized reannotation effort across working groups and to consistently annotate IEIs (spanning more than 300 different diseases in Online Mendelian Inheritance in Man) with HPO terms (see Fig 1). Because IEIs represent a large and heterogeneous group of rare diseases, we here decided to selectively focus on defined subgroups of IEI to test the feasibility and usefulness of such an endeavor. First, publications were collected by experts for each disease within the subgroups (minimum of 2 articles per disease), representing key phenotypic presentation(s) of the specific disease. In the second step, HPO terms were extracted from the provided publications for each disease using machine learning[17] (explained in detail in this article’s Methods section in the Online Repository) and summarized into Excel documents. Third, a 2-tier expert review evaluated the text-mined terms, suggested additional terms if required, and the responsible working group agreed (defined as at least 80% agreement among group experts) on the final HPO annotations for each disease. Fourth, the validated terms were submitted to HPO. Document S2 in this article’s Methods section in the Online Repository contains the reannotated diseases, and the list of reannotated terms for each disease is available in Document S3 in this article’s Methods section in the Online Repository.

FIG 1.

Pipeline for standardized reannotation of IEI diseases. First, scientific publications were collected by experts for each disease within the subgroups. Second, HPO terms were extracted from the provided publications for each disease using machine learning and summarized into Excel documents. Third, a 2-tier expert review evaluated the text-mined terms, suggested additional terms if required, and the responsible working group agreed on the final HPO annotations for each disease. Fourth, data were collated, and the agreed terms were submitted to HPO.

Standardized reannotation of genetically undiagnosed diseases

The methods above were specifically designed for application in (very) rare diseases, where the number of patients and therefore the described phenotypic spectrum and clinical presentation is sparse. In case of diseases and disease groups where an adequate amount of patient and phenotype data were available, in addition to a True/False annotation, the frequency of each phenotypic item was assessed. The frequencies correspond to the following representation in patients: common = Frequent (79%-30%); sometimes = Occasional (29%-5%); rare = 5 Very rare (<4%-1%).

Patient cohort

We randomly selected 30 patients who harbored a genetic diagnosis in one of the reannotated diseases from a large pediatric referral center research database. Clinical summaries of these patients before genetic diagnosis were retrieved by an expert clinician. The clinical summaries were parsed and HPO terms were extracted using machine learning as described in this article’s Methods section in the Online Repository.

HPO information content measures, and disease patient similarity measures

Information content of all HPO terms was assessed with the R package ontologyIndex v2.5.[18] The phenotypic similarity of diseases and patients before and after reannotation was compared using the R package ontologySimilarity v2.3.[18] The Euclidean distances between the diseases were computed on the basis of similarity measures, clustered with hierarchical clustering and visualized with ggtree using the R packages ggtree[19] and ape v5.2.[20] A detailed description including the data processing pipeline and tools is available in this article’s Methods section in the Online Repository.

RESULTS

Systematic evaluation and expansion of the HPO structure and terms relevant to IEIs

Our approach has resulted in the restructuring of 4 main branches of the HPO tree, namely (1) abnormality of the immune system (HP:0002715), (2) abnormality of metabolism/homeostasis (HP:0001939), (3) abnormality of the integument (HP:0001574), and (4) abnormality of the cardiovascular system (see Fig 2, A, and Document S1 in this article’s Online Repository at www.jacionline.org). Together, this revision prompted the replacement/restructuring of 67 terms, and the addition of 57 new terms to the HPO tree, among them “recurrent fever,” “unusual infections,” “IgG levels in blood” (see Fig 2, B, and comprehensive list in Documents S1 and S2 in this article’s Online Repository at www.jacionline.org).

FIG 2.

Revision and expansion of the HPO tree. A, Schematic representation of the restructuring of the HPO tree. Main branches of the HPO tree where restructuring was performed are marked with light green. B, “Abnormality of temperature,” “Abnormality of immunoglobulin level,” and “Unusual infections” as examples of revised branches of the HPO tree. New additions and suggestions are marked with green, and repositioned terms are marked with yellow.

Directed expansion of PAD terms

Overall, the PAD working group focused on replacing broad and nonspecific terms with terms that describe phenotypes in more detail and accuracy (eg, “partially absent total IgG/IgA/IgM in blood” and “(near) absent total IgG/IgA/IgM in blood” instead of “hypogammaglobulinemia”) (Fig 2, B). In addition, we proposed that the full detailed spectrum of specific antibody as well as IgG-subclass deficiencies be described by separate HPO terms. For example, we described individual terms related to “decreased specific antibody response to vaccination in blood” divided according to the response to different types of vaccination (protein, protein-conjugated polysaccharide, and unconjugated polysaccharide).

Standardized reannotation of rare, genetically diagnosed IEIs

We started by a systematic review of 4 disease categories of the International Union of Immunological Societies (IUIS) classification of IEIs, as proof of concept: diseases affecting cellular and humoral immunity (IUIS Table 1), diseases of immune dysregulation (IUIS Table 4), autoinflammatory disorders (IUIS Table 7), and genetically undiagnosed predominantly antibody deficiencies (IUIS Table 3), detailed in Table E1 and Document S3 in this article’s Online Repository at www.jacionline.org. As a first step, we assessed the already available HPO annotation for each disease in the v2019-06-03 HPO release (see this article’s Methods section in the Online Repository). We found that 15% of diseases considered (11 of 73 diseases in total) did not have any associated HPO terms (Fig 3, A). Overall, we found that on average 13.3 phenotype terms were available per disease (Fig 3, B), later referred to as “existing terms.”

FIG 3.

Results of disease reannotation. A, HPO annotation availability in the subset of 72 diseases. B, Distribution of number of available HPO terms per disease. C, Pipeline for the reannotation process. D, Distribution of the number of articles used per disease for the reannotation pipeline. E, Number of mined terms per disease. Each dot represents a disease. F, All mined vs all accepted terms. G, Number of available terms per disease before and after reannotation. Each dot represents a disease. H, Mean information content available per disease before and after reannotation. I, The aggregate mean annotation per disease after reannotation. J, All text-mined terms from PAD publications. K, Frequency distribution of different PAD terms according to the experts.

The text-mining and evaluation process was separated into 4 steps shown in Fig 3, C. We have first focused on the reannotation of 72 genetically diagnosed IEIs, and genetically undiagnosed PADs. For genetically diagnosed IEIs, text mining was based on 162 expert-curated articles, on average 2.57 articles per disease (Fig 3, D). This resulted in 4,517 extracted phenotype terms, 66.42 terms per disease (Fig 3, E). Of these terms, 3,242—or 71% per disease (47.67 of 66.42)—were accepted as correctly attributed terms by the expert reviewers (Fig 3, F). Expert suggestions added up to 529 additional HPO terms, in addition to the existing and text-mined terms. After reannotation, a mean of 63.1 terms were available for each disease, resulting in a 4.7-fold gain in the number of available annotations (Fig 3, G). The mean information content as measured by the overall frequency of terms in each disease’s annotations has increased from 6.17 to 8.3 (Fig 3, H) after reannotation. The new annotation of diseases consisted mainly of text-mined terms (70.6%) (Fig 3, I), followed by already existing terms (9.3%) and additional suggestions by experts (9.3%; adding a further 5.2 additional terms per disease) (see Document S3 in this article’s Online Repository).

Standardized reannotation of genetically undiagnosed PADs

PADs form a heterogeneous group, and most PADs do not (as yet) have a genetic diagnosis. We collected articles describing the heterogeneous PADs related to common variable immunodeficiency disorders, agammaglobulinemia, selective IgM deficiency, selective IgA deficiency, IgG-subclass deficiency, specific antibody deficiency, and unclassified antibody deficiency subgroups. In total, 541 terms were text mined from these articles, many of these in more than 1 PAD subgroup, and 245 of these terms (45.2%) were annotated as correctly associated to the respective PAD subgroup by the expert reviewers (Fig 3, J). Of these 245 terms, the experts annotated 16.3% as commonly found in PADs, 48.97% as sometimes associated (albeit less commonly), and 34.7% as rarely associated with PAD (Fig 3, K).

Patient-disease matching

We set out to showcase the efficacy of our reannotation effort by highlighting the potential diagnostic impact of optimized disease annotation. To do this, we have selected 30 clinical cases from a large immunology referral center research database (see Online Repository Document S3). HPO terms were matched to patient phenotypes by experts from the clinical synopsis, and the phenotypic similarity to all HPO-annotated diseases was calculated on the basis of these selected patient HPO terms (Fig 4, A), as illustrated by a concrete clinical example of a patient with tumor necrosis factor receptor–associated periodic syndrome (Fig 4, B). Overall, we show a significant improvement by 47% in the specificity of patient phenotype matching to correct diagnosis (from 0.49 to 0.72; P = 1.8 × 10−07; Fig 4, C), and a significantly better ranking of the correct clinical diagnosis across all possible diseases after reannotation: in most cases, the correct diagnosis was in the top 10 of matched diseases (Fig 4, D) after reannotation, and the rank of the correct diagnosis for individual patients was highly significantly improved, from a mean of 285 to 19 (14.9-fold improvement; P = 9.1 × 10−07; Fig 4, E).

FIG 4.

Patient-disease matching. A, Schematic overview of the different steps of patient-to-disease matching. First, the phenotypes were identified in a patient’s clinical history. Second, these phenotypes were translated to HPO terms. Finally, patient phenotype to disease matching was measured by Lin similarity. B, Matching patient 1 to a diagnosis. C, Similarity of patients in patient cohort to genetic diagnosis before and after reannotation. D, The rank of correct clinical diagnosis more often is in the top 10 of matched diseases after reannotation. E, Improvement of ranks of clinical diagnosis before and after reannotation. Significance was assessed by Student t test.

Phenotype-driven disease classification

We tested the efficacy of our approach in selecting biologically and clinically meaningful phenotypes by assessing the HPO-based phenotypic similarity of diseases before and after reannotation. In particular, we assessed whether the similarity was greater within or between IUIS clinically defined groups. We found that the phenotype-driven disease classification after reannotation has resulted in a clustering more in concordance with the IUIS-based clinical classification (see Fig 5, A and B).

FIG 5.

Phenotypic similarity of diseases before and after reannotation. Diseases are annotated with the IUIS disease group (inner circle), subgroup (outer circle), and OMIM identifier. A, Clustering of diseases based on phenotypic similarity before reannotation. B, Clustering of diseases based on phenotypic similarity after reannotation. OMIM, Online Mendelian Inheritance in Men; SCID, severe combined immunodeficiency.

DISCUSSION

Unified data standards, consistent classification, and robustly verified clinical data are vital pillars supporting diagnostic pipelines and data-driven research. Although databases and vocabularies that aim to provide accurate phenotypic descriptions exist,[5-9] there are still major gaps in the depiction of IEIs in these data sets. Here, we used a cross-community collaboration to review, expand, and improve the depiction of IEIs in HPO, and reannotate IEIs with HPO terms. We reviewed 4 separate branches of the HPO tree and submitted 57 new and expanded HPO terms, most of which are now included in the official HPO data set. We introduced a semi-automated reannotation pipeline, which combines ontology-guided machine learning and a 2-tier expert review to reannotate 4 main categories of IEIs. The basis of the ontology-guided machine learning was the expert-curated list of articles (162 in total), which was submitted to the PanelApp[21] to serve as a public resource. The text-mined phenotypes were subjected to expert review to confer face validity or refute the putative new HPO terms. IEIs and their current HPO terms covered by the working groups were scrutinized in-depth, resulting in high-quality annotations. Overall, we have achieved a 4.7-fold gain in the number of HPO terms annotating each disease. These annotations included unspecific (frequently annotated) as well as specific (less frequently annotated) HPO terms holding less and more information content, respectively. Combined, the mean information content increased from 6.17 to 8.3. Each reannotated disease showed an increase in information content and a quantitative gain in the number of available HPO terms. Through patient-disease matching and disease-similarity examples, we illustrated that these gains and increases translated to significant qualitative improvement in patient-disease matching in an independent cohort of patients with IEI (Fig 4), and phenotype-driven classification of IEIs that more closely resembles clinical consensus (Fig 5). Although neither of these measures are systematic assessments of global patient-disease matching and disease-similarity comparisons, they highlight that there is considerable benefit by the revision of specific subclasses of diseases. Once a near-complete HPO phenotype reannotation of almost all IEIs is available, it will be intriguing to assess how well patients with genetic diagnoses match reannotated Online Mendelian Inheritance in Man (OMIM) diseases in a clinical setting, how patient matching to genetic diagnosis is transformed, and whether these changes ultimately lead to an earlier diagnosis. Finally, once a detailed and accurate phenotypic description is available for all IEIs, identification phenotype-driven patient subgroups will be common practice, and a more objective entirely phenotype-driven classification and ontology of IEIs can become a reality. Accurate phenotypic description of patients holds promise for diagnostic utility and for the discovery of novel diseases. Phenotype-driven genetic diagnostic tools now exist, but their full clinical potential is hampered by the lack of complete phenotypic descriptions for most types of IEIs. Phenotips[22] is a free and open source software for collecting and analyzing phenotypic information of patients with genetic disorders that is widely used in the rare disease community. Tools such as Exomiser use HPO terms to annotate and to prioritize potentially casual variants.[23] New integrative “omics” approaches and the analysis of large-scale data with artificial intelligence will allow us to go from a one-size-fits-all to a more personalized medicine, including in IEIs. We see the potential to integrate the richer phenotyping of previously undiagnosed groups of patients with IEI with available sequencing data to accelerate disease gene discovery and at the same time increase the diagnostic rate in new patients.[24] Novel disease-gene or phenotype associations depend on sufficient numbers of cases as well as a control cohort of comparable quality. Cross-institute and cross-country collaborations for cohorts of undiagnosed, but well-phenotyped patients could shed light on novel disease-causing genes of the immune system. Trusted and accepted data and information sharing platforms are already being developed[13,22] to provide robust and sufficiently granular HPO terms as a standardized way of phenotyping patients. Electronic health records[25] could facilitate the transfer of HPO terms by integrating with available sharing platforms. Capturing HPO annotations of novel rare diseases or cases is an ongoing challenge for a complete disease representation. Thus, it is important that alongside of updating the official IUIS classification, HPO descriptions of disorders are curated once every several years. We suggest a community effort for such regular reviews of HPO regarding IEIs, such as a team of experts, part of big international groups of clinicians such as ESID or ERN-RITA, the Clinical Immunology Society (CIS), or other similar organizations. Publication standards that require the submission of HPO annotations upfront would greatly improve this process. Once phenotyped patients are available, robust and global approaches are accessible[2] to find phenotypically similar cases. These comparisons are performed by advanced machine learning algorithms. However, machine learning can also be a very powerful tool to automate the identification of relevant phenotype information in publications or clinical notes. We applied an ontology-guided machine learning tool to support the annotation of diseases and explored the full spectrum of terms—from very relevant to not relevant at all. The same process can be applied to unstructured clinical notes to accelerate in-depth annotation of patients. For patients with electronic health records,[25] abnormal clinical values can automatically be translated into HPO codes[26] for a more precise diagnostic application and integrated with sharing platforms as mentioned before. The foundation of these comparisons is an ontology with a comprehensive set of terms, which is widely used. Because there is currently no criterion standard on how to perform an expert-based review of ontologies, guidance on annotating diseases with HPO phenotypes can vary between diseases, disease classes, and centers. IEIs are rare diseases, and often there are only a few patients described (sometimes only 1 kindred in case of ultra-rare diseases). Therefore, the depth of currently available published phenotypes is at times limited. The low number of patients and insufficient depth of available phenotypes bring up a question as to which diseases to include in phenotyping exercises of this nature. On the one hand, focusing on IEIs that are commonly accepted, with multiple patients diagnosed and well described by multiple researchers, can increase the depth of phenotyping. However, this approach excludes at least 10% of IEIs (the ultra-rare diseases). On the other hand, an all-inclusive approach including every disease systematically means that we rely on sparsely phenotyped patients and perhaps insufficient data for ultrarare disorders. A warning of accuracy by indicating the frequency of each phenotype for diseases could soon be possible, with the addition of phenotype frequency to the HPO data set, an expansion that is currently work in progress. This implies the need for a responsive system, capable of assimilating new phenotypic information as the pool of confidently diagnosed patients increases. Our ongoing approach aims to address these gaps for IEIs and to provide an ontology that is practical, useful, and as complete as possible. However, the existence of a well-built ontology and the awareness of clinicians and researchers itself does not guarantee a shift in the community to fully adapt a standardized phenotyping approach. Our approach raised awareness regarding the concept and importance of HPO among the IEI community. Moreover, the process made the participating clinicians aware of the available terms and highlighted where these were lacking. Moving forward, it is very important that official entities adopt HPO terms as the unified means of patient phenotyping. We hypothesize that as soon as the widely used registries such as the Undiagnosed Disease Network[11] or the IUIS[27] use HPO to refer to phenotypic annotation, this will propel the IEI field toward adopting HPO as the main nomenclature for phenotyping patients with IEI. One promising move in this direction is the recent expansion of the ESID registry working definitions for the clinical diagnosis of IEIs,[28] which derives HPO terms from OrphaNet using the ORDO Ontological Module (HOOM) platform,[29] prompted by our HPO initiative.

Conclusions

Our work reviewed and expanded the phenotypic depiction of multiple subclasses of IEIs, and to our knowledge, this initiative is the first endeavor of its kind with the aim of standardizing IEI phenotypes. Our semiautomated annotation-based approach is scalable to include all IEIs as illustrated herein. We propose our reannotation approach as a blueprint for systematic HPO (re) annotation for additional immunologic and nonimmunologic diseases.

29 in total

1. The Undiagnosed Diseases Network: Accelerating Discovery about Health and Disease.

Authors: Rachel B Ramoni; John J Mulvihill; David R Adams; Patrick Allard; Euan A Ashley; Jonathan A Bernstein; William A Gahl; Rizwan Hamid; Joseph Loscalzo; Alexa T McCray; Vandana Shashi; Cynthia J Tifft; Anastasia L Wise
Journal: Am J Hum Genet Date: 2017-02-02 Impact factor: 11.025

2. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R.

Authors: Emmanuel Paradis; Klaus Schliep
Journal: Bioinformatics Date: 2019-02-01 Impact factor: 6.937

3. The Matchmaker Exchange: a platform for rare disease gene discovery.

Authors: Anthony A Philippakis; Danielle R Azzariti; Sergi Beltran; Anthony J Brookes; Catherine A Brownstein; Michael Brudno; Han G Brunner; Orion J Buske; Knox Carey; Cassie Doll; Sergiu Dumitriu; Stephanie O M Dyke; Johan T den Dunnen; Helen V Firth; Richard A Gibbs; Marta Girdea; Michael Gonzalez; Melissa A Haendel; Ada Hamosh; Ingrid A Holm; Lijia Huang; Matthew E Hurles; Ben Hutton; Joel B Krier; Andriy Misyura; Christopher J Mungall; Justin Paschall; Benedict Paten; Peter N Robinson; François Schiettecatte; Nara L Sobreira; Ganesh J Swaminathan; Peter E Taschner; Sharon F Terry; Nicole L Washington; Stephan Züchner; Kym M Boycott; Heidi L Rehm
Journal: Hum Mutat Date: 2015-10 Impact factor: 4.878

4. A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®).

Authors: Joanna Amberger; Carol Bocchini; Ada Hamosh
Journal: Hum Mutat Date: 2011-04-05 Impact factor: 4.878

5. Undiagnosed Diseases Network International (UDNI): White paper for global actions to meet patient needs.

Authors: Domenica Taruscio; Stephen C Groft; Helene Cederroth; Béla Melegh; Paul Lasko; Kenjiro Kosaki; Gareth Baynam; Alexa McCray; William A Gahl
Journal: Mol Genet Metab Date: 2015-11-05 Impact factor: 4.797

6. GeneMatcher: a matching tool for connecting investigators with an interest in the same gene.

Authors: Nara Sobreira; François Schiettecatte; David Valle; Ada Hamosh
Journal: Hum Mutat Date: 2015-08-13 Impact factor: 4.878

7. Clinical efficacy of a next-generation sequencing gene panel for primary immunodeficiency diagnostics.

Authors: W Rae; D Ward; C Mattocks; R J Pengelly; E Eren; S V Patel; S N Faust; D Hunt; A P Williams
Journal: Clin Genet Date: 2018-02-02 Impact factor: 4.438

8. PhenoTips: patient phenotyping software for clinical and research use.

Authors: Marta Girdea; Sergiu Dumitriu; Marc Fiume; Sarah Bowdin; Kym M Boycott; Sébastien Chénier; David Chitayat; Hanna Faghfoury; M Stephen Meyn; Peter N Ray; Joyce So; Dimitri J Stavropoulos; Michael Brudno
Journal: Hum Mutat Date: 2013-05-24 Impact factor: 4.878

9. Human phenotype ontology annotation and cluster analysis to unravel genetic defects in 707 cases with unexplained bleeding and platelet disorders.

Authors: Sarah K Westbury; Ernest Turro; Daniel Greene; Claire Lentaigne; Anne M Kelly; Tadbir K Bariana; Ilenia Simeoni; Xavier Pillois; Antony Attwood; Steve Austin; Sjoert Bg Jansen; Tamam Bakchoul; Abi Crisp-Hihn; Wendy N Erber; Rémi Favier; Nicola Foad; Michael Gattens; Jennifer D Jolley; Ri Liesner; Stuart Meacham; Carolyn M Millar; Alan T Nurden; Kathelijne Peerlinck; David J Perry; Pawan Poudel; Sol Schulman; Harald Schulze; Jonathan C Stephens; Bruce Furie; Peter N Robinson; Chris van Geet; Augusto Rendon; Keith Gomez; Michael A Laffan; Michele P Lambert; Paquita Nurden; Willem H Ouwehand; Sylvia Richardson; Andrew D Mumford; Kathleen Freson
Journal: Genome Med Date: 2015-04-09 Impact factor: 11.117

10. PanelApp crowdsources expert knowledge to establish consensus diagnostic gene panels.

Authors: Antonio Rueda Martin; Eleanor Williams; Rebecca E Foulger; Sarah Leigh; Louise C Daugherty; Olivia Niblock; Ivone U S Leong; Katherine R Smith; Oleg Gerasimenko; Eik Haraldsdottir; Ellen Thomas; Richard H Scott; Emma Baple; Arianna Tucci; Helen Brittain; Anna de Burca; Kristina Ibañez; Dalia Kasperaviciute; Damian Smedley; Mark Caulfield; Augusto Rendon; Ellen M McDonagh
Journal: Nat Genet Date: 2019-11 Impact factor: 38.330

4 in total

1. A rapid turnaround gene panel for severe autoinflammation: Genetic results within 48 hours.

Authors: Dara McCreary; Ebun Omoyinmi; Ying Hong; Barbara Jensen; Alice Burleigh; Fiona Price-Kuehne; Kimberly Gilmour; Despina Eleftheriou; Paul Brogan
Journal: Front Immunol Date: 2022-09-20 Impact factor: 8.786

2. Implementation of Early Next-Generation Sequencing for Inborn Errors of Immunity: A Prospective Observational Cohort Study of Diagnostic Yield and Clinical Implications in Dutch Genome Diagnostic Centers.

Authors: Kim Elsink; Manon M H Huibers; Iris H I M Hollink; Annet Simons; Evelien Zonneveld-Huijssoon; Lars T van der Veken; Helen L Leavis; Stefanie S V Henriet; Marcel van Deuren; Frank L van de Veerdonk; Judith Potjewijd; Dagmar Berghuis; Virgil A S H Dalm; Clementien L Vermont; Annick A J M van de Ven; Annechien J A Lambeck; Kristin M Abbott; P Martin van Hagen; Godelieve J de Bree; Taco W Kuijpers; Geert W J Frederix; Mariëlle E van Gijn; Joris M van Montfrans
Journal: Front Immunol Date: 2021-12-21 Impact factor: 7.561

3. Phenomic Analysis of Chronic Granulomatous Disease Reveals More Severe Integumentary Infections in X-Linked Compared With Autosomal Recessive Chronic Granulomatous Disease.

Authors: Timothy Lok-Hin Chiu; Daniel Leung; Koon-Wing Chan; Hok Man Yeung; Chung-Yin Wong; Huawei Mao; Jianxin He; Pandiarajan Vignesh; Weiling Liang; Woei Kang Liew; Li-Ping Jiang; Tong-Xin Chen; Xiang-Yuan Chen; Yin-Bo Tao; Yong-Bin Xu; Hsin-Hui Yu; Alta Terblanche; David Christopher Lung; Cheng-Rong Li; Jing Chen; Man Tian; Brian Eley; Xingtian Yang; Jing Yang; Wen Chin Chiang; Bee Wah Lee; Deepti Suri; Amit Rawat; Anju Gupta; Surjit Singh; Wilfred Hing Sang Wong; Gilbert T Chua; Jaime Sou Da Rosa Duque; Kai-Ning Cheong; Patrick Chun-Yin Chong; Marco Hok-Kung Ho; Tsz-Leung Lee; Wanling Yang; Pamela P Lee; Yu Lung Lau
Journal: Front Immunol Date: 2022-01-24 Impact factor: 7.561

4. The Immune Deficiency and Dysregulation Activity (IDDA2.1 'Kaleidoscope') Score and Other Clinical Measures in Inborn Errors of Immunity.

Authors: Markus G Seidel; Victoria K Tesch; Linlin Yang; Fabian Hauck; Anna Lena Horn; Maria Anna Smolle; Franz Quehenberger; Martin Benesch
Journal: J Clin Immunol Date: 2021-11-19 Impact factor: 8.542

4 in total