| Literature DB >> 24217912 |
Sebastian Köhler1, Sandra C Doelken, Christopher J Mungall, Sebastian Bauer, Helen V Firth, Isabelle Bailleul-Forestier, Graeme C M Black, Danielle L Brown, Michael Brudno, Jennifer Campbell, David R FitzPatrick, Janan T Eppig, Andrew P Jackson, Kathleen Freson, Marta Girdea, Ingo Helbig, Jane A Hurst, Johanna Jähn, Laird G Jackson, Anne M Kelly, David H Ledbetter, Sahar Mansour, Christa L Martin, Celia Moss, Andrew Mumford, Willem H Ouwehand, Soo-Mi Park, Erin Rooney Riggs, Richard H Scott, Sanjay Sisodiya, Steven Van Vooren, Ronald J Wapner, Andrew O M Wilkie, Caroline F Wright, Anneke T Vulto-van Silfhout, Nicole de Leeuw, Bert B A de Vries, Nicole L Washingthon, Cynthia L Smith, Monte Westerfield, Paul Schofield, Barbara J Ruef, Georgios V Gkoutos, Melissa Haendel, Damian Smedley, Suzanna E Lewis, Peter N Robinson.
Abstract
The Human Phenotype Ontology (HPO) project, available at http://www.human-phenotype-ontology.org, provides a structured, comprehensive and well-defined set of 10,088 classes (terms) describing human phenotypic abnormalities and 13,326 subclass relations between the HPO classes. In addition we have developed logical definitions for 46% of all HPO classes using terms from ontologies for anatomy, cell types, function, embryology, pathology and other domains. This allows interoperability with several resources, especially those containing phenotype information on model organisms such as mouse and zebrafish. Here we describe the updated HPO database, which provides annotations of 7,278 human hereditary syndromes listed in OMIM, Orphanet and DECIPHER to classes of the HPO. Various meta-attributes such as frequency, references and negations are associated with each annotation. Several large-scale projects worldwide utilize the HPO for describing phenotype information in their datasets. We have therefore generated equivalence mappings to other phenotype vocabularies such as LDDB, Orphanet, MedDRA, UMLS and phenoDB, allowing integration of existing datasets and interoperability with multiple biomedical resources. We have created various ways to access the HPO database content using flat files, a MySQL database, and Web-based tools. All data and documentation on the HPO project can be found online.Entities:
Mesh:
Year: 2013 PMID: 24217912 PMCID: PMC3965098 DOI: 10.1093/nar/gkt1026
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The HPO aims to act as a central resource to connect several genomics datasets with the diseasome. Thus, the HPO can act as a scaffold for enabling the interoperability between molecular biology and human disease. For example, phenotypic abnormalities in genetically modified model organisms can be mapped to human disease phenotypes (2).
Different types of phenotypic abnormalities covered by the HPO
| ‘Class’ of phenotype | HPO example |
|---|---|
| Morphological abnormality | Arachnodactyly (HP:0001166) |
| Abnormal process (organ) | Epistaxis (HP:0000421) |
| Abnormal process (cellular) | Abnormality of Krebs cycle metabolism (HP:0000816) |
| Abnormal laboratory finding | Glycosuria (HP:0003076) |
| Electrophysiological abnormality | Hypsarrhythmia (HP:0002521) |
| Abnormality by medical imaging | Choroid plexus cyst (HP:0002190) |
| Behavioral abnormality | Self-mutilation (HP:0000742) |
Figure 2.Statistics of the data from the HPO project from January 2009 to August 2013. Ontology statistics shows quantities related to the file hp.obo. The annotation statistics clearly demonstrates the inclusion of Orphanet data in October 2012.
Meta-information for HPO phenotype annotations
| Meta-attribute | Possible values (explanation in brackets) |
|---|---|
| Qualifier/Modifier | not, mild (±2–3 SD from mean), moderate (±3–4 SD from mean), severe (±4–5 SD from mean), profound (±5SD and greater from mean), secondary, chronic, (non)progressive, episodic, recurrent, bilateral, unilateral, distal, proximal, refractory and generalized |
| Evidence Code | ITM (inferred by text mining), IEA (inferred from electronical annotation), PCS (published clinical study), ICE (individual clinical experience), TAS (traceable author statement) |
| Onset modifier | Any term from HPO-subontology |
| Frequency modifier | percentage value (e.g. 25%), n of m (e.g. 3/10 patients), very rare, rare, occasional, frequent, typical, variable, common, hallmark and obligate |
The meaning/definition of the values is shown in brackets. (SD = standard deviation).
Definitions of age-of-onset terms in the HPO
| Onset of manifestations | Definition |
|---|---|
| Less than 1 year | |
| Embryonal | <8 weeks’ gestation |
| Fetal | 8 weeks’ gestation–birth |
| Neonatal | Birth–28 days |
| Infantile | 28 days–1 year |
| More than 1 year | |
| Childhood | 1–5 years |
| Juvenile | 5–15 years |
| Adults | |
| Young adult | <40 years |
| Mid adult | 40–60 years |
| Old age | >60 years |
Content of and access to the stable releases of the data provided by the HPO project
| Release category | URL of latest stable release for job (relative to | File(s) at URL | File description |
|---|---|---|---|
| HPO releases | hpo/lastStableBuild/ | hp.obo, hp.owl | HPO in OBO/OWL format as generated by Oort. |
| human-phenotype-ontology_xp.obo | Logical definitions of HPO terms. | ||
| onet_hpo.tsv, LDDB2HPO-v2.csv, medraMapping.tsv | Mappings to other phenotype vocabularies, e.g. Orphanet, LDDB, MedDRA. | ||
| Disease annotations | hpo.annotations/lastStableBuild/ | negative_phenotype_annotation.tab | Disease-HPO term associations asserted not to be associated with the corresponding disease. |
| phenotype_annotation_hpoteam.tab | Manual and semi-automatic annotations of syndromes from OMIM and DECIPHER. | ||
| phenotype_annotation.tab | Manual and semi-automatic annotations of OMIM and DECIPHER augmented with annotations to Orphanet syndromes. | ||
| Other data | hpo.annotations.monthly/lastStableBuild/ | < | Mapping of human genes to phenotypic features (via disease-to-gene relationship). (< |
| < | Mapping of phenotypic features to human genes (via disease-to-gene relationship). (< | ||
| MYHPO_MM_YYYY.sql | MySQL dump of the HPO database, where MM and YYYY denote the month and year of release. | ||
| hpo.diseasesimilarity/lastStableBuild/ | matrices.tar.gz | Precomputed disease–disease similarity matrix for all diseases with annotations to HPO’s phenotypic abnormality subontology. Symmetric and asymmetric semantic similarity score. | |
| hpo.ontology.uberpheno/lastStableBuild/ | crossSpeciesPheno.obo | Cross-species phenotype ontology (human, mouse, zebrafish). | |
| HSgenes_crossSpecies PhenoAnnotation.txt | Annotation of all human genes to terms in crossSpeciesPheno.obo (uses orthology to human genes obtained from MGI and ZFIN). See ( |
Tools and applications using HPO
| Tool | Reference/URL |
|---|---|
| Phenomizer | ( |
| BOQA | ( |
| Exomiser | |
| Cartagenia | |
| ECARUCA | ( |
| DECIPHER | ( |
| PhenoTips | ( |
| PhenoDigm | ( |
| MouseFinder | ( |
| Monarch | |
| PhenomeNet | ( |
| Uberpheno | ( |