| Literature DB >> 30407550 |
Lynn M Schriml1, Elvira Mitraka2, James Munro1, Becky Tauber1, Mike Schor1, Lance Nickle1, Victor Felix1, Linda Jeng3, Cynthia Bearer3, Richard Lichenstein3, Katharine Bisordi3, Nicole Campion3, Brooke Hyman3, David Kurland4, Connor Patrick Oates5, Siobhan Kibbey3, Poorna Sreekumar3, Chris Le3, Michelle Giglio1, Carol Greene3.
Abstract
The Human Disease Ontology (DO) (http://www.disease-ontology.org), database has undergone significant expansion in the past three years. The DO disease classification includes specific formal semantic rules to express meaningful disease models and has expanded from a single asserted classification to include multiple-inferred mechanistic disease classifications, thus providing novel perspectives on related diseases. Expansion of disease terms, alternative anatomy, cell type and genetic disease classifications and workflow automation highlight the updates for the DO since 2015. The enhanced breadth and depth of the DO's knowledgebase has expanded the DO's utility for exploring the multi-etiology of human disease, thus improving the capture and communication of health-related data across biomedical databases, bioinformatics tools, genomic and cancer resources and demonstrated by a 6.6× growth in DO's user community since 2015. The DO's continual integration of human disease knowledge, evidenced by the more than 200 SVN/GitHub releases/revisions, since previously reported in our DO 2015 NAR paper, includes the addition of 2650 new disease terms, a 30% increase of textual definitions, and an expanding suite of disease classification hierarchies constructed through defined logical axioms.Entities:
Year: 2019 PMID: 30407550 PMCID: PMC6323977 DOI: 10.1093/nar/gky1032
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Advanced Boolean Searches. AND/OR searches of any of the DO datatypes (Name, Synonym, Definition, SubSet, DOID, Alternate ID, Xrefs) enable complex data queries of the DO Knowledgebase. For example, a search of Xrefs: OMIM, Name: Parkinson, Subset: DO_rare_slim – identifies all DO disease terms that include a cross-reference (Xref) to OMIM, where the disease name includes ‘Parkinson’ and where the disease is included in the DO_rare_slim – rare disease category. This query in DO, returns seven disease terms.
DO file names and content, class counts
| File name (.owl and .obo) | Content | Hierarchy | Classes | Total # classes |
|---|---|---|---|---|
| doid | OBO Foundry format | asserted is_a with, Equivalent To Axioms, SubClass Of Statements | imports, DOIDs | 17,579 |
| doid-non-classified | DO file format | asserted is_a | DOIDs: non-obsolete (9,069) | 11,463 |
| doid-merged | MGI custom file | asserted is_a, inferred parents | imports, DOIDs, omim_susceptibility | 15,183 |
Note: The file, doid-non-classified, is an equivalent file (in content and structure) to HumanDO.obo. The DO project has continued to produce the HumanDO files, as this was the original naming convention used by the project for ∼10 years and was included in several publications. Total class counts include non-obsolete and obsolete DOIDs (DO identifiers). OBO Foundry (http://www.obofoundry.org) (22). For DO’s GitHub release # 45, there are 9,069 DOIDs, also retrievable from the DO’s website.
Figure 2.An example DO SPARQL: doid-report.rq.
Figure 3.Example areas of term expansion: (A) hematopoietic system diseases and (B) inherited metabolic disorders. Blue arrow indicates new DO terms.
Figure 4.Skin disease logical axioms, define inferred disease parents. Integration of a SubClass Of logical axiom for ‘ichthyosis vulgaris’ and ‘autosomal dominant cutis laxa’ [BOLD] (‘has material basis in’ some ‘autosomal dominant inheritance’) and an ‘Equivalent To logical axiom for ‘autosomal dominant disease’ (disease and (‘has material basis in’ some ‘autosomal dominant inheritance’)), where ‘autosomal dominant inheritance is from the Genotype Ontology (http://www.obofoundry.org/ontology/geno.html) creates ‘inferred’ child to parent DO relationships, thus both skin diseases are defined as inferred child terms of DO’s ‘autosomal dominant disease’.