| Literature DB >> 22080554 |
Lynn Marie Schriml1, Cesar Arze, Suvarna Nadendla, Yu-Wei Wayne Chang, Mark Mazaitis, Victor Felix, Gang Feng, Warren Alden Kibbe.
Abstract
The Disease Ontology (DO) database (http://disease-ontology.org) represents a comprehensive knowledge base of 8043 inherited, developmental and acquired human diseases (DO version 3, revision 2510). The DO web browser has been designed for speed, efficiency and robustness through the use of a graph database. Full-text contextual searching functionality using Lucene allows the querying of name, synonym, definition, DOID and cross-reference (xrefs) with complex Boolean search strings. The DO semantically integrates disease and medical vocabularies through extensive cross mapping and integration of MeSH, ICD, NCI's thesaurus, SNOMED CT and OMIM disease-specific terms and identifiers. The DO is utilized for disease annotation by major biomedical databases (e.g. Array Express, NIF, IEDB), as a standard representation of human disease in biomedical ontologies (e.g. IDO, Cell line ontology, NIFSTD ontology, Experimental Factor Ontology, Influenza Ontology), and as an ontological cross mappings resource between DO, MeSH and OMIM (e.g. GeneWiki). The DO project (http://diseaseontology.sf.net) has been incorporated into open source tools (e.g. Gene Answers, FunDO) to connect gene and disease biomedical data through the lens of human disease. The next iteration of the DO web browser will integrate DO's extended relations and logical definition representation along with these biomedical resource cross-mappings.Entities:
Mesh:
Year: 2011 PMID: 22080554 PMCID: PMC3245088 DOI: 10.1093/nar/gkr972
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.DO web interface with search, navigation and display functions. The disease tree view displays the DO's hierarchical structure and the placement of top level parent nodes expandable to view subgraphs. The fungal infectious disease subgraph with its direct child terms are shown. Term Metadata is displayed for selected terms from the tree view.
DO UMLS CUI ID mappings
| Vocabulary | Vocabulary IDs | DO IDs | ||
|---|---|---|---|---|
| May 2010 | August 2011 | May 2010 | August 2011 | |
| OMIM | 2304 | 1389 | 1594 | 2330 |
| SNOMED CT | 20 985 | 14 313 | 8054 | 5155 |
| NCI thesaurus | 7249 | 4761 | 7067 | 4858 |
| MeSH | 3932 | 3032 | 3921 | 3116 |
| ICD9CM | 6403 | 2971 | 5757 | 3325 |
A total of 7845 of 8588 DOID's (91%) in DO version 3, revision 2490 were mapped to the 2011 UMLS CUIs in August 2011. The number of unique vocabulary IDs mapped is given in the center column and the number of DO terms mapping to other terminologies through the CUI mapping file is presented on the right. Note that a single DO term may have multiple matches in a given terminology. The decrease in SNOMED CT mappings is a reflection of the increased use of logical definitions in DO.
Figure 2.Term visualization in the DO web-browser. The ‘Visualize’ button on the Metadata page opens a graphical view of DO. Clicking on this button will open a new tab that will display the target node of the visualization (e.g. basidiobolomycosis) [red box], parent node [green box] and sibling leaf nodes [gray box]. Nodes with five or more children are represented by a gold circle containing the number of children. Clicking on a node in the graph will expand the view.