| Literature DB >> 25348409 |
Warren A Kibbe1, Cesar Arze2, Victor Felix2, Elvira Mitraka2, Evan Bolton3, Gang Fu3, Christopher J Mungall4, Janos X Binder5, James Malone6, Drashtti Vasant6, Helen Parkinson6, Lynn M Schriml7.
Abstract
The current version of the Human Disease Ontology (DO) (http://www.disease-ontology.org) database expands the utility of the ontology for the examination and comparison of genetic variation, phenotype, protein, drug and epitope data through the lens of human disease. DO is a biomedical resource of standardized common and rare disease concepts with stable identifiers organized by disease etiology. The content of DO has had 192 revisions since 2012, including the addition of 760 terms. Thirty-two percent of all terms now include definitions. DO has expanded the number and diversity of research communities and community members by 50+ during the past two years. These community members actively submit term requests, coordinate biomedical resource disease representation and provide expert curation guidance. Since the DO 2012 NAR paper, there have been hundreds of term requests and a steady increase in the number of DO listserv members, twitter followers and DO website usage. DO is moving to a multi-editor model utilizing Protégé to curate DO in web ontology language. This will enable closer collaboration with the Human Phenotype Ontology, EBI's Ontology Working Group, Mouse Genome Informatics and the Monarch Initiative among others, and enhance DO's current asserted view and multiple inferred views through reasoning.Entities:
Mesh:
Year: 2014 PMID: 25348409 PMCID: PMC4383880 DOI: 10.1093/nar/gku1011
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The DO website. The query for all DO terms included in the DO Cancer slim is displayed.
The seven disease vocabularies being cross-compared.
| Vocabulary | Data source | Size | Access date |
|---|---|---|---|
| DO | Subclasses of root disease concept () | 8,757 | 26 March 2014 |
| NDF-RT | Subclasses of root disease concept (N0000000004) | 4,700 | 26 March 2014 |
| NCI | Subclasses of root disease concept (C2991) | 4,252 | 11 August 2014 |
| ORDO | Subclasses of root disease concept (Orphanet_C001) | 4,799 | 27 March 2014 |
| KEGG | All of KEGG MEDICUS disease concepts | 1,359 | 30 March 2014 |
| CTD | All of MEDIC disease concepts | 11,898 | 08 April 2014 |
| HPO-D | All of hereditary syndromes with phenotypic information | 7,520 | 07 August 2014 |
The number of unique cross-references in each disease vocabulary.
| MeSH | OMIM | SNOMED-CT | UMLS direct | UMLS indirecta | UMLS combined | Total | |
|---|---|---|---|---|---|---|---|
| DO | 2,908 | 1,864 | 13,276 | 12,170 | 8,354 | 13,004 | 35,895 |
| ORDO | 1,705 | 5,201 | 1,943 | 7,997 | 2,285 | 8,071 | 16,920 |
| CTD | 11,332 | 3,977 | 0 | 0 | 6,650 | 6,650 | 21,959 |
| NDF-RT | 4,661 | 0 | 0 | 4,699 | 7,922 | 8,245 | 12,906 |
| KEGG | 1,295 | 2,690 | 0 | 0 | 4,482 | 4,482 | 8,467 |
| HPO-D | 0 | 6,528 | 0 | 0 | 8,000 | 8,000 | 14,528 |
| NCI | 0 | 0 | 0 | 3,910 | 0 | 3,910 | 3,910 |
| Totalb | 11,467 | 7,389 | 14,362 | 13,486 | 23,251 | 23,251 | 56,469 |
aInferred UMLS cross-references were obtained by UMLS mapping to MeSH, OMIM and SNOMED-CT concepts.
bTotal number of unique cross-references.
The overlap between disease vocabularies.
| DO | ORDO | CTD | NDF-RT | KEGG | HPO-D | NCI | |
|---|---|---|---|---|---|---|---|
| DO | 8,757a | 1,492 | 3,432 | 3,200 | 1,102 | 1,278 | 2,092 |
| ORDO | 1,952 | 4,799a | 3,503 | 1,465 | 2,383 | 4,170 | 993 |
| CTD | 6,413 | 6,418 | 11,898a | 6,701 | 6,607 | 2,883 | 2,345 |
| NDF-RT | 2,878 | 1,091 | 4,660 | 4,700a | 775 | 876 | 1,686 |
| KEGG | 854 | 1,105 | 1,303 | 718 | 1,359a | 1,102 | 522 |
| HPO-D | 2,291 | 5,426 | 3,818 | 1,169 | 3,102 | 7,520a | 818 |
| NCI | 3,692 | 888 | 1,004 | 1,865 | 625 | 694 | 4,252a |
The number of identical terms between any two disease vocabularies indicates the degree of overlap between them. The off-diagonal numbers in the overlap matrix can be interpreted as the number of terms from the row disease resource covered by the column disease resource. For example, DO has common cross-references with 1492 of the 4799 ORDO terms, while ORDO has common cross-references with 1952 of the 8757 DO terms. The diagonal numbers in the overlap matrix indicate the total number of terms in each disease resource.
aThe total number of disease terms in each disease vocabulary.
The percentage overlap between disease vocabularies.
Google analytics reporting for the DO website.
| Time frame | Sessions | Users | Returning visitors | New visitors |
|---|---|---|---|---|
| 8/31/2013–9/1/2014 | 11,517 (total) | 6,206 | 47% | 53% |
| December 2011 | 539 (per month) | 446 | 18% | 82% |
| June 2014 | 1,385 (per month) | 681 | 58% | 42% |
The increased number of visits (sessions), number of users and percent of returning visitors to the DO website over the past year and monthly averages between December 2011 and June 2014.
The number of DO terms mapped to biomedical resources.
| Biomedical resource | Data types | DO disease mappings |
|---|---|---|
| Reactome | Disease to pathway | 770+ |
| Neurocarta | Disease to gene associations | 1,920 |
| FlyBase | Human disease models - Drosophila alleles | 2,289 |
| PRO | Protein-disease | 50 |
| EFO | EFO-DO | 137 |
| NeuroDevNet | Intellectual disabilities | 71 |
| Inherited metabolic disorders | 80 |