| Literature DB >> 30097014 |
Ying Shen1, Daoyuan Chen1, Buzhou Tang2, Min Yang3, Kai Lei4.
Abstract
BACKGROUND: Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path.Entities:
Keywords: Big data and semantics; Entropy-based metric; Knowledge representation; Ontology evaluation; Ontology modeling
Mesh:
Year: 2018 PMID: 30097014 PMCID: PMC6086046 DOI: 10.1186/s13326-018-0188-7
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1IDDAP ontology—Amoxicillin. The green nodes represent contraindications between Amoxicillin and other drugs (relation_ contraindication_drug). The red nodes identify bacteria that can be treated by Amoxicillin (relation_antibiotics_bacteria). The pink nodes indicate the relationships between infection sites and bacteria (relation_infs_bacteria). The gray nodes specify the relationships between diseases and bacteria (relation_disease_bacteria). Finally, the yellow nodes show the relationships between diseases and complications (relation_disease_ complication)
Summary statistics of ontologies
| IDO | IDODEN | DO | IDDAP | |
|---|---|---|---|---|
| Triples with annotation/data | 3901 | 23,657 | 129,670 |
|
| Triples without annotation/data | 960 | 5845 | 10,060 |
|
| Class/Instance/Entity | 507 | 5007 | 11,088 |
|
| Subclassof | 582 | 5834 | 10,008 |
|
| Equivalent classes | 81 | 0 |
|
|
| Disjoint classes | 17 | 11 |
|
|
| Object property |
| 25 | 20 | 8 |
| Annotation property |
|
| 33 | 1 |
Performance comparison result on IDO, IDODEN, DO and IDDAP
| Calmet [ | Doran [ | Gurupur [ | EAPB | |
|---|---|---|---|---|
| Entropy evaluation - Pearson coefficient | ||||
| IDO | 0.1791 | 0.2352 | 0.2488 |
|
| IDODEN | 0.2229 | 0.2494 | 0.2862 |
|
| DO | 0.3387 | 0.3620 | 0.3962 |
|
| IDDAP | 0.3729 | 0.3936 | 0.4215 |
|
| Entropy evaluation - Spearman rank correlation | ||||
| IDO | 0.1727 | 0.2121 | 0.2131 |
|
| IDODEN | 0.2254 | 0.2367 | 0.2581 |
|
| DO | 0.3173 | 0.3616 | 0.3844 |
|
| IDDAP | 0.3238 | 0.3857 | 0.4023 |
|
Fig. 2Granularity observation in the IDO, IDODEN and DO ontologies. From left to right: the granularity decreases due to the increase of sparseness of relationship types in ontology schema
Fig. 3Ontology visualization for the node representations of IDO, IDODEN, DO and IDDAP. From left to right: the redundancy increases due to the structure sparseness, relation incompleteness, and unclear node definition
Fig. 4An example of the structural and textual information of IDODEN ontology. The higher the EAPB calculated information gain is, the more information the attention model highlights
Entropy evaluation result on IDO, IDODEN, DO and IDDAP (with ablation study)
| IDO | IDODEN | DO | IDDAP | |
|---|---|---|---|---|
| EAPB entropy evaluation | 5.0554 | 4.4507 | 8.2320 | 10.4234 |
| w/o text | 5.7469 | 8.4408 | 9.0656 | 14.5873 |
Fig. 5An illustration of text-based embedding