| Literature DB >> 29316968 |
Daniela Oliveira1,2, Catia Pesquita3.
Abstract
BACKGROUND: Ontologies are commonly used to annotate and help process life sciences data. Although their original goal is to facilitate integration and interoperability among heterogeneous data sources, when these sources are annotated with distinct ontologies, bridging this gap can be challenging. In the last decade, ontology matching systems have been evolving and are now capable of producing high-quality mappings for life sciences ontologies, usually limited to the equivalence between two ontologies. However, life sciences research is becoming increasingly transdisciplinary and integrative, fostering the need to develop matching strategies that are able to handle multiple ontologies and more complex relations between their concepts.Entities:
Keywords: Algorithms; Biomedical ontologies; Ontology alignment
Mesh:
Year: 2018 PMID: 29316968 PMCID: PMC5761129 DOI: 10.1186/s13326-017-0171-8
Source DB: PubMed Journal: J Biomed Semantics
Biomedical ontologies downloaded from the OBO Foundry in May 2015 (http://obo.sourceforge.net)
| Ontology | Acronym | Classes | Names | Reference |
|---|---|---|---|---|
| Cell type | CL | 4775 | 4375 | [ |
| Foundational model of anatomy | FMA | 78977 | 126190 | [ |
| Gene ontology (biological process domain) | GO | 43048 | 276577 | [ |
| Human phenotype | HP | 28621 | 18431 | [ |
| Mammalian phenotype | MP | 28643 | 29592 | [ |
| Neuro behaviour ontology | NBO | 116710 | 1168 | [ |
| Phenotypic quality | PATO | 2497 | 3378 | [ |
| Uber anatomy ontology | UBERON | 18322 | 50713 | [ |
| WBP | 2290 | 2739 | [ |
Examples of the patterns found in a manual analysis of binary alignments
| Pattern | Source URI and label | Target URI and label |
|---|---|---|
| Addition | WBP:0001911 axon regeneration | GO:0031103 axon regeneration |
| Variation | MP:0002269 musc | GO:0014889 musc |
| Combination | MP:0013527 | CL:2000084 conjunctiva goblet cell |
| Full match | MP:0002119 dipsosis | NBO:0000541 dipsosis |
| Synonym | HP:0010108 aplasia of the | FMA:25047 big toe |
| None | MP:0002229 neurodegeneration | GO:0070657 neuromast regeneration |
Distributions of mappings fitting lexical patterns 1 or 2
| Matcher | Ontology | Addition | Variation | Size |
|---|---|---|---|---|
| String Matcher | MP-CL | 26 | 7 | 34 |
| MP-GO | 287 | 210 | 501 | |
| MP-NBO | 354 | 205 | 594 | |
| MP-UBERON | 58 | 11 | 71 | |
| WBP-GO | 182 | 137 | 322 | |
| HP-FMA | 272 | 23 | 304 | |
| MP-PATO | 18 | 1 | 29 | |
| WBP-PATO | 28 | 2 | 41 | |
| HP-PATO | 12 | 1 | 25 | |
| Word Matcher | MP-CL | 4 | 1 | 5 |
| MP-GO | 32 | 25 | 65 | |
| MP-NBO | 118 | 44 | 219 | |
| MP-UBERON | 42 | 5 | 50 | |
| WBP-GO | 183 | 33 | 219 | |
| HP-FMA | 158 | 44 | 252 | |
| MP-PATO | 33 | 21 | 59 | |
| WBP-PATO | 19 | 1 | 25 | |
| HP-PATO | 6 | 0 | 12 | |
| Reference | MP-CL | 439 | 12 | 474 |
| MP-GO | 805 | 83 | 944 | |
| MP-NBO | 177 | 24 | 219 | |
| MP-UBERON | 1693 | 126 | 1999 | |
| WBP-GO | 256 | 39 | 325 | |
| HP-FMA | 1691 | 66 | 1893 | |
| MP-PATO | 3096 | 35 | 3636 | |
| HP-PATO | 1710 | 8 | 1893 | |
| WBP-PATO | 302 | 4 | 325 | |
| Total | 12001 | 1168 | 14535 |
Fig. 1First-pass recall selection for the first threshold. The left axis shows the values for recall and the right axis shows the normalised averages for the runtime and number of mappings across all sets of ontologies
Fig. 2Longest match precision selection for the second threshold. Longest match precision selection for the second threshold. The left axis indicates the values for precision and recall and the right axis shows the normalised averages across all ontology sets for the runtime
Evaluation results from the comparison with the automatically generated reference alignments with the Top-one Ranked Selector. The “Ref.” column indicates the number of mappings present in the the reference alignments
| Ontology sets | Precision | Recall | F-measure | Ref. |
|---|---|---|---|---|
| MP-CL-PATO | 24.5% | 24.3% | 24.4% | 474 |
| MP-GO-PATO | 62.9% | 60.7% | 61.8% | 944 |
| MP-NBO-PATO | 50.0% | 39.7% | 44.3% | 219 |
| MP-UBERON-PATO | 55.2% | 46.8% | 50.7% | 1999 |
| WBP-GO-PATO | 11.7% | 10.2% | 10.9% | 325 |
| HP-FMA-PATO | 27.3% | 20.3% | 23.3% | 1893 |
Evaluation results from the comparison with the automatically generated reference alignments with the top-two ranked selector
| Ontology sets | Precision | Recall | F-measure | Ref. |
|---|---|---|---|---|
| MP-CL-PATO | 34.9% | 53.0% | 42.0% | 474 |
| MP-GO-PATO | 41.5% | 61.5% | 49.6% | 944 |
| MP-NBO-PATO | 42.7% | 41.1% | 41.9% | 219 |
| MP-UBERON-PATO | 52.8% | 51.4% | 52.1% | 1999 |
| WBP-GO-PATO | 11.6% | 13.5% | 12.5% | 325 |
| HP-FMA-PATO | 24.0% | 22.9% | 23.4% | 1893 |
Comparison of the compound alignments and the compound references
| Ontologies | Correct | Missing | Conflict |
|---|---|---|---|
| MP-CL-PATO | 132 | 158 | 158 |
| MP-GO-PATO | 556 | 204 | 79 |
| MP-NBO-PATO | 84 | 35 | 50 |
| MP-UBERON-PATO | 831 | 390 | 192 |
| WBP-GO-PATO | 31 | 105 | 140 |
| HP-FMA-PATO | 482 | 611 | 196 |
Manual evaluation of mapping subsets
| Ontologies | Conflicts | Missing | |||||
|---|---|---|---|---|---|---|---|
| Reference | Alignment | Both | Agreement | Correct | Incorrect | Agreement | |
| MP-CL-PATO | 0.0% | 3% | 97% | 87% | 62% | 38% | 88% |
| MP-GO-PATO | 60% | 30% | 10% | 41% | 60% | 40% | 76% |
| MP-NBO-PATO | 47% | 40% | 13% | 100% | 78% | 22% | 72% |
| MP-UBERON-PATO | 20% | 40% | 40% | 67% | 84% | 16% | 84% |
| WBP-GO-PATO | 7% | 20% | 73% | 80% | 74% | 26% | 64% |
| HP-FMA-PATO | 3% | 44% | 53% | 75% | 92% | 8% | 100% |
Candidate logical definitions
| Ontology | New mappings | Conflicts | OBO classes |
|---|---|---|---|
| MP | 335 | 442 | 7694 |
| WBP | 72 | 140 | 957 |
| HP | 498 | 169 | 14059 |
Evaluation of the plant based alignments
| T1 | T2 | CO-PO-PATO | TO-PO-PATO | ||||
|---|---|---|---|---|---|---|---|
| Found | Correct | Time | Found | Correct | Time | ||
| 0.1 | 0.9 | 14 | 93% | 20s | 259 | 96% | 149s |
| 0.1 | 0.7 | 45 | 36% | 20s | 487 | 55% | 169s |
| 0.3 | 0.85 | 4 | 100% | 6s | 152 | 95% | 15s |
| 0.5 | 0.9 | 0 | 0% | 5s | 25 | 92% | 7s |