| Literature DB >> 23303300 |
Matthew A Bertone1, István Mikó, Matthew J Yoder, Katja C Seltmann, James P Balhoff, Andrew R Deans.
Abstract
Matching is an important step for increasing interoperability between heterogeneous ontologies. Here, we present alignments we produced as domain experts, using a manual mapping process, between the Hymenoptera Anatomy Ontology and other existing arthropod anatomy ontologies (representing spiders, ticks, mosquitoes and Drosophila melanogaster). The resulting alignments contain from 43 to 368 mappings (correspondences), all derived from domain-expert input. Despite the many pairwise correspondences, only 11 correspondences were found in common between all ontologies, suggesting either major intrinsic differences between each ontology or gaps in representing each group's anatomy. Furthermore, we compare our findings with putative correspondences from Bioportal (derived from LOOM software) and summarize the results in a total evidence alignment. We briefly discuss characteristics of the ontologies and issues with the matching process.Entities:
Mesh:
Year: 2013 PMID: 23303300 PMCID: PMC3540420 DOI: 10.1093/database/bas057
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
General statistics of the ontologies examined in this article
| Subject (Ontology | # Valid classes | # Obsolete classes | Proportion of classes with definitions (%) (total with definitions) | # Species covered currently (potential coverage) | Version date |
|---|---|---|---|---|---|
| Hymenoptera (HAO) | 1786 | 64 | 100 (1786) | ∼150 000 (∼1 million) | 24:01:2011 09:40 |
| Spiders (SPD) | 552 | 25 | 73 (404) | ∼40 000 (∼150 000) | 17:03:2010 06:57 |
| Ticks (TADS) | 628 | 0 | 99 (627) | ∼900 | 18:11:2007 11:42 |
| Mosquitoes (TGMA) | 1861 | 0 | 100 (1861) | ∼3500 (∼4500) | 04:02:2009 10:45 |
| 6884 | 162 | 47 (3239)b | 1 | 24:11:2010 15:26 |
aFull ontology names from the OBO Foundry [http://www.obofoundry.org/; (1)] are as follows: HAO, Hymenoptera Anatomy Ontology; SPD, Spider Ontology; TADS, Tick gross anatomy; TGMA, Mosquito gross anatomy; FBbt, Drosophila gross anatomy. b275 of these definitions are represented only by ‘.’; the percentage of worded definitions is 43% (2964).
Figure 1Cytoscape visualization showing the HAO full ontology network (black circle nodes and gray lines) with correspondences mapped from the other arthropod anatomy ontologies: SPD (purple diamonds); TADS (green squares); TGMA (red octagons); FBbt (blue triangles). Box A represents an area of general agreement between the ontologies, showing multiple correspondences from each ontology (largely consisting of CARO and many general body classes); further magnification (A2, represented by dashed box in A) reveals nodes with many correspondences from different ontologies (arrow). Box B represents an area with fewer correspondences, mainly from FBbt with some TADS (largely consisting of specific muscle classes not present in TGMA and SPD); further magnification (B2, represented by dashed box in B) reveals instances where one class from FBbt is aligned to multiple HAO classes (arrow; many to one relationship).
Summary of correspondences found during the manual alignment process between source arthropod ontologies and the target ontology, the HAO
| Source ontology | # Correspondences | Correspondences as % of valid classes (source/target) | # Correspondences from CARO aligned | # Correspondence direct superclass | # Correspondence direct superclass |
|---|---|---|---|---|---|
| Spiders (SPD) | 43 | 7.8/2.4 | 4 (9.3) | 14/10 | 5/13 |
| Ticks (TADS) | 82 | 13.1/4.6 | 45 (54.9) | 49/14 | 12/19 |
| Mosquitoes (TGMA) | 307 | 16.5/17.2 | 30 (9.8) | 79/84 | 85/252 |
| 368 | 5.4/20.6 | 18 (4.9) | 97/327 | 35/118 |
aNumber of HAO Common Anatomy Reference Ontology (19) classes aligned (some putative CARO classes were not cited as belonging to CARO in all source ontologies). bNumber of direct is_a or part_of superclasses that are (yes) or are not (no) additionally represented as matched correspondences (e.g. if A is_a C and B is_a D, then ‘yes’ if the correspondences A to B and C to D are present; if A and B correspond, but C and D do not, then ‘no’).
Figure 2Comparison of the number of correspondences found through manual alignment alone, LOOM-based algorithm alignment alone (available from Bioportal) and using both methods. Only source-target alignments with results from both methods (SPD-HAO, TGMA-HAO and FBbt-HAO) are shown. Correspondences found by LOOM alone are further characterized as valid (overlooked during the manual alignment), mismatched (invalid correspondences) or other (errors; see text).
List of relations used in each ontology
| Hymenoptera | Spiders | Ticks | Mosquitoes | |
|---|---|---|---|---|
| (HAO) | (SPD) | (TADS) | (TGMA) | (FBbt) |
Bold relations denote those shared by all. aDoes not include is_a, disjoint_from and others that are implicit in the OBO format (http://www.geneontology.org/GO.format.obo-1_4.shtml).