| Literature DB >> 20519284 |
Gilles Parmentier1, Frederic B Bastian, Marc Robinson-Rechavi.
Abstract
MOTIVATION: The anatomy of model species is described in ontologies, which are used to standardize the annotations of experimental data, such as gene expression patterns. To compare such data between species, we need to establish relations between ontologies describing different species.Entities:
Mesh:
Year: 2010 PMID: 20519284 PMCID: PMC2894521 DOI: 10.1093/bioinformatics/btq283
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Homolonto pairwise alignment architecture. O1 and O2 are ontologies to align. P and P′ are lists of propositions. H is a list of validated homologies (invalidation information). A is the final alignment, generated when the user chooses to stop iterations. User input appears twice: to propose original pairings, and to validate propositions.
Summary of the alignments discussed
| Zebrafish | Xenopus | Human | Mouse | |
|---|---|---|---|---|
| Ontology | ZFA | XAO | EHDAA | EMAPA |
| Number of terms | 1974 | 569 | 2327 | 3525 |
| with synonyms | 1080 | 122 | 0 | 0 |
| with definitions | 772 | 186 | 0 | 0 |
| Number of validations | 189 | 1959 | ||
| Number of invalidations | 543 | 1003 | ||
| Number of unique terms aligned | 183 | 182 | 1541 | 1754 |
aReferences for the ontologies aligned are: ZFA (Sprague et al., 2006) (version of 24:10:2007); XAO (Bowes et al., 2008) (version of 07:11:2007); EHDAA (Aitken, 2005; Hunter et al., 2003) (version of 08:04:2005); EMAPA (Aitken, 2005; Baldock et al., 2003) (version of 08:04:2005).
bIncluding ‘partial’ validations.
Examples of false positives and false negatives
| Term 1 | Term 2 | Homolonto result | Frequency of shared words |
|---|---|---|---|
| XAO:0000399 tendon fibroblast | ZFA:0009296 perijunctional fibroblast | False positive | 3 |
| EMAPA:16370 cardiovascular system (part_of extraembryonic component) | EHDAA:394 cardiovascular system (part_of organ system part_of embryo) | False positive | 3 |
| EMAPA:16754 central nervous system (part_of tail) | EHDAA:828 central nervous system (part_of nervous system) | False positive | 3 |
| XAO:0000385 pronephric sinus (part_of pronephric kidney) | ZFA:0001557 pronephric glomerulus (part_of pronephros) | False positive | 36 |
| XAO:0000119 lung (part_of respiratory system) | ZFA:0000354 gill (part_of respiratory system) | False positive | – |
| XAO:0000355 fourth aortic arch | ZFA:0005008 aortic arch 4 | False negative | 43 |
| EMAPA:17340 right ventricle (part_of ventricle) | EHDAA:1916 right part (part_of ventricle) | False negative | 67 |
| EMAPA:17853 naso-lacrimal duct (part_of nose) | EHDAA:7837 nasolacrimal duct (part_of nasolacrimal groove) | False negative | 75 |
| XAO:0000050 mesoderm (part_of embryo) | ZFA:0000041 mesoderm (part_of primary germ layer) | False negative | 183 |
aSum of frequencies in the two ontologies being compared.
bProposition with a high score between non-homologous structures.
cProposition with a low score between homologous structures.
dNo proposition reported between homologous structures.