| Literature DB >> 25937880 |
Nina Laurenne1, Jouni Tuominen1, Hannu Saarenmaa2, Eero Hyvönen1.
Abstract
BACKGROUND: The scientific names of plants and animals play a major role in Life Sciences as information is indexed, integrated, and searched using scientific names. The main problem with names is their ambiguous nature, because more than one name may point to the same taxon and multiple taxa may share the same name. In addition, scientific names change over time, which makes them open to various interpretations. Applying machine-understandable semantics to these names enables efficient processing of biological content in information systems. The first step is to use unique persistent identifiers instead of name strings when referring to taxa. The most commonly used identifiers are Life Science Identifiers (LSID), which are traditionally used in relational databases, and more recently HTTP URIs, which are applied on the Semantic Web by Linked Data applications.Entities:
Keywords: HTTP URI; LSID; Linked data; Ontology; Scientific name; Semantic web; Species checklist; Taxonomic concept
Year: 2014 PMID: 25937880 PMCID: PMC4417522 DOI: 10.1186/2041-1480-5-40
Source DB: PubMed Journal: J Biomed Semantics
Figure 1The structure of an LSID. An LSID of a cerambycid beetle species obtained from the Catalogue of Life database.
The main differences between LSIDs and HTTP URIs
| Life science | HTTP URIs | |
|---|---|---|
| identifiers | ||
| Standardised by | Object Management | Internet Engineering |
| Group [ | Task Force [ | |
| Reuse existing | Defines a new | Uses an |
| URI schemes | URN subscheme | existing scheme |
| Data retrieval/ | Specific resolving | Uses existing |
| dereferenceability | service needed | web technology |
| (DNS, web servers) | ||
| Structure of identifier | Strict | Flexible |
| Linked Data compatibility | No | Yes |
Figure 2A simplified structure of the relational taxonomic database. The boxes illustrate the tables of the database, and the lines present the relations between them. LSIDs are given to taxonomic concepts and scientific names (illustrated with a darker colour). Taxonomic concepts are linked to each other using the relations described in Table 3 and each concept is linked to a scientific name. External LSIDs and common names are connected to the concepts. An author reference, validity, and a taxonomic rank are assigned to the scientific names.
The relations used for mapping underlying taxonomic concepts
| Relation between taxa | Intensive | Ostensive | Notation | Properties |
|---|---|---|---|---|
| Congruent | Share the same characters | Share the same species |
| Symmetric, transitive |
| Part of | All characters of a taxon are included in another taxon | All species are included in another taxon |
| Non-symmetric, transitive |
| Overlap | At least one character is shared between taxa, but not all of them | At least one species is shared between taxa, but not all of them |
| Symmetric, non-transitive |
The division into intensional and ostensive relations [35] is only available in TaxMeOn (not in the Taxonomic Database).
Figure 3The core classes of the taxonomic meta-ontology. The classes are illustrated with ellipses (colours are to improve the readability of the figure). The arrows indicate relations (properties) between the classes. The subclass relations are indicated with lighter-coloured arrows and a few examples of the subclasses. To demonstrate how the TaxMeOn model is applied, an example taxon depicted using dotted lines is illustrated. The example taxon is an instance of the class TaxonInChecklist and of a specific taxonomic rank. The properties associated with the example taxon are marked with dotted-line arrows. The properties with literal values are not shown in the figure.
Figure 4Core taxonomic information represented according to the taxonomic meta-ontology. Ferus is described by Mulsant in 1839 and it belongs to the genus Arhopalus.
The core properties of the Taxonomic Meta-Ontology and their explanations
| Property | Explanation |
|---|---|
|
| |
| occursInChecklist | Reference to a species checklist |
| auctorumYear | The year of original publication |
| completeAuctorumString | Author name(s) expressed according |
| to the established practices of taxonomy | |
|
| |
| hasNonvalidName | Expresses synonyms, homonyms |
| and orthographic variants of a valid | |
| scientific name | |
| hasVernacularName | The common name equivalents for |
| the scientific names | |
| hasNomenclaturalCode | Specifies the set of rules that are applied |
| (ICN [ | |
| hasVernacularNameStatus | Expresses whether a common name is |
| accepted or an alternative one | |
| rdf:type | Expresses the hierarchical level in |
| a classification. The ranks are obtained | |
| from TDWG Taxon Rank LSID Ontology [ | |
| Every taxon is an instance of a specific | |
| taxonomic rank and the class | |
|
|
See other properties in the Results section, in subsection Taxonomic Meta-Ontology.
A comparison of the features of the taxonomic database and the taxonomic meta-ontology
| Taxonomic | TaxMeOn | |
|---|---|---|
| database | ||
|
| ||
| Structure easily | No | Yes |
| extensible | ||
| Global linkability | No | Yes |
| to other contents | ||
| Public interfaces | Simple search API, | HTTP and SOAP APIs, |
| LSID resolver | Linked Data, | |
| SPARQL endpoint | ||
| Need of a resolver | Yes | No |
| Content editing | Web interface | SAHA [ |
|
| ||
| Granularity of | Low | High |
| taxonomic information | ||
| Linking additional | No | Yes |
| scientific publications | ||
| Treatment of botanical | Identical | Not identical |
| and zoological names | ||
| Semantics applied to | No | Yes |
| author names | ||
| Tracking temporal | Publication year | Versioning of checklists |
| changes | of a checklist | (static) and duplication |
| of taxa (dynamic) |
Figure 5Taxonomic changes in relation to time presented in RDF. (A) The change is modelled as an instance. (B) The relation is modelled as an instance. The instances are depicted as lighter-coloured ellipses and literals as rectangles.