| Literature DB >> 23046625 |
Frank Loebe1, Frank Stumpf, Robert Hoehndorf, Heinrich Herre.
Abstract
BACKGROUND: Phenotype ontologies are used in species-specific databases for the annotation of mutagenesis experiments and to characterize human diseases. The Entity-Quality (EQ) formalism is a means to describe complex phenotypes based on one or more affected entities and a quality. EQ-based definitions have been developed for many phenotype ontologies, including the Human and Mammalian Phenotype ontologies.Entities:
Year: 2012 PMID: 23046625 PMCID: PMC3448528 DOI: 10.1186/2041-1480-3-S2-S5
Source DB: PubMed Journal: J Biomed Semantics
Figure 1EQ model. In the schematic part, E stands for entity, Q for quality, and R for relational quality. Schema a) accounts for a simple (unary or non-relational) quality, while b) refers to relational qualities. The examples c) and d) correspond to those from the text. Example e) in parallel with d) is forestalling the problem of inter-modeler consistency from the Methods section. 'conc.' in d) and e) abbreviates concentration of.
Figure 2Roles-as-properties: Ontological roles encoded as OWL properties. PB stands for phenotype bearer in the schematic part.
Figure 3Roles-as-classes: Ontological roles modeled as classes in OWL.
Figure 4Relator-based-quality: Relators characterized by qualities.
Summary of the main features of the discussed approaches.
| Feature | EQ | RP | RC | RQ | |
|---|---|---|---|---|---|
| A | role information | no | yes | yes | yes |
| B | unlimited arity of relations | no (yes) | yes | yes | yes |
| C | variable arity of relations | no | yes | yes | yes |
| D | straight-forward database support | yes | partially | partially | partially |
| E | max. nr. of relevant vocabulary | 2/0 | 0/ | 2/ | |
| F | add. characterization of relations | no | no | no | yes |
Abbreviations of the approaches: EQ: entity-quality, RP: roles-as-properties, RC: roles-as-classes, RQ: relator-based-quality. The entry of '(yes)' in line B, column EQ, reflects the discussed extensibility of EQ. The numbers in line E count vocabulary elements (OWL properties or classes) introduced in each approach. The first number is the number of fixed elements (applicable to all relational qualities), in case of EQ this is two for inheresIn and towards. The second number is the maximal number of elements required per n-ary relational quality, e.g. n OWL properties encoding n roles and 1 OWL class encoding the relation itself for RP. The variable X in column RQ stands for the respective number of the RP or RC columns, depending on the relation model that is combined with RQ.
Sample phenotype table for annotation databases.
| Phenotype Table | |||||
|---|---|---|---|---|---|
| 0 | 0 | red | eye | ||
| 1 | 1 | concentration of | concentrator | spleen | abnormal |
| 2 | 1 | concentration of | concentrated | iron | abnormal |
| 3 | 2 | increased concentration | concentrator | spleen | |
| 4 | 2 | increased concentration | concentrated | iron | |
Tables 2-4 form an integrated set of database tables, as an example of one implementation option for annotation databases. The present table stores pure phenotype descriptions. It supports non-relational and relational qualities, in the latter case of arbitrary arity and role type. For this purpose, role types do not correspond to columns (attributes) of the table, but they are stored in rows. Rows of a single phenotype description have the same phenotype identifier assigned to them. The sample entries repeat examples of the main text (with natural language terms for readability), but are arbitrary otherwise.
Abbreviations: r_id: row identifier, p_id: phenotype identifier, N/A: not available. Empty positions are to be read as null values.
Sample entity table for annotation databases.
| Entity Table | ||
|---|---|---|
| . . . | . . . | |
| 0 | . . . | . . . |
| 1 | . . . | . . . |
| 2 | . . . | . . . |
| 3 | . . . | . . . |
The database table for the description of entities to be annotated with phenotypic information can adhere to an arbitrary schema in the context of Tables 2-4. Only entity identifiers must be presumed for Table 4. This entity table is merely shown for completeness.
Abbreviation: e_id: entity identifier.
Sample annotation table for annotation databases.
| Annotation Table (for phenotypic annotations) | |||
|---|---|---|---|
| 0 | 0 | ||
| 0 | 1 | 5 | mg/kg |
| 1 | 2 | ||
| 2 | 2 | ||
Assuming phenotypes and entities according to Tables 2 and 3, the actual annotations of entities with phenotypes are kept separately in this table, since this is a many-to-many-relationship. The two additional columns shall indicate a simple option to capture quantitative data. Notably, the qualifier column of Table 2 is placed therein for coherence with current EQ descriptions. That column may alternatively be part of the present table.
Abbreviations: e_id: entity identifier, p_id: phenotype identifier. Empty positions are to be read as null values.