| Literature DB >> 25774204 |
Anika Oellrich1, Ramona L Walls2, Ethalinda Ks Cannon3, Steven B Cannon4,5, Laurel Cooper6, Jack Gardiner7, Georgios V Gkoutos8, Lisa Harper4, Mingze He7, Robert Hoehndorf9, Pankaj Jaiswal6, Scott R Kalberer4, John P Lloyd10, David Meinke11, Naama Menda12, Laura Moore6, Rex T Nelson4, Anuradha Pujar12, Carolyn J Lawrence5,7, Eva Huala13.
Abstract
BACKGROUND: Plant phenotype datasets include many different types of data, formats, and terms from specialized vocabularies. Because these datasets were designed for different audiences, they frequently contain language and details tailored to investigators with different research objectives and backgrounds. Although phenotype comparisons across datasets have long been possible on a small scale, comprehensive queries and analyses that span a broad set of reference species, research disciplines, and knowledge domains continue to be severely limited by the absence of a common semantic framework.Entities:
Year: 2015 PMID: 25774204 PMCID: PMC4359497 DOI: 10.1186/s13007-015-0053-y
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Figure 1The method applied to annotate mutant phenotypes from textual descriptions. Textual descriptions from the literature or databases (A), based on observations of mutant plants, are first broken down into atomized statements corresponding to phenes (B) that are then represented with EQ statements (C).
Description of applied ontologies
|
|
|
|
|---|---|---|
| Plant Ontology (PO) [ | Plant anatomy and morphology and development stages |
|
| Gene Ontology (GO) [ | Biological processes, cellular components and molecular functions |
|
| Chemical Entities of Biological Interest ontology (ChEBI) [ | Molecular entities focused on ‘small’ chemical compounds. |
|
| Phenotypic Qualities Ontology (PATO) [ | Phenotypic qualities |
|
| Plant Experimental Conditions Ontology (EO) | Treatments, growing conditions, and/or study types |
|
| NCBI taxonomy (NCBITAXON) | A curated classification and nomenclature for all of the organisms in the public sequence databases. |
|
| Relation Ontology (RO) [ | Core upper-level relations and biology-specific relations |
|
Species-independent ontologies used to form EQ statements. All ontologies were downloaded on 15 March 2014.
The number of EQ statements, genes, genotypes, and phenotypes they were associated with, for six plant species
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| 5172 | 1260 | 2393 | 2393* | 1385 |
|
| 373 | 180 | 114 | 169 | 117 |
|
| 340 | 271 | 92 | 95 | 86 |
|
| 269 | 174 | 72 | 128 | 90 |
|
| 149 | 99 | 40 | 45 | 40 |
|
| 61 | 39 | 30 | 30* | 24 |
| Total | 6364 | 2023 | 2741 | 2866 | 1742 |
The number of EQ statements, genes, genotypes, and phenotypes they were associated with, for each species.
*#Genotypes equals # genes because no information on alleles was available for these species.
Figure 2Semantic similarity score distributions for inter- and intraspecific pairwise phenotype similarity. When binning all semantic similarity scores across all species, 44% of semantic similarity scores indicate a relatively low phenotypic overlap between genes (semantic similarity range 0–0.1) while 13% show highly similar phenotypes (similarity score range 0.9-1) (A). Distributions of intraspecific scores (pairwise scores where both genotypes belong to the same species) were similar to the overall distribution of scores (B-H).
Figure 3Average semanitic similarity scores for previously derived groupings of Arabidopsis genotypes. The average pairwise semantic similarity for subsets previously identified by [36] ranged from ~0.1 to ~0.9. Subsets are shown grouped by the classes and groups to which they belong.
Figure 4This figures illustrates the usage of Plant PhenomeNET for the maize gene . After searching for the gene (A), search results are returned (B) and assigned and inferred phenes are shown (C), as well as semantically similar phenotypes from other genes (D). See text for more details.