| Literature DB >> 26713234 |
Anne E Thessen1, Daniel E Bunker2, Pier Luigi Buttigieg3, Laurel D Cooper4, Wasila M Dahdul5, Sami Domisch6, Nico M Franz7, Pankaj Jaiswal4, Carolyn J Lawrence-Dill8, Peter E Midford9, Christopher J Mungall10, Martín J Ramírez11, Chelsea D Specht12, Lars Vogt13, Rutger Aldo Vos14, Ramona L Walls15, Jeffrey W White16, Guanyang Zhang7, Andrew R Deans17, Eva Huala18, Suzanna E Lewis10, Paula M Mabee5.
Abstract
Understanding the interplay between environmental conditions and phenotypes is a fundamental goal of biology. Unfortunately, data that include observations on phenotype and environment are highly heterogeneous and thus difficult to find and integrate. One approach that is likely to improve the status quo involves the use of ontologies to standardize and link data about phenotypes and environments. Specifying and linking data through ontologies will allow researchers to increase the scope and flexibility of large-scale analyses aided by modern computing methods. Investments in this area would advance diverse fields such as ecology, phylogenetics, and conservation biology. While several biological ontologies are well-developed, using them to link phenotypes and environments is rare because of gaps in ontological coverage and limits to interoperability among ontologies and disciplines. In this manuscript, we present (1) use cases from diverse disciplines to illustrate questions that could be answered more efficiently using a robust linkage between phenotypes and environments, (2) two proof-of-concept analyses that show the value of linking phenotypes to environments in fishes and amphibians, and (3) two proposed example data models for linking phenotypes and environments using the extensible observation ontology (OBOE) and the Biological Collections Ontology (BCO); these provide a starting point for the development of a data model linking phenotypes and environments.Entities:
Keywords: Biodiversity; Data integration; Environment; Ontology; Phenotype; Semantic web
Year: 2015 PMID: 26713234 PMCID: PMC4690371 DOI: 10.7717/peerj.1470
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Ontology and the Heterogeneity Challenge.
This diagram demonstrates how ontologies can solve the challenge of heterogeneous terminology. In this example, the database contains four different natural language descriptions about dinoflagellate chloroplasts harvested from text. A user needs to query the database for instances of dinoflagellates with yellow chromatophores. Without an ontology to provide information to the query-engine about synonymy (“chromatophore” = “chloroplast”) and term relationships (“brownish-yellow”, “golden”, and “yellow gold” are subtypes of “yellow”), a query for “yellow chromatophore” will only yield one of the four results the user needs and would find using an ontology. Without an ontology to link closely related concepts with a common parent, and reconcile heterogeneous terms, a user would have to perform many more queries to get a desired result, which may not be tractable in a large dataset.
List of resources (vocabularies and ontologies) relevant to annotating phenotypes and environments.
| Name | Abbreviation | URL | Reference |
|---|---|---|---|
| AGROVOC | 1 | ||
| Behavioral Ontology | NBO | 2 | |
| Biological Collections Ontology | BCO | 3 |
|
| Biological Spatial Ontology | BSPO | 4 |
|
| Chemical Entities of Biological Interest | ChEBI | 5 |
|
| CMECS Habitat Classification | 6 | ||
| Crop Ontology | CO | 7 |
|
| Eagle-i Resource Ontology | ERO | 8 | |
| EcoLexicon | 9 | ||
| Ecological Classifications NatureServe | 10 | ||
| Environment Ontology | ENVO | 11 |
|
| EUNIS Habitat Classification | 12 | ||
| Experimental Factor Ontology | EFO | 13 |
|
| Exposure ontology | EXO | 14 | |
| Fission Yeast Phenotype Ontology | FYPO |
| |
| Flora Phenotype Ontology | FLOPO | 15 |
|
| Floristic Regions of the World | 16 |
| |
| Fungal gross anatomy | FAO | 17 | |
| Gazetteer | GAZ | 18 | |
| Gene Ontology | GO | 19 |
|
| GeoNames | 20 | ||
| Getty Thesaurus of Geographic Names | 21 | ||
| Global Administrative Areas | GADM | 22 | |
| Human Phenotype Ontology | HP |
| |
| Information Artifact Ontology | IAO | 23 |
|
| International Consortium for Agricultural Systems Applications standards | ICASA | 24 |
|
| IUCN Habitats Classification Scheme | 25 | ||
| Mammalian phenotype | MP | 26 |
|
| Mapping European Seabed Habitats | MESH | 27 | |
| NASA GCMD keyword list for locations | 28 | ||
| Observation Ontology | OBOE | 29 |
|
| Ontology of Biological Attributes | OBA | 30 | |
| Ontology of Biomedical Investigation | OBI | 31 |
|
| Ontology of Microbial Phenotypes | OMP | 32 |
|
| Phenotype Quality Ontology | PATO | 33 |
|
| Plant Environment Ontology | EO | 34 | |
| Plant Ontology | PO | 35 |
|
| Plant Trait Ontology | TO | 36 |
|
| Population and Community Ontology | PCO | 37 | |
| Relation Ontology | RO | 38 | |
| Semantic Web for Earth and Environmental Terminology | SWEET | 39 |
|
| Sequence Ontology | SO | 40 |
|
| Terminology of Grazing Lands and Grazing Animals |
| ||
| Uber Anatomy Ontology | UBERON | 41 | |
| Worm Phenotype | WBPhenotype | 42 |
|
| WWF Ecozones | 43 |
Notes.
1, http://aims.fao.org/agrovoc#.VG4QG_nF_ng; 2, https://code.google.com/p/behavior-ontology/; 3, https://github.com/tucotuco/bco; 4, https://code.google.com/p/biological-spatial-ontology/; 5, https://www.ebi.ac.uk/chebi/; 6, https://marinemetadata.org/references/cmecshabitat; 7, http://pantheon.generationcp.org/index.php?option=com_content&task=section&id=7&Itemid=35; 8, https://www.eagle-i.net/; 9, http://ecolexicon.ugr.es/en/index.htm; 10, http://explorer.natureserve.org/classeco.htm; 11, http://www.environmentontology.org; 12, https://marinemetadata.org/references/eunishabitat; 13, http://www.ebi.ac.uk/efo/; 14, http://www.obofoundry.org/cgi-bin/detail.cgi?id=exo; 15, http://www.pombase.org/; 16, http://wiki.pro-ibiosphere.eu/wiki/Traits_Task_Group; 17, http://www.yeastgenome.org/fungi/fungal_anatomy_ontology/; 18, http://bioportal.bioontology.org/ontologies/GAZ; 19, http://geneontology.org/; 20, http://www.geonames.org/; 21, http://www.getty.edu/research/tools/vocabularies/tgn/index.html; 22, http://www.gadm.org/; 23, http://www.human-phenotype-ontology.org/; 24, https://code.google.com/p/information-artifact-ontology/; 25, http://www.iucnredlist.org/technical-documents/classification-schemes/habitats-classification-scheme-ver3; 26, http://www.informatics.jax.org/searches/MP_form.shtml; 27, http://www.emodnet-seabedhabitats.eu/; 28, https://marinemetadata.org/references/cfregions; 29, https://semtools.ecoinformatics.org/oboe; 30, http://wiki.geneontology.org/index.php/Extensions/x-attribute; 31, http://obi-ontology.org/page/Main_Page; 32, http://microbialphenotypes.org/wiki/index.php/Main_Page; 33, http://obofoundry.org/wiki/index.php/PATO:Main_Page; 34, http://planteome.org/amigo/cgi-bin/crop_amigo/browse.cgi?; 35, http://www.plantontology.org/; 36, http://planteome.org/amigo/cgi-bin/crop_amigo/browse.cgi?; 37, https://github.com/PopulationAndCommunityOntology/pco; 38, https://github.com/oborel/obo-relations; 39, https://sweet.jpl.nasa.gov/; 40, http://www.sequenceontology.org/; 41, http://uberon.github.io/; 42 http://www.wormbase.org/; 43, http://wwf.panda.org/about_our_earth/ecoregions/ecoregion_list/.
Figure 2Manual workflow conceptual diagram.
This diagram shows the manual workflow to link phenotype and environment data sets using current tools and services.
Figure 3Using the OBOE ontology to link phenotype and environment.
This demonstrates linking phenotype and environment using instances of the OBOE classes Entity, Observation, and Measurement. (A) Links between Entity, Observation, and Measurement OBOE classes. (B) Example measurements of phenotypes and environments using instances of the OBOE classes. Numbered measurement instances are consistent across A and B. This representation is simplified with regards to the taxonomic entities in play (Baskauf & Webb, 2015).
Figure 4Using the Biological Collections Ontology (BCO) to link phenotype and environment.
This demonstrates linking phenotype and environment using classes and relations from the Biological Collections Ontology (BCO). (A) A simple version of the classes and relations used to describe observations in the BCO, with classes imported from OBI (Ontology for Biomedical Investigations), IAO (Information Artifact Ontology), and BFO (Basic Formal Ontology). (B) Links among organism, phenotype, and environment, using the BCO model, using the same data as in Fig. 5. Light grey boxes represent either literal values (e.g., Hyla plicata), or instances of classes from external ontologies (ENVO, Environment Ontology; UBERON, Uber Anatomy Ontology; PATO, Phenotype Quality Ontology). Properties with a dwc prefix are imported directly from Darwin Core.
Figure 5Map of miniaturized fishes and their non-miniaturized sister taxa.
This map shows locations of fish species exhibiting the miniaturized phenotype (red circles) and their non-miniature sister taxa (blue squares). The georeferenced occurrence data were gathered from GBIF.
Results of a logistic regression comparing temperature and precipitation in the habitats of fishes with and without the miniaturization phenotype.
| Variable | Coefficient | Std. error | Confidence interval 2.5% | Confidence interval 97.5% | ||
|---|---|---|---|---|---|---|
| Annual mean temperature (°C) | 1.6 × 10−1 | 5.3 × 10−2 | 3.033 | 0.00242 | 6.6 × 10−2 | 2.8 × 10−1 |
| Annual mean precipitation (mm) | 3.4 × 10−9 | 1.6 × 10−9 | 2.139 | 0.03243 | 7.6 × 10−10 | 7.1 × 10−9 |
Some of the URIs used to describe amphibian breeding and development in TraitBank.
| Term | URI |
|---|---|
| Breeding habitat |
|
| Development mode |
|
| Terrestrial habitat |
|
| Intermittent pond |
|
| Permanent pond |
|
| Freshwater stream |
|
| Direct development |
|
| Larval development |
|
| Paedomorphic |
|
Breeding habitat and developmental mode for 282 species of amphibians.
| Larval | Direct |
| Test statistic |
| |
|---|---|---|---|---|---|
| Freshwater stream | 30 | 0 | 3 | 270.9 | 0.352 |
| Intermittent pond | 28 | 0 | |||
| Permanent pond | 59 | 2 | |||
| Terrestrial | 2 | 166 |