| Literature DB >> 33205138 |
Elizabeth Arnaud1, Marie-Angélique Laporte1, Soonho Kim2, Céline Aubert3, Sabina Leonelli4, Berta Miro5, Laurel Cooper6, Pankaj Jaiswal6, Gideon Kruseman7, Rosemary Shrestha8, Pier Luigi Buttigieg9, Christopher J Mungall10, Julian Pietragalla11, Afolabi Agbona12, Jacqueline Muliro13, Jeffrey Detras14, Vilma Hualla15, Abhishek Rathore16, Roma Rani Das16, Ibnou Dieng17, Guillaume Bauchet18, Naama Menda18, Cyril Pommier19, Felix Shaw20, David Lyon18, Leroy Mwanzia21, Henry Juarez15, Enrico Bonaiuti22, Brian Chiputwa23, Olatunbosun Obileye24, Sandrine Auzoux25,26, Esther Dzalé Yeumo27, Lukas A Mueller18, Kevin Silverstein28, Alexandra Lafargue29, Erick Antezana30,31, Medha Devare3, Brian King32.
Abstract
Heterogeneous and multidisciplinary data generated by research on sustainable global agriculture and agrifood systems requires quality data labeling or annotation in order to be interoperable. As recommended by the FAIR principles, data, labels, and metadata must use controlled vocabularies and ontologies that are popular in the knowledge domain and commonly used by the community. Despite the existence of robust ontologies in the Life Sciences, there is currently no comprehensive full set of ontologies recommended for data annotation across agricultural research disciplines. In this paper, we discuss the added value of the Ontologies Community of Practice (CoP) of the CGIAR Platform for Big Data in Agriculture for harnessing relevant expertise in ontology development and identifying innovative solutions that support quality data annotation. The Ontologies CoP stimulates knowledge sharing among stakeholders, such as researchers, data managers, domain experts, experts in ontology design, and platform development teams.Entities:
Keywords: Big Data; Community of Practice; FAIR data; agriculture; agrifood systems; data annotation; data labeling; knowledge representation; ontologies; semantics for agriculture
Year: 2020 PMID: 33205138 PMCID: PMC7660444 DOI: 10.1016/j.patter.2020.100105
Source DB: PubMed Journal: Patterns (N Y) ISSN: 2666-3899
Criteria Established by CoP Experts to Characterize the Quality of Ontologies for Data Annotation
| Criteria Classified by the Expert Panel | |
|---|---|
| 1 | Adhere to the OBO Foundry guidelines |
| 2 | Represent a unique non-overlapping knowledge domain (also known as orthogonality) |
| 3 | Willingness to express and integrate multiple, evidence-based classification systems in the chosen domain |
| 4 | Logically structured with a well-defined scope |
| 5 | May contain relationships and dependencies to other reference ontologies |
| 6 | Represent accurate science supported by evidence |
| 7 | Open source and Creative Commons CC-BY or CC-0 license ( |
| 8 | Must be widely used in annotation and data capture |
| 9 | Support both inter- and intra-specific needs with species agnostic (core) and specific (extensions) resources that work together |
| 10 | Sustainable funding sources |
| 11 | Human resources to manage (i.e., curators, editors, and developers) |
| 12 | Established ontology management system, including roles and responsibility |
| 13 | Must be designed to answer both the computing and community needs |
| 14 | Must explicitly identify the communities of reference |
| 15 | Centralized maintenance of the validated content, and distributed contribution and access |
| 16 | Ontology quality assurance by experts in the field of knowledge |
| 17 | Reducing reliance on internal processes and data stewardship networks |
Widely Used Ontologies in Agricultural Science
| Ontology | Domain and URL |
|---|---|
| Agronomy Ontology | Agronomic practices, agronomic techniques, and agronomic variables used in agronomic experiments |
| Crop Ontology | Species-specific phenotypic plant traits |
| Environment Ontology | Environmental features and habitats |
| Evidence & Conclusion Ontology | Evidence of scientific events |
| Gene Ontology | Molecular functions, biological processes, cellular components |
| NCBI Taxon Ontology | Organismal taxonomy of National Center for Biotechnology Information |
| Plant Ontology | Plant anatomy, morphology, and growth and development |
| Plant Experimental Conditions Ontology | Treatments and growth conditions used in plant science experiments |
| Plant Trait Ontology | Phenotypic traits in plants |
| Sequence Ontology | Features and attributes of biological sequence |
| Units of Measurement Ontology | Units of measurement |
Adapted from Refs.,
Figure 1Use of the CoP's Products and Tools for Data Annotation
Result of an Ontological Term Selection to Annotate Datasets about Submergence Tolerance of Rice Varieties for the Flood-Prone Lowlands in Nigeria
| Dataset Terms | Selected Ontology Terms | Definition | Source Ontologies | URI for Data Annotation | |
|---|---|---|---|---|---|
| Crop | Rice | (Rice), species, monocots | NCBI taxonomy | ||
| Genotype | Germplasm with the submergence tolerance “Sub1” gene | Response to flooding | Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a stimulus indicating flooding, short-term immersion in water | Gene Ontology | |
| Phenotype | Submergence tolerance | Rice submergence tolerance trait | The ability of plants to survive a period of submergence | Crop Ontology (CO) | |
| Field practices | Manual weeding | Hand picking weeding process | A mechanical weeding process in which unwanted organisms are removed by hands | Agronomy Ontology | |
| Herbicide treatment | Chemical weeding process | A weeding process in which chemical is used to manage unwanted weeds | Agronomy Ontology | ||
| Weeding application date | Term not found | ||||
| Farming system | Rain-fed rice production system | Rain-fed farming | Arable cultivation relying solely on rainfall | AGROVOC | |
| Abiotic stress | Flood-prone region exposure | Flood-prone region exposure | A treatment in terms of a plant's exposure to the regional conditions found in the vicinity of the water bodies, such as sea, river, lake. Growth conditions may include aerobic to anaerobic soil, salinity or toxicity in tidal areas. Treatment may include standing or flash flooding | Plant Experimental Conditions Ontology (PECO) | |
| Geography | Bangladesh | Bangladesh | Gazetteer | ||
| Agro-ecosystem | Lowland region | Lowland | None | AGROVOC | |
| Socio-economy | Farmers' income | Household income | A demographic parameter indicating the amount of earnings made by a family | NCI thesaurus in Socio-economic ontology | |
| Fertilizer costs | Term not found | ||||
Annotation performed by Dr. Berta Miro, IRRI with the support of the CoP ontology experts.
CO term is mapped to a TO term so annotations using one or another are valid. CO will provide the format the variables measuring in the field the effect of the flood on the rice varieties.
PECO term is mapped to ENVO term “Floods (EO:0007172)” that has the definition: an unusual accumulation of water above the ground caused by high tide, heavy rain, melting snow, or rapid runoff from paved areas.
| Ontology | URL |
|---|---|
| Agronomy Ontology | |
| Crop Ontology | |
| Environment Ontology | |
| Plant Ontology | |
| Plant Experimental Conditions Ontology | |
| Plant Trait Ontology | |
| Plant Stress Ontology | |
| Planteome | |
| SEOnt |
| AgroFIMS | |
| AgroPortal | |
| BrAPI | |
| Crop Ontology website | |
| GARDIAN | |
| COPO | |
| MIAPPE | |
| Ontology Lookup Service |