| Literature DB >> 31913851 |
Mehmood Ghaffar1, Danuta Schüler1, Patrick König1, Daniel Arend1, Astrid Junker1, Uwe Scholz1, Matthias Lange1.
Abstract
Genetic variance within the genotype of population and its mapping to phenotype variance in a systematic and high throughput manner is of interest for biodiversity and breeding research. Beside the established and efficient high throughput genotype technologies, phenotype capabilities got increased focus in the last decade. This results in an increasing amount of phenotype data from well scaling, automated sensor platform. Thus, data stewardship is a central component to make experimental data from multiple domains interoperable and re-usable. To ensure a standard and comprehensive sharing of scientific and experimental data among domain experts, FAIR data principles are utilized for machine read-ability and scale-ability. In this context, BrAPI consortium, provides a comprehensive and commonly agreed FAIRed guidelines to offer a BrAPI layered scientific data in a RESTful manner. This paper presents the concepts, best practices and implementations to meet these challenges. As one of the worlds leading plant research institutes it is of vital interest for the IPK-Gatersleben to transform legacy data infrastructures into a bio-digital resource center for plant genetics resources (PGR). This paper also demonstrates the benefits of integrated database back-ends, established data stewardship processes, and FAIR data exposition in a machine-readable, highly scalable programmatic interfaces.Entities:
Keywords: FAIR data; genotyping; lab information management; phenotyping; plant digital resources
Year: 2020 PMID: 31913851 PMCID: PMC7074143 DOI: 10.1515/jib-2019-0060
Source DB: PubMed Journal: J Integr Bioinform ISSN: 1613-4516
Figure 1:Data processing in LIMS throughout from data-entry to data-dissemination.
Figure 2:Generic ER schema of the LIMS database backend.
Example of ambiguous attribute values in column “Condition”.
| Parameter | Condition | Methods | Result |
|---|---|---|---|
| Inflorescence Slices | First Blossom | Measurement | 60 |
| Inflorescence Slices | 50% of Plants | Measurement | 62 |
| Leaf width | Minimum | Measurement | −2 |
| Leaf width | Maximum | Measurement | −2 |
| Leaf width | Minimum | Measurement | −2 |
| Leaf width | Maximum | Measurement | −2 |
| Flowering Season | Minimum | Measurement | 86 |
| Flowering Season | Maximum | Measurement | 86 |
Non-homogeneous and incomplete material naming.
| Material | |||
|---|---|---|---|
| TrivialName | Species | Genus | SubTaxa |
| Morex_32 | vulgare | Hordeum | |
| Morex_42 | vulgare | Hordeum | |
| Morex_52 | vulgare | Hordeum | |
| Morex_11 | vulgare | Hordeum | |
| Morex | vulgare | Hordeum | |
| A thaliana Sap | |||
Examples for the evolution of API for FAIR-IPK implemented endpoints.
| endpoint | V1.2 | V1.3 |
|---|---|---|
| GET /calls | datatypes | dataTypes |
| GET /breedingmethods | name | breedingMethodName |
| GET /germplasm-search | germplasm-search | germplasm |
| genus | germplasmGenus | |
| species | germplasmSpecies | |
| GET /germplasm/{germplasmDbId}/pedigree | parent1DbId | parent1Id |
| parent2DbId | parent2Id |
The BrAPI specification has a release cycle of 6 month. Here we show the changed attributes from version 1.2 to version 1.3 only. For instance, germplasm-search is replaced with germplasm, without affecting the data. There are also changes for endpoints itself, which is not documented in this example. As illustrated here, it can be assumed that there is a notifiable effort for BrAPI-endpoint provider to provide up-to-date endpoint implementations.
Figure 3:Web interface for accessing digital PGR of IPK-Gatersleben.
Figure 4:Mapping of LIMS resources to BrAPI resources.
Figure 5:Example JSON output for the BrAPI call GET /germplasm-search – the first element comprise meta data, like status, linked data files and pagination information.
The result part comprises the array of query results for that particular page.
Figure 6:Flow of RESTful call in MVC-framework.
Figure 7:A Python Code Snippet to access RESTful endpoint Germplasm-search.
Figure 8:Accessing Germplasm resource REST endpoint using Linux cURL command line tool.
Figure 9:Performance test on large data-sets such as germplasm and observation units.