| Literature DB >> 27386258 |
Serge Casaregola1, Alexander Vasilenko2, Paolo Romano3, Vincent Robert4, Svetlana Ozerskaya2, Anna Kopf5, Frank O Glöckner5, David Smith6.
Abstract
Culture collections contain indispensable information about the microorganisms preserved in their repositories, such as taxonomical descriptions, origins, physiological and biochemical characteristics, bibliographic references, etc. However, information currently accessible in databases rarely adheres to common standard protocols. The resultant heterogeneity between culture collections, in terms of both content and format, notably hampers microorganism-based research and development (R&D). The optimized exploitation of these resources thus requires standardized, and simplified, access to the associated information. To this end, and in the interest of supporting R&D in the fields of agriculture, health and biotechnology, a pan-European distributed research infrastructure, MIRRI, including over 40 public culture collections and research institutes from 19 European countries, was established. A prime objective of MIRRI is to unite and provide universal access to the fragmented, and untapped, resources, information and expertise available in European public collections of microorganisms; a key component of which is to develop a dynamic Information System. For the first time, both culture collection curators as well as their users have been consulted and their feedback, concerning the needs and requirements for collection databases and data accessibility, utilised. Users primarily noted that databases were not interoperable, thus rendering a global search of multiple databases impossible. Unreliable or out-of-date and, in particular, non-homogenous, taxonomic information was also considered to be a major obstacle to searching microbial data efficiently. Moreover, complex searches are rarely possible in online databases thus limiting the extent of search queries. Curators also consider that overall harmonization-including Standard Operating Procedures, data structure, and software tools-is necessary to facilitate their work and to make high-quality data easily accessible to their users. Clearly, the needs of culture collection curators coincide with those of users on the crucial point of database interoperability. In this regard, and in order to design an appropriate Information System, important aspects on which the culture collection community should focus include: the interoperability of data sets with the ontologies to be used; setting best practice in data management, and the definition of an appropriate data standard.Entities:
Keywords: Culture collection; Information Technology; Microbial domain Biological Resource Centre; Microorganism
Year: 2016 PMID: 27386258 PMCID: PMC4912550 DOI: 10.1186/s40064-016-2450-8
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
List of acronyms
| BRC: Biological Resource Centre |
| CC: Culture Collection |
| CABRI: Common Access to Biological Resources and Information |
| ECCO: European Culture Collections’ Organization |
| ELIXIR: European Life Science Infrastructure for Biological Information |
| EMbaRC: European Consortium of Microbial Resource Centres |
| EnvO: Environmental Ontology |
| ESFRI: European Strategy Forum on Research Infrastructures |
| GBIF: Global Biodiversity Information Facility |
| GBRCN: Global Biological Resource Centre Network |
| GCM: Global Catalogue of Microorganisms |
| IT: Information Technology |
| MEO: Metagenome and Microbes Environmental Ontology |
| MINE: Microbial Information Network Europe |
| MIRRI: Microbial Resource Research Infrastructure |
| NCBO: National Center for Biomedical Ontology |
| OBO: Open Biological and Biomedical Ontologies Foundry |
| OECD: Organization for Economic Co-operation and Development |
| R&D: Research and Development |
| SOP: Standard Operating Procedures |
| TDWG: Taxonomic Databases Working Group |
| WDCM: World Data Centre for Microorganisms |
| WFCC: World Federation of Culture Collections |
Fig. 1The “Extended Query Form” interface of CABRI. The SRS standard interface allows multiple query conditions to be set combined with boolean operators for each indexed data field. The query form for CABRI bacteria catalogues is shown. A possible response to a query would then return, e.g., all strains of the genus Phyllobacterium (field Name), isolated from Ardisia crispa (field Isolated_from) in Germany (field Geographic_origin)
Fig. 2The “Advanced search” interface of StrainInfo. The query form for all the data in StrainInfo is shown. A possible response to a query would then return, e.g., all strains of the genus Debaryomyces (field Taxonomic name), isolated from dairy products and their environment (field full text)
Definitions of “strain” in BioPortal ontologies referring to the microbiological concept
| Ontology | Code | Definition |
|---|---|---|
| Experimental Factor Ontology (EFO) | EFO_0005135 | A population of organisms that is genetically different from others of the same species and possessing a set of defined characteristics |
| Semanticscience Integrated Ontology (SIO) | SIO_010055 | A strain is a genetic variant or kind of microorganism |
| Microbial Culture Collection Vocabulary (MCVV) | MCCV_000002 | Straina |
aMCVV is a very limited ontology, including 16 concepts only
Fig. 3The “Advanced search” interface of WDCM with the request Isolation Source. The result of the request for Isolation Source for the entire content of the GCM (http://gcm.wfcc.info/strains.jsp) 02/02/2015
Fig. 4Schematic representation of the main tasks of the construction of the MIRRI-IS. For simplicity’s sake, the hypothesis of an inter-operable interface linking all individual CC databases was chosen (see Proposed solutions for increased interoperability between the existing databases section)