Literature DB >> 26508761

openBIS ELN-LIMS: an open-source database for academic laboratories.

Caterina Barillari1, Diana S M Ottoz2, Juan Mariano Fuentes-Serna1, Chandrasekhar Ramakrishnan1, Bernd Rinn1, Fabian Rudolf3.   

Abstract

UNLABELLED: The open-source platform openBIS (open Biology Information System) offers an Electronic Laboratory Notebook and a Laboratory Information Management System (ELN-LIMS) solution suitable for the academic life science laboratories. openBIS ELN-LIMS allows researchers to efficiently document their work, to describe materials and methods and to collect raw and analyzed data. The system comes with a user-friendly web interface where data can be added, edited, browsed and searched.
AVAILABILITY AND IMPLEMENTATION: The openBIS software, a user guide and a demo instance are available at https://openbis-eln-lims.ethz.ch. The demo instance contains some data from our laboratory as an example to demonstrate the possibilities of the ELN-LIMS (Ottoz et al., 2014). For rapid local testing, a VirtualBox image of the ELN-LIMS is also available.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2015        PMID: 26508761      PMCID: PMC4743625          DOI: 10.1093/bioinformatics/btv606

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Scientific recording is an essential part of research, as progress in science depends among other factors on existing knowledge and reproducibility. Recording is the process by which the scientific question, the choice of an experimental procedure, materials and methods, data analysis and interpretation of the results are gathered. Traditionally, all these details were kept in paper notebooks. In today’s academic life science laboratories, most raw data, analyzed data and scripts are stored electronically, while the experimental details are often recorded in paper notebooks. This makes linking and retrieving experimental details and results problematic. Moreover, the large amount of data produced by today’s readout machines adds a level of complexity as the devices the data are stored on might soon be outdated. Databases represent a solution to store all the necessary information together (Bird ) because (i) they ensure long-lasting data storage by being independent of the operating environment; (ii) they are installed on a dedicated server, enabling long-term data storage and regular backups; (iii) they allow storage of large amounts of data; (iv) they are easily accessible via a local area network or the Internet; (v) custom applications can be easily added. Several commercial database solutions for ELNs are available, but they are not commonly adopted by academia. An internal evaluation among the research groups of the departments of Biosystems Science and Engineering and Biology of ETH Zurich found that the options affordable to academic labs had either unsuitable user interfaces (UI) or lack of features. The combination of an easy-to-use UI, flexible adaptation to a labs needs, support of an audit trail, physical support of the data server, support for managing large amounts of data and for integrating new measurement devices was only available at very high costs. On the other hand, the available open-source platforms lack many of the above features and, as they often originate from a single person within a research group, they also lack maintenance and support during the required data management life cycle. Here, we describe the open-source database openBIS (open Biology Information System) Electronic Laboratory Notebook and a Laboratory Information Management System (ELN-LIMS), a software specifically developed with academic life science labs in mind. The structure of the database ensures that the results are represented in a logical and meaningful fashion and it guarantees that the users can find, understand and reuse the data. openBIS ELN-LIMS consists of two interconnected parts: the LIMS, where the information about materials and methods is stored and kept up to date and the ELN, where data obtained from experiments are uploaded and annotated. The system is based on the openBIS platform (Bauch ), which, since 2007, is actively developed and maintained by a dedicated team of software engineers at the ETH and is productively used in several academic institutions and a few private companies.

2 Results

2.1 System’s overview

Our goal was to implement a database to support the research activity of a typical biology laboratory. We conceived it as a tool to assist everyday activities: storage of materials, methods set-up, design and description of experiments, data labeling, storage and organization of results. To this purpose, we based our system on the platform openBIS (Bauch ), as it has the necessary features described above, is field-tested since about 7 years and known to work well also with hundreds of terabytes of data. The openBIS core has a predefined hierarchical structure (Fig. 1). The top level, called Data Space, controls access and rights and contains one or more Projects, which in turn contain one or more Experiments. Samples are the basic unit of openBIS and are organized in the Experiments. We adapted this hierarchical structure to describe the organization and the objects of a typical life science laboratory. We defined the type and the number of Experiments and Samples and assigned customized properties to describe metadata. External files, in any kind of format, are uploaded as Datasets to each Sample. Each Sample can have any number of Datasets. We defined different types of Datasets and defined appropriate sets of properties (or annotations).
Fig. 1.

Hierarchical structure of openBIS. The levels are indicated with different grey shades

Hierarchical structure of openBIS. The levels are indicated with different grey shades We used parent–child relationships linking Samples or Datasets to describe the logical or physical connections existing between the corresponding objects in the real world. We introduced the possibility to describe the nature of these relationships by annotating the links using texts or keywords. To ensure that the creation, modification or deletion of any electronic records is traceable, computer-generated, time-stamped audit trails are used to record the date and time of the user’s intervention. In addition, Datasets are immutable, and as such all modifications made to the original data must be uploaded in separate Datasets using version numbers. Last, access to the system is restricted to authorized users, who can have different roles (Bauch ). All these features are in accordance to the FDA regulations for electronic records (FDA, 2003).

2.2 LIMS

The LIMS consists of two Data Spaces, the Materials and the Methods, accessible and editable by all group members (Fig. 2 , left side).
Fig. 2.

Overview of a typical openBIS ELN-LIMS platform. The LIMS consists of the Materials and Methods Data Spaces, where the Samples are organized in Projects and Experiments. The ELN can have several Data Spaces assigned to each researcher in the laboratory. Researchers can organize their own Data Space by creating Projects, Experiments and Samples. Each Sample has a unique identifier. For simplicity, Datasets are not illustrated here

Overview of a typical openBIS ELN-LIMS platform. The LIMS consists of the Materials and Methods Data Spaces, where the Samples are organized in Projects and Experiments. The ELN can have several Data Spaces assigned to each researcher in the laboratory. Researchers can organize their own Data Space by creating Projects, Experiments and Samples. Each Sample has a unique identifier. For simplicity, Datasets are not illustrated here The organization of the materials of each laboratory can be mapped to the structure of openBIS. Materials include reagents, plasmids, cell lines, etc. Each of them is a Sample type described by properties. For example, we have two plasmid collections: yeast and mammalian. In the Materials Data Space, we create a Project called Plasmids containing two Experiments: Yeast plasmids and Mammalian plasmids. We upload Samples called Plasmids, having properties like plasmid name, backbone, marker, etc. We upload text files containing the DNA sequences to the corresponding plasmid entry as Datasets and integrate Plasmapper to visualize them as a plasmid map (Dong ). A method consists of a series of steps to accomplish a task, using several materials. Each method is described as a Sample type with specific properties. For example, we create a Sample type called western blotting with properties like western blotting name, membrane, etc. We link western blotting Samples to material Samples, such as Buffers, Chemicals and Antibodies and annotate the links with the quantities needed for the method (Fig. 3 ). For instance, we assign anti-β-Actin and Nitrocellulose to properties western blotting name and membrane in western blotting 1. Then, we add 1 μg/ml of Antibody A and 1% of Chemical B in 1 × Buffer C. In the database, Antibody A, Chemical B and Buffer C are added as parents of western blotting 1 and are annotated with 1 μg/ml, 1% and 1×, respectively. Methods belonging to the same category are grouped in one Experiment. For example, we sort the western blotting and the PCR Samples into the western blotting Experiment and the PCR Experiment.
Fig. 3.

Samples can be linked in parent–child relationships to illustrate the logical relationships existing between objects contained in the database. A link annotation specifies the parent–child relationship between Samples. In the LIMS section, annotations are mainly quantities, while, in the ELN section, annotations are mainly descriptions or comments

Samples can be linked in parent–child relationships to illustrate the logical relationships existing between objects contained in the database. A link annotation specifies the parent–child relationship between Samples. In the LIMS section, annotations are mainly quantities, while, in the ELN section, annotations are mainly descriptions or comments

2.3 ELN

In the ELN, experimental data are stored and described in a scientifically meaningful manner, in terms of goal, method and interpretation of the results. Each user has a personal Data Space where he/she can create and edit Projects, Experiments and Samples (Fig. 2, right side). An Experiment is a specific scientific question and contains Samples that are individual attempts to answer it. For example, to monitor the expression of gene x, we performed real-time quantitative PCR (RT-qPCR) and western blotting. In the database, we create an Experiment called Gene × expression and two Samples describing the results obtained by RT-qPCR and western blotting. All materials and methods used for each Sample are conveniently linked and can be annotated with specific details (e.g. deviation from the protocol). This ensures a complete description of the experiments and easy back-tracking (Fig. 3). The results of the RT-qPCR and the western blot can then be uploaded as Datasets to the corresponding Samples. Importantly, the information about configurations or set-up of the acquisition machines are stored in the Datasets as their settings are specific to this result. In our example, the raw data, the analyzed data and the analysis scripts are organized as separate Datasets.

2.4 Interaction with the system

An essential aspect of a database is its ease-of-use. We implemented an intuitive and easy user interface (ELN-UI) for the structure described above. This interface can be used on desktops and other devices, such as tablets. The ELN-UI main menu accesses the ELN, the LIMS, the Utilities (containing a storage manager), a generic search function and a BLAST search function (Altschul ). Samples of the same type are listed in tables whose columns can be filtered with keywords. Single Samples can be selected to be visualized or printed separately. Connections between Samples can be inspected and browsed via a tree-like visualization. We enter new Samples by filling in a form in the ELN-UI or via batch upload of text files. Data are uploaded as Datasets, either via the ELN-UI or via “dropbox scripts”. These are written in Java or Jython and automatically register data and metadata, which can be directly imported from a data acquisition machine, like a microscope. Export of metadata and data is done with an export function from the ELN-UI or using dedicated scripts. To set up the system, an administrator has to operate the previously described standard openBIS interface (Bauch ). This interface is needed for configuration of the underlying database, as Data Spaces, Experiment, Sample and Dataset types and properties are created and maintained from there. However, normal users of the ELN-LIMS do not need to familiarize themselves with this interface as the ELN-UI possesses all necessary possibilities for everyday lab work and data retrieval.

3 Conclusion

We described an efficient ELN-LIMS based on the open source platform openBIS. This platform is pre-set with sensible defaults and ready for use. It can be customized and extended, as shown by the integration of PlasMapper and BLAST. Further extensions could be the integration of a function generating reports or the seamless integration of previously described openBIS extensions (Bauch ). Initial use of a database requires a time investment to organize the existing data, a learning phase and some security precautions. Nevertheless, we find that openBIS ELN-LIMS is of great help and worth the effort. Having all data stored in a single place and easily accessible sped up work and facilitated data retrieval in our laboratory.
  5 in total

1.  PlasMapper: a web server for drawing and auto-annotating plasmid maps.

Authors:  Xiaoli Dong; Paul Stothard; Ian J Forsythe; David S Wishart
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Laboratory notebooks in the digital era: the role of ELNs in record keeping for chemistry and other sciences.

Authors:  Colin L Bird; Cerys Willoughby; Jeremy G Frey
Journal:  Chem Soc Rev       Date:  2013-10-21       Impact factor: 54.564

4.  Inducible, tightly regulated and growth condition-independent transcription factor in Saccharomyces cerevisiae.

Authors:  Diana S M Ottoz; Fabian Rudolf; Joerg Stelling
Journal:  Nucleic Acids Res       Date:  2014-07-17       Impact factor: 16.971

5.  openBIS: a flexible framework for managing and analyzing complex data in biology research.

Authors:  Angela Bauch; Izabela Adamczyk; Piotr Buczek; Franz-Josef Elmer; Kaloyan Enimanev; Pawel Glyzewski; Manuel Kohler; Tomasz Pylak; Andreas Quandt; Chandrasekhar Ramakrishnan; Christian Beisel; Lars Malmström; Ruedi Aebersold; Bernd Rinn
Journal:  BMC Bioinformatics       Date:  2011-12-08       Impact factor: 3.307

  5 in total
  7 in total

1.  Proteomics INTegrator (PINT): An Online Tool To Store, Query, and Visualize Large Proteomics Experiment Results.

Authors:  Salvador Martínez-Bartolomé; Tom Casimir Bamberger; Mathieu Lavallée-Adam; Daniel B McClatchy; John R Yates
Journal:  J Proteome Res       Date:  2019-07-01       Impact factor: 4.466

2.  Neurophysiological analytics for all! Free open-source software tools for documenting, analyzing, visualizing, and sharing using electronic notebooks.

Authors:  David M Rosenberg; Charles C Horn
Journal:  J Neurophysiol       Date:  2016-04-20       Impact factor: 2.714

Review 3.  Making the collective knowledge of chemistry open and machine actionable.

Authors:  Kevin Maik Jablonka; Luc Patiny; Berend Smit
Journal:  Nat Chem       Date:  2022-04-04       Impact factor: 24.427

4.  Chemotion ELN: an Open Source electronic lab notebook for chemists in academia.

Authors:  Pierre Tremouilhac; An Nguyen; Yu-Chieh Huang; Serhii Kotov; Dominic Sebastian Lütjohann; Florian Hübsch; Nicole Jung; Stefan Bräse
Journal:  J Cheminform       Date:  2017-09-25       Impact factor: 5.514

5.  A guide to open science practices for animal research.

Authors:  Kai Diederich; Kathrin Schmitt; Philipp Schwedhelm; Bettina Bert; Céline Heinl
Journal:  PLoS Biol       Date:  2022-09-15       Impact factor: 9.593

6.  Implementation and use of cloud-based electronic lab notebook in a bioprocess engineering teaching laboratory.

Authors:  Erin M Riley; Holly Z Hattaway; P Arthur Felse
Journal:  J Biol Eng       Date:  2017-11-24       Impact factor: 4.355

7.  SwissPKcdw - A clinical data warehouse for the optimization of pediatric dosing regimens.

Authors:  Roland Goers; Diana Coman Schmid; Vera F Jäggi; Paolo Paioni; Michal J Okoniewski; Althea Parker; Beat Bangerter; Sofia Georgakopoulou; Thierry Sengstag; Julia Bielicki; Romy Tilen; Swen Vermeul; Stefanie D Krämer; Christoph Berger; Bernd Rinn; Henriette E Meyer Zu Schwabedissen
Journal:  CPT Pharmacometrics Syst Pharmacol       Date:  2021-11-02
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.