Literature DB >> 29092073

Reproducible Bioconductor workflows using browser-based interactive notebooks and containers.

Reem Almugbel1, Ling-Hong Hung1, Jiaming Hu1, Abeer Almutairy1, Nicole Ortogero2, Yashaswi Tamta1, Ka Yee Yeung1.   

Abstract

Objective: Bioinformatics publications typically include complex software workflows that are difficult to describe in a manuscript. We describe and demonstrate the use of interactive software notebooks to document and distribute bioinformatics research. We provide a user-friendly tool, BiocImageBuilder, that allows users to easily distribute their bioinformatics protocols through interactive notebooks uploaded to either a GitHub repository or a private server. Materials and methods: We present four different interactive Jupyter notebooks using R and Bioconductor workflows to infer differential gene expression, analyze cross-platform datasets, process RNA-seq data and KinomeScan data. These interactive notebooks are available on GitHub. The analytical results can be viewed in a browser. Most importantly, the software contents can be executed and modified. This is accomplished using Binder, which runs the notebook inside software containers, thus avoiding the need to install any software and ensuring reproducibility. All the notebooks were produced using custom files generated by BiocImageBuilder.
Results: BiocImageBuilder facilitates the publication of workflows with a point-and-click user interface. We demonstrate that interactive notebooks can be used to disseminate a wide range of bioinformatics analyses. The use of software containers to mirror the original software environment ensures reproducibility of results. Parameters and code can be dynamically modified, allowing for robust verification of published results and encouraging rapid adoption of new methods.
Conclusion: Given the increasing complexity of bioinformatics workflows, we anticipate that these interactive software notebooks will become as necessary for documenting software methods as traditional laboratory notebooks have been for documenting bench protocols, and as ubiquitous.
© The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Keywords:  automated; bioconductor workflows; containers; data science; reproducibility

Mesh:

Year:  2018        PMID: 29092073      PMCID: PMC6381817          DOI: 10.1093/jamia/ocx120

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  10 in total

1.  Biomedical informatics and data science: evolving fields with significant overlap.

Authors:  Patricia Flatley Brennan; Michael F Chiang; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

2.  Building Containerized Workflows Using the BioDepot-Workflow-Builder.

Authors:  Ling-Hong Hung; Jiaming Hu; Trevor Meiss; Alyssa Ingersoll; Wes Lloyd; Daniel Kristiyanto; Yuguang Xiong; Eric Sobie; Ka Yee Yeung
Journal:  Cell Syst       Date:  2019-09-11       Impact factor: 10.304

3.  Does health informatics have a replication crisis?

Authors:  Enrico Coiera; Elske Ammenwerth; Andrew Georgiou; Farah Magrabi
Journal:  J Am Med Inform Assoc       Date:  2018-08-01       Impact factor: 4.497

4.  NanoDJ: a Dockerized Jupyter notebook for interactive Oxford Nanopore MinION sequence manipulation and genome assembly.

Authors:  Héctor Rodríguez-Pérez; Tamara Hernández-Beeftink; José M Lorenzo-Salazar; José L Roda-García; Carlos J Pérez-González; Marcos Colebrook; Carlos Flores
Journal:  BMC Bioinformatics       Date:  2019-05-09       Impact factor: 3.169

5.  Implementing the FAIR Data Principles in precision oncology: review of supporting initiatives.

Authors:  Charles Vesteghem; Rasmus Froberg Brøndum; Mads Sønderkær; Mia Sommer; Alexander Schmitz; Julie Støve Bødker; Karen Dybkær; Tarec Christoffer El-Galaly; Martin Bøgsted
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

6.  Short-Chain Fatty Acid-Producing Gut Microbiota Is Decreased in Parkinson's Disease but Not in Rapid-Eye-Movement Sleep Behavior Disorder.

Authors:  Hiroshi Nishiwaki; Tomonari Hamaguchi; Mikako Ito; Tomohiro Ishida; Tetsuya Maeda; Kenichi Kashihara; Yoshio Tsuboi; Jun Ueyama; Teppei Shimamura; Hiroshi Mori; Ken Kurokawa; Masahisa Katsuno; Masaaki Hirayama; Kinji Ohno
Journal:  mSystems       Date:  2020-12-08       Impact factor: 6.496

7.  ML-MEDIC: A Preliminary Study of an Interactive Visual Analysis Tool Facilitating Clinical Applications of Machine Learning for Precision Medicine.

Authors:  Laura Stevens; David Kao; Jennifer Hall; Carsten Görg; Kaitlyn Abdo; Erik Linstead
Journal:  Appl Sci (Basel)       Date:  2020-05-09       Impact factor: 2.679

Review 8.  Streamlining statistical reproducibility: NHLBI ORCHID clinical trial results reproduction.

Authors:  Arnaud Serret-Larmande; Jonathan R Kaltman; Paul Avillach
Journal:  JAMIA Open       Date:  2022-01-14

9.  Reproducible bioinformatics project: a community for reproducible bioinformatics analysis pipelines.

Authors:  Neha Kulkarni; Luca Alessandrì; Riccardo Panero; Maddalena Arigoni; Martina Olivero; Giulio Ferrero; Francesca Cordero; Marco Beccuti; Raffaele A Calogero
Journal:  BMC Bioinformatics       Date:  2018-10-15       Impact factor: 3.169

10.  Vertical and horizontal integration of multi-omics data with miodin.

Authors:  Benjamin Ulfenborg
Journal:  BMC Bioinformatics       Date:  2019-12-10       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.