| Literature DB >> 29953862 |
Björn Grüning1, John Chilton2, Johannes Köster3, Ryan Dale4, Nicola Soranzo5, Marius van den Beek6, Jeremy Goecks7, Rolf Backofen8, Anton Nekrutenko9, James Taylor10.
Abstract
Many areas of research suffer from poor reproducibility, particularly in computationally intensive domains where results rely on a series of complex methodological decisions that are not well captured by traditional publication approaches. Various guidelines have emerged for achieving reproducibility, but implementation of these practices remains difficult due to the challenge of assembling software tools plus associated libraries, connecting tools together into pipelines, and specifying parameters. Here, we discuss a suite of cutting-edge technologies that make computational reproducibility not just possible, but practical in both time and effort. This suite combines three well-tested components-a system for building highly portable packages of bioinformatics software, containerization and virtualization technologies for isolating reusable execution environments for these packages, and workflow systems that automatically orchestrate the composition of these packages for entire pipelines-to achieve an unprecedented level of computational reproducibility. We also provide a practical implementation and five recommendations to help set a typical researcher on the path to performing data analyses reproducibly.Entities:
Mesh:
Year: 2018 PMID: 29953862 PMCID: PMC6263957 DOI: 10.1016/j.cels.2018.03.014
Source DB: PubMed Journal: Cell Syst ISSN: 2405-4712 Impact factor: 10.304