Literature DB >> 31882996

Bioinformatics pipeline using JUDI: Just Do It!

Soumitra Pal1, Teresa M Przytycka1.   

Abstract

SUMMARY: Large-scale data analysis in bioinformatics requires pipelined execution of multiple software. Generally each stage in a pipeline takes considerable computing resources and several workflow management systems (WMS), e.g. Snakemake, Nextflow, Common Workflow Language, Galaxy, etc. have been developed to ensure optimum execution of the stages across two invocations of the pipeline. However, when the pipeline needs to be executed with different settings of parameters, e.g. thresholds, underlying algorithms, etc. these WMS require significant scripting to ensure an optimal execution. We developed JUDI on top of DoIt, a Python based WMS, to systematically handle parameter settings based on the principles of database management systems. Using a novel modular approach that encapsulates a parameter database in each task and file associated with a pipeline stage, JUDI simplifies plug-and-play of the pipeline stages. For a typical pipeline with n parameters, JUDI reduces the number of lines of scripting required by a factor of O(n). With properly designed parameter databases, JUDI not only enables reproducing research under published values of parameters but also facilitates exploring newer results under novel parameter settings.
AVAILABILITY AND IMPLEMENTATION: https://github.com/ncbi/JUDI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2019. This work is written by US Government employees and is in the public domain in the US.

Mesh:

Year:  2020        PMID: 31882996     DOI: 10.1093/bioinformatics/btz956

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  1 in total

1.  Sustainable data analysis with Snakemake.

Authors:  Felix Mölder; Kim Philipp Jablonski; Brice Letcher; Michael B Hall; Christopher H Tomkins-Tinch; Vanessa Sochat; Jan Forster; Soohyun Lee; Sven O Twardziok; Alexander Kanitz; Andreas Wilm; Manuel Holtgrewe; Sven Rahmann; Sven Nahnsen; Johannes Köster
Journal:  F1000Res       Date:  2021-01-18
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.