Literature DB >> 31077294

Tibanna: software for scalable execution of portable pipelines on the cloud.

Soohyun Lee1, Jeremy Johnson1, Carl Vitzthum1, Koray Kırlı1, Burak H Alver1, Peter J Park1.   

Abstract

SUMMARY: We introduce Tibanna, an open-source software tool for automated execution of bioinformatics pipelines on Amazon Web Services (AWS). Tibanna accepts reproducible and portable pipeline standards including Common Workflow Language (CWL), Workflow Description Language (WDL) and Docker. It adopts a strategy of isolation and optimization of individual executions, combined with a serverless scheduling approach. Pipelines are executed and monitored using local commands or the Python Application Programming Interface (API) and cloud configuration is automatically handled. Tibanna is well suited for projects with a range of computational requirements, including those with large and widely fluctuating loads. Notably, it has been used to process terabytes of data for the 4D Nucleome (4DN) Network.
AVAILABILITY AND IMPLEMENTATION: Source code is available on GitHub at https://github.com/4dn-dcic/tibanna. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 31077294      PMCID: PMC6931271          DOI: 10.1093/bioinformatics/btz379

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Galaxy: a platform for interactive large-scale genome analysis.

Authors:  Belinda Giardine; Cathy Riemer; Ross C Hardison; Richard Burhans; Laura Elnitski; Prachi Shah; Yi Zhang; Daniel Blankenberg; Istvan Albert; James Taylor; Webb Miller; W James Kent; Anton Nekrutenko
Journal:  Genome Res       Date:  2005-09-16       Impact factor: 9.043

2.  Nextflow enables reproducible computational workflows.

Authors:  Paolo Di Tommaso; Maria Chatzou; Evan W Floden; Pablo Prieto Barja; Emilio Palumbo; Cedric Notredame
Journal:  Nat Biotechnol       Date:  2017-04-11       Impact factor: 54.908

3.  Snakemake--a scalable bioinformatics workflow engine.

Authors:  Johannes Köster; Sven Rahmann
Journal:  Bioinformatics       Date:  2012-08-20       Impact factor: 6.937

4.  RABIX: AN OPEN-SOURCE WORKFLOW EXECUTOR SUPPORTING RECOMPUTABILITY AND INTEROPERABILITY OF WORKFLOW DESCRIPTIONS.

Authors:  Gaurav Kaushik; Sinisa Ivkovic; Janko Simonovic; Nebojsa Tijanic; Brandi Davis-Dusenbery; Deniz Kural
Journal:  Pac Symp Biocomput       Date:  2017

5.  Toil enables reproducible, open source, big biomedical data analyses.

Authors:  John Vivian; Arjun Arkal Rao; Frank Austin Nothaft; Christopher Ketchum; Joel Armstrong; Adam Novak; Jacob Pfeil; Jake Narkizian; Alden D Deran; Audrey Musselman-Brown; Hannes Schmidt; Peter Amstutz; Brian Craft; Mary Goldman; Kate Rosenbloom; Melissa Cline; Brian O'Connor; Megan Hanna; Chet Birger; W James Kent; David A Patterson; Anthony D Joseph; Jingchun Zhu; Sasha Zaranek; Gad Getz; David Haussler; Benedict Paten
Journal:  Nat Biotechnol       Date:  2017-04-11       Impact factor: 54.908

6.  The 4D nucleome project.

Authors:  Job Dekker; Andrew S Belmont; Mitchell Guttman; Victor O Leshyk; John T Lis; Stavros Lomvardas; Leonid A Mirny; Clodagh C O'Shea; Peter J Park; Bing Ren; Joan C Ritland Politz; Jay Shendure; Sheng Zhong
Journal:  Nature       Date:  2017-09-13       Impact factor: 49.962

7.  Singularity: Scientific containers for mobility of compute.

Authors:  Gregory M Kurtzer; Vanessa Sochat; Michael W Bauer
Journal:  PLoS One       Date:  2017-05-11       Impact factor: 3.240

8.  CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language.

Authors:  Michael Kotliar; Andrey V Kartashov; Artem Barski
Journal:  Gigascience       Date:  2019-07-01       Impact factor: 6.524

  8 in total
  3 in total

1.  CloudASM: an ultra-efficient cloud-based pipeline for mapping allele-specific DNA methylation.

Authors:  Emmanuel L P Dumont; Benjamin Tycko; Catherine Do
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

2.  The 4D Nucleome Data Portal as a resource for searching and visualizing curated nucleomics data.

Authors:  Sarah B Reiff; Andrew J Schroeder; Koray Kırlı; Andrea Cosolo; Clara Bakker; Soohyun Lee; Alexander D Veit; Alexander K Balashov; Carl Vitzthum; William Ronchetti; Kent M Pitman; Jeremy Johnson; Shannon R Ehmsen; Peter Kerpedjiev; Nezar Abdennur; Maxim Imakaev; Serkan Utku Öztürk; Uğur Çamoğlu; Leonid A Mirny; Nils Gehlenborg; Burak H Alver; Peter J Park
Journal:  Nat Commun       Date:  2022-05-02       Impact factor: 17.694

3.  Sustainable data analysis with Snakemake.

Authors:  Felix Mölder; Kim Philipp Jablonski; Brice Letcher; Michael B Hall; Christopher H Tomkins-Tinch; Vanessa Sochat; Jan Forster; Soohyun Lee; Sven O Twardziok; Alexander Kanitz; Andreas Wilm; Manuel Holtgrewe; Sven Rahmann; Sven Nahnsen; Johannes Köster
Journal:  F1000Res       Date:  2021-01-18
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.