Burkhard Linke1, Robert Giegerich, Alexander Goesmann. 1. Bioinformatics Resource Faciliy, Center for Biotechnology and Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany. blinke@ceBiTec.Uni-Bielefeld.De
Abstract
MOTIVATION: The rapidly increasing amounts of data available from new high-throughput methods have made data processing without automated pipelines infeasible. As was pointed out in several publications, integration of data and analytic resources into workflow systems provides a solution to this problem, simplifying the task of data analysis. Various applications for defining and running workflows in the field of bioinformatics have been proposed and published, e.g. Galaxy, Mobyle, Taverna, Pegasus or Kepler. One of the main aims of such workflow systems is to enable scientists to focus on analysing their datasets instead of taking care for data management, job management or monitoring the execution of computational tasks. The currently available workflow systems achieve this goal, but fundamentally differ in their way of executing workflows. RESULTS: We have developed the Conveyor software library, a multitiered generic workflow engine for composition, execution and monitoring of complex workflows. It features an open, extensible system architecture and concurrent program execution to exploit resources available on modern multicore CPU hardware. It offers the ability to build complex workflows with branches, loops and other control structures. Two example use cases illustrate the application of the versatile Conveyor engine to common bioinformatics problems. AVAILABILITY: The Conveyor application including client and server are available at http://conveyor.cebitec.uni-bielefeld.de.
MOTIVATION: The rapidly increasing amounts of data available from new high-throughput methods have made data processing without automated pipelines infeasible. As was pointed out in several publications, integration of data and analytic resources into workflow systems provides a solution to this problem, simplifying the task of data analysis. Various applications for defining and running workflows in the field of bioinformatics have been proposed and published, e.g. Galaxy, Mobyle, Taverna, Pegasus or Kepler. One of the main aims of such workflow systems is to enable scientists to focus on analysing their datasets instead of taking care for data management, job management or monitoring the execution of computational tasks. The currently available workflow systems achieve this goal, but fundamentally differ in their way of executing workflows. RESULTS: We have developed the Conveyor software library, a multitiered generic workflow engine for composition, execution and monitoring of complex workflows. It features an open, extensible system architecture and concurrent program execution to exploit resources available on modern multicore CPU hardware. It offers the ability to build complex workflows with branches, loops and other control structures. Two example use cases illustrate the application of the versatile Conveyor engine to common bioinformatics problems. AVAILABILITY: The Conveyor application including client and server are available at http://conveyor.cebitec.uni-bielefeld.de.
Authors: Laura Gómez-Consarnau; José M González; Thomas Riedel; Sebastian Jaenicke; Irene Wagner-Döbler; Sergio A Sañudo-Wilhelmy; Jed A Fuhrman Journal: ISME J Date: 2015-11-17 Impact factor: 10.302
Authors: Sebastian Jünemann; Karola Prior; Rafael Szczepanowski; Inga Harks; Benjamin Ehmke; Alexander Goesmann; Jens Stoye; Dag Harmsen Journal: PLoS One Date: 2012-08-01 Impact factor: 3.240
Authors: Mohamed El-Kalioby; Mohamed Abouelhoda; Jan Krüger; Robert Giegerich; Alexander Sczyrba; Dennis P Wall; Peter Tonellato Journal: BMC Bioinformatics Date: 2012-12-13 Impact factor: 3.169
Authors: Ola Spjuth; Erik Bongcam-Rudloff; Guillermo Carrasco Hernández; Lukas Forer; Mario Giovacchini; Roman Valls Guimera; Aleksi Kallio; Eija Korpelainen; Maciej M Kańduła; Milko Krachunov; David P Kreil; Ognyan Kulev; Paweł P Łabaj; Samuel Lampa; Luca Pireddu; Sebastian Schönherr; Alexey Siretskiy; Dimitar Vassilev Journal: Biol Direct Date: 2015-08-19 Impact factor: 4.540