| Literature DB >> 20738864 |
Jeremy Goecks1, Anton Nekrutenko, James Taylor.
Abstract
Increased reliance on computational approaches in the life sciences has revealed grave concerns about how accessible and reproducible computation-reliant results truly are. Galaxy http://usegalaxy.org, an open web-based platform for genomic research, addresses these problems. Galaxy automatically tracks and manages data provenance and provides support for capturing the context and intent of computational methods. Galaxy Pages are interactive, web-based documents that provide users with a medium to communicate a complete computational analysis.Entities:
Mesh:
Year: 2010 PMID: 20738864 PMCID: PMC2945788 DOI: 10.1186/gb-2010-11-8-r86
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Galaxy analysis workspace. The Galaxy analysis workspace is where users perform genomic analyses. The workspace has four areas: the navigation bar, tool panel (left column), detail panel (middle column), and history panel (right column). The navigation bar provides links to Galaxy's major components, including the analysis workspace, workflows, data libraries, and user repositories (histories, workflows, Pages). The tool panel lists the analysis tools and data sources available to the user. The detail panel displays interfaces for tools selected by the user. The history panel shows data and the results of analyses performed by the user, as well as automatically tracked metadata and user-generated annotations. Every action by the user generates a new history item, which can then be used in subsequent analyses, downloaded, or visualized. Galaxy's history panel helps to facilitate reproducibility by showing provenance of data and by enabling users to extract a workflow from a history, rerun analysis steps, visualize output datasets, tag datasets for searching and grouping, and annotate steps with information about their purpose or importance. Here, step 12 is being rerun.
Figure 2Galaxy workflow editor. Galaxy's workflow editor provides a graphical user interface for creating and modifying workflows. The editor has four areas: navigation bar, tool bar (left column), editor panel (middle column), and details panel. A user adds tools from the tool panel to the editor panel and configures each step in the workflow using the details panel. The details panel also enables a user to add tags to a workflow and annotate a workflow and workflow steps. Workflows are run in Galaxy's analysis workspace; like all tools executed in Galaxy, Galaxy automatically generates history items and provenance information for each tool executed via a workflow.
Figure 3Galaxy public repositories and published items. (a) Galaxy's public repository for Pages; there are also public repositories for histories and workflows. Repositories can be searched by name, annotation, owner, and community tags. (b) A published Galaxy workflow. Each shared or published item is displayed in a webpage with its metadata (for example, execution details, user annotations), a link for copying the item into a user's workspace, and links for viewing related items.
Figure 4Galaxy Pages. Galaxy Page that is an online, interactive supplement for a metagenomic study performed in Galaxy [21]. The Page communicates all facets of the experiment via increasing levels of detail, starting with supplementary text, two embedded histories, and an embedded workflow. Readers can open the embedded items and view details for each step, including provenance information, parameter settings, and annotations. For history steps, readers can view corresponding datasets (red arrow). Readers can also copy histories (green arrow) or the workflow (blue arrow) into their analysis workspace and both reproduce and extend the experiment's analyses without leaving Galaxy or their web browser.
Comparing Galaxy to other genomic workbenches
| Galaxy functionality | Description | GenePattern comparison | Mobyle comparison |
|---|---|---|---|
| Unified, web-based tool interface | All tool interface share same style and use web components; tool interfaces are generated from tool configuration file | Same functions as Galaxy | Same functions as Galaxy |
| Simple tool integration | Tool developers can integrate tools by writing a tool configuration file and including tool file in Galaxy configuration file | Similar but not as flexible tool configuration file; easy installation of selected tools via a web-based interface | Remote services can be added using a server configuration file |
| Integrated datasources | Transparent access to established data warehouses | No similar functions | No similar functions |
| Automatic metadata | Provenance, inputs, parameters, and outputs for each tool used; analysis steps grouped into histories | Same functions as Galaxy | Same functions as Galaxy |
| User tags | Can apply short tags to histories, datasets, workflows, and pages; tags are searchable and facilitate reuse | No similar functions | No similar functions |
| User annotations | Can add descriptions or notes to histories, datasets, workflows, workflow steps, and pages to aid in understanding analyses | Cannot annotate a history but can annotate a workflow (pipeline) with an external document | No similar functions |
| Creating and running workflows | Can create, either by example or from scratch, a workflow that can be repeatedly used to perform a multi-step analysis | Same functions as Galaxy, although editor is form-based rather than graphical | In development |
| Workflow metadata | Automatic documentation is generated when a workflow is run; users can also tag and annotate workflows and workflow steps | Same functions as Galaxy for generating automatic metadata; cannot annotate workflow steps | In development |
| Sharing model | Datasets, histories, workflows, and Pages can be shared at progressive levels and published to Galaxy's public repositories; datasets have more advanced sharing options, including groups | Can share analyses and workflows with individuals or groups | No similar functions |
| Item reuse, display framework and public repositories | Shared or published items displayed as webpages and can be imported and used immediately; public repositories can be searched; archives of analyses and workflows for sharing between servers are under development | Can create an archive of an analysis or workflow and share that with others; author information is included in archive | Can create an archive of an analysis and share that with others |
| Pages with embedded items | Can create custom webpages with embedded Galaxy items; each page can document a complete experiment, providing all details and supporting reuse of experiment's outputs | Microsoft Word plugin enables users to embed analyses and workflows in Word documents | No similar functions |
| Coupling between analysis workspace and publication workspace | Can import and immediately start using any shared, published, or embedded item without leaving web browser or Galaxy | Can run embedded analyses and save results in Microsoft Word documents | No similar functions |
A summary of Galaxy's functionality and how Galaxy's functionality compares to the functionality of two other genomic workbenches, GenePattern and Mobyle. Galaxy's novel functionality includes (but is not limited to) integrated datasources, user annotations, a graphical workflow editor, Pages with embedded items, and coupling the workspaces for analysis and publication using an open, web-based model.