| Literature DB >> 21156038 |
Angela C M Luyf1, Barbera D C van Schaik, Michel de Vries, Frank Baas, Antoine H C van Kampen, Silvia D Olabarriaga.
Abstract
BACKGROUND: Bioinformatics is confronted with a new data explosion due to the availability of high throughput DNA sequencers. Data storage and analysis becomes a problem on local servers, and therefore it is needed to switch to other IT infrastructures. Grid and workflow technology can help to handle the data more efficiently, as well as facilitate collaborations. However, interfaces to grids are often unfriendly to novice users.Entities:
Mesh:
Year: 2010 PMID: 21156038 PMCID: PMC3018473 DOI: 10.1186/1471-2105-11-598
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1People, resources and data flow in the platforms for DNA sequencing. Blue arrows: data flow on local infrastructure. Data is transferred from the SeqLab to the BioLab server and results are downloaded from the BioLab server. Red dotted arrows: The VBrowser is used for data transfer between servers and grid resources and for running workflows on the grid. Red dashed arrows: Data can be transferred directly from the SeqLab server to the grid.
Figure 2Screenshot of the VBrowser GUI. Screenshot of the VBrowser GUI showing the resource (left side) and the workflow (right side) windows. The resource window illustrates storage locations at the grid, remote servers and local computer (green ellipses). The color of the workflow components indicate the status of the jobs; purple components are completed, red are failed and in green still running.
Figure 3Data management of sequencing analysis. (*) indicates users action (1) create directory structure, (2) copy analysis data, (3) select workflow, (4) run workflow, (5) GASW description, (6a) submit job, (6b) monitor jobs, (7) start job, (8) download data/analysis tools, (9) upload results, (10) copy results.