| Literature DB >> 20630057 |
David B Burdick1, Chris C Cavnor, Jeremy Handcock, Sarah Killcoyne, Jake Lin, Bruz Marzolf, Stephen A Ramsey, Hector Rovira, Ryan Bressler, Ilya Shmulevich, John Boyle.
Abstract
BACKGROUND: High throughput sequencing has become an increasingly important tool for biological research. However, the existing software systems for managing and processing these data have not provided the flexible infrastructure that research requires.Entities:
Mesh:
Year: 2010 PMID: 20630057 PMCID: PMC2916924 DOI: 10.1186/1471-2105-11-377
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Architecture of the default SeqAdapt system. The default SeqAdapt system consist of a number of services and analysis programs that can be replaced to allow for a high level of customization. Default sample tracking, data management, and analysis components are available. These components can be replaced with other systems to suit individual needs. All interfaces between the components are standardized using REST to enable interoperability. The default system uses SLIMseq for initially entering of sample/experiment annotation information, and publishes this information using JSON (over REST). An Addama service accesses the SLIMseq web service and stores the information in a JCR repository. When runs have been completed SLIMseq is updated and this information is pulled into Addama. To run an analysis the Addama Robot system is used, this system allows for any command line utility to be prepared, triggered and monitored. When an analysis is complete the results are automatically pulled into an Addama repository. Customized applications can also be written against the Addama web services.
Figure 2Integration of analysis tools. An automated system is used to integrate new analysis tools with Addama. The robot system is responsible for delivering the correct inputs to an analysis script, monitoring the process while it runs, and then for publishing the results of the analysis back. Web applications can then be built on top of standard Addama APIs for providing customizable input information for specific scripts, and then for visualizing the results of the analyses. The loose coupling of each of the web applications from the underlying analysis script makes the system robust to change and (relatively) easy to maintain - especially when the scripts are under constant revision.
Figure 3Step 1: Sample Entry. Sample information is entered into the custom ordering system SLIMseq.
Figure 4Step 2: Browsing. A repository stores all the annotations and sample data files so they can be searched and browsed.
Figure 5Step 3: Analysis. Data stored within the repository can be analyzed with any integrated analysis. In this case MACS can be triggered and the results automatically stored back in Addama.
Figure 6Step 4: Browsing. The results of an analysis are stored in Addama and can be integrated with other data files and viewed in applications such as IGV.