| Literature DB >> 22779037 |
Terry Camerlengo1, Hatice Gulcin Ozer, Raghuram Onti-Srinivasan, Pearlly Yan, Tim Huang, Jeffrey Parvin, Kun Huang.
Abstract
Next Generation Sequencing is highly resource intensive. NGS Tasks related to data processing, management and analysis require high-end computing servers or even clusters. Additionally, processing NGS experiments requires suitable storage space and significant manual interaction. At The Ohio State University's Biomedical Informatics Shared Resource, we designed and implemented a scalable architecture to address the challenges associated with the resource intensive nature of NGS secondary analysis built around Illumina Genome Analyzer II sequencers and Illumina's Gerald data processing pipeline. The software infrastructure includes a distributed computing platform consisting of a LIMS called QUEST (http://bisr.osumc.edu), an Automation Server, a computer cluster for processing NGS pipelines, and a network attached storage device expandable up to 40TB. The system has been architected to scale to multiple sequencers without requiring additional computing or labor resources. This platform provides demonstrates how to manage and automate NGS experiments in an institutional or core facility setting.Entities:
Year: 2012 PMID: 22779037 PMCID: PMC3392054
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 2:The NGS data processing and automation pipeline.
Figure 3Execution of the Configuration file.
Figure 1
Figure 4Main page for listing the studies.
Figure 5NGS/GAII run history and comments/analyzing instructions.
Figure 6Flow cell properties header.
Figure 7An example of lane properties panel for a flowcell (showing first 4 lanes).
Figure 8Configuration Manager.