| Literature DB >> 32224841 |
Gunhwan Ko1, Pan-Gyu Kim1, Youngbum Cho2, Seongmun Jeong2, Jae-Yoon Kim2, Kyoung Hyoun Kim2, Ho-Yeon Lee2, Jiyeon Han3, Namhee Yu3, Seokjin Ham4, Insoon Jang4, Byunghee Kang4, Sunguk Shin5, Lian Kim6, Seung-Won Lee7, Dougu Nam8, Jihyun F Kim5,9, Namshin Kim2, Seon-Young Kim10, Sanghyuk Lee3, Tae-Young Roh4,11, Byungwook Lee1.
Abstract
The explosive growth of next-generation sequencing data has resulted in ultra-large-scale datasets and ensuing computational problems. In Korea, the amount of genomic data has been increasing rapidly in the recent years. Leveraging these big data requires researchers to use large-scale computational resources and analysis pipelines. A promising solution for addressing this computational challenge is cloud computing, where CPUs, memory, storage, and programs are accessible in the form of virtual machines. Here, we present a cloud computing-based system, Bio-Express, that provides user-friendly, cost-effective analysis of massive genomic datasets. Bio-Express is loaded with predefined multi-omics data analysis pipelines, which are divided into genome, transcriptome, epigenome, and metagenome pipelines. Users can employ predefined pipelines or create a new pipeline for analyzing their own omics data. We also developed several web-based services for facilitating downstream analysis of genome data. Bio-Express web service is freely available at https://www.bioexpress.re.kr/.Entities:
Keywords: analysis pipeline; cloud computing; genomic data; web server; workflow system
Year: 2020 PMID: 32224841 PMCID: PMC7120352 DOI: 10.5808/GI.2020.18.1.e8
Source DB: PubMed Journal: Genomics Inform ISSN: 1598-866X
Fig. 1.The interface of the Bio-Express workspace. The Bio-Express workflow editor has eight panels: the user’s projects (A), the file explorer (B), the canvas (C), the analysis programs of the current pipeline (D), the program parameter settings (E), the pipeline list (F), the program list (G), and the job execution history (H).
Fig. 2.Screenshot of the RNA-sequencing (RNA-Seq) schematic diagram and its pipeline. The RNA-Seq pipeline was implemented on the canvas.
Fig. 3.Workflow for the histone modification analysis pipeline. ChIP-Seq, chromatin immunoprecipitation sequencing.
Fig. 4.Simplified workflow diagram of the metagenomics pipelines.
Fig. 5.Screenshot of Bio-Express results. Users can view files in various formats, including text, HTML, and PNG on the web.
Web servers for gene-set, pathway, and pharmacogenomic data analysis
| Tool | Main function | Web address | |
|---|---|---|---|
| ADGO2 | Gene-set analysis of microarray data | ||
| ExPathNet | Gene-set analysis with network-weighted clustering | ||
| GSA-SNP2 | Gene-set analysis of GWAS summary data | ||
| Barcas | Software for analyzing barcode-seq data |
GWAS, genome-wide association studies.