| Literature DB >> 28108448 |
Arnald Alonso1,2, Brittany N Lasseigne1, Kelly Williams1, Josh Nielsen1, Ryne C Ramaker1,3, Andrew A Hardigan1,3, Bobbi Johnston1, Brian S Roberts1, Sara J Cooper1, Sara Marsal2, Richard M Myers1.
Abstract
SUMMARY: The wide range of RNA-seq applications and their high-computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high-performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high-performance computing (HPC) environment. The current applications included in aRNApipe combine the essential RNA-seq primary analyses, including quality control metrics, transcript alignment, count generation, transcript fusion identification, alternative splicing and sequence variant calling. aRNApipe is project-oriented and dynamic so users can easily update analyses to include or exclude samples or enable additional processing modules. Workflow parameters are easily set using a single configuration file that provides centralized tracking of all analytical processes. Finally, aRNApipe incorporates interactive web reports for sample tracking and a tool for managing the genome assemblies available to perform an analysis. AVAILABILITY AND DOCUMENTATION: https://github.com/HudsonAlpha/aRNAPipe ; DOI: 10.5281/zenodo.202950. CONTACT: rmyers@hudsonalpha.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28108448 PMCID: PMC5447234 DOI: 10.1093/bioinformatics/btx023
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.aRNApipe workflow for primary analysis of RNA-seq data