Literature DB >> 30859176

Holistic optimization of an RNA-seq workflow for multi-threaded environments.

Ling-Hong Hung1, Wes Lloyd1, Radhika Agumbe Sridhar1, Saranya Devi Athmalingam Ravishankar1, Yuguang Xiong2, Eric Sobie2, Ka Yee Yeung1.   

Abstract

SUMMARY: For many next generation-sequencing pipelines, the most computationally intensive step is the alignment of reads to a reference sequence. As a result, alignment software such as the Burrows-Wheeler Aligner is optimized for speed and is often executed in parallel on the cloud. However, there are other less demanding steps that can also be optimized to significantly increase the speed especially when using many threads. We demonstrate this using a unique molecular identifier RNA-sequencing pipeline consisting of 3 steps: split, align, and merge. Optimization of all three steps yields a 40% increase in speed when executed using a single thread. However, when executed using 16 threads, we observe a 4-fold improvement over the original parallel implementation and more than an 8-fold improvement over the original single-threaded implementation. In contrast, optimizing only the alignment step results in just a 13% improvement over the original parallel workflow using 16 threads.
AVAILABILITY AND IMPLEMENTATION: Code (M.I.T. license), supporting scripts and Dockerfiles are available at https://github.com/BioDepot/LINCS_RNAseq_cpp and Docker images at https://hub.docker.com/r/biodepot/rnaseq-umi-cpp/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30859176      PMCID: PMC6792127          DOI: 10.1093/bioinformatics/btz169

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Quantitative single-cell RNA-seq with unique molecular identifiers.

Authors:  Saiful Islam; Amit Zeisel; Simon Joost; Gioele La Manno; Pawel Zajac; Maria Kasper; Peter Lönnerberg; Sten Linnarsson
Journal:  Nat Methods       Date:  2013-12-22       Impact factor: 28.547

2.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

3.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

4.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

5.  Salmon provides fast and bias-aware quantification of transcript expression.

Authors:  Rob Patro; Geet Duggal; Michael I Love; Rafael A Irizarry; Carl Kingsford
Journal:  Nat Methods       Date:  2017-03-06       Impact factor: 28.547

6.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

7.  A Comparison of mRNA Sequencing with Random Primed and 3'-Directed Libraries.

Authors:  Yuguang Xiong; Magali Soumillon; Jie Wu; Jens Hansen; Bin Hu; Johan G C van Hasselt; Gomathi Jayaraman; Ryan Lim; Mehdi Bouhaddou; Loren Ornelas; Jim Bochicchio; Lindsay Lenaeus; Jennifer Stocksdale; Jaehee Shim; Emilda Gomez; Dhruv Sareen; Clive Svendsen; Leslie M Thompson; Milind Mahajan; Ravi Iyengar; Eric A Sobie; Evren U Azeloglu; Marc R Birtwistle
Journal:  Sci Rep       Date:  2017-11-07       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.