Literature DB >> 23786767

Harnessing virtual machines to simplify next-generation DNA sequencing analysis.

Julie Nocq1, Magalie Celton, Patrick Gendron, Sebastien Lemieux, Brian T Wilhelm.   

Abstract

MOTIVATION: The growth of next-generation sequencing (NGS) has not only dramatically accelerated the pace of research in the field of genomics, but it has also opened the door to personalized medicine and diagnostics. The resulting flood of data has led to the rapid development of large numbers of bioinformatic tools for data analysis, creating a challenging situation for researchers when choosing and configuring a variety of software for their analysis, and for other researchers trying to replicate their analysis. As NGS technology continues to expand from the research environment into clinical laboratories, the challenges associated with data analysis have the potential to slow the adoption of this technology.
RESULTS: Here we discuss the potential of virtual machines (VMs) to be used as a method for sharing entire installations of NGS software (bioinformatic 'pipelines'). VMs are created by programs designed to allow multiple operating systems to co-exist on a single physical machine, and they can be made following the object-oriented paradigm of encapsulating data and methods together. This allows NGS data to be distributed within a VM, along with the pre-configured software for its analysis. Although VMs have historically suffered from poor performance relative to native operating systems, we present benchmarking results demonstrating that this reduced performance can now be minimized. We further discuss the many potential benefits of VMs as a solution for NGS analysis and describe several published examples. Lastly, we consider the benefits of VMs in facilitating the introduction of NGS technology into the clinical environment. CONTACT: brian.wilhelm@umontreal.ca.

Mesh:

Year:  2013        PMID: 23786767     DOI: 10.1093/bioinformatics/btt352

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  Omics Pipe: a community-based framework for reproducible multi-omics data analysis.

Authors:  Kathleen M Fisch; Tobias Meißner; Louis Gioia; Jean-Christophe Ducom; Tristan M Carland; Salvatore Loguercio; Andrew I Su
Journal:  Bioinformatics       Date:  2015-01-30       Impact factor: 6.937

2.  xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud.

Authors:  Jon Duvick; Daniel S Standage; Nirav Merchant; Volker P Brendel
Journal:  Plant Cell       Date:  2016-03-28       Impact factor: 11.277

3.  Bioinformatics pipelines for targeted resequencing and whole-exome sequencing of human and mouse genomes: a virtual appliance approach for instant deployment.

Authors:  Jason Li; Maria A Doyle; Isaam Saeed; Stephen Q Wong; Victoria Mar; David L Goode; Franco Caramia; Ken Doig; Georgina L Ryland; Ella R Thompson; Sally M Hunter; Saman K Halgamuge; Jason Ellul; Alexander Dobrovic; Ian G Campbell; Anthony T Papenfuss; Grant A McArthur; Richard W Tothill
Journal:  PLoS One       Date:  2014-04-21       Impact factor: 3.240

4.  SeqBench: integrated solution for the management and analysis of exome sequencing data.

Authors:  Andreas Dander; Stephan Pabinger; Michael Sperk; Maria Fischer; Gernot Stocker; Zlatko Trajanoski
Journal:  BMC Res Notes       Date:  2014-01-20

5.  Automated Selection of Hotspots (ASH): enhanced automated segmentation and adaptive step finding for Ki67 hotspot detection in adrenal cortical cancer.

Authors:  Hao Lu; Thomas G Papathomas; David van Zessen; Ivo Palli; Ronald R de Krijger; Peter J van der Spek; Winand N M Dinjens; Andrew P Stubbs
Journal:  Diagn Pathol       Date:  2014-11-25       Impact factor: 2.644

6.  BioImg.org: A Catalog of Virtual Machine Images for the Life Sciences.

Authors:  Martin Dahlö; Frédéric Haziza; Aleksi Kallio; Eija Korpelainen; Erik Bongcam-Rudloff; Ola Spjuth
Journal:  Bioinform Biol Insights       Date:  2015-09-10

7.  SeqMule: automated pipeline for analysis of human exome/genome sequencing data.

Authors:  Yunfei Guo; Xiaolei Ding; Yufeng Shen; Gholson J Lyon; Kai Wang
Journal:  Sci Rep       Date:  2015-09-18       Impact factor: 4.379

Review 8.  Stem cell systems informatics for advanced clinical biodiagnostics: tracing molecular signatures from bench to bedside.

Authors:  Randolph S Faustino; D Kent Arrell; Clifford D L Folmes; Andre Terzic; Carmen Perez-Terzic
Journal:  Croat Med J       Date:  2013-08       Impact factor: 1.351

9.  MEMOSys 2.0: an update of the bioinformatics database for genome-scale models and genomic data.

Authors:  Stephan Pabinger; Rene Snajder; Timo Hardiman; Michaela Willi; Andreas Dander; Zlatko Trajanoski
Journal:  Database (Oxford)       Date:  2014-02-14       Impact factor: 3.451

Review 10.  Validation of high throughput sequencing and microbial forensics applications.

Authors:  Bruce Budowle; Nancy D Connell; Anna Bielecka-Oder; Rita R Colwell; Cindi R Corbett; Jacqueline Fletcher; Mats Forsman; Dana R Kadavy; Alemka Markotic; Stephen A Morse; Randall S Murch; Antti Sajantila; Sarah E Schmedes; Krista L Ternus; Stephen D Turner; Samuel Minot
Journal:  Investig Genet       Date:  2014-07-30
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.