Literature DB >> 32913637

QUARTIC: QUick pArallel algoRithms for high-Throughput sequencIng data proCessing.

Frédéric Jarlier1,2,3,4, Nicolas Joly5, Nicolas Fedy1,2,3,4,6, Thomas Magalhaes1,2,3,4,6, Leonor Sirotti1,2,3,4,6, Paul Paganiban1,2,3,4,6, Firmin Martin1,2,3,4,6, Michael McManus7, Philippe Hupé1,2,3,4,8.   

Abstract

Life science has entered the so-called 'big data era' where biologists, clinicians and bioinformaticians are overwhelmed with high-throughput sequencing data. While they offer new insights to decipher the genome structure they also raise major challenges to use them for daily clinical practice care and diagnosis purposes as they are bigger and bigger. Therefore, we implemented a software to reduce the time to delivery for the alignment and the sorting of high-throughput sequencing data.  Our solution is implemented using Message Passing Interface and is intended for high-performance computing architecture. The software scales linearly with respect to the size of the data and ensures a total reproducibility with the traditional tools. For example, a 300X whole genome can be aligned and sorted within less than 9 hours with 128 cores. The software offers significant speed-up using multi-cores and multi-nodes parallelization. Copyright:
© 2020 Jarlier F et al.

Entities:  

Keywords:  Alignment; High-Performance Computing; High-Throughput Sequencing; MPI; Sorting

Year:  2020        PMID: 32913637      PMCID: PMC7429925          DOI: 10.12688/f1000research.22954.2

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  11 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Sambamba: fast processing of NGS alignment formats.

Authors:  Artem Tarasov; Albert J Vilella; Edwin Cuppen; Isaac J Nijman; Pjotr Prins
Journal:  Bioinformatics       Date:  2015-02-19       Impact factor: 6.937

3.  Integrating Genomics into Healthcare: A Global Responsibility.

Authors:  Zornitza Stark; Lena Dolman; Teri A Manolio; Brad Ozenberger; Sue L Hill; Mark J Caulfied; Yves Levy; David Glazer; Julia Wilson; Mark Lawler; Tiffany Boughtwood; Jeffrey Braithwaite; Peter Goodhand; Ewan Birney; Kathryn N North
Journal:  Am J Hum Genet       Date:  2019-01-03       Impact factor: 11.025

4.  Supercomputing for the parallelization of whole genome analysis.

Authors:  Megan J Puckelwartz; Lorenzo L Pesce; Viswateja Nelakuditi; Lisa Dellefave-Castillo; Jessica R Golbus; Sharlene M Day; Thomas P Cappola; Gerald W Dorn; Ian T Foster; Elizabeth M McNally
Journal:  Bioinformatics       Date:  2014-02-12       Impact factor: 6.937

5.  Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.

Authors:  Justin M Zook; Brad Chapman; Jason Wang; David Mittelman; Oliver Hofmann; Winston Hide; Marc Salit
Journal:  Nat Biotechnol       Date:  2014-02-16       Impact factor: 54.908

6.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

7.  Review of applications of high-throughput sequencing in personalized medicine: barriers and facilitators of future progress in research and clinical application.

Authors:  Gaye Lightbody; Valeriia Haberland; Fiona Browne; Laura Taggart; Huiru Zheng; Eileen Parkes; Jaine K Blayney
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

8.  Halvade: scalable sequence analysis with MapReduce.

Authors:  Dries Decap; Joke Reumers; Charlotte Herzeel; Pascal Costanza; Jan Fostier
Journal:  Bioinformatics       Date:  2015-03-26       Impact factor: 6.937

9.  Leveraging the power of high performance computing for next generation sequencing data analysis: tricks and twists from a high throughput exome workflow.

Authors:  Amit Kawalia; Susanne Motameny; Stephan Wonczak; Holger Thiele; Lech Nieroda; Kamel Jabbari; Stefan Borowski; Vishal Sinha; Wilfried Gunia; Ulrich Lang; Viktor Achter; Peter Nürnberg
Journal:  PLoS One       Date:  2015-05-05       Impact factor: 3.240

10.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.