Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Fast-SG: an alignment-free algorithm for hybrid assembly.

Literature DB >> 29741627

Fast-SG: an alignment-free algorithm for hybrid assembly.

Alex Di Genova^1,2,3,4,5, Gonzalo A Ruz^1,6, Marie-France Sagot^3,4, Alejandro Maass^2,5,7.

Abstract

Background: Long-read sequencing technologies are the ultimate solution for genome repeats, allowing near reference-level reconstructions of large genomes. However, long-read de novo assembly pipelines are computationally intense and require a considerable amount of coverage, thereby hindering their broad application to the assembly of large genomes. Alternatively, hybrid assembly methods that combine short- and long-read sequencing technologies can reduce the time and cost required to produce de novo assemblies of large genomes.
Results: Here, we propose a new method, called Fast-SG, that uses a new ultrafast alignment-free algorithm specifically designed for constructing a scaffolding graph using light-weight data structures. Fast-SG can construct the graph from either short or long reads. This allows the reuse of efficient algorithms designed for short-read data and permits the definition of novel modular hybrid assembly pipelines. Using comprehensive standard datasets and benchmarks, we show how Fast-SG outperforms the state-of-the-art short-read aligners when building the scaffoldinggraph and can be used to extract linking information from either raw or error-corrected long reads. We also show how a hybrid assembly approach using Fast-SG with shallow long-read coverage (5X) and moderate computational resources can produce long-range and accurate reconstructions of the genomes of Arabidopsis thaliana (Ler-0) and human (NA12878). Conclusions: Fast-SG opens a door to achieve accurate hybrid long-range reconstructions of large genomes with low effort, high portability, and low cost.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2018 PMID： 29741627 PMCID： PMC6007556 DOI： 10.1093/gigascience/giy048

Source DB: PubMed Journal: Gigascience ISSN： 2047-217X Impact factor: 6.524

41 in total

1. SSAHA: a fast search method for large DNA databases.

Authors: Z Ning; A J Cox; J C Mullikin
Journal: Genome Res Date: 2001-10 Impact factor: 9.043

2. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors: Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal: J Comput Biol Date: 2012-04-16 Impact factor: 1.479

3. The MaSuRCA genome assembler.

Authors: Aleksey V Zimin; Guillaume Marçais; Daniela Puiu; Michael Roberts; Steven L Salzberg; James A Yorke
Journal: Bioinformatics Date: 2013-08-29 Impact factor: 6.937

4. Fast gapped-read alignment with Bowtie 2.

Authors: Ben Langmead; Steven L Salzberg
Journal: Nat Methods Date: 2012-03-04 Impact factor: 28.547

Review 5. Repetitive DNA and next-generation sequencing: computational challenges and solutions.

Authors: Todd J Treangen; Steven L Salzberg
Journal: Nat Rev Genet Date: 2011-11-29 Impact factor: 53.242

6. Fast-SG: an alignment-free algorithm for hybrid assembly.

Authors: Alex Di Genova; Gonzalo A Ruz; Marie-France Sagot; Alejandro Maass
Journal: Gigascience Date: 2018-05-01 Impact factor: 6.524

7. Paired-end sequencing of Fosmid libraries by Illumina.

Authors: Louise J S Williams; Diana G Tabbaa; Na Li; Aaron M Berlin; Terrance P Shea; Iain Maccallum; Michael S Lawrence; Yotam Drier; Gad Getz; Sarah K Young; David B Jaffe; Chad Nusbaum; Andreas Gnirke
Journal: Genome Res Date: 2012-07-16 Impact factor: 9.043

8. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

Authors: Valerie A Schneider; Tina Graves-Lindsay; Kerstin Howe; Nathan Bouk; Hsiu-Chuan Chen; Paul A Kitts; Terence D Murphy; Kim D Pruitt; Françoise Thibaud-Nissen; Derek Albracht; Robert S Fulton; Milinn Kremitzki; Vincent Magrini; Chris Markovic; Sean McGrath; Karyn Meltz Steinberg; Kate Auger; William Chow; Joanna Collins; Glenn Harden; Timothy Hubbard; Sarah Pelan; Jared T Simpson; Glen Threadgold; James Torrance; Jonathan M Wood; Laura Clarke; Sergey Koren; Matthew Boitano; Paul Peluso; Heng Li; Chen-Shan Chin; Adam M Phillippy; Richard Durbin; Richard K Wilson; Paul Flicek; Evan E Eichler; Deanna M Church
Journal: Genome Res Date: 2017-04-10 Impact factor: 9.043

9. Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.

Authors: Martin Šošic; Mile Šikic
Journal: Bioinformatics Date: 2017-05-01 Impact factor: 6.937

10. A comprehensive evaluation of assembly scaffolding tools.

Authors: Martin Hunt; Chris Newbold; Matthew Berriman; Thomas D Otto
Journal: Genome Biol Date: 2014-03-03 Impact factor: 13.583

4 in total

1. Fast-SG: an alignment-free algorithm for hybrid assembly.

Authors: Alex Di Genova; Gonzalo A Ruz; Marie-France Sagot; Alejandro Maass
Journal: Gigascience Date: 2018-05-01 Impact factor: 6.524

2. High-quality carnivoran genomes from roadkill samples enable comparative species delineation in aardwolf and bat-eared fox.

Authors: Rémi Allio; Marie-Ka Tilak; Celine Scornavacca; Nico L Avenant; Andrew C Kitchener; Erwan Corre; Benoit Nabholz; Frédéric Delsuc
Journal: Elife Date: 2021-02-18 Impact factor: 8.140

3. LRScaf: improving draft genomes using long noisy reads.

Authors: Mao Qin; Shigang Wu; Alun Li; Fengli Zhao; Hu Feng; Lulu Ding; Jue Ruan
Journal: BMC Genomics Date: 2019-12-09 Impact factor: 3.969

4. Identification of a dual orange/far-red and blue light photoreceptor from an oceanic green picoplankton.

Authors: Yuko Makita; Shigekatsu Suzuki; Keiji Fushimi; Setsuko Shimada; Aya Suehisa; Manami Hirata; Tomoko Kuriyama; Yukio Kurihara; Hidefumi Hamasaki; Emiko Okubo-Kurihara; Kazutoshi Yoshitake; Tsuyoshi Watanabe; Masaaki Sakuta; Takashi Gojobori; Tomoko Sakami; Rei Narikawa; Haruyo Yamaguchi; Masanobu Kawachi; Minami Matsui
Journal: Nat Commun Date: 2021-06-16 Impact factor: 14.919

4 in total