Literature DB >> 29617724

Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions.

Damla Senol Cali1, Jeremie S Kim1,2, Saugata Ghose1, Can Alkan3, Onur Mutlu1,2.   

Abstract

Nanopore sequencing technology has the potential to render other sequencing technologies obsolete with its ability to generate long reads and provide portability. However, high error rates of the technology pose a challenge while generating accurate genome assemblies. The tools used for nanopore sequence analysis are of critical importance, as they should overcome the high error rates of the technology. Our goal in this work is to comprehensively analyze current publicly available tools for nanopore sequence analysis to understand their advantages, disadvantages and performance bottlenecks. It is important to understand where the current tools do not perform well to develop better tools. To this end, we (1) analyze the multiple steps and the associated tools in the genome assembly pipeline using nanopore sequence data, and (2) provide guidelines for determining the appropriate tools for each step. Based on our analyses, we make four key observations: (1) the choice of the tool for basecalling plays a critical role in overcoming the high error rates of nanopore sequencing technology. (2) Read-to-read overlap finding tools, GraphMap and Minimap, perform similarly in terms of accuracy. However, Minimap has a lower memory usage, and it is faster than GraphMap. (3) There is a trade-off between accuracy and performance when deciding on the appropriate tool for the assembly step. The fast but less accurate assembler Miniasm can be used for quick initial assembly, and further polishing can be applied on top of it to increase the accuracy, which leads to faster overall assembly. (4) The state-of-the-art polishing tool, Racon, generates high-quality consensus sequences while providing a significant speedup over another polishing tool, Nanopolish. We analyze various combinations of different tools and expose the trade-offs between accuracy, performance, memory usage and scalability. We conclude that our observations can guide researchers and practitioners in making conscious and effective choices for each step of the genome assembly pipeline using nanopore sequence data. Also, with the help of bottlenecks we have found, developers can improve the current tools or build new ones that are both accurate and fast, to overcome the high error rates of the nanopore sequencing technology.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  assembly; genome analysis; genome sequencing; mapping; nanopore sequencing

Mesh:

Year:  2019        PMID: 29617724      PMCID: PMC6781587          DOI: 10.1093/bib/bby017

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  55 in total

1.  Genome assembly reborn: recent computational challenges.

Authors:  Mihai Pop
Journal:  Brief Bioinform       Date:  2009-05-29       Impact factor: 11.622

2.  RazerS--fast read mapping with sensitivity control.

Authors:  David Weese; Anne-Katrin Emde; Tobias Rausch; Andreas Döring; Knut Reinert
Journal:  Genome Res       Date:  2009-07-10       Impact factor: 9.043

3.  On genomic repeats and reproducibility.

Authors:  Can Firtina; Can Alkan
Journal:  Bioinformatics       Date:  2016-03-11       Impact factor: 6.937

4.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

5.  mrsFAST: a cache-oblivious algorithm for short-read mapping.

Authors:  Faraz Hach; Fereydoun Hormozdiari; Can Alkan; Farhad Hormozdiari; Inanc Birol; Evan E Eichler; S Cenk Sahinalp
Journal:  Nat Methods       Date:  2010-08       Impact factor: 28.547

6.  GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping.

Authors:  Mohammed Alser; Hasan Hassan; Hongyi Xin; Oguz Ergin; Onur Mutlu; Can Alkan
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

7.  A reference bacterial genome dataset generated on the MinION™ portable single-molecule nanopore sequencer.

Authors:  Joshua Quick; Aaron R Quinlan; Nicholas J Loman
Journal:  Gigascience       Date:  2014-10-20       Impact factor: 6.524

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

10.  Reducing assembly complexity of microbial genomes with single-molecule sequencing.

Authors:  Sergey Koren; Gregory P Harhay; Timothy P L Smith; James L Bono; Dayna M Harhay; Scott D Mcvey; Diana Radune; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Biol       Date:  2013       Impact factor: 13.583

View more
  49 in total

1.  Modeling multi-species RNA modification through multi-task curriculum learning.

Authors:  Yuanpeng Xiong; Xuan He; Dan Zhao; Tingzhong Tian; Lixiang Hong; Tao Jiang; Jianyang Zeng
Journal:  Nucleic Acids Res       Date:  2021-04-19       Impact factor: 16.971

2.  Potential m6A and m5C Methylations within the Genome of A Chinese African Swine Fever Virus Strain.

Authors:  Lijia Jia; Jianjun Chen; Haizhou Liu; Wenhui Fan; Depeng Wang; Jing Li; Di Liu
Journal:  Virol Sin       Date:  2020-04-08       Impact factor: 4.327

3.  Shouji: a fast and efficient pre-alignment filter for sequence alignment.

Authors:  Mohammed Alser; Hasan Hassan; Akash Kumar; Onur Mutlu; Can Alkan
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

4.  Orrella daihaiensis sp. nov., a bacterium isolated from Daihai Lake in Inner Mongolia.

Authors:  Kai Jiang; Bo Yuan; ChunLing Cao; ChenYing Zhang; Yang Liu; XiaoHu Hai; RuoXuan Li; KangYuan Qian; HongZhen Yang
Journal:  Arch Microbiol       Date:  2022-06-25       Impact factor: 2.552

5.  Mongoliitalea daihaiensis sp. nov., isolated from Daihai Lake in Inner Mongolia.

Authors:  Kai Jiang; Bo Yuan; Chun Ling Cao; Chen Ying Zhang; Ruo Xuan Li; Yan An
Journal:  Arch Microbiol       Date:  2021-12-28       Impact factor: 2.552

Review 6.  Nanopore sequencing technology, bioinformatics and applications.

Authors:  Yunhao Wang; Yue Zhao; Audrey Bollas; Yuru Wang; Kin Fai Au
Journal:  Nat Biotechnol       Date:  2021-11-08       Impact factor: 54.908

7.  Nano2NGS-Muta: a framework for converting nanopore sequencing data to NGS-liked sequencing data for hotspot mutation detection.

Authors:  Jidong Lang; Jiguo Sun; Zhi Yang; Lei He; Yu He; Yanmei Chen; Lei Huang; Ping Li; Jialin Li; Liu Qin
Journal:  NAR Genom Bioinform       Date:  2022-04-21

8.  Molecular characterization of a new highly divergent Mobala related arenavirus isolated from Praomys sp. rodents.

Authors:  Emmanuel Nakouné; Nicolas Berthet; Huguette Simo Tchetgna; Stephane Descorps-Declère; Benjamin Selekon; Aurelia Kwasiborski; Mathias Vandenbogaert; Jean-Claude Manuguerra; Antoine Gessain; Valérie Caro
Journal:  Sci Rep       Date:  2021-05-13       Impact factor: 4.379

9.  A high-quality genome assembly of Morinda officinalis, a famous native southern herb in the Lingnan region of southern China.

Authors:  Jihua Wang; Shiqiang Xu; Yu Mei; Shike Cai; Yan Gu; Minyang Sun; Zhan Liang; Yong Xiao; Muqing Zhang; Shaohai Yang
Journal:  Hortic Res       Date:  2021-06-01       Impact factor: 6.793

10.  Design and MinION testing of a nanopore targeted gene sequencing panel for chronic lymphocytic leukemia.

Authors:  Paola Orsini; Crescenzio F Minervini; Cosimo Cumbo; Luisa Anelli; Antonella Zagaria; Angela Minervini; Nicoletta Coccaro; Giuseppina Tota; Paola Casieri; Luciana Impera; Elisa Parciante; Claudia Brunetti; Annamaria Giordano; Giorgina Specchia; Francesco Albano
Journal:  Sci Rep       Date:  2018-08-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.