Literature DB >> 27742661

The present and future of de novo whole-genome assembly.

Jang-Il Sohn, Jin-Wu Nam.   

Abstract

As the advent of next-generation sequencing (NGS) technology, various de novo assembly algorithms based on the de Bruijn graph have been developed to construct chromosome-level sequences. However, numerous technical or computational challenges in de novo assembly still remain, although many bright ideas and heuristics have been suggested to tackle the challenges in both experimental and computational settings. In this review, we categorize de novo assemblers on the basis of the type of de Bruijn graphs (Hamiltonian and Eulerian) and discuss the challenges of de novo assembly for short NGS reads regarding computational complexity and assembly ambiguity. Then, we discuss how the limitations of the short reads can be overcome by using a single-molecule sequencing platform that generates long reads of up to several kilobases. In fact, the long read assembly has caused a paradigm shift in whole-genome assembly in terms of algorithms and supporting steps. We also summarize (i) hybrid assemblies using both short and long reads and (ii) overlap-based assemblies for long reads and discuss their challenges and future prospects. This review provides guidelines to determine the optimal approach for a given input data type, computational budget or genome.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  de Bruijn graph; de novo assembly algorithms; next-generation sequencing; single-molecule sequencing

Mesh:

Year:  2018        PMID: 27742661     DOI: 10.1093/bib/bbw096

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  49 in total

1.  An efficient classification algorithm for NGS data based on text similarity.

Authors:  Xiangyu Liao; Xingyu Liao; Wufei Zhu; Lu Fang; Xing Chen
Journal:  Genet Res (Camb)       Date:  2018-09-17       Impact factor: 1.588

2.  Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

Authors:  Nathan D Olson; Todd J Treangen; Christopher M Hill; Victoria Cepeda-Espinoza; Jay Ghurye; Sergey Koren; Mihai Pop
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

3.  Multiplexed Non-barcoded Long-Read Sequencing and Assembling Genomes of Bacillus Strains in Error-Free Simulations.

Authors:  Jiating Qian; Qiao Meng; Yifan Feng; Xuanxuan Mao; Yayue Ling; Jie Li
Journal:  Curr Microbiol       Date:  2019-11-13       Impact factor: 2.188

Review 4.  Comparative immunogenomics of molluscs.

Authors:  Jonathan H Schultz; Coen M Adema
Journal:  Dev Comp Immunol       Date:  2017-03-18       Impact factor: 3.636

5.  msRepDB: a comprehensive repetitive sequence database of over 80 000 species.

Authors:  Xingyu Liao; Kang Hu; Adil Salhi; You Zou; Jianxin Wang; Xin Gao
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

6.  High-Level Antibiotic Tolerance of a Clinically Isolated Enterococcus faecalis Strain.

Authors:  Huan Gu; Sweta Roy; Xiaohui Zheng; Tian Gao; Huilin Ma; Zafer Soultan; Christopher Fortner; Shikha Nangia; Dacheng Ren
Journal:  Appl Environ Microbiol       Date:  2020-12-17       Impact factor: 4.792

7.  Whole genome and transcriptome maps of the entirely black native Korean chicken breed Yeonsan Ogye.

Authors:  Jang-Il Sohn; Kyoungwoo Nam; Hyosun Hong; Jun-Mo Kim; Dajeong Lim; Kyung-Tai Lee; Yoon Jung Do; Chang Yeon Cho; Namshin Kim; Han-Ha Chai; Jin-Wu Nam
Journal:  Gigascience       Date:  2018-07-01       Impact factor: 6.524

8.  A CAZyme-Rich Genome of a Taxonomically Novel Rhodophyte-Associated Carrageenolytic Marine Bacterium.

Authors:  Delbert Almerick T Boncan; Anne Marjorie E David; Arturo O Lluisma
Journal:  Mar Biotechnol (NY)       Date:  2018-06-23       Impact factor: 3.619

Review 9.  Interferon-stimulated genes: new platforms and computational approaches.

Authors:  Richard Green; Reneé C Ireton; Michael Gale
Journal:  Mamm Genome       Date:  2018-07-07       Impact factor: 3.224

10.  The genome of the Pyrenean desman and the effects of bottlenecks and inbreeding on the genomic landscape of an endangered species.

Authors:  Lídia Escoda; Jose Castresana
Journal:  Evol Appl       Date:  2021-05-29       Impact factor: 5.183

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.