Literature DB >> 30594583

Hybrid assembly of ultra-long Nanopore reads augmented with 10x-Genomics contigs: Demonstrated with a human genome.

Zhanshan Sam Ma1, Lianwei Li2, Chengxi Ye3, Minsheng Peng4, Ya-Ping Zhang5.   

Abstract

The 3rd generation of sequencing (3GS) technologies generate ultra-long reads (up to 1 Mb), which makes it possible to eliminate gaps and effectively resolve repeats in genome assembly. However, the 3GS technologies suffer from the high base-level error rates (15%-40%) and high sequencing costs. To address these issues, the hybrid assembly strategy, which utilizes both 3GS reads and inexpensive NGS (next generation sequencing) short reads, was invented. Here, we use 10×-Genomics® technology, which integrates a novel bar-coding strategy with Illumina® NGS with an advantage of revealing long-range sequence information, to replace common NGS short reads for hybrid assembly of long erroneous 3GS reads. We demonstrate the feasibility of integrating the 3GS with 10×-Genomics technologies for a new strategy of hybrid de novo genome assembly by utilizing DBG2OLC and Sparc software packages, previously developed by the authors for regular hybrid assembly. Using a human genome as an example, we show that with only 7× coverage of ultra-long Nanopore® reads, augmented with 10× reads, our approach achieved nearly the same level of quality, compared with non-hybrid assembly with 35× coverage of Nanopore reads. Compared with the assembly with 10×-Genomics reads alone, our assembly is gapless with slightly high cost. These results suggest that our new hybrid assembly with ultra-long 3GS reads augmented with 10×-Genomics reads offers a low-cost (less than ¼ the cost of the non-hybrid assembly) and computationally light-weighted (only took 109 calendar hours with peak memory-usage = 61GB on a dual-CPU office workstation) solution for extending the wide applications of the 3GS technologies.
Copyright © 2018. Published by Elsevier Inc.

Entities:  

Keywords:  10× Genomics; 3GS (3rd generation sequencing); DBG2OLC; Human genome; Hybrid assembly; Nanopore; Sparc

Mesh:

Year:  2018        PMID: 30594583     DOI: 10.1016/j.ygeno.2018.12.013

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  11 in total

1.  Assessment of human diploid genome assembly with 10x Linked-Reads data.

Authors:  Lu Zhang; Xin Zhou; Ziming Weng; Arend Sidow
Journal:  Gigascience       Date:  2019-11-01       Impact factor: 6.524

2.  Genome reconstruction and haplotype phasing using chromosome conformation capture methodologies.

Authors:  Zhichao Xu; Jesse R Dixon
Journal:  Brief Funct Genomics       Date:  2020-03-23       Impact factor: 4.241

3.  Nanopore Guided Assembly of Segmental Duplications near Telomeres.

Authors:  Eleni Adam; Tunazzina Islam; Desh Ranjan; Harold Riethman
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2019-12-26

4.  Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.

Authors:  Kishwar Shafin; Trevor Pesout; Ryan Lorig-Roach; Marina Haukness; Hugh E Olsen; Colleen Bosworth; Joel Armstrong; Kristof Tigyi; Nicholas Maurer; Sergey Koren; Fritz J Sedlazeck; Tobias Marschall; Simon Mayes; Vania Costa; Justin M Zook; Kelvin J Liu; Duncan Kilburn; Melanie Sorensen; Katy M Munson; Mitchell R Vollger; Jean Monlong; Erik Garrison; Evan E Eichler; Sofie Salama; David Haussler; Richard E Green; Mark Akeson; Adam Phillippy; Karen H Miga; Paolo Carnevali; Miten Jain; Benedict Paten
Journal:  Nat Biotechnol       Date:  2020-05-04       Impact factor: 54.908

5.  De Novo Assembly of a High-Quality Reference Genome for the Horned Lark (Eremophila alpestris).

Authors:  Nicholas A Mason; Paulo Pulgarin; Carlos Daniel Cadena; Irby J Lovette
Journal:  G3 (Bethesda)       Date:  2020-02-06       Impact factor: 3.154

Review 6.  Opportunities and challenges in long-read sequencing data analysis.

Authors:  Shanika L Amarasinghe; Shian Su; Xueyi Dong; Luke Zappia; Matthew E Ritchie; Quentin Gouil
Journal:  Genome Biol       Date:  2020-02-07       Impact factor: 13.583

Review 7.  Structural variant detection in cancer genomes: computational challenges and perspectives for precision oncology.

Authors:  Ianthe A E M van Belzen; Alexander Schönhuth; Patrick Kemmeren; Jayne Y Hehir-Kwa
Journal:  NPJ Precis Oncol       Date:  2021-03-02

8.  Accurate assembly of the olive baboon (Papio anubis) genome using long-read and Hi-C data.

Authors:  Sanjit Singh Batra; Michal Levy-Sakin; Jacqueline Robinson; Joseph Guillory; Steffen Durinck; Tauras P Vilgalys; Pui-Yan Kwok; Laura A Cox; Somasekar Seshagiri; Yun S Song; Jeffrey D Wall
Journal:  Gigascience       Date:  2020-12-07       Impact factor: 6.524

9.  Highly multiplexed, fast and accurate nanopore sequencing for verification of synthetic DNA constructs and sequence libraries.

Authors:  Andrew Currin; Neil Swainston; Mark S Dunstan; Adrian J Jervis; Paul Mulherin; Christopher J Robinson; Sandra Taylor; Pablo Carbonell; Katherine A Hollywood; Cunyu Yan; Eriko Takano; Nigel S Scrutton; Rainer Breitling
Journal:  Synth Biol (Oxf)       Date:  2019-10-29

Review 10.  Technical and Methodological Aspects of Cell-Free Nucleic Acids Analyzes.

Authors:  Zuzana Pös; Ondrej Pös; Jakub Styk; Angelika Mocova; Lucia Strieskova; Jaroslav Budis; Ludevit Kadasi; Jan Radvanszky; Tomas Szemes
Journal:  Int J Mol Sci       Date:  2020-11-16       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.