Literature DB >> 30576505

LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.

Gui-Cai Xu1,2, Tian-Jun Xu2, Rui Zhu1,3, Yan Zhang1, Shang-Qi Li1, Hong-Wei Wang1, Jiong-Tang Li1.   

Abstract

Background: Completing a genome is an important goal of genome assembly. However, many assemblies, including reference assemblies, are unfinished and have a number of gaps. Long reads obtained from third-generation sequencing (TGS) platforms can help close these gaps and improve assembly contiguity. However, current gap-closure approaches using long reads require extensive runtime and high memory usage. Thus, a fast and memory-efficient approach using long reads is needed to obtain complete genomes. Findings: We developed LR_Gapcloser to rapidly and efficiently close the gaps in genome assembly. This tool utilizes long reads generated from TGS sequencing platforms. Tested on de novo assembled gaps, repeat-derived gaps, and real gaps, LR_Gapcloser closed a higher number of gaps faster and with a lower error rate and a much lower memory usage than two existing, state-of-the art tools. This tool utilized raw reads to fill more gaps than when using error-corrected reads. It is applicable to gaps in the assemblies by different approaches and from large and complex genomes. After performing gap-closure using this tool, the contig N50 size of the human CHM1 genome was improved from 143 kb to 19 Mb, a 132-fold increase. We also closed the gaps in the Triticum urartu genome, a large genome rich in repeats; the contig N50 size was increased by 40%. Further, we evaluated the contiguity and correctness of six hybrid assembly strategies by combining the optimal TGS-based and next-generation sequencing-based assemblers with LR_Gapcloser. A proposed and optimal hybrid strategy generated a new human CHM1 genome assembly with marked contiguity. The contig N50 value was greater than 28 Mb, which is larger than previous non-reference assemblies of the diploid human genome. Conclusions: LR_Gapcloser is a fast and efficient tool that can be used to close gaps and improve the contiguity of genome assemblies. A proposed hybrid assembly including this tool promises reference-grade assemblies. The software is available at http://www.fishbrowser.org/software/LR_Gapcloser/.

Entities:  

Mesh:

Year:  2019        PMID: 30576505      PMCID: PMC6324547          DOI: 10.1093/gigascience/giy157

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  44 in total

1.  Graph accordance of next-generation sequence assemblies.

Authors:  Guohui Yao; Liang Ye; Hongyu Gao; Patrick Minx; Wesley C Warren; George M Weinstock
Journal:  Bioinformatics       Date:  2011-10-23       Impact factor: 6.937

2.  Continuous base identification for single-molecule nanopore DNA sequencing.

Authors:  James Clarke; Hai-Chen Wu; Lakmal Jayasinghe; Alpesh Patel; Stuart Reid; Hagan Bayley
Journal:  Nat Nanotechnol       Date:  2009-02-22       Impact factor: 39.213

3.  MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads.

Authors:  Chuan-Le Xiao; Ying Chen; Shang-Qian Xie; Kai-Ning Chen; Yan Wang; Yue Han; Feng Luo; Zhi Xie
Journal:  Nat Methods       Date:  2017-09-18       Impact factor: 28.547

4.  Versatile and open software for comparing large genomes.

Authors:  Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg
Journal:  Genome Biol       Date:  2004-01-30       Impact factor: 13.583

5.  GMcloser: closing gaps in assemblies accurately with a likelihood-based selection of contig or long-read alignments.

Authors:  Shunichi Kosugi; Hideki Hirakawa; Satoshi Tabata
Journal:  Bioinformatics       Date:  2015-08-10       Impact factor: 6.937

6.  Resolving the complexity of the human genome using single-molecule sequencing.

Authors:  Mark J P Chaisson; John Huddleston; Megan Y Dennis; Peter H Sudmant; Maika Malig; Fereydoun Hormozdiari; Francesca Antonacci; Urvashi Surti; Richard Sandstrom; Matthew Boitano; Jane M Landolin; John A Stamatoyannopoulos; Michael W Hunkapiller; Jonas Korlach; Evan E Eichler
Journal:  Nature       Date:  2014-11-10       Impact factor: 49.962

7.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

10.  Genomic characterization of large heterochromatic gaps in the human genome assembly.

Authors:  Nicolas Altemose; Karen H Miga; Mauro Maggioni; Huntington F Willard
Journal:  PLoS Comput Biol       Date:  2014-05-15       Impact factor: 4.475

View more
  51 in total

1.  LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.

Authors:  Gui-Cai Xu; Tian-Jun Xu; Rui Zhu; Yan Zhang; Shang-Qi Li; Hong-Wei Wang; Jiong-Tang Li
Journal:  Gigascience       Date:  2019-01-01       Impact factor: 6.524

2.  CSA: A high-throughput chromosome-scale assembly pipeline for vertebrate genomes.

Authors:  Heiner Kuhl; Ling Li; Sven Wuertz; Matthias Stöck; Xu-Fang Liang; Christophe Klopp
Journal:  Gigascience       Date:  2020-05-01       Impact factor: 6.524

3.  Enabling large-scale genome editing at repetitive elements by reducing DNA nicking.

Authors:  Cory J Smith; Oscar Castanon; Khaled Said; Verena Volf; Parastoo Khoshakhlagh; Amanda Hornick; Raphael Ferreira; Chun-Ting Wu; Marc Güell; Shilpa Garg; Alex H M Ng; Hannu Myllykallio; George M Church
Journal:  Nucleic Acids Res       Date:  2020-05-21       Impact factor: 16.971

4.  TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads.

Authors:  Mengyang Xu; Lidong Guo; Shengqiang Gu; Ou Wang; Rui Zhang; Brock A Peters; Guangyi Fan; Xin Liu; Xun Xu; Li Deng; Yongwei Zhang
Journal:  Gigascience       Date:  2020-09-01       Impact factor: 6.524

5.  Improved contiguity of the threespine stickleback genome using long-read sequencing.

Authors:  Shivangi Nath; Daniel E Shaw; Michael A White
Journal:  G3 (Bethesda)       Date:  2021-02-09       Impact factor: 3.154

6.  Chromosome Fusion Affects Genetic Diversity and Evolutionary Turnover of Functional Loci but Consistently Depends on Chromosome Size.

Authors:  Francesco Cicconardi; James J Lewis; Simon H Martin; Robert D Reed; Charles G Danko; Stephen H Montgomery
Journal:  Mol Biol Evol       Date:  2021-09-27       Impact factor: 16.240

7.  The sockeye salmon genome, transcriptome, and analyses identifying population defining regions of the genome.

Authors:  Kris A Christensen; Eric B Rondeau; David R Minkley; Dionne Sakhrani; Carlo A Biagi; Anne-Marie Flores; Ruth E Withler; Scott A Pavey; Terry D Beacham; Theresa Godin; Eric B Taylor; Michael A Russello; Robert H Devlin; Ben F Koop
Journal:  PLoS One       Date:  2020-10-29       Impact factor: 3.240

8.  Dissecting the chromosome-level genome of the Asian Clam (Corbicula fluminea).

Authors:  Tongqing Zhang; Jiawen Yin; Shengkai Tang; Daming Li; Xiankun Gu; Shengyu Zhang; Weiguo Suo; Xiaowei Liu; Yanshan Liu; Qicheng Jiang; Muzi Zhao; Yue Yin; Jianlin Pan
Journal:  Sci Rep       Date:  2021-07-22       Impact factor: 4.379

9.  The Assembled and Annotated Genome of the Fairy-Ring Fungus Marasmius oreades.

Authors:  Markus Hiltunen; Sandra Lorena Ament-Velásquez; Hanna Johannesson
Journal:  Genome Biol Evol       Date:  2021-07-06       Impact factor: 3.416

10.  Identification of a dual orange/far-red and blue light photoreceptor from an oceanic green picoplankton.

Authors:  Yuko Makita; Shigekatsu Suzuki; Keiji Fushimi; Setsuko Shimada; Aya Suehisa; Manami Hirata; Tomoko Kuriyama; Yukio Kurihara; Hidefumi Hamasaki; Emiko Okubo-Kurihara; Kazutoshi Yoshitake; Tsuyoshi Watanabe; Masaaki Sakuta; Takashi Gojobori; Tomoko Sakami; Rei Narikawa; Haruyo Yamaguchi; Masanobu Kawachi; Minami Matsui
Journal:  Nat Commun       Date:  2021-06-16       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.