Literature DB >> 26056623

NxRepair: error correction in de novo sequence assembly using Nextera mate pairs.

Rebecca R Murphy1, Jared O'Connell2, Anthony J Cox2, Ole Schulz-Trieglaff2.   

Abstract

Scaffolding errors and incorrect repeat disambiguation during de novo assembly can result in large scale misassemblies in draft genomes. Nextera mate pair sequencing data provide additional information to resolve assembly ambiguities during scaffolding. Here, we introduce NxRepair, an open source toolkit for error correction in de novo assemblies that uses Nextera mate pair libraries to identify and correct large-scale errors. We show that NxRepair can identify and correct large scaffolding errors, without use of a reference sequence, resulting in quantitative improvements in the assembly quality. NxRepair can be downloaded from GitHub or PyPI, the Python Package Index; a tutorial and user documentation are also available.

Entities:  

Keywords:  Assembly quality; Automated error detection; De novo assembly; Error correction; Genome assembly; Insert size; Mate pair; Misassembly; Misassembly detection; Scaffolding

Year:  2015        PMID: 26056623      PMCID: PMC4458127          DOI: 10.7717/peerj.996

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


  13 in total

1.  GAGE: A critical evaluation of genome assemblies and assembly algorithms.

Authors:  Steven L Salzberg; Adam M Phillippy; Aleksey Zimin; Daniela Puiu; Tanja Magoc; Sergey Koren; Todd J Treangen; Michael C Schatz; Arthur L Delcher; Michael Roberts; Guillaume Marçais; Mihai Pop; James A Yorke
Journal:  Genome Res       Date:  2012-01-06       Impact factor: 9.043

2.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

Review 3.  Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses.

Authors:  Melissa J Fullwood; Chia-Lin Wei; Edison T Liu; Yijun Ruan
Journal:  Genome Res       Date:  2009-04       Impact factor: 9.043

4.  A5-miseq: an updated pipeline to assemble microbial genomes from Illumina MiSeq data.

Authors:  David Coil; Guillaume Jospin; Aaron E Darling
Journal:  Bioinformatics       Date:  2014-10-22       Impact factor: 6.937

5.  ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies.

Authors:  Scott C Clark; Rob Egan; Peter I Frazier; Zhong Wang
Journal:  Bioinformatics       Date:  2013-01-09       Impact factor: 6.937

6.  QUAST: quality assessment tool for genome assemblies.

Authors:  Alexey Gurevich; Vladislav Saveliev; Nikolay Vyahhi; Glenn Tesler
Journal:  Bioinformatics       Date:  2013-02-19       Impact factor: 6.937

7.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

8.  Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.

Authors:  Keith R Bradnam; Joseph N Fass; Anton Alexandrov; Paul Baranay; Michael Bechner; Inanç Birol; Sébastien Boisvert; Jarrod A Chapman; Guillaume Chapuis; Rayan Chikhi; Hamidreza Chitsaz; Wen-Chi Chou; Jacques Corbeil; Cristian Del Fabbro; T Roderick Docking; Richard Durbin; Dent Earl; Scott Emrich; Pavel Fedotov; Nuno A Fonseca; Ganeshkumar Ganapathy; Richard A Gibbs; Sante Gnerre; Elénie Godzaridis; Steve Goldstein; Matthias Haimel; Giles Hall; David Haussler; Joseph B Hiatt; Isaac Y Ho; Jason Howard; Martin Hunt; Shaun D Jackman; David B Jaffe; Erich D Jarvis; Huaiyang Jiang; Sergey Kazakov; Paul J Kersey; Jacob O Kitzman; James R Knight; Sergey Koren; Tak-Wah Lam; Dominique Lavenier; François Laviolette; Yingrui Li; Zhenyu Li; Binghang Liu; Yue Liu; Ruibang Luo; Iain Maccallum; Matthew D Macmanes; Nicolas Maillet; Sergey Melnikov; Delphine Naquin; Zemin Ning; Thomas D Otto; Benedict Paten; Octávio S Paulo; Adam M Phillippy; Francisco Pina-Martins; Michael Place; Dariusz Przybylski; Xiang Qin; Carson Qu; Filipe J Ribeiro; Stephen Richards; Daniel S Rokhsar; J Graham Ruby; Simone Scalabrin; Michael C Schatz; David C Schwartz; Alexey Sergushichev; Ted Sharpe; Timothy I Shaw; Jay Shendure; Yujian Shi; Jared T Simpson; Henry Song; Fedor Tsarev; Francesco Vezzi; Riccardo Vicedomini; Bruno M Vieira; Jun Wang; Kim C Worley; Shuangye Yin; Siu-Ming Yiu; Jianying Yuan; Guojie Zhang; Hao Zhang; Shiguo Zhou; Ian F Korf
Journal:  Gigascience       Date:  2013-07-22       Impact factor: 6.524

9.  How to apply de Bruijn graphs to genome assembly.

Authors:  Phillip E C Compeau; Pavel A Pevzner; Glenn Tesler
Journal:  Nat Biotechnol       Date:  2011-11-08       Impact factor: 54.908

10.  REAPR: a universal tool for genome assembly evaluation.

Authors:  Martin Hunt; Taisei Kikuchi; Mandy Sanders; Chris Newbold; Matthew Berriman; Thomas D Otto
Journal:  Genome Biol       Date:  2013-05-27       Impact factor: 13.583

View more
  5 in total

1.  Complete genome sequence of the abscisic acid-utilizing strain Novosphingobium sp. P6W.

Authors:  Natalia E Gogoleva; Yevgeny A Nikolaichik; Timur T Ismailov; Vladimir Y Gorshkov; Vera I Safronova; Andrey A Belimov; Yuri Gogolev
Journal:  3 Biotech       Date:  2019-02-19       Impact factor: 2.406

Review 2.  Next Generation Sequencing of Actinobacteria for the Discovery of Novel Natural Products.

Authors:  Juan Pablo Gomez-Escribano; Silke Alt; Mervyn J Bibb
Journal:  Mar Drugs       Date:  2016-04-13       Impact factor: 5.118

Review 3.  Modern technologies and algorithms for scaffolding assembled genomes.

Authors:  Jay Ghurye; Mihai Pop
Journal:  PLoS Comput Biol       Date:  2019-06-05       Impact factor: 4.475

4.  Draft Genome Sequence of Parageobacillus thermoglucosidasius Strain TG4, a Hydrogenogenic Carboxydotrophic Bacterium Isolated from a Marine Sediment.

Authors:  Masao Inoue; Ayumi Tanimura; Yusuke Ogami; Taiki Hino; Suguru Okunishi; Hiroto Maeda; Takashi Yoshida; Yoshihiko Sako
Journal:  Microbiol Resour Announc       Date:  2019-01-31

5.  Tigmint: correcting assembly errors using linked reads from large molecules.

Authors:  Shaun D Jackman; Lauren Coombe; Justin Chu; Rene L Warren; Benjamin P Vandervalk; Sarah Yeo; Zhuyi Xue; Hamid Mohamadi; Joerg Bohlmann; Steven J M Jones; Inanc Birol
Journal:  BMC Bioinformatics       Date:  2018-10-26       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.