Literature DB >> 25890305

ScaffMatch: scaffolding algorithm based on maximum weight matching.

Igor Mandric1, Alex Zelikovsky1.   

Abstract

MOTIVATION: Next-generation high-throughput sequencing has become a state-of-the-art technique in genome assembly. Scaffolding is one of the main stages of the assembly pipeline. During this stage, contigs assembled from the paired-end reads are merged into bigger chains called scaffolds. Because of a high level of statistical noise, chimeric reads, and genome repeats the problem of scaffolding is a challenging task. Current scaffolding software packages widely vary in their quality and are highly dependent on the read data quality and genome complexity. There are no clear winners and multiple opportunities for further improvements of the tools still exist.
RESULTS: This article presents an efficient scaffolding algorithm ScaffMatch that is able to handle reads with both short (<600 bp) and long (>35 000 bp) insert sizes producing high-quality scaffolds. We evaluate our scaffolding tool with the F score and other metrics (N50, corrected N50) on eight datasets comparing it with the most available packages. Our experiments show that ScaffMatch is the tool of preference for the most datasets.
AVAILABILITY AND IMPLEMENTATION: The source code is available at http://alan.cs.gsu.edu/NGS/?q=content/scaffmatch. CONTACT: mandric@cs.gsu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2015        PMID: 25890305     DOI: 10.1093/bioinformatics/btv211

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  The combination of direct and paired link graphs can boost repetitive genome assembly.

Authors:  Wenyu Shi; Peifeng Ji; Fangqing Zhao
Journal:  Nucleic Acids Res       Date:  2017-04-07       Impact factor: 16.971

2.  Repeat-aware evaluation of scaffolding tools.

Authors:  Igor Mandric; Sergey Knyazev; Alex Zelikovsky
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

3.  Fast-SG: an alignment-free algorithm for hybrid assembly.

Authors:  Alex Di Genova; Gonzalo A Ruz; Marie-France Sagot; Alejandro Maass
Journal:  Gigascience       Date:  2018-05-01       Impact factor: 6.524

4.  Deciphering the evolutionary signatures of pinnipeds using novel genome sequences: The first genomes of Phoca largha, Callorhinus ursinus, and Eumetopias jubatus.

Authors:  Jung Youn Park; Kwondo Kim; Hawsun Sohn; Hyun Woo Kim; Yong-Rock An; Jung-Ha Kang; Eun-Mi Kim; Woori Kwak; Chul Lee; DongAhn Yoo; Jaehoon Jung; Samsun Sung; Joon Yoon; Heebal Kim
Journal:  Sci Rep       Date:  2018-11-15       Impact factor: 4.379

5.  Multi-CSAR: a multiple reference-based contig scaffolder using algebraic rearrangements.

Authors:  Kun-Tze Chen; Hsin-Ting Shen; Chin Lung Lu
Journal:  BMC Syst Biol       Date:  2018-12-31

6.  Single molecule sequencing-guided scaffolding and correction of draft assemblies.

Authors:  Shenglong Zhu; Danny Z Chen; Scott J Emrich
Journal:  BMC Genomics       Date:  2017-12-06       Impact factor: 3.969

7.  CAMSA: a tool for comparative analysis and merging of scaffold assemblies.

Authors:  Sergey S Aganezov; Max A Alekseyev
Journal:  BMC Bioinformatics       Date:  2017-12-06       Impact factor: 3.169

8.  Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes.

Authors:  Yoann Anselmetti; Wandrille Duchemin; Eric Tannier; Cedric Chauve; Sèverine Bérard
Journal:  BMC Genomics       Date:  2018-05-09       Impact factor: 3.969

9.  The First Highly Contiguous Genome Assembly of Pikeperch (Sander lucioperca), an Emerging Aquaculture Species in Europe.

Authors:  Julien Alban Nguinkal; Ronald Marco Brunner; Marieke Verleih; Alexander Rebl; Lidia de Los Ríos-Pérez; Nadine Schäfer; Frieder Hadlich; Marcus Stüeken; Dörte Wittenburg; Tom Goldammer
Journal:  Genes (Basel)       Date:  2019-09-13       Impact factor: 4.096

10.  SLR: a scaffolding algorithm based on long reads and contig classification.

Authors:  Junwei Luo; Mengna Lyu; Ranran Chen; Xiaohong Zhang; Huimin Luo; Chaokun Yan
Journal:  BMC Bioinformatics       Date:  2019-10-30       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.