Literature DB >> 26959081

Gap Filling as Exact Path Length Problem.

Leena Salmela1, Kristoffer Sahlin2, Veli Mäkinen1, Alexandru I Tomescu1.   

Abstract

One of the last steps in a genome assembly project is filling the gaps between consecutive contigs in the scaffolds. This problem can be naturally stated as finding an s-t path in a directed graph whose sum of arc costs belongs to a given range (the estimate on the gap length). Here s and t are any two contigs flanking a gap. This problem is known to be NP-hard in general. Here we derive a simpler dynamic programming solution than already known, pseudo-polynomial in the maximum value of the input range. We implemented various practical optimizations to it, and compared our exact gap-filling solution experimentally to popular gap-filling tools. Summing over all the bacterial assemblies considered in our experiments, we can in total fill 76% more gaps than the best previous tool, and the gaps filled by our method span 136% more sequence. Furthermore, the error level of the newly introduced sequence is comparable to that of the previous tools. The experiments also show that our exact approach does not easily scale to larger genomes, where the problem is in general difficult for all tools.

Keywords:  de novo assembly; dynamic programming; gap filling; graph algorithms

Mesh:

Year:  2016        PMID: 26959081     DOI: 10.1089/cmb.2015.0197

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  Cost-effective high-throughput single-haplotype iterative mapping and sequencing for complex genomic structures.

Authors:  Daniel W Bellott; Ting-Jan Cho; Jennifer F Hughes; Helen Skaletsky; David C Page
Journal:  Nat Protoc       Date:  2018-03-22       Impact factor: 13.491

2.  Variant genotyping with gap filling.

Authors:  Riku Walve; Leena Salmela; Veli Mäkinen
Journal:  PLoS One       Date:  2017-09-08       Impact factor: 3.240

3.  Comparative scaffolding and gap filling of ancient bacterial genomes applied to two ancient Yersinia pestis genomes.

Authors:  Nina Luhmann; Daniel Doerr; Cedric Chauve
Journal:  Microb Genom       Date:  2017-07-08

4.  gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output.

Authors:  Juhana I Kammonen; Olli-Pekka Smolander; Lars Paulin; Pedro A B Pereira; Pia Laine; Patrik Koskinen; Jukka Jernvall; Petri Auvinen
Journal:  PLoS One       Date:  2019-09-09       Impact factor: 3.240

5.  Validation of Variant Assembly Using HAPHPIPE with Next-Generation Sequence Data from Viruses.

Authors:  Keylie M Gibson; Margaret C Steiner; Uzma Rentia; Matthew L Bendall; Marcos Pérez-Losada; Keith A Crandall
Journal:  Viruses       Date:  2020-07-14       Impact factor: 5.048

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.