Literature DB >> 23057825

SLIQ: simple linear inequalities for efficient contig scaffolding.

Rajat S Roy1, Kevin C Chen, Anirvan M Sengupta, Alexander Schliep.   

Abstract

Scaffolding is an important subproblem in de novo genome assembly, in which mate pair data are used to construct a linear sequence of contigs separated by gaps. Here we present SLIQ, a set of simple linear inequalities derived from the geometry of contigs on the line that can be used to predict the relative positions and orientations of contigs from individual mate pair reads and thus produce a contig digraph. The SLIQ inequalities can also filter out unreliable mate pairs and can be used as a preprocessing step for any scaffolding algorithm. We tested the SLIQ inequalities on five real data sets ranging in complexity from simple bacterial genomes to complex mammalian genomes and compared the results to the majority voting procedure used by many other scaffolding algorithms. SLIQ predicted the relative positions and orientations of the contigs with high accuracy in all cases and gave more accurate position predictions than majority voting for complex genomes, in particular the human genome. Finally, we present a simple scaffolding algorithm that produces linear scaffolds given a contig digraph. We show that our algorithm is very efficient compared to other scaffolding algorithms while maintaining high accuracy in predicting both contig positions and orientations for real data sets.

Entities:  

Mesh:

Year:  2012        PMID: 23057825     DOI: 10.1089/cmb.2011.0263

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  BESST--efficient scaffolding of large fragmented assemblies.

Authors:  Kristoffer Sahlin; Francesco Vezzi; Björn Nystedt; Joakim Lundeberg; Lars Arvestad
Journal:  BMC Bioinformatics       Date:  2014-08-15       Impact factor: 3.169

2.  ILP-based maximum likelihood genome scaffolding.

Authors:  James Lindsay; Hamed Salooti; Ion Măndoiu; Alex Zelikovsky
Journal:  BMC Bioinformatics       Date:  2014-09-10       Impact factor: 3.169

3.  Approaches for in silico finishing of microbial genome sequences.

Authors:  Frederico Schmitt Kremer; Alan John Alexander McBride; Luciano da Silva Pinto
Journal:  Genet Mol Biol       Date:  2017 Jul-Sep 01       Impact factor: 1.771

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.