Literature DB >> 27362987

Pluribus-Exploring the Limits of Error Correction Using a Suffix Tree.

Daniel Savel, Thomas LaFramboise, Ananth Grama, Mehmet Koyuturk.   

Abstract

Next generation sequencing technologies enable efficient and cost-effective genome sequencing. However, sequencing errors increase the complexity of the de novo assembly process, and reduce the quality of the assembled sequences. Many error correction techniques utilizing substring frequencies have been developed to mitigate this effect. In this paper, we present a novel and effective method called Pluribus, for correcting sequencing errors using a generalized suffix trie. Pluribus utilizes multiple manifestations of an error in the trie to accurately identify errors and suggest corrections. We show that Pluribus produces the least number of false positives across a diverse set of real sequencing datasets when compared to other methods. Furthermore, Pluribus can be used in conjunction with other contemporary error correction methods to achieve higher levels of accuracy than either tool alone. These increases in error correction accuracy are also realized in the quality of the contigs that are generated during assembly. We explore, in-depth, the behavior of Pluribus , to explain the observed improvement in accuracy and assembly performance. Pluribus is freely available at http://compbio. CASE: edu/pluribus/.

Entities:  

Mesh:

Year:  2016        PMID: 27362987      PMCID: PMC5754272          DOI: 10.1109/TCBB.2016.2586060

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  17 in total

1.  Reptile: representative tiling for short read error correction.

Authors:  Xiao Yang; Karin S Dorman; Srinivas Aluru
Journal:  Bioinformatics       Date:  2010-08-16       Impact factor: 6.937

2.  Correction of sequencing errors in a mixed set of reads.

Authors:  Leena Salmela
Journal:  Bioinformatics       Date:  2010-04-08       Impact factor: 6.937

3.  HiTEC: accurate error correction in high-throughput sequencing data.

Authors:  Lucian Ilie; Farideh Fazayeli; Silvana Ilie
Journal:  Bioinformatics       Date:  2010-11-26       Impact factor: 6.937

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  RACER: Rapid and accurate correction of errors in reads.

Authors:  Lucian Ilie; Michael Molnar
Journal:  Bioinformatics       Date:  2013-07-12       Impact factor: 6.937

6.  SHREC: a short-read error correction method.

Authors:  Jan Schröder; Heiko Schröder; Simon J Puglisi; Ranjan Sinha; Bertil Schmidt
Journal:  Bioinformatics       Date:  2009-06-19       Impact factor: 6.937

7.  Correcting errors in short reads by multiple alignments.

Authors:  Leena Salmela; Jan Schröder
Journal:  Bioinformatics       Date:  2011-04-05       Impact factor: 6.937

8.  Molecular cloning and complete nucleotide sequence of the attenuated rabies virus SAD B19.

Authors:  K K Conzelmann; J H Cox; L G Schneider; H J Thiel
Journal:  Virology       Date:  1990-04       Impact factor: 3.616

9.  Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.

Authors:  Keith R Bradnam; Joseph N Fass; Anton Alexandrov; Paul Baranay; Michael Bechner; Inanç Birol; Sébastien Boisvert; Jarrod A Chapman; Guillaume Chapuis; Rayan Chikhi; Hamidreza Chitsaz; Wen-Chi Chou; Jacques Corbeil; Cristian Del Fabbro; T Roderick Docking; Richard Durbin; Dent Earl; Scott Emrich; Pavel Fedotov; Nuno A Fonseca; Ganeshkumar Ganapathy; Richard A Gibbs; Sante Gnerre; Elénie Godzaridis; Steve Goldstein; Matthias Haimel; Giles Hall; David Haussler; Joseph B Hiatt; Isaac Y Ho; Jason Howard; Martin Hunt; Shaun D Jackman; David B Jaffe; Erich D Jarvis; Huaiyang Jiang; Sergey Kazakov; Paul J Kersey; Jacob O Kitzman; James R Knight; Sergey Koren; Tak-Wah Lam; Dominique Lavenier; François Laviolette; Yingrui Li; Zhenyu Li; Binghang Liu; Yue Liu; Ruibang Luo; Iain Maccallum; Matthew D Macmanes; Nicolas Maillet; Sergey Melnikov; Delphine Naquin; Zemin Ning; Thomas D Otto; Benedict Paten; Octávio S Paulo; Adam M Phillippy; Francisco Pina-Martins; Michael Place; Dariusz Przybylski; Xiang Qin; Carson Qu; Filipe J Ribeiro; Stephen Richards; Daniel S Rokhsar; J Graham Ruby; Simone Scalabrin; Michael C Schatz; David C Schwartz; Alexey Sergushichev; Ted Sharpe; Timothy I Shaw; Jay Shendure; Yujian Shi; Jared T Simpson; Henry Song; Fedor Tsarev; Francesco Vezzi; Riccardo Vicedomini; Bruno M Vieira; Jun Wang; Kim C Worley; Shuangye Yin; Siu-Ming Yiu; Jianying Yuan; Guojie Zhang; Hao Zhang; Shiguo Zhou; Ian F Korf
Journal:  Gigascience       Date:  2013-07-22       Impact factor: 6.524

10.  Quake: quality-aware detection and correction of sequencing errors.

Authors:  David R Kelley; Michael C Schatz; Steven L Salzberg
Journal:  Genome Biol       Date:  2010-11-29       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.