Literature DB >> 19542152

SHREC: a short-read error correction method.

Jan Schröder1, Heiko Schröder, Simon J Puglisi, Ranjan Sinha, Bertil Schmidt.   

Abstract

MOTIVATION: Second-generation sequencing technologies produce a massive amount of short reads in a single experiment. However, sequencing errors can cause major problems when using this approach for de novo sequencing applications. Moreover, existing error correction methods have been designed and optimized for shotgun sequencing. Therefore, there is an urgent need for the design of fast and accurate computational methods and tools for error correction of large amounts of short read data.
RESULTS: We present SHREC, a new algorithm for correcting errors in short-read data that uses a generalized suffix trie on the read data as the underlying data structure. Our results show that the method can identify erroneous reads with sensitivity and specificity of over 99% and 96% for simulated data with error rates of up to 3% as well as for real data. Furthermore, it achieves an error correction accuracy of over 80% for simulated data and over 88% for real data. These results are clearly superior to previously published approaches. SHREC is available as an efficient open-source Java implementation that allows processing of 10 million of short reads on a standard workstation.

Mesh:

Substances:

Year:  2009        PMID: 19542152     DOI: 10.1093/bioinformatics/btp379

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  47 in total

1.  Fulcrum: condensing redundant reads from high-throughput sequencing studies.

Authors:  Matthew S Burriesci; Erik M Lehnert; John R Pringle
Journal:  Bioinformatics       Date:  2012-03-13       Impact factor: 6.937

Review 2.  Three-stage quality control strategies for DNA re-sequencing data.

Authors:  Yan Guo; Fei Ye; Quanghu Sheng; Travis Clark; David C Samuels
Journal:  Brief Bioinform       Date:  2013-09-24       Impact factor: 11.622

3.  ECHO: a reference-free short-read error correction algorithm.

Authors:  Wei-Chun Kao; Andrew H Chan; Yun S Song
Journal:  Genome Res       Date:  2011-04-11       Impact factor: 9.043

Review 4.  From next-generation resequencing reads to a high-quality variant data set.

Authors:  S P Pfeifer
Journal:  Heredity (Edinb)       Date:  2016-10-19       Impact factor: 3.821

5.  Reference-free validation of short read data.

Authors:  Jan Schröder; James Bailey; Thomas Conway; Justin Zobel
Journal:  PLoS One       Date:  2010-09-22       Impact factor: 3.240

6.  Pluribus-Exploring the Limits of Error Correction Using a Suffix Tree.

Authors:  Daniel Savel; Thomas LaFramboise; Ananth Grama; Mehmet Koyuturk
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2016-06-29       Impact factor: 3.710

7.  Slim-filter: an interactive Windows-based application for illumina genome analyzer data assessment and manipulation.

Authors:  Georgiy Golovko; Kamil Khanipov; Mark Rojas; Antonio Martinez-Alcántara; Jesse J Howard; Efren Ballesteros; Sharu Gupta; William Widger; Yuriy Fofanov
Journal:  BMC Bioinformatics       Date:  2012-07-16       Impact factor: 3.169

Review 8.  Prospects and limitations of full-text index structures in genome analysis.

Authors:  Michaël Vyverman; Bernard De Baets; Veerle Fack; Peter Dawyndt
Journal:  Nucleic Acids Res       Date:  2012-05-13       Impact factor: 16.971

9.  Benchmarking short sequence mapping tools.

Authors:  Ayat Hatem; Doruk Bozdağ; Amanda E Toland; Ümit V Çatalyürek
Journal:  BMC Bioinformatics       Date:  2013-06-07       Impact factor: 3.169

10.  Probabilistic error correction for RNA sequencing.

Authors:  Hai-Son Le; Marcel H Schulz; Brenna M McCauley; Veronica F Hinman; Ziv Bar-Joseph
Journal:  Nucleic Acids Res       Date:  2013-04-04       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.