Literature DB >> 24451628

BLESS: bloom filter-based error correction solution for high-throughput sequencing reads.

Yun Heo1, Xiao-Long Wu, Deming Chen, Jian Ma, Wen-Mei Hwu.   

Abstract

MOTIVATION: Rapid advances in next-generation sequencing (NGS) technology have led to exponential increase in the amount of genomic information. However, NGS reads contain far more errors than data from traditional sequencing methods, and downstream genomic analysis results can be improved by correcting the errors. Unfortunately, all the previous error correction methods required a large amount of memory, making it unsuitable to process reads from large genomes with commodity computers.
RESULTS: We present a novel algorithm that produces accurate correction results with much less memory compared with previous solutions. The algorithm, named BLoom-filter-based Error correction Solution for high-throughput Sequencing reads (BLESS), uses a single minimum-sized Bloom filter, and is also able to tolerate a higher false-positive rate, thus allowing us to correct errors with a 40× memory usage reduction on average compared with previous methods. Meanwhile, BLESS can extend reads like DNA assemblers to correct errors at the end of reads. Evaluations using real and simulated reads showed that BLESS could generate more accurate results than existing solutions. After errors were corrected using BLESS, 69% of initially unaligned reads could be aligned correctly. Additionally, de novo assembly results became 50% longer with 66% fewer assembly errors.
AVAILABILITY AND IMPLEMENTATION: Freely available at http://sourceforge.net/p/bless-ec CONTACT: dchen@illinois.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2014        PMID: 24451628      PMCID: PMC6365934          DOI: 10.1093/bioinformatics/btu030

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  34 in total

1.  BFC: correcting Illumina sequencing errors.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2015-05-06       Impact factor: 6.937

Review 2.  From next-generation resequencing reads to a high-quality variant data set.

Authors:  S P Pfeifer
Journal:  Heredity (Edinb)       Date:  2016-10-19       Impact factor: 3.821

3.  BLESS 2: accurate, memory-efficient and fast error correction method.

Authors:  Yun Heo; Anand Ramachandran; Wen-Mei Hwu; Jian Ma; Deming Chen
Journal:  Bioinformatics       Date:  2016-03-24       Impact factor: 6.937

4.  Improving Bloom Filter Performance on Sequence Data Using k-mer Bloom Filters.

Authors:  David Pellow; Darya Filippova; Carl Kingsford
Journal:  J Comput Biol       Date:  2016-11-09       Impact factor: 1.479

5.  Extensive gene tree discordance and hemiplasy shaped the genomes of North American columnar cacti.

Authors:  Dario Copetti; Alberto Búrquez; Enriquena Bustamante; Joseph L M Charboneau; Kevin L Childs; Luis E Eguiarte; Seunghee Lee; Tiffany L Liu; Michelle M McMahon; Noah K Whiteman; Rod A Wing; Martin F Wojciechowski; Michael J Sanderson
Journal:  Proc Natl Acad Sci U S A       Date:  2017-10-23       Impact factor: 11.205

6.  Aberration-corrected ultrafine analysis of miRNA reads at single-base resolution: a k-mer lattice approach.

Authors:  Xuan Zhang; Pengyao Ping; Gyorgy Hutvagner; Michael Blumenstein; Jinyan Li
Journal:  Nucleic Acids Res       Date:  2021-10-11       Impact factor: 16.971

7.  Indexing Arbitrary-Length k-Mers in Sequencing Reads.

Authors:  Tomasz Kowalski; Szymon Grabowski; Sebastian Deorowicz
Journal:  PLoS One       Date:  2015-07-16       Impact factor: 3.240

8.  VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering.

Authors:  András Gézsi; Bence Bolgár; Péter Marx; Peter Sarkozy; Csaba Szalai; Péter Antal
Journal:  BMC Genomics       Date:  2015-10-28       Impact factor: 3.969

9.  Pollux: platform independent error correction of single and mixed genomes.

Authors:  Eric Marinier; Daniel G Brown; Brendan J McConkey
Journal:  BMC Bioinformatics       Date:  2015-01-16       Impact factor: 3.169

Review 10.  Molecular Microbial Community Analysis as an Analysis Tool for Optimal Biogas Production.

Authors:  Seyedbehnam Hashemi; Sayed Ebrahim Hashemi; Kristian M Lien; Jacob J Lamb
Journal:  Microorganisms       Date:  2021-05-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.