Literature DB >> 21385040

naiveBayesCall: an efficient model-based base-calling algorithm for high-throughput sequencing.

Wei-Chun Kao1, Yun S Song.   

Abstract

Immense amounts of raw instrument data (i.e., images of fluorescence) are currently being generated using ultra high-throughput sequencing platforms. An important computational challenge associated with this rapid advancement is to develop efficient algorithms that can extract accurate sequence information from raw data. To address this challenge, we recently introduced a novel model-based base-calling algorithm that is fully parametric and has several advantages over previously proposed methods. Our original algorithm, called BayesCall, significantly reduced the error rate, particularly in the later cycles of a sequencing run, and also produced useful base-specific quality scores with a high discrimination ability. Unfortunately, however, BayesCall is too computationally expensive to be of broad practical use. In this article, we build on our previous model-based approach to devise an efficient base-calling algorithm that is orders of magnitude faster than BayesCall, while still maintaining a comparably high level of accuracy. Our new algorithm is called naive-BayesCall, and it utilizes approximation and optimization methods to achieve scalability. We describe the performance of naiveBayesCall and demonstrate how improved base-calling accuracy may facilitate de novo assembly and SNP detection when the sequence coverage depth is low to moderate.

Mesh:

Substances:

Year:  2011        PMID: 21385040     DOI: 10.1089/cmb.2010.0247

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  7 in total

Review 1.  A beginners guide to SNP calling from high-throughput DNA-sequencing data.

Authors:  André Altmann; Peter Weber; Daniel Bader; Michael Preuss; Elisabeth B Binder; Bertram Müller-Myhsok
Journal:  Hum Genet       Date:  2012-08-11       Impact factor: 4.132

2.  Single Nucleotide Polymorphism (SNP) Detection and Genotype Calling from Massively Parallel Sequencing (MPS) Data.

Authors:  Yun Li; Wei Chen; Eric Yi Liu; Yi-Hui Zhou
Journal:  Stat Biosci       Date:  2013-05

3.  ParticleCall: a particle filter for base calling in next-generation sequencing systems.

Authors:  Xiaohu Shen; Haris Vikalo
Journal:  BMC Bioinformatics       Date:  2012-07-09       Impact factor: 3.169

4.  BlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution.

Authors:  Chengxi Ye; Chiaowen Hsiao; Héctor Corrada Bravo
Journal:  Bioinformatics       Date:  2014-01-09       Impact factor: 6.937

5.  Improvement in detection of minor alleles in next generation sequencing by base quality recalibration.

Authors:  Shengyu Ni; Mark Stoneking
Journal:  BMC Genomics       Date:  2016-02-27       Impact factor: 3.969

6.  Pan-cancer analysis of systematic batch effects on somatic sequence variations.

Authors:  Ji-Hye Choi; Seong-Eui Hong; Hyun Goo Woo
Journal:  BMC Bioinformatics       Date:  2017-04-11       Impact factor: 3.169

7.  High-resolution microbial community reconstruction by integrating short reads from multiple 16S rRNA regions.

Authors:  Amnon Amir; Amit Zeisel; Or Zuk; Michael Elgart; Shay Stern; Ohad Shamir; Peter J Turnbaugh; Yoav Soen; Noam Shental
Journal:  Nucleic Acids Res       Date:  2013-11-07       Impact factor: 16.971

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.