Literature DB >> 26982880

Computational Performance Assessment of k-mer Counting Algorithms.

Nelson Pérez1, Miguel Gutierrez1, Nelson Vera1.   

Abstract

This article is about the assessment of several tools for k-mer counting, with the purpose to create a reference framework for bioinformatics researchers to identify computational requirements, parallelizing, advantages, disadvantages, and bottlenecks of each of the algorithms proposed in the tools. The k-mer counters evaluated in this article were BFCounter, DSK, Jellyfish, KAnalyze, KHMer, KMC2, MSPKmerCounter, Tallymer, and Turtle. Measured parameters were the following: RAM occupied space, processing time, parallelization, and read and write disk access. A dataset consisting of 36,504,800 reads was used corresponding to the 14th human chromosome. The assessment was performed for two k-mer lengths: 31 and 55. Obtained results were the following: pure Bloom filter-based tools and disk-partitioning techniques showed a lesser RAM use. The tools that took less execution time were the ones that used disk-partitioning techniques. The techniques that made the major parallelization were the ones that used disk partitioning, hash tables with lock-free approach, or multiple hash tables.

Entities:  

Keywords:  computational performance assessment; data structures; k-mer counters

Mesh:

Year:  2016        PMID: 26982880     DOI: 10.1089/cmb.2015.0199

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  6 in total

1.  deBWT: parallel construction of Burrows-Wheeler Transform for large collection of genomes with de Bruijn-branch encoding.

Authors:  Bo Liu; Dixian Zhu; Yadong Wang
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

2.  De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from Total DNA Sequences.

Authors:  Shairul Izan; Danny Esselink; Richard G F Visser; Marinus J M Smulders; Theo Borm
Journal:  Front Plant Sci       Date:  2017-08-02       Impact factor: 5.753

3.  Gerbil: a fast and memory-efficient k-mer counter with GPU-support.

Authors:  Marius Erbert; Steffen Rechner; Matthias Müller-Hannemann
Journal:  Algorithms Mol Biol       Date:  2017-03-31       Impact factor: 1.405

4.  Estimating the k-mer Coverage Frequencies in Genomic Datasets: A Comparative Assessment of the State-of-the-art.

Authors:  Swati C Manekar; Shailesh R Sathe
Journal:  Curr Genomics       Date:  2019-01       Impact factor: 2.236

5.  Chromosome-scale assembly and high-density genetic map of the yellow drum, Nibea albiflora.

Authors:  Dongdong Xu; Wanchang Zhang; Ruiyi Chen; Hongbin Song; Lu Tian; Peng Tan; Ligai Wang; Qihui Zhu; Bin Wu; Bao Lou; Jiumeng Min; Juhong Zhou
Journal:  Sci Data       Date:  2021-10-15       Impact factor: 6.444

6.  A benchmark study of k-mer counting methods for high-throughput sequencing.

Authors:  Swati C Manekar; Shailesh R Sathe
Journal:  Gigascience       Date:  2018-12-01       Impact factor: 6.524

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.