Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Computational Performance Assessment of k-mer Counting Algorithms.

Literature DB >> 26982880

Computational Performance Assessment of k-mer Counting Algorithms.

Nelson Pérez¹, Miguel Gutierrez¹, Nelson Vera¹.

Abstract

This article is about the assessment of several tools for k-mer counting, with the purpose to create a reference framework for bioinformatics researchers to identify computational requirements, parallelizing, advantages, disadvantages, and bottlenecks of each of the algorithms proposed in the tools. The k-mer counters evaluated in this article were BFCounter, DSK, Jellyfish, KAnalyze, KHMer, KMC2, MSPKmerCounter, Tallymer, and Turtle. Measured parameters were the following: RAM occupied space, processing time, parallelization, and read and write disk access. A dataset consisting of 36,504,800 reads was used corresponding to the 14th human chromosome. The assessment was performed for two k-mer lengths: 31 and 55. Obtained results were the following: pure Bloom filter-based tools and disk-partitioning techniques showed a lesser RAM use. The tools that took less execution time were the ones that used disk-partitioning techniques. The techniques that made the major parallelization were the ones that used disk partitioning, hash tables with lock-free approach, or multiple hash tables.

Entities: Species

Keywords: computational performance assessment; data structures; k-mer counters

Mesh：

Year: 2016 PMID： 26982880 DOI： 10.1089/cmb.2015.0199

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
Cited

6 in total

1. deBWT: parallel construction of Burrows-Wheeler Transform for large collection of genomes with de Bruijn-branch encoding.

Authors: Bo Liu; Dixian Zhu; Yadong Wang
Journal: Bioinformatics Date: 2016-06-15 Impact factor: 6.937

2. De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from Total DNA Sequences.

Authors: Shairul Izan; Danny Esselink; Richard G F Visser; Marinus J M Smulders; Theo Borm
Journal: Front Plant Sci Date: 2017-08-02 Impact factor: 5.753

Computational Performance Assessment of k-mer Counting Algorithms.

1. deBWT: parallel construction of Burrows-Wheeler Transform for large collection of genomes with de Bruijn-branch encoding.

2. De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from Total DNA Sequences.

3. Gerbil: a fast and memory-efficient k-mer counter with GPU-support.

4. Estimating the k-mer Coverage Frequencies in Genomic Datasets: A Comparative Assessment of the State-of-the-art.

5. Chromosome-scale assembly and high-density genetic map of the yellow drum, Nibea albiflora.

6. A benchmark study of k-mer counting methods for high-throughput sequencing.