| Literature DB >> 27536334 |
Toshiki Takeuchi1, Atsuo Yamada1, Takashi Aoki1, Kunihiro Nishimura1.
Abstract
BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required.Entities:
Keywords: BAM; Clojure; DNA; Next-generation sequencing; Parallel processing; SAM
Year: 2016 PMID: 27536334 PMCID: PMC4987990 DOI: 10.1186/s13029-016-0058-6
Source DB: PubMed Journal: Source Code Biol Med ISSN: 1751-0473
Fig. 1Execution time of indexing. The green dashed line indicates SAMtools and the red dashed line indicates Picard under single thread conditions because they cannot be run using multithreaded processing. The error bar shows the standard deviation of the result
Fig. 2Execution time of pileup. The green dashed line indicates SAMtools under a single thread condition because it cannot be run using multithreaded processing. The error bar shows the standard deviation of the result
Measurement of LOC
| Language | Files | Blank | Comment | Code |
|---|---|---|---|---|
| cljam (Clojure) | 46 | 466 | 165 | 3264 |
| SAMtools (C/C++) | 53 | 1606 | 2403 | 11619 |
| Picard (Java) | 290 | 6409 | 11835 | 28322 |