| Literature DB >> 35255960 |
Nelina Angelova1, Theodoros Danis1,2, Jacques Lagnel3, Costas S Tsigenopoulos1, Tereza Manousaki4.
Abstract
OBJECTIVE: The rapid progress in sequencing technology and related bioinformatics tools aims at disentangling diversity and conservation issues through genome analyses. The foremost challenges of the field involve coping with questions emerging from the swift development and application of new algorithms, as well as the establishment of standardized analysis approaches that promote transparency and transferability in research.Entities:
Keywords: Assembly; Container; Genome; Pipeline; de-novo
Mesh:
Year: 2022 PMID: 35255960 PMCID: PMC8900408 DOI: 10.1186/s13104-022-05978-5
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Fig. 1The workflow of SnakeCube and its sub-containers. Each box represents the different images available. A and B represent the quality checking steps for short or/and long reads. B and C serve users with only long reads. D combines them all, forming SnakeCube
SnakeCube’s performance
| Dataset | Raw-data size | Genome size estimation | Busco C, % | Published Busco, C% | Serial run time | Parallel execution time | Time Saved, % |
|---|---|---|---|---|---|---|---|
| 8.3 Gb MinION, 18.8 Gb Illumina paired-end | 219 Mb | 98.2 | 98.3 | 39.04 h | 35h 26m | 10 | |
| 9.68 GB MinION, 57,3 Gb Illumina paired-end | 373 Mb | 96.7 | 96.2 | 31.13 h | 27h 29m | 13 | |
116.5 Gb MinION, 54.6 Illumina paired-end | 1.23 Gb | 94.7 | 97.7 | 216.73 h | 172h 47m | 20.5 |
Fig. 2The benchmarking and optimization of SnakeCube based on the L. sceleratus dataset. a Reports of average memory and load monitoring records of three serial runs, with each point representing a rule of the container. b The rules were further independently monitored for their time-scaling efficiency when run multiple times with an increasing thread allowance. The memory properties are reported as in megabytes and only the highest value at any point is recorded. Time is measured in seconds. Rules are presented in the down-right side with their order of appearance