| Literature DB >> 35347321 |
Marek Kokot1, Adam Gudyś1, Heng Li2,3, Sebastian Deorowicz4.
Abstract
The cost of maintaining exabytes of data produced by sequencing experiments every year has become a major issue in today's genomic research. In spite of the increasing popularity of third-generation sequencing, the existing algorithms for compressing long reads exhibit a minor advantage over the general-purpose gzip. We present CoLoRd, an algorithm able to reduce the size of third-generation sequencing data by an order of magnitude without affecting the accuracy of downstream analyses.Entities:
Mesh:
Year: 2022 PMID: 35347321 PMCID: PMC9337911 DOI: 10.1038/s41592-022-01432-3
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 47.990