Literature DB >> 24867943

CWig: compressed representation of Wiggle/BedGraph format.

Do Huy Hoang1, Wing-Kin Sung2.   

Abstract

MOTIVATION: BigWig, a format to represent read density data, is one of the most popular data types. They can represent the peak intensity in ChIP-seq, the transcript expression in RNA-seq, the copy number variation in whole genome sequencing, etc. UCSC Encode project uses the bigWig format heavily for storage and visualization. Of 5.2 TB Encode hg19 database, 1.6 TB (31% of the total space) is used to store bigWig files. BigWig format not only saves a lot of space but also supports fast queries that are crucial for interactive analysis and browsing. In our benchmark, bigWig often has similar size to the gzipped raw data, while is still able to support ∼ 5000 random queries per second.
RESULTS: Although bigWig is good enough at the moment, both storage space and query time are expected to become limited when sequencing gets cheaper. This article describes a new method to store density data named CWig. The format uses on average one-third of the size of existing bigWig files and improves random query speed up to 100 times.
AVAILABILITY AND IMPLEMENTATION: http://genome.ddns.comp.nus.edu.sg/∼cwig.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2014        PMID: 24867943     DOI: 10.1093/bioinformatics/btu330

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  smallWig: parallel compression of RNA-seq WIG files.

Authors:  Zhiying Wang; Tsachy Weissman; Olgica Milenkovic
Journal:  Bioinformatics       Date:  2015-09-30       Impact factor: 6.937

2.  ChIPWig: a random access-enabling lossless and lossy compression method for ChIP-seq data.

Authors:  Vida Ravanmehr; Minji Kim; Zhiying Wang; Olgica Milenkovic
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.