Literature DB >> 31290943

Cooler: scalable storage for Hi-C data and other genomically labeled arrays.

Nezar Abdennur1, Leonid A Mirny1,2.   

Abstract

MOTIVATION: Most existing coverage-based (epi)genomic datasets are one-dimensional, but newer technologies probing interactions (physical, genetic, etc.) produce quantitative maps with two-dimensional genomic coordinate systems. Storage and computational costs mount sharply with data resolution when such maps are stored in dense form. Hence, there is a pressing need to develop data storage strategies that handle the full range of useful resolutions in multidimensional genomic datasets by taking advantage of their sparse nature, while supporting efficient compression and providing fast random access to facilitate development of scalable algorithms for data analysis.
RESULTS: We developed a file format called cooler, based on a sparse data model, that can support genomically labeled matrices at any resolution. It has the flexibility to accommodate various descriptions of the data axes (genomic coordinates, tracks and bin annotations), resolutions, data density patterns and metadata. Cooler is based on HDF5 and is supported by a Python library and command line suite to create, read, inspect and manipulate cooler data collections. The format has been adopted as a standard by the NIH 4D Nucleome Consortium.
AVAILABILITY AND IMPLEMENTATION: Cooler is cross-platform, BSD-licensed and can be installed from the Python package index or the bioconda repository. The source code is maintained on Github at https://github.com/mirnylab/cooler. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2020        PMID: 31290943      PMCID: PMC8205516          DOI: 10.1093/bioinformatics/btz540

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Tabix: fast retrieval of sequence features from generic TAB-delimited files.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-01-05       Impact factor: 6.937

2.  Bioconda: sustainable and comprehensive software distribution for the life sciences.

Authors:  Björn Grüning; Ryan Dale; Andreas Sjödin; Brad A Chapman; Jillian Rowe; Christopher H Tomkins-Tinch; Renan Valieris; Johannes Köster
Journal:  Nat Methods       Date:  2018-07       Impact factor: 28.547

3.  Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom.

Authors:  Neva C Durand; James T Robinson; Muhammad S Shamim; Ido Machol; Jill P Mesirov; Eric S Lander; Erez Lieberman Aiden
Journal:  Cell Syst       Date:  2016-07       Impact factor: 10.304

4.  Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

Authors:  Erez Lieberman-Aiden; Nynke L van Berkum; Louise Williams; Maxim Imakaev; Tobias Ragoczy; Agnes Telling; Ido Amit; Bryan R Lajoie; Peter J Sabo; Michael O Dorschner; Richard Sandstrom; Bradley Bernstein; M A Bender; Mark Groudine; Andreas Gnirke; John Stamatoyannopoulos; Leonid A Mirny; Eric S Lander; Job Dekker
Journal:  Science       Date:  2009-10-09       Impact factor: 47.728

5.  Unifying Biological Image Formats with HDF5.

Authors:  Matthew T Dougherty; Michael J Folk; Erez Zadok; Herbert J Bernstein; Frances C Bernstein; Kevin W Eliceiri; Werner Benger; Christoph Best
Journal:  Commun ACM       Date:  2009-10-01       Impact factor: 4.654

Review 6.  The second decade of 3C technologies: detailed insights into nuclear organization.

Authors:  Annette Denker; Wouter de Laat
Journal:  Genes Dev       Date:  2016-06-15       Impact factor: 11.361

7.  HiFive: a tool suite for easy and efficient HiC and 5C data analysis.

Authors:  Michael Eg Sauria; Jennifer E Phillips-Cremins; Victor G Corces; James Taylor
Journal:  Genome Biol       Date:  2015-10-24       Impact factor: 13.583

8.  BioContainers: an open-source and community-driven framework for software standardization.

Authors:  Felipe da Veiga Leprevost; Björn A Grüning; Saulo Alves Aflitos; Hannes L Röst; Julian Uszkoreit; Harald Barsnes; Marc Vaudel; Pablo Moreno; Laurent Gatto; Jonas Weber; Mingze Bai; Rafael C Jimenez; Timo Sachsenberg; Julianus Pfeuffer; Roberto Vera Alvarez; Johannes Griss; Alexey I Nesvizhskii; Yasset Perez-Riverol
Journal:  Bioinformatics       Date:  2017-08-15       Impact factor: 6.937

9.  Galaxy HiCExplorer: a web server for reproducible Hi-C data analysis, quality control and visualization.

Authors:  Joachim Wolff; Vivek Bhardwaj; Stephan Nothjunge; Gautier Richard; Gina Renschler; Ralf Gilsbach; Thomas Manke; Rolf Backofen; Fidel Ramírez; Björn A Grüning
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

10.  The 3D Genome Browser: a web-based browser for visualizing 3D genome organization and long-range chromatin interactions.

Authors:  Yanli Wang; Fan Song; Bo Zhang; Lijun Zhang; Jie Xu; Da Kuang; Daofeng Li; Mayank N K Choudhary; Yun Li; Ming Hu; Ross Hardison; Ting Wang; Feng Yue
Journal:  Genome Biol       Date:  2018-10-04       Impact factor: 13.583

View more
  87 in total

1.  Multi-scale architecture of archaeal chromosomes.

Authors:  Naomichi Takemata; Stephen D Bell
Journal:  Mol Cell       Date:  2020-12-30       Impact factor: 17.970

2.  A map of cis-regulatory elements and 3D genome structures in zebrafish.

Authors:  Hongbo Yang; Yu Luan; Tingting Liu; Hyung Joo Lee; Li Fang; Yanli Wang; Xiaotao Wang; Bo Zhang; Qiushi Jin; Khai Chung Ang; Xiaoyun Xing; Juan Wang; Jie Xu; Fan Song; Iyyanki Sriranga; Chachrit Khunsriraksakul; Tarik Salameh; Daofeng Li; Mayank N K Choudhary; Jacek Topczewski; Kai Wang; Glenn S Gerhard; Ross C Hardison; Ting Wang; Keith C Cheng; Feng Yue
Journal:  Nature       Date:  2020-11-25       Impact factor: 49.962

Review 3.  Sister chromatid-sensitive Hi-C to map the conformation of replicated genomes.

Authors:  Michael Mitter; Zsuzsanna Takacs; Thomas Köcher; Ronald Micura; Christoph C H Langer; Daniel W Gerlich
Journal:  Nat Protoc       Date:  2022-04-27       Impact factor: 13.491

4.  Down-syndrome-induced senescence disrupts the nuclear architecture of neural progenitors.

Authors:  Hiruy S Meharena; Asaf Marco; Vishnu Dileep; Elana R Lockshin; Grace Y Akatsu; James Mullahoo; L Ashley Watson; Tak Ko; Lindsey N Guerin; Fatema Abdurrob; Shruthi Rengarajan; Malvina Papanastasiou; Jacob D Jaffe; Li-Huei Tsai
Journal:  Cell Stem Cell       Date:  2022-01-06       Impact factor: 24.633

5.  Genome-wide detection of enhancer-hijacking events from chromatin interaction data in rearranged genomes.

Authors:  Xiaotao Wang; Jie Xu; Baozhen Zhang; Ye Hou; Fan Song; Huijue Lyu; Feng Yue
Journal:  Nat Methods       Date:  2021-06-03       Impact factor: 28.547

6.  Cohesin residency determines chromatin loop patterns.

Authors:  Lorenzo Costantino; Tsung-Han S Hsieh; Rebecca Lamothe; Xavier Darzacq; Douglas Koshland
Journal:  Elife       Date:  2020-11-10       Impact factor: 8.140

7.  Resolving the 3D Landscape of Transcription-Linked Mammalian Chromatin Folding.

Authors:  Tsung-Han S Hsieh; Claudia Cattoglio; Elena Slobodyanyuk; Anders S Hansen; Oliver J Rando; Robert Tjian; Xavier Darzacq
Journal:  Mol Cell       Date:  2020-03-25       Impact factor: 17.970

8.  Balancing cohesin eviction and retention prevents aberrant chromosomal interactions, Polycomb-mediated repression, and X-inactivation.

Authors:  Andrea J Kriz; David Colognori; Hongjae Sunwoo; Behnam Nabet; Jeannie T Lee
Journal:  Mol Cell       Date:  2021-03-15       Impact factor: 17.970

9.  The SUN1-SPDYA interaction plays an essential role in meiosis prophase I.

Authors:  Yanyan Chen; Yan Wang; Juan Chen; Wu Zuo; Yong Fan; Sijia Huang; Yongmei Liu; Guangming Chen; Qing Li; Jinsong Li; Jian Wu; Qian Bian; Chenhui Huang; Ming Lei
Journal:  Nat Commun       Date:  2021-05-26       Impact factor: 14.919

10.  Distinct Classes of Chromatin Loops Revealed by Deletion of an RNA-Binding Region in CTCF.

Authors:  Anders S Hansen; Tsung-Han S Hsieh; Claudia Cattoglio; Iryna Pustova; Ricardo Saldaña-Meyer; Danny Reinberg; Xavier Darzacq; Robert Tjian
Journal:  Mol Cell       Date:  2019-09-12       Impact factor: 17.970

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.