| Literature DB >> 35413804 |
Kristyna Kupkova1,2, Jose Verdezoto Mosquera1,2, Jason P Smith1,2, Michał Stolarczyk1, Tessa L Danehy1, John T Lawson1,3, Bingjie Xue1,3, John T Stubbs1,2, Nathan LeRoy1,3, Nathan C Sheffield4,5,6,7.
Abstract
BACKGROUND: Epigenome analysis relies on defined sets of genomic regions output by widely used assays such as ChIP-seq and ATAC-seq. Statistical analysis and visualization of genomic region sets is essential to answer biological questions in gene regulation. As the epigenomics community continues generating data, there will be an increasing need for software tools that can efficiently deal with more abundant and larger genomic region sets. Here, we introduce GenomicDistributions, an R package for fast and easy summarization and visualization of genomic region data.Entities:
Keywords: Bioconductor; Data visualization; Genomic regions; R package; Region set summary
Mesh:
Year: 2022 PMID: 35413804 PMCID: PMC9003978 DOI: 10.1186/s12864-022-08467-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Overview and advantages of GenomicDistributions. A List of key design principles and advantages offered by GenomicDistributions. B GenomicDistributions functions are designed to process one or multiple genomic region sets at once. Plotting is separated from summary statistics calculation. Grey indicates that users may develop their own plots of summary statistics generated by calc functions, and edit ggplot objects from plot functions
Fig. 2Example plots produced by GenomicDistributions. A Signal summary used for cell-type specificity of chromatin accessibility plot. Bars represent median values across regions. B Distribution of regions over chromosomes. C Distances between neighbor regions. D GC content in query regions. The dashed line indicates medians. E Partition distribution plot. F Distances to TSS