| Literature DB >> 32321568 |
Lorinc S Pongor1, Jacob M Gross2, Roberto Vera Alvarez3, Junko Murai2, Sang-Min Jang2, Hongliang Zhang2, Christophe Redon2, Haiqing Fu2, Shar-Yin Huang2, Bhushan Thakur2, Adrian Baris2, Leonardo Marino-Ramirez3, David Landsman3, Mirit I Aladjem4, Yves Pommier5.
Abstract
BACKGROUND: Next-generation sequencing allows genome-wide analysis of changes in chromatin states and gene expression. Data analysis of these increasingly used methods either requires multiple analysis steps, or extensive computational time. We sought to develop a tool for rapid quantification of sequencing peaks from diverse experimental sources and an efficient method to produce coverage tracks for accurate visualization that can be intuitively displayed and interpreted by experimentalists with minimal bioinformatics background. We demonstrate its strength and usability by integrating data from several types of sequencing approaches.Entities:
Keywords: ATAC-seq; ChIP-seq; Expression; Histone modifications; NS-seq; RNA-seq; Replication origins; Replication timing; SLFN11
Mesh:
Substances:
Year: 2020 PMID: 32321568 PMCID: PMC7175505 DOI: 10.1186/s13072-020-00343-x
Source DB: PubMed Journal: Epigenetics Chromatin ISSN: 1756-8935 Impact factor: 4.954
Fig. 1Application and benchmarking of BAMscale on different sequencing datasets. a Scaled coverage track generation and peak quantification of ChIP-seq and ATAC-seq data. Local differential H3K27ac signal at the HOXB7 locus in MV4-11 (wild type) and PKC412-resistant (R) (drug-resistant) cells, and global H3K27ac increase. b Creating exon–intron boundary aware stranded and un-stranded RNA-seq coverage tracks. c OK-seq coverage track creation using BAMscale outputs scaled strand-specific coverage tracks, and the replication-fork directionality tracks. d Analysis of replication timing data. e Mapping DNA-breaks from END-seq, creating strand-specific or unstranded coverage tracks
Capabilities of BAMScale and other publicly available tools
| Tool | ||||||
|---|---|---|---|---|---|---|
| Creating coverage tracks | ||||||
| ChIP-seq/ATAC-seq | X | X | X | * | X | |
| ChIP-seq/ATAC-seq (normalized) | X | X | X | |||
| Log2 coverage (replication timing) | X | X | ||||
| OK-seq (RFD calculation) | X | |||||
| RNA-seq | X | * | * | * | X | |
| RNA-seq (splice-aware) | X | |||||
| Stranded coverage | X | X | ||||
| Quantifying peaks | ||||||
| Raw read counts | x | x | ||||
| Normalized read counts | x | |||||
* Scaling factor cannot be specified
** BAM file has to be pre-filtered for alignment quality
Fig. 2Benchmarking BAMscale on ATAC-seq data. a Performance comparison of peak quantification and b correlation of raw read counts in ~ 33 k peaks between BAMscale and bedtools. c Coverage tracks generation benchmarks using IGVtools (unscaled output), MACS2 (pileup: unscaled, callpeak: scaled with peak calling), deeptools and BAMscale (scaled output). d ATAC-seq signal change induced from camptothecin (CPT) is observed in wild-type CEM-CCRF cells (SLFN11 positive), and not in the SLFN11 isogenic knockout. e Colocalization of opening ATAC-seq peaks using GIGGLE and Cistrome. f Examples of chromatin accessibility in the TOP1 and CTCF genes
Fig. 3Re-analysis of RNA-seq data. a Overview of BAMscale RNA-seq mode. Coverage resolution is switched to single-base resolution in bins where adjacent base coverages exceed 4 reads. b Comparison of coverage tracks between IGVTools, Deeptools and BAMscale. c Performance comparison of three tools. d Differentially expressed genes between Top1mt wild-type and knock-out mice. e Comparison of unscaled and scaled coverages
Fig. 4Comparison of replication timing and DNA-breaks. a Visualization of DNA-replication timing, synchronized replication origins (OK-seq) and DNA-breaks mapped with END-seq on chromosome 7. b Mean replication timing distributions of genomic regions enriched with END-seq signal compared to random peaks. c Elevated END-seq regions overlap early replicating regions
Fig. 5Comparison of different replication sequencing methods. a Replication timing ratios where higher ratios correspond to earlier replicating regions (i), replication timing profile (ii), active/repressed chromatin regions (iii), strong OK-seq strand switch coordinates (iv), OK-seq replication-fork directionality ratios (v) and NS-seq (replication origin) tracks (vi) for K562 cell line. b OK-seq strand switches in the four segments of replication. c NS-seq peak abundances in the four replication timing phases. d Comparison of analysis time between deeptools bamCompare and BAMscale for replication timing data