| Literature DB >> 35325188 |
Zhaonan Zou1,2,3, Tazro Ohta4, Fumihito Miura5, Shinya Oki1,6.
Abstract
ChIP-Atlas (https://chip-atlas.org) is a web service providing both GUI- and API-based data-mining tools to reveal the architecture of the transcription regulatory landscape. ChIP-Atlas is powered by comprehensively integrating all data sets from high-throughput ChIP-seq and DNase-seq, a method for profiling chromatin regions accessible to DNase. In this update, we further collected all the ATAC-seq and whole-genome bisulfite-seq data for six model organisms (human, mouse, rat, fruit fly, nematode, and budding yeast) with the latest genome assemblies. These together with ChIP-seq data can be visualized with the Peak Browser tool and a genome browser to explore the epigenomic landscape of a query genomic locus, such as its chromatin accessibility, DNA methylation status, and protein-genome interactions. This epigenomic landscape can also be characterized for multiple genes and genomic loci by querying with the Enrichment Analysis tool, which, for example, revealed that inflammatory bowel disease-associated SNPs are the most significantly hypo-methylated in neutrophils. Therefore, ChIP-Atlas provides a panoramic view of the whole epigenomic landscape. All datasets are free to download via either a simple button on the web page or an API.Entities:
Year: 2022 PMID: 35325188 PMCID: PMC9252733 DOI: 10.1093/nar/gkac199
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.Overview of the ChIP-Atlas data set and computational processing. (A) Numbers of ChIP-seq, DNase-seq, ATAC-seq and WGBS experiments recorded in ChIP-Atlas (as of September 2021). Colors indicate different experiment types. Color legend is shown in the figure. Hm, Homo sapiens; Mm, Mus musculus; Rn, Rattus norvegicus; Dm, Drosophila melanogaster; Ce, Caenorhabditis elegans; Sc, Saccharomyces cerevisiae. (B) Cumulative number of SRX-based experiments recorded in ChIP-Atlas. Colors of the dots indicate different experiment types. Color legend is the same as that in (A). The gray, blue, and orange arrowheads indicate the timing of the release of ChIP-Atlas in 2015, the publication of the first ChIP-Atlas paper in 2018, and the inclusion of ATAC-seq and WGBS in 2021, respectively. ChIP-seq and DNase-seq data published before 2015, and ATAC-seq or WGBS data published before January 2021 are shown in gray. (C) Numbers of experiments according to cell type classes for human, mouse, and fruit fly data. PSC, pluripotent stem cell; CDV, cardiovascular; DGT, digestive tract; EMF, embryonic fibroblast. (D) Overview of data processing. Raw sequence data are downloaded from NCBI SRA, aligned to a reference genome, and subjected to peak calling, all of which can be monitored with the IGV genome browser. MR, methylated region.
Data statistics of ChIP-Atlas
| Experiment type | ChIP-seq | ATAC-seq | DNase-seq | Bisulfite-seq | All | |
|---|---|---|---|---|---|---|
| Year | 2021 | 2018 | 2021 | 2021 | 2021 | 2021 |
| Number of experiments | 182 891 | 74 076 | 66 104 | 5346 | 51 074 | 305 415 |
| Number of ChIP antigens | 2 477 | 1 979 | NA | NA | NA | 2 477 |
| Number of cell types | 2 486 | 1 615 | 772 | 406 | 553 | 2 918 |
| Number of peaks | 1 329 049 688 | 717 206 934 | 362 032 813 | 128 597 931 | 3 798 954 139 | 5 618 634 571 |
Comparison of ChIP-Atlas with other similar web services
| ChIP-Atlas | Cistrom DB | ReMap | GTRD | MethBank | |
|---|---|---|---|---|---|
| Data source | NCBI SRA | GEO, ENCODE, and Roadmap Epigenetics | GEO, ENCODE, and ENA | GEO, SRA, ENCODE, and modENCODE | NCBI SRA and Genome Sequence Archive |
| Experiments | ChIP-seq, | ChIP-seq, ATAC-seq, and DNase-seq | ChIP-seq, ChIP-exo, and DAP-seq | ChIP-seq, ChIP-exo, ChIP-nexus, MNase-seq, DNase-seq, FAIRE-seq, ATAC-seq, and RNA-seq | Bisulfite-seq |
| Filtering of data for quality control | No | Yes | Yes | No | Yes |
| Number of experiments | 76 217 → | 56 442 | 19 983 | 36 540 | 673 |
| Organism | Hs, Mm, Rn, Dm, Ce, and Sc | Hs and Mm | Hs, Mm, Dm and At | Hs, Mm, At, Ce, Dr, Dm, Rn, Sc and Sp | Hs, Dr, Mm, Os, Gm, Me, Pv and Sl |
| Genome assembly | hg19, | hg38 and mm10 | hg38, mm10, dm6, and tair10 (can be lifted to hg19/mm9 with liftover) | hg38, mm10, tair10, ce11, danRer11, dm6, rn6, sacCer3 and spo2 | hg38, danRer7, mm10, IRGSP-1.0, Gmax_275_v2.0, Mesculenta_305_v6, Pvulgaris_218_v1 and GCF_000188115.3_SL2.53 |
| Alignment tool | Bowtie2/ | BWA | Bowtie2 | Bowtie2 | WSBA |
| Peak caller | MACS2/ | MACS2 | MACS2 | MACS2, MACS, GEM, PICS, SISSRs | NA |
| Display format for each experiment | Alignment and peaks | Alignment and peaks | Peaks | Peaks | Alignment |
| Browsing assembled peaks | Possible | None | Possible | Possible | NA |
| Genome browser | IGV and UCSC | WashU and UCSC | UCSC, ENSEMBL and IGV | Self-developed, UCSC and ENSEMBL | JBrowse |
| Integrative analysis tools | Search tool for target genes and colocalizing factors of given TF, and enrichment analysis tool for given genes and genomic coordinates | Search tool for target genes of single experiment, TFs binding to single given genomic locus or query gene | Enrichment analysis tool for given genomic coordinates relative to random background | Search tool for target genes of given TF | Predictor of DNA methylation, age of human blood and enrichment analysis tool for identification of differentially methylated promoters |
Hm, Homo sapiens; Mm, Mus musculus; Rn, Rattus norvegicus; Dm, Drosophila melanogaster; Ce, Caenorhabditis elegans; Sc, Saccharomyces cerevisiae; At, Arabidopsis thaliana; Dr, Danio rerio; Sp, Schizosaccharomyces pombe; Os, Oryza sativa; Gm, Glycine max; Me, Manihot esculenta; Pv, Phaseolus vulgaris; Sl, Solanum lycopersicum.
Figure 2.Examples of use of the integrative analysis tools in ChIP-Atlas. (A) Parameters when users perform a query to show chromatin accessibility, methylation status, and TF binding around mouse Tcf19–Cchcr1 locus in spermatogonia using the Peak Browser tool. (B) Peak-call data for TF ChIP-seq, ATAC-seq and WGBS SRXs around the mouse Tcf19–Cchcr1 locus are shown in the IGV genome browser. The highlighted region indicates an accessible chromatin and hypo-methylated region. Bars represent the peak regions, with the curated names of the ChIP antigens (only for ChIP-seq peaks) and cell types being shown below the bars. The color of the bars in the ‘ChIP (TFs)’ and ‘ATAC’ panel indicates the score calculated with the peak-caller MACS2 (−log10[Q-value]). Black, pink, and beige bars in the ‘WGBS’ panel indicate hyper-, hypo- and partially methylated regions, respectively. (C) Parameters when users perform a query to evaluate TF binding, chromatin accessibility, and methylation levels of multiple IBD-associated SNPs in all cell types. (D) The resultant HTML of enrichment analysis using WGBS data (upper). Volcano plots for visualizing the results (bottom). X-axis, log2[fold enrichment]; Y-axis, –log10[P-value]. Each dot represents an individual SRX. SRXs are categorized by cell type classes and indicated by different colors. Red, ‘Blood’ class. The whole color legend is summarized in Supplementary Table S2.