| Literature DB >> 36012247 |
Victor G Levitsky1,2, Alexey M Mukhin1, Dmitry Yu Oshchepkov1, Elena V Zemlyanskaya1,2, Sergey A Lashin1,2.
Abstract
(1) Background: The widespread application of ChIP-seq technology requires annotation of cis-regulatory modules through the search of co-occurred motifs. (2)Entities:
Keywords: chromatin immunoprecipitation with massively parallel sequencing; co-binding of transcription factors; composite elements; motifs conservation; overlap of motifs; transcription factor binding sites; transcription factors binding sites prediction
Mesh:
Substances:
Year: 2022 PMID: 36012247 PMCID: PMC9408884 DOI: 10.3390/ijms23168981
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Comparison of various tools for prediction of CEs in ChIP-seq data.
| Tool Name | A Single Dataset of Peaks Is Sufficient | Overlapped Motifs Are Allowed | URL | Reference |
|---|---|---|---|---|
| SpaMo | Yes | No | [ | |
| TACO | No | Yes | [ | |
| MCOT, | Yes | Yes | [ |
Figure 1Basic Web-MCOT pages: (A) homepage, (B) application, and (C) example.
Figure 2Basic output table of Web-MCOT for the example FoxA2 ChIP-seq dataset [18]. The first column lists the names of the second motifs in CEs. The common header ‘CE significance, −Log10(P-value)’ joins columns showing the significances of CEs without respect to motif conservation for five computation flows. The column ‘Similarity, −Log10(P-value)’ shows the significance of similarity of motifs. The common header ‘Asymmetry, −Log10(P-value)’ joins columns with the significances of asymmetry within CEs for five computation flows. The columns ‘CE histogram’ and ‘CE logo’ contain icons for histograms of CE abundance as a function of mutual orientation and location of the motifs, and for logos for the most common CE structural type with an overlap of motifs. The last five columns with the common header ‘Asymmetry heatmap, per mille’ respect five flows and show the links to heamaps of CEs abundance as a function of the ratios of conservation between Anchor and Partner motifs.
Figure 3Distribution of structural variants of predicted CEs with various orientations, overlaps and spacers of the Anchor FoxA2 and Partner HNF1B motifs for the FoxA2 dataset [18]. Colors denote different mutual orientations. The letters in labels near axis X from left to right mean full (‘F’) or partial overlaps (‘P’), and spacer length (‘S’). The numbers preceding these letters denote the distance between nearest borders of two motifs, the length of overlap and the length of spacer, respectively. Axis Y shows the percentage of peaks containing CEs variants specific in mutual orientation and location. FoxA2 and HNF1B motifs were derived with the Homer tool [6] and the mouse Hocomoco library [14], respectively.
Figure 4Graphical description of the most common CEs with an overlap of the Anchor FoxA2 and Partner HNF1B motifs for the FoxA2 dataset [18]. (A) Logo of the most common CE structural type. Black/grey arrows show the location and orientation of Anchor/Partner motifs. (B) Heatmap visualization of relationship of motifs conservation in CEs. Axes X/Y show ranges of conservation level of Partner/Anchor motifs. The color implies the per-mille measure for difference between observed and expected abundances of CEs with specific conservation of motifs (see Section 4.3).
Figure 5Web-MCOT workflow details. (A) Input data comprise a dataset of ChIP-seq peaks; a nucleotide frequency matrix for an Anchor motif; and a nucleotide frequency matrix for a Partner motif or designation of a public library of partner motifs. (B–D) show CEs classification according to orientations, locations (overlaps and spacers), and ratios of motifs conservation. (E) 2 × 2 contingency tables for computation of significances of enrichment for three CE types: without taking into account relationships of motifs conservation, and with an Anchor/Partner motif possessing higher conservation than Partner/Anchor motif (notations CE+/CE− respect sequences with/without CEs). (F) 2 × 2 contingency table for computation of significances of asymmetry within CEs.