| Literature DB >> 35463472 |
Satyanarayan Rao1,2, Srinivas Ramachandran1,2.
Abstract
Here, we present a pipeline to map states of protein-binding DNA in vivo. Our pipeline infers as well as quantifies cooperative binding. Using dual-enzyme single-molecule footprinting (dSMF) data, we show how our workflow identifies binding states at an enhancer in Drosophila S2 cells. Data from cells lacking endogenous DNA methylation are a prerequisite for this pipeline. For complete details on the use and execution of this protocol, please refer to Rao et al. (2021) and Krebs et al. (2017).Entities:
Keywords: Bioinformatics; Genomics; Molecular Biology; Sequence analysis
Mesh:
Substances:
Year: 2022 PMID: 35463472 PMCID: PMC9026571 DOI: 10.1016/j.xpro.2022.101299
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1An example of three binding states observed at Peak 229 (position 0 is at chr2L:480305)
(Left) Each line in the heatmap is a DNA molecule (the grey fill). A red dot represents a methylated cytosine and a dark gray dot represents an unmethylated cytosine. (Right) Blue lines are footprint calls on DNA molecules. These two heatmaps are directly taken from the output of the pipeline and custom labeled. Number of DNA molecules representing protein-DNA binding states is denoted on Y-axis. D: Naked DNA; T: TF-bound; N: Nucleosome-bound. For optimal visualization, in each state, DNA molecules are sorted by their length and coordinates relative to position 0.
Figure 2An example of cobinding states observed at enhancer Peak 110 (position 0 is at chr2L: 19155173)
(Left) Each line in the heatmap is a DNA molecule (the grey fill). A red dot represents a methylated cytosine and a dark gray dot represents an unmethylated cytosine. (Right) Blue lines are footprint calls on DNA molecules. These two heatmaps are directly taken from the output of the pipeline and custom labeled. DNA molecules spanning the two MNase peaks (shown in grey box at the top) are included in the plot. The MNase peaks are separated by 78 bp. Eight of nine possible states are found at this locus. The number of DNA molecules mapped to each protein-DNA binding state is denoted on Y-axis. D: Naked DNA; T: TF-bound; N: Nucleosome-bound. T-T, for example represents both sites bound by TFs simultaneously. For optimal visualization, in each state, DNA molecules are sorted by their length and coordinates relative to position 0.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| dSMF bisulfite sequencing data ( | ( | |
| STARR-seq summits from | ( | |
| Snakemake | ( | |
| Gnuplot | N/A | |
| Trim Galore | N/A | |
| Bowtie2 | ( | |
| Bismark | ( | |
| Bamtools | ( | |
| Samtools | ( | |
| Deeptools | ( | |
| Anaconda | N/A | |
| Bioconda | ( | |
| Operating System | Linux/Unix | N/A |