| Literature DB >> 26229744 |
Adam Scheidegger1, Adam Burkholder2, Ata Abbas1, Kris Zarns3, Ann Samarakkody1, Sergei Nechaev1.
Abstract
While a role of promoter-proximal RNA Polymerase II (Pol II) pausing in regulation of eukaryotic gene expression is implied, the mechanisms and dynamics of this process are poorly understood. We performed genome-wide analysis of short capped RNAs (scRNAs) and Pol II chromatin immunoprecipitation sequencing (ChIP-seq) in human breast cancer MCF-7 cells to better understand Pol II pausing (Samarakkody, A., Abbas, A., Scheidegger, A., Warns, J., Nnoli, O., Jokinen, B., Zarns, K., Kubat, B., Dhasarathy, A. and Nechaev, S. (2015) RNA polymerase II pausing can be retained or acquired during activation of genes involved in the epithelial to mesenchymal transition. Nucleic Acids Res43, 3938-3949). The data are available at the NCBI Gene Expression Omnibus under accession number GSE67041. For both ChIP and scRNA samples, we used paired end sequencing on the Illumina MiSeq instrument. For ChIP-seq, the use of paired end sequencing allowed us to avoid ambiguities in center-read definition. For scRNA seq, this allowed us to identify both the 5'-end and the 3'-end in the same run that represent, respectively, the transcription start sites and the locations of Pol II pausing. The sharpening of Pol II ChIP-seqmetagene profiles when aligned against 5'-ends of scRNAs indicates that these RNAs can be used to define the start sites for the majority of mRNA transcription events.Entities:
Keywords: Paired end ChIP-sequencing; Pol II pausing; Short capped RNA
Year: 2015 PMID: 26229744 PMCID: PMC4516138 DOI: 10.1016/j.gdata.2015.06.021
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Preparation of ChIP-seq and scRNA-seq libraries. A. Left. Sonicated DNA prepared from chromatin before ChIP. The black square shows the size range of DNA fragments that was extracted from the gel. Right. Example of a ChIP-sequencing library ready for cluster generation. Both images show ethidium bromide stained agarose-TAE gel. B. A scRNA library from MCF-7 cells resolved on a 6% TBE polyacrylamide gel. 25-bp ladder (Life Technologies) was used as a size marker. The band running at 125 bp size corresponds to the linker dimer with no insert and was avoided. The area marked by the black square was extracted from the gel for sequencing.
Fig. 2Quality control of Pol II ChIP-seq by qPCR. A. Quantitative PCR on DNA extracted from precipitated chromatin before preparation of ChIP-sequencing libraries. Left panel shows positive (ACTB and GAPDH) and negative (NC1 and NC2) ChIP control primer pairs suitable for Pol II analysis (Active Motif). The right panel shows human Snail ChIP primers specific for the upstream, promoter, and downstream regions [1]. Y-axes show ChIP signal as percent of input DNA. B. Quantitative PCR of libraries prepared from ChIP material in A. Because calculating percent input in libraries is impossible, qPCR signal was normalized against Snail promoter primer pair, which was taken as 1. Numbers next to the arrows indicate fold-difference between indicated values (average of 2 independent biological replicates).
Fig. 3Centering of Pol II ChIP-sequencing signal around transcriptional start sites. The plot shows metagene analysis as in [1], except that metagene plots for ChIP-seq signal are overlaid. Plots are centered around RefSeq annotated transcription start sites (TSSs) (Green) and scRNA-defined TSSs (Blue). Defining TSSs based on scRNAs sharpens the Pol II signal. We suggest that the previously observed double peaks of Pol II at the promoters [10] are reproducible, but likely arise from sonication-based ChIP-sequencing [1], and are distinct from the broad peak corresponding to divergent transcription (marked with “D”).
Alignment statistics for all samples included in the analysis.
| Sample | Aligned reads | Multi-mapped reads (suppressed) | |
|---|---|---|---|
| ChIP 1 | 13,446,295 (85.90%) | 540,939 (3.46%) | |
| ChIP 2 | 14,635,302 (86.36%) | 561,951 (3.32%) | |
| scRNA 1 | 1,837,112 (38.15%) | 624,503 (12.97%) | |
| scRNA 2 | 4,980,050 (48.97%) | 2,667,232 (26.23%) | |
| scRNA 3 | 1,610,543 (50.54%) | 1,120,507 (35.16%) |
| Organism/cell line/tissue | |
| Sex | |
| Sequencer or array type | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Consent | |
| Sample source location | |