| Literature DB >> 26484232 |
N A Tchurikov1, O V Kretova1, D M Fedoseeva1, V R Chechetkin1, M A Gorbacheva1, A V Snezhkina1, I R Alembekov1, G I Kravatskaya1, Y V Kravatsky1.
Abstract
Hot spots of DNA double-strand breaks (DSBs) are associated with coordinated expression of genes in chromosomal domains (Tchurikov et al., 2011 [1]; 2013). These 50-150-kb DNA domains (denoted "forum domains") can be visualized by separation of undigested chromosomal DNA in pulsed-field agarose gels (Tchurikov et al., 1988; 1992) and used for genome-wide mapping of the DSBs that produce them. Recently, we described nine hot spots of DSBs in human rDNA genes and observed that, in rDNA units, the hot spots coincide with CTCF binding sites and H3K4me3 marks (Tchurikov et al., 2014), suggesting a role for DSBs in active transcription. Here we have used Illumina sequencing to map DSBs in chromosomes of human HEK293T cells, and describe in detail the experimental design and bioinformatics analysis of the data deposited in the Gene Expression Omnibus with accession number GSE53811 and associated with the study published in DNA Research (Kravatsky et al., 2015). Our data indicate that H3K4me3 marks often coincide with hot spots of DSBs in HEK293T cells and that the mapping of these hot spots is important for cancer genomic studies.Entities:
Keywords: Bioinformatics; Double-strand breaks; Fragile sites; H3K4me3 marks; HEK293T
Year: 2015 PMID: 26484232 PMCID: PMC4583641 DOI: 10.1016/j.gdata.2015.05.018
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Schematic representation of the procedures used for isolation of DNA samples inside 0.5% low-melt agarose (A) and the major steps of the RAFT procedure (B).
Fig. 2Bioinformatics pipelines.
Correlation of the data on mapping of DSBs and H3K4me3 marks in HEK293T cells.
z and zp are calculated by Genome Track Analyzer [6] and characterize the positional and ordering correlations between DSBs and H3K4me3 peak summits. The 1% significance thresholds for | z | and | zp | in the case of random correlations correspond to 2.58, while 5% significance thresholds correspond to 1.96. The negative values of zp indicate that H3K4me3 mark peak summits precede DSBs for some chromosomes (2, 3, 18, 19). The corresponding p-values were calculated using Gaussian statistics. All data have number of pairs of the nearest neighbors (NN) exceeding 50 to ensure the applicability of Gaussian statistics.
Fig. 3The mapping of hot spots of DSBs inside TMPRSS2 and ERG genes. The mapping results using the BAM file are shown using IGB Browser on Human Feb. 2009 (GRCh37/hg19) Assembly. The values at genes indicate exons numbers. The red bars indicate the regions that are involved very often in fusion variant possessing exon 1 from TMPRSS2 and exons 4–11 from ERG in prostate cancer [16].
| Specifications | |
|---|---|
| Organism/cell line/tissue | |
| Sex | Female |
| Sequencer or array type | Illumina Genome Analyzer IIx, Illumina MiSeq |
| Data format | Raw and processed. |
| Experimental factors | HEK293T cells were seeded in 10-cm culture plates 1–2 days before experiments in DMEM containing 10% FBS, and were used at approximately 60–80% confluency. |
| Experimental features | DNA domains, migrating in 0.8% agarose mini-gels from the DNA-agarose plugs, were electroeluted. Biotinylated oligonucleotides were ligated to DNA sequences at DSB sites. |
| Consent | Level of consent allowed for reuse if applicable (typically for human samples). |
| Sample source location | Moscow 119334, Russia |