| Literature DB >> 28616452 |
Minou Bina1, Phillip Wyss1, Xiaohui C Song2.
Abstract
While MLL1 activates gene expression in most tissues, ZFP57 represses transcription. MLL1 selectively interacts with a group of nonmethylated DNA sequences known as the MLL1 morphemes. ZFP57 associates with a methylated hexamer (ZFBS), dispersed in the genomic DNA segments known as Imprinted Control Regions (ICRs) and germline Differentially Methylated Regions (gDMRs), to maintain allele-specific gene repression. We have identified a set of composite DNA elements (ZFBS-Morph overlaps) that provides the sequence context of ZFBS in the canonical ICRs/gDMRs. This report provides tables listing the nucleotide sequences of the MLL1 morphemes and ZFBS-Morph overlaps. The report also offers links to the data repository at Purdue University, for downloading the positions of the MLL1 morphemes, the ZFP57 binding site, and the ZFBS-Morph overlaps in the mouse genome.Entities:
Keywords: CpG-rich motifs; Gene regulation; Genomic imprinting; KMT2A; MLL1 morphemes; Mouse genome; ZFP57 binding site
Year: 2017 PMID: 28616452 PMCID: PMC5458072 DOI: 10.1016/j.dib.2017.05.050
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
MLL1 morphemes.
| CGACG CGTCG |
| CGCCG CGGCG |
| CGCGCG |
| CGTGCG CGCACG |
| CGCCCG CGGGCG |
| CGGACG CGTCCG |
| CGTACG |
Fig. 1A cluster of ZFBS-Morph overlaps localizes the possible peak position of the Zac1 gDMR. Box 1 marks the position of CpG72, a conserved CGI that is methylated in oocyte DNA [11]. CpG72 includes a cluster of 5 ZFBS-Morph overlaps, marked by Box 2. As expected, a cluster of ZFBS also is present in CpG72 (Box 3). Random occurrences of ZFBS are marked by Box 4. A cluster of ZFBS also maps to a region that is not part of the gDMR (Box 5). That region includes a single, isolated, ZFBS-Morph overlap. The CGI that is not imprinted (CpG36) does not contain ZFBS-Morph overlaps. The track labeled MLL1 sites shows the position of the MLL1 morphemes in the displayed chromosomal location (chr10:12,749,001–12,879,000). In this relatively long genomic DNA segment (130,000 bps), closely-spaced MLL1 morphemes appear as thick vertical bars, isolated occurrences as thin vertical lines. Clustering of the MLL1 morphemes in CGIs is more apparent in shorter DNA segments; for examples see Refs. [3], [6].
ZFBS-Morph overlaps.
| TGCCGCG | CGCGGCA |
| TGCCGCCG | CGGCGGCA |
| TGCCGCGCG | CGCGCGGCA |
| TGCCGCCCG | CGGGCGGCA |
| TGCCGCACG | CGTGCGGCA |
Closely-spaced ZFBS-Morph overlaps in the canonical ICRs in the mouse genome. Identical genes that are displayed in 2 rows contain closely-spaced ZFBS-Morph overlaps at two different genomic positions.
| chr1:63,246,711-63,246,910 | TGCCGCCG, CGCGGCA | |
| chr2:157,385,801-157,387,500 | TGCCGCG, CGGGCGGCA, TGCCGCG | |
| chr2:152,512,591-152,512,650 | TGCCGCG, TGCCGCGCG | |
| chr2:174,121,336-174,121,660 | TGCCGCG, CGCGGCA, TGCCGCG, CGCGCGGCA | |
| chr2:174,124,701-174,125,300 | CGCGGCA, TGCCGCCCG, TGCCGCCCG, TGCCGCCG | |
| chr2:174,152,536-174,154,195 | TGCCGCCG, CGGCGGCA, TGCCGCCG, TGCCGCCCG | |
| chr2:174,155,591-174,156,025 | CGGCGGCA, TGCCGCG | |
| chr6:4,697,131-4,698,550 | TGCCGCG, TGCCGCG | |
| chr6:30,687,491-30,688,825 | TGCCGCG, CGCGGCA, TGCCGCG, CGGGCGGCA, TGCCGCG, TGCCGCG | |
| chr6:58,856,861-58,857,170 | CGCGGCA, CGCGGCA | |
| chr7:67,148,966-67,149,720 | CGCGGCA, CGCGGCA | |
| chr7:6,681,601-6,683,200 | CGTGCGGCA, CGCGGCA, TGCCGCG, CGCGGCA | |
| chr7:135,831,441-135,832,095 | CGCGGCA, TGCCGCG, CGCGGCA, TGCCGCG | |
| chr7:149,765,896-149,766,315 | CGCGGCA, TGCCGCG, CGCGGCA | |
| chr7:149,767,676-149,767,975 | TGCCGCCG, CGTGCGGCA, CGCGGCA | |
| chr7:150,481,306-150,481,730 | CGCGGCA, TGCCGCG | |
| chr8:125,388,921-125,389,390 | TGCCGCG, TGCCGCG | |
| chr9:89,774,326-89,775,050 | TGCCGCG, TGCCGCG | |
| chr10:12,810,341-12,811,120 | CGCGGCA, CGCGGCA, TGCCGCG, TGCCGCG, TGCCGCG | |
| chr11:11,925,501-11,926,400 | CGCGGCA, CGCGGCA | |
| chr12:110,764,761-110,766,795 | CGCGGCA, CGCGGCA, TGCCGCG, TGCCGCG, TGCCGCG | |
| chr15:72,640,121-72,641,650 | CGCGGCA, CGCGGCA | |
| chr17:12,934,306-12,935,515 | CGCGGCA, TGCCGCG, CGCGGCA, CGCGGCA, CGCGGCA, TGCCGCG, TGCCGCG |
| Genomics | |
| Gene regulation | |
| Tables and text files (in bed format, for display at the UCSC genome browser) | |
| Analyzing the mouse chromosomes using Perl Scripts | |
| Tables and text files | |
| None | |
| Two links to files deposited at the Purdue University Research Repository: | |
| 1) Bina, M., Wyss, P.J., Wang, D., Song, X.C. (2014). Localization of MLL1 morphemes in mouse mm9 genomic DNA. Purdue University Research Repository. doi:10.4231/R7KW5CXF | |
| 2) Bina, M., Wyss, P.J., Wang, D., Song, X.C. (2014). Localization of MLL1 morphemes in mouse mm9 genomic DNA. Purdue University Research Repository. doi:10.4231/R7KW5CXF | |