| Literature DB >> 23193294 |
Jesse D Ziebarth1, Anindya Bhattacharya, Yan Cui.
Abstract
CTCF is a highly conserved transcriptional regulator protein that performs diverse functions such as regulating gene expression and organizing the 3D structure of the genome. Here, we describe recent updates to a database of CTCF-binding sites, CTCFBSDB (http://insulatordb.uthsc.edu/), which now contains almost 15 million CTCF-binding sequences in 10 species. Since the original publication of the database, studies of the 3D structure of the genome, such as those provided by Hi-C experiments, have suggested that CTCF plays an important role in mediating intra- and inter-chromosomal interactions. To reflect this important progress, we have integrated CTCF-binding sites with genomic topological domains defined using Hi-C data. Additionally, the updated database includes new features enabled by new CTCF-binding site data, including binding site occupancy and the ability to visualize overlapping CTCF-binding sites determined in separate experiments.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23193294 PMCID: PMC3531215 DOI: 10.1093/nar/gks1165
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Screenshot of an example webpage for a CTCF-binding sequence (ENCODE_OC_hg18_MCF-7_744758) in CTCFBSDB 2.0. The database provides a description of the binding site, where the binding sequence is located within topological domains, and a Genome Browser viewer showing the genomic context of the binding site. Users also have the option to display the expression of genes flanking the binding site and CTCF-binding sequences that overlap the sequence. This CTCF-binding sequence, which was identified in MCF-7 cells, overlaps binding sequences that were identified in four other cell types.
Figure 2.Gene expression profiles for genes flanking a CTCF-binding site (ENCODE_OC_hg18_MCF-7_744758). CTCFBSDB provides images comparing expression profiles identified using both RNA-Seq (top) and microarrays (bottom) for genes flanking the CTCF-binding site.
Description of fields used to annotated CTCF-binding sites
| Field name | Description |
|---|---|
| ID | Unique database identifier for the binding sequence |
| Species and build | The species and genomic build in which the binding site was determined |
| Location | Genomic location of the binding sequence |
| ENCODE | Whether or not the site was determined in an ENCODE dataset |
| Source | PubmedID or ENCODE accession number containing the binding site |
| Cell and experiment type | Experimental conditions in which the site was identified |
| Occupancy | Numerical value of the occupancy of the binding site reported in the original source |
| Occupancy% | Percentile of Occupancy within sites of the source dataset |
| M1M2 Class | Binding site motif class |
| ENCODE Peak location | Location of the ChIP-Seq peak of the binding site for ENCODE datasets |