| Literature DB >> 30371817 |
Yong Jiang1, Fengcui Qian1, Xuefeng Bai1, Yuejuan Liu1, Qiuyu Wang1, Bo Ai1, Xiaole Han1, Shanshan Shi1, Jian Zhang1, Xuecang Li1, Zhidong Tang1, Qi Pan1, Yuezhu Wang1, Fan Wang1, Chunquan Li1.
Abstract
Super-enhancers are important for controlling and defining the expression of cell-specific genes. With research on human disease and biological processes, human H3K27ac ChIP-seq datasets are accumulating rapidly, creating the urgent need to collect and process these data comprehensively and efficiently. More importantly, many studies showed that super-enhancer-associated single nucleotide polymorphisms (SNPs) and transcription factors (TFs) strongly influence human disease and biological processes. Here, we developed a comprehensive human super-enhancer database (SEdb, http://www.licpathway.net/sedb) that aimed to provide a large number of available resources on human super-enhancers. The database was annotated with potential functions of super-enhancers in the gene regulation. The current version of SEdb documented a total of 331 601 super-enhancers from 542 samples. Especially, unlike existing super-enhancer databases, we manually curated and classified 410 available H3K27ac samples from >2000 ChIP-seq samples from NCBI GEO/SRA. Furthermore, SEdb provides detailed genetic and epigenetic annotation information on super-enhancers. Information includes common SNPs, motif changes, expression quantitative trait locus (eQTL), risk SNPs, transcription factor binding sites (TFBSs), CRISPR/Cas9 target sites and Dnase I hypersensitivity sites (DHSs) for in-depth analyses of super-enhancers. SEdb will help elucidate super-enhancer-related functions and find potential biological effects.Entities:
Mesh:
Year: 2019 PMID: 30371817 PMCID: PMC6323980 DOI: 10.1093/nar/gky1025
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Database content and construction. SEdb-calculated super-enhancers based on H3K27ac ChIP-seq data. Genetic and epigenetic annotations were collected or calculated including common SNPs, eQTLs, risk SNPs, TFBSs, CRISPR/Cas9 target sites, DHSs, enhancers, motif changes and LD SNPs. Users query super-enhancers using three types: tissue-category-based query, gene-based query and sample-based advanced query. SEdb includes analytical tools and personalized genome browser to discover potential biological effects of super-enhancers. DHS: Dnase I hypersensitivity site, TFBS: transcription factor binding site, eQTL: expression quantitative trait locus.
Figure 2.The main functions and usage of SEdb. (A) Top navigation bar help users use functions of this database. (B) Table of search results including super-enhancer ID coded by SEdb (SE ID), Chr, Start, End, Size, Rank, Element, Common SNP, eQTL, Risk SNP, TFBS, CRISPR/Cas9 target site and Visualization (genome browser). (C) Overview of super-enhancer. (D) Detailed interactive table of annotation information. (E) Genes potentially associated with super-enhancers are provided through six identification strategies. Network diagram of their relationships. (F) Interactive table of super-enhancer elements, related annotation information and analysis tools. (G) Other super-enhancers identified in samples overlapping the super-enhancer. (H) Analysis of super-enhancers related to a query gene via relationships between super-enhancers and associated genes under different strategies. (I) Analysis of a common SNP, super-enhancers it appears in, and annotation of the common SNP. (J) Browse the sample details. (K) Visualization of genome browser for genetic and epigenetic information. (L) Data download. (M) Quantitative statistics of data sources in SEdb. (N) This is an overlap analysis tool that uses the super-enhancers provided in SEdb to annotate the user-submitted regions.
Comparison of SEdb with other databases that are based on human super-enhancer-related data and functions (20 June 2018)
| Function type | Data type/Specific function | SEdb | dbSUPER | SEA v2.0 |
|---|---|---|---|---|
| Interaction table /annotation | Number of human samples | 542 | 102 | 189 |
| Number of human super-enhancers | 331 601 | 69 205 | 164 398 | |
| Strategies of super-enhancer associated genesa | 6 | 3 | 3 | |
| Common SNP | ✓ | |||
| Motif changed | ✓ | |||
| eQTL | ✓ | |||
| Risk SNP | ✓ | |||
| TFBS | ✓ | ✓ | ||
| CRISPR/Cas9 target site | ✓ | ✓ | ||
| DHS | ✓ | |||
| Enhancerb | ✓ | |||
| LD SNP | ✓ | |||
| Genome browser | Super-enhancers | ✓ | ✓ | |
| Super-enhancer elements | ✓ | |||
| Genome segments | ✓ | |||
| SNP | ✓ | |||
| Common SNP | ✓ | ✓ | ||
| Risk SNP | ✓ | ✓ | ||
| TFBS conserved | ✓ | |||
| TFBS by ChIP-seq | ✓ | ✓ | ||
| CRISPR/Cas9 target site | ✓ | ✓ | ||
| DHS | ✓ | |||
| Enhancer | ✓ | |||
| Conservative score | ✓ | ✓ | ||
| Analysis functions | Gene-SE analysis | ✓ | ||
| SNP-SE analysis | ✓ | |||
| Overlap analysis | ✓ | ✓ | ✓ | |
| Region analysisc | ✓ | ✓ | ✓ | |
| Data browse | Simple browsed | ✓ | ✓ | ✓ |
| Browse based on samples classificatione | ✓ | |||
| Alphanumerically sortable table | ✓ | |||
| CGI tool | Genome location overlapf | ✓ | ||
| Other functions | Super-enhancer element annotation | ✓ | ||
| Overlap with other super-enhancers in different samples | ✓ |
aSuper-enhancer-associated genes obtained by different strategies or algorithms.
bChromHMM method or CAGE to predict enhancers.
cExternal link to GREAT and Galaxy.
dSimple browser function of super-enhancer samples.
eClassification of samples including Data sources, Biosample type, Tissue type and Biosample name.
fQuickly generate super-enhancers that overlap with the user-submitted genome location.