| Literature DB >> 29077884 |
Zhen Wang1, Quanwei Zhang1, Wen Zhang1, Jhih-Rong Lin1, Ying Cai1, Joydeep Mitra1, Zhengdong D Zhang1.
Abstract
Enhancers, as specialized genomic cis-regulatory elements, activate transcription of their target genes and play an important role in pathogenesis of many human complex diseases. Despite recent systematic identification of them in the human genome, currently there is an urgent need for comprehensive annotation databases of human enhancers with a focus on their disease connections. In response, we built the Human Enhancer Disease Database (HEDD) to facilitate studies of enhancers and their potential roles in human complex diseases. HEDD currently provides comprehensive genomic information for ∼2.8 million human enhancers identified by ENCODE, FANTOM5 and RoadMap with disease association scores based on enhancer-gene and gene-disease connections. It also provides Web-based analytical tools to visualize enhancer networks and score enhancers given a set of selected genes in a specific gene network. HEDD is freely accessible at http://zdzlab.einstein.yu.edu/1/hedd.php. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.Entities:
Mesh:
Year: 2018 PMID: 29077884 PMCID: PMC5753236 DOI: 10.1093/nar/gkx988
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Database content and construction. HEDD collected the enhancers (from three major epigenome study projects), enhancer target gene and gene disease set to quantify the connections between enhancer and diseases. Besides the disease information, it also stores the genetic and epigenetic information (e.g. DHS, TFBS, conservation score) related to enhancers. Users can query with multiple options (e.g. genome locations, disease name, gene name) to acquire enhancers and further view all the detail information such as, associated disease, cell type/tissue, overlapped GWAS SNPs, CADD score and neighboring enhancer for a specific enhancer. It enables users to do scoring analysis: for a gene set of interest, users can score enhancers in a gene network for their ‘relatedness’ to the gene set. All results of query and analysis can be downloaded for further analysis. DHS: DNase I hypersensitive sites, GWAS: genome-wide association studies, TF: transcription factor, TFBS: transcription factor biding sites, CADD: combined annotation dependent depletion.
Summary of data sources (as of April 2017)
| Source | Cell-types/tissue | Number of records | Total | |
|---|---|---|---|---|
| Enhancer | ENCODE | 6 | 399 124 | 2 793 316 |
| FANTOM5 | — | 65 359 | ||
| RoadMap | 111 | 2 328 833 | ||
| Disease | DISEASES | — | 44 581 | 523 109 |
| MalaCards | — | 49 492 | ||
| DisGeNET | — | 429 036 | ||
| GWAS | GWAS Catalog | — | 35 329 | 349 566 |
| GWASdb v2 | — | 314 237 | ||
| SNV | CADD | — | ∼8.6 billion | ∼8.6 billion |
| Genome segmentation state | UCSC | 6 | 11 062 356 | 11 062 356 |
| DHS | UCSC | 120 | 10 040 306 | 10 040 306 |
| TFBS activity | Ensembl | 68 | 22 801 | 22 801 |
| TFBS | UCSC | 91 | 1611 | 1611 |
| Histone modification | UCSC | 19 | 411 | 411 |
| Repeats | UCSC | — | — | 1 533 636 |
| Conservation | UCSC | — | — | |
| Target gene connection | ENCODE | — | 13 812 | |
| FANTOM5 | 66 943 | |||
| GTEx | 26 393 329 | |||
| Network | Nodes | Edges | ||
| HINT | 11 984 | 53 405 | ||
| HPRD | 9 460 | 36 985 | ||
| HIPPIE | 16 567 | 276 051 | ||
| PIPs | 5 445 | 37 343 | ||
| CCSB | 4 230 | 13 427 | ||
| IID | 18 080 | 915 091 | ||
| UniHI | 17 685 | 364 777 |
Note: 1. The number of markers.
Figure 2.Interactive searching and browsing activity of HEDD. (A) Input parameters for query. (B) The result table, including enhancer IDs, genomic coordinates, target genes, cell types/tissues, conservation scores and sources. (C) Details of a selected enhancer from the result table, including its associated diseases, functional annotation from GWAS and CADD, overlapped genomic elements such as DHS, TFBSs, histone modification and repeats, TFBSs activity levels, genome segmentation states, the comparison among cell/tissues types, regulatory network, and neighboring enhancers.