Literature DB >> 29077884

HEDD: Human Enhancer Disease Database.

Zhen Wang¹, Quanwei Zhang¹, Wen Zhang¹, Jhih-Rong Lin¹, Ying Cai¹, Joydeep Mitra¹, Zhengdong D Zhang¹.

Abstract

Enhancers, as specialized genomic cis-regulatory elements, activate transcription of their target genes and play an important role in pathogenesis of many human complex diseases. Despite recent systematic identification of them in the human genome, currently there is an urgent need for comprehensive annotation databases of human enhancers with a focus on their disease connections. In response, we built the Human Enhancer Disease Database (HEDD) to facilitate studies of enhancers and their potential roles in human complex diseases. HEDD currently provides comprehensive genomic information for ∼2.8 million human enhancers identified by ENCODE, FANTOM5 and RoadMap with disease association scores based on enhancer-gene and gene-disease connections. It also provides Web-based analytical tools to visualize enhancer networks and score enhancers given a set of selected genes in a specific gene network. HEDD is freely accessible at http://zdzlab.einstein.yu.edu/1/hedd.php. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Year: 2018 PMID： 29077884 PMCID： PMC5753236 DOI： 10.1093/nar/gkx988

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Enhancers are specialized genomic cis-regulatory elements, capable of activating transcription of their target genes at great distances, and play a central role in regulating a wide range of important biological functions and processes, such as embryogenesis, development, and homeostasis, whose impairment could result in diseases (1). Numerous studies have shown that genetic variants associated with human complex diseases are significantly enriched in transcription-factor-occupied regions and DNase I hypersensitive sites, most of which overlap with enhancer regions (2–5). For example, SNPs associated with Type 2 diabetes are highly enriched in the pancreatic islet clustered enhancers, and ∼88% of the SNPs within the known prostate cancer loci are located in the putative enhancer regions identified in human prostatic carcinoma cell (6). Indeed, enhancer-related dysregulation of gene expression has been recognized as one of the main drivers in the pathogenesis of many diseases (7,8). For example, in certain cases of cancer, the MYC oncogene is commonly translocated close to the enhancer regions (9,10). Thanks to recent rapid development of sequencing technology, genome annotation consortia—e.g. ENCODE (11), FANTOM (12,13), NIH Epigenome Roadmap (14)—have generated a massive amount of the different types of sequencing data, which makes it possible for the identification of enhancers on the genome-wide scale. Although several databases have been set up for enhancers—e.g. DENDB (15) and EnhacerAtlas (16)—and super-enhancers—e.g. SEA (17) and dbSUPER (18)—in the human genome, they only provide limited basic information about enhancers, such as their coordinates, cell or tissue types, and nearby genes. As enhancers are highly relevant to human diseases, information about their disease connections can help us to better understand their potential roles in the biological processes of human diseases. To facilitate studies of enhancers and their roles in the molecular mechanism of human complex diseases, we developed the Human Enhancer Disease Database (HEDD), the first integrated and interactive online knowledge base of enhancers and their disease associations. Compared with earlier released enhancer-related databases, HEDD contains the most up-to-date and complete set of enhancers in the human genome. It not only provides comprehensive genomic annotation on every enhancer in the database but also makes connections between enhancers and diseases and scores them using a newly developed scoring method. Moreover, HEDD offers Web-based analytical tools to visualize enhancer networks and score enhancers given a set of selected genes in a specific gene network. Overall, as a comprehensive enhancer resource with a focus on diseases, HEDD provides a convenient platform to search, browse, and download data related to enhancer and enhancer-disease association and facilitates studies of enhancers and their roles in human complex diseases.

MATERIALS AND METHODS

System design and implementation

The current version of HEDD has been developed using MySQL 5.7.17 (http://www.mysql.com) and runs on a Linux-based Apache Web server. PHP 5.4.16 (http://www.php.net/) is used for server-side scripting. We design and build the interactive interface using Bootstrap 3 (http://getbootstrap.com/), the most popular HTML, CSS and JS framework on the Web. We recommend using a modern Web browser such as Firefox (preferred), Google Chrome, or Safari to achieve the best display effect.

Data sources

We integrated different sources of enhancers, human diseases, and functional genomic annotation to construct a central repository of human enhancers and their disease associations (Figure 1 and Table 1).

Figure 1.

Database content and construction. HEDD collected the enhancers (from three major epigenome study projects), enhancer target gene and gene disease set to quantify the connections between enhancer and diseases. Besides the disease information, it also stores the genetic and epigenetic information (e.g. DHS, TFBS, conservation score) related to enhancers. Users can query with multiple options (e.g. genome locations, disease name, gene name) to acquire enhancers and further view all the detail information such as, associated disease, cell type/tissue, overlapped GWAS SNPs, CADD score and neighboring enhancer for a specific enhancer. It enables users to do scoring analysis: for a gene set of interest, users can score enhancers in a gene network for their ‘relatedness’ to the gene set. All results of query and analysis can be downloaded for further analysis. DHS: DNase I hypersensitive sites, GWAS: genome-wide association studies, TF: transcription factor, TFBS: transcription factor biding sites, CADD: combined annotation dependent depletion.

Table 1.

Summary of data sources (as of April 2017)

	Source	Cell-types/tissue	Number of records	Total
Enhancer	ENCODE	6	399 124	2 793 316
	FANTOM5	—	65 359
	RoadMap	111	2 328 833
Disease	DISEASES	—	44 581	523 109
	MalaCards	—	49 492
	DisGeNET	—	429 036
GWAS	GWAS Catalog	—	35 329	349 566
	GWASdb v2	—	314 237
SNV	CADD	—	∼8.6 billion	∼8.6 billion
Genome segmentation state	UCSC	6	11 062 356	11 062 356
DHS	UCSC	120	10 040 306	10 040 306
TFBS activity	Ensembl	68	22 801	22 801
TFBS	UCSC	91	161¹	161¹
Histone modification	UCSC	19	41¹	41¹
Repeats	UCSC	—	—	1 533 636
Conservation	UCSC	—	—
Target gene connection	ENCODE	—	13 812
	FANTOM5		66 943
	GTEx		26 393 329
Network		Nodes	Edges
	HINT	11 984	53 405
	HPRD	9 460	36 985
	HIPPIE	16 567	276 051
	PIPs	5 445	37 343
	CCSB	4 230	13 427
	IID	18 080	915 091
	UniHI	17 685	364 777

Note: 1. The number of markers.

Enhancers

The current release of HEDD makes available a total of 2 793 316 putative enhancers collected from three major genome/epigenome annotation projects: 399 124 from ENCODE (11) predicted jointly by two segmentation methods—ChromHMM and Segway (19), 65 359 from FANTOM5 (12,13) predicted by the cap analysis of gene expression (CAGE) (20) and 2 328 833 from RoadMap (14) predicted by ChromHMM (21).

Functional genomic annotation of enhancers

The genomic regions of enhancers usually have several prominent features, including DNase I hypersensitivity, transcription factor biding sites (TFBS), and enriched histone acetylation (22–24). To build an enhancer knowledge base, we collected six types of functional genomic annotation: (i) DNase I hypersensitive sites (DHS) (25,26), (ii) transcription factor binding sites (26,27), (iii) histone modification marks (26), (iv) repeats (28), (v) genome segmentation states (11), and (vi) evolutionary conservation (28). See Table 1 for a summary of these data sets.

Enhancer target genes

To study the biological function and the disease association of enhancers, it is critical to annotate their target genes. We collected enhancer target genes from three sources: (i) the genome-wide map of distal DHS-to-promoter connectivity data from ENCODE (11), (ii) intra-chromosomal enhancer-promoter expression correlation data from FANTOM (12) and (iii) eQTL and target gene data from GTEx (29) for enhancers identified by the RoadMap Project.

Human diseases

We collected the human genes-disease association data from MalaCards (30), DISEASES (31) and DisGeNET (32). These databases provide both genes–disease pairs and the scores representing the strength of association between them. In addition, we annotated enhancers with disease/traits-associated or deleterious genetic variants from GWAS Catalog, GWASdb (33,34) and CADD (35), which provides scores of functional deleteriousness for both single nucleotide variants and insertion/deletions variants in the human genome.

Gene networks

As part of HEDD, we provide seven gene networks—HINT (High-quality Interactomes) (36), HPRD (Human Protein Reference Database) (37), HIPPIE (Human Integrated Protein Protein Interaction Reference) (38), PIPs (protein–protein interactions) (39), CCSB (Center for Cancer Systems Biology) (40), IID (Integrated Interactions Database) (41), UniHI (Unified Human Interactome) (42)—which are used to score enhancers in the context of the a biological network.

Evolutionary conservation of genomic sequences

We quantified the evolutionary conservation of enhancers among placental mammals using the sequence conservation scores for positions in the human genome from the UCSC genome browser (28). HEDD provides summary statistics (the mean and the median) of conservation scores for enhancers.

Score the enhancer-disease connection

To study enhancer-related disease mechanisms, the first major challenge is to confidently link enhancers to diseases. We used disease-associated genes as the intermediaries to build such connections: if an enhancer targets a disease-associated gene, then this enhancer is functionally connected to the disease. As connections between enhancers and genes and between genes and diseases are both scored and their scores have the same directionality (the higher the score, the stronger the connection), we were able to quantify the connections between enhancers and diseases. To do this, we first calculated the percentile for every score in a set of scores from a particular source as the probability of connection between an enhancer and a gene (pEG) or between a gene and a disease (pGD). We then computed the probability of connection between an enhancer and a disease (pED) by multiplying pEG and pGD, the two probabilities of their respective connections with an intermediate gene: pED = pEG × pGD. In this way, we connect enhancers and diseases through genes that are connected to both of them, and also quantify the strength of their connections.

Score enhancers based on a gene set

Given a gene set of interest (e.g. differentially expressed genes or disease/trait-related genes), an online software tool in HEDD can score enhancers based on their connections to genes and the centrality of genes in a gene network using the highly successful Google PageRank algorithm (43) that we implemented before (44). Briefly, for a set of genes, HEDD first scores the relatedness of each gene in a gene network to the gene set based on how all genes are wired in the network to the gene set. These scores are then transferred from genes to their enhancers. The score indicating the strength of connection between an enhancer and the gene set is defined by the mean or the sum of scores from all the target genes of this enhancer.

Gene set-disease correlation analysis

Given a functionally coherent set of genes (e.g. from a differential gene expression analysis), HEDD can suggest their related diseases as a form of functional annotation based on the correlation between two sets of enhancer scores: scores of connections between enhancers and a disease and scores of enhancers based on the gene set in the gene network. With the first set of enhancer scores pre-computed for a number of diseases (and stored in HEDD), this correlation analysis uses enhancers as intermediates to link and score the connections between a gene set and diseases. The disease with the highest correlation coefficient could shed light on the molecular function and biological process of the given gene set.

DATABASE USE AND ACCESS

Database search and browsing

HEDD can be interactively searched and browsed in various ways (Figure 2). Users can search for enhancers in a genomic region, disease, target gene, or by conditions such as source, cell type/tissue, overlapped functional element (DHS, TFBS, histone mark, and repeat), transcription factor, and histone modification marker (Figure 2A). The search result, including genomic coordinates, target genes, cell type/tissues, conservation scores, and sources, are organized and returned in a tabular format (Figure 2B). Following links embedded in the enhancer IDs, users can examine on a new webpage detailed annotation of every enhancer in the table (Figure 2), including information about its related diseases (Figure 2C2 and 2C7) with corresponding scores (and their quantiles), functional annotation of its variants from GWAS and CADD (Figure 2C8–9), cell/tissue types, enhancer network with its target genes, TFBS and neighboring enhancers, genome segmentation states, TFBS activity, and overlapped genomic elements (e.g. DHS, TFBS, histone modification and repeats; Figure 2C3–6 and 2C10–14). All search results can be downloaded (Figure 2C1) for further analysis.

Figure 2.

Interactive searching and browsing activity of HEDD. (A) Input parameters for query. (B) The result table, including enhancer IDs, genomic coordinates, target genes, cell types/tissues, conservation scores and sources. (C) Details of a selected enhancer from the result table, including its associated diseases, functional annotation from GWAS and CADD, overlapped genomic elements such as DHS, TFBSs, histone modification and repeats, TFBSs activity levels, genome segmentation states, the comparison among cell/tissues types, regulatory network, and neighboring enhancers.

Online analysis tools

On the ‘Analyze’ webpage, for a gene set of interest, users can score enhancers in a gene network for their ‘relatedness’ to the gene set (Supplementary Figure S1). User can either select a gene network out of seven that are currently available or upload a customer network (Supplementary Figure S1A). The scores (both mean and sum) are returned in a descending order in a table (Supplementary Figure S1B2). Input genes that are present or absent in the selected/uploaded gene network will also be reported (Supplementary Figure S1B1). Users can use these scores in a subsequent correlation analysis (Supplementary Figure S1C), which diseases with highest absolute correlation coefficients (top 20) will be shown in bar charts (Supplementary Figure S1D). We used a list of 242 schizophrenia genes to benchmark the running time for all networks that we make available online. It usually takes several minutes to score enhancers, depending on the selected gene network, and correlation analysis takes about half an hour.

APPLICATION

Analysis of enhancer distribution and disease association in 9p21 locus

Chromosome 9p21 locus is a 13.3-Mb gene-poor genomic region, which contains many genetic variants associated with multiple human complex diseases, including coronary artery disease (CAD), glaucoma, diabetes, and several cancers. Most of the risk variants in this region are non-coding, suggesting that they influence gene expression and may act in cis (45). We analyzed the genomic distribution of enhancers in 9p21 and found eight regions with high densities of enhancers (Supplementary Figure S3A, Supplementary Table S1). The genomic distribution of enhancers with disease association is mostly consistent with the overall distribution of enhancers. Using enhancer-disease associations with the top 10% highest scores, we screened for diseases associated with each enhancer cluster and found the majority of these clusters are highly associated with cancer (Supplementary Figure S3B, Supplementary Table S1). Enhancers overlapping or near these risk variants could be the genomic functional elements underlying their disease association. Indeed, we identified several blocks of 9p21 region in which enhancers are scored high for corresponding variant-associated diseases (Supplementary Figure S2A). Block B contains two genomic regions enriched with risk SNPs of glaucoma and CAD (Supplementary Figure S2B). We found that enhancers in high linkage disequilibrium with those risk SNPs—rs523096 (46), rs4977756 (47,48), rs1333037 (49), rs1063192 (50), rs7865618 (51), rs2157719 (52) and rs7866783 (53) for glaucoma; rs1537370 (54), rs1333049 (55,56), rs10738607 (57), rs4977574 (58) and rs2891168 (59) for CAD—have the highest scores with these two diseases in this block. In block C, all the enhancers have the highest score with the small cell lung cancer (SCLC) among other diseases. Near one of these enhancers is a SNP—rs4246856—associated with D-dimer level (Supplementary Figure S2C), which has been shown to provide useful information for predicting the prognosis of patients with SCLC (60). Block D contains enhancers in TEK, a gene that also contains a SNP (rs2273720) associated with endothelial growth factor levels that are correlated with the formation of blood vessels. Interestingly, enhancers in this block have the highest scores with venous malformations, multiple cutaneous and mucosal, diseases related to blood vessels (Supplementary Figure S2D). Block E and F contains genes with multiple risk SNPs associated with obesity (Supplementary Figure S2E) and amyotrophic lateral sclerosis (Supplementary Figure S2F), respectively. Enhancers of these genes show higher scores for those two diseases than other diseases. In block G, an enhancer strongly associated with congenital disorder of glycosylation was found near rs10971170 (61), a risk SNP related to igG glycosylation (Supplementary Figure S2G), and could be the regulatory element underlying the genetic signal of the risk SNP.

Identification of potential regulatory causal variants for human complex diseases

HEDD can be used to identify potential regulatory causal variants in enhancers in post-GWAS analyses of human complex diseases. We analyzed glaucoma GWAS results as an example of this usage. We first collected from the GWAS Catalog (33) 51 glaucoma-associated SNPs. To use linkage disequilibrium (LD) as mapping tool to find potential casual variants in enhancer, we searched their vicinity (within 25 kb upstream and downstream) and found near 36 of them 2871 enhancers containing 42 923 variants in total (based on NCBI dbSNP as of April 2017). 907 enhancer variants have alleles with high CADD scores (≥ 20) and thus were considered as candidates of causal variants. For 129 of them, here is genotype information (from 2,504 individuals) from the 1000 Genomes Project. We calculated the LD between one of the 36 GWAS SNPs and one of the 129 causal variant candidates in the former's neighborhood using the 1000 Genomes Project (Phase 3) genotype data. We identified one potential regulatory causal variant (rs8940). This enhancer variant is in relatively high linkage disequilibrium (r2 = 0.565) with the glaucoma-associated SNP rs4236601. Interestingly, a previous study has reported that SNP rs8940 was associated with glaucoma (62).

Building disease gene regulatory networks

Human complex diseases are results of gene dysregulation. It is thus particularly important to elucidate the regulatory relationship among genes related to a complex disease. Using enhancer-disease association data in HEDD, we can build a gene regulatory network for a complex disease based on enhancers associated with it and their connections with transcription factors and target genes. Using acute erythroblastic leukemia (AEL) as a disease example, we found 651 enhancers with high AEL-association scores (≥ 0.6). These enhancers are connected to 37 genes, including seven transcription factor genes. We built the gene regulatory network for AEL based on these 37 genes (five genes without the interaction with other genes were removed, Supplementary Figure S3C). Transcriptional factor genes—GATA1, TAL1, SPI1, STAT1, STAT3—display high network centrality, implying their important roles in the regulatory mechanism related to the disease pathology. Few of interactions in the network have already been reported previously, except for TAL1 targeting GATA1 (63,64) and GYPA (65). Therefore, gene regulatory network built with HEDD enhancer data can be used to generate hypothesis about regulatory mechanisms of disease pathology.

CONCLUSION AND FUTURE DEVELOPMENT

We have built an integrated database for human enhancers and their disease associations. Our goal is to provide a comprehensive data resource and a set of interactive analysis tools to facilitate genomic research of enhancers and their roles in human complex diseases. We will continue to update the database with the latest data sets when they become available. In the future, we will add more genetic and epigenetic information about enhancers, such as the topologic associated domains and the retargeting of enhancers in different cell type/tissues or cancers. We believe that our enhancer database will be of particular interest to researchers working on the gene regulatory networks of human diseases. Click here for additional data file.

61 in total

1. SubNet: a Java application for subnetwork extraction.

Authors: Christophe Lemetre; Quanwei Zhang; Zhengdong D Zhang
Journal: Bioinformatics Date: 2013-08-13 Impact factor: 6.937

2. Super-enhancers in the control of cell identity and disease.

Authors: Denes Hnisz; Brian J Abraham; Tong Ihn Lee; Ashley Lau; Violaine Saint-André; Alla A Sigova; Heather A Hoke; Richard A Young
Journal: Cell Date: 2013-10-10 Impact factor: 41.582

3. Chromosome 9p21 SNPs Associated with Multiple Disease Phenotypes Correlate with ANRIL Expression.

Authors: Michael S Cunnington; Mauro Santibanez Koref; Bongani M Mayosi; John Burn; Bernard Keavney
Journal: PLoS Genet Date: 2010-04-08 Impact factor: 5.917

4. Different sequence requirements for expression in erythroid and megakaryocytic cells within a regulatory element upstream of the GATA-1 gene.

Authors: P Vyas; M A McDevitt; A B Cantor; S G Katz; Y Fujiwara; S H Orkin
Journal: Development Date: 1999-06 Impact factor: 6.868

5. A promoter-level mammalian expression atlas.

Authors: Alistair R R Forrest; Hideya Kawaji; Michael Rehli; J Kenneth Baillie; Michiel J L de Hoon; Vanja Haberle; Timo Lassmann; Ivan V Kulakovskiy; Marina Lizio; Masayoshi Itoh; Robin Andersson; Christopher J Mungall; Terrence F Meehan; Sebastian Schmeier; Nicolas Bertin; Mette Jørgensen; Emmanuel Dimont; Erik Arner; Christian Schmidl; Ulf Schaefer; Yulia A Medvedeva; Charles Plessy; Morana Vitezic; Jessica Severin; Colin A Semple; Yuri Ishizu; Robert S Young; Margherita Francescatto; Intikhab Alam; Davide Albanese; Gabriel M Altschuler; Takahiro Arakawa; John A C Archer; Peter Arner; Magda Babina; Sarah Rennie; Piotr J Balwierz; Anthony G Beckhouse; Swati Pradhan-Bhatt; Judith A Blake; Antje Blumenthal; Beatrice Bodega; Alessandro Bonetti; James Briggs; Frank Brombacher; A Maxwell Burroughs; Andrea Califano; Carlo V Cannistraci; Daniel Carbajo; Yun Chen; Marco Chierici; Yari Ciani; Hans C Clevers; Emiliano Dalla; Carrie A Davis; Michael Detmar; Alexander D Diehl; Taeko Dohi; Finn Drabløs; Albert S B Edge; Matthias Edinger; Karl Ekwall; Mitsuhiro Endoh; Hideki Enomoto; Michela Fagiolini; Lynsey Fairbairn; Hai Fang; Mary C Farach-Carson; Geoffrey J Faulkner; Alexander V Favorov; Malcolm E Fisher; Martin C Frith; Rie Fujita; Shiro Fukuda; Cesare Furlanello; Masaaki Furino; Jun-ichi Furusawa; Teunis B Geijtenbeek; Andrew P Gibson; Thomas Gingeras; Daniel Goldowitz; Julian Gough; Sven Guhl; Reto Guler; Stefano Gustincich; Thomas J Ha; Masahide Hamaguchi; Mitsuko Hara; Matthias Harbers; Jayson Harshbarger; Akira Hasegawa; Yuki Hasegawa; Takehiro Hashimoto; Meenhard Herlyn; Kelly J Hitchens; Shannan J Ho Sui; Oliver M Hofmann; Ilka Hoof; Furni Hori; Lukasz Huminiecki; Kei Iida; Tomokatsu Ikawa; Boris R Jankovic; Hui Jia; Anagha Joshi; Giuseppe Jurman; Bogumil Kaczkowski; Chieko Kai; Kaoru Kaida; Ai Kaiho; Kazuhiro Kajiyama; Mutsumi Kanamori-Katayama; Artem S Kasianov; Takeya Kasukawa; Shintaro Katayama; Sachi Kato; Shuji Kawaguchi; Hiroshi Kawamoto; Yuki I Kawamura; Tsugumi Kawashima; Judith S Kempfle; Tony J Kenna; Juha Kere; Levon M Khachigian; Toshio Kitamura; S Peter Klinken; Alan J Knox; Miki Kojima; Soichi Kojima; Naoto Kondo; Haruhiko Koseki; Shigeo Koyasu; Sarah Krampitz; Atsutaka Kubosaki; Andrew T Kwon; Jeroen F J Laros; Weonju Lee; Andreas Lennartsson; Kang Li; Berit Lilje; Leonard Lipovich; Alan Mackay-Sim; Ri-ichiroh Manabe; Jessica C Mar; Benoit Marchand; Anthony Mathelier; Niklas Mejhert; Alison Meynert; Yosuke Mizuno; David A de Lima Morais; Hiromasa Morikawa; Mitsuru Morimoto; Kazuyo Moro; Efthymios Motakis; Hozumi Motohashi; Christine L Mummery; Mitsuyoshi Murata; Sayaka Nagao-Sato; Yutaka Nakachi; Fumio Nakahara; Toshiyuki Nakamura; Yukio Nakamura; Kenichi Nakazato; Erik van Nimwegen; Noriko Ninomiya; Hiromi Nishiyori; Shohei Noma; Shohei Noma; Tadasuke Noazaki; Soichi Ogishima; Naganari Ohkura; Hiroko Ohimiya; Hiroshi Ohno; Mitsuhiro Ohshima; Mariko Okada-Hatakeyama; Yasushi Okazaki; Valerio Orlando; Dmitry A Ovchinnikov; Arnab Pain; Robert Passier; Margaret Patrikakis; Helena Persson; Silvano Piazza; James G D Prendergast; Owen J L Rackham; Jordan A Ramilowski; Mamoon Rashid; Timothy Ravasi; Patrizia Rizzu; Marco Roncador; Sugata Roy; Morten B Rye; Eri Saijyo; Antti Sajantila; Akiko Saka; Shimon Sakaguchi; Mizuho Sakai; Hiroki Sato; Suzana Savvi; Alka Saxena; Claudio Schneider; Erik A Schultes; Gundula G Schulze-Tanzil; Anita Schwegmann; Thierry Sengstag; Guojun Sheng; Hisashi Shimoji; Yishai Shimoni; Jay W Shin; Christophe Simon; Daisuke Sugiyama; Takaai Sugiyama; Masanori Suzuki; Naoko Suzuki; Rolf K Swoboda; Peter A C 't Hoen; Michihira Tagami; Naoko Takahashi; Jun Takai; Hiroshi Tanaka; Hideki Tatsukawa; Zuotian Tatum; Mark Thompson; Hiroo Toyodo; Tetsuro Toyoda; Elvind Valen; Marc van de Wetering; Linda M van den Berg; Roberto Verado; Dipti Vijayan; Ilya E Vorontsov; Wyeth W Wasserman; Shoko Watanabe; Christine A Wells; Louise N Winteringham; Ernst Wolvetang; Emily J Wood; Yoko Yamaguchi; Masayuki Yamamoto; Misako Yoneda; Yohei Yonekura; Shigehiro Yoshida; Susan E Zabierowski; Peter G Zhang; Xiaobei Zhao; Silvia Zucchelli; Kim M Summers; Harukazu Suzuki; Carsten O Daub; Jun Kawai; Peter Heutink; Winston Hide; Tom C Freeman; Boris Lenhard; Vladimir B Bajic; Martin S Taylor; Vsevolod J Makeev; Albin Sandelin; David A Hume; Piero Carninci; Yoshihide Hayashizaki
Journal: Nature Date: 2014-03-27 Impact factor: 49.962

6. A proteome-scale map of the human interactome network.

Authors: Thomas Rolland; Murat Taşan; Benoit Charloteaux; Samuel J Pevzner; Quan Zhong; Nidhi Sahni; Song Yi; Irma Lemmens; Celia Fontanillo; Roberto Mosca; Atanas Kamburov; Susan D Ghiassian; Xinping Yang; Lila Ghamsari; Dawit Balcha; Bridget E Begg; Pascal Braun; Marc Brehme; Martin P Broly; Anne-Ruxandra Carvunis; Dan Convery-Zupan; Roser Corominas; Jasmin Coulombe-Huntington; Elizabeth Dann; Matija Dreze; Amélie Dricot; Changyu Fan; Eric Franzosa; Fana Gebreab; Bryan J Gutierrez; Madeleine F Hardy; Mike Jin; Shuli Kang; Ruth Kiros; Guan Ning Lin; Katja Luck; Andrew MacWilliams; Jörg Menche; Ryan R Murray; Alexandre Palagi; Matthew M Poulin; Xavier Rambout; John Rasla; Patrick Reichert; Viviana Romero; Elien Ruyssinck; Julie M Sahalie; Annemarie Scholz; Akash A Shah; Amitabh Sharma; Yun Shen; Kerstin Spirohn; Stanley Tam; Alexander O Tejeda; Shelly A Trigg; Jean-Claude Twizere; Kerwin Vega; Jennifer Walsh; Michael E Cusick; Yu Xia; Albert-László Barabási; Lilia M Iakoucheva; Patrick Aloy; Javier De Las Rivas; Jan Tavernier; Michael A Calderwood; David E Hill; Tong Hao; Frederick P Roth; Marc Vidal
Journal: Cell Date: 2014-11-20 Impact factor: 41.582

7. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes.

Authors: Janet Piñero; Núria Queralt-Rosinach; Àlex Bravo; Jordi Deu-Pons; Anna Bauer-Mehren; Martin Baron; Ferran Sanz; Laura I Furlong
Journal: Database (Oxford) Date: 2015-04-15 Impact factor: 3.451

8. Integrative annotation of chromatin elements from ENCODE data.

Authors: Michael M Hoffman; Jason Ernst; Steven P Wilder; Anshul Kundaje; Robert S Harris; Max Libbrecht; Belinda Giardine; Paul M Ellenbogen; Jeffrey A Bilmes; Ewan Birney; Ross C Hardison; Ian Dunham; Manolis Kellis; William Stafford Noble
Journal: Nucleic Acids Res Date: 2012-12-05 Impact factor: 16.971

9. Integrative analysis of 111 reference human epigenomes.

Authors: Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal: Nature Date: 2015-02-19 Impact factor: 69.504

10. UniHI 7: an enhanced database for retrieval and interactive analysis of human molecular interaction networks.

Authors: Ravi Kiran Reddy Kalathur; José Pedro Pinto; Miguel A Hernández-Prieto; Rui S R Machado; Dulce Almeida; Gautam Chaurasia; Matthias E Futschik
Journal: Nucleic Acids Res Date: 2013-11-08 Impact factor: 16.971

22 in total

1. PGA: post-GWAS analysis for disease gene identification.

Authors: Jhih-Rong Lin; Daniel Jaroslawicz; Ying Cai; Quanwei Zhang; Zhen Wang; Zhengdong D Zhang
Journal: Bioinformatics Date: 2018-05-15 Impact factor: 6.937

2. EnhancerAtlas 2.0: an updated resource with enhancer annotation in 586 tissue/cell types across nine species.

Authors: Tianshun Gao; Jiang Qian
Journal: Nucleic Acids Res Date: 2020-01-08 Impact factor: 16.971

3. EpiRegio: analysis and retrieval of regulatory elements linked to genes.

Authors: Nina Baumgarten; Dennis Hecker; Sivarajan Karunanithi; Florian Schmidt; Markus List; Marcel H Schulz
Journal: Nucleic Acids Res Date: 2020-07-02 Impact factor: 16.971

4. scEnhancer: a single-cell enhancer resource with annotation across hundreds of tissue/cell types in three species.

Authors: Tianshun Gao; Zilong Zheng; Yihang Pan; Chengming Zhu; Fuxin Wei; Jinqiu Yuan; Rui Sun; Shuo Fang; Nan Wang; Yang Zhou; Jiang Qian
Journal: Nucleic Acids Res Date: 2022-01-07 Impact factor: 16.971

5. PCRMS: a database of predicted cis-regulatory modules and constituent transcription factor binding sites in genomes.

Authors: Pengyu Ni; Zhengchang Su
Journal: Database (Oxford) Date: 2022-04-22 Impact factor: 4.462

6. A network-based method for predicting disease-associated enhancers.

Authors: Duc-Hau Le
Journal: PLoS One Date: 2021-12-08 Impact factor: 3.240

Review 7. Spirits in the Material World: Enhancer RNAs in Transcriptional Regulation.

Authors: Tim Y Hou; W Lee Kraus
Journal: Trends Biochem Sci Date: 2020-09-01 Impact factor: 13.807

8. Loss-of-function tolerance of enhancers in the human genome.

Authors: Duo Xu; Omer Gokcumen; Ekta Khurana
Journal: PLoS Genet Date: 2020-04-03 Impact factor: 5.917

9. SEdb: a comprehensive human super-enhancer database.

Authors: Yong Jiang; Fengcui Qian; Xuefeng Bai; Yuejuan Liu; Qiuyu Wang; Bo Ai; Xiaole Han; Shanshan Shi; Jian Zhang; Xuecang Li; Zhidong Tang; Qi Pan; Yuezhu Wang; Fan Wang; Chunquan Li
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971

10. HACER: an atlas of human active enhancers to interpret regulatory variants.

Authors: Jing Wang; Xizhen Dai; Lynne D Berry; Joy D Cogan; Qi Liu; Yu Shyr
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971