| Literature DB >> 21736759 |
Geir K Sandve1, Sveinung Gundersen, Halfdan Rydbeck, Ingrid K Glad, Lars Holden, Marit Holden, Knut Liestøl, Trevor Clancy, Finn Drabløs, Egil Ferkingstad, Morten Johansen, Vegard Nygaard, Eivind Tøstesen, Arnoldo Frigessi, Eivind Hovig.
Abstract
BACKGROUND: Transcription factors in disease-relevant pathways represent potential drug targets, by impacting a distinct set of pathways that may be modulated through gene regulation. The influence of transcription factors is typically studied on a per disease basis, and no current resources provide a global overview of the relations between transcription factors and disease. Furthermore, existing pipelines for related large-scale analysis are tailored for particular sources of input data, and there is a need for generic methodology for integrating complementary sources of genomic information.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21736759 PMCID: PMC3160420 DOI: 10.1186/1471-2164-12-353
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic model of regulome construction. Two input sources are selected, e.g a set of TF tracks and a set of disease tracks. For each combination, the pairwise relation model, in this case the number or genes containing TF binding locations, is evaluated and subsequently differentiated against the full matrix of counts. The main output is an interactive heat map of over-/under-representation that for each entry also includes detailed information and links to follow-up analysis. The regulome construction is performed by a web-based system, the Genomic HyperBrowser [33], that allows input data, a pairwise relation model and a measure of deviation to be selected.
Figure 2A screenshot of the differential disease regulome, using Google Maps API . Detailed information about each disease-TF combination is available. The selected cell contains information about the overrepresentation of HIF-1alpha in the regulation of the genes associated to Barret Esophagus, a relation previously reported [35].
Figure 3A small part of the differential disease regulome, showing a cluster of TFs associated with a range of lipid metabolism disorders in addition to gallstone, including dendrograms of the hierarchical clustering of the TFs and diseases. The disease category "Diseases in Twins" seems to have joined the cluster because of twin studies on lipid metabolism and gallstone. The color of each square indicates the difference between the observed and expected number of gene regions with TF bindings, as denoted by z-scores calculated under the specific null hypothesis. Black denotes no difference, blue to cyan (lowest) denotes under-representation, while red to yellow (highest) represent over-representation of TF binding. Small circles denote significant under-or over-representation, as calculated by the appropriate hypothesis test (see Additional file 1). Different parts of the regulome have been joined together in the figure and dendrograms have been shortened for illustrative purposes.
Figure 4Venn-diagram of the number of contributing genes that are unique for different combinations of the four immune-related clusters analyzed, one pair of which were based on NF-κB/Rel-related TF motifs (I) and the other pair on IRF/Stat1/Cutl1-related TF motifs (II). Note that each cluster pair is comprised of one cluster found in the disease regulome and one found in the TF vs Gene Ontology regulome. Only the 100 genes with highest hit rate were considered (including all genes with the exact same hit rate as the 100th gene). The gene symbols for a select set of combinations are presented in Table 1. The figure was created with the online tool VENNY, by Oliveros, J.C. http://bioinfogp.cnb.csic.es/tools/venny/index.html.
Lists of unique genes contributing to the immune clusters
| All | Both NF-κB/Rel | Both IRF/Stat1/Cutl1 | Both disease | Both GO | NF-κB/Rel | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| HLA-A | 22.1% | TNF | 99.5% | TLR4 | 12.1% | HLA-B | 30.1% | CSF1 | 19.3% | ICAM1 | 74.7% |
| NOD2 | 20.1% | LTA | 58.3% | IL6 | 9.5% | CARD15 | 22.3% | TNFSF13 | 13.3% | IRF6 | 44.0% |
| STAT1 | 12.2% | TGFB1 | 36.9% | CTLA4 | 9.4% | VDR | 17.5% | FLT3LG | 8.0% | CD69 | 28.9% |
| CD40 | 11.6% | CD4 | 24.8% | CCR5 | 8.5% | VEGF | 4.1% | RELB | 6.9% | REL | 27.7% |
| TAP1 | 10.7% | AKT1 | 12.1% | TLR3 | 7.7% | NFKBIZ | 3.3% | CD27 | 6.0% | TNFRSF4 | 21.1% |
| NFKBIA | 10.7% | LTB | 11.6% | HLA-DRB1 | 7.2% | COL6A1 | 2.4% | PER1 | 5.9% | RELA | 18.7% |
| IRF1 | 10.1% | IL1RN | 10.2% | IL2 | 5.9% | PSORS1 | 2.1% | PTPN6 | 5.6% | EDC4 | 16.9% |
| CXCL10 | 10.0% | IL2RA | 10.2% | IFNB1 | 4.2% | CYBA | 1.7% | LCK | 5.3% | CD58 | 14.5% |
| IRF5 | 7.3% | TNFRSF1B | 5.3% | CCL2 | 4.1% | DDAH2 | 1.5% | MYD88 | 5.0% | TNFRSF18 | 14.1% |
| PSMB8 | 6.7% | CD86 | 5.3% | TNFSF13B | 3.5% | CYP27B1 | 1.4% | PTMA | 4.8% | CD83 | 14.1% |
| PSMB9 | 6.2% | ITGAM | 5.1% | MX1 | 3.3% | PLAU | 1.4% | IL27 | 4.7% | TNFRSF9 | 12.9% |
| FAS | 5.5% | TRADD | 5.0% | STAT5A | 3.0% | RUNX1 | 1.3% | B2M | 4.3% | IL17C | 12.0% |
| IL7R | 2.6% | MIF | 3.9% | NOD1 | 3.0% | RUNX3 | 1.2% | IRF2 | 3.1% | CD5 | 10.8% |
| IFIH1 | 1.9% | NFKBIB | 3.4% | TLR1 | 2.9% | PAX2 | 1.1% | STAT3 | 3.0% | CD70 | 9.6% |
| CXCL5 | 3.2% | HLA-DMA | 2.3% | CCND1 | 1.0% | TAPBP | 2.9% | DPP4 | 9.6% | ||
| PTGS1 | 3.0% | CASP1 | 2.1% | RXRB | 1.0% | BIRC3 | 2.5% | NFATC2 | 9.0% | ||
| SOCS1 | 1.7% | HIF1A | 0.8% | TRAF2 | 9.0% | ||||||
| TRIM21 | 1.3% | CREB1 | 8.0% | ||||||||
| CCL21 | 1.1% | ... | |||||||||
Lists of genes that are unique for selected combinations of the four immune-related clusters analyzed (see Figure 4). For each gene, the hit rate (proportion of cluster where the gene is relevant to disease regulation) is presented. The NF-κB/Rel cluster in the Gene Ontology regulome is included because of the large hit rates (only the top of the list is shown). Note that TNF is relevant for regulation for nearly all disease-TF pairs in both NF-κB/Rel-related clusters. The full gene listing is included in Additional file 4.