| Literature DB >> 29363429 |
Abstract
BACKGROUND: Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes.Entities:
Keywords: ChIP-seq; Ctcf; Enhancer; Functional enrichment; Gene expression; Open chromatin regions; Transcription factors; Transcriptional target genes
Mesh:
Substances:
Year: 2018 PMID: 29363429 PMCID: PMC5780744 DOI: 10.1186/s12864-017-4339-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Analyses of functional enrichments of putative transcriptional target genes. Transcriptional target genes were predicted using open chromatin regions (DNase-DGF) and known transcription factor binding sequences. Functional enrichments of target genes were analyzed using 10 annotation databases, and were changed based on the criteria of promoter and extended regions for enhancer-promoter association (EPA). To compare with the tendency of the normalized numbers of functional enrichments, the median expression levels of target genes were examined using promoter and EPA
Number of functional enrichments and unique functional enrichments of putative transcriptional target genes. (see the colored table in Additional file 1: Figure S3)
| Number of functional enrichments of putative transcriptional target genes | ||||||||||
| KEGG | TF Targets | CTD Ontology | GO Slim | GO | Pathway Commons | BioMarkers | MicroRNA | Domains | WikiPathways | |
| Monocyte | 349 | 107 | 209 | 114 | 2902 | 1005 | 42 | 451 | 1202 | 242 |
| CD4+ T cell | 317 | 135 | 278 | 77 | 4077 | 1806 | 47 | 754 | 1401 | 405 |
| CD20+ B cell | 323 | 103 | 170 | 88 | 2778 | 821 | 39 | 948 | 950 | 288 |
| Number of unique functional enrichments of gene expression information alone and putative transcriptional target genes | ||||||||||
| Gene expression information alone, independent of transcriptional target genes. | ||||||||||
| Monocyte | 43 | 0 | 35 | 11 | 237 | 101 | 7 | 314 | 404 | 58 |
| CD4+ T cell | 47 | 0 | 19 | 9 | 301 | 165 | 9 | 136 | 397 | 81 |
| CD20+ B cell | 42 | 0 | 27 | 12 | 239 | 247 | 6 | 370 | 409 | 65 |
| Putative transcriptional target genes. | ||||||||||
| Monocyte | 95 | 16 | 127 | 12 | 1271 | 242 | 17 | 97 | 303 | 105 |
| CD4+ T cell | 105 | 26 | 146 | 23 | 1654 | 415 | 24 | 224 | 585 | 133 |
| CD20+ B cell | 93 | 23 | 96 | 23 | 1192 | 329 | 16 | 231 | 397 | 106 |
Fig. 2Effect of randomly selected genes on functional enrichments. a Effect of randomly selected genes on functional enrichments using DNase-DGF data. Transcriptional target genes were predicted using DNase-DGF data in human monocytes, CD4+ T, CD20+ B, other four somatic and two stem cells (H1-hESC and iPSC) (see also Additional file 1: Figure S1). To test whether slight changes of transcriptional target genes were reflected in the normalized number of their functional enrichments, the ratio of randomly selected genes in the target genes of each TF was changed between 5% and 60%. In the left part of the graphs, randomly selected genes were replaced with the target genes where the total number of target genes was unchanged. In the right part of the graphs, randomly selected genes were added to the target genes where the total number of target genes was increased. The result of Gene Ontology annotation was shown. The results of Pathway Commons and KEGG were shown in Additional file 1: Figure S1. Native target genes showed the most functional enrichments in most cell types. b Effect of randomly selected genes on functional enrichments using ChIP-seq data. Transcriptional target genes were predicted using ChIP-seq data of 19 TF in H1-hESC. The results of nine functional annotation databases were shown and the result of target genes of microRNAs was shown in Additional file 1: Figure S2. Native target genes showed the most functional enrichments using most annotation databases except for low frequent functional annotations. Putative transcriptional target genes tend to include similar function of genes
Functional enrichments of putative transcriptional target genes using Pathway Commons
| Monocytes – Pathway Commons | No. of TFs |
| Proteoglycan syndecan-mediated signaling events | 18 |
| Regulation of CDC42 activity | 15 |
| LKB1 signaling events | 14 |
| Glypican pathway | 13 |
| Interferon Signaling | 13 |
| Sphingosine 1-phosphate (S1P) pathway | 13 |
| IL5-mediated signaling events | 12 |
| Syndecan-1-mediated signaling events | 12 |
| IL3-mediated signaling events | 11 |
| Mitotic Prophase | 11 |
| Golgi Cisternae Pericentriolar Stack Reorganization | 11 |
| Interferon alpha/beta signaling | 11 |
| IFN-gamma pathway | 10 |
| Signaling events mediated by Hepatocyte Growth Factor Receptor (c-Met) | 10 |
| Recruitment of mitotic centrosome proteins and complexes | 10 |
| CD4+ T cell – Pathway Commons | No. of TFs |
| TCR signaling in naive CD8+ T cells | 36 |
| IL12-mediated signaling events | 24 |
| Downstream signaling in naive CD8+ T cells | 21 |
| TCR signaling in naive CD4+ T cells | 21 |
| IL12 signaling mediated by STAT4 | 20 |
| Validated transcriptional targets of AP1 family members Fra1 and Fra2 | 19 |
| CXCR4-mediated signaling events | 17 |
| ATF-2 transcription factor network | 15 |
| Thrombin/protease-activated receptor (PAR) pathway | 14 |
| TCR signaling | 14 |
| PAR1-mediated thrombin signaling events | 14 |
| Downstream TCR signaling | 14 |
| Internalization of ErbB1 | 13 |
| Urokinase-type plasminogen activator (uPA) and uPAR-mediated signaling | 13 |
| ErbB receptor signaling network | 13 |
| CD20+ B cell – Pathway Commons | No. of TFs |
| Interferon alpha/beta signaling | 12 |
| Alpha6Beta4Integrin | 11 |
| Validated targets of C-MYC transcriptional activation | 11 |
| IL8- and CXCR2-mediated signaling events | 10 |
| Antigen processing-Cross presentation | 9 |
| BCR signaling pathway | 9 |
| IL6-mediated signaling events | 9 |
| Cell junction organization | 9 |
| ER-Phagosome pathway | 8 |
| Regulation of CDC42 activity | 8 |
| CXCR4-mediated signaling events | 8 |
| CDC42 signaling events | 8 |
| Syndecan-4-mediated signaling events | 8 |
| Noncanonical Wnt signaling pathway | 7 |
| Class I MHC mediated antigen processing & presentation | 7 |
Fig. 3Criteria of promoter and extended regions for enhancer-promoter association and features of chromatin interactions. a Computationally-defined regulatory domains [11]. The transcription start site (TSS) of each gene is indicated as an arrow. The corresponding regulatory domain for each gene is shown in a matching color as a bracketed line. The basal plus extension association rule assigns a basal regulatory domain to each gene regardless of genes nearby (thick line, Association rule 1 and 2) (see Methods). The domain is then extended to the basal regulatory domain of the nearest upstream and downstream genes. The two nearest genes association rule extends the regulatory domain to the TSS of the nearest upstream and downstream genes (Association rule 3). The single nearest gene association rule extends the regulatory domain to the midpoint between this gene’s TSS and the nearest gene’s TSS both upstream and downstream (Association rule 4). b Forward–reverse orientation of CTCF-binding sites are frequently found in chromatin interactions. CTCF can block the interaction between enhancers and promoters limiting the activity of enhancers to certain functional domains. Figures adapted from [28, 69–71] with permission
Normalized number of functional enrichments of putative transcriptional target genes using promoter and extended regions for enhancer-promoter association (see the colored table in Additional file 1: Figure S4)
| Monocytes | KEGG | CTD Ontology | GO Slim | GO | Pathway Commons | Domains | Wiki Pathways |
|---|---|---|---|---|---|---|---|
| Promoter | 8.46 | 5.07 |
| 70.34 | 24.36 | 29.13 | 5.87 |
| Association rule 1 | 10.44 | 6.48 | 2.01 | 133.54 | 44.75 | 42.19 | 11.10 |
| Association rule 2 | 9.03 | 6.29 | 1.45 | 125.13 | 38.69 | 40.17 | 8.30 |
| Association rule 3 | 8.06 | 5.29 | 1.37 | 106.60 | 24.25 | 38.96 | 7.62 |
| Association rule 4 |
|
| 2.46 |
|
|
|
|
| CD4+ T cell | |||||||
| Promoter | 5.30 | 4.64 | 1.29 | 68.11 | 30.17 | 23.41 | 6.77 |
| Association rule 1 | 13.60 | 7.07 | 2.74 | 142.40 | 84.51 | 43.78 | 13.65 |
| Association rule 2 | 13.57 | 6.69 | 3.05 | 141.15 | 86.36 | 46.33 | 12.02 |
| Association rule 3 | 12.40 | 5.89 | 2.50 | 115.76 | 68.84 | 41.85 | 10.00 |
| Association rule 4 |
|
|
|
|
|
|
|
| CD20+ B cell | |||||||
| Promoter | 7.02 | 3.70 | 1.91 | 60.39 | 17.85 | 20.65 | 6.26 |
| Association rule 1 | 8.88 | 6.21 | 2.59 | 104.55 | 59.32 | 34.34 | 8.29 |
| Association rule 2 | 8.60 | 5.32 | 1.55 | 105.34 | 57.31 | 38.05 | 9.95 |
| Association rule 3 | 9.01 | 5.28 | 1.42 | 88.85 | 26.49 | 35.17 | 8.26 |
| Association rule 4 |
|
|
|
|
|
|
|
| HUVEC | |||||||
| Promoter | 7.50 | 5.74 | 1.34 | 79.71 | 22.21 | 22.40 | 4.59 |
| Association rule 1 | 11.62 | 8.95 | 3.74 | 160.52 | 37.11 | 65.15 | 8.88 |
| Association rule 2 | 11.87 | 8.77 | 4.76 | 176.26 | 36.86 | 74.71 | 8.66 |
| Association rule | 9.84 | 7.70 | 4.56 | 154.06 | 34.29 | 73.98 | 7.74 |
| Association rule 4 |
|
|
|
|
|
|
|
| IMR90 | |||||||
| Promoter | 6.63 | 5.21 | 2.43 | 68.31 | 11.64 | 25.50 | 5.47 |
| Association rule 1 | 9.63 | 6.78 | 2.85 | 122.38 | 24.89 | 65.39 | 8.03 |
| Association rule 2 | 9.32 | 6.56 | 1.43 | 115.31 | 16.36 | 60.36 | 7.47 |
| Association rule 3 | 9.59 | 5.63 | 1.55 | 99.10 | 15.39 | 54.82 | 5.82 |
| Association rule 4 |
|
|
|
|
|
|
|
| MCF-7 | |||||||
| Promoter | 6.97 | 5.43 | 1.69 | 77.88 | 24.71 | 25.99 | 8.77 |
| Association rule 1 | 9.45 | 6.26 | 2.03 | 121.84 | 22.38 | 55.19 | 9.86 |
| Association rule 2 | 9.44 | 6.18 | 0.89 | 111.77 | 21.59 | 53.90 | 7.97 |
| Association rule 3 | 9.83 | 5.13 | 1.12 | 100.35 | 18.96 | 55.30 | 7.37 |
| Association rule 4 |
|
|
|
|
|
|
|
| HMEC | |||||||
| Promoter | 8.76 | 5.03 | 1.53 | 65.80 | 14.07 | 26.52 | 6.70 |
| Association rule 1 | 10.88 | 7.04 | 1.96 | 138.27 | 18.28 | 72.55 | 8.08 |
| Association rule 2 | 9.22 | 6.31 | 1.63 | 141.57 | 18.84 | 73.09 | 7.99 |
| Association rule 3 | 9.30 | 6.50 | 1.68 | 123.46 | 17.27 | 64.67 | 7.75 |
| Association rule 4 |
|
|
|
|
|
|
|
| H1-hESC | |||||||
| Promoter | 8.45 | 3.47 | 2.49 | 73.33 | 14.12 | 24.35 | 4.84 |
| Association rule 1 | 9.54 | 4.82 | 1.97 | 113.37 | 21.86 | 37.66 | 6.30 |
| Association rule 2 |
|
| 1.69 | 123.45 | 21.27 | 38.26 | 5.35 |
| Association rule 3 | 8.00 | 4.84 | 1.43 | 101.12 | 18.00 | 33.59 | 5.62 |
| Association rule 4 | 10.56 | 5.37 |
|
|
|
|
|
| iPSC | |||||||
| Promoter | 8.64 | 2.05 | 1.58 | 67.32 | 22.75 | 12.36 | 6.42 |
| Association rule 1 | 10.25 | 5.64 | 1.66 | 107.50 | 18.35 | 26.77 | 8.59 |
| Association rule 2 |
| 6.20 | 1.01 | 101.76 | 17.00 | 28.13 | 8.16 |
| Association rule 3 | 11.69 | 6.89 | 1.01 | 90.85 | 16.60 | 24.89 | 7.76 |
| Association rule 4 | 11.39 |
|
|
|
|
|
|
The bold numbers are the highest numbers in each cell type and functional annotation database
Differences in functional enrichments between EPA and promoters using Pathway Commons
| Cell type | Term of functional annotation | No. of TF EPA (Association rule 4) | No. of TF Promoter | Experimental support | |
|---|---|---|---|---|---|
| Monocytes | IFN-gamma pathway | 51 | 10 | 6.89 × 10−4 | [ |
| IL3-mediated signaling events | 46 | 11 | 4.69 × 10−3 | [ | |
| GMCSF-mediated signaling events | 43 | 10 | 5.25 × 10−3 | [ | |
| PDGF receptor signaling network | 43 | 8 | 1.34 × 10−2 | [ | |
| VEGF and VEGFR signaling network | 44 | 9 | 2.10 × 10−3 | [ | |
| CD4+ T cell | Integrin family cell surface interactions | 112 | 13 | 2.59 × 10−12 | [ |
| IFN-gamma pathway | 107 | 12 | 5.26 × 10−12 | [ | |
| LKB1 signaling events | 107 | 11 | 1.96 × 10−12 | [ | |
| TCR signaling in naive CD4+ T cells | 103 | 21 | 3.98 × 10−8 | [ | |
| IL3-mediated signaling events | 100 | 8 | 9.54 × 10−13 | [ | |
| CD20+ B cell | Integrin family cell surface interactions | 69 | 2 | 5.17 × 10−11 | [ |
| IL5-mediated signaling events | 64 | 0 | 2.21 × 10−11 | [ | |
| Insulin Pathway | 63 | 0 | 3.15 × 10−11 | [ | |
| mTOR signaling pathway | 63 | 0 | 3.15 × 10−11 | [ | |
| Sphingosine 1-phosphate (S1P) pathway | 63 | 0 | 3.15 × 10−11 | [ |
Immune cell-related functional annotations were enriched more in ‘promoter and extended regions for enhancer-promoter association (EPA)’ than promoters. Five annotations are shown in each cell type. These functions are confirmed to be related to the cellular functions by reference journal papers. Chi-square tests were conducted using the total number of putative transcriptional target genes (Additional file 1: Figure S14)
Normalized number of functional enrichments of putative transcriptional target genes using CTCF binding sites (see the colored table in Additional file 1: Figure S5)
| Monocytes | KEGG | CTD Ontology | GO Slim | GO | Pathway Commons | Domains | Wiki Pathways |
|---|---|---|---|---|---|---|---|
| Association rule 4 | 11.47 | 8.22 | 2.46 | 164.18 | 71.42 | 47.85 | 12.78 |
| CTCF (FR + RF + FF + RR) | 13.19 | 10.39 | 2.74 | 134.26 | 34.37 | 44.46 | 12.96 |
| CTCF (FR) |
|
|
|
|
|
|
|
| CD4+ T cell | |||||||
| Association rule 4 | 16.40 | 7.86 | 4.03 | 177.55 | 108.08 | 53.86 | 16.72 |
| CTCF (FR + RF + FF + RR) | 26.33 | 8.73 | 5.11 | 206.05 | 130.71 | 57.53 | 23.56 |
| CTCF (FR) |
|
|
|
|
|
|
|
| CD20+ B cell | |||||||
| Association rule 4 | 9.95 | 6.62 | 3.07 | 134.46 | 90.99 | 41.30 | 10.67 |
| CTCF (FR + RF + FF + RR) | 8.78 | 4.22 | 2.78 | 94.13 | 27.70 | 28.55 | 7.08 |
| CTCF (FR) |
|
|
|
|
|
|
|
|
| |||||||
| Association rule 4 | 13.46 | 8.99 | 5.08 | 191.73 | 59.10 | 82.65 | 10.13 |
| CTCF (FR + RF + FF + RR) | 19.35 | 10.59 | 3.85 | 155.11 | 28.36 | 57.87 | 13.58 |
| CTCF (FR) |
|
|
|
|
|
|
|
| IMR90 | |||||||
| Association rule 4 | 10.55 | 7.68 | 3.76 | 142.10 | 30.05 | 74.86 | 8.78 |
| CTCF (FR + RF + FF + RR) | 13.88 | 17.55 | 9.22 | 252.24 | 39.92 | 123.18 | 22.45 |
| CTCF (FR) |
|
|
|
|
|
|
|
| MCF-7 | |||||||
| Association rule 4 | 10.80 | 7.38 | 2.77 | 149.64 | 31.20 | 62.40 | 11.07 |
| CTCF (FR + RF + FF + RR) | 11.39 | 8.30 | 4.92 | 234.48 | 30.40 | 126.70 | 13.99 |
| CTCF (FR) |
|
|
|
|
|
|
|
| HMEC | |||||||
| Association rule 4 | 11.78 | 7.53 | 3.48 | 166.71 | 23.02 | 90.11 | 10.10 |
| CTCF (FR + RF + FF + RR) | 19.68 | 13.54 | 4.29 | 232.47 | 27.97 | 106.24 | 9.84 |
| CTCF (FR) |
|
|
|
|
|
|
|
| H1-hESC | |||||||
| Association rule 4 | 10.56 | 5.37 | 3.27 | 141.38 | 35.12 | 46.16 | 6.73 |
| CTCF (FR + RF + FF + RR) | 12.95 | 5.37 | 2.45 | 131.46 | 29.57 | 44.15 | 11.08 |
| CTCF (FR) |
|
|
|
|
|
|
|
| iPSC | |||||||
| Association rule 4 | 11.39 |
| 2.53 | 136.46 | 23.60 | 31.49 | 9.27 |
| CTCF (FR + RF + FF + RR) | 17.17 | 2.33 | 1.69 | 120.87 | 25.60 | 21.06 | 6.42 |
| CTCF (FR) |
| 6.09 |
|
|
|
|
|
The bold numbers are the highest numbers in each cell type and functional annotation database
Differences in functional enrichments between EPA shortened at FR CTCF and EPA without CTCF using Pathway Commons
| Cell type | Term of functional annotation | No. of TF EPA (Association rule 4) shortened at FR CTCF | No. of TF EPA (Association rule 4) without CTCF | Experimental support | |
|---|---|---|---|---|---|
| Monocytes | CXCR4-mediated signaling events | 33 | 14 | 7.81 × 10−12 | [ |
| GPVI-mediated activation cascade | 26 | 3 | 8.97 × 10−15 | [ | |
| Signaling events mediated by TCPTP | 23 | 10 | 1.46 × 10−8 | [ | |
| p38 MAPK signaling pathway | 16 | 8 | 6.97 × 10−6 | [ | |
| IL4-mediated signaling events | 11 | 3 | 8.80 × 10−6 | [ | |
| CD4+ T cell | CXCR4-mediated signaling events | 181 | 82 | < 2.20 × 10−16 | [ |
| Fc-epsilon receptor I signaling in mast cells | 85 | 25 | < 2.20 × 10−16 | [ | |
| JNK signaling in the CD4+ TCR pathway | 65 | 18 | < 2.20 × 10−16 | [ | |
| Reelin signaling pathway | 62 | 26 | < 2.20 × 10−16 | [ | |
| Semaphorin interactions | 51 | 14 | < 2.20 × 10−16 | [ | |
| CD20+ B cell | Syndecan-4-mediated signaling events | 85 | 40 | < 2.20 × 10−16 | [ |
| CXCR4-mediated signaling events | 77 | 15 | < 2.20 × 10−16 | [ | |
| EPO signaling pathway | 46 | 19 | 4.50 × 10−15 | [ | |
| Stabilization and expansion of the E-cadherin adherens junction | 32 | 16 | 1.06 × 10−9 | [ | |
| Interleukin-1 signaling | 13 | 1 | 3.98 × 10−8 | [ |
Immune cell-related functional annotations were enriched more in EPA shortened at genomic locations of forward–reverse orientation of CTCF-binding sites than EPA without CTCF. Five annotations are shown in each cell type. These functions are confirmed to be related to the cellular functions by reference journal papers. Chi-square tests were conducted using the total number of putative transcriptional target genes (Additional file 1: Figure S18)
Fig. 4Comparison of the median expression levels of transcriptional target genes. The median expression levels of the target genes of the same transcription factor binding sequences were compared between promoters and three types of promoter and extended regions for enhancer-promoter association (EPA). Red and blue dots show statistically significant difference of the distribution of expression levels of target genes between promoters and EPA. Red dots show the median expression level of target genes was higher in EPA than promoters, and blue dots show the median expression level of target genes was lower in EPA than promoters. The median expression levels of putative transcriptional target genes were significantly lower using TF bound in enhancers than using TF bound in promoters in stem cells such as H1-hESC and iPSC, and higher by TF bound in enhancers in immune cells. These results implied that enhancers significantly affect the expression of target genes