| Literature DB >> 28720872 |
Nikolai Hecker1,2,3,4, Stefan E Seemann1,2, Asli Silahtaroglu1,5, Walter L Ruzzo6,7,8, Jan Gorodkin9,10.
Abstract
Anatomical subdivisions of the human brain can be associated with different neuronal functions. This functional diversification is reflected by differences in gene expression. By analyzing post-mortem gene expression data from the Allen Brain Atlas, we investigated the impact of transcription factors (TF) and RNA secondary structures on the regulation of gene expression in the human brain. First, we modeled the expression of a gene as a linear combination of the expression of TFs. We devised an approach to select robust TF-gene interactions and to determine localized contributions to gene expression of TFs. Among the TFs with the most localized contributions, we identified EZH2 in the cerebellum, NR3C1 in the cerebral cortex and SRF in the basal forebrain. Our results suggest that EZH2 is involved in regulating ZIC2 and SHANK1 which have been linked to neurological diseases such as autism spectrum disorder. Second, we associated enriched regulatory elements inside differentially expressed mRNAs with RNA secondary structure motifs. We found a group of purine-uracil repeat RNA secondary structure motifs plus other motifs in neuron related genes such as ACSL4 and ERLIN2.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28720872 PMCID: PMC5516038 DOI: 10.1038/s41598-017-06200-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Transcription factor contributions to gene expression. The color gradient indicates the contribution to gene expression and is capped between 5% and 95% quantiles for all transcription factors (rows) in all depicted region (columns). Cx refers cerebral cortex, CxN to cerebral nuclei, TH to thalamus, Hy to hypothalamus, Cb to cerebellum and MTg to mid-brain tegmentum. Depicted are the contributions of the 25 transcription factors with the strongest contribution in any of the regions at depth 3 from one instance out of a five-times repeated analysis. Arrows point to transcription factors with localized contributions (bold font). Italic font indicates transcription factor with comparably strong contributions to gene expression in all regions and all instances of the analysis including different region hierarchy depths.
Transcription factor contributions to gene expression and the number of regulated genes.
| TF |
| Tel | MET | Cx | Cb | BF | T |
|
|---|---|---|---|---|---|---|---|---|
| MAX | 2 | 149.86 | 48.49 | 110.93 | 61.87 | 31.22 | Tel | 180 |
| TAF1 | 5 | 64.96 | 30.47 | 45.91 | 42.36 | 10.57 | Tel | 101 |
| YY1 | 9 | 59.05 | 23.72 | 59.57 | 19.39 | 13.65 | Tel | 90 |
| POLR2A | 11 | 88.05 | 23.94 | 76.97 | 52.09 | 12.82 | Tel | 252 |
| CEBPB | 12 | 30.99 | 86.87 | 25.25 | 36.43 | 12.23 | MET | 64 |
| JUND | 14 | 35.75 | 17.39 | 41.50 | 21.93 | 13.47 | Tel | 34 |
| NFYB | 14 | 30.53 | 12.19 | 24.15 | 21.35 | 16.73 | Tel | 52 |
| REST | 14 | 38.72 | 47.50 | 21.51 | 25.24 | 9.5 | MET | 39 |
| GABPA | 15 | 49.29 | 26.25 | 41.12 | 52.55 | 18.06 | Tel | 101 |
| TCF12 | 17 | 24.99 | 20.16 | 19.65 | 18.06 | 3.33 | Tel | 25 |
| SP1 | 18 | 27.89 | 11.52 | 17.05 | 15.68 | 6.77 | Tel | 54 |
| EZH2 | 81 | 3.42 | 19.02 | 2.83 | 19.62 | 1.47 | MET | 11 |
| NR3C1 | 40 | 32.39 | 26.20 | 33.00 | 22.60 | 3.55 | Cx | 20 |
| SRF | 95 | 2.24 | 0.88 | 0.94 | 1.19 | 10.57 | BF | 5 |
Transcription factors (TF) with the strongest contribution to gene expression (ranked ≥20 in all regions) are given in the upper part and TFs with localized contributions are shown in the lower part of the table. is the maximum (worst) rank in any region for the TF. Tel (telencephalon), MET (metencephalon), Cx (cerebral cortex), Cb (cerebellum) and BF (basal forebrain) show the contribution to gene expression of the TF in the specified region. These values are averages over five runs of the analysis initialized with different seed for the random number generator. T indicates the region with the highest contribution to gene expression at depth 2 for brain-wide active TFs and a selected region with localized contributions for locally active TFS. refers to the number of TF-gene interactions that were consistently predicted in these five runs for the correspoding region.
Figure 2Enriched structured regulatory elements in differentially expressed genes. (A) The number of differentially expressed genes between any two sub-regions of the indicated region. (B) Enriched regulatory elements for the set of differentially expressed genes. RE refers to regulatory elements. If a RE overlaps with a RNA secondary structure predictions it is referred to as structured RE.
Figure 3Overlap between predicted instances of reoccurring RNA structure motifs. The overlap between the predicted instances is depicted for 15 CRS motifs. The 15 CRSs are well conserved and predicted inside enriched structured regulatory elements for at least three different differentially expressed genes. The overlap between instances a CRS motif A and instances of a CRS motif B is computed by . The heatmap shows a single-linkage hierarchical clustering between the CRS motifs.
Figure 4Examples of reoccurring RNA structure motifs. Each panel shows the consensus structure based on the predicted instances of the corresponding conserved RNA structure (CRS) motif and instances of the CRS in brain related genes. The sequence identity is computed over the predicted instances of the motif and given by a white to blue color gradient. RNA secondary structure diagrams were drawn with RNAfdl[104].