| Literature DB >> 29228394 |
Marc P Forrest1, Matthew J Hill1, David H Kavanagh1, Katherine E Tansey2, Adrian J Waite1, Derek J Blake1.
Abstract
Background: Common genetic variants in and around the gene encoding transcription factor 4 (TCF4) are associated with an increased risk of schizophrenia. Conversely, rare damaging TCF4 mutations cause Pitt-Hopkins syndrome and have also been found in individuals with intellectual disability (ID) and autism spectrum disorder (ASD).Entities:
Mesh:
Substances:
Year: 2018 PMID: 29228394 PMCID: PMC6101561 DOI: 10.1093/schbul/sbx164
Source DB: PubMed Journal: Schizophr Bull ISSN: 0586-7614 Impact factor: 9.306
Fig. 1.Chromatin immunoprecipitation and next generation sequencing (ChIP-seq) analysis of genomic transcription factor 4 (TCF4) binding sites. Genomic distribution of TCF4 ChIP-seq peaks centered around the transcriptional start site (A). Location of TCF4 binding sites relative to genomic features (B). TCF4 consensus binding sites predicted from the motif discovery program in HOMER (C). TCF4 binding sites are enriched at canonical E-boxes (Atoh1), while de novo motif discovery identified a nonpalindromic E-box. Venn diagram showing the overlap between TCF4 targets and differentially expressed genes in TCF4-depleted cells (D). Statistical tests were performed against a background set of 16318 expressed genes with TORAY 3D-Gene and HOMER annotations. TCF4 bound regions at PAPPA2 displayed using IGV (E). IGV plots for PAPPA2 (intron) showing tiled, unique sequence reads after PCR de-duplication. For comparative purposes, the control pre-immune IgG is also shown. The locations of canonical E-boxes within the locus are also shown.
Fig. 2.Transcription factor 4 (TCF4) binding sites are enriched at active enhancers. Venn diagram showing the intersection of TCF4 bound regions of genome with those marked with the H3K27ac in SH-SY5Ycells (A). Top ranked GO terms for TCF4 bound genes and genes marked by H3K27ac in SH-SY5Y cells (B). UCSC genome browser screenshot of the CHRNA5/A3/B4 locus on human chromosome 15 showing ChIP-seq peaks for H3K27ac and TCF4 (C). The 2 major TCF4 peaks (box) are found at an active enhancer (marked by extensive H3K27ac) upstream of CHRNB4. Enlarged view of panel C showing that TCF4 bound regions are located within histone-depleted regions (troughs) of the enhancer (D). Enlarged view of panel C showing the TCF4 peak maps to a pair of E-boxes (highlighted) with the sequence 5′-CAGCTG (E).
Gene Set Enrichment for TCF4 Targets in Cortical Expression Modules, FMRP Targets, and LoF Intolerant Genes
| Module | Ontologya |
|
| OR | 95% CI |
|---|---|---|---|---|---|
| 17 | Synaptic transmission | 8.34 × 10−8 | 1.50 × 10−6 | 1.55 | 1.32–1.81 |
| 18 | Defense response | 2.15 × 10−7 | 3.87 × 10−6 | 1.54 | 1.32–1.81 |
| 13 | Synaptic transmission | 3.77 × 10−3 | NS | 1.29 | 1.09–1.53 |
| 8 | Negative regulation of neuron differentiation | 1.26 × 10−1 | NS | 1.29 | 0.94–1.76 |
| 1 | None | 7.82 × 10−3 | NS | 1.29 | 1.07–1.54 |
| 11 | Cell cycle | 3.43 × 10−1 | NS | 1.11 | 0.89–1.37 |
| 15 | Response to virus | 4.94 × 10−1 | NS | 1.10 | 0.84–1.43 |
| 5 | None | 3.35 × 10−1 | NS | 1.09 | 0.91–1.30 |
| 16 | Cation-transporting ATPase activity | 6.48 × 10−1 | NS | 1.06 | 0.85–1.32 |
| 2 | Zinc ion binding | 9.28 × 10−1 | NS | 0.99 | 0.82–1.18 |
| None | None | 2.66 × 10−1 | NS | 0.96 | 0.89–1.03 |
| 3 | Nucleic acid metabolic processes | 1.42 × 10−1 | NS | 0.88 | 0.74–1.04 |
| 9 | Regulation of cellular amino acid metabolic process | 4.46 × 10−2 | NS | 0.69 | 0.48–0.99 |
| 10 | None | 3.40 × 10−1 | NS | 0.62 | 0.26–1.49 |
| 6 | None | 8.76 × 10−5 | 1.58 × 10−3 | 0.54 | 0.39–0.75 |
| 12 | None | 7.28 × 10−6 | 1.31 × 10−4 | 0.52 | 0.39–0.71 |
| 14 | Translational elongation | 4.15 × 10−9 | 7.47 × 10−8 | 0.48 | 0.37–0.63 |
| 4 | None | 4.78 × 10−7 | 8.61 × 10−6 | 0.45 | 0.32–0.63 |
| Gene set | |||||
| FMRP | Synaptic transmission | <2.2 × 10−16 | 1.87 | 1.62–2.14 | |
| LoF | Regulation of transcription | <2.2 × 10−16 | 1.42 | 1.31–1.53 | |
Note: Gene sets were collated from Parikshak et al (weighted gene co-expression network, human neocortical development), Darnell et al (FMRP bound genes) and Lek et al (LoF intolerant genes).[37,38,46] Initial P values were generated using Fisher’s exact test and were corrected (P corr) for multiple testing (Bonferroni) where appropriate. Only P values <.05 were considered statistically significant. OR, odds ratio; CI, confidence interval; NS, not significant.
aPrinicipal ontology for each module or gene set.
Gene Set Enrichment for TCF4 Bound Genes and Neuropsychiatric Disease Risk Genes
| Gene Set | Gene Count |
|
| OR | 95% CI |
|---|---|---|---|---|---|
| SCZ_all | 862 | 3.28 × 10−8 | 5.25 × 10−7 | 1.55 | 1.33–1.81 |
| SCZ_LoF | 98 | 1.74 × 10−3 | 2.79 × 10−2 | 1.99 | 1.31–3.03 |
| SCZ_NS | 668 | 7.90 × 10−7 | 1.26 × 10−5 | 1.56 | 1.31–1.85 |
| SCZ_silent | 207 | 1.02 × 10−2 | NS | 1.5 | 1.11–2.03 |
| ASD_all | 951 | 1.54 × 10−5 | 2.46 × 10−4 | 1.39 | 1.20–1.62 |
| ASD_LoF | 124 | 3.54 × 10−2 | NS | 1.53 | 1.04–2.27 |
| ASD_NS | 724 | 9.84 × 10−6 | 1.57 × 10−4 | 1.47 | 1.25–1.74 |
| ASD_silent | 248 | 2.40 × 10−1 | NS | 1.20 | 0.89–1.60 |
| ID_all | 154 | 4.76 × 10−4 | 7.61 × 10−3 | 1.86 | 1.33–2.61 |
| ID_LoF | 30 | 2.60 × 10−1 | NS | 1.6 | 0.73–3.51 |
| ID_NS | 129 | 1.28 × 10−4 | 2.04 × 10−3 | 2.09 | 1.45–2.99 |
| ID_silent | 25 | 1.00 | NS | 0.94 | 0.35–2.49 |
| Control/sibling_all | 557 | 8.39 × 10−3 | NS | 1.3 | 1.07–1.58 |
| Control/sibling_LoF | 49 | 3.80 × 10−1 | NS | 1.35 | 0.72–2.55 |
| Control/sibling_NS | 417 | 5.98 × 10−2 | NS | 1.25 | 1.00–1.56 |
| Control/sibling_silent | 151 | 7.15 × 10−2 | NS | 1.4 | 0.98–2.00 |
Note: Statistical enrichment for TCF4 bound genes in gene sets derived from exome sequencing studies in schizophrenia, ID and ASD using gene sets were collated from Fromer et al.36 Initial P values were generated using Fisher’s exact test and were corrected (P corr.) for multiple testing (Bonferroni). Only P values <.05 were considered statistically significant. OR, odds ratio; CI, confidence interval; NS, not significant.
Fig. 3.Networks of gene ontology (GO) terms derived from transcription factor 4 target genes. Nodes (circles) GO terms with the size proportional to the total number of member. The networks were produced using Cytoscape with connecting lines representing the degree of overlap between nodes. Clustered terms were manually cropped for visualization. Clusters correspond to the three most significant terms; cation channel activity and function (A), neurogenesis, development, and morphology (B), and signaling (C).