| Literature DB >> 24324330 |
Jun-Ichi Satoh1, Natsuki Kawana, Yoji Yamamoto.
Abstract
The neuron-restrictive silencer factor (NRSF) is a zinc finger transcription factor that represses neuronal gene transcription in non-neuronal cells by binding to the consensus repressor element-1 (RE1) located in regulatory regions of target genes. NRSF silences the expression of a wide range of target genes involved in neuron-specific functions. Previous studies showed that aberrant regulation of NRSF plays a key role in the pathological process of human neurodegenerative diseases. However, a comprehensive set of NRSF target genes relevant to human neuronal functions has not yet been characterized. We performed genome-wide data mining from chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) datasets of NRSF binding sites in human embryonic stem cells (ESC) and the corresponding ESC-derived neurons, retrieved from the database of the ENCODE/HAIB project. Using bioinformatics tools such as Avadis NGS and MACS, we identified 2,172 NRSF target genes in ESC and 308 genes in ESC-derived neurons based on stringent criteria. Only 40 NRSF target genes overlapped between both data sets. According to motif analysis, binding regions showed an enrichment of the consensus RE1 sites in ESC, whereas they were mainly located in poorly defined non-RE1 sites in ESC-derived neurons. Molecular pathways of NRSF target genes were linked with various neuronal functions in ESC, such as neuroactive ligand-receptor interaction, CREB signaling, and axonal guidance signaling, while they were not directed to neuron-specific functions in ESC-derived neurons. Remarkable differences in ChIP-Seq-based NRSF target genes and pathways between ESC and ESC-derived neurons suggested that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons.Entities:
Keywords: ChIP-seq; ESC; GenomeJack; Huntington’s disease; NRSF; REST; data mining; human neurons
Year: 2013 PMID: 24324330 PMCID: PMC3855043 DOI: 10.4137/BBI.S13279
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Top 20 Most Enriched ChIP-Seq-based NRSF target genes in human ESC.
| CHROMOSOME | START BASE POSITION | END BASE POSITION | FE | FDR | ENTREZ GENE ID | GENE SYMBOL | GENE NAME |
|---|---|---|---|---|---|---|---|
| chr14 | 24981675 | 24982925 | 170.00177 | 0 | 1215 | CMA1 | chymase 1, mast cell |
| chr6 | 152167485 | 152168597 | 170.00177 | 0 | 2099 | ESR1 | estrogen receptor 1 |
| chr2 | 166094555 | 166096160 | 170.00177 | 0 | 6326 | SCN2A | sodium channel, voltage-gated, type II, alpha subunit |
| chr14 | 79627693 | 79628461 | 170.00177 | 0 | 9369 | NRXN3 | neurexin 3 |
| chr9 | 19781808 | 19782570 | 170.00177 | 0 | 25769 | SLC24A2 | solute carrier family 24 (sodium/ potassium/calcium exchanger), member 2 |
| chr10 | 60936329 | 60937937 | 170.00177 | 0 | 84457 | PHYHIPL | phytanoyl-CoA 2-hydroxylase interacting protein-like |
| chr2 | 163694658 | 163698116 | 170.00177 | 0 | 90134 | KCNH7 | potassium voltage-gated channel, subfamily H (eag-related), member 7 |
| chr19 | 57148425 | 57151677 | 170.00177 | 0 | 147670 | LOC147670 | uncharacterized LOC147670 |
| chr3 | 68355362 | 68356382 | 170.00177 | 0 | 407738 | FAM19A1 | family with sequence similarity 19 (chemokine (C-C motif)-like), member A1 |
| chr6 | 167227855 | 167229954 | 169.15176 | 0 | 6196 | RPS6KA2 | ribosomal protein S6 kinase, 90kDa, polypeptide 2 |
| chr6 | 108508703 | 108509966 | 169.15176 | 0 | 7101 | NR2E1 | nuclear receptor subfamily 2, group E, member 1 |
| chr6 | 32072697 | 32073801 | 169.15176 | 0 | 7148 | TNXB | tenascin XB |
| chr19 | 10203673 | 10204816 | 169.15176 | 0 | 55337 | C19orf66 | chromosome 19 open reading frame 66 |
| chr6 | 119411155 | 119412478 | 169.15176 | 0 | 79632 | FAM184A | family with sequence similarity 184, member A |
| chrX | 153140408 | 153141899 | 167.45175 | 0 | 3897 | L1CAM | L1 cell adhesion molecule |
| chr19 | 4171564 | 4174043 | 167.45175 | 0 | 84699 | CREB3L3 | cAMP responsive element binding protein 3-like 3 |
| chr14 | 58331291 | 58333220 | 167.45175 | 0 | 341880 | SLC35F4 | solute carrier family 35, member F4 |
| chr19 | 51983546 | 51984430 | 167.45175 | 0 | 729767 | CEACAM18 | carcinoembryonic antigen-related cell adhesion molecule 18 |
| chr8 | 69084217 | 69085604 | 167.36694 | 0 | 80243 | PREX2 | phosphatidylinositol-3,4, 5-trisphosphate-dependent Rac exchange factor 2 |
| chr4 | 147564574 | 147565930 | 166.8817 | 0 | 5458 | POU4F2 | POU class 4 homeobox 2 |
From ChIP-Seq datasets, we identified 2,172 NRSF target genes in human ESC based on stringent criteria that satisfy both false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20. Top 20 genes based on FE are listed with the chromosome, the start base position, the end base position, FE, FDR, Entrez Gene ID, Gene Symbol, and Gene Name.
Top 20 Most Enriched ChIP-Seq-based NRSF target genes in human ESC-derived neurons.
| CHROMOSOME | START BASE POSITION | END BASE POSITION | FE | FDR | ENTREZ GENE ID | GENE SYMBOL | GENE NAME |
|---|---|---|---|---|---|---|---|
| chrX | 54833719 | 54835596 | 59.98119 | 0 | 10916 | MAGED2 | melanoma antigen family D, 2 |
| chr22 | 20066889 | 20069179 | 58.29808 | 0 | 54487 | DGCR8 | DiGeorge syndrome critical region gene 8 |
| chr10 | 70090608 | 70093965 | 54.11434 | 0 | 3189 | HNRNPH3 | heterogeneous nuclear ribonucleoprotein H3 (2H9) |
| chr10 | 70090608 | 70093965 | 54.11434 | 0 | 64081 | PBLD | phenazine biosynthesis-like protein domain containing |
| chr4 | 1403924 | 1404873 | 51.01328 | 0.007168459 | 54729 | NKX1-1 | NK1 homeobox 1 |
| chr11 | 66187302 | 66189955 | 48.899265 | 0 | 266743 | NPAS4 | neuronal PAS domain protein 4 |
| chr5 | 176264343 | 176265229 | 48.634613 | 0.003686636 | 90249 | UNC5A | unc-5 homolog A (C. elegans) |
| chr19 | 23868322 | 23870535 | 48.258587 | 0 | 171392 | ZNF675 | zinc finger protein 675 |
| chr5 | 167860686 | 167861828 | 46.708134 | 0 | 23286 | WWC1 | WW and C2 domain containing 1 |
| chrX | 102469463 | 102471360 | 46.172604 | 0.002554278 | 56271 | BEX4 | brain expressed, X-linked 4 |
| chr4 | 26320438 | 26324056 | 46.108536 | 0 | 3516 | RBPJ | recombination signal binding protein for immunoglobulin kappa J region |
| chr20 | 33102940 | 33105165 | 44.41789 | 0 | 83658 | DYNLRB1 | dynein, light chain, roadblock-type 1 |
| chr1 | 230939858 | 230940883 | 42.362473 | 0.004624277 | 10753 | CAPN9 | calpain 9 |
| chr9 | 131936079 | 131941155 | 40.959568 | 0 | 389792 | IER5L | immediate early response 5-like |
| chr2 | 70120481 | 70121863 | 39.22451 | 0.004094166 | 11017 | SNRNP27 | small nuclear ribonucleoprotein 27kDa (U4/U6.U5) |
| chr11 | 65263253 | 65278757 | 39.090984 | 0 | 100996467 | LOC100996467 | metastasis-associated lung adenocarcinoma transcript 1-like |
| chr2 | 169765195 | 169767746 | 38.41547 | 0 | 57818 | G6PC2 | glucose-6-phosphatase, catalytic, 2 |
| chr3 | 195497495 | 195498727 | 38.410645 | 0.002728513 | 4585 | MUC4 | mucin 4, cell surface associated |
| chr2 | 134279356 | 134280482 | 38.321316 | 0 | 344148 | NCKAP5 | NCK-associated protein 5 |
| chr20 | 44561770 | 44564554 | 37.885845 | 0 | 63935 | PCIF1 | PDX1 C-terminal inhibiting factor 1 |
From ChIP-Seq datasets, we identified 308 NRSF target genes in human ESC-derived neurons based on stringent criteria that satisfy both false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20. Top 20 genesbased on FE are listed with the chromosome, the start base position, the end base position, FE, FDR, Entrez Gene ID, Gene Symbol, and Gene Name.
Figure 1The genomic location of ChIP-Seq-based NRSF target gene EP300 in ESC. An example of NRSF target gene in ESC named histone acetyltransferase p300 (EP300) is shown on GenomeJack, where a MACS peak numbered 26542 is located in the exon of EP300. The panels (A, B) represent (A) the overview and (B) the close-up view. The canonical REST motif is highlighted by orange rectangle.
Figure 2The genomic location of ChIP-Seq-based NRSF target gene SLC7 A14 in ESC-derived neurons. An example of NRSF target gene in ESC-derived neurons named solute carrier family 7, member 14 (SLC7A14) is shown on GenomeJack, where a MACS peak numbered 31638 is located in the first intron of SLC7A14. The panels (A, B) represent (A) the overview and (B) the close-up view. The canonical REST motif is highlighted by orange rectangle.
Figure 3Identification of NRSF binding consensus motif sequence. The consensus motif sequence was identified by importing a 400 bp-length sequence surrounding the summit of MACS peaks of top 100 binding sites based on FE into MEME-ChIP. The panels (A–D) represents (A) ESC, motif 1, E-value = 5.1 × 10−254, (B) ESC, motif 2, E-value = 2.9 × 10−11, (C) ESC-derived neurons, motif 1, E-value = 6.7 × 10−17, and (D) ESC-derived neurons, motif 2, E-value = 1.7 × 10−4.
Figure 4KEGG pathway of ChIP-Seq-based NSRF target genes in ESC. Entrez Gene IDs of 2,172 ChIP-Seq-based NRSF target genes in ESC were imported into the Functional Annotation tool of DAVID. It extracted statistically significant KEGG pathways relevant to the set of imported genes. KEGG pathway termed “Neuroactive ligand-receptor interaction” (hsa04080; P = 0.0343) is shown. NRSF target genes are highlighted by orange.
Top 5 IPA canonical pathways of NRSF target genes in human ESC and ESC-derived neurons.
| RANK | CATEGORY | FOCUSED MOLECULES | RATIO | |
|---|---|---|---|---|
| 1 | Corticotropin releasing hormone signaling | CAMK4, BDNF, MAPK11, | 32/138 (0.232) | 7.05E-08 |
| 2 | Role of NFAT in cardiac hypertrophy | CAMK4, CAMK1D, PIK3R5, PLCH2, MAPK11, | 43/209 (0.206) | 1.48E-07 |
| 3 | CREB signaling in neurons | GRIN2A, CAMK4, GRID2, GNA11, PIK3R5, POLR2B, GNA14, PLCH2, | 41/206 (0.199) | 3.64E-07 |
| 4 | Axonal guidance signaling | DPYSL2, PAPPA2, KLC1, RAC2, MYL10, PLCH2, NTN1, SEMA6D, ECE2, ABLIM3, UNC5D, ADAM23, ABLIM2, PLXNB3, PRKD3, RTN4R, PAPPA, PRKCQ, PTCH1, VEGFC, L1CAM, SRGAP3, ADAM12, MYL12B, PLCG2, PRKCD, GNAO1, PAK7, OPN1SW, NRP1, FYN, RGS3, BDNF, GNA11, PIK3R5, EGF, GNA14, ABLIM1, SEMA6C, EPHA8, WNT7A, GLI3, GLIS1, SDC2, EFNB1, NTRK1, SMO, PRKAR1B, ARPC3, PIK3R2, RASSF5, ROBO2, SHANK2, GNB1L, MAP2K1, ACE, UNC5C, SEMA3E, PLXNA1, GNAS, PLXNC1, ARHGEF12, NRP2, GRB2, WNT2B, PIK3C2G, GNAQ, EPHA3, GNAI3, EPHA10, PLCB4, TUBB6, NTRK3, ADAM10, PRKAG2, NFATC2, SEMA3C, BMP6 | 78/471 (0.166) | 4.24E-07 |
| 5 | Neuropathic pain signaling in dorsal horn neurons | GRIN2A, CAMK4, CAMK1D, BDNF, PIK3R5, PLCH2, GPR37, GRIN2C, PRKAR1B, PIK3R2, PRKD3, CAMK2B, GRIN2B, SRC, GRIN1, KCNN2, PRKCQ, GRM8, GRM1, PIK3C2G, ITPR1, GRIN3A, GRM7, PLCB4, PRKCD, PLCG2, PRKAG2 | 27/108 (0.25) | 2.02E-06 |
| 1 | EIF2 signaling | RPL24, RPS28, RPS2, | 15/200 (0.075) | 1.98E-07 |
| 2 | Regulation of eIF4 and p70S6K signaling | RPS29, RPS2, RPS28, RPS15, EIF3A, PPP2R3B, PPP2R5C, | 10/175 (0.057) | 1.10E-04 |
| 3 | mTOR signaling | RPS29, TSC1, RPS2, RPS28, RPS15, EIF3A, PPP2R3B, PPP2R5C, | 11/211 (0.052) | 2.07E-04 |
| 4 | ERK5 signaling | FOS, YWHAG, YWHAE, YWHAZ, ATF4, | 6/66 (0.091) | 4.74E-04 |
| 5 | ERK/MAPK signaling | FOS, YWHAG, | 10/208 (0.048) | 7.46E-04 |
By importing Entrez Gene IDs of ChIP-Seq-based NRSF target genes into the Core Analysis tool of IPA, we identified canonical pathways with significant relevance to the imported genes. Top 5 pathways based on P-value are listed with the category, focused molecules, ratio, and P-value of Fisher’s exact test. The NRSF target genes shared between ESC and ESC-derived neurons are underlined.
Figure 5IPA pathways of ChIP-Seq-based NSRF target genes in ESC. Entrez Gene IDs of 2,172 ChIP-Seq-based NRSF target genes in ESC were imported into the Core Analysis tool of IPA. It extracted statistically significant canonical pathways relevant to the set of imported genes. The third rank pathway termed “CREB signaling in neurons” (P = 3.64 × 10−7) in Table 3 is shown. NRSF target genes are highlighted in red.