Literature DB >> 24324330

ChIP-Seq Data Mining: Remarkable Differences in NRSF/REST Target Genes between Human ESC and ESC-Derived Neurons.

Jun-Ichi Satoh1, Natsuki Kawana, Yoji Yamamoto.   

Abstract

The neuron-restrictive silencer factor (NRSF) is a zinc finger transcription factor that represses neuronal gene transcription in non-neuronal cells by binding to the consensus repressor element-1 (RE1) located in regulatory regions of target genes. NRSF silences the expression of a wide range of target genes involved in neuron-specific functions. Previous studies showed that aberrant regulation of NRSF plays a key role in the pathological process of human neurodegenerative diseases. However, a comprehensive set of NRSF target genes relevant to human neuronal functions has not yet been characterized. We performed genome-wide data mining from chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) datasets of NRSF binding sites in human embryonic stem cells (ESC) and the corresponding ESC-derived neurons, retrieved from the database of the ENCODE/HAIB project. Using bioinformatics tools such as Avadis NGS and MACS, we identified 2,172 NRSF target genes in ESC and 308 genes in ESC-derived neurons based on stringent criteria. Only 40 NRSF target genes overlapped between both data sets. According to motif analysis, binding regions showed an enrichment of the consensus RE1 sites in ESC, whereas they were mainly located in poorly defined non-RE1 sites in ESC-derived neurons. Molecular pathways of NRSF target genes were linked with various neuronal functions in ESC, such as neuroactive ligand-receptor interaction, CREB signaling, and axonal guidance signaling, while they were not directed to neuron-specific functions in ESC-derived neurons. Remarkable differences in ChIP-Seq-based NRSF target genes and pathways between ESC and ESC-derived neurons suggested that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons.

Entities:  

Keywords:  ChIP-seq; ESC; GenomeJack; Huntington’s disease; NRSF; REST; data mining; human neurons

Year:  2013        PMID: 24324330      PMCID: PMC3855043          DOI: 10.4137/BBI.S13279

Source DB:  PubMed          Journal:  Bioinform Biol Insights        ISSN: 1177-9322


Introduction

The neuron-restrictive silencer factor (NRSF), also known as repressor element-1 (RE1) silencing transcription factor (REST), is a zinc finger transcription factor that represses neuronal gene transcription in non-neuronal cells by binding to the consensus RE1 site, also known as neuron-restrictive silencer element (NRSE), located in regulatory regions of target genes.1 NRSF silences the expression of a wide range of target genes involved in neuron-specific functions encoding ion channels, synaptic vesicle proteins, neurotransmitter receptors, neurotransmitter-synthesizing enzymes, neuropeptides, cell adhesion molecules, and cytoskeletal components.2 When NRSF represses the transcription of target genes, it recruits the repressor complex, composed of the REST corepressor COREST, transcription regulator SIN3A, methyl CpG binding protein MECP2, histone deacetylases, histone modifying enzymes, and the components of the SWI-SNF chromatin remodeling complexes.3 NRSF acts as a master negative regulator of neurogenesis in the central nervous system (CNS) during development. It is expressed at variable levels in most nonneuronal cell types, while the expression levels are reduced in neuronal progenitor cells and more greatly suppressed in differentiated neurons in adult rodent brains.4 NRSF plays a pivotal role in the control of proper timing of neuronal differentiation in neural progenitor cells.5 Sustained expression of NRSF inhibits radial migration of neural progenitor cells by targeting doublecortin in the developing rat brain.6 Overexpression of the NRSF competitor REST-VP16 induces neuronal differentiation by direct activation of NRSF target genes in mouse neural stem cells.7 Because most previous studies on NRSF employed rodent models,3–7 the functional roles of NRSF remain largely unknown in the human CNS during development and maturation, except for a previous study showing that blocking NRSF function induces apoptosis in human medulloblastoma cells, although the target genes of NRSF were not characterized.8 Increasing evidence indicates that aberrant regulation of NRSF is involved in the pathological process of human neurodegenerative diseases, such as Huntington’s disease (HD) and Alzheimer’s disease (AD). Wild-type huntingtin (HTT), the causative gene of HD, binds to and sequesters NRSF protein in the cytoplasm, thereby inhibiting the transcriptional repression of BDNF, a cardinal target of NRSF.9 In contrast, mutant HTT protein with a polyglutamine repeat expansion causes nuclear accumulation of NRSF and persistently represses the expression of neuronal NRSF target genes in HD brains.9 Furthermore, the binding ability of NRSF to NRSE is enhanced in lymphocytes of HD patients.10 NRSF protein levels are elevated in AD brains correlated with reduced expression of the choline acetyltransferase gene.11 These observations suggest that to clarify the role of NRSF in the pathogenesis of human neurodegenerative diseases, it is important to characterize the comprehensive list of NRSF target genes relevant to human neuronal functions and dysfunctions. Recently, the next-generation sequencing (NGS) technology had a great impact on the field of genome research. Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) serves as a highly efficient NGS method for genome-wide profiling of DNA-binding proteins, histone modifications, and nucleosomes, with high resolution and great coverage of the genome.12 Increasing amounts of NGS data have been deposited in public databases, such as the Sequence Read Archive (SRA) (http://www.ncbi.nlm.nih.gov/sra) and the European Nucleotide Archive (ENA) (http://www.ebi.ac.uk/ena/home). Since these databases contain extremely high-throughput experimental data, it is often difficult to extract meaningful biological implications. Recent advances in systems biology enable illustration of a cell-wide map of complex molecular interactions by using the literature-based knowledgebase of molecular pathways.13 The logically arranged molecular networks construct the entire system characterized by robustness, which maintains the proper function of the system in the face of genetic and environmental perturbations. Therefore, the integration of extremely high dimensional NGS data with underlying molecular pathways and networks by using bioinformatics tools of knowledgebase offers a rational approach for characterizing the pathway- and network-based molecular mechanisms of gene regulation on the whole genome scale.13 To identify genome-wide NRSF target genes relevant to human neuronal functions, we compared ChIP-Seq-based NRSF binding sites between human embryonic stem cells (ESC) and the corresponding ESC-derived neurons and studied the difference in molecular pathways and networks of NRSF target genes. We found remarkable differences in NRSF target genes and pathways between ESC and ESC-derived neurons, suggesting that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons.

Methods

ChIP-Seq dataset of NRSF target genes

To identify a comprehensive set of NRSF target genes in human ESC and ESC-derived neurons, we investigated FASTQ-formatted files of ChIP-Seq datasets retrieved from the DDBJ Sequence Read Archive (DRA) (http://trace.ddbj.nig.ac.jp/DRASearch) under the accession number of SRX100410, SRX190260, SRX100409, and SRX190329. The datasets were originally prepared for the Encyclopedia of DNA Elements (ENCODE) project by Dr Richard Myers, HudsonAlpha Institute for Biotechnology (HAIB). In their experiments, human H1 ESC cells established from inner cell mass, presenting with a normal karyotype, were processed to induce neuronal differentiation. Both ESC and ESC-derived neurons were processed for ChIP with mouse monoclonal anti-NRSF antibody (Protein Expression Center, Caltech). Next, NGS libraries constructed from size-selected ChIP DNA fragments were processed for deep sequencing at a read length composed of 30–36 bp for ESC or 40–50 bp for neurons on Genome Analyzer (Illumina, San Diego, CA, USA). In both cell types, the corresponding libraries termed RevXlinkChromatin made from sonicated chromatin, whose formaldehyde cross-links are reversed without immunoenrichment, were utilized for input control. First, we evaluated the quality of short reads by searching them on the FastQC program. Next, we removed the reads of insufficient quality by filtering them out by the FASTX-toolkit. After cleaning the data, we mapped them on the human genome reference sequence version hg19 using a mapping tool of Avadis NGS (Agilent Technologies, Santa Clara, CA, USA) or using the Bowtie program. We then identified significant peaks by using the Model-based Analysis of ChIP-Seq (MACS) program in the stringent condition that satisfies the false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20 in order to reduce the detection of false-positive binding sites if possible as described previously.14 We characterized the genomic location of binding peaks using a peak-finding tool of Avadis NGS that classifies the locations into the upstream region, 5′ untranslated region (5′UTR), exon, intron, and 3′UTR. To identify the exact location of binding sites, we imported the Bowtie and MACS-processed data into the GenomeJack genome viewer (Mitsubishi Space Software) as described previously.14 The genes corresponding to MACS peaks were determined by a neighboring gene analysis tool of Avadis NGS within a distance of 5000 bp from peaks to genes. The consensus sequence motifs were identified by importing a 400 bp-length sequence surrounding the summit of MACS peaks derived from top 100 genes based on FE into the MEME-ChIP program. The utility of QC tools, Bowtie, MACS, and MEME-ChIP is described elsewhere.15–18 The information on websites and analysis conditions of bioinformatics tools is summarized in Supplementary Table 1.

Molecular pathway analysis

To identify biologically relevant molecular pathways and networks, we imported Entrez Gene IDs of NRSF target genes into the Functional Annotation tool of Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7.19 DAVID extracts gene ontology (GO) terms enriched in the set of imported genes and identifies relevant pathways constructed by Kyoto Encyclopedia of Genes and Genomes (KEGG) or the Protein Analysis Through Evolutionary Relationships (PANTHER) classification system. The results are followed by statistical evaluation with the modified Fisher’s exact test corrected by multiple comparison tests. In DAVID analysis data, we considered P-value < 0.05 by Bonferroni correction as significant. KEGG is a publicly accessible knowledgebase that contains manually curated 272,733 pathways that cover a wide range of metabolic, genetic, environmental, and cellular processes, as well as human diseases. PANTHER is a publicly accessible knowledgebase that includes the information on 176 signaling and metabolic pathways manually curated by expert biologists. We also imported Entrez Gene IDs of NRSF target genes into the Core Analysis tool of Ingenuity Pathways Analysis (IPA) (Ingenuity Systems). IPA is a commercial knowledgebase that contains approximately 3,000,000 biological and chemical interactions with definite scientific evidence. By uploading the list of Gene IDs, the network-generation algorithm identifies focused genes integrated in global molecular pathways and networks. IPA calculates the score P-value that reflects the statistical significance of association between the genes and the pathways and networks by Fisher’s exact test. For IPA network and pathway analysis data, we considered P-value < 0.05 by Fisher’s exact test as significant.

Results

Identification of ChIP-Seq-based NRSF target genes in human ESC and ESC-derived neurons

After mapping short reads on hg19 and detecting peaks by MACS on Avadis NGS, we initially identified 36,096 ChIP-Seq peaks of NRSF target genes in ESC and 45,108 peaks in ESC-derived neurons. After performing the neighboring gene search within a distance of 5,000 bp from peaks to genes, these were reduced to 12,757 genes in ESC and 15,802 genes in ESC-derived neurons, respectively. These results indicate that approximately 65% of ChIP-Seq peaks were located in intergenic regions far from the nearest genes. Upon application of stringent criteria that satisfied both FDR ≤ 0.01 and fold enrichment (FE) ≥ 20, the results were further reduced to 2,697 genes in ESC and 388 genes in ESC-derived neurons, respectively. After omitting non-protein coding genes, we extracted 2,172 protein-coding genes in ESC (FE = 59.4 ± 38.8) and 308 genes in ESC-derived neurons (FE = 26.0 ± 6.8). The top 20 genes based on FE are listed in Table 1 for ESC and Table 2 for ESC-derived neurons. The complete lists are shown in Supplementary Table 2 for ESC and Supplementary Table 3 for ESC-derived neurons.
Table 1

Top 20 Most Enriched ChIP-Seq-based NRSF target genes in human ESC.

CHROMOSOMESTART BASE POSITIONEND BASE POSITIONFEFDRENTREZ GENE IDGENE SYMBOLGENE NAME
chr142498167524982925170.0017701215CMA1chymase 1, mast cell
chr6152167485152168597170.0017702099ESR1estrogen receptor 1
chr2166094555166096160170.0017706326SCN2Asodium channel, voltage-gated, type II, alpha subunit
chr147962769379628461170.0017709369NRXN3neurexin 3
chr91978180819782570170.00177025769SLC24A2solute carrier family 24 (sodium/ potassium/calcium exchanger), member 2
chr106093632960937937170.00177084457PHYHIPLphytanoyl-CoA 2-hydroxylase interacting protein-like
chr2163694658163698116170.00177090134KCNH7potassium voltage-gated channel, subfamily H (eag-related), member 7
chr195714842557151677170.001770147670LOC147670uncharacterized LOC147670
chr36835536268356382170.001770407738FAM19A1family with sequence similarity 19 (chemokine (C-C motif)-like), member A1
chr6167227855167229954169.1517606196RPS6KA2ribosomal protein S6 kinase, 90kDa, polypeptide 2
chr6108508703108509966169.1517607101NR2E1nuclear receptor subfamily 2, group E, member 1
chr63207269732073801169.1517607148TNXBtenascin XB
chr191020367310204816169.15176055337C19orf66chromosome 19 open reading frame 66
chr6119411155119412478169.15176079632FAM184Afamily with sequence similarity 184, member A
chrX153140408153141899167.4517503897L1CAML1 cell adhesion molecule
chr1941715644174043167.45175084699CREB3L3cAMP responsive element binding protein 3-like 3
chr145833129158333220167.451750341880SLC35F4solute carrier family 35, member F4
chr195198354651984430167.451750729767CEACAM18carcinoembryonic antigen-related cell adhesion molecule 18
chr86908421769085604167.36694080243PREX2phosphatidylinositol-3,4, 5-trisphosphate-dependent Rac exchange factor 2
chr4147564574147565930166.881705458POU4F2POU class 4 homeobox 2

From ChIP-Seq datasets, we identified 2,172 NRSF target genes in human ESC based on stringent criteria that satisfy both false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20. Top 20 genes based on FE are listed with the chromosome, the start base position, the end base position, FE, FDR, Entrez Gene ID, Gene Symbol, and Gene Name.

Table 2

Top 20 Most Enriched ChIP-Seq-based NRSF target genes in human ESC-derived neurons.

CHROMOSOMESTART BASE POSITIONEND BASE POSITIONFEFDRENTREZ GENE IDGENE SYMBOLGENE NAME
chrX548337195483559659.98119010916MAGED2melanoma antigen family D, 2
chr22200668892006917958.29808054487DGCR8DiGeorge syndrome critical region gene 8
chr10700906087009396554.1143403189HNRNPH3heterogeneous nuclear ribonucleoprotein H3 (2H9)
chr10700906087009396554.11434064081PBLDphenazine biosynthesis-like protein domain containing
chr41403924140487351.013280.00716845954729NKX1-1NK1 homeobox 1
chr11661873026618995548.8992650266743NPAS4neuronal PAS domain protein 4
chr517626434317626522948.6346130.00368663690249UNC5Aunc-5 homolog A (C. elegans)
chr19238683222387053548.2585870171392ZNF675zinc finger protein 675
chr516786068616786182846.708134023286WWC1WW and C2 domain containing 1
chrX10246946310247136046.1726040.00255427856271BEX4brain expressed, X-linked 4
chr4263204382632405646.10853603516RBPJrecombination signal binding protein for immunoglobulin kappa J region
chr20331029403310516544.41789083658DYNLRB1dynein, light chain, roadblock-type 1
chr123093985823094088342.3624730.00462427710753CAPN9calpain 9
chr913193607913194115540.9595680389792IER5Limmediate early response 5-like
chr2701204817012186339.224510.00409416611017SNRNP27small nuclear ribonucleoprotein 27kDa (U4/U6.U5)
chr11652632536527875739.0909840100996467LOC100996467metastasis-associated lung adenocarcinoma transcript 1-like
chr216976519516976774638.41547057818G6PC2glucose-6-phosphatase, catalytic, 2
chr319549749519549872738.4106450.0027285134585MUC4mucin 4, cell surface associated
chr213427935613428048238.3213160344148NCKAP5NCK-associated protein 5
chr20445617704456455437.885845063935PCIF1PDX1 C-terminal inhibiting factor 1

From ChIP-Seq datasets, we identified 308 NRSF target genes in human ESC-derived neurons based on stringent criteria that satisfy both false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20. Top 20 genesbased on FE are listed with the chromosome, the start base position, the end base position, FE, FDR, Entrez Gene ID, Gene Symbol, and Gene Name.

Importantly, when compared with 985 ChIP-Seq-based NRSF target genes in Jurkat human T cells identified in a previous study,20 405 genes (41.1%) of Jurkat overlapped with NRSF target genes of human ESC, supporting the validity of our analysis (Supplementary Table 4). In contrast, only 40 genes overlapped between human ESC and ESC-derived neurons. These included ST8SIA3, NPAS4, EP300, NKX1–1, ARID2, SLCA7 A14, MEX3B, GPR26, ZSWIM5, SEZ6 L2, ASPHD1, DDX39B, CELSR3, GAD2, SPARTA16, MPHOSPH10, MCEE, RAB28, FAM117B, KCNMA1, MRP63, SKA3, TMEM187, LATS1, PGK1, RPS17, LRP6, H3F3B, EIF4 A2, MGEA5, RPL10, KIF21 A, FUS, AZIN1, WHSC2, BTG1, CCDC88 A, WWC1, TMED2, and GLCE (underlined in Supplementary Table 2 and Supplementary Table 3). The network analysis of 40 genes by IPA indicated that the set of 14 genes (35%), such as ARID2, MEX3B, SEZ6 L2, GAD2, RAB28, SKA3, RPS17, RPL10, KIF21A, FUS, AZIN1, WHSC2 (NEFLA), CCDC88A, and TMED2, belong to the molecules interacting with UBC (ubiquitin C) (Supplementary Fig. 1). Among 2,172 genes detected in ESC, the peaks were located in the upstream region (8.6%), 5′UTR (12.8%), exon (21.9%), intron (53.4%), and 3′UTR (3.3%). Among 308 genes detected in ESC-derived neurons, the peaks were located in the upstream region (1.9%), 5′UTR (30.9%), exon (27.4%), intron (36.2%), and 3′UTR (3.7%). Thus, the intron serves as a major NRSF-binding site in both ESC and ESC-derived neurons. The more precise location was determined by importing processed data into the GenomeJack genome viewer (Fig. 1 for ESC and Fig. 2 for ESC-derived neurons).
Figure 1

The genomic location of ChIP-Seq-based NRSF target gene EP300 in ESC. An example of NRSF target gene in ESC named histone acetyltransferase p300 (EP300) is shown on GenomeJack, where a MACS peak numbered 26542 is located in the exon of EP300. The panels (A, B) represent (A) the overview and (B) the close-up view. The canonical REST motif is highlighted by orange rectangle.

Figure 2

The genomic location of ChIP-Seq-based NRSF target gene SLC7 A14 in ESC-derived neurons. An example of NRSF target gene in ESC-derived neurons named solute carrier family 7, member 14 (SLC7A14) is shown on GenomeJack, where a MACS peak numbered 31638 is located in the first intron of SLC7A14. The panels (A, B) represent (A) the overview and (B) the close-up view. The canonical REST motif is highlighted by orange rectangle.

According to motif analysis with MEME-ChIP of top 100 genes based on FE, the most significant consensus sequences in ESC were characterized by a well-defined REST motif (RE1) composed of 5′AG[CG]ACCA[TC] GGACAG3′ (80 sites; E-value = 5.1 × 10−254) and a left half-site of RE1 composed of 5′[TG]TTCAGCAC[CT]3′ (21 sites; E-Value = 2.9 × 10−11) (Fig. 3A, B). In contrast, the most significant consensus sequences in ESC-derived neurons represented a poorly defined non-RE1 motif composed of 5′[GA][AGC][GA][AG][ATG][GA]G[CA]GG[C/G][AG] G[CGT][GA][GA][CAG][AG][GC][TA][GA][GA][GA] [AG][CAG][GA][GC]A3′ (63 sites; E-Value = 6.7 × 10−17) and an RE1-like motif composed of 5′[CG][CT][TG] TC[AT][GT]CACCG[CT][GA]GACAGC[GA]C[CT] [ACT][TG]3′ (11 sites; E-Value = 1.7 × 10−4) (Fig. 3C, D). These results suggest that NRSF binds chiefly to the RE1 consensus sites in ESC by direct binding, whereas NRSF often binds to the promiscuous non-RE1 sites in ESC-derived neurons possibly by indirect binding.
Figure 3

Identification of NRSF binding consensus motif sequence. The consensus motif sequence was identified by importing a 400 bp-length sequence surrounding the summit of MACS peaks of top 100 binding sites based on FE into MEME-ChIP. The panels (A–D) represents (A) ESC, motif 1, E-value = 5.1 × 10−254, (B) ESC, motif 2, E-value = 2.9 × 10−11, (C) ESC-derived neurons, motif 1, E-value = 6.7 × 10−17, and (D) ESC-derived neurons, motif 2, E-value = 1.7 × 10−4.

Molecular pathways of ChIP-Seq-based NRSF target genes in human ESC and ESC-derived neurons

Next, we studied molecular pathways and networks of ChIP-Seq-based NRSF target genes by using bioinformatics tools of the comprehensive knowledgebase. First, by importing the sets of Entrez Gene IDs of NRSF target genes, DAVID identified statistically significant functionally associated GO terms and related KEGG and PANTHER pathways. The top 3 most significant GO terms, regardless of their categories in ESC, included “neuron projection” (GO: 0043005; P = 1.21 × 10−8), “calcium ion binding” (GO: 0005509; P = 1.21 × 10−7), and “hemophilic cell adhesion” (GO: 0007156; P = 3.72 × 10−6). In ESC, only one KEGG pathway, termed “Neuroactive ligand-receptor interaction” (hsa04080; P = 0.0343; Fig. 4), and only one PANTHER pathway, termed “Wnt signaling pathway” (P00057; P = 0.0399), were considered to be significantly relevant to NRSF target genes. In contrast, the top 3 most significant GO terms, regardless of their categories in ESC-derived neurons, included “nuclear lumen” (GO: 0031981; P = 8.72 × 10−8), “intracellular non-membrane-bounded organelle” (GO: 0043232; P = 5.41 × 10−7), and “non-membrane- bounded organelle” (GO: 0043228; P = 5.41 × 10−7). In ESC-derived neurons, only one KEGG pathway termed “Ribosome” was significant (hsa03010; P = 3.53 × 10−5) and none of the PANTHER pathways reached the statistical significance after Bonferroni correction.
Figure 4

KEGG pathway of ChIP-Seq-based NSRF target genes in ESC. Entrez Gene IDs of 2,172 ChIP-Seq-based NRSF target genes in ESC were imported into the Functional Annotation tool of DAVID. It extracted statistically significant KEGG pathways relevant to the set of imported genes. KEGG pathway termed “Neuroactive ligand-receptor interaction” (hsa04080; P = 0.0343) is shown. NRSF target genes are highlighted by orange.

Finally, we imported the list of Entrez Gene IDs of NRSF target genes into IPA. It extracted the network defined as “Cell-To-Cell Signaling and Interaction, Cell Signaling, Nervous System Development and Function” (P = 1.00 × 10−39) and “Endocrine System Disorders, Gastrointestinal Disease, Hereditary Disorder” (P = 1.00 × 10−39) in ESC, and “Cell-mediated Immune Response, Cellular Movement, Developmental Disorder” (P = 1.00 × 10−71) in ESC-derived neurons as the most significant functionally associated molecular network. IPA also identified canonical pathways relevant to the set of imported genes (Table 3). The top 5 most significant pathways in ESC included “Corticotropin releasing hormone signaling” (P = 7.05E-08), “Role of NFAT in cardiac hypertrophy” (P = 1.48 × 10−7), “CREB signaling in neurons” (P = 3.64 × 10−7; Fig. 5), “Axonal guidance signaling” (P = 4.24 × 10−7), and “Neuropathic pain signaling in dorsal horn neurons” (P = 2.02 × 10−6). In contrast, the top 5 most significant pathways in ESC-derived neurons were composed of “EIF2 signaling” (P = 1.98 × 10−8), “Regulation of eIF4 and p70S6K signaling” (P = 1.10 × 10−4), “mTOR signaling” (P = 2.07 × 10−4), “ERK5 signaling” (P = 4.74 × 10−4), and “ERK/MAPK signaling” (P = 7.46 × 10−4). When the top 5 IPA pathways were compared between both cell types, a very small number of NRSF target genes, such as EP300, EIF4 A2, RPS17, RPL10, and H3FB, overlapped between ESC and ESC-derived neurons (underlined in Table 3). These results suggest that the signaling pathways composed of NRSF target genes differ between cell types, and the pathways pivotal for maintenance of neuronal function are enriched in ESC, whereas the pathways in ESC-derived neurons involve more general functions, such as protein translation control and cell survival.
Table 3

Top 5 IPA canonical pathways of NRSF target genes in human ESC and ESC-derived neurons.

RANKCATEGORYFOCUSED MOLECULESRATIOP-VALUE
Human ESC
1Corticotropin releasing hormone signalingCAMK4, BDNF, MAPK11, EP300, GLI3, UCN, PRKAR1B, SMO, NOS2, PRKD3, MAP2K1, GNAS, PRKCQ, PTCH1, UCN3, GNAQ, CRH, ITPR1, MEF2B, CALM1, GNAI3, ADCY9, PLCG2, PRKCD, GNAO1, ADCY1, NR4A1, GUCY1A2, PRKAG2, MEF2C, ADCY7, OPN1SW32/138 (0.232)7.05E-08
2Role of NFAT in cardiac hypertrophyCAMK4, CAMK1D, PIK3R5, PLCH2, MAPK11, EP300, HDAC6, TGFB1, PRKAR1B, PIK3R2, GNB1L, MAP2K1, PRKD3, CAMK2B, SRC, HDAC9, HDAC4, PRKCQ, GNAS, HDAC8, GRB2, GNAQ, PIK3C2G, SLC8A3, SLC8A2, ITPR1, MEF2B, GNAI3, CALM1, NKX2-5, ADCY9, PLCB4, HDAC3, PRKCD, PLCG2, ADCY1, MAPK10, PRKAG2, MEF2C, SLC8A1, RCAN2, ADCY7, OPN1SW43/209 (0.206)1.48E-07
3CREB signaling in neuronsGRIN2A, CAMK4, GRID2, GNA11, PIK3R5, POLR2B, GNA14, PLCH2, EP300, GRIN2C, PRKAR1B, PIK3R2, GNB1L, MAP2K1, PRKD3, CAMK2B, POLR2J2/POLR2J3, GRIN1, GRIN2B, GNAS, PRKCQ, GRM8, GRB2, GRM1, GRIK3, PIK3C2G, GNAQ, ITPR1, GRM7, GNAI3, CALM1, ADCY9, PLCB4, PRKCD, PLCG2, GNAO1, ADCY1, PRKAG2, RPS6KA1, ADCY7, OPN1SW41/206 (0.199)3.64E-07
4Axonal guidance signalingDPYSL2, PAPPA2, KLC1, RAC2, MYL10, PLCH2, NTN1, SEMA6D, ECE2, ABLIM3, UNC5D, ADAM23, ABLIM2, PLXNB3, PRKD3, RTN4R, PAPPA, PRKCQ, PTCH1, VEGFC, L1CAM, SRGAP3, ADAM12, MYL12B, PLCG2, PRKCD, GNAO1, PAK7, OPN1SW, NRP1, FYN, RGS3, BDNF, GNA11, PIK3R5, EGF, GNA14, ABLIM1, SEMA6C, EPHA8, WNT7A, GLI3, GLIS1, SDC2, EFNB1, NTRK1, SMO, PRKAR1B, ARPC3, PIK3R2, RASSF5, ROBO2, SHANK2, GNB1L, MAP2K1, ACE, UNC5C, SEMA3E, PLXNA1, GNAS, PLXNC1, ARHGEF12, NRP2, GRB2, WNT2B, PIK3C2G, GNAQ, EPHA3, GNAI3, EPHA10, PLCB4, TUBB6, NTRK3, ADAM10, PRKAG2, NFATC2, SEMA3C, BMP678/471 (0.166)4.24E-07
5Neuropathic pain signaling in dorsal horn neuronsGRIN2A, CAMK4, CAMK1D, BDNF, PIK3R5, PLCH2, GPR37, GRIN2C, PRKAR1B, PIK3R2, PRKD3, CAMK2B, GRIN2B, SRC, GRIN1, KCNN2, PRKCQ, GRM8, GRM1, PIK3C2G, ITPR1, GRIN3A, GRM7, PLCB4, PRKCD, PLCG2, PRKAG227/108 (0.25)2.02E-06
ESC-derive human neurons
1EIF2 signalingRPL24, RPS28, RPS2, EIF4A2, RPS17, RPS29, RPL15, RPL5, RPL7A, EIF5, RPL10, EIF3A, RPS15, RPS5, RPL1315/200 (0.075)1.98E-07
2Regulation of eIF4 and p70S6K signalingRPS29, RPS2, RPS28, RPS15, EIF3A, PPP2R3B, PPP2R5C, EIF4A2, RPS17, RPS510/175 (0.057)1.10E-04
3mTOR signalingRPS29, TSC1, RPS2, RPS28, RPS15, EIF3A, PPP2R3B, PPP2R5C, EIF4A2, RPS17, RPS511/211 (0.052)2.07E-04
4ERK5 signalingFOS, YWHAG, YWHAE, YWHAZ, ATF4, EP3006/66 (0.091)4.74E-04
5ERK/MAPK signalingFOS, YWHAG, H3F3B, DUSP1, CRKL, YWHAZ, ATF4, PPP2R3B, PPP2R5C, EP30010/208 (0.048)7.46E-04

By importing Entrez Gene IDs of ChIP-Seq-based NRSF target genes into the Core Analysis tool of IPA, we identified canonical pathways with significant relevance to the imported genes. Top 5 pathways based on P-value are listed with the category, focused molecules, ratio, and P-value of Fisher’s exact test. The NRSF target genes shared between ESC and ESC-derived neurons are underlined.

Figure 5

IPA pathways of ChIP-Seq-based NSRF target genes in ESC. Entrez Gene IDs of 2,172 ChIP-Seq-based NRSF target genes in ESC were imported into the Core Analysis tool of IPA. It extracted statistically significant canonical pathways relevant to the set of imported genes. The third rank pathway termed “CREB signaling in neurons” (P = 3.64 × 10−7) in Table 3 is shown. NRSF target genes are highlighted in red.

Discussion

NRSF acts as a transcriptional repressor that restricts neuronal gene expression to differentiated neurons by silencing expression of target genes in nonneuronal cells.1,2 Most previous studies have focused on a role of NRSF in rodent cells.3–7 We characterized a comprehensive set of NRSF target genes relevant to human neuronal functions by analyzing ChIP-Seq datasets of NRSF binding sites in human ESC and the corresponding ESC-derived neurons, retrieved from the public database. We identified totally 2,172 NRSF target genes in ESC and 308 genes in ESC-derived neurons based on stringent criteria, with a small overlapping set of 40 genes, in which ubiquitin C-interacting proteins are accumulated. Importantly, the protein level of NRSF is regulated by ubiquitin- mediated proteolysis in a β-TRCP-dependent manner during neuronal differentiation.21 These observations suggest that NRSF, along with a subset of NRSF targets, is processed for degradation by the ubiquitin-proteasome system (UPS), leading to acceleration of neuronal differentiation. Furthermore, the list of NRSF target genes in human ESC included many known targets identified in various cell types, such as NRXN3, L1CAM, LHX3, CY11B2, SYP, NEUROD1, FGF14, SCN3B, ST8SIA3, RASL10B, WDR25, SYN1, HCN1, DCX, CRH, KIRREL3, PTPRT, HES1, ONE-CUT1, CACNG2, NTRK3, CALB1, BDNF, NGDN, GRIN2 A, LHFPL5, RXFP3, HNF4 A, STMN2, GPR123, ACTA1, GLRA2, NPAS4, and VIP.2,20,22 A previous ChIP-Seq study identified 985 NRSF target genes in Jurkat human T cells.20 Surprisingly, they are highly enriched in the GO terms related to synaptic transmission and nervous system development despite lymphoid cell types.20 We analyzed the molecular pathway of 405 overlapping genes between human ESC and Jurkat T cells by using IPA. The program extracted “Corticotropin Releasing Hormone Signaling” (P = 2.79 × 10−6), “Glutamate Receptor Signaling” (P = 2.18 × 10−5), “Calcium Signaling” (P = 2.84 × 10−4), “Synaptic Long Term Depression” (P = 5.25 × 10−4), and “Circadian Rhythm Signaling” (P = 5.41 × 10−4) as the top 5 significant pathways. These observations suggested that the pathways involved in neuronal functions are enriched in NRSF target genes, and they are shared substantially between ESC and T lymphocytes. We found that NRSF binding sites are located in various genomic regions, among which the intron region serves as the most major site in both ESC and ESC-derived neurons except for intergenic regions, which was consistent with previous observations.23 Importantly, NRSF effectively inhibits transcription of target genes, regardless of the location or the orientation of its binding sites in 5′UTRs, introns, exons, upstream of the transcription start, and downstream of the coding region distant from the enhancer and the promoter.24 Furthermore, only a subset of NRSE sequences are constitutively occupied by NRSF in non-neuronal cells, and the occupancy depends largely on endogenous levels of NRSF.22 The canonical NRSE site reflects a 21-bp consensus sequence composed of 5′TTCAGCACCACGGA-CAGCGCC3 ′, which is divided into the left half-site and the right half-site separated by a spacer of two nucleotides. 23 However, NRSF binding sites with highly degenerate sequences are often found throughout the genome.25 Through motif analysis, we found that most NRSF binding sites in ESC are present in the most significant RE1 sites (E-value = 5.1 × 10−254), while they are located mostly in non-RE1 sites and the minority is located in less significant RE1 sites (E-value = 1.7 × 10−4) in ESC-derived neurons. The possibility exists that the lower expression levels of NRSF in ESC-derived neurons may make it difficult to detect specific ChIP-Seq peaks. However, a highly sensitive monoclonal anti-NRSF antibody was utilized to detect a trace amount of NRSF protein for both ESC and ESC-derived neurons. Therefore, our observations suggest that NRSF binds chiefly to RE1 sites in ESC by direct binding, whereas it often binds to promiscuous non-RE1 sites in ESC-derived neurons by indirect binding via an interaction with other DNA binding proteins, as described peviously.23 Notably, noncanonical binding sites are often highly polymorphic, where the left and right half-site motifs are separated by variable-length spacer sequences.25,26 Furthermore, numerous ChIP-Seq-based NRSF binding sites contain only partial motifs composed of a single half-site.23 NRSF is composed of a DNA binding domain located within the cluster of eight zinc fingers, and two repressor domains at the N- and C-termini, which are capable of interacting with corepressor complex components.3 Importantly, as neuronal progenitor cells differentiate into neurons, NRSF and its corepressor complex gradually dissociate from RE1 sites, triggering activation of neuronal gene expression.27 These observations may explain why only a limited subset of NRSF target genes overlapped between human ESC and ESC-derived neurons. By molecular pathway analysis, we identified the pathways termed “Neuroactive ligand-receptor interaction”, “Wnt signaling pathway”, “CREB signaling in neurons”, and “Axonal guidance signaling”, all of which are closely related to neuronal functions, as the pathways relevant to NRSF target genes in ESC. Among these, downregulation of CREB signaling may represent a highly important function of NRSF in non-neuronal cells. CREB is promptly activated in response to a wide range of extracellular stimuli, such as growth factors, peptide hormones, and neuronal activity, which activates protein kinase A (PKA), mitogen-activated protein kinases (MAPKs), and Ca2+/calmodulin-dependent protein kinases (CAMKs).28 Phosphorylated CREB upregulates a panel of genes essential for neuronal differentiation and plasticity, including BDNF, TrkB, somatostatin, and c-fos.28 NRSF inhibits the CREB signaling on activation of cocaine and amphetamine regulated transcript (CART) in SK-N-SH neuroblastoma cells, being in line with our observations.29 Furthermore, NRSF expression is positively regulated by CREB in small-cell lung cancer (SCLC) cell lines, serving as an autoregulatory loop in the NRSF-CREB signaling pathway.30 Importantly, NRSF is upregulated in Alzheimer’s disease (AD) brains,11 and CREB signaling is deregulated in the hippocampus of AD brains.31 With respect to the “Axonal guidance signaling” pathway of NRSF target genes, overexpression of NRSF in differentiated neurons causes axonal guidance errors in the developing check spinal cord.32 In contrast to the molecular pathways in ESC, we identified the pathways linked to regulation of protein synthesis termed “Ribosome”, “EIF2 signaling”, “Regulation of eIF4 and p70S6K signaling”, and “mTOR signaling” as the major pathways relevant to NRSF target genes in ESC-derived neurons. They converge on the PI3K/Akt/mTOR signaling pathway pivotal for cellular proliferation and survival in all cell types.33 These observations suggest that NRSF target genes play a key role in maintenance of neuronal functions, which are suppressed in ESC, whereas they are not restricted to neuron-specific functions in ESC-derived neurons. Notably, NRSF negatively regulates WNT signaling pathway, providing a mechanism responsible for maintenance of pluripotency in ESC.34 Increasing evidence indicates that aberrant regulation of NRSF plays a pivotal role in the pathogenesis of cancer, cardiac hypertrophy, ischemia, epilepsy, and neurodegeneration. 9–11,35–37 NRSF acts as an essential regulator of maintenance of normal cardiac structure, since transgenic mice expressing a dominant-negative mutant NRSF in hearts exhibited dilated cardiomyopathy.38 Through IPA analysis, we found the pathway termed “Role of NFAT in cardiac hypertrophy” as the second-ranked significant pathway for NRSF target genes in ESC (Table 3). In Huntington’s disease (HD) brains, the inability of mutant huntingtin (HTT) to retain NRSF in the neuronal cytoplasm inhibits the expression of NRSF target genes essential for neuronal functions.9 We identified both NRSF itself and HTT as potential NRSF target genes in human ESC, because they have degenerate RE1 motifs of NRSF binding sites (Supplementary Fig 2 for NRSF/REST and Supplementary Fig 3 for HTT). These observations suggested the existence of a complex autoregulatory loop in the network of the HTT-NRSF axis.

Conclusions

Using genome-wide data mining from ChIP-Seq datasets, we identified 2,172 NRSF target genes in human ESC and 308 NRSF target genes in ESC-derived neurons. The binding regions were accumulated in the RE1 sites in human ESC, whereas they were located mainly in non-RE1 sites in ESC-derived neurons. The molecular pathways of NRSF target genes were relevant to the maintenance of neuronal functions in ESC, whereas they were not limited to neuron-specific functions in ESC-derived neurons. Remarkable differences in ChIP-Seq-based NRSF target genes and pathways between ESC and ESC-derived neurons suggested that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons. The molecular network of NRSF target genes shared between human ESC and ESC-derived neurons. Entrez Gene IDs of 40 NRSF target genes overlapping between ESC and ESC-derived neurons were imported into the Core Analysis tool of IPA. It extracted the functional network defined as “Cellular Assembly and Organization, Cellular Function and Maintenance, Developmental Disorder” (P = 1.00 × 10−35), composed of 14 molecules interacting with UBC (ubiquitin C), as the most significant network. The genomic location of ChIP-Seq- based NRSF target gene REST in ESC. An example of NRSF target gene named RE1-silencing transcription factor (REST) is shown on GenomeJack, where a MACS peak numbered 30234 is located in the first intron of REST transcript variant NM_005612. The panels (a, b) represent (a) the overview and (b) the close-up view. The degenerate REST motif is highlighted in an orange rectangle. The genomic location of ChIP-Seq-based NRSF target gene HTT in ESC. An example of NRSF target gene named huntingtin (HTT) is shown on GenomeJack, where a MACS peak numbered 29571 is located in the 3′UTR of HTT. The panels (a, b) represent (a) the overview and (b) the close-up view. The degenerate REST motif is highlighted in an orange rectangle.
  37 in total

Review 1.  Function and regulation of CREB family transcription factors in the nervous system.

Authors:  Bonnie E Lonze; David D Ginty
Journal:  Neuron       Date:  2002-08-15       Impact factor: 17.173

2.  Biological activity and modular structure of RE-1-silencing transcription factor (REST), a repressor of neuronal genes.

Authors:  G Thiel; M Lietz; M Cramer
Journal:  J Biol Chem       Date:  1998-10-09       Impact factor: 5.157

3.  REST regulates the pool size of the different neural lineages by restricting the generation of neurons and oligodendrocytes from neural stem/progenitor cells.

Authors:  Matthew V Covey; Jeffrey W Streb; Roman Spektor; Nurit Ballas
Journal:  Development       Date:  2012-07-12       Impact factor: 6.868

4.  The neuronal repressor REST/NRSF is an essential regulator in medulloblastoma cells.

Authors:  P Lawinger; R Venugopal; Z S Guo; A Immaneni; D Sengupta; W Lu; L Rastelli; A Marin Dias Carneiro; V Levin; G N Fuller; Y Echelard; S Majumder
Journal:  Nat Med       Date:  2000-07       Impact factor: 53.440

5.  Regulation of the NRSF/REST gene by methylation and CREB affects the cellular phenotype of small-cell lung cancer.

Authors:  A Kreisler; P L Strissel; R Strick; S B Neumann; U Schumacher; C-M Becker
Journal:  Oncogene       Date:  2010-08-09       Impact factor: 9.867

6.  The neuron-restrictive silencer factor (NRSF): a coordinate repressor of multiple neuron-specific genes.

Authors:  C J Schoenherr; D J Anderson
Journal:  Science       Date:  1995-03-03       Impact factor: 47.728

7.  SCFbeta-TRCP controls oncogenic transformation and neural differentiation through REST degradation.

Authors:  Thomas F Westbrook; Guang Hu; Xiaolu L Ang; Peter Mulligan; Natalya N Pavlova; Anthony Liang; Yumei Leng; Rene Maehr; Yang Shi; J Wade Harper; Stephen J Elledge
Journal:  Nature       Date:  2008-03-20       Impact factor: 49.962

8.  Genome-wide mapping of in vivo protein-DNA interactions.

Authors:  David S Johnson; Ali Mortazavi; Richard M Myers; Barbara Wold
Journal:  Science       Date:  2007-05-31       Impact factor: 47.728

9.  A genome-wide screen for genetic variants that modify the recruitment of REST to its target genes.

Authors:  Rory Johnson; Nadine Richter; Gireesh K Bogu; Akshay Bhinge; Siaw Wei Teng; Siew Hua Choo; Lise O Andrieux; Cinzia de Benedictis; Ralf Jauch; Lawrence W Stanton
Journal:  PLoS Genet       Date:  2012-04-05       Impact factor: 5.917

10.  REST regulates distinct transcriptional networks in embryonic and neural stem cells.

Authors:  Rory Johnson; Christina Hui-leng Teh; Galih Kunarso; Kee Yew Wong; Gopalan Srinivasan; Megan L Cooper; Manuela Volta; Sarah Su-ling Chan; Leonard Lipovich; Steven M Pollard; R Krishna Murthy Karuturi; Chia-lin Wei; Noel J Buckley; Lawrence W Stanton
Journal:  PLoS Biol       Date:  2008-10-28       Impact factor: 8.029

View more
  12 in total

1.  Perturbations of Neuron-Restrictive Silencing Factor Modulate Corticotropin-Releasing Hormone Gene Expression in the Human Cell Line BeWo.

Authors:  Vasileios Kreouzis; Guo-Lin Chen; Gregory M Miller
Journal:  Mol Neuropsychiatry       Date:  2018-09-19

2.  Intragenic CpG islands play important roles in bivalent chromatin assembly of developmental genes.

Authors:  Sun-Min Lee; Jungwoo Lee; Kyung-Min Noh; Won-Young Choi; Sejin Jeon; Goo Taeg Oh; Jeongsil Kim-Ha; Yoonhee Jin; Seung-Woo Cho; Young-Joon Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2017-02-21       Impact factor: 11.205

Review 3.  Selective repression of gene expression in neuropathic pain by the neuron-restrictive silencing factor/repressor element-1 silencing transcription (NRSF/REST).

Authors:  Dianna E Willis; Meng Wang; Elizabeth Brown; Lilah Fones; John W Cave
Journal:  Neurosci Lett       Date:  2015-12-08       Impact factor: 3.046

4.  Identification of REST targets in the Xenopus tropicalis genome.

Authors:  Banu Saritas-Yildirim; Christopher P Childers; Christine G Elsik; Elena M Silva
Journal:  BMC Genomics       Date:  2015-05-14       Impact factor: 3.969

5.  REST/NRSF Knockdown Alters Survival, Lineage Differentiation and Signaling in Human Embryonic Stem Cells.

Authors:  Kaushali Thakore-Shah; Tasneem Koleilat; Majib Jan; Alan John; April D Pyle
Journal:  PLoS One       Date:  2015-12-21       Impact factor: 3.240

6.  REST is a hypoxia-responsive transcriptional repressor.

Authors:  Miguel A S Cavadas; Marion Mesnieres; Bianca Crifo; Mario C Manresa; Andrew C Selfridge; Ciara E Keogh; Zsolt Fabian; Carsten C Scholz; Karen A Nolan; Liliane M A Rocha; Murtaza M Tambuwala; Stuart Brown; Anita Wdowicz; Danielle Corbett; Keith J Murphy; Catherine Godson; Eoin P Cummins; Cormac T Taylor; Alex Cheong
Journal:  Sci Rep       Date:  2016-08-17       Impact factor: 4.379

Review 7.  More than a Corepressor: The Role of CoREST Proteins in Neurodevelopment.

Authors:  Simon Maksour; Lezanne Ooi; Mirella Dottori
Journal:  eNeuro       Date:  2020-03-10

8.  Dedifferentiation and neuronal repression define familial Alzheimer's disease.

Authors:  Andrew B Caldwell; Qing Liu; Gary P Schroth; Douglas R Galasko; Shauna H Yuan; Steven L Wagner; Shankar Subramaniam
Journal:  Sci Adv       Date:  2020-11-13       Impact factor: 14.136

9.  Molecular mechanisms regulating the defects in fragile X syndrome neurons derived from human pluripotent stem cells.

Authors:  Tomer Halevy; Christian Czech; Nissim Benvenisty
Journal:  Stem Cell Reports       Date:  2014-12-04       Impact factor: 7.765

10.  REST Regulates Non-Cell-Autonomous Neuronal Differentiation and Maturation of Neural Progenitor Cells via Secretogranin II.

Authors:  Hyung Joon Kim; Ahmet M Denli; Rebecca Wright; Tithi D Baul; Gregory D Clemenson; Ari S Morcos; Chunmei Zhao; Simon T Schafer; Fred H Gage; Mohamedi N Kagalwala
Journal:  J Neurosci       Date:  2015-11-04       Impact factor: 6.167

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.