| Literature DB >> 32553182 |
Daria Bunina1, Nade Abazova2, Nichole Diaz3, Kyung-Min Noh4, Jeroen Krijgsveld5, Judith B Zaugg6.
Abstract
Cellular differentiation requires dramatic changes in chromatin organization, transcriptional regulation, and protein production. To understand the regulatory connections between these processes, we generated proteomic, transcriptomic, and chromatin accessibility data during differentiation of mouse embryonic stem cells (ESCs) into postmitotic neurons and found extensive associations between different molecular layers within and across differentiation time points. We observed that SOX2, as a regulator of pluripotency and neuronal genes, redistributes from pluripotency enhancers to neuronal promoters during differentiation, likely driven by changes in its protein interaction network. We identified ATRX as a major SOX2 partner in neurons, whose co-localization correlated with an increase in active enhancer marks and increased expression of nearby genes, which we experimentally confirmed for three loci. Collectively, our data provide key insights into the regulatory transformation of SOX2 during neuronal differentiation, and we highlight the significance of multi-omic approaches in understanding gene regulation in complex systems.Entities:
Keywords: Atrx; Sox2; data integration; gene regulation; multi-omics; neuronal differentiation; transcription factors
Mesh:
Substances:
Year: 2020 PMID: 32553182 PMCID: PMC7322528 DOI: 10.1016/j.cels.2020.05.003
Source DB: PubMed Journal: Cell Syst ISSN: 2405-4712 Impact factor: 10.304
Figure 1Changes in Proteome, Transcriptome, and Chromatin during Neuronal Differentiation
(A) Scheme of neuronal differentiation protocol and experimental set-up (LiF, leukemia inhibitory factor; RA, retinoic acid).
(B) Overview of ATAC-seq, RNA-seq, and proteomics data. All except distal ATAC-peaks are aligned by genes. Distal ATAC-peaks are only partially shown due to the high number. n, number of ATAC-seq peaks (or genes if no ATAC-peak is present).
(C) Variance explained by latend factors (LFs) identified with MOFA is shown for each dataset.
(D) Top: scatterplots of samples projected to LF1 versus LF2 (left), and LF1 versus LF3 (right) are shown. Bottom: a subset of the most enriched GO terms per LF and data type are shown as bar graphs.
(E) RNA, protein, and chromatin accessibility data are shown as a heatmap for genes grouped by k-means clustering of log2 foldchanges of RNA and protein. Accessibility is shown separately for promoters (1.5 kb from TSS; “Promoter ATAC signal”) and gene body (“Intragenic ATAC signal”). Numbers indicate unique gene IDs (NGene) and intragenic ATAC-seq peaks (NATAC). Top enriched motifs in gene promoters are shown in each cluster (right; full list in Figure S1F). (B and E): genes with multiple promoters and or gene body peaks are shown multiple times.
See also Figure S1.
Figure 2Associations Between Differential Features Across Molecular Layers and Time Points
(A) Numbers of differential RNAs, proteins, and ATAC-seq peaks between adjacent time points (day 0 versus 4, 8 versus 4, and 12 versus 8; FDR < 5%.) are shown.
(B and C) Associations between differential features across time points are shown as log2 ORs according to the schematic in (B) (top). (B): differential RNA and differential proteins, (C): differential promoter ATAC-seq peaks and differential RNA. Adj.p < 0.05 are bold; Fishers’ test; arrows indicate up- or down- regulation at each time point; enrichments contradicting the central dogma of molecular biology (ATAC→RNA, RNA→protein) were not considered (gray boxes).
(D) GO terms enriched among genes downregulated at day 12, whose promoters were decompacted already at day 4 (related to the C). Expressed genes with promoter ATAC-seq peak were used as background.
(E) Promoter ATAC-seq signal (top) and RNA expression (bottom) dynamics are shown for the genes defined in (D).
See also Figure S2.
Figure 3Transcription Factor Binding Sites Undergo Major Changes in Chromatin Accessibility during Differentiation
(A) K-means clustering of RNA expression and activity of TFs at days 4, 8, and 12 versus day 0 (log2 fold changes) are represented as a heatmap (left) and lineplot (right). Protein abundance (log2) is shown for comparison (gray, not detected). Thick lines represent LOESS (local polynomial regression) fit.
(B) ATAC signal footprints of the combined OCT4-SOX2 and SOX2-only motif (± 100 bp) at day 0.
(C) Protein expression levels (log2) of SOX2 and OCT4 during differentiation.
See also Figure S3 and Table S3.
Figure 4Redistribution of SOX2 Binding Sites during Neuronal Differentiation
(A) Regulatory regions occupied by SOX2 (n = 14.362) are clustered into regulatory groups based on k-means clustering of SOX2 binding, ATAC-seq, and ChIP-seq of different histone marks in ESCs and neurons. H3K4me3-marked groups are labeled as promotors (P).
(B) Overlap of SOX2 peaks identified in ESCs and neurons displayed as Venn diagram (left). Fold changes of SOX2 binding versus signal intensity are visualized as MA plot. Pink represents differentially bound peaks (FDR < 5%).
(C) Top enriched motifs (HOMER tools) are shown for the differential SOX2-peaks in ESCs and neurons.
(D) Fractions of SOX2 peaks overlapping known cell-type-specific enhancers are shown (mouse ESCs and E14.5 brain data from Shen et al., 2012).
(E) The fractions of ESCs or neuronal SOX2 peaks in different genomic regions are shown.
See also Figure S4 and Table S4.
Figure 5Rearrangement of Chromatin-Bound SOX2 Protein Interaction Network during Neuronal Differentiation
(A) Protein enrichment ranking of all SOX2-associated proteins pulled down by ChIP-SICAP. x axis: protein rank, corresponding to the total number of proteins. y axis: iBAQ intensities, log10.
(B) Number and overlap between the SOX2-associated proteins in ESCs and neurons.
(C) Percentage of different functional groups of SOX2-associated proteins in ESCs and neurons.
(D) Selected stem cell, neuronal, and epigenetic factors present in ESCs and neurons alone or in both.
(E) Relative SOX2-association rate of overlapping proteins between neurons and ESCs, log2. For details on (C–E) see STAR Methods and Table S5.
(F) Scatter plot of the SOX2 association scores from (E) and corresponding protein log2 fold changes in neurons versus ESCs. SOX2 interactors with publicly available ChIP-seq data are marked in red.
Figure 6Genomic Co-occupancy of SOX2 and Its Identified Interactors
(A) Overlap of SOX2 interactors in ESCs and neurons (based on our SOX2 ChIP-SICAP and ChIP-MS data) with publicly available ChIP-seq data (from ChIP-Atlas) are shown as Venn diagram.
(B) SOX2 interactors identified by SICAP or ChIP-MS in ESCs and neurons are shown (left) along with their RNA and protein expression in neural differentiation (middle; log2 fold change to day 0) and the fraction of SOX2 peaks that overlap with their binding sites (right). Gray boxes indicate no ChIP-seq data in the corresponding cell type in the ChIP-Atlas database. Stars mark known SOX2 interactors (STRING database (Szklarczyk et al., 2019), experimentally confirmed interactors).
Figure 7SOX2-ATRX Co-binding Coincides with Increased Enhancer Activity and Is Required for Proper Expression Some Neuronal Genes
(A) Intersections of neuronal SOX2 peaks with binding sites of its neuronal interactors are shown as an upset plot. Venn diagram (top left) shows the intersection between the top interactors KDM1A and ATRX binding sites.
(B and C) H3K27ac and H3K4me1 occupancy in ESC and brain (B, left and right) and RNA expression in neurons (normalized counts; C) are shown for genomic regions bound by ATRX alone, SOX2 alone, or co-bound by SOX2-ATRX.
(D) SOX2 binding strength in neurons for peaks bound by SOX2-only or co-bound by SOX2-ATRX.
(E) Expression (qRT-PCR) of 5 neuronal genes on day 12 is shown for 5 lines, in which a SOX2-ATRX co-bound enhancer in the respective genes was removed (blue) normalized to CRISPR control cell lines (red). Each box summarizes at least 2 biological and 3 technical replicates.
(F) RNA expression of Slc1a3 gene (qRT-PCR) is shown for lines in which the indicated regions are removed by CRISPR.
(G) Proposed model of SOX2 interaction dynamics during the transition from ESCs to neurons and its possible role in regulating the expression of neuronal genes. All panels: p values are obtained with the Student’s t-test (except for C - Wilcoxon rank-sum test).
See also Figure S5 and Table S6.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Rabbit polyclonal anti-Sox2 for immunofluorescence | Merck-Millipore | Cat#AB5603; RRID: |
| Goat polyclonal anti-Sox2 for ChIP-seq/ChIP-SICAP | R&D Systems | Cat#AF2018; RRID: |
| Mouse monoclonal anti-ßTubulin III | Abcam | Cat#AB78078; RRID: |
| Goat polyclonal anti-mouse Alexa Fluor 594 | Thermo Fisher | Cat#A11005; RRID: |
| Goat polyclonal anti-rabbit Alexa Fluor 488 | Thermo Fisher | Cat#A11008; RRID: |
| Nextera DNA Library Prep Kit | Illumina | Cat# FC-121-1030 |
| NEBNext High-Fidelity 2× PCR Master Mix | New England Biolabs | Cat# M0541 |
| NEBNext Ultra II RNA Library Prep Kit for Illumina | New England Biolabs | Cat#E7770L |
| NEBNext® Ultra II DNA Library Prep Kit for Illumina | New England Biolabs | Cat#E7645L |
| Raw and processed ATAC-seq data | This study | ArrayExpress: |
| Raw and processed RNA-seq data | This study | ArrayExpress: |
| Raw proteomics data | This study | Proteomexchange: PXD016080 |
| Raw and processed Sox2 ChIP-seq data | This study | ArrayExpress: |
| Mouse cortex, H3K27ac ChIPseq | ENCODE Project | |
| Mouse forebrain E16.5, H3K9me3 ChIPseq | ENCODE Project | GSE82631; https://www.encodeproject.org/experiments/ENCSR352NVU/ |
| Mouse forebrain E16.5, H3K9ac ChIPseq | ENCODE Project | GSE82353; |
| Mouse forebrain E16.5, H3K27me3 ChIPseq | ENCODE Project | GSE82859; |
| Mouse forebrain E16.5, H3K4me3 ChIPseq | ENCODE Project | GSE82453; |
| Mouse forebrain E16.5, H3K4me1 ChIPseq | ENCODE Project | GSE82464; |
| Mouse forebrain E16.5, H3K36me3 ChIPseq | ENCODE Project | GSE82630; |
| Gencode mouse genome annotation version M7 | ENCODE | |
| Uniprot database release 2015 | Uniprot | |
| ChIP-Atlas database of public ChIP-seq data | ChIP-Atlas | |
| Jaspar CORE release 2018 | ||
| Mouse embryonic stem cells 129XC57BL/6J generated from male 129-B13 agouti mice | Laboratory of Kyung-Min Noh | ( |
| See | N/A | |
| N/A | N/A | Addgene plasmid #48138 |
| Proteome Discoverer 1.4 | Thermo Fisher | |
| Mascot | MatrixScience | |
| TopHat2 | ||
| Bowtie2 2.3.2 | ||
| Trimmomatic 0.32 | ||
| Snakemake 5.0 | ||
| Deeptools 2.5.0 | ||
| Macs2 2.1.1 | ( | |
| FastQC 0.11.5 | ( | |
| Picard tools 2.9.0 | Broad Institute | |
| Samtools 1.3.1 | ||
| Homer 4.9.1 | ( | |
| R 3.5.1 | ||
| Bioconductor | ||
| MOFA 1.2 | ( | |
| DiffBind Bioconductor package 2.10 | ( | |
| DESeq2 Bioconductor package 1.20 | ( | |
| GenomicAlignments Bioconductor package 1.16 | ( | |
| clusterProfiler 3.10 | ( | |
| diffTF | ||