| Literature DB >> 32770939 |
Elizabeth A Fogarty1, Jacob O Kitzman2,3, Anthony Antonellis4,5,6.
Abstract
BACKGROUND: Multicellular organisms adopt various strategies to tailor gene expression to cellular contexts including the employment of multiple promoters (and the associated transcription start sites (TSSs)) at a single locus that encodes distinct gene isoforms. Schwann cells-the myelinating cells of the peripheral nervous system (PNS)-exhibit a specialized gene expression profile directed by the transcription factor SOX10, which is essential for PNS myelination. SOX10 regulates promoter elements associated with unique TSSs and gene isoforms at several target loci, implicating SOX10-mediated, isoform-specific gene expression in Schwann cell function. Here, we report on genome-wide efforts to identify SOX10-regulated promoters and TSSs in Schwann cells to prioritize genes and isoforms for further study.Entities:
Keywords: Isoform; Myelination; Promoter; SOX10; Schwann cell
Mesh:
Substances:
Year: 2020 PMID: 32770939 PMCID: PMC7430845 DOI: 10.1186/s12864-020-06963-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1SOX10 binds to promoter elements in Schwann cells in vivo. a Tn5Prime-defined transcription start sites (TSSs) in sciatic nerve mapping within 1 kilobase of SOX10 and/or H3K4me3 ChIP-Seq peaks. b Expression levels of TSS classes (as in panel A). X-axis: log10 transformed average expression level (reads per million, RPM). Y-axis: the cumulative fraction of TSSs. Asterisk indicates p < 0.00001. c Aggregate SOX10 ChIP-Seq data in the 2-kilobase region surrounding TSS classes (as in panel A). X-axis: genomic distance from the TSS (base pairs, bp). Y-axis: average SOX10 ChIP-Seq signal (RPM, reads per million). Asterisk indicates p < 0.01
Gene mapping rates of Tn5Prime-defined transcription start sites classified by associations with H3K4me3 and SOX10 ChIP-Seq peaks
| Group | Number of TSSs | Number Within 1 kb of Gene on Same Strand | Percent Within 1 kb of Gene on Same Strand | Number of Unique Genes |
|---|---|---|---|---|
| H3K4me3 ChIP and SOX10 ChIP | 4993 | 4698 | 94.09 | 2993 |
| H3K4me3 ChIP; No SOX10 ChIP | 7455 | 6918 | 92.80 | 4741 |
| SOX10 ChIP; No H3K4me3 ChIP | 431 | 392 | 90.95 | 225 |
| No H3K4me3 ChIP and No SOX10 ChIP | 26,827 | 24,867 | 92.69 | 7088 |
| Total | 39,706 | 36,875 | 92.87 | 10,000 |
Fig. 2Assessment of SOX10-associated transcription start sites in differentiating primary Schwann cells. a Expression profiles of TSSs associated with SOX10-bound promoters in sciatic nerve, measured in control- and cAMP-treated primary Schwann cells. Upregulated by cAMP (yellow), downregulated by cAMP (orange), unchanged (green), or not expressed in either condition (black). b In vivo expression levels of TSS classes (as in panel a). X-axis: log10 transformed average expression level (reads per million, RPM). Y-axis: the cumulative fraction of TSSs. Asterisk indicates p < 0.03. c Per-sample expression levels for TSSs associated with enriched gene ontology terms among TSSs that are upregulated (left) and downregulated (right) upon cAMP treatment. Expression values are scaled per TSS (Z-score)
Top 30 enriched gene ontology terms for transcription start sites that are upregulated upon primary Schwann cell differentiation, ranked by fold enrichment. Beneath each term is a list of the genes that contributed to the term’s enrichment
| Gene Ontology Term | Fold Enrichment | FDR-Corrected P-Value |
|---|---|---|
| Isopentenyl diphosphate biosynthetic process (GO:0009240) | 36.73 | 1.56E-02 |
| Isopentenyl diphosphate metabolic process (GO:0046490) | 36.73 | 1.55E-02 |
| Ribosomal small subunit export from nucleus (GO:0000056) | 26.23 | 2.99E-02 |
| Cholesterol biosynthetic process (GO:0006695) | 17.49 | 1.55E-08 |
| Secondary alcohol biosynthetic process (GO:1902653) | 17.08 | 1.87E-08 |
| SRP-dependent cotranslational protein targeting to membrane (GO:0006614) | 17.04 | 2.45E-19 |
| Coenzyme A metabolic process (GO:0015936) | 16.32 | 1.52E-02 |
| Cotranslational protein targeting to membrane (GO:0006613) | 16.2 | 5.64E-19 |
| Ribosomal small subunit assembly (GO:0000028) | 15.3 | 3.80E-03 |
| Sterol biosynthetic process (GO:0016126) | 15.3 | 4.86E-08 |
| Protein targeting to ER (GO:0045047) | 15.02 | 2.04E-18 |
| Viral transcription (GO:0019083) | 14.9 | 5.06E-19 |
| Establishment of protein localization to endoplasmic reticulum (GO:0072599) | 14.5 | 3.04E-18 |
| Regulation of cholesterol biosynthetic process (GO:0045540) | 14.24 | 2.22E-06 |
| Regulation of sterol biosynthetic process (GO:0106118) | 14.24 | 2.19E-06 |
| Viral gene expression (GO:0019080) | 14.24 | 2.34E-19 |
| Nuclear-transcribed mRNA catabolic process, nonsense-mediated decay (GO:0000184) | 13.77 | 7.89E-18 |
| Isoprenoid biosynthetic process (GO:0008299) | 13.12 | 1.47E-03 |
| Translational initiation (GO:0006413) | 13 | 2.20E-19 |
| Protein localization to endoplasmic reticulum (GO:0070972) | 12.96 | 1.83E-18 |
| Myelin assembly (GO:0032288) | 12.89 | 2.98E-02 |
| Nucleotide-excision repair, DNA duplex unwinding (GO:0000717) | 11.13 | 4.41E-02 |
| Regulation of cholesterol metabolic process (GO:0090181) | 10.93 | 1.50E-05 |
| Protein targeting to membrane (GO:0006612) | 10.65 | 4.14E-16 |
| Glucose 6-phosphate metabolic process (GO:0051156) | 10.65 | 4.94E-02 |
| Myelination (GO:0042552) | 10.59 | 4.51E-10 |
| Ensheathment of neurons (GO:0007272) | 10.39 | 5.84E-10 |
| Axon ensheathment (GO:0008366) | 10.39 | 5.70E-10 |
| Ribosome assembly (GO:0042255) | 10.05 | 8.16E-06 |
| Cytoplasmic translation (GO:0002181) | 9.62 | 1.17E-05 |
Top 30 enriched gene ontology terms for transcription start sites that are downregulated upon primary Schwann cell differentiation, ranked by fold enrichment. Beneath each term is a list of the genes that contributed to the term’s enrichment
| Gene Ontology Term | Fold Enrichment | FDR-Corrected P-Value |
|---|---|---|
| Vesicle transport along actin filament (GO:0030050) | 33.96 | 2.44E-02 |
| Wasl, Fnbp1l, Actn4 | ||
| Regulation of protein folding (GO:1903332) | 22.64 | 8.45E-03 |
| Activation of MAPKKK activity (GO:0000185) | 20.58 | 1.08E-02 |
| Response to laminar fluid shear stress (GO:0034616) | 18.86 | 2.87E-03 |
| Establishment of protein localization to mitochondrial membrane (GO:0090151) | 13.32 | 3.33E-02 |
| Membrane protein intracellular domain proteolysis (GO:0031293) | 12.58 | 3.83E-02 |
| Membrane protein proteolysis (GO:0033619) | 10.71 | 1.80E-03 |
| Response to fluid shear stress (GO:0034405) | 9.99 | 7.03E-03 |
| Multivesicular body sorting pathway (GO:0071985) | 8.84 | 3.32E-02 |
| Positive regulation of transcription from RNA polymerase II promoter in response to stress (GO:0036003) | 8.84 | 3.30E-02 |
| Interleukin-12-mediated signaling pathway (GO:0035722) | 8.61 | 4.59E-03 |
| Cortical actin cytoskeleton organization (GO:0030866) | 8.49 | 1.36E-02 |
| Cellular response to interleukin-12 (GO:0071349) | 8.25 | 5.46E-03 |
| Response to interleukin-12 (GO:0070671) | 8.08 | 6.06E-03 |
| ER-nucleus signaling pathway (GO:0006984) | 7.86 | 4.74E-02 |
| Cortical cytoskeleton organization (GO:0030865) | 7.55 | 2.25E-02 |
| Response to epidermal growth factor (GO:0070849) | 6.93 | 3.03E-02 |
| Response to unfolded protein (GO:0006986) | 6.41 | 2.32E-06 |
| Endoplasmic reticulum unfolded protein response (GO:0030968) | 6.23 | 8.34E-04 |
| Negative regulation of intracellular transport (GO:0032387) | 6.19 | 2.13E-02 |
| IRE1-mediated unfolded protein response (GO:0036498) | 6.17 | 4.57E-02 |
| Cellular response to unfolded protein (GO:0034620) | 6.03 | 2.09E-04 |
| Intracellular steroid hormone receptor signaling pathway (GO:0030518) | 5.74 | 2.91E-02 |
| Response to topologically incorrect protein (GO:0035966) | 5.6 | 9.46E-06 |
| Positive regulation of axonogenesis (GO:0050772) | 5.33 | 2.01E-02 |
| Regulation of transcription from RNA polymerase II promoter in response to stress (GO:0043618) | 5.23 | 2.69E-03 |
| Cellular response to topologically incorrect protein (GO:0035967) | 5.14 | 8.06E-04 |
| Cell redox homeostasis (GO:0045454) | 5.14 | 4.57E-02 |
| Endosome organization (GO:0007032) | 5.14 | 4.55E-02 |
| Rho protein signal transduction (GO:0007266) | 5.08 | 4.76E-02 |
Fig. 3Assessment of SOX10-associated transcription start sites in ΔSOX10 S16 cells. a Expression profiles of TSSs associated with SOX10-bound promoters in sciatic nerve, measured in ΔSOX10 S16 cells compared to unmodified S16s. Downregulated by loss of SOX10 (pink), upregulated by loss of SOX10 (purple), unchanged (blue), or not expressed in either condition (black). b In vivo expression levels of TSS classes (as in panel A). X-axis: log10 transformed average expression level (reads per million, RPM). Y-axis: the cumulative fraction of TSSs. Asterisk indicates p < 0.05. c Per-sample expression levels for TSSs associated with enriched gene ontology terms among TSSs that are downregulated upon loss of SOX10. Expression values are scaled per TSS (Z-score)
Top 15 enriched gene ontology terms for transcription start sites that are downregulated upon loss of SOX10 in S16 cells, ranked by fold enrichment. Beneath each term is a list of the genes that contributed to the term’s enrichment
| Gene Ontology Term | Fold Enrichment | FDR-Corrected P-Value |
|---|---|---|
| Regulation of myelination (GO:0031641) | 17.54 | 1.41E-02 |
| Myelination (GO:0042552) | 15.35 | 1.86E-07 |
| Ensheathment of neurons (GO:0007272) | 15.06 | 1.16E-07 |
| Axon ensheathment (GO:0008366) | 15.06 | 7.73E-08 |
| Substantia nigra development (GO:0021762) | 13.35 | 3.88E-02 |
| Gliogenesis (GO:0042063) | 7.53 | 1.51E-04 |
| Glial cell differentiation (GO:0010001) | 7.44 | 2.38E-03 |
| Regulation of supramolecular fiber organization (GO:1902903) | 4.84 | 2.53E-03 |
| Extracellular structure organization (GO:0043062) | 3.87 | 4.98E-02 |
| Biological adhesion (GO:0022610) | 3.19 | 1.60E-03 |
| Cell adhesion (GO:0007155) | 3.07 | 2.72E-03 |
| Neuron development (GO:0048666) | 2.9 | 2.82E-02 |
| Regulation of cellular component movement (GO:0051270) | 2.79 | 1.48E-02 |
| Cytoskeleton organization (GO:0007010) | 2.65 | 1.81E-02 |
| Plasma membrane bounded cell projection organization (GO:0120036) | 2.5 | 4.96E-02 |
Fig. 4SOX10-dependent transcription start sites are associated with high-affinity SOX10 binding at conserved motifs and restricted expression profiles. a Aggregate SOX10 ChIP-Seq data in the 2-kilobase region surrounding TSSs that were downregulated, upregulated, or unchanged in ΔSOX10 S16 cells as in Fig. 3a. X-axis: genomic distance from the TSS (base pairs, bp). Y-axis: average SOX10 ChIP-Seq signal (RPM, reads per million). Asterisk indicates p < 0.001. b and c Monomeric (b) and dimeric (c) SOX10 binding motifs underlying SOX10 ChIP-Seq peaks associated with each class of TSS. Y-axis: distribution of number of motifs per peak. d and e Conservation scores of monomeric (d) and dimeric (e) SOX10 binding motifs as defined in panels b and c. Y-axis: distribution of conservation scores. b-e Whiskers extend to the 5th and 95th percentile of the data. Asterisks indicate p < 0.01. f CpG islands surrounding the downregulated, upregulated, and unchanged TSSs from the ΔSOX10 S16 model. X-axis: distance from the TSS (bp, base pairs). Y-axis: fraction of regions harboring a CpG island, calculated in 100 bp bins. g Tau scores for TSSs that were downregulated, upregulated, and unchanged in the ΔSOX10 S16 model, measured across 11 mouse tissues. Y-axis: distribution of Tau scores. Whiskers extend to the 5th and 95th percentile of data. Asterisk, p < 0.001
Fig. 5SOX10 regulates the expression of a novel transcription start site at Arpc1a. a The genomic region surrounding exons 7 and 8 of the rat Arpc1a locus. Y-axes for H3K4me3 and SOX10 ChIP-Seq data: fold enrichment of sequencing reads above chromatin input. Y-axes for Tn5Prime data from rat sciatic nerve, CPT-cAMP- (cAMP) and vehicle-treated (Control) primary Schwann cells, and unmodified and ΔSOX10 S16 cells: number of transcript 5’ends mapped per base, in reads per million. b The 904-base pair ARPC1A Prom 2 is shown along with the position of the two SOX10 dimeric consensus sequences (red bars and red text). The seven species utilized for comparative sequence analysis are shown on the left. cARPC1A Prom 2 (with or without the dimeric SOX10 sequences, as indicated) was tested in luciferase reporter assays in cultured Schwann (S16) cells. Y-axis: fold induction of luciferase activity; error bars indicate standard deviations. Asterisk indicates p < 0.05. d Annotated ARPC1A protein isoforms 1 and 2 contain WD40 repeats (WD, blue) and isoform-specific N-terminal sequences (purple). A cartoon of the predicted protein product of the SOX10-regulated Arpc1a transcript (isoform 3) is presented at the bottom of the panel. Predicted molecular weights based on amino acid sequences are shown in kilodaltons (kDa) on the right
Fig. 6SOX10-dependent expression of Chn2 transcripts originating at exon 1D. a The genomic region surrounding exon 1B at the rat Chn2 locus. Y-axes for H3K4me3 and SOX10 ChIP-Seq data: fold enrichment of sequencing reads above chromatin input. Y-axes for Tn5Prime data from rat sciatic nerve, CPT-cAMP- (cAMP) and vehicle-treated (Control) primary Schwann cells, and unmodified and ΔSOX10 S16 cells: number of transcript 5’ends mapped per base, in reads per million. b The 844-base pair CHN2 Prom 4 is shown along with the position of the SOX10 dimeric consensus sequence (red bars and red text). The seven species utilized for comparative sequence analysis are shown on the left. cCHN2 Prom 2 (with or without the dimeric SOX10 sequence, as indicated) was tested in luciferase reporter assays in cultured Schwann (S16) cells. Y-axis: fold induction of luciferase activity; error bars indicate standard deviations. Asterisk indicates p < 1 × 10− 6. d β-chimaerin isoforms 1, 2, and 3. β2-and β3-chimaerins contain Src-homology 2 (SH2, magenta), diacylglycerol binding (C1, blue), and Rac-GTPase activating (Rac-GAP, orange) domains, and are distinguished by isoform-specific N-terminal sequences (red and green). β1-chimaerin includes an isoform-specific N-terminal sequence (purple). Predicted molecular weights based on amino acid sequences are shown in kilodaltons (kDa) on the right
Fig. 7SOX10 regulates the expression of a novel Ddr1 transcription start site in Schwann cells. a The genomic locus at rat Ddr1 intron 1. Y-axes for H3K4me3 and SOX10 ChIP-Seq data: fold enrichment of sequencing reads above chromatin input. Y-axes for Tn5Prime data from rat sciatic nerve, CPT-cAMP- (cAMP) and vehicle-treated (Control) primary Schwann cells, and unmodified and ΔSOX10 S16 cells: number of transcript 5’ends mapped per base, in reads per million. b The 712-base pair DDR1 Prom 5 is shown along with the position of the SOX10 dimeric consensus sequence (red bars and red text). The seven species utilized for comparative sequence analysis are shown on the left. cDDR1 Prom 5 (with or without the dimeric SOX10 sequence, as indicated) was tested in luciferase reporter assays in cultured Schwann (S16) cells. Y-axis: fold induction of luciferase activity; error bars indicate standard deviations. Asterisk indicates p < 1 × 10− 4. d DDR1 isoforms a, b, and c contain discoidin (DS, green), discoidin-like (DS-like, magenta), transmembrane (TM, blue) and kinase (KD, orange) domains, and are distinguished by insertions in the intracellular sequences (red and light blue). Predicted molecular weights based on amino acid sequences are shown in kilodaltons (kDa) on the right. e DDR1 protein expression in sciatic nerve, primary Schwann cells, and S16 cells. Actin was used as a protein loading control. Numbered dashes to the left of each blot indicate the position of protein size markers in kilodaltons (kDa). f The intensity of the lower DDR1 band relative to Actin signal in control and cAMP-treated primary Schwann cells. The average across three independent samples is indicated by the bar height. Error bars indicate standard deviations
Fig. 8SOX10-dependent expression of a Gas7 transcription start site at exon 1B. a The genomic region surrounding GAS7 exon 1B at the rat Gas7 locus. Y-axes for H3K4me3 and SOX10 ChIP-Seq data: fold enrichment of sequencing reads above chromatin input. Y-axes for Tn5Prime data from rat sciatic nerve, CPT-cAMP- (cAMP) and vehicle-treated (Control) primary Schwann cells, and unmodified and ΔSOX10 S16 cells: number of transcript 5’ends mapped per base, in reads per million. b The 835-base pair GAS7 Prom 2 is shown along with the position of the SOX10 dimeric consensus sequences (red bars and red text). The seven species utilized for comparative sequence analysis are shown on the left. cGAS7 Prom 2 (with or without the dimeric SOX10 sequences, as indicated) was tested in luciferase reporter assays in cultured Schwann (S16) cells. Y-axis: fold induction of luciferase activity; error bars indicate standard deviations. Asterisk indicates p < 0.005. d GAS7 isoforms a, b, c, and d. Isoform Gas7-c contains Src homology 3 (SH3, green), WW (magenta), and Fes/Cip4 homology (FCH, blue) domains. Gas7-a contains an isoform-specific N-terminal sequence (gold). Predicted molecular weights based on amino acid sequences are shown in kilodaltons (kDa) on the right. e GAS7 protein expression in sciatic nerve, primary Schwann cells, and S16 cells. AARS was used as a protein loading control. Numbered dashes to the left of each blot indicate the position of protein size markers in kilodaltons (kDa). f The intensity of the lower GAS7 band relative to AARS signal in control and cAMP-treated primary Schwann cells. The average across three independent samples is indicated by the bar height. Error bars indicate standard deviation