| Literature DB >> 29968757 |
Satoshi Okawa1, Carmen Saltó2, Srikanth Ravichandran1, Shanzheng Yang2, Enrique M Toledo2,3, Ernest Arenas2, Antonio Del Sol4,5.
Abstract
Single-cell RNA sequencing allows defining molecularly distinct cell subpopulations. However, the identification of specific sets of transcription factors (TFs) that define the identity of these subpopulations remains a challenge. Here we propose that subpopulation identity emerges from the synergistic activity of multiple TFs. Based on this concept, we develop a computational platform (TransSyn) for identifying synergistic transcriptional cores that determine cell subpopulation identities. TransSyn leverages single-cell RNA-seq data, and performs a dynamic search for an optimal synergistic transcriptional core using an information theoretic measure of synergy. A large-scale TransSyn analysis identifies transcriptional cores for 186 subpopulations, and predicts identity conversion TFs between 3786 pairs of cell subpopulations. Finally, TransSyn predictions enable experimental conversion of human hindbrain neuroepithelial cells into medial floor plate midbrain progenitors, capable of rapidly differentiating into dopaminergic neurons. Thus, TransSyn can facilitate designing strategies for conversion of cell subpopulation identities with potential applications in regenerative medicine.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29968757 PMCID: PMC6030214 DOI: 10.1038/s41467-018-05016-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Principle of transcriptional synergy and method overview. a The method requires single-cell RNA-seq data classified into distinct subpopulations as input and identifies most synergistic transcriptional cores for each subpopulation. b Comparison of pair-wise MI between individual TF pairs with joint MI between two TFs together and a third one. For a combination of TFs to be synergistic, the sum of pair-wise MIs has to be less than the joint MI (i.e., negative MMI). Any permutation of same set of TFs results in the same MMI value. c Dynamic search for identifying the most synergistic transcriptional cores, in which the “seed” 3-TF combinations are progressively expanded by an addition of another TF one by one. The search is terminated when there is no more increase in synergy when adding a new TF to the current best combination and the current combination is considered the most synergistic transcriptional core
Most synergistic transcriptional cores predicted by TransSyn and top 10 JSD TFs in example subpopulations, where known identity TFs are in bold
| Data set | Cell subpopulation | Synergistic transcriptional core | Top 10 JSD TFs (incl. ties) |
|---|---|---|---|
| Treutlein et al. 2014 | Lung surfactant-secreting cuboidal alveolar type 2 cell | Atf4, Fos, | Gfi1, Hes7, Insm1, Mesp2, Nr2e1, Phox2a, Sp5, Tox2, Zbtb12, Zfp251, Zfp398, Zkscan16 |
| Grün et al. 2015 | Intestinal organoid enterocyte precursor | Alx4, Ar, Ebf2, Esx1, Foxp2, Gm14393, Lhx2, Pou3f2, Rarb, Snai2, Sox8, Tead4, Tlx1, Zfhx4, Zfp52, Zfp532 | |
| Grün et al. 2015 | Intesitinal organoid enteroendocrine cell | Evx1, Hsf5, Ikzf2, Irx3, Lef1, Obox3, Peg3, Sall2, Sp8, Zfp14, Zfp867 | |
| Chu et al. 2016 | H9 ESC |
| DMRTB1, EGR4, HES3, INSM1, NKX2-6, OLIG3, PAX9, PITX3, SIX6, TFAP2B |
| Scialdone et al.[ | Embryonic blood progenitor | Id3, Hes1, | Dbx2, Emx2, Gfi1, |
| Scialdone et al.[ | Embryonic visceral endoderm | Peg3, Ybx1, | Creb3l3, En1, Foxa1, |
| Gokce et al. 2016 | Striatal neuron | Ar, Atoh7, | |
| Gokce et al. 2016 | Striatal microglia | Arid3c, Atoh1, Batf3, Ebf3, Hoxd10, Mlxipl, Pax7, Pitx2, Sox15, Zfp69 | |
| Gokce et al. 2016 | Striatal vascular endothelial cell | Csde1, Hmgb1, Nfia, | Alx1, Erg, Foxc2, Foxl2, Nfatc4, Tbx1, Tbx2, Tbx4, Tcf21, Vsx1 |
| Joost et al. 2016 | Upper hair follicle I | Jun, | Prdm14, Tbx19, Alx1, Insm2, Gm9376, Pou4f3, Prrx2, Rex2, Obox6, Sox3, Rnf138rt1 |
| Segerstolpe et al. 2016 | Pancreatic alpha cell | EVX1, FOXD4L3, GBX2, HOXC11, IFNB1, MEF2B, MYF6, POU3F3, SP9, SSX1 | |
| Segerstolpe et al. 2016 | Pancreatic beta cell | ENO1, | BHLHE23, DBX1, FERD3L, FOXR2, HOXB8, MYF5, OTP, SOHLH2, SOX3, TFAP2B |
| LaManno et al. 2016 | Fetal dopaminergic neuron type 2 | ALX4, ASCL2, ENF, FEZF1, FOXH1, NKX2-1, PRDM12, RAX2, TBX15, TBX22 | |
| LaManno et al. 2016 | Fetal progenitor medial floorplate | FOXB1, FOXD4L1, GFI1, HNF1A, MESP2, NR0B1, NR5A2, TBX5, ZNF99 |
Unique top 10 hub TFs in GRNs for the example subpopulations in Table 1. Known identity TFs are in bold.
| Data set | Cell subpopulation | Unique top 10 hub TFs |
|---|---|---|
| Treutlein et al. 2014 | Lung surfactant-secreting cuboidal alveolar type 2 cell | 2610008e11rik, Bcl11b, Crem, E2f3, Elf2, Foxq1, Gfi1, Hsf1, Ikzf2, Ikzf4 |
| Grün et al. 2015 | Intestinal organoid enterocyte precursor | Arx, Esrrg, Foxd2, Hoxa1, Hoxa4, Neurod2, Sox7, St18, Tox3, Zfp532 |
| Grün et al. 2015 | Intesitinal organoid enteroendocrine cell | 2700081O15rik, 5730507C01rik, Arntl, Atf6b, Dnajc2, Ehf, Erf, Etv5, Fiz1 |
| Chu et al. 2016 | H9 ESC | AEBP2, ARID1A, HIF1A, MIER1, TCF4, TSC22D2, ZNF146, ZNF286A, ZNF441, ZNF814 |
| Scialdone et al.[ | Embryonic blood progenitor | Dnajc2, E2f3, |
| Scialdone et al.[ | Embryonic visceral endoderm | Dmrta2, Elf4, Fosl1, Foxl1, Foxp1, Glis3, Klf14, Klf4, Smad9, Vdr |
| Gokce et al. 2016 | Striatal neuron | Csde1, Dbp, Erf, Gatad1, Hmgb1, Hsf2, Jund, Mier3, Thrb, Zfhx3 |
| Gokce et al. 2016 | Striatal microglia | Bhlha15, Lef1, Prox2, Tbx3, Zbtb17, Zfp113, Zfp184, Zfp82 |
| Gokce et al. 2016 | Striatal vascular endothelial cell | Arx, Sox21, Sp4, Tfap4, Tox3, Tshz2, Vsx1, Zfp433, Zfp579, Zfp709 |
| Joost et al. 2016 | Upper hair follicle I | Creb3l3, Lhx9, Rhox3f, Tal1, Zscan20 |
| Segerstolpe et al. 2016 | Pancreatic alpha cell | CARM1, DEAF1, JUNB |
| Segerstolpe et al. 2016 | Pancreatic beta cell | LDB1, |
| LaManno et al. 2016 | Fetal dopaminergic neuron type 2 | AFF2, DACH2, FOXJ2, MAF, MEF2D, ZBTB48, ZNF354B, ZNF555, ZNF771 |
| LaManno et al. 2016 | Fetal progenitor medial floorplate | AHR, ARID3A, BARHL2, CEBPD, FOXF2, PRDM13, SCRT1, ZNF497, ZNF557 |
Fig. 2Conversion of basal hNES cells into medial floor plate midbrain progenitors (hProgFPM) by treatment of proliferating hNES for 2 days with the smoothen agonist (SAG, 500 nM) and Dickoppf1 (Dkk1, 150 ng/ml). a Transcription factors forming the synergistic transcriptional core of hNES cells and required for their conversion to midbrain progenitors. b Schematic representation of the treatment followed to convert proliferating hNES into HProgFPM, which included SAG, smoothen agonist (500 nM) and Dkk1, Dickoppf1 (150 ng/ml). c–f RT-qPCR analysis at day 2 showing the expression of TFs that define cell populations in the midbrain and hindbrain regions: EN1 (c) and PAX2 (d); the ventral midbrain: OTX2:GBX2 ratio (e) and FOXA2 (f), as well as more lateral compartments: PAX6 (g) and IRX3 (h). i Comparison of OTX2+cells in control and SAB+Dkk1-converted NES cells. Scale 50 μm. Box plots (c–h): Center line, median; hinges, 25% and 75% quartiles; whiskers, 1.5 interquartile range. Statistics: t-test; *p ≤ 0.05; *p ≤ 0.01. N = 3
Fig. 3Conversion of hNES cells into hPRogFPM and their differentiation into midbrain dopaminergic neurons. a Schematic representation of the conversion and differentiation protocols. b, c RT-qPCR analysis at day 8, showing the expression of midbrain-hindbrain TFs, such as OTX2, GBX2, LMX1A, and FOXA2 (b), as well as the dopaminergic neuron markers, NR4A2, TH, SLC18A22, and SLC6A3 (c). d Immunocytochemistry analysis of the presence of OTX2 and TH in control unconverted NES cultures, compared with NES cells converted with SAF+Dkk1 and differentiated until day 8. e, f Percentage of OTX2+ and TH+ cells in the conditions in d. P = 0.02673 (e), P = 0.03233 (f), n = 3. g–i Expression of the key midbrain TFs, LMX1A, NR4A2, and PBX1, in TH+cells derived from SAI2-NES cells after conversion and differentiation. j, k TH+ cells express the mature neuronal marker, MAP2 (j), and some acquire mature neuronal morphologies, with long processes and varicosities at day 8 (k). Scale 50μm. Box plots (b, c, e, f): Center line, median; hinges, 25% and 75% quartiles; whiskers, 1.5 interquartile range. Statistics: (b, c) ANOVA, followed by pair-wise t-test with Bonferroni correction for multiple testing. (e, f). Two sample t-test; *P ≤ 0,05; **P ≤ 0,01; ***P ≤ 0,001. N = 3 (GBX2, FOXA2, TH, SLC6A3), n = 4 (LMX1A, OTX2, NR4A2, SLC18A2)