Literature DB >> 33125830

TransSynW: A single-cell RNA-sequencing based web application to guide cell conversion experiments.

Mariana Messias Ribeiro1, Satoshi Okawa1,2, Antonio Del Sol1,3,4.   

Abstract

Generation of desired cell types by cell conversion remains a challenge. In particular, derivation of novel cell subtypes identified by single-cell technologies will open up new strategies for cell therapies. The recent increase in the generation of single-cell RNA-sequencing (scRNA-seq) data and the concomitant increase in the interest expressed by researchers in generating a wide range of functional cells prompted us to develop a computational tool for tackling this challenge. Here we introduce a web application, TransSynW, which uses scRNA-seq data for predicting cell conversion transcription factors (TFs) for user-specified cell populations. TransSynW prioritizes pioneer factors among predicted conversion TFs to facilitate chromatin opening often required for cell conversion. In addition, it predicts marker genes for assessing the performance of cell conversion experiments. Furthermore, TransSynW does not require users' knowledge of computer programming and computational resources. We applied TransSynW to different levels of cell conversion specificity, which recapitulated known conversion TFs at each level. We foresee that TransSynW will be a valuable tool for guiding experimentalists to design novel protocols for cell conversion in stem cell research and regenerative medicine.
© 2020 The Authors. STEM CELLS TRANSLATIONAL MEDICINE published by Wiley Periodicals LLC on behalf of AlphaMed Press.

Entities:  

Keywords:  cellular therapy; clinical translation; differentiation; direct cell conversion; genomics; reprogramming; synergy; transcription factors

Mesh:

Substances:

Year:  2020        PMID: 33125830      PMCID: PMC7848352          DOI: 10.1002/sctm.20-0227

Source DB:  PubMed          Journal:  Stem Cells Transl Med        ISSN: 2157-6564            Impact factor:   6.940


The study proposes a computational web application, TransSynW. To the best of the author's knowledge, it is the only computational tool that can identify cell conversion transcription factors (TFs) for any cell population in single‐cell RNA‐sequencing data. TransSynW does not require prior biological information, computer programming, and users computational resources. In addition, TransSynW prioritizes pioneer factors among predicted conversion TFs to facilitate chromatin opening often required for cell conversion. Furthermore, TransSynW predicts marker genes for assessing the performance of cell conversion experiments. Thus, TransSynW will be a staple tool for guiding experimentalists to design novel protocols for cell conversion in stem cell research and regenerative medicine.

INTRODUCTION

Cell conversion is fundamental to many biological processes. Control of cell conversion has significant relevance in stem cell research. For example, generation of functionally specific cells by cell conversion is of clinical interest for cell replacement therapies. However, several roadblocks need to be overcome for achieving optimal cell conversion, such as the accurate characterization of cell populations and the identification of cell conversion factors. Single‐cell RNA‐sequencing (scRNA‐seq) technologies have made it possible to address these challenges. Due to the greater amount of scRNA‐seq data generated across the world, experimental researchers are increasingly expressing their interest in deriving novel functional cell types. Here, we present TransSynW, a scRNA‐seq based web application for identifying cell conversion transcription factors (TFs) applicable in stem cell and clinical research (Figure 1A). It prioritizes pioneer factors (PFs) in the prediction of conversion TFs. Evidence suggests that PFs have a key role in chromatin opening, a process often required for cell conversion. Indeed, including PFs on cell conversion protocols has been shown to improve their outcome. Furthermore, it predicts marker genes for each target cell type, enabling researchers to assess the fidelity of experimentally converted cells. In addition, it is user‐friendly, and it does not require users' computer programming or computational resources. We also created a comprehensive video tutorial for guiding users through the web interface.
FIGURE 1

A, Application of TransSynW to stem cell research and regenerative medicine. B, Schematic overview of TransSynW algorithm (see also Methods). First, transcription factors (TFs) most specifically expressed in the selected target cell population (specific TFs) and nonspecifically expressed pioneer factors (PFs) are computed. The most synergistic combination of specific TFs and nonspecific PFs is then identified. The predicted set of TFs are ranked by expression fold change between target and starting cell populations. In parallel, top 10 candidate marker genes for target cell population are computed by JSD

A, Application of TransSynW to stem cell research and regenerative medicine. B, Schematic overview of TransSynW algorithm (see also Methods). First, transcription factors (TFs) most specifically expressed in the selected target cell population (specific TFs) and nonspecifically expressed pioneer factors (PFs) are computed. The most synergistic combination of specific TFs and nonspecific PFs is then identified. The predicted set of TFs are ranked by expression fold change between target and starting cell populations. In parallel, top 10 candidate marker genes for target cell population are computed by JSD The application of TransSynW to various cell systems well‐recapitulated known cell conversion TFs and made novel predictions, including the phenotypic conversion between cells in organoids and their in vivo counterparts. Moreover, predicted marker genes were consistent with experimentally known ones. These results highlight the applicability of TransSynW to a wide range of cell conversion experiments.

RESULTS

Method overview

The TransSynW algorithm first identifies specifically and nonspecifically expressed TFs, and selects the combination that exhibits the highest synergistic interactions among them (see Methods) (Figure 1B). Notably, here we considered for the nonspecific part only PFs that have previously been reported to be involved in cell conversion protocols (Table S1). Predicted conversion TFs are then ranked by the expression fold change between the target and starting cell populations and users can prioritize the TFs for experimental follow‐ups based on this ranking. We compiled the scRNA‐seq data of starting cell types frequently used in cell conversion experiments from various scRNA‐seq platforms (Table S2). For optimal results, users are recommended to use starting and target cell type data obtained from the same scRNA‐seq platform or, if not available, from the closest sequencing platform. In general, it is recommended to select at least one PF and one specific TF from the predicted conversion TFs. It may be advisable to select more factors if the phenotypic difference between the starting and target cell types is large. Finally, TransSynW also predicts potential marker genes of the target cell populations. This feature enables researchers to select markers for assessing the performance of their cell conversion experiments.

Application to various cell conversions

To demonstrate the applicability of TransSynW, we applied it to different cell systems, which encompassed conversions into broad cell types, subtypes, and phenotypic states (Tables 1 and S3). For example, in the first category, FOXA2, FOXA3, and HNF4A were predicted for the hepatocyte, which, together with HNF1A predicted in the specific part, are known for hepatocyte conversion. The predicted TFs for the beta cells included NKX6‐1, MAFA, PDX1, and NEUROD1, which have been shown to induce beta cell conversion. , , Moreover, in both cases the predicted marker genes recapitulated commonly used ones (Tables 2 and S4). Indeed, many predicted conversion TFs are known to regulate each other and the predicted marker genes (Figure 2A,B), supporting the biological relevance of synergistic interactions captured by TransSynW.
TABLE 1

Predicted specific transcription factors (TFs) and nonspecific PFs

Cell typeSpecific TFsNonspecific PFsAnnotation in dataData source (PubMed ID)
(1) Conversion into broad cell type
Myoblast MYF5, MYOD1, PAX7, GLIS3, PAX3 CEBPB, IRF8, PBX11,3,4,5,730283141
Keratinocyte TRP63, GATA3, NFIB KLF4, GRHL2, CEBPA0‐1630283141
Cardiomyocyte NKX2‐5, TBX5, PROX1, ZFP579, NR0B2 GATA4, MEIS1, PBX19,1430283141
HepatocyteNR1I2, ZFP750, ZFHX4, HNF1A, ZBTB48 HNF4A, FOXA3, FOXA2 4,5,10,11,12,1530283141
HSC HLF, HOXA9, GATA2, TAL1, MYCN CEBPB, CEBPA, PBX1 0,4,830283141
NeuronEOMES, NEUROD6, EGR4, RARB, DLX6 FOXG1, NEUROD1, PBX19,10,1230283141
Oligodendrocyte/OPC NKX6‐2, OLIG1, SOX10, OLIG2, NFE2L3SOX20,6,1130283141
MacrophageRUNX3, BATF3, BATF, NFE2, E2F1 SPI1, CEBPA, ARID3ADifferent tissues30283141
Beta cell NKX6‐1, PDX1, MAFA, OVOL2, MNX1 NEUROD1, ISL1, FOXA2 0,8,9,11,1730283141
NSCZFP275, ASCL1, TCF3 FOXG1, SOX2, PBX1All young NSCs30827680
(2) Conversion into subtype
Dopaminergic neuronNPAS4, MYT1L, EBF3, POU6F1, BNC2 FOXA2, ASCL1, GATA3 hDA27716510
Medial floorplate progenitor LMX1A, SP2, NR2F6, LMX1B, HMGA2 FOXA2, ASCL1, SOX2hProgFPM27716510
GABAergic neuroblastGATA3, SOX14, MYT1L, BNC2, ZBTB38 ASCL1, SOX2, PBX1hNbGaba27716510
Oculomotor neuron PHOX2B, PHOX2A, ISL1, RXRG, NR2F2FOXA2, ASCL1, “PBX1hOMTN27716510
Serotonin neuron FEV, GATA3, SOX1, DPF1, LMX1B GATA2, PBX1hSert27716510
CD4+ central memory T cellRBSN, RFX3, NR4A1, KLF9, ID3GATA3, CEBPBTCM29352091
CD8+ memory T cell EOMES, BACH2, KLF7, MYC, ID3 CEBPB, GATA3 4,6,11,1331754020
Memory B cellKLF13, LMO4, PCBD1, KLF10, ZBTB38IRF8, SPI1, CEBPBMemory B cell31968262
(3) Phenotype conversion
Primed mESC 1 LIN28A, MYC, ID1, FOXP1, ID3 POU5F1, ESRRB, KLF4 FBSLIF25471879
Naive mESC 1ZFHX2, MEIS2, ZIC2 POU5F1, ESRRB, KLF4 2iLIF
Primed mESC 2 LIN28A, FOXP1, SOX4 SOX2, POU5F1, KLF4 mES_lif26431182
Naive mESC 2SPIC, MITF, MEIS2 ESRRB, KLF4, POU5F1 mES_2i
Active NSCCENPS, EGR1, INSM1, MXD3, E2F1 ASCL1, SOX2, PBX1All young aNSCs30827680
Quiescent NSCDBP, EPAS1, ID2 FOXG1, PBX1, ASCL1 All young qNSCs
Fetal hepatocyteZGPAT, KLF11, ZBTB20 GATA4, HNF4A, CEBPA Fetal hepatocyte30500538
Organoid hepatocyteHES6, LEF1, THAP8, SOX9, HTT FOXA2, HNF4A, MEIS1Fetal hepatocyte organoid
Adult hepatocyte 1KLF9, CEBPD, KLF6 FOXA2, HNF4A, CEBPBHepatocyte31292543
Adult hepatocyte 2SCAND1, NR3C1, EDF1 HNF4A, FOXA2, PBX1Hepatocyte30348985
Adult excitatory neuronMLXIPL, PEG3, HLF, BHLHE40, KLF9 FOXG1, CEBPB, PBX1adult_Ex31619793
Organoid excitatory neuron NEUROG2, SOX11, SOX4, CSRP2, CARHSP1 FOXG1, PBX1hOrga_EN
Adult inhibitory neuronPEG3, MLXIPL, HLF, PPARGC1A, KLF9 FOXG1, SOX2, PBX1adult_In31619793
Organoid inhibitory neuronSIX3, PAX6, ID4, KLF10, MEIS2 ASCL1, SOX2, SOX9hOrga_IN

Note: Experimentally validated conversion TFs are marked in bold. TFs are ordered from left to right by fold change to MEF/HFF. Cluster IDs annotated to same cell types in PanglaoDB were merged prior to analysis. Macrophage data from different tissues (heart, kidney, lung, muscle, brain, pancreas, skin spleen, trachea) were merged. See Table S3 for literature evidence for predicted conversion TFs.

TABLE 2

Predicted marker genes with documented evidence

Cell typePredicted marker gene with evidenceReference (PubMed ID or website)
(1) Conversion into broad cell type
MyoblastCALCR, FGFR4, DES, ANKRD1, FITM112223412, 26440893, 26492245, 24644428, 8120103
KeratinocyteKRT522028850
CardiomyocyteNPPA, MYH627123009, https://www.rndsystems.com/cn/research‐area/cardiac‐stem‐cell‐markers
HepatocyteSRD5A2, FGF2125974403, 28515909
HSCESAM, LHCGR, SLC22A3, TIE1, ANGPT1, RBP1

https://www.rndsystems.com/cn/research‐area/hematopoietic‐stem‐cell‐markers

27365425, 27225119

NeuronHTR2C, NTNG1, HS6ST330078709
Oligodendrocyte/OPCMAG, CLDN11, PLEKHH1, ASPA, TRF29024657
MacrophageFOLR2, F13A1, LYZ2, PF4, MGL2, MMP13, CLEC10A28576768, 29622724, 25477711,
Beta cellINS1, INS2, G6PC222745242, 15133852, 25322827
NSCNUDC, TUBA1B, TUBA1A21771589, 29057214, 29281841
(2) Conversion into subtype
Dopaminergic neuronALDH1A1, TH30096314, http://www.abcam.com/neuroscience/neural‐markers‐guide
Medial floorplate progenitorWNT1, MDK31080111, 24125182, 11750071
GABAergic neuroblastGAD2 http://www.abcam.com/neuroscience/neural‐markers‐guide
Oculomotor neuronPRPH, FGF10, SLIT3, EYA124549637, 9221911, 20215354, 31080111
Serotonin neuronTPH2, SLC6A4 http://www.abcam.com/neuroscience/neural‐markers‐guide
CD8+ memory T cellSELL, CXCR5, DRC129236683, 18000950, 30243945
Memory B cellTNFRSF13B, CD27Company ebioscience, miltenyibiotec
(3) Phenotype conversion
Primed mESC 1BMP426860365
Active NSCCENPF29727663
Quiescent NSCGJA129727663
Fetal hepatocyteFGB, CYP2E128166538, 29622030
Adult hepatocyte 1CYP3A426838674
Adult hepatocyte 2APOA128166538
Adult excitatory neuronCCK12815247
Adult inhibitory neuronCCK, PVALB, CRH12815247, 2196836, 2843570

Note: See Table S4 for full list of predicted marker genes.

FIGURE 2

Transcriptional regulatory interactions among predicted conversion transcription factors (TFs) and marker genes for, A, hepatocyte and B, beta cell. Interaction data were retrieved from MetaCore from Clarivate Analytic in May/2020. C, Experimental strategy to improve cell conversion protocols for GABAergic neurons (Gaba) and medial floorplate progenitor (ProgFPM) based on TransSynW predicted core TFs. Dashed outlines represent nonvalidated TFs in the literature. D, Processing time vs number of cells in input scRNA‐seq file (n = 3). Target population size was fixed to 8% of total size. E, Processing time for Rds files vs number of cells in target population (n = 3). Input population size was fixed to 10 000

Predicted specific transcription factors (TFs) and nonspecific PFs Note: Experimentally validated conversion TFs are marked in bold. TFs are ordered from left to right by fold change to MEF/HFF. Cluster IDs annotated to same cell types in PanglaoDB were merged prior to analysis. Macrophage data from different tissues (heart, kidney, lung, muscle, brain, pancreas, skin spleen, trachea) were merged. See Table S3 for literature evidence for predicted conversion TFs. Predicted marker genes with documented evidence https://www.rndsystems.com/cn/research‐area/hematopoietic‐stem‐cell‐markers 27365425, 27225119 Note: See Table S4 for full list of predicted marker genes. Transcriptional regulatory interactions among predicted conversion transcription factors (TFs) and marker genes for, A, hepatocyte and B, beta cell. Interaction data were retrieved from MetaCore from Clarivate Analytic in May/2020. C, Experimental strategy to improve cell conversion protocols for GABAergic neurons (Gaba) and medial floorplate progenitor (ProgFPM) based on TransSynW predicted core TFs. Dashed outlines represent nonvalidated TFs in the literature. D, Processing time vs number of cells in input scRNA‐seq file (n = 3). Target population size was fixed to 8% of total size. E, Processing time for Rds files vs number of cells in target population (n = 3). Input population size was fixed to 10 000 Next, we analyzed different subtypes of neurons, as they are one of the most well studied subtypes. Among the predicted TFs for dopaminergic (DA) neurons, MYT1L, ASCL1, FOXA2, and GATA3 have been shown to generate DA neurons. , , The predicted TFs for the medial floorplate progenitor, LMX1A and FOXA2, are consistent with the previous attempt to derive this cell subtype. ASCL1 is sufficient to convert fibroblasts into GABAergic neurons. Consistently, the predicted TFs for GABAergic neuroblasts contained ASCL1 and no other TFs known to generate other neuronal subtypes. The predicted TFs for oculomotor neuron included ISL1, PHOX2A, and PHOX2B which have been reported to generate motor neurons via a synergistic interaction. , FEV, GATA2, and LMX1B were predicted for serotonergic neurons, which are among the TFs used for deriving this cell subtype. We considered memory T and B cells as subtypes of their naive counterparts. Although a defined set of TFs for generating T cells has not been reported, the nonspecific PFs for both CD4+ and CD8+ T cells contained GATA3 and CEBPB, suggesting that these factors are primary candidates for experimental validation. Indeed, GATA3 is implicated in CD8+ memory T cell conversion. Among the specific TFs, ID3, MYC, BACH2, and EOMES are reported to initiate CD8+ memory T cell conversion. , , The known marker genes, such as SELL and CXCR5, were also identified. Finally, the nonspecific PFs for the memory B cells included IRF8 and SPI1, which together are implicated in the generation of B cell memory. Another type of cell conversion is phenotypes of a same cell type. The predicted nonspecific PFs for the two mouse embryonic stem cells (mESC) datasets are known to induce pluripotency. , , The specific conversion TFs predicted for both primed mESC populations were LIN28A and FOXP1. LIN28A is known to induce the transition from naive to primed mESCs. FOXP1 is implicated in maintaining pluripotency under non‐2i conditions. Whether FOXP1 induces a transition from a naive state to a primed state calls for further investigations. MEIS2 was predicted for both naive mESC populations. Little is known about its role in mESC regulation and hence it constitutes a novel candidate gene. The nonspecific conversion PFs for both active (aNSCs) and quiescent (qNSCs) consisted of known NSC‐conversion TFs (eg, ASCL1, SOX2, FOXG1). The specific TFs for aNSCs contained EGR1 known to activate EGFR and accelerate proliferation of NSCs, and E2F1, which is a cell cycle regulator linked to EGFR signaling in NSCs. The conversion TFs for qNSCs included ID2, a BMP effector that has been inferred to regulate qNSCs. Furthermore, CENPF and GJA1 are implicated as markers for late‐aNSCs and qNSCs, respectively. Next, the scRNA‐seq data of organoid and in vivo heptocytes , , were analyzed. The nonspecific PFs included general hepatocyte conversion TFs (eg, HNF4A, FOXA2, GATA3). Among the specific TFs for the in vivo hepatocytes were ZBTB20, KLF6, KLF9, CEBPD, and NR3C1. ZBTB20, KLF9 are important for hepatocyte proliferation, whereas KLF6, CEBPD, KLF9, and NR3C1 regulate hepatic glucose and lipid metabolism, , , suggesting that the derivation of in vivo hepatocytes might require sustained cell proliferation and proper metabolization of glucose and lipids. Known hepatocyte marker genes, such as FGB, CYP2E1, CYP3A4, APOA1, were predicted only for the in vivo hepatocytes but none for the in vitro ones. Finally, TransSynW was applied to in vivo and organoid excitatory and inhibitory neurons. TFs predicted only for the in vivo excitatory and inhibitory neurons contained many common TFs (PEG3, KLF9, HLF, and MLXIPL), suggesting a common maturation mechanism. KLF9 is known to be necessary for late‐phase maturation of neurons. BHLHE40, which was only predicted for the in vivo excitatory neurons, is implicated in the regulation of neuronal excitability. Moreover, a few known markers (CCK, PVALB, CRH) for excitatory/inhibitory neurons were predicted only for the adult samples. It would be of interest to experimentally test if predicted conversion TFs could indeed convert organoid cells into functional ones. Taken together, we demonstrated that TransSynW can be effectively applied for identifying conversion TFs for a wide range of cell types. An example experimental strategy for using TransSynW predicted conversion TFs is shown in Figure 2C.

Processing speed

The processing speed of TransSynW was assessed using text file, Rds file and a sparse matrix saved as Rds file (sparse‐Rds). The time required for the upload of the data was not considered for this analysis. Thus, depending on the users internet connection speed, the overall processing time may vary to a certain degree. Rds files were the most efficient in processing 10 000 cells (6 minutes) (Figure 2D). In addition, up to 40 000 cells were successfully processed with Rds files, whereas only 25 000 cells in the other formats. This is in accordance with the respective file sizes (Table S5). If users wish to use datasets larger than 40 000 cells, we recommend to down‐sample them. Next, we benchmarked the execution time against the target cell population size in 10 000 cells. The processing time peaked at 11 minutes for 3500 cells (Figure 2E). Afterwards, it started decreasing due to the reduced size of the background populations. Our general recommendation to users is to use Rds files for datasets with more than 10 000 cells.

DISCUSSION

We have introduced a scRNA‐seq based web application, TransSynW, for unbiased identification of cell conversion TFs, following the increasing interest from experimental researchers in generating novel functional cell types identified by scRNA‐seq. TransSynW does not require prior biological knowledge, computer programming and computational resources. Moreover, TransSynW identifies potential marker genes for target cell types, which researchers can use for assessing the performance of conversion experiments. Furthermore, prioritization of PFs well recapitulated known conversion TFs in various systems, and predicted novel ones. We foresee that TransSynW will be a valuable tool for the experimental community, particularly for the generation of novel cell populations for stem cell research and regenerative medicine purposes.

MATERIALS AND METHODS

Implementation

TransSynW is written in HTML, JavaScript (frontend), PHP and Bash (backend), and runs on a virtual server hosted by Luxembourg Centre of Systems Biomedicine (LCSB, University of Luxembourg). The frontend allows users to upload all required data, which are then parsed to the backend as different variables. In the backend bash script, the variables are parsed to the TransSynW main R script as different arguments. The output files are compressed into a .zip folder and sent to the user‐specified E‐mail address.

Identification of conversion TFs

The main algorithm is based on the notion that conversion TFs consist of a combination of TFs that are specifically expressed in a target population and TFs that are more broadly expressed in the background population, and that these TFs synergistically interact with each other. The algorithm follows four major steps. Step 1: Identification of candidate TFs. TransSynW first normalizes the data by the total RNA counts. Then TFs whose expression value is 0 across all cells in the target cell population are discarded. Next, it selects top 300 lowest CV (coefficient of variation) TFs as potential candidate TFs, since using more than this number of TFs often resulted in an out‐of‐memory error during the subsequent computation and conversion TFs usually exhibit low expression variation. Step 2: Identification of most specifically expressed TFs. The set of TFs that are specifically expressed in the target population is determined by Jensen‐Shannon Divergence (JSD). JSD is computed for each TF in each cell and the summed JSD value for each TF over all cells is calculated. The top 10 lowest summed‐JSD TFs are selected as the most specifically expressed TFs. Step 3: Identification of most synergistic set of specifically expressed TFs. Next, TransSynW identifies the most synergistic subset of TFs among the most specifically expressed TFs by computing MMI. where , is a subset of , ∣ denotes the cardinality of , and is Shannon's entropies. Negative MMI values imply a synergistic interaction among the TFs. TransSynW first computes MMI of all sets of three TFs among the most specifically expressed TFs. Then a new TF is added to this set and MMI is computed again. If MMI is synergistic, then the next TF is added to the previous set, and so on. This iteration continues until either MMI no longer shows synergy, or when the maximum core size is reached. Here, the maximum core size was set to five. Step 4: Addition of PFs. The specific TF set from step 3 is extended with the nonspecific part, consisting solely of PFs. Every subset of three PFs is added to the specific part. MMI is computed for each set of all TFs and the most synergistic combination is selected as the final conversion TF set. The final conversion TFs are ranked by the expression fold change calculated between the target cell population and starting cell population.

Identification of marker genes

The marker gene set (Table S6) was collected from the following sources; extracellular proteins and membrane receptors, cytoskeletal genes (http://www.informatics.jax.org/), metabolic genes (https://www.vmh.life/#human/all) and CD markers for immune cells (www.abcam.com/CDmarkers). These genes are relatively easily accessible for experimental validation. TransSynW identifies the top 10 candidate marker genes among this compiled set by computing JSD. Literature evidence for predicted markers were collected either manually or from CellMarker (http://biocc.hrbmu.edu.cn/CellMarker/).

PF set

Information on PFs that have previously been reported to be involved in cell conversion protocols was manually collected from literature. The list is available in Table S1.

scRNA‐seq data of starting cell populations

scRNA‐seq data of starting cell types were collected from Cell Ranger, GEO and Array Express databases, log 2 transformed and mean gene expression was calculated and compiled in TransSynW (Table S2).

scRNA‐seq dataset of target cell populations

scRNA‐seq data used in this study were obtained from the following sources. , , , , , , , , , , , For References 43, 48, the reprocessed data were retrieved from PangloaDB, as the cell annotation was more accurate than the original one.

CONFLICT OF INTEREST

The authors declared no potential conflicts of interest.

AUTHOR CONTRIBUTIONS

M.M.R., S.O.: collection and/or assembly of data, data analysis and interpretation, manuscript writing; A.d.S.: conception and design, manuscript writing, final approval of manuscript. Table S1 List of literature evidence for PFs known to be involved in cell conversions. Click here for additional data file. Table S2 List of collected scRNA‐seq data and their platforms for starting cell types. These datasets were built in the TransSynW web application. Click here for additional data file. Table S3 List of literature evidence for predicted cell conversion TFs being used for cell conversion experiments. Click here for additional data file. Table S4 Top 10 predicted marker genes. Predicted genes supported by literature evidence are shown in Table 2. Click here for additional data file. Table S5 scRNA‐seq data file sizes in megabytes used for assessing the processing speed. Click here for additional data file. Table S6 List of potential candidate marker genes. Genes belonging to either extracellular proteins, membrane receptors, cytoskeletal proteins, metabolic genes, or CD markers for immune cells were considered. See Methods for details. Click here for additional data file. Data S1 Supplementary notes—User guide. Click here for additional data file.
  49 in total

1.  Krüppel-like factor KLF9 regulates PPARγ transactivation at the middle stage of adipogenesis.

Authors:  H Pei; Y Yao; Y Yang; K Liao; J-R Wu
Journal:  Cell Death Differ       Date:  2010-08-20       Impact factor: 15.828

2.  Long-Term Expansion of Functional Mouse and Human Hepatocytes as 3D Organoids.

Authors:  Huili Hu; Helmuth Gehart; Benedetta Artegiani; Carmen LÖpez-Iglesias; Florijn Dekkers; Onur Basak; Johan van Es; Susana M Chuva de Sousa Lopes; Harry Begthel; Jeroen Korving; Maaike van den Born; Chenhui Zou; Corrine Quirk; Luis Chiriboga; Charles M Rice; Stephanie Ma; Anne Rios; Peter J Peters; Ype P de Jong; Hans Clevers
Journal:  Cell       Date:  2018-11-29       Impact factor: 41.582

3.  Quiescence Modulates Stem Cell Maintenance and Regenerative Capacity in the Aging Brain.

Authors:  Georgios Kalamakis; Daniel Brüne; Srikanth Ravichandran; Jan Bolz; Wenqiang Fan; Frederik Ziebell; Thomas Stiehl; Francisco Catalá-Martinez; Janina Kupke; Sheng Zhao; Enric Llorens-Bobadilla; Katharina Bauer; Stefanie Limpert; Birgit Berger; Urs Christen; Peter Schmezer; Jan Philipp Mallm; Benedikt Berninger; Simon Anders; Antonio Del Sol; Anna Marciniak-Czochra; Ana Martin-Villalba
Journal:  Cell       Date:  2019-02-28       Impact factor: 41.582

4.  PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data.

Authors:  Oscar Franzén; Li-Ming Gan; Johan L M Björkegren
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

5.  Krüppel-like factor 9 is necessary for late-phase neuronal maturation in the developing dentate gyrus and during adult hippocampal neurogenesis.

Authors:  Kimberly N Scobie; Benjamin J Hall; Scott A Wilke; Kristen C Klemenhagen; Yoshiaki Fujii-Kuriyama; Anirvan Ghosh; René Hen; Amar Sahay
Journal:  J Neurosci       Date:  2009-08-05       Impact factor: 6.167

6.  Organoid single-cell genomic atlas uncovers human-specific features of brain development.

Authors:  Sabina Kanton; Michael James Boyle; Zhisong He; Malgorzata Santel; Anne Weigert; Fátima Sanchís-Calleja; Patricia Guijarro; Leila Sidow; Jonas Simon Fleck; Dingding Han; Zhengzong Qian; Michael Heide; Wieland B Huttner; Philipp Khaitovich; Svante Pääbo; Barbara Treutlein; J Gray Camp
Journal:  Nature       Date:  2019-10-16       Impact factor: 49.962

7.  Repression of the DNA-binding inhibitor Id3 by Blimp-1 limits the formation of memory CD8+ T cells.

Authors:  Yun Ji; Zoltan Pos; Mahadev Rao; Christopher A Klebanoff; Zhiya Yu; Madhusudhanan Sukumar; Robert N Reger; Douglas C Palmer; Zachary A Borman; Pawel Muranski; Ena Wang; David S Schrump; Francesco M Marincola; Nicholas P Restifo; Luca Gattinoni
Journal:  Nat Immunol       Date:  2011-11-06       Impact factor: 25.606

8.  Deconstructing transcriptional heterogeneity in pluripotent stem cells.

Authors:  Roshan M Kumar; Patrick Cahan; Alex K Shalek; Rahul Satija; AJay DaleyKeyser; Hu Li; Jin Zhang; Keith Pardee; David Gennert; John J Trombetta; Thomas C Ferrante; Aviv Regev; George Q Daley; James J Collins
Journal:  Nature       Date:  2014-12-04       Impact factor: 49.962

9.  Murine single-cell RNA-seq reveals cell-identity- and tissue-specific trajectories of aging.

Authors:  Lolita Penland; Nimrod D Rubinstein; Jacob C Kimmel; David G Hendrickson; David R Kelley; Adam Z Rosenthal
Journal:  Genome Res       Date:  2019-11-21       Impact factor: 9.043

10.  Transcription factor-induced lineage programming of noradrenaline and motor neurons from embryonic stem cells.

Authors:  Jamie Mong; Lia Panman; Zhanna Alekseenko; Nigel Kee; Lawrence W Stanton; Johan Ericson; Thomas Perlmann
Journal:  Stem Cells       Date:  2014-03       Impact factor: 6.277

View more
  2 in total

1.  Probabilistic boolean networks predict transcription factor targets to induce transdifferentiation.

Authors:  Bahar Tercan; Boris Aguilar; Sui Huang; Edward R Dougherty; Ilya Shmulevich
Journal:  iScience       Date:  2022-08-17

2.  TransSynW: A single-cell RNA-sequencing based web application to guide cell conversion experiments.

Authors:  Mariana Messias Ribeiro; Satoshi Okawa; Antonio Del Sol
Journal:  Stem Cells Transl Med       Date:  2020-10-08       Impact factor: 6.940

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.