| Literature DB >> 33941680 |
Abhijeet P Deshmukh1, Suhas V Vasaikar1, Katarzyna Tomczak2, Shubham Tripathi3, Petra den Hollander1, Emre Arslan2, Priyanka Chakraborty4, Rama Soundararajan1, Mohit Kumar Jolly4, Kunal Rai5, Herbert Levine6,7, Sendurai A Mani8.
Abstract
The epithelial-to-mesenchymal transition (EMT) plays a critical role during normal development and in cancer progression. EMT is induced by various signaling pathways, including TGF-β, BMP, Wnt-β-catenin, NOTCH, Shh, and receptor tyrosine kinases. In this study, we performed single-cell RNA sequencing on MCF10A cells undergoing EMT by TGF-β1 stimulation. Our comprehensive analysis revealed that cells progress through EMT at different paces. Using pseudotime clustering reconstruction of gene-expression profiles during EMT, we found sequential and parallel activation of EMT signaling pathways. We also observed various transitional cellular states during EMT. We identified regulatory signaling nodes that drive EMT with the expression of important microRNAs and transcription factors. Using a random circuit perturbation methodology, we demonstrate that the NOTCH signaling pathway acts as a key driver of TGF-β-induced EMT. Furthermore, we demonstrate that the gene signatures of pseudotime clusters corresponding to the intermediate hybrid EMT state are associated with poor patient outcome. Overall, this study provides insight into context-specific drivers of cancer progression and highlights the complexities of the EMT process.Entities:
Keywords: EMT; NOTCH; RACIPE; scRNA-seq; signaling cascade
Mesh:
Substances:
Year: 2021 PMID: 33941680 PMCID: PMC8126782 DOI: 10.1073/pnas.2102050118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.During TGF-β1–induced EMT multiple signaling pathways are activated simultaneously. (A) Schematic representation of model used to EMT in MCF10A cells. (B) Flow cytometry analyses of MCF10A cells treated over 8 d with TGF-β. (C) scRNA-seq schematic. (D) Force-directed layout embedding (FLE) of the trajectory of TGF-β1–induced EMT in MCF10A. (E) FLE trajectories of epithelial markers (CDH1, EpCAM, and S100A9), mesenchymal markers (CDH2, FN1, and FAP), and EMT transcription factors (TGFB1, SNAI2, and S100A6) across the time course of TGF-β treatment. (F) Rank correlation of Hallmark signaling pathways. The x axis shows the TGF-β1 time course treatment of MCF10A cells and y axis shows the NES. (G) Violin plot of the hallmark EMT and TGF-β signaling pathways NES for each time point during the TGF-β1 treatment. (H) Kolmogorov–Smirnov EMT scores during TGF-β1 time course treatment of MCF10A cells. ANOVA (***P < 0.001).
Fig. 2.Reconstruction of EMT using pseudotime clustering reveals key regulators of epithelial-mesenchymal states. (A) t-SNE of cells treated with TGF-β1 (Left) and pseudotime (Right) clusters in principal component analysis space grouped using modularity optimization technique. (B) The fraction of cells in each cluster and time point of origin shown using a Sankey network. The red dots indicate that at least 10% of cells in a cluster mapped to the corresponding time point. (C) Hierarchical network of cells in each cluster with different TGF-β1 treatment time points and ranked in pseudotime. (D) Cluster-specific markers identified using Wilcoxon test (>1.4-fold change, adjusted P < 0.05). Among identified significantly up-regulated genes in each cluster the transcription factors critical for stemness are highlighted. (E) The inferred enrichment for miRNAs based on the miRNA target expression. Before enrichment the average gene expression for each gene in a cluster was calculated. The average expression was scaled across the clusters and used for enrichment analysis. (F) Inferred miRNA enrichment score and relative fraction of target in each cluster. (G) Inferred EMT regulatory miRNA network and putative regulators and model depicting miR217- and miR200a/200b/200c-3p-dependent EMT regulatory checkpoint.
Fig. 3.scRNA-seq analysis identifies parallel and sequential signaling pathways involved in TGF-β1–driven EMT. (A) Heatmap of enrichment for indicated EMT-associated pathways in each cluster. (B) NES of significantly altered signaling pathways across pseudotime clusters. The red lines are a fit determined by LOESS. (C) Heatmap showing the distributions of mRNA expression of genes that were induced during TGF-β1–induced EMT and maintained increased expression during EMT progression. Genes associated with each EMT-associated pathway and their fraction in each cluster compared to maximum average expression is shown in alignment with EMT-associated pathway activation.
Fig. 4.Systematic inhibition of EMT signaling reveals key drivers of EMT. (A) Network representation of the transition matrix calculated using multiple regression with lasso regularization mapping the population state at pseudotime t to pseudotime point t + 1. (B) The RACIPE method identifies the steady-state behaviors a network can exhibit by simulating network behavior for an ensemble of parameters. For the constructed network, but not for a set of randomized controls, the variance is dominated by the first principal component (PC1, Left) and there exists a large percentage of parameter values for which the network exhibits more than one stable phenotypic state (Right). (C) Activity of each node was suppressed one-by-one and the change in the distribution of the first principal component was plotted. (D) Effect of suppressing the activity of a given pathway node (shown along the rows) on the activity of the other key network pathways (shown along the columns).
Fig. 5.Gene signatures of pseudotime clusters closer to the mesenchymal state are associated with worse survival. (A) Schematic of identification of differentially expressed (DE) genes from each cluster and signature enrichment in cancer types based on normalized RNA expression data from TCGA. Cancer cohorts (∼11,000 patients) analyzed were adrenocortical carcinoma (ACC), bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), cervical and endocervical cancers (CESC), cholangiocarcinoma (CHOL), colon cancer (COAD), lymphoid neoplasm diffuse large B cell lymphoma (DLBC), esophageal carcinoma (ESCA), glioblastoma multiforme (GBM), head and neck squamous cell carcinoma (HNSC), kidney chromophobe (KICH), kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP), acute myeloid leukemia (LAML), brain lower grade glioma (LGG), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), mesothelioma (MESO), ovarian serous cystadenocarcinoma (OV), pancreatic adenocarcinoma (PAAD), pheochromocytoma and paraganglioma (PCPG), prostate adenocarcinoma (PRAD), rectal adenocarcinoma (READ), sarcoma (SARC), skin cutaneous melanoma (SKCM), stomach adenocarcinoma (STAD), testicular germ cell tumors (TGCT), thyroid carcinoma (THCA), thymoma (THYM), uterine corpus endometrial carcinoma (UCEC), uterine carcinosarcoma (UCS), and uveal melanoma (UVM). (B) Hazard ratios for PFI for TCGA breast cancer cohort (n = 1,092) for patients with each cluster-specific signature. The dashed line indicates hazard ratio 1. (C) Hazard ratios for patients with indicated cancers for each cluster-specific signature. (D) Univariate cox analysis-based hazard ratio observed using PFI data for each cluster in each cancer cohort. (E) Log fold-changes for the cluster C0, C10, and C13 signature genes.