| Literature DB >> 33228590 |
Zizhen Zhang1,2, Sheng Zheng1,2, Yifeng Lin1,2, Jiawei Sun1,2, Ning Ding1,2, Jingyu Chen1,2, Jing Zhong1,2, Liuhong Shi3,4, Meng Xue5,6.
Abstract
BACKGROUND: The epithelial-mesenchymal transition (EMT) plays a pivotal role in various physiological processes, such as embryonic development, tissue morphogenesis, and wound healing. EMT also plays an important role in cancer invasion, metastasis, and chemoresistance. Additionally, EMT is partially responsible for chemoresistance in colorectal cancer (CRC). The aim of this research is to develop an EMT-based prognostic signature in CRC.Entities:
Keywords: Colorectal cancer; Epithelial-mesenchymal transition; GEO; Signature; TCGA
Mesh:
Year: 2020 PMID: 33228590 PMCID: PMC7686680 DOI: 10.1186/s12885-020-07615-5
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Differentially expressed ERGs between colorectal cancer (CRC) and normal colorectal tissues. a Heatmap for differentially expressed ERGs. b Volcano plot of differentially expressed mRNAs between CRC and normal tissues. The heatmap was generated using version 3.6.1 of R software
Fig. 2GO, KEGG analysis, and protein-protein interaction (PPI) network of ERGs. a GO cluster. The inner dendrogram indicates the hierarchical clustering of the gene expression profiles; the outer circle represents the log2FC of each ERG, with the color corresponding to the gene level; and the outermost circle represents the GO BP terms assigned to the gene. b The 10 most significantly enriched CC and MF terms. c Volcano plot of EMT gene-associated pathways. d PPI network of all differentially expressed ERGs
Fig. 3Establishment of prognostic gene signature by LASSO regression analysis. a Solid vertical lines represent partial likelihood deviance ± standard error (SE). The dotted vertical lines are drawn at the optimal values by minimum criteria and 1-SE criteria. Herein, a value λ = 0.023 with log (λ) = − 3.78 was chosen by 10-fold cross-validation via minimum criteria. b Selection of the optimal parameter (lambda) in the LASSO model for training cohort. c Gene-gene interaction network among selected genes by LASSO regression analysis in the GeneMANIA dataset. d Spearman’s correlation analysis of the selected genes by LASSO regression analysis
Fig. 4Forrest plot of the univariate and multivariate Cox regression analysis. a, c Univariate Cox proportion hazard regression for OS of CRC in training and validation cohorts. b, d Multivariable Cox proportion hazard regression for OS of CRC in training and validation cohorts
Fig. 5Development of risk score based on the 11-EMT-related gene signature of patients with CRC in TCGA and GEO. a, b The hierarchical clustering analysis of eleven genes with the increase of the risk score. c, d Kaplan–Meier analysis of the prognostic model in TCGA or GEO datasets. e, f Time-dependent ROC analysis showing the optimal AUC of the gene signature in the two cohorts
Fig. 6Ten representative enriched KEGG pathways by GESA. Each group contains five KEGG pathways. GESA, gene set enrichment analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes
Fig. 7Construction of a nomogram based on the 11-EMT-related gene signature. a A nomogram based on the signature and clinical information. b Time-dependent receiver operating characteristic (ROC) curve for predicting overall survival (OS) of the nomogram. c, d Calibration plot evaluating the predictive accuracy of the nomogram [at 3-year survival (c) at 5-year survival (d)]. (e, f) Decision curve analysis evaluating the clinical utility of the nomogram [at 3-year survival (e) at 5-year survival (f)]