| Literature DB >> 33195408 |
Zhou Jiawei1, Mu Min2, Xing Yingru3, Zhang Xin1, Li Danting1, Liu Yafeng1, Xie Jun3, Hu Wangfa3, Zhang Lijun1, Wu Jing1,2, Hu Dong1,2.
Abstract
BACKGROUND: The development of human tumors is associated with the abnormal expression of various functional genes, and a massive tumor-based database needs to be deeply mined. Based on a multigene prediction model, access to urgent prognosis of patients has become possible.Entities:
Keywords: bioinformatics; data mining; lung adenocarcinoma; predictor; prognosis
Year: 2020 PMID: 33195408 PMCID: PMC7653064 DOI: 10.3389/fmolb.2020.561456
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1Identification of differentially expressed genes (DEGs) in lung adenocarcinoma. (A) Venn diagram of DEGs in GSE10072, GSE43458, and GSE32863. The DEGs with P-value <0.05 and a fold change >1 were selected. (B) Expression of 244 DEGs in tumor tissues.
FIGURE 2Screening of genes related to the prognosis of lung adenocarcinoma. (A) Venn diagram of prognosis-related genes. (B) The expression heat maps of six selected prognostic genes are shown, with red indicating high expression and green indicating low expression.
FIGURE 3Univariate and multivariate prognostic analyses of genes related to the prognosis of lung adenocarcinoma. (A) Dendrogram of six genes by univariate prognostic analysis. (B) The calculation formula of the comprehensive risk score of the six genes. (C) Results of the risk score by multivariate prognostic analysis.
FIGURE 4Evaluation of the predictive effect of the comprehensive risk score of the six genes. (A–F) The receiver operating characteristic (ROC) curve of the individual risk scores of the six genes. (G–H) Kaplan–Meier (KM) survival curves of lung adenocarcinoma (LUAD) patients with low or high individual risk score of the six genes. (N) KM survival curve of LUAD patients with low or high comprehensive risk score of the six genes. (M) The ROC curve of the comprehensive risk score of the six genes. (O) Risk score of patients, with red indicating high risk and green indicating low risk. (P) Expression heat map of the six genes in the high-risk group and the low-risk group (green for low expression and red for high expression). (Q) Distribution of the survival status of patients in the high-risk group and the low-risk group (red for death and green for survival).
FIGURE 5Evaluation of the six-gene model as an independent predictor. Individual and comprehensive risk scores were involved in a multivariate analysis with patient characteristics. (A) Univariate analysis. (B–H) Multivariate analysis. (I–O) Receiver operating characteristic curve.
FIGURE 6Correlation analysis of the individual expression and the comprehensive risk values of six genes with clinical features. (A) Results of the correlation analysis of the expression alone of the six genes and of the comprehensive risk value of the six genes with clinical features. Data are shown as R value (p value). (B–J) Box plot of the expression and the comprehensive risk score of the six genes of patients grouped with clinical characteristics.
FIGURE 7The establishment and the evaluation of the prognostic model combining the comprehensive risk score of six genes with clinical features. (A) Nomogram of the prognostic model. (B,C) Calibration tested the accuracy of the constructed model to predict the 3- and 5-year survival status, respectively. (D,E) Receiver operating characteristic tested the accuracy of the constructed model to predict the 3- and 5-year survival status, respectively.