| Literature DB >> 34249701 |
Yang Cheng1, Kezuo Hou2,3,4, Yizhe Wang1, Yang Chen1, Xueying Zheng2,3,4, Jianfei Qi5, Bowen Yang2,3,4, Shiying Tang2,3,4, Xu Han2,3,4, Dongyao Shi1, Ximing Wang1, Yunpeng Liu2,3,4, Xuejun Hu1, Xiaofang Che2,3,4.
Abstract
BACKGROUND: Lung adenocarcinoma (LUAD) is the most common pathological type of lung cancer, with high incidence and mortality. To improve the curative effect and prolong the survival of patients, it is necessary to find new biomarkers to accurately predict the prognosis of patients and explore new strategy to treat high-risk LUAD.Entities:
Keywords: drug repositioning; gliclazide; lung adenocarcinoma; prognosis; signature
Year: 2021 PMID: 34249701 PMCID: PMC8264429 DOI: 10.3389/fonc.2021.665276
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Study flow diagram. OS, overall survival; DEGs, differentially expressed genes.
Characteristics of the public microarray datasets used in this study.
| Study | Species/array platform | Samples | Type | Function |
|---|---|---|---|---|
| TCGA-LUAD | Human Illumina HiSeq 2000 | 332 Cancer | LUAD | Training set |
| GSE50081 | (GPL570) | 128 Cancer | LUAD | Validation set |
| GSE18842 | (GPL570) | 45 Normal and 46 Cancer | NSCLC | For DEGs |
| GSE19188 | (GPL570) | 65 Normal and 91 Cancer | NSCLC | For DEGs |
| GSE40791 | (GPL570) | 100 Normal and 94 Cancer | NSCLC | For DEGs |
Figure 2Construction and assessment of the Five-Gene Prognostic Signature. (A) Venn diagram depicting potential prognostic gene of the inter section between DEGs and Survival-Related Genes; (B) A coefficient profile plot was generated against the log (lambda) sequence. Selection of the optimal parameter (lambda) in the LASSO model. (C) LASSO coefficient profiles of the nine candidates in TCGA training set. (D) Patients’ survival status distribution by the risk score; patient survival status distribution of the low-risk group and the high-risk group; (E) Kaplan–Meier curves for the low- and high-RS groups; (F, G) the receiver operating characteristic (ROC) curve validation of prognostic value by the risk score of 1 and 3 years.
Multivariate Cox regression analysis of the 5-gene signature.
| ID | Coef | HR | HR.95L | HR.95H | P-value |
|---|---|---|---|---|---|
| KIF20A | 0.301291283 | 1.351602983 | 1.103810884 | 1.65502139 | 0.003547525 |
| LIFR | −0.182264632 | 0.833380772 | 0.672709557 | 1.032427002 | 0.095329705 |
| RGS13 | −1.100168875 | 0.332814875 | 0.117059758 | 0.946232446 | 0.039052759 |
| KLF4 | 0.240230298 | 1.27154195 | 1.070939257 | 1.509720483 | 0.006100447 |
| KRT6A | 0.085854478 | 1.089647749 | 1.012552387 | 1.172613123 | 0.0218395 |
HR, hazard ratio (HR >1, risk factor; HR <1, Protective factors); 95% CI, 95%confidence interval.
Figure 4Prognostic analysis based on of the 5 gene-RS model in GSE50081 validation cohort. (A) Patients’ survival status distribution by the risk score; patient survival status distribution of the low-RS group and the high-RS group; (B) Kaplan–Meier curves for the low- and high-RS groups; (C, D) the receiver operating characteristic (ROC) curve validation of prognostic value by the risk score of 1 and 3 years; (E) The heatmap showed the expression levels of the 5-genes and the distribution of clinicopathological features in the low- and high-RS groups; (F) The univariate and multivariate Cox analysis for the independent 5-gene signature.
Figure 3The relationship between the 5-gene based RS and clinical information. (A) The heatmap showed the expression levels of the 5-genes in the low- and high-RS groups; The distribution of clinicopathological features was compared in low- and high-RS groups. (B) The univariate and multivariate Cox analysis for the independent 5-gene signature; (C) a nomogram was used to predict the overall survival at 1 year and 3 years with RS and clinical information.
Figure 5GSEA analysis of the differentially expressed genes between high- and low-RS groups. (A) Cell cycle; (B) spliceosome; (C) mismatch repair; (D) DNA replication.
Eight candidate drugs of connectivity map analysis.
| CMAP name | Mean | P | Percent non-null |
|---|---|---|---|
| medrysone | −0.68 | 0.00064 | 100 |
| 0175029-0000 | −0.649 | 0.00073 | 100 |
| ginkgolide A | −0.73 | 0.00105 | 100 |
| repaglinide | −0.722 | 0.00275 | 100 |
| trioxysalen | −0.663 | 0.00336 | 100 |
| gliclazide | −0.66 | 0.00432 | 100 |
| 0173570-0000 | −0.734 | 0.00465 | 100 |
| eucatropine | −0.621 | 0.00528 | 100 |
| 0297417-0002B | −0.658 | 0.00863 | 100 |
Figure 6The screening process and proliferation experiment verification of gliclazide. (A) The methodology used in the compilation of survival-differentially expressed genes and the CMAP algorithm. (B–E) A549 and H1299 cells were treated with gliclazide for 24, 48 and 72 h and the cell viability was measured by MTT Assay (B, C), or colony-forming assay (D, E). One-way ANOVA was used for statistical analyses. Data are plotted as mean ± SD. P values are labeled in the figures. *P < 0.05; **P < 0.01; ***P < 0.001.
Figure 7Molecular docking of gliclazide targets. The results of CCNB2 homology modeling. Ramachandran plot analysis showed that existence of 98.4% of all residues in the allowed regions for CCNB2 (A), the structure of CCNB2 was basically the same as that of template protein, and the identity of their amino acid sequence was 64.5%, highlighting the accuracy of the predicted structures (B). (C–F) Molecular docking analyses for Gliclazide with target proteins.
Binding energy for targets with their drugs.
| Protein | Docking score (kcal/mol) |
|---|---|
| Gliclazide | |
| CCNB1 | −8.9 |
| CCNB2 | −8.6 |
| CDK1 | −8.3 |
| AURKA | −6.9 |
Figure 8Experimental verification of Gliclazide on cell cycle and apoptosis. (A) Effect of the gliclazide on cell cycle distribution of A549 and H1299 cells exposed to gliclazide for 48 h. Histograms of cellular DNA content obtained by flow cytometry have been represented. (B, C) Protein expression of CyclinD1, cyclin B1, AURKA and PARP were quantified via western blotting, with GAPDH as the loading control. One-way ANOVA was used for statistical analyses. Data are plotted as mean ± SD. P values are labeled in the figures. *P < 0.05; **P < 0.01; ***P < 0.001.