| Literature DB >> 30410595 |
Yaoyi Xiong1, Lushun Yuan1, Liang Chen1, Yuan Zhu2,3,4, Shanshan Zhang2,3,4, Xuefeng Liu5, Yu Xiao1,2,3,4, Xinghuan Wang1.
Abstract
Although it is well known that smoking is one of pathogenesis of clear cell renal cell carcinoma (ccRCC), the underlying molecular mechanism is still unclear. In our study, the microarray dataset GSE46699 is analyzed by weighted gene co-expression network analysis (WGCNA). Then we identify 15 co-expressed gene modules in which the lightcyan module (R2 = 0.30) is the most significant. Combined with the protein-protein interaction (PPI) network and WGCNA, two hub genes are identified. Meanwhile, linear regression analyses indicate that TOP2A has a higher connection with smoking in ccRCC, survival analysis proved that overexpression of TOP2A in ccRCC could lead to shorter survival time. Furthermore, bioinformatical analyses based on GSE46699 and GSE2109 as well as qRT-PCR experiment show similar results that TOP2A is significantly up-regulated in smoking ccRCC compared to non-smoking ccRCC samples. In addition, Functional analysis, pathway enrichment analysis and gene set enrichment analysis (GSEA) indicate that high expression of TOP2A is related to cell cycle and p53 signaling pathway in ccRCC samples. Moreover, in vitro experiments revealed that TOP2A induced cell cycle arrest at G2 phase and proliferation inhibition via p53 phosphorylation. Taken together, by using WGCNA, we have identified a novel biomarker named TOP2A, which could affect the development of smoking-related ccRCC by regulating cell cycle and p53 signaling pathway.Entities:
Keywords: TOP2A; biomarker; clear cell renal cell carcinoma (ccRCC); smoking; weighted gene co-expression network analysis (WGCNA)
Year: 2018 PMID: 30410595 PMCID: PMC6218786 DOI: 10.7150/jca.25900
Source DB: PubMed Journal: J Cancer ISSN: 1837-9664 Impact factor: 4.207
Figure 1Description of the study. (A) Flow chart of this study. (B) Sample dendrogram and cigarette smoking heatmap (GSE46699).
The description of GSE46699, GSE2109 and GSE10072.
| GEO Dataset | Platform | Tumor type | Total samples | Non-smoking samples | Smoking samples | Normal samples | Tumor samples |
|---|---|---|---|---|---|---|---|
| GSE46699 | Affymetrix Human Genome U133 Plus 2.0 Array | ccRCC | 130 | 84 | 46 | 63 | 67 |
| GSE2109 | Affymetrix Human Genome U133 Plus 2.0 Array | ccRCC | 283 | 140 | 143 | - | 283 |
| GSE10072 | Affymetrix Human Genome U133A Array | LUAD | 107 | 31 | 76 | 49 | 58 |
Figure 2Determination of soft-thresholding power and analysis of hub genes as well as protein-protein interaction network (PPI). (A) Analysis of the scale-free fit index for various soft-thresholding powers (β). (B) Analysis of the mean connectivity for various soft-thresholding powers. (C) Histogram of connectivity distribution when β = 11. (D) Checking the scale-free topology when β = 11. (E) Scatter plot of module eigengenes in lightcyan module. (F) PPI network of genes in the lightcyan module.
Figure 3Identify modules associated with the smoking of ccRCC. (A) Dendrogram of all differentially expressed genes clustered. (B) Distribution of average gene significance in the modules associated with smoking of ccRCC. (C) The correlation between module eigengenes and the smoking group.
Figure 4Validate the relationship between TOP2A is significantly overexpressed in the ccRCC according to GEPIA database. (B) Linear regression analysis between the expressions of TOP2A and smoking group. (C) The expression of TOP2A is correlated with smoking of ccRCC in GSE46699 (N represents non-smoking patients, S represents smoking patients). (D) TOP2A expression is correlated with smoking of ccRCC in GSE2109. (E) TOP2A expression is positive correlation with smoking status in lung adenocarcinoma in GSE10072 (NNS: Normal never smoked, NFS: Normal former smoker, NCS: Normal current smoker, TNS: Tumor never smoked, TFS: Tumor former smoker, TCS: Tumor current smoker). (F) qRT-PCR indicates the expression of TOP2A is up-regulated in smoking ccRCC tissues. (G-H) Kaplan-Meier survival curve downloaded from GEPIA database demonstrate that up-regulation of TOP2A have a significantly shorter overall survival time and disease-free survival time. (I) Biological processes and (J) KEGG pathway enrichment analysis of TOP2A.
The P value of the GSE10072 in Figure 4E.
| Smoking status | Mean±SD | Smoking status | Mean±SD | ||
|---|---|---|---|---|---|
| NNS | 5.805±0.469 | VS | NFS | 6.088±0.555 | 0.116475331 |
| NNS | 5.805±0.469 | VS | NCS | 6.251±0.631 | 0.00052866 |
| NFS | 6.088±0.555 | VS | NCS | 6.251±0.631 | 0.034655055 |
| NNS | 5.805±0.469 | VS | NFS+NCS | 6.642±0.598 | 0.002130744 |
| TNS | 7.658±1.028 | VS | TFS | 8.451±0.913 | 0.076959943 |
| TNS | 7.658±1.028 | VS | TCS | 9.123±0.851 | 0.001676115 |
| TFS | 8.451±0.913 | VS | TCS | 9.123±0.851 | 0.148011761 |
| TNS | 7.658±1.028 | VS | TNS+TCS | 8.835±0.939 | 0.003132606 |
| NNS+NFS+ NCS | 6.054±0.585 | VS | TNS+TFS+ TCS | 8.510±1.098 | 1.95169E-25 |
Figure 5Gene set enrichment analysis (GSEA). “Viral myocarditis”, “Antigen processing and presentation”, “Leishmania infection”, “Cell cycle”, “Autoimmune Thyoid disease”, “P53 signaling pathway” were enriched in ccRCC samples with TOP2A highly expressed.
Figure 6Downregulation of The expression of TOP2A in transcriptional level (upper picture) and translation level (lower picture) by treating with siTOP2A-1, siTOP2A-2 and siTOP2A-3 in 786-O cell line. The proliferative capacity was measured by (B) MTT assay and (C) colony formation assays in 786-O cell line. (D) Flow cytometry analysis of TOP2A deficiency group and NC group in 786-O cell line. (E) Cell cycle related proteins such as Cyclin A1/2 and CDK1/2 were significantly decreased in the 786-O cell line in TOP2A knockdown group. (F) TOP2A deficiency strongly induced phosphorylated p53 in 786-O cell line.