| Literature DB >> 33968720 |
Weicheng Duan1, Kang Wang1, Yijie Duan1, Xiuyi Chen2, Xufeng Chu1, Ping Hu2, Bo Xiong1.
Abstract
Malignant pleural mesothelioma (MPM) is a highly aggressive cancer with short survival time. Unbalanced competing endogenous RNAs (ceRNAs) have been shown to participate in the tumor pathogenesis and served as biomarkers for the clinical prognosis. However, the comprehensive analyses of the ceRNA network in the prognosis of MPM are still rarely reported. In this study, we obtained the transcriptome data of the MPM and the normal samples from TCGA, EGA, and GEO databases and identified the differentially expressed (DE) mRNAs, lncRNAs, and miRNAs. The functions of the prognostic genes and the overlapped DEmRNAs were further annotated by the multiple enrichment analyses. Then, the targeting relationships among lncRNA-miRNA and miRNA-mRNA were predicted and calculated, and a prognostic ceRNA regulatory network was established. We included the prognostic 73 mRNAs and 13 miRNAs and 26 lncRNAs into the ceRNA network. Moreover, 33 mRNAs, three miRNAs, and seven lncRNAs were finally associated with prognosis, and a model including seven mRNAs, two lincRNAs, and some clinical factors was finally established and validated by two independent cohorts, where CDK6 and SGMS1-AS1 were significant to be independent prognostic factors. In addition, the identified co-expressed modules associated with the prognosis were overrepresented in the ceRNA network. Multiple enrichment analyses showed the important roles of the extracellular matrix components and cell division dysfunction in the invasion of MPM potentially. In summary, the prognostic ceRNA network of MPM was established and analyzed for the first time and these findings shed light on the function of ceRNAs and revealed the potential prognostic and therapeutic biomarkers of MPM.Entities:
Keywords: biomarker; ceRNA; lncRNA; mesothelioma; microenvironment; overall survival
Year: 2021 PMID: 33968720 PMCID: PMC8104912 DOI: 10.3389/fonc.2021.615234
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
The characteristics of the included MPM patients.
| Sample type | MESO | 80 | 30 | 211 | 128 |
| Age | Median | 64.5 | 62.5 | 65.4 | 64.15 |
| Range [years] | 28–81 | 28–81 | 18.8–86 | 27.3–86 | |
| Sex | Male | 65 | 27 | 176 | 108 |
| Female | 15 | 3 | 35 | 20 | |
| Vital status | Alive | 8 | 0 | 48 | 0 |
| Dead | 72 | 30 | 163 | 128 | |
| Survival time | Median | 15.06 | 8.32 | 12.6 | 7.8 |
| Range [months] | 0.65–91.73 | 0.65–91.73 | 0.24–132.6 | 0.24–132.6 | |
Figure 1Differential expression gene analyses. (A,B) DEG heatmap of the RNA-Seq and the miRNA-Seq. (C–E) Screening of the DEmRNA, DElncRNA, and DEmiRNA. Black points represent the insignificant genes, red points represent the upregulated genes, and green points represent the downregulated genes. FC, fold change. Threshold criteria: FC > 2 and FDR < 0.05.
Figure 2GO, KEGG pathway, and GSEA enrichment analyses. (A–C) Top 10 enriched terms in GO-BP, GO-CC, and GO-MF. (D) The significant terms in the KEGG pathway. (E,F) TOP 5 enriched terms of GSEA in the c5.all.v6.2.entrez.gmt gene sets and the c2.all.v6.2.entrez.gmt gene sets. P-value adjusted by FDR < 0.05.
Figure 3Compared the DEmRNAs obtained from the RNA-seq data and the microarray data. (A) The overlapped upregulated genes. (B) The overlapped downregulated genes. (C) The enrichment analyses of the overlapped upregulated genes. (D) The enrichment analyses of the overlapped downregulated genes.
Figure 4The construction of the ceRNA network among the 26 lncRNAs, 13 miRNAs, and 73 mRNAs. (A) The ceRNA network among lncRNA–miRNA–mRNA. Ellipse represents mRNA. Rectangle represents lncRNA. Diamond represents miRNA. Red represents upregulation, and blue represents downregulation. (B–D) The Pearson correlation analyses among the miRNA–mRNA, 26 lncRNAs, and 73 mRNAs.
The concrete interactions among the lncRNAs, miRNAs, and mRNAs in the ceRNA network.
| hsa-miR-1 | LINC-PINT, HOTAIR, AL035661.1, SMIM25, AC087741.1 | NOTCH2, FN1, TM4SF1, SMIM14, SLC7A11, SH3PXD2B, CDK6, SNAI2, ADAM12, WEE1, BDNF, CORO1C, GPR137C, NOTCH3, AXL |
| hsa-miR-129 | AC087741.1, XIST | CDK6 |
| hsa-miR-141 | AL359852.1, LINC01176, RUNDC3A-AS1, XIST | TGFB2, CDC25A, FOXL2, EPHA7, PAPPA, ADRB1, KIAA1549L, FRMD6, IGF1R, PHLPP2, LHX1 |
| hsa-miR-196b | LOXL1-AS1, XIST | HAND1, EPHA7, HOXA9, CPEB3, PPFIBP1, HMGA2, IGF2BP1, SALL3 |
| hsa-miR-200c | MSC-AS1, SGMS1-AS1, LINC02568, LINC02128, AC022150.4, XIST | FN1, DCBLD2, DNAJB9, SH3PXD2A, NCAM1, AVPR1A, HNF1B, PMAIP1, DNMT3B |
| hsa-miR-302a | TENM3-AS1, LINC00689, MSC-AS1, LINC00484, SGMS1-AS1, AC091057.1, AC022150.4, XIST | PSD3, ATAD2, CKS2, WEE1, GDF11, KPNA2 |
| hsa-miR-302b | TENM3-AS1, LINC00689, MSC-AS1, LINC00484, SGMS1-AS1, AC091057.1, AC022150.4, XIST | SLC26A9, SATB2, SLC7A11, CDK6, PSD3, ATAD2, C9orf40, TMEFF1, WEE1, GDF11, KPNA2 |
| hsa-miR-372 | TENM3-AS1, LINC00689, MSC-AS1, LINC00484, SGMS1-AS1, AC091057.1, AC022150.4, XIST | PKP1, ENAH, SORBS2, DCDC2, CDK6, PSD3, MCM4, ATAD2, TRIB1, WEE1, GDF11, WDR76, PHLPP2, PRR11, KPNA2, CHAF1B |
| hsa-miR-483 | AL365361.1, LINC01176, LINC00689, AC022150.4, XIST | ALCAM, CACNG8 |
| hsa-miR-503 | AC022167.2, AC106886.2 | ANLN, RECK, WEE1, CHEK1, CDCA4, CHAC1, KIF23, CBX2 |
| hsa-miR-552 | TENM3-AS1, LINC01176, AC091057.1, AC022150.4, XIST, LOXL1-AS1 | IFITM1, RACGAP1, NPTXR |
| hsa-miR-888 | LINC02015, SGMS1-AS1, AC090181.2, RUNDC3A-AS1, AP000526.1, XIST | PMAIP1 |
| hsa-miR-3129 | MIR200CHG, AL035661.1, XIST | BCAT1, TSPAN3 |
Statistical analyses associated with the OS of the mRNAs and lncRNAs in the ceRNA network.
| KPNA | 1.02 | 1.01–1.02 | <0.0001 | 3.53e-10 |
| PRR11 | 1.25 | 1.16–1.34 | <0.0001 | 4.82e-08 |
| 2.14 | 1.68–2.73 | <0.0001 | 9.28e-08 | |
| 1.19 | 1.12–1.23 | <0.0001 | 1.92e-07 | |
| ANLN | 1.06 | 1.04–1.09 | <0.0001 | 6.64e-07 |
| 1075 | 1.46–2.1 | <0.0001 | 1.20e-06 | |
| RACGAP1 | 1.21 | 1.14–1.29 | <0.0001 | 4.29e-06 |
| CHEK1 | 1.20 | 1.1–1.31 | 0.00001 | 4.30e-06 |
| SATB2 | 1.58 | 1.19–2.11 | <0.0019 | 4.35e-06 |
| DNMT3B | 3.01 | 1.85–4.49 | <0.0001 | 5.10e-06 |
| MCM | 1.11 | 1.06–1.16 | <0.0001 | 8.06e-06 |
| 10.4 | 1.03–1.05 | <0.0001 | 2.21e-05 | |
| AXL | 1.01 | 1.01–1.02 | <0.0001 | 2.48e-05 |
| CDCA4 | 1.14 | 1.08–1.2 | <0.0001 | 3.35e-05 |
| TMEFF1 | 129.56 | 4.61–3643.52 | 0.043 | 4.54e-05 |
| 1.17 | 1.07–1.28 | 0.0006 | 5.15e-05 | |
| 1.03 | 1.02–1.04 | <0.0001 | 7.01e-05 | |
| 1.13 | 1.08–1.18 | <0.0001 | 9.47e-05 | |
| C9orf40 | 1.78 | 1.46–2.16 | <0.0001 | 0.000147 |
| ENAH | 1.11 | 1.06–1.16 | <0.0001 | 0.000354 |
| GDF11 | 1.64 | 1.26–2.12 | <0.0001 | 0.000413 |
| WDR76 | 1.45 | 1.24–1.71 | <0.0001 | 0.001005 |
| ATAD2 | 1.26 | 1.14–1.39 | <0.0001 | 0.00202 |
| CPEB3 | 0.50 | 0.34–0.74 | <0.0001 | 0.002249 |
| NPTXR | 1.25 | 1.12–1.39 | <0.0001 | 0.002846 |
| PAPPA | 1.19 | 1.08–1.3 | <0.0002 | 0.004774 |
| SNAI2 | 1.07 | 1.04–1.1 | <0.0001 | 0.006123 |
| DCDC2 | 1.12 | 1.05–1.2 | 0.0014 | 0.006189 |
| DNAJB9 | 0.98 | 0.96–0.99 | 0.0027 | 0.006755 |
| AC026470.1 | 0.75 | 0.69–0.88 | 0.0001 | 0.008526 |
| DCBLD2 | 1.03 | 1.01–1.05 | 0.0007 | 0.01441 |
| FRMD6 | 1.10 | 1.05–1.15 | 0.0001 | 0.035087 |
| SMIM14 | 0.93 | 0.9–0.97 | 0.0007 | 0.041198 |
| hsa-miR-302a | 1.02 | 1–1.04 | 0.0118 | 0.000125 |
| hsa-miR-302b | 1.01 | 1–1.02 | 0.0109 | 0.000134 |
| hsa-miR-503 | 1.01 | 1–1.01 | 0.0062 | 0.007793 |
| AC091057.1 | 3.92 | 2.21–6.97 | <0.0001 | 0.000565 |
| AC022150.4 | 3.99 | 2.1–7.6 | <0.0001 | 0.000763 |
| 0.03 | 0–0.19 | 2.00E-04 | 3.56E-05 | |
| 0.4 | 0.23–0.71 | 0.0017 | 0.030472 | |
| AC087741.1 | 0.58 | 0.41–0.83 | 0.0025 | 0.036343 |
| MSC.AS1 | 1.26 | 1.08–1.48 | 0.0031 | 0.00131 |
| RUNDC3A.AS1 | 10.26 | 1.98–53.1 | 0.0055 | 0.010718 |
Bold: the genes in the survival model.
Figure 5Construction, estimation, and validation of the risk model. (A) Least absolute shrinkage and selection operator (LASSO) coefficient profiles of the 40 genes. The value was chosen by 10-fold cross-validation. (B) The numbers above the graph represent the number of genes involved in the LASSO model. (C,D) Survival curve and ROC curve of the risk model according to the dataset of EGA. (E) Forest plot showing associations between the selected nine genes, the reported clinical factors, and the overall survival in the model. (F–H) Validation of the risk model based on the survival curve, the ROC curve, and the multivariate Cox analysis according to the EGA dataset. The significances of the survival curves were calculated by log-rank test. Also, the AUC values of ROC curves > 70% were regarded as efficient models.
Statistical analyses between the clinical traits and the risk model.
| <60 | 12 | 11 | 0.8049 |
| ≥60 | 28 | 29 | |
| Male | 32 | 33 | 0.7745 |
| Female | 8 | 7 | |
| STAGE I | 3 | 6 | 0.4638 |
| STAGE II+STAGE III+STAGE IV | 37 | 34 | |
| Biphasic mesothelioma | 5 | 17 | |
| Diffuse malignant mesothelioma (NOS) | 2 | 3 | |
| Epithelioid mesothelioma | 33 | 20 |
Bold: the types of clinical traits.
Italic and bold: significant value.
Figure 6Validation of DEGs in the risk model based on the EGA MPM dataset. The statistical analyses were performed by Students' t-test with or without Welch correction according to the TPM of genes.
Figure 7Kaplan–Meier survival analyses. (A-I) Validation of of the nine genes in the model based on the EGA MPM dataset. The statistical significances were determined by the log-rank test.
Figure 8WGCNA analyses of the DEmRNAs. (A) Identification of the co-expression modules. (B) Association between the modules and the traits. (C) Pearson correlations between the genes in the turquoise module and the risk score. (D) Pearson correlations between the genes in the blue module and the risk model. (E) Protein–protein interaction network analysis of the overlapped mRNAs in the ceRNA network and the turquoise module. The size and color of the round represent the number of links of the protein (gene).
Figure 9Gene-enrichment analyses of genes in the blue module and the turquoise module. (A) Top 10 GO-BP terms in the blue module. (B,C) Significant GO-MF and KEGG pathway terms in the blue module. (D,E) Top 10 GO-BP and GO-MF terms in the turquoise module. (F) Significant KEGG terms in the turquoise module. P-value was adjusted by the FDR method and p < 0.05 was significant.
Figure 10Twenty-two types of immune-cell infiltration analyses. (A) The overall changes of 22 types of immune cells in the MPM tissues of the 80 patients. (B–J) The significant differences of the immune cells between the high- and low-score groups based on the Student's t-test.