| Literature DB >> 32117736 |
Haojie Yang1, Hong-Cheng Lin2, Hua Liu1, Dan Gan1, Wei Jin1, Can Cui1, Yixin Yan3, Yiming Qian3, Changpeng Han1, Zhenyi Wang1.
Abstract
Colon adenocarcinoma (COAD) is a common type of colon cancer, and post-operative recurrence and metastasis may occur in COAD patients. This study is designed to build a risk score system for COAD patients. The Cancer Genome Atlas (TCGA) dataset of COAD (the training set) was downloaded, and GSE17538 and GSE39582 (the validation sets) from Gene Expression Omnibus database were obtained. The differentially expressed RNAs (DERs) were analyzed by limma package. Using survival package, the independent prognosis-associated long non-coding RNAs (lncRNAs) were selected for constructing risk score system. After the independent clinical prognostic factors were screened out using survival package, a nomogram survival model was constructed using rms package. Furthermore, competitive endogenous RNA (ceRNA) regulatory network and enrichment analyses separately were performed using Cytoscape software and DAVID tool. Totally 404 DERs between recurrence and non-recurrence groups were identified. Based on the six independent prognosis-associated lncRNAs (including H19, KCNJ2-AS1, LINC00899, LINC01503, PRKAG2-AS1, and SRRM2-AS1), the risk score system was constructed. After the independent clinical prognostic factors (Pathologic M, pathologic T, and RS model status) were identified, the nomogram survival model was built. In the ceRNA regulatory network, there were three lncRNAs, four miRNAs, and 77 mRNAs. Additionally, PPAR signaling pathway and hedgehog signaling pathway were enriched for the mRNAs in the ceRNA regulatory network. The risk score system and the nomogram survival model might be used for predicting COAD recurrence. Besides, PPAR signaling pathway and hedgehog signaling pathway might affect the recurrence of COAD patients.Entities:
Keywords: colon adenocarcinoma; competitive endogenous RNA; differential expression analysis; nomogram survival model; risk score system
Year: 2020 PMID: 32117736 PMCID: PMC7015976 DOI: 10.3389/fonc.2020.00081
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1The screening results of differentially expressed RNAs (DERs). (A) The volcano plot (left; the horizontal dashed line represents false discovery rate (FDR) < 0.05, and the vertical dashed lines represent |log2 fold change (FC)| > 0.263; the blue dots indicate DERs; FC: fold change) and the histogram showing the proportional distribution of different kinds of DERs (right; pink and blue separately represent up-regulation and down-regulation; lncRNA: long non-coding RNA, miRNA: microRNA); (B) The hierarchical clustering heatmap (in sample strip, blue and pink separately represent recurrence samples and non-recurrence samples).
Univariable Cox regression analysis identified 21 DE-lncRNAs related to recurrence prognosis.
| 0.834 | 2.3 | 0.209 | 4 | 6.40E-05 | |
| 1.34 | 3.82 | 0.404 | 3.32 | 9.20E-04 | |
| 0.178 | 1.19 | 0.0547 | 3.26 | 1.10E-03 | |
| 0.974 | 2.65 | 0.309 | 3.15 | 1.60E-03 | |
| 3.37 | 29.2 | 1.13 | 2.98 | 2.90E-03 | |
| 0.55 | 1.73 | 0.187 | 2.94 | 3.30E-03 | |
| 0.905 | 2.47 | 0.317 | 2.86 | 4.20E-03 | |
| −0.521 | 0.594 | 0.184 | −2.82 | 4.70E-03 | |
| 0.309 | 1.36 | 0.11 | 2.81 | 5.00E-03 | |
| 2.18 | 8.87 | 0.843 | 2.59 | 9.60E-03 | |
| 0.338 | 1.4 | 0.136 | 2.49 | 1.30E-02 | |
| 0.338 | 1.4 | 0.138 | 2.44 | 1.50E-02 | |
| 0.506 | 1.66 | 0.218 | 2.32 | 2.00E-02 | |
| 0.716 | 2.05 | 0.308 | 2.32 | 2.00E-02 | |
| 0.927 | 2.53 | 0.405 | 2.29 | 2.20E-02 | |
| −0.49 | 0.612 | 0.222 | −2.21 | 2.70E-02 | |
| −2.06 | 0.127 | 0.948 | −2.18 | 2.90E-02 | |
| −0.366 | 0.693 | 0.181 | −2.03 | 4.20E-02 | |
| 0.327 | 1.39 | 0.161 | 2.03 | 4.20E-02 | |
| −0.874 | 0.417 | 0.435 | −2.01 | 4.40E-02 | |
| 0.669 | 1.95 | 0.334 | 2 | 4.50E-02 |
The information of the six independent prognosis-associated long non-coding RNAs (lncRNAs).
| 0.1647 | 2.23E-02 | 1.179 | 1.024–1.358 | |
| −1.1438 | 1.60E-02 | 0.319 | 0.126–0.808 | |
| 1.4922 | 1.65E-03 | 4.447 | 1.755–6.265 | |
| 0.8525 | 1.22E-03 | 2.346 | 1.399–3.932 | |
| −0.8100 | 6.73E-04 | 0.445 | 0.279–0.710 | |
| 3.8523 | 1.35E-02 | 7.100 | 2.213–10.431 |
CI, confidence interval.
Figure 2The Kaplan-Meier (KM) curves and receiver operating characteristic (ROC) curves showing the correlations between the grouping based on the risk score system and the actual recurrence prognosis. (A) The KM curves for The Cancer Genome Atlas (TCGA) dataset; (B) The KM curves for the validation set GSE17538; (C) The KM curves for the validation set GSE39582; (D) The ROC curves for the TCGA dataset and the validation sets; In KM curves, blue and red curves separately represent low risk group and high risk group. In ROC curves, red, black, and blue curves represent the TCGA dataset, the validation set GSE17538, and the validation set GSE39582, respectively. HR, hazard ratio; AUC, area under the receiver operating characteristic curve.
The selection of independent clinical prognostic factors.
| Age (years, mean ± SD) | 65.73 ± 12.71 | 0.994 [0.975–1.014] | 5.549E-01 | – | – |
| Gender (Male/Female) | 169/141 | 1.802 [0.885–2.993] | 2.103E-01 | – | – |
| Pathologic M (M0/M1/–) | 226/43/41 | 3.550 [1.969–6.400] | 7.051E-06 | 3.450 [1.107–10.76] | 3.280E-02 |
| Pathologic N (N0/N1/N2) | 180/77/53 | 1.948 [1.449–2.619] | 4.638E-06 | 1.317 [0.749–2.314] | 3.378E-01 |
| Pathologic T (T1/T2/T3/T4) | 8/55/212/35 | 2.461 [1.507–4.019] | 5.105E-04 | 1.775 [1.167–3.259] | 4.640E-02 |
| Pathologic stage (I/II/III/IV/–) | 51/118/88/43/10 | 1.761 [1.329–2.334] | 6.207E-05 | 0.717 [0.334–1.542] | 3.947E-01 |
| Lymphatic invasion (Yes/No/–) | 109/175/26 | 2.401 [0.441–3.997] | 5.247E-02 | – | – |
| Colon polyps history (Yes/No/–) | 78/176/56 | 0.836 [0.443–1.578] | 5.801E-01 | – | – |
| RS model status (High/Low) | 155/155 | 4.396 [2.467–7.832] | 5.164E-08 | 3.793 [2.009–7.161] | 3.940E-05 |
| Recurrence (Yes/No) | 66/244 | – | – | – | – |
| Recurrence free survival time (months, mean ± SD) | 29.66 ± 25.46 | – | – | – | – |
TCGA, The Cancer Genome Atlas; HR, Hazard Ratio; CI, confidence interval.
Figure 3The Kaplan-Meier (KM) curves showed the associations between pathologic M/pathologic T and recurrence prognosis. (A) The KM curves for pathologic M (blue and red curves separately represent the samples in pathologic M0 stage and pathologic M1 stage); (B) The KM curves for pathologic T (black, red, blue, and purples curves represent the samples in pathologic T1 stage, pathologic T2 stage, pathologic T3 stage, and pathologic T4 stage, respectively). HR, hazard ratio.
Figure 4Nomogram survival model. (A) The nomogram survival model. (B) The graph showing the consistency of the nomogram-predicted survival probability and the actual survival probability (the horizontal and vertical axes represent the predicted survival probability and the actual survival probability, respectively; black and red segments separately represent the 5-years survival rate and the 3-years survival rate in the group with the highest consistency).
Figure 5The long non-coding RNA (lncRNA)-microRNA (miRNA) regulatory network. Squares and triangles represent lncRNAs and miRNAs, respectively. Red and green separately represent up-regulation and down-regulation. FC, fold change.
Figure 6The microRNA (miRNA)-mRNA regulatory network. Circles and triangles represent mRNAs and miRNAs, respectively. Red and green separately represent up-regulation and down-regulation. FC, fold change.
Figure 7The competing endogenous RNA (ceRNA) regulatory network. Squares, circles, and triangles represent long non-coding RNAs (lncRNAs), mRNAs, and microRNAs (miRNAs), respectively. Red and green separately represent up-regulation and down-regulation. Red and black edges represent lncRNA-miRNA and miRNA-mRNA pairs, respectively. FC, fold change.
The enrichment results for the mRNAs in the competitive endogenous RNA (ceRNA) regulatory network.
| GO_Biology process | GO:0045664~regulation of neuron differentiation | 5 | 1.68E-03 | |
| GO:0050767~regulation of neurogenesis | 5 | 3.74E-03 | ||
| GO:0015698~inorganic anion transport | 4 | 5.45E-03 | ||
| GO:0051960~regulation of nervous system development | 5 | 6.26E-03 | ||
| GO:0060284~regulation of cell development | 5 | 7.86E-03 | ||
| GO:0006820~anion transport | 4 | 1.75E-02 | ||
| GO:0007015~actin filament organization | 3 | 3.12E-02 | ||
| GO:0045017~glycerolipid biosynthetic process | 3 | 3.79E-02 | ||
| GO:0007409~axonogenesis | 4 | 3.79E-02 | ||
| GO:0048878~chemical homeostasis | 6 | 4.61E-02 | ||
| GO:0048667~cell morphogenesis involved in neuron differentiation | 4 | 4.62E-02 | ||
| GO:0048812~neuron projection morphogenesis | 4 | 4.84E-02 | ||
| GO:0030308~negative regulation of cell growth | 3 | 4.87E-02 | ||
| KEGG pathway | hsa03320:PPAR signaling pathway | 3 | 3.82E-03 | |
| hsa04080:Neuroactive ligand-receptor interaction | 4 | 1.06E-02 | ||
| hsa04672:Intestinal immune network for IgA production | 2 | 2.00E-02 | ||
| hsa04340:Hedgehog signaling pathway | 2 | 2.25E-02 | ||
| hsa04360:Axon guidance | 2 | 4.47E-02 |
GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes (KEGG).