| Literature DB >> 30747226 |
Yan Liu1, Dongyan Zhu2, Hongjian Xing3, Yi Hou4, Yan Sun1.
Abstract
Pancreatic adenocarcinoma (PAC) is the most common type of pancreatic cancer, which commonly has an unfavorable prognosis. The present study aimed to develop a novel prognostic prediction strategy for PAC patients. mRNA sequencing data of PAC (the training dataset) were extracted from The Cancer Genome Atlas database, and the validation datasets (GSE62452 and GSE79668) were acquired from the Gene Expression Omnibus database. The differentially expressed genes (DEGs) between good and poor prognosis groups were analyzed by limma package, and then prognosis‑associated genes were screened using Cox regression analysis. Subsequently, the risk score system was constructed and confirmed using Kaplan‑Meier (KM) survival analysis. After the survival associated‑clinical factors were screened using Cox regression analysis, they were performed with stratified analysis. Using DAVID tool, the DEGs correlated with risk scores were conducted with enrichment analysis. The results revealed that there were a total of 242 DEGs between the poor and good prognosis groups. Afterwards, a risk score system was constructed based on 6 prognosis‑associated genes (CXCL11, FSTL4, SEZ6L, SPRR1B, SSTR2 and TINAG), which was confirmed in both the training and validation datasets. Cox regression analysis showed that risk score, targeted molecular therapy, and new tumor (the new tumor event days after the initial treatment according to the TCGA database) were significantly related to clinical prognosis. Under the same clinical condition, 6 clinical factors (age, history of chronic pancreatitis, alcohol consumption, radiation therapy, targeted molecular therapy and new tumor (event days) had significant associations with clinical prognosis. Under the same risk condition, only targeted molecular therapy was significantly correlated with clinical prognosis. In conclusion, the 6‑gene risk score system may be a promising strategy for predicting the outcome of PAC patients.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30747226 PMCID: PMC6365694 DOI: 10.3892/or.2019.6979
Source DB: PubMed Journal: Oncol Rep ISSN: 1021-335X Impact factor: 3.906
Clinical information of The Cancer Genome Atlas (TCGA) dataset and the validation datasets (GSE79668 and GSE62452).
| Clinical factors | TCGA (n=178) | GSE79668 (n=51) | GSE62452 (n=69) |
|---|---|---|---|
| Age, years (mean ± SD) | 64.69±11.09 | 64.04±11.57 | – |
| Sex (male/female/-) | 91/74/13 | 32/19 | – |
| Chronic pancreatitis history (yes/no/-) | 13/117/48 | – | – |
| Diabetes history (yes/no/-) | 36/99/43 | 22/29 | – |
| Alcohol (yes/no/-) | 97/57/24 | – | – |
| Tobacco (never/reform/current/-) | 59/56/19/44 | – | – |
| New tumor (yes/no/-) | 55/101/22 | – | – |
| Pathologic_M (M0/M1/-) | 75/3/100 | 48/1/2 | – |
| Pathologic_N (N0/N1/-) | 44/117/17 | 14/37 | – |
| Pathologic_T (T1/T2/T3/T4/-) | 8/19/134/3/14 | 3/12/31/5 | – |
| Pathologic_stage (I/II/III/IV/-) | 20/137/4/3/ | – | 4/46/13/6 |
| Radiation therapy (yes/no/-) | 38/107/33 | – | |
| Targeted molecular therapy (yes/no/-) | 102/48/28 | – | |
| Deceased (death/alive/-) | 83/8213 | 45/6 | 49/16/4 |
| Overall survival months (mean ± SD) | 17.11±15.35 | 26.78±26.12 | 20.21±16.69 |
-, Indicates information unavailable. SD, standard deviation.
The 6 prognosis-associated genes to establish the risk score system.
| Genes | coef | HR | P-value |
|---|---|---|---|
| 0.451453 | 0.6367 | 0.0031 | |
| 0.54981 | 0.5771 | 0.0025 | |
| −1.18976 | 3.2863 | <0.0001 | |
| 0.37643 | 0.6863 | 0.0004 | |
| 1.17541 | 0.3087 | 0.0035 | |
| 0.26515 | 0.7671 | 0.0163 |
HR, hazard ratio.
Figure 1.Overall survival of pancreatic adenocarcinoma (PAC) patients in low- and high-risk groups in The Cancer Genome Atlas (TCGA) dataset (A), GSE62452 (B), and GSE79668 (C). Red and black separately represent high- and low-risk groups.
Figure 2.Expression distributions of the 6 prognosis-associated genes in the high- and low-risk groups of The Cancer Genome Atlas (TCGA) dataset (A), GSE62452 (B) and GSE79668 (C). *0.01≤P<0.05; **0.005≤P<0.01; ***P<0.005.
Cox regression analysis for selecting the clinical factors significantly related to prognosis.
| Clinical characteristics | Univariable Cox P-value | Multivariable Cox P-value |
|---|---|---|
| Age in years (above/below median) | 0.0781 | |
| Sex (male/female) | 0.5630 | |
| Pathologic_M (M0/M1) | 0.3020 | |
| Alcohol (yes/no) | 0.8190 | |
| Tobacco (never/reform/current) | 0.1490 | |
| Chronic pancreatitis history (yes/no) | 0.6990 | |
| Diabetes history (yes/no) | 0.6830 | |
| Pathologic_N (N0/N1) | 0.0128 | 0.2683 |
| Pathologic_T (T1/T2/T3/T4) | 0.0338 | 0.2258 |
| Radiation therapy (yes/no) | 0.0371 | 0.5174 |
| New tumor (yes/no) | ||
| Targeted molecular therapy (yes/no) | ||
| Risk score |
P-values in bold print indicate significant correlations.
Results of differential expression analysis after dividing the samples into groups according to different clinical factors.
| Clinical factors | Downregulated genes | Upregulated genes |
|---|---|---|
| Age in years (above vs. below median) | ||
| Sex (male vs. female) | ||
| Chronic pancreatitis history | ||
| (yes vs. no) | ||
| Diabetes history (yes vs. no) | ||
| Alcohol (yes vs. no) | ||
| Tobacco (never vs. reform) | – | – |
| Tobacco (never vs. current) | ||
| Pathologic_M (M0 vs. M1) | – | – |
| Pathologic_T (T1+ T2 vs. T3+T4) | ||
| Pathologic_N (N0 vs. N1) | ||
| Radiation therapy (yes vs. no) | ||
| Targeted molecular therapy (yes vs. no) | ||
| New tumor (yes vs. no) |
Tobacco: current, subjects who smoke at least once a month; reform, those who have tried smoking but have quit; never, those who have never tried tobacco). Pathologic_M: M0, no distant metastasis; M1, distant metastasis. Pathologic_T: T1, unilateral tumor 80 cm2 or less in area; T2, unilateral tumor more than 80 cm2 in area; T3, unilateral tumor rupture before treatment; T4, bilateral tumors. Pathologic_N: N0, no regional lymph node metastasis; N1, regional lymph node metastasis. New tumor, tumor metastasis or spread to other parts of the body.
Results of the stratified analysis under the same clinical condition.
| Clinical factors | P-value |
|---|---|
| Age (≥65 years, n=75) | 0.0526 |
| Age (<65 years, n=76) | 0.0011 |
| Sex (male, n=85) | 0.2520 |
| Sex (female, n=64) | 0.7210 |
| Chronic pancreatitis history (yes, n=13) | 0.6360 |
| Chronic pancreatitis history (no, n=109) | <0.0001 |
| Diabetes history (yes, n=32) | 0.1200 |
| Diabetes history (no, n=94) | 0.0936 |
| Alcohol (yes, n=90) | 0.0018 |
| Alcohol (no, n=51) | 0.0071 |
| Tobacco (never, n=54) | 0.3370 |
| Tobacco (reform, n=53) | 0.0502 |
| Tobacco (current, n=17) | 0.1180 |
| Pathologic_M (M0, N=68) | 0.6310 |
| Pathologic_M (M1, n=2) | – |
| Pathologic_N (N0, n=40) | 0.1930 |
| Pathologic_N (N1, n=105) | 0.0537 |
| Pathologic_T (T1+T2, n=24) | 0.1540 |
| Pathologic_T (T3+T4, n=124) | 0.0567 |
| Radiation therapy (yes, n=37) | 0.0215 |
| Radiation therapy (no, n=102) | 0.0021 |
| Targeted molecular therapy (yes, n=98) | 0.0357 |
| Targeted molecular therapy (no, n=45) | 0.0019 |
| New tumor (yes, n=54) | 0.0357 |
| New tumor (no, n=87) | 0.0193 |
Figure 3.The Kaplan-Meier (KM) survival curves for the 6 clinical factors (age, alcohol use, new tumor, targeted molecular therapy, chronic pancreatitis history and radiation therapy) in high- and low-risk groups under the same clinical condition. (A) Survival curves for patients below the age of 65 (left), patients above the age of 65 (middle), and patients below or above the age of 65 years (right). (B) The survival curves for no-alcohol group (left), alcohol group (middle), and no-alcohol or alcohol groups (right). (C) Survival curves for no new tumor group (left), new tumor group (middle), and no new tumor or new tumor groups (right). (D) Survival curves for no targeted therapy group (left), targeted therapy group (middle), and no targeted therapy or targeted therapy groups (right). (E) Survival curves for no chronic pancreatitis group (left), chronic pancreatitis group (middle), and no chronic pancreatitis or chronic pancreatitis groups (right). (F) Survival curves for no radiation therapy group (left), radiation therapy group (middle), and no radiation therapy or radiation therapy groups (right). Red and black separately represent high- and low-risk groups.
Results of the stratified analysis under the same risk condition.
| Clinical factors | High risk | Low risk |
|---|---|---|
| Age in years (above vs. below median) | 0.888 | 0.056 |
| Sex (male vs. female) | 0.622 | 0.939 |
| Pathologic_M (M0/M1) | 0.869 | 0.368 |
| Pathologic_N (N0 vs. N1) | 0.332 | 0.906 |
| Pathologic_T (T1 vs. T2 vs. T3) | 0.308 | 0.098 |
| Chronic pancreatitis history (yes vs. no) | 0.267 | 0.917 |
| Diabetes history (yes vs. no) | 0.643 | 0.997 |
| Alcohol (yes vs. no) | 0.803 | 0.977 |
| Tobacco (never vs. reform vs. current) | 0.210 | 0.534 |
| Radiation therapy (yes vs. no) | 0.668 | 0.173 |
| Targeted molecular therapy (yes vs. no) | 0.002 | 0.154 |
| New tumor (yes vs. no) | 0.389 | 0.997 |
Figure 4.Kaplan-Meier (KM) survival curves for targeted molecular therapy in the high- and low-risk groups under the same risk condition. (A) Survival curve for no targeted therapy group. (B) Survival curve for targeted therapy group. (C) Survival curve for no targeted therapy or targeted therapy groups. Red and black separately represent high- and low-risk groups.
Figure 5.Risk scores and survival time of the patients, as well as the expression heatmaps of the 6 prognosis-associated genes separately in The Cancer Genome Atlas (TCGA) dataset (A), GSE79668 (B) and GSE62452 (C).
Figure 6.Clustering heatmap for the top 20 differentially expressed genes (DEGs) positively or negatively related to risk scores (A), and the significantly enriched biological processes (B) and pathways (C) for the risk score-associated DEGs.