| Literature DB >> 29340021 |
Guodong Shi1,2, Jingjing Zhang1,2, Zipeng Lu1,2, Dongfang Liu1,2, Yang Wu1,2, Pengfei Wu1,2, Jie Yin1,2, Hao Yuan1,2, Qicong Zhu1,2, Lei Chen1,2, Yue Fu1,2, Yunpeng Peng1,2, Yan Wang3, Kuirong Jiang1,2, Yi Miao1,2.
Abstract
Pancreatic ductal adenocarcinoma (PDAC) death rate and recurrence rate have remained obstinately high. Current methods can not satisfy the need of predicting cancer relapse accurately. Utilizing expression profiles of 10 GEO datasets (N = 774), we identified 154 differentially expressed genes (DEGs) between PDAC and normal pancreas tissue or paracancerous tissue. Next we built a 16-mRNA-based signature by means of the LASSO COX regression model. We also validated the prognostic value of the signature. Patients were classified into high-risk and low-risk group according to the signature risk score; 1 year RFS was 45% (95% CI: 31.6%-63.9%) for high-risk group in contrast to 92.5% (95% CI: 86.3%-99.1%) for low-risk group. Moreover, it could predict RFS well in cases with the receipt of different treatment modalities. The 16-mRNA-based signature was an independent and powerful prognostic biomarker for RFS for PDAC patients (HR = 7.74, 95% CI: 3.25-18.45, p < 0.0001).Entities:
Keywords: bioinformatics; biomarker; pancreatic ductal adenocarcinoma; predictive signature; relapse-free survival
Year: 2017 PMID: 29340021 PMCID: PMC5762289 DOI: 10.18632/oncotarget.22861
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Datasets enrolled in the study
| GEO ID or data source | T | P | N | Platform |
|---|---|---|---|---|
| GSE62165 (Janky | 118 | - | 13 | Affymetrix U219 |
| GSE91035 (Schmittgen | 27 | 15 | 8 | Agilent LincRNA G3 |
| GSE32676 (Donahue | 25 | - | 7 | Affymetrix U133 Plus 2 |
| GSE71989 (Schmittgen | 13 | - | 8 | Affymetrix U133 Plus 2 |
| GSE62452 (Yang | 69 | 61 | - | Affymetrix Gene 1.0 ST |
| GSE28735 (Zhang | 45 | 45 | - | Affymetrix Gene 1.0 ST |
| GSE15471 (Badea | 36 | 36 | - | Affymetrix U133 Plus 2 |
| GSE16515 (Pei | 36 | 16 | - | Affymetrix U133 Plus 2 |
| GSE71729 (Moffitt | 145 | 46 | - | Agilent G4112F |
| GSE58561 (Wennerström | 3 | 2 | - | Agilent G3 |
| TCGA-PAAD (The Cancer Genome Atlas Research Network, 2014) | 138 | - | - | Illumina HiSeq V2 |
N: normal pancreas tissue samples; P: paracancerous tissue samples; PAAD: patients with pancreatic adenocarcinoma; T: tumor samples.
Figure 1Vocano plots of DEGs in selected datasets
(A–K) GSE91035 consists of gene expression profiles of not only TvsN but also TvsP. X-axis: log[2](FC); Y-axis: -log[10](FDR) for each gene. Genes with FDR < 0.05 and absolute FC > 2 are considered as DEGs in each series. Blue: down-regulated genes; Green: non-differential genes; Red: up-regulated genes.
Figure 2DEGs between PDAC and normal pancreas tissue or paracancerous tissue
(A–D) Overlap analysis in 11 different datasets. Every ellipse corresponds to a dataset. The numbers of DEGs in each overlapped area are marked in relevant position. DEGs within 2 dataset or more were regarded as credible in each venn diagram. (E) Top 10 terms in each of three categories of GO enrichment analysis. Red: biological processes (BP); Green: cellular component (CC); Blue: molecular functions (MF). (F) KEGG pathway analysis of DEGs. Seven signal pathways are significantly enriched.
Figure 3The 16-mRNA-based signature for predicting RFS
(A) Visualization of the coefficient profiles of fitting LASSO COX model. Each curve represents a variable. It shows the path of its coefficient against the L1-norm of the whole coefficient vector at as λ varies. A vertical line is drawn at the value chosen by 10-fold cross-validation. The above axis: the number of nonzero coefficients at as λ varies. It represents the degrees of freedom for model. X-axis: L1 Norm, the summation of absolute nonzero coefficients at as λ varies. Y-axis: the values of nonzero coefficients at as λ varies. (B) Time-dependent ROC curve in 138 PDAC patients. (C) The Kaplan-Meier survival curve for patients in different risk groups. (D) Risk score analysis for patients. Top: the distribution of RFS status. Bottom: risk score of 16-mRNA-based signature. (E) Comparison of prognostic accuracy between the signature and clinical characteristics. (F) Comparison of prognostic accuracy between the signature and single mRNAs.
Result of COX univariable analysis and multivariable analysis of RFS
| Characteristics | Univariable HR (95% CI) | Multivariable HR (95% CI) | ||
|---|---|---|---|---|
| Gender (Female vs. male) | 1.2 (0.64–2.26) | 0.5615 | ||
| Age (≥65 years vs. <65) | 1.01 (0.54–1.92) | 0.9652 | ||
| History of diabetes (YES vs. No) | 0.79 (0.36–1.75) | 0.5578 | ||
| History of chronic pancreatitis (YES vs. No) | 0.72 (0.22–2.39) | 0.5943 | ||
| Family history of cancer (YES vs. No) | 0.73 (0.34–1.56) | 0.4186 | ||
| Tobacco smoking history (YES vs. No) | 1.51 (0.73–3.09) | 0.2625 | ||
| Alcohol history documented (YES vs. No) | 1.39 (0.7–2.79) | 0.3469 | ||
| Location (tail + body vs. head) | 0.47 (0.18–1.23) | 0.1161 | 0.35 (0.1–1.23) | 0.1021 |
| Histologic grade (G3 + G4 vs. G1 + G2) | 2.54 (1.34–4.84) | 2.09 (0.87–4.98) | 0.0973 | |
| Pathological stage (III + IV vs. II vs I) | 1.55 (0.73–3.31) | 0.257 | ||
| N stage (N1 vs. N0) | 3.14 (1.3–7.56) | |||
| T stage (T3 + T4 vs. T1 + T2) | 2.77 (0.96–7.94) | |||
| Maximum tumor dimension (≥3.5 mm vs. <3.5 mm) | 1.66 (0.86–3.2) | 0.1304 | 1.57 (0.71–3.51) | 0.2662 |
| Residual tumor (R1 + R2 vs. R0) | 1.56 (0.77–3.14) | 0.2095 | ||
| Radiation therapy (YES vs. No) | 1.2 (0.61–2.35) | 0.5895 | ||
| Targeted molecular therapy (YES vs. No) | 1.14 (0.54–2.41) | 0.7388 | ||
| Primary therapy outcome success (SD + PD vs. CR + PR) | 2.42 (1.24–4.7) | 1.31 (0.61–2.83) | 0.4885 | |
| AJCC8th T stage (T3 + T4 vs. T1 + T2) | 1.37 (0.73–2.58) | 0.3273 | ||
| AJCC8th N stage (N2 vs. N1 vs. N0) | 1.59 (1.06–2.39) | 1.87 (0.74–4.73) | 0.1828 | |
| AJCC8th Pathological Stage (III + IV vs. II vs I) | 1.49 (0.95–2.32) | 0.0784 | 0.92 (0.34–2.49) | 0.8699 |
| 16-mRNA-based signature (High vs. Low) | 12.79 (5.5–29.74) | 7.74 (3.25–18.45) |
AJCC: American Joint Committee on Cancer; CI: Confidence interval; CR: Complete remission /response; HR: Hazard ratio; PD: Progressive disease; PR: Partial remission/response; SD: Stable disease.
Figure 4(A–D) The Kaplan-Meier survival curves for PDAC patients in different subgroups stratified by radiation and targeted molecular therapy. Information of partial patients on radiation or TMT was incomplete and deleted. TMT: targeted molecular therapy.