| Literature DB >> 30700038 |
Nguyen Phuoc Long1, Kyung Hee Jung2, Nguyen Hoang Anh3, Hong Hua Yan4, Tran Diem Nghi5, Seongoh Park6, Sang Jun Yoon7, Jung Eun Min8, Hyung Min Kim9, Joo Han Lim10, Joon Mee Kim11, Johan Lim12, Sanghyuk Lee13, Soon-Sun Hong14, Sung Won Kwon15.
Abstract
Substantial alterations at the multi-omics level of pancreatic cancer (PC) impede the possibility to diagnose and treat patients in early stages. Herein, we conducted an integrative omics-based translational analysis, utilizing next-generation sequencing, transcriptome meta-analysis, and immunohistochemistry, combined with statistical learning, to validate multiplex biomarker candidates for the diagnosis, prognosis, and management of PC. Experiment-based validation was conducted and supportive evidence for the essentiality of the candidates in PC were found at gene expression or protein level by practical biochemical methods. Remarkably, the random forests (RF) model exhibited an excellent diagnostic performance and LAMC2, ANXA2, ADAM9, and APLP2 greatly influenced its decisions. An explanation approach for the RF model was successfully constructed. Moreover, protein expression of LAMC2, ANXA2, ADAM9, and APLP2 was found correlated and significantly higher in PC patients in independent cohorts. Survival analysis revealed that patients with high expression of ADAM9 (Hazard ratio (HR)OS = 2.2, p-value < 0.001), ANXA2 (HROS = 2.1, p-value < 0.001), and LAMC2 (HRDFS = 1.8, p-value = 0.012) exhibited poorer survival rates. In conclusion, we successfully explore hidden biological insights from large-scale omics data and suggest that LAMC2, ANXA2, ADAM9, and APLP2 are robust biomarkers for early diagnosis, prognosis, and management for PC.Entities:
Keywords: diagnostic biomarker; machine learning; meta-analysis; next-generation sequencing; pancreatic ductal adenocarcinoma; prognostic biomarker; systems biology; transcriptomics
Year: 2019 PMID: 30700038 PMCID: PMC6407035 DOI: 10.3390/cancers11020155
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Screening process of innovative biomarker candidates for pancreatic cancer diagnosis and treatment. (a) Combined effect size distributions in transcriptome meta-analysis. (b) Selection of membrane protein-coding genes. (c) Selection of secretory protein-coding genes. The left circle of the Venn diagram (orange) represents the candidates from meta-analysis and the right circle of the Venn diagram (blue) represents the candidates curated from [17].
Literature- and experiment-based validation of the 23 candidates.
| Gene Symbol | Entrez ID | RNA Alteration | Protein Alteration | NGS Results | Meta-Analysis | Neoplasm # vs. Normalcy | PDAC vs. Neoplasm # | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Log FC | FDR | cES | ||||||||
|
| 10551 | ↑RT-PCR 1,2, ↑SAGE 1,2, ↑DM 1 | ↑IHC 1, ↑LFQ-MS 1, ↑WB 2 | 3.67 | 1.55 × 10−15 | 7.37 × 10−13 | 1.93 | 0 | ↑ | X |
|
| 2597 | ↑DM 1, ↑SAGE 1, NB 2 | ↑ICAT 1, ↑WB 1,2, ↓SILAC-TMS 2 | 1.48 | 6.08 × 10−4 | 1.27 × 10−2 | 1.68 | 2.02 × 10−13 | X | X |
|
| 3918 | ↑DM 1,2 | ↑IHC 1 | 3.91 | 1.15 × 10−7 | 1.17 × 10−5 | 2.37 | 0 | ↑ | X |
|
| 4320 | ↑DM 1, ↑ISH 1, ↑SAGE 1,2, ↑NB 1,2 | ↑IHC 1, ↑WB 2 | NA | NA | NA | 1.62 | 7.90 × 10−14 | X | X |
|
| 8407 | NA | ↑SILAC-TMS 1, ↑ICAT 1 | NA | NA | NA | 1.73 | 1.64 × 10−5 | ↑ | X |
|
| 8754 | ↑DM 1,2, ↑RT-PCR 1,2 | ↑IHC 1 | NA | NA | NA | 1.71 | 1.41 × 10−5 | X | X |
|
| 302 | ↑DM 1 | ↑IHC 1, ↑ICAT 1, ↑WB 1,2, ↑MS 1, ↑TDE 1, ↑LFQ-MS 1 | 1.16 | 2.26 × 10−3 | 3.35 × 10−2 | 1.63 | 8.33 × 10−9 | X | X |
|
| 334 | ↑DM 1,2 | ↑SILAC-TMS 1,2, ↑SILAC 2 | NA | NA | NA | 1.53 | 1.11 × 10−5 | X | X |
|
| 1001 | ↑DM 1,2, ↑RT-PCR 1,2 | ↑IHC 1 | 2.72 | 8.70 × 10−6 | 4.50 × 10−4 | 2.02 | 0 | X | ↑ |
|
| 10232 | ↓DM 1, ↑DM 1,2, ↑RT-PCR 1,2, ↑SAGE 1,2, ↑ISH 1 | ↑IHC 1 | 5.21 | 1.89 × 10−15 | 8.77 × 10−13 | 2.03 | 4.88 × 10−4 | ↑ | X |
|
| 5268 | ↑DM 1,2, ↑NB 2 | ↑IHC 1,2, ↑SILAC-TMS 1 | 8.35 | 6.54 × 10−30 | 2.96 × 10−26 | 1.96 | 0 | ↑ | X |
|
| 3732 | ↑NB 1, ↑ISH 1, ↑DM 1 | ↑IHC 1 | 1.39 | 4.55 × 10−4 | 1.03 × 10−2 | 1.62 | 7.90 × 10−14 | X | X |
|
| 51208 | ↑DM 1 | ↑IHC 1 | 2.57 | 9.43 × 10−5 | 2.97 × 10−3 | 1.64 | 7.60 × 10−9 | ↑ | X |
|
| 1969 | ↑DM 1,2 | ↑SILAC-TMS 1, ↑WB 2 | 1.69 | 4.95 × 10−5 | 1.78 × 10−3 | 1.56 | 2.99 × 10−13 | X | ↑ |
|
| 7430 | NA | ↑IHC 1, SILAC-TMS 1 | NA | NA | NA | 1.68 | 0 | ↑ | X |
|
| 5349 | ↑DM 1,2, ↑SAGE 1, ↑NB 1, ↑ISH 1 | NA | 4.71 | 4.03 × 10−21 | 5.21 × 10−18 | 1.91 | 0 | ↑ | X |
|
| 9052 | ↑DM 1,2, ↑SAGE 1,2 | ↑IHC 1 | 3.98 | 2.08 × 10−8 | 2.61 × 10−6 | 1.91 | 1.25 × 10−8 | ↑ | ↑ |
|
| 3673 | ↑DM 1,2 | ↑IHC 1, ↑WB 1 | 2.94 | 1.20 × 10−6 | 8.72 × 10−5 | 2.04 | 2.02 × 10−13 | ↑ | X |
|
| 3694 | ↑DM 1 | ↑IHC 1 | 2.56 | 4.55 × 10−6 | 2.71 × 10−4 | 1.79 | 2.78 × 10−8 | X | X |
|
| 4233 | ↑DM 1, ↑RT-PCR 1,2 | NA | NA | NA | NA | 1.90 | 1.97 × 10−7 | X | X |
|
| 4486 | ↑DM 1,2, ↑RT-PCR 2 | ↑IHC 1, ↑WB 2 | 2.84 | 9.03 × 10−7 | 6.81 × 10−5 | 2.06 | 0 | ↑ | X |
|
| 1728 | ↑DM 1,2 | ↑IHC 1, ↑WB 2 | 3.71 | 1.88 × 10−9 | 3.04 × 10−7 | 2.15 | 1.63 × 10−5 | ↑ | X |
|
| 6513 | ↑DM 1,2 | ↑IHC 1 | 3.28 | 6.34 × 10−10 | 1.16 × 10−7 | 2.03 | 0 | ↑ | ↑ |
# Data obtained from the dataset GSE19650. 1 Evidence in PDAC tissue. 2 Evidence in cancer cell lines. Abbreviations: cES: combined effect size; DM: DNA microarray; FDR: false discovery rate; ICAT: isotope-coded affinity tag; IHC: immunohistochemistry; IPMN: intraductal papillary mucinous neoplasia; ISH: in situ hybridization; LFQ-MS: label-free quantitation mass spectrometry; NB: northern blot; NGS: next-generation sequencing; PDAC: pancreatic ductal adenocarcinoma; RT-PCR: reverse transcription-polymerase chain reaction; SAGE: serial analysis of gene expression; SILAC-TMS: stable isotope labeling by/with amino acids in cell culture-tandem mass spectrometry; TDE: two dimensional electrophoresis; WB: western blot; NA: Not applicable/available.
Figure 2Kaplan–Meier (KM) plots of the overall survival of four promising prognostic candidates. (a) KM plot of ADAM9. (b) KM plot of ANXA2. (c) KM plot of ITGA2. (d) KM plot of MET.
Combined effect sizes, Cox regression analysis, and alterations at the mRNA and protein level of the 23 candidates.
| Gene Symbol | Entrez ID | Cox Regression | |
|---|---|---|---|
| Cox Coefficient | FDR | ||
|
| 10551 | 0.05 | 0.76 |
|
| 2597 | 0.21 | 0.19 |
|
| 3918 | 0.42 | 0.02 |
|
| 4320 | 0.17 | 0.28 |
|
| 8407 | 0.19 | 0.22 |
|
| 8754 | 0.41 | 0.02 |
|
| 302 | 0.32 | 0.04 |
|
| 334 | 0.23 | 0.14 |
|
| 1001 | 0.37 | 0.03 |
|
| 10232 | 0.26 | 0.10 |
|
| 5268 | 0.38 | 0.02 |
|
| 3732 | 0.11 | 0.51 |
|
| 51208 | 0.09 | 0.63 |
|
| 1969 | 0.33 | 0.05 |
|
| 7430 | 0.28 | 0.08 |
|
| 5349 | 0.17 | 0.30 |
|
| 9052 | 0.37 | 0.03 |
|
| 3673 | 0.41 | 0.02 |
|
| 3694 | 0.49 | 0.01 |
|
| 4233 | 0.66 | 0.00 |
|
| 4486 | 0.22 | 0.17 |
|
| 1728 | 0.14 | 0.43 |
|
| 6513 | 0.27 | 0.08 |
Figure 3Data exploration and diagnostic performance of 11 biomarker candidates in the Random Forests model. (a) The gene expression of CDH3 is higher in PDAC than in normal controls (n = 96), ****: p < 0.0001. (b) The gene expression of LAMC2 is higher in PDAC than in normal controls (n = 96). (c) Principal component analysis of PDAC versus normal controls. (d) Heatmap analysis of 11 biomarker candidates. (e) ROC curve of the random forests model in the test set (n = 28). (f) The importance scores of 11 biomarker candidates in the random forests model.
Figure 4Immunohistochemical analysis for determination of LAMC2, ADAM9, and ANXA2 (Annexin A2) gene expression. (a) Scoring system for the three proteins in pancreatic tissues. (b) IHC scores of LAMC2, ADAM9, and ANXA2 in pancreatic tumors and normal controls (n = 86). ***: p < 0.001; ****: p < 0.0001. (c) Pairwise correlation of LAMC2, ADAM9, and ANXA2 expression. (d) Pairwise correlation of LAMC2, ADAM9, and ANXA9 at gene expression level (log2 of transcripts per million) in TGCA PC, TCGA normal pancreas, and Genotype-Tissue Expression Project (GTEx) derived normal pancreas.
Drug–gene and miRNA–gene interactions of 18 membrane proteins.
| Gene Symbol | miRNA–Gene Interaction | Druggable Genome | Drug–Gene Interaction 3 | |||
|---|---|---|---|---|---|---|
| miRNA 1 That Targets the Gene | Validation Methods | Expression Profile in PC (Correlation Coefficient 2) | ||||
| Strong Evidence | Less Strong Evidence | |||||
|
| hsa-miR-126-3p | RA, WB, qPCR | MA | 0.315 | Yes | Ilomastat |
| hsa-miR-33a-5p | RA, WB, qPCR | 0.373 | ||||
| hsa-miR-125a-5p | qPCR | 0.320 | ||||
|
| hsa-miR-155-5p | RA, WB, qPCR | MA | −0.599 | Yes | NA |
| hsa-miR-206 | RA, WB, qPCR | −0.312 | ||||
|
| NA | Yes | NA | |||
|
| NA | Yes | NA | |||
|
| hsa-miR-21-5p | RA | 0.715 | Yes | Amatuximab | |
|
| hsa-miR-21-5p | RA, WB, qPCR | MA, NGS | 0.709 | Yes | NA |
| hsa-miR-103a-3p | RA, WB, qPCR | MA, NGS | 0.501 | |||
|
| NA | Yes | NA | |||
|
| NA | Yes | Claudiximab | |||
|
| NA | Yes | Dasatinib, Dorsomorphin, Regorafenib, Vandetanib | |||
|
| hsa-miR-183-5p | RA, WB, qPCR | 0.424 | NA | NA | |
| hsa-miR-204-5p | RA, WB, qPCR | MA | −0.542 | |||
| hsa-miR-205-5p | RA | MA, NGS | 0.358 | |||
|
| NA | No | NA | |||
|
| hsa-miR-103a-3p | RA, WB, qPCR | NGS | 0.514 | Yes | NA |
|
| hsa-miR-16-5p | qPCR | NGS, pSILAC | 0.509 | Yes | Abciximab, CHEMBL36326, Eptifibatide, Tirofiban, Vatelizumab |
|
| NA | Yes | DI17E6, Intetumumab, STX-100 | |||
|
| hsa-miR-34c-5p | RA, WB, qPCR | MA, NGS | 0.337 | Yes | ARRY-300, ABT-700, AMG-337, AMG-208, Amuvatinib, Altiratinib, Amoxicillin, Alectinib, BMS-698769, BMS-777607, BMS-794833, BMS-817378, BPI-9016, Crizotinib, Clofibrate, Cabozantinib, Cabozantinib S-Malate, Capmatinib, Crenolanib, Emibetuzumab, EMD-1204831, Foretinib, Golvatinib, JNJ-38877605, LY-2875358, MK-8033, MGCD-265, Merestinib, MK-2461, Onartuzumab, PF-04217903, PHA-665752, Pyrazinamide, Rilonacept, SGX-523, Savolitinib, Tepotinib, Tivantinib, Tanespimycin, SAR-125844, TAS-115 |
| hsa-miR-199a-3p | RA, WB, qPCR | MA | 0.365 | |||
| hsa-miR-34a-5p | RA, WB, qPCR | 0.478 | ||||
| hsa-miR-23b-3p | RA, WB, qPCR | 0.447 | ||||
| hsa-miR-27a-3p | RA, WB, qPCR | 0.444 | ||||
| hsa-miR-27b-3p | RA, WB, qPCR | 0.458 | ||||
| hsa-miR-31-5p | RA, qPCR | 0.594 | ||||
| hsa-miR-34a-3p | WB | 0.380 | ||||
|
| NA | Yes | BMS-777607, Foretinib, MGCD-265, MK-2461, MK-8033 | |||
|
| NA | Yes | Apaziquone, Dicumarol, Vatiquinone | |||
|
| hsa-miR-22-3p | RA, WB, qPCR | NGSs | 0.331 | Yes | NA |
1 Only miRNAs that have strong evidence were included, 2 p-value of the correlation coefficient was less than 0.05. 3 Only defined interactions were included. Abbreviation: MA: microarray; NGS: next-generation sequencing; pSILAC: pulsed stable isotope labeling by amino acids in cell culture; qPCR: quantitative polymerase chain reaction; RA: reporter assay; WB: western blot.