| Literature DB >> 31151932 |
Qiujin Shen1, Karol Polom2, Coralie Williams3, Felipe Marques Souza de Oliveira4, Mariana Guergova-Kuras5, Frederique Lisacek6, Niclas G Karlsson7, Franco Roviello8, Masood Kamali-Moghaddam9.
Abstract
BACKGROUND: Gastric cancer (GC) is the third leading cause of cancer death. Early detection is a key factor to reduce its mortality.Entities:
Keywords: Biomarker; Diagnosis; Gastric cancer; PEA; Proteomics
Mesh:
Substances:
Year: 2019 PMID: 31151932 PMCID: PMC6606959 DOI: 10.1016/j.ebiom.2019.05.044
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1A schematic diagram overview of the study. GC, gastric cancer. PEA, proximity extension assay. LOD, limit of detection.
Demographic and pathologic characteristics of 50 control subjects and 100 patients with gastric cancer.
| Variables | No. | Total No. | |
|---|---|---|---|
| Control group | |||
| Age (year) | Mean (range) | 63·5 (25–91) | 50 |
| Gender | F | 38 | 50 |
| M | 12 | ||
| CEA (ng/ml) | Mean (SD) | 1·9 (1·1) | 50 |
| CA19–9 (ng/ml) | Mean (SD) | 20·4 (32·8) | 36 |
| Missing | 14 | ||
| CA72–4 (ng/ml) | Mean (SD) | 2·2 (1·6) | 17 |
| Missing | 33 | ||
| GC group | |||
| Age (year) | Mean (range) | 70·6 (30–92) | 100 |
| Gender | F | 41 | 100 |
| M | 59 | ||
| CEA (ng/ml) | Mean (SD)_Pre | 40·1 (96·1) | 97 |
| Mean (SD)_Post | 13·7 (259·7) | ||
| missing | 3 | ||
| CA19–9 (ng/ml) | Mean (SD)_Pre | 261.6 (1024.4) | 97 |
| Mean (SD)_Post | 93.1 (490.1) | ||
| missing | 3 | ||
| CA72–4 (ng/ml) | Mean (SD)_Pre | 34.3 (143.9) | 90 |
| Mean (SD)_Post | 25.2 (116.5) | ||
| missing | 10 | ||
| Blood type ABO | A | 42 | 94 |
| B | 11 | ||
| O | 38 | ||
| AB | 3 | ||
| Missing | 6 | ||
| Blood type Rh | Rh- | 10 | 94 |
| Rh+ | 84 | ||
| missing | 6 | ||
| Stomach cancer history | No | 84 | 99 |
| Yes | 15 | ||
| missing | 1 | ||
| MSI status | Stable | 63 | 98 |
| High | 35 | ||
| missing | 2 | ||
| Tumour diameter | <50 mm | 30 | 98 |
| ≥50 mm | 68 | ||
| missing | 2 | ||
| TNM stage | I | 8 | 99 |
| II | 20 | ||
| III | 57 | ||
| IV | 14 | ||
| missing | 1 | ||
| Tumour relapse | No | 29 | 65 |
| Yes | 36 | ||
| missing | 35 | ||
MSI: microsatellite instability; EBV: epstein-barr virus.
Fig. 2Multiplex PEA results of proteins measured in GC tissues and serum specimen. (A) Venn diagram showing the number of proteins detectable or undetectable in serum and in tissue specimen. (B) Comparison of subcellular location between proteins detectable in serum but not in tissue and proteins detectable in tissue but not in serum by the FunRich software. The 13 proteins detectable in GC tissue but not in serum are AGR3, ARTN, CAMKK1, IL1A, IL20RA, IL22RA1, IL24, IL33, JUN, LIF, NCLN, NRTN, PAK4. The 14 proteins detectable in GC serum but not in tissue are CRX, DKKL1, FAM19A5, FCRLB, FGF23, IL10, IL10RA, IL2, LYPD1, OPTC, SEZ6L, SLITRK2, TCL1B, WNT9A. (C) Principal component analysis (PCA) plot illustrating the sample distribution of 100 gastric tumour tissues (T, blue) and matched adjacent noncancerous tissues (N, red), based on 245 proteins levels. Each dot represents an individual sample. (D) Volcano plot showing the 245 proteins levels in GC tissues compared to matched non-tumour tissues. The dashed line represents the cutoff line with indicated significance criteria. Points having absolute log fold-change ≥2 and FDR adjusted p-value <0·05 are shown in red, with absolute log fold-change <2 and p-value ≥0·05 are in gray, and the rest are in black. (E) PCA plot illustrating the distribution of 50 serum samples from controls (Ctrl, red), 100 GC preoperative serum samples (Pre, green) and matched 100 postoperative samples (Post, blue), based on 316 proteins levels. (F—H) Volcano plots showing the 316 protein levels in preoperative GC serum samples versus controls (F), between preoperative and postoperative ones (G), and between postoperative samples and controls (H). Points having absolute log fold-change ≥0·5 and FDR adjusted p-value <0·05 are shown in red, with absolute log fold-change <0·5 and p-value <0·05 are in black, with absolute log fold-change <0·5 and p-value ≥0·05 are in gray, and the rest are in orange. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Clinical significance of proteins expressed in gastric cancer tumour tissue.
| Variables | Protein | Padj | |
|---|---|---|---|
| Age (mean, range, year) | 70.6 (30–92) | MOG | 0·0002 |
| GFRA2 | 0·0006 | ||
| BOC | 0·02733 | ||
| Gender | |||
| Female | 41 | FASLG | 0·0388 |
| Male | 59 | ||
| MSI status | |||
| Stable | 64 | XPNPEP2 | 0·0002 |
| High | 35 | TGFR2 | 0·0002 |
| BCR | 0·0008 | ||
| Flt3L | 0·002 | ||
| IFN gamma | 0·0039 | ||
| SIGLEC6 | 0·0063 | ||
| CPE | 0·0170 | ||
| GPNMB | 0·0207 | ||
| TNFRSF19 | 0·0266 | ||
| TWEAK | 0·0266 | ||
| CBL | 0·0294 | ||
| BOC | 0·0403 | ||
| Borrman classification | |||
| I | 7 | OMG | 6·11E-17 |
| II | 14 | MOG | 1·59E-05 |
| III | 60 | PROK1 | 0·0113 |
| IV | 18 | ||
| Ming classification | |||
| EXP + MIX (expanding+mixed) | 37 | ZBTB16 | 0·0002 |
| INF (infiltrative) | 40 | NT3 | 0·0008 |
| TWEAK | 0·0072 | ||
| WHO classification | |||
| Poor/Undifferentiated | 38 | OMG | 1·87E-05 |
| Tubular | 34 | MOG | 0·0002 |
| Signet-ring cells & mucinous | 22 | GSAP | 0·0103 |
| Papillary | 2 | IL33 | 0·0415 |
| PROK1 | 0·0453 |
Fig. 3Diagnostic capacity for gastric cancer of the identified 19 serum protein signature by elastic-net logistic regression. (A) Diagnostic performances of different protein combinations. Proteins are sorted according to the absolute coefficient from the largest to the smallest. “ROC test P" is the p-value of the comparison of ROC curves generated from successive protein combinations with one more protein added at a time. Coef., coefficient. ROC, receiver operator characteristics. *, p < 0·05. (B) Overlaid ROC curves of each of the combinations from two to 19 serum proteins. Comb, combination. (C) Protein-protein interactions among the 19 proteins assessed with the STRING database.
Serum protein biomarkers identified by elastic-net logistic regression (ENLR) for gastric cancer diagnosis.
| Identified proteins | Short name | Ctrl (Mean ± SD) | Pre (Mean ± SD) | Pre vs. Ctrl | FDR Padj | Cut-off | Sen (%) | Spe (%) | AUC (95%CI) | T vs. N | FDR Padj |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Matrix metalloproteinase-1 | MMP1 | 1·56 ± 1·04 | 2·46 ± 1·04 | I | 0·0009 | 2·05 | 66 | 76 | 0·74(0·65–0·82) | I | 2·57E-21 |
| Interleukin-7 | IL7 | 3·3 ± 0·54 | 3·75 ± 0·58 | I | 0·0014 | 3·41 | 73 | 64 | 0·72(0·64–0·81) | – | – |
| Carbonic anhydrase 9 | CA9 | 3·26 ± 1·36 | 4·02 ± 1·21 | I | 0·0014 | 3·40 | 71 | 68 | 0·71(0·62–0·8) | I | 2·82E-05 |
| CUB domain-containing protein 1 | CDCP1 | 3·69 ± 0·63 | 4·19 ± 0·66 | I | 0·0018 | 3·34 | 93 | 40 | 0·71(0·62–0·8) | I | 1·95E-05 |
| Zinc finger and BTB domain-containing protein 17 | ZBTB17 | 1·34 ± 0·69 | 1·88 ± 0·85 | I | 0·0027 | 1·69 | 61 | 76 | 0·7(0·61–0·79) | I | 4·84E-13 |
| dimethylarginine dimethylaminohydrolase 1 | DDAH1 | 1·36 ± 0·51 | 1·82 ± 0·76 | I | 0·0041 | 1·29 | 77 | 54 | 0·69(0·6–0·79) | – | – |
| Friend leukemia integration 1 transcription factor | FLI1 | 0·11 ± 0·33 | 0·42 ± 0·61 | I | 0·0052 | 0·19 | 66 | 66 | 0·69(0·6–0·78) | D | 3·01E-08 |
| Mesothelin | MSLN | 2·58 ± 0·83 | 3·11 ± 0·74 | I | 0·0053 | 2·31 | 87 | 48 | 0·69(0·59–0·78) | I | 7·34E-17 |
| Carcinoembryonic antigen-related cell adhesion molecule 5 | CEACAM5 | 2·42 ± 0·79 | 3·43 ± 1·73 | I | 0·0053 | 2·86 | 54 | 76 | 0·68(0·6–0·77) | I | 0·0229 |
| Kazal-type serine protease inhibitor domain-containing protein 1 | KAZALD1 | 2·78 ± 0·7 | 3·17 ± 0·68 | I | 0·0079 | 3·08 | 56 | 80 | 0·68(0·58–0·77) | D | 1·47E-07 |
| C-C motif chemokine 20 | CCL20 | 6·09 ± 1·3 | 6·78 ± 1·37 | I | 0·0184 | 6·60 | 52 | 76 | 0·66(0·57–0·75) | I | 0·0137 |
| Stem cell factor/KIT ligand | SCF | 7·85 ± 0·81 | 8·08 ± 0·63 | – | – | 7·71 | 83 | 34 | 0·58(0·49–0·68) | I | 1·20E-07 |
| Peptidyl-prolyl cis-trans isomerase A | PPIA | 3·05 ± 1·01 | 2·92 ± 1·05 | – | – | 3·05 | 52 | 56 | 0·48(0·38–0·57) | – | – |
| Transforming growth factor alpha | TGF alpha | 2·43 ± 0·91 | 2·45 ± 1·02 | – | – | 2·48 | 62 | 52 | 0·5(0·4–0·6) | D | 3·57E-05 |
| Heme oxygenase 1 | HMOX1 | 9·91 ± 1·07 | 9·57 ± 0·81 | – | – | 9·96 | 74 | 62 | 0·63(0·52–0·74) | – | – |
| Matrix metalloproteinase-10 | MMP10 | 6·81 ± 0·56 | 6·71 ± 0·61 | – | – | 6·94 | 71 | 40 | 0·54(0·44–0·64) | I | 3·98E-11 |
| Amyloid beta A4 precursor protein-binding family B member 1-interacting protein | APBB1IP | 1·61 ± 0·69 | 1·98 ± 0·98 | – | – | 2·31 | 31 | 88 | 0·6(0·5–0·69) | I | 0·0082 |
| Insulin-like growth factor I | IGF1 | 1·43 ± 1·13 | 1·56 ± 0·88 | – | – | 2·24 | 78 | 34 | 0·49(0·38–0·59) | – | – |
| A disintegrin and metalloproteinase with thrombospondin motifs 15 | ADAMTS15 | 1·78 ± 0·83 | 1·46 ± 0·62 | – | – | 1·94 | 80 | 48 | 0·62(0·52–0·72) | D | 5·86E-13 |
| 19-protein signature | 93 | 100 | 0·99(0·98–1) | ||||||||
| Carcinoembryonic antigen | CEAa | 2·24 ± 0·83 | 43·89 ± 272·52 | I | 2·65 | 52 | 82 | 0·67(0·59–0·76) | |||
| Carbohydrate antigen 19–9 | CA19–9b | 11·31 ± 10·39 | 280·05 ± 1092·91 | I | 37·5 | 42 | 88 | 0·63(0·54–0·73) | |||
| Carbohydrate antigen 72–4 | CA72–4c | 2·22 ± 1·63 | 35·03 ± 145·51 | I | 2·6 | 55 | 88 | 0·72(0·61–0·83) |
I: increase. D: decrease. FDR Padj: P values were tested by non-parametric Mann-Whitney-Wilcoxon and adjusted multiple tests with false discovery rate. CI: confidence interval. Coef.: coefficient calculated by ENLR. T: tumour tissue. N: adjacent normal tissue. –: Proteins not significantly altered.
Cutoff was defined by Yoden's index by maximizing values of sensitivity+specificity-1.
a, b, c,: clinically measured biomarkers. b: 36 controls vs. 97 GC. c: 17 controls vs.90 GC.
Fig. 4Protein levels in serum samples from GC patients at TNM I-II stage or with high microsatellite instability (MSI) status. (A) Volcano plot showing the comparison of protein levels between patients at TNM I-II stage and controls. (B) ROC curves for the 19-serum protein signature overlaid with each individual protein showing the diagnostic capacity of GC at TNM I-II stage versus controls. (C) Volcano plot showing the comparison of protein levels between patients with MSI-H and controls. (D) ROC curves for the 19-serum protein signature overlaid with each individual protein showing the diagnostic capacity of GC with MSI-H status versus controls. Points in plots A and C having absolute log fold-change ≥0·5 and false discovery rate adjusted p-value <0·05 are shown in red, with absolute log fold-change <0·5 and p-value <0·05 are in black, with absolute log fold-change <0·5 and p-value ≥0·05 are in gray, and the rest are in orange. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)