| Literature DB >> 27217769 |
Weihua Tong1, Fei Ye2, Liang He1, Lifeng Cui1, Miao Cui2, Yuan Hu2, Wei Li1, Jing Jiang3, David Y Zhang2, Jian Suo1.
Abstract
PURPOSE: Currently, serum biomarkers that are sufficiently sensitive and specific for early detection and risk classification of gastric adenocarcinomas are not known. In this study, ten serum markers were assessed using the Luminex system and enzyme-linked immunosorbent assay for the diagnosis of gastric cancer and analysis of the relation between prognosis and metastases. PATIENTS AND METHODS: A training set consisting of 228 gastric adenocarcinoma and 190 control samples was examined. A Luminex multiplex panel with nine biomarkers, consisting of three proteins discovered through our previous studies and six proteins previously reported to be cancer-associated, was constructed. One additional biomarker was detected using a commercial kit containing EDTA. Logistic regression, random forest (RF), and support vector machine (SVM) were used to identify the panel of discriminatory biomarkers in the training set. After selecting five proteins as candidate biomarkers, multivariate classification analyses were used to identify algorithms for diagnostic biomarker combinations. These algorithms were independently validated using a set of 57 gastric adenocarcinoma and 48 control samples.Entities:
Keywords: Luminex; cancer diagnosis; cancer screening; gastric adenocarcinoma
Year: 2016 PMID: 27217769 PMCID: PMC4853138 DOI: 10.2147/OTT.S86139
Source DB: PubMed Journal: Onco Targets Ther ISSN: 1178-6930 Impact factor: 4.147
Demographic data and pathological characteristics of patients with gastric cancer and healthy controls
| Gastric adenocarcinoma, n=285 | Control, n=238 | |
|---|---|---|
| Age, years | 59.89±11.499 | 59.14±9.33 |
| Sex | ||
| Males | 204 | 157 |
| Females | 81 | 81 |
| Stage | ||
| I | 66 (23.1%) | |
| II | 93 (32.6%) | |
| III | 126 (44.2%) | |
| Alcohol and smoking history | ||
| Alcohol | 86 −199 | |
| Smoking | 130 −155 | |
| Early stage | 48 | |
| Advanced stage | 237 | |
| Differentiation type | ||
| High | 7 | |
| Medium | 120 | |
| Low | 158 | |
| Tumor | ||
| T1 | 48 | |
| T2 | 55 | |
| T3 | 3 | |
| T4 | 179 | |
| Lymph node | ||
| N0 | 120 | |
| N1 | 40 | |
| N2 | 47 | |
| N3 | 78 | |
| Tumor ≤2 cm | 52 | |
| Tumor >2 cm | 233 | |
| Tumor position | ||
| Antrum | 188 | |
| Body | 65 | |
| Cardia | 32 | |
Patients’ demographic data and clinical profiles in training and validation groups
| Training group
| Validation group
| |||
|---|---|---|---|---|
| Gastric carcinoma | Control | Gastric carcinoma | Control | |
| Age, years | 59.82±11. 32 | 59.15±9.27 | 60.14±10.73 | 59.11±7.93 |
| Sex | ||||
| Male | 164 | 120 | 40 | 36 |
| Female | 64 | 70 | 17 | 12 |
| Stage | ||||
| I | 47 | 19 | ||
| II | 78 | 15 | ||
| III | 103 | 23 | ||
List of markers
| Biomarker | Antigen | Capture antibody | Detection antibody |
|---|---|---|---|
| Growth/angiogenic factor | |||
| VEGF | R&D System | R&D System | R&D System |
| EGFR | R&D System | R&D System | R&D System |
| Coagulation/thrombosis | |||
| Hemoglobin | Sigma-Aldrich | Pierce | Pierce |
| Adhesion | |||
| ADAM8 | R&D System | R&D System | R&D System |
| Beta-catenin | R&D System | R&D System | R&D System |
| sICAM-1 | R&D System | R&D System | R&D System |
| Inflammation | |||
| COX2 | R&D System | R&D System | R&D System |
| Pepsinogen | |||
| Pepsinogen-I | R&D System | Abcam | Abcam |
| Pepsinogen-II | Biohit | NOVUS | NOVUS |
Notes: R&D System: R&D System Inc., Minneapolis, MN, USA; Sigma-Aldrich: Sigma-Aldrich Co., St Louis, MO, USA; Pierce:; Biohit: Biohit, Helsinki, Finland; Abcam: Abcam, MA, USA; NOVUS: Novus Biologicals, Littleton, CO, USA.
Abbreviations: ICAM, intercellular cell adhesion molecule; VEGF, vascular endothelial growth factor; EGFR, epidermal growth factor receptor; ADAM8, A Disintegrin And Metalloproteinase domain-containing protein 8.
The description of variables in gastric cancer patients and controls
| Variable | – | Control (N=238) | Gastric cancer (N=285) | Statistic | |
|---|---|---|---|---|---|
| IgG to | Mean ± SD | 35.14±37.98 | 66.96±42.19 | <0.0001 | |
| M (Q1–Q3) | 16.67 (6.31–60.64) | 66.44 (27.94–107.41) | |||
| Min–Max | 0.46–142.71 | 0.60–155.67 | |||
| PGI | Mean ± SD | 101.15±36.49 | 4,740.89±22,768.80 | <0.0001 | |
| M (Q1–Q3) | 94.05 (77.37–120.19) | 116.66 (74.28–211.51) | |||
| Min–Max | 13.54–286.47 | 0.24–197,160.00 | |||
| PGII | Mean ± SD | 13.96±9.98 | 3,422.14±10,146.89 | <0.0001 | |
| M (Q1–Q3) | 10.91 (7.38–17.38) | 21.66 (12.13–38.19) | |||
| Min–Max | 1.60–66.94 | 0.14–90,759.61 | |||
| PGI/PGII | Mean ± SD | 9.13±4.07 | 5.71±4.07 | <0.0001 | |
| M (Q1–Q3) | 8.56 (6.26–10.93) | 5.23 (3.03–7.61) | |||
| Min–Max | 2.22–36.76 | 0.04–24.49 | |||
| Beta-catenin | Mean ± SD | 409.20±473.89 | 767.73±854.38 | <0.0001 | |
| M (Q1–Q3) | 269.98 (135.91–460.89) | 483.61 (263.08–883.18) | |||
| Min–Max | 1.00–3,210.81 | 1.00–5,456.21 | |||
| COX2 | Mean ± SD | 190.31±210.51 | 308.00±320.50 | <0.0001 | |
| M (Q1–Q3) | 146.60 (38.79–260.89) | 221.51 (97.94–422.12) | |||
| Min–Max | 0.34–1,325.78 | 1.00–2,481.92 | |||
| ADAM8 | Mean ± SD | 500.78±492.02 | 1,146.98±1,161.13 | <0.0001 | |
| M (Q1–Q3) | 337.12 (199.81–627.00) | 803.82 (466.89–1,467.16) | |||
| Min–Max | 2.65–3,168.90 | 1.00–9,645.67 | |||
| VEGF | Mean ± SD | 242.79±277.96 | 548.92±490.40 | <0.0001 | |
| M (Q1–Q3) | 164.49 (77.28–290.32) | 416.47 (236.06–707.36) | |||
| Min–Max | 1.00–1,962.88 | 1.00–3,653.43 | |||
| EGFR | Mean ± SD | 9,308.89±4,302.37 | 9,837.98±5,024.02 | 0.1951 | |
| M (Q1–Q3) | 8,211.83 (6,565.46–10,832.58) | 9,029.69 (6,786.17–11,484.39) | |||
| Min–Max | 3,897.56–41,378.60 | 1.00–47,739.98 | |||
| ICAM | Mean ± SD | 174,870.10±86,585.45 | 165,658.82±95,597.97 | 0.2527 | |
| M (Q1–Q3) | 151,346.57 (119,159.57–213,194.11) | 144,412.52 (107,495.51–200,256.92) | |||
| Min–Max | 56,918.07–627,365.84 | 1.00–1,101,730.81 | |||
| HB | Mean ± SD | 369,038.51±470,630.63 | 939,135.01±3,663,960.29 | 0.0157 | |
| M (Q1–Q3) | 235,096.52 (135,491.09–417,276.58) | 268,581.00 (144,175.20–724,694.68) | |||
| Min–Max | 31,780.94–5,466,994.12 | 1.00–50,057,376.68 |
Abbreviations: HB, hemoglobin; M (Q1–Q3), median and interquartile range; Min–Max, minimum to maximum; PGI, serum pepsinogen I; PGII, serum pepsinogen II; SD, standard deviation; ICAM, intercellular cell adhesion molecule; VEGF, vascular endothelial growth factor; EGFR, epidermal growth factor receptor; ADAM8, A Disintegrin And Metalloproteinase domain-containing protein 8.
The results of correlation analysis for three variables
| Beta-catenin | COX2 | VEGF | |
|---|---|---|---|
| Beta-catenin | 1.0000 | ||
| COX2 | 0.8370 | 1.0000 | |
| VEGF | 0.7893 | 0.7291 | 1.0000 |
Note:
The correlation is statistically significant.
Abbreviation: VEGF, vascular endothelial growth factor.
Figure 1Random forest cross-validation for feature selection.
Abbreviation: CV, cross-validation.
Biomarker selection in the training group
| Marker | Avg.imp |
|---|---|
| ADAM8 | 42.6065 |
| PGI | 38.7232 |
| PGII | 37.7563 |
| VEGF | 35.1600 |
| IgG to | 34.7993 |
| EGFR | 10.8508 |
| Beta-catenin | 10.3391 |
| HB | 8.4268 |
| ICAM | 6.8035 |
| COX2 | 2.1472 |
Abbreviations: HB, hemoglobin; ICAM, intercellular cell adhesion molecule; VEGF, vascular endothelial growth factor; EGFR, epidermal growth factor receptor; ADAM8, A Disintegrin And Metalloproteinase domain-containing protein 8; PGI, serum pepsinogen I; PGII, serum pepsinogen II.
Figure 2Importance scores of biomarkers in random forest.
Abbreviations: ICAM, intercellular cell adhesion molecule; VEGF, vascular endothelial growth factor; EGFR, epidermal growth factor receptor; ADAM8, A Disintegrin And Metalloproteinase domain-containing protein 8; HB, hemoglobin; IgG, immunoglobulin G; PGII, serum pepsinogen II; PGI, serum pepsinogen I.
Diagnostic performance of classification algorithms with five-biomarker panel
| Training group
| Test group
| |||||
|---|---|---|---|---|---|---|
| Accuracy | Sensitivity | Specificity | Accuracy | Sensitivity | Specificity | |
| RF | 0.790 | 0.842 | 0.729 | 0.825 | 0.860 | 0.784 |
| SVM | 0.838 | 0.877 | 0.792 | 0.861 | 0.886 | 0.832 |
| Logistic | 0.762 | 0.807 | 0.708 | 0.787 | 0.829 | 0.737 |
Abbreviations: Logistic, logistic regression; RF, random forest; SVM, support vector machine.
Figure 3The ROC of five biomarkers.
Notes: (A) RF. (B) SVM. (C) Logistic regression.
Abbreviations: AUC, area under the curve; CI, confidence interval; ROC, receiver operating characteristic; RF, random forest; SVM, support vector machine.
Cox model coefficients and 95% confidence interval
| OR | 2.5% OR | 97.5% OR | |
|---|---|---|---|
| ADAM8 | 1.118 | 0.754 | 1.591 |
| PGI | 0.935 | 0.640 | 1.444 |
| PGII | 1.035 | 0.733 | 1.392 |
| VEGF | 1.077 | 0.795 | 1.541 |
| IgG to | 1.170 | 0.937 | 1.544 |
| N1 | 0.726 | 0.000 | 2.086 |
| N2 | 3.817 | 1.841 | 10.506 |
| N3 | 5.998 | 3.110 | 15.813 |
| T1 | 1.365 | 0.180 | 38,189,896.926 |
| T2 | 15.098 | 0.000 | 789,835,105.698 |
| T3 | 5.680 | 1.800 | 141,247,321.593 |
| T4 | 11.395 | 2.777 | 391,254,179.091 |
Abbreviations: OR, odds ratio; PGI, serum pepsinogen I; PGII, serum pepsinogen II; VEGF, vascular endothelial growth factor; ADAM8, A Disintegrin And Metalloproteinase domain-containing protein 8.