| Literature DB >> 25992884 |
Sonia Blanco-Prieto1, Lorena Vázquez-Iglesias1, Mar Rodríguez-Girondo2, Leticia Barcia-Castro1, Alberto Fernández-Villar3, María Isabel Botana-Rial3, Francisco Javier Rodríguez-Berrocal1, María Páez de la Cadena1.
Abstract
Lung cancer is the most lethal neoplasia, and an early diagnosis is the best way for improving survival. Symptomatic patients attending Pulmonary Services could be diagnosed with lung cancer earlier if high-risk individuals are promptly separated from healthy individuals and patients with benign respiratory pathologies. We searched for a convenient non-invasive serum test to define which patients should have more immediate clinical tests. Six cancer-associated molecules (HB-EGF, EGF, EGFR, sCD26, VEGF, and Calprotectin) were investigated in this study. Markers were measured in serum by specific ELISAs, in an unselected population that included 72 lung cancer patients of different histological types and 56 control subjects (healthy individuals and patients with benign pulmonary pathologies). Boosted regression and random forests analysis were conducted for the selection of the best candidate biomarkers. A remarkable discriminatory capacity was observed for EGF, sCD26, and especially for Calprotectin, these three molecules constituting a marker panel boasting a sensitivity of 83% and specificity of 87%, resulting in an associated misclassification rate of 15%. Finally, an algorithm derived by logistic regression and a nomogram allowed generating classification scores in terms of the risk of a patient of suffering lung cancer. In conclusion, we propose a non-invasive test to identify patients at high-risk for lung cancer from a non-selected population attending a Pulmonary Service. The efficacy of this three-marker panel must be tested in a larger population for lung cancer.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25992884 PMCID: PMC4436352 DOI: 10.1371/journal.pone.0127318
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Distribution of markers in serum of Lung Cancer patients and Controls, and efficacy in classifying Lung Cancer.
| Marker | Control/Case | Median | Range |
| AUC (95% CI) | |
|---|---|---|---|---|---|---|
| HB-EGF (pg/mL) |
|
|
|
|
| |
| Healthy | 174.50 | 65.00–1627.00 | ||||
| Benign | 218.50 | 32.00–4661.00 | ||||
|
|
|
| ||||
| EGF (pg/mL) |
|
|
|
|
| |
| Healthy | 247.61 | 98.01–659.01 | ||||
| Benign | 404.76 | 102.04–1160.42 | ||||
|
|
|
| ||||
| sEGFR (ng/mL) |
|
|
|
|
| |
| Healthy | 30.42 | 20.82–47.70 | ||||
| Benign | 37.82 | 25.13–49.57 | ||||
|
|
|
| ||||
| sCD26 (ng/mL) |
|
|
|
|
| |
| Healthy | 504.50 | 331.00–816.00 | ||||
| Benign | 405.50 | 122.00–998.00 | ||||
|
|
|
| ||||
| VEGF (pg/mL) |
|
|
|
|
| |
| Healthy | 542.70 | 39.73–1178.40 | ||||
| Benign | 541.29 | 124.94–2631.56 | ||||
|
|
|
| ||||
| CAL (ng/mL) |
|
|
|
|
| |
| Healthy | 116.55 | 39.16–274.70 | ||||
| Benign | 141.93 | 33.13–421.23 | ||||
|
|
|
| ||||
a Mann-Whitney U (two-sided test).
b AUC based on out-of-bag predictions (models are fitted in the training sets with the 75% of the cases and are subsequently used for predicting the group membership of the 25% test cases). Average values and percentile 95% CI over 1000 repetitions are provided.
Distribution of markers in early and advanced NSCLC versus Controls.
| Marker | Control/ Case | Median | Range |
| |
|---|---|---|---|---|---|
| HB-EGF (pg/mL) |
|
|
| ||
|
| |||||
| NSCLC I+II | 159.00 | 56.00–435.00 | 0.630 | ||
| NSCLC III+IV | 190.00 | 24.00–1823.00 | 0.698 | ||
| EGF (pg/mL) |
|
|
| ||
|
| |||||
| NSCLC I+II | 792.82 | 388.23–1176.89 | <0.001 | ||
| NSCLC III+IV | 477.74 | 144.23–1158.32 | 0.012 | ||
| sEGFR (ng/mL) |
|
|
| ||
|
| |||||
| NSCLC I+II | 38.28 | 29.17–46.44 | 0.375 | ||
| NSCLC III+IV | 37.14 | 21.90–46.28 | 0.375 | ||
| sCD26 (ng/mL) |
|
|
| ||
|
| |||||
| NSCLC I+II | 396.00 | 206.00–640.00 | 0.116 | ||
| NSCLC III+IV | 356.00 | 136.00–945.00 | <0.001 | ||
| VEGF (pg/mL) |
|
|
| ||
|
| |||||
| NSCLC I+II | 469.86 | 227.00–1353.08 | 0.990 | ||
| NSCLC III+IV | 663.85 | 81.54–1856.40 | 0.028 | ||
| CAL (ng/mL) |
|
|
| ||
|
| |||||
| NSCLC I+II | 196.16 | 120.73–426.99 | 0.002 | ||
| NSCLC III+IV | 247.06 | 107.71–482.89 | <0.001 | ||
a Mann-Whitney U (two-sided test) for comparison among the control group and early and advanced NSCLC stages. P-values were corrected by the Holm method to prevent for inflation of the type I error due to multiple testing (corrections based on multiplicity given by controls subtypes and LC staging).
Fig 1Box-plots of the 6 biomarkers.
Box-plots of the levels of the six biomarkers candidates in the sera subgroups of controls and lung cancer patients. Horizontal lines represent median values.
Relationship between marker levels and demographic parameters.
.
| Parameter | Marker | ||||||
|---|---|---|---|---|---|---|---|
| HB-EGF (pg/mL) | EGF (pg/mL) | sEGFR (ng/mL) | sCD26 (ng/mL) | VEGF (pg/mL) | CAL (ng/mL) | ||
| Gender | |||||||
| Male (n = 91) | 191.00(24.00–1823.00) | 528.51(55.97–1176.89) | 35.71(20.82–97.40) | 375.00(136.00–1192.00) | 594.40(39.73–1856.40) | 191.48(39.16–438.32) | |
| Female (n = 37) | 188.00(44.00–4661.00) | 334.55(116.45–868.94) | 34.20(22.05–45.77) | 440.00(122.00–945.00) | 549.87(81.54–2631.54) | 185.67(33.13–482.89) | |
|
| 0.902 | 0.004 | 0.526 | 0.014 | 0.106 | 0.271 | |
| Age | |||||||
| <65 yr (n = 67) | 191.00(24.00–2067.00) | 473.71(78.37–1176.89) | 38.32(20.82–49.57) | 463.00(159.00–998.00) | 565.82(39.73–1711.25) | 185.67(38.67–438.32) | |
| >65 yr (n = 60) | 189.50(32.00–4661.00) | 426.97(55.97–1142.11) | 34.55(23.32–97.40) | 348.50(122.00–1192.00) | 587.58(81.54–2631.56) | 192.81(33.13–482.89) | |
|
| 0.714 | 0.443 | 0.086 | 0.001 | 0.372 | 0.487 | |
| Smoking | |||||||
| Yes (n = 85) | 191.00(24.00–1823.00) | 579.85(55.97–1176.89) | 36.77(21.90–97.40) | 377.00(136.00–1192.00) | 594.40(39.73–1856.40) | 207.88(38.67–438.32) | |
| No (n = 19) | 188.00(32.00–4661.00) | 344.68(98.01–1160.42) | 34.20(20.82–47.67) | 425.00(122.00–998.00) | 616.56(81.54–2631.56) | 181.44(33.13–482.89) | |
|
| 0.680 | 0.011 | 0.635 | 0.207 | 0.665 | 0.498 | |
a Median and range values provided.
b Mann-Whitney U (two-sided test)
Ranking of each marker according to its importance for predicting Lung Cancer.
| Marker | Boosted Regression | Random Forests | ||||
|---|---|---|---|---|---|---|
| Rank | Median Importance | Range | Rank | Median Importance | Range | |
| CAL | 1 | 69 | 56–82 | 1 | 10 | 9–10 |
| sCD26 | 2 | 16 | 6–31 | 2 | 7 | 6–8 |
| EGF | 3 | 13 | 5–24 | 3 | 6 | 6–7 |
| sEGFR | 4 | 2 | 0–7 | 4 | 5 | 4–5 |
| VEGF | 5 | 0 | 0–4 | 6 | 4 | 4–4 |
| HB-EGF | 6 | 0 | 0–1 | 5 | 5 | 4–5 |
a Importance measures report method-specific estimates of the individual contribution to prediction of each evaluated biomarker. Medians values and range over 1000 runs are provided.
Performance of each model possibility.
| Markers included in the model | Performance Indexes | |||
|---|---|---|---|---|
| AIC | BIC | MSE | AUC | |
| CAL | 135.39 | 146.77 | 0.176 (0.115–0.252) | 0.816 (0.671–0.949) |
| CD26 | 155.27 | 166.65 | 0.214 (0.158–0.286) | 0.732 (0.567–0.886) |
| EGF | 150.25 | 161.63 | 0.205 (0.152–0.277) | 0.759 (0.606–0.893) |
| CAL+ sCD26 | 133.43 | 147.65 | 0.173 (0.108–0.249) | 0.823 (0.680–0.958) |
| CAL+ EGF | 133.77 | 147.99 | 0.173 (0.110–0.252) | 0.824 (0.677–0.954) |
| CD26+EGF | 146.77 | 160.99 | 0.196 (0.138–0.274) | 0.778 (0.626–0.917) |
| CAL+ sCD26+EGF | 132.64 | 149.70 | 0.169 (0.103–0.247) | 0.828 (0.683–0.962) |
a Area under the ROC curve (AUC) and mean squared error (MSE) based on out-of-bag (OOB) predictions (models are fitted in the training sets with the 75% of the cases and are subsequently used for predicting the group membership of the 25% test cases). Mean AUC and MSE and 95% bootstrap confidence intervals are provided.
Diagnostic performance of the panel composed off CAL+sCD26+EGF+gender+age.
| Cut-off point | Sensitivity (%) | Specificity (%) |
|---|---|---|
|
|
|
|
| 0.410 | 90 | 62 |
| 0.107 | 95 | 18 |
| 0.076 | 98 | 17 |
| 0.772 | 57 | 90 |
| 0.839 | 39 | 95 |
| 0.927 | 13 | 98 |
a Dichotomization of the classification score at this cut-off resulted in the best terms of sensitivity and specificity.
Fig 2Nomogram for prediction of the classification score p for lung cancer.
Multivariable logistic regression model-based nomogram to define lung cancer score p based on Calprotectin, sCD26 and EGF concentration (log transformed), gender and age.