| Literature DB >> 28117344 |
Sonia Blanco-Prieto1, Loretta De Chiara1, Mar Rodríguez-Girondo2,3, Lorena Vázquez-Iglesias1, Francisco Javier Rodríguez-Berrocal1, Alberto Fernández-Villar4, María Isabel Botana-Rial4, María Páez de la Cadena1.
Abstract
While evidence for lung cancer screening implementation in Europe is awaited, Rapid Diagnostic Units have been established in many hospitals to accelerate the early diagnosis of lung cancer. We seek to develop an algorithm to detect lung cancer in a symptomatic population attending such unit, based on a sensitive serum marker panel. Serum concentrations of Epidermal Growth Factor, sCD26, Calprotectin, Matrix Metalloproteinases -1, -7, -9, CEA and CYFRA 21.1 were determined in 140 patients with respiratory symptoms (lung cancer and controls with/without benign pathology). Logistic Lasso regression was performed to derive a lung cancer prediction model, and the resulting algorithm was tested in a validation set. A classification rule based on EGF, sCD26, Calprotectin and CEA was established, able to reasonably discriminate lung cancer with 97% sensitivity and 43% specificity in the training set, and 91.7% sensitivity and 45.4% specificity in the validation set. Overall, the panel identified with high sensitivity stage I non-small cell lung cancer (94.7%) and 100% small-cell lung cancers. Our study provides a sensitive 4-marker classification algorithm for lung cancer detection to aid in the management of suspicious lung cancer patients in the context of Rapid Diagnostic Units.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28117344 PMCID: PMC5259733 DOI: 10.1038/srep41151
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Markers Selected for the Development of a Diagnostic Panel for Lung Cancer.
| Marker | Function in Cancer | Usefulness in LC diagnosis |
|---|---|---|
| Epidermal Growth Factor (EGF) | Binding of EGF to receptor promotes tumour growth and progression | Suitable discrimination of LC/NSCLC from healthy and benign lung pathologies |
| sCD26 | Immune regulation and co-stimulatory activities | Suitable diagnostic potential for LC vs healthy |
| Calprotectin (CAL) | Antimicrobial and pro-inflammatory functions, tumour development | Promising marker in LC derived by pleural effusion |
| Matrix Metalloproteinases (MMP-1, −7, −9) | Extracellular matrix degradation leading to cancer invasion and metastasis, processing of growth factors, regulation of apoptosis and angiogenesis, tumour-associated inflammation and immune escape | MMP-1: Poor diagnostic capacity for LC vs healthy controls in plasma |
| MMP-7: Moderate diagnostic potential for NSCLC vs healthy and benign lung disease | ||
| MMP-9: Good diagnostic potential for LC vs control and benign lung affections | ||
| Carcinoembryonic Antigen (CEA) | Belongs to immunoglobulin superfamily, acting in cell adhesion and innate immunity | Moderate diagnostic potential for LC vs non-malignant pathologies |
| CYFRA 21.1 | Fragment of Cytokeratin 19, constituent of cytoskeleton and expressed in epithelial differentiation | Moderate diagnostic potential for LC vs non-malignant pathologies |
Abbreviations: LC = Lung Cancer, NSCLC = Non Small Cell Lung Cancer, MPE = Malignant Pleural Effusion, PMPE = paramalignant pleural effusion.
Serum Markers in Lung Cancer and Controls in the Training Set.
| Marker | Control/Casea | Median | Range | Adjusted effect (95% CI)c | AUC (95% CI) | |
|---|---|---|---|---|---|---|
| (pg/mL) | Healthy | 247.60 | 40.13–972.90 | |||
| Benign | 419.67 | 43.25–1187.06 | ||||
| 0.238 (0.140–0,336)* | 0.698 (0.615–0.773) | |||||
| (ng/mL) | Healthy | 509.00 | 237.00–886.00 | |||
| Benign | 431.50 | 122.00–1092.00 | ||||
| −0.097 (−0.148–−0.047)* | 0.711 (0.629–0.785) | |||||
| (ng/mL) | Healthy | 105.47 | 7.56–362.29 | |||
| Benign | 158.13 | 33.13–421.23 | ||||
| 0.263 (0.190–0.336)* | 0.759 (0.679-0.827) | |||||
| (pg/mL) | Healthy | 4664.94 | 1186.61–23635.33 | |||
| Benign | 6420.51 | 1207.70–23960.37 | ||||
| 0.089 (−0.008–0.187) | 0.597 (0.511-0.679) | |||||
| (pg/mL) | Healthy | 21237.20 | 10383.69–60968.91 | |||
| Benign | 22286.74 | 5026.14–79977.27 | ||||
| 0.020 (−0.040–0.081) | 0.624 (0.539-0.705) | |||||
| (ng/mL) | Healthy | 177.19 | 52.79–743.25 | |||
| Benign | 181.74 | 57.41–3611.59 | ||||
| 0.270 (0.167–0.373)* | 0.729 (0.648–0.801) | |||||
| (pg/mL) | Healthy | 929.50 | 171.03–4070.71 | |||
| Benign | 559.51 | 170.84–2828.15 | ||||
| 0.494 (0.333–0.656)* | 0.744 (0.663–0.814) | |||||
| (pg/mL) | Healthy | 0.00 | 0.00–16592.63 | |||
| Benign | 1052.47 | 0.00–19314.33 | ||||
| 1.080 (0.587–1.573)* | 0.734 (0.653–0.805) |
Abbreviations: LC = Lung Cancer.
aSample size in training set: Control n = 72 (Healthy n = 36, Benign n = 36), LC n = 68.
bMann-Whitney U test for comparison between the cancer and control group corrected by Benjamini-Hochberg method to control familywise error under multiple comparisons.
cAdjusted effects and 95% confidence intervals of the case/control status on each of the log-transformed markers considered as outcome in lineal regression model adjusted for gender, age and smoking. *P-value < 0.001.
Distribution of Serum Markers in Lung Cancer by Histology and Stage and Comparison with Controls in the Training Set.
| Marker | Control/Casea | Median | Range | Adjusted effect (95% CI)c | |
|---|---|---|---|---|---|
| (pg/mL) | |||||
| Early (I + II) | 785.28 | 388.23–1159.73 | 0.002 | 0.409 (0.236–0.581)* | |
| Late (III + IV) | 528.51 | 144.23–1176.15 | 0.002 | 0.231 (0.109–0.354)* | |
| Limited | 55.97 | 40.75–201.63 | — | ||
| Extended | 440.64 | 264.30–1716.30 | — | ||
| (ng/mL) | |||||
| Early (I + II) | 432.00 | 206.00–640.00 | 0.235 | -0.032 (−0.114–0.049) | |
| Late (III + IV) | 341.00 | 136.00–945.00 | 0.002 | −0.116 (−0.179– −0.054)* | |
| Limited | 370.00 | 339.00–1192.00 | — | ||
| Extended | 288.00 | 208.00–541.00 | — | ||
| (ng/mL) | |||||
| Early (I + II) | 193.72 | 120.73–340.67 | 0.017 | 0.188 (0.033–0.343)* | |
| Late (III + IV) | 238.59 | 87.19–438.32 | 0.002 | 0.275 (0.172–0.378)* | |
| Limited | 97.32 | 91.06–422.20 | — | ||
| Extended | 279.34 | 48.33–355.19 | — | ||
| (pg/mL) | |||||
| Early (I + II) | 7800.42 | 1784.41–30180.33 | 0.235 | 0.087 (-0.106-0.279) | |
| Late (III + IV) | 7132.52 | 1450.34–41668.33 | 0.131 | 0.066 (−0.065–0.198) | |
| Limited | 7135.23 | 4890.35–18900.42 | — | ||
| Extended | 8114.52 | 2220.44–14246.29 | — | ||
| (pg/mL) | |||||
| Early (I + II) | 24615.56 | 8551.00–51000.43 | 0.726 | −0.031 (−0.136–0.074) | |
| Late (III + IV) | 26680.26 | 5383.18–79809.13 | 0.023 | 0.039 (−0.041–0.118) | |
| Limited | 26874.44 | 26710.65–40252.39 | — | ||
| Extended | 30110.14 | 8231.19–43429.13 | — | ||
| (ng/mL) | |||||
| Early (I + II) | 379.94 | 174.70–1688.00 | 0.002 | 0.285 (0.101–0.470)* | |
| Late (III + IV) | 299.65 | 21.06–1914.00 | 0.002 | 0.244 (0.106–0.382)* | |
| Limited | 115.93 | 65.58–253.19 | — | ||
| Extended | 298.21 | 76.78–786.34 | — | ||
| (pg/mL) | |||||
| Early (I + II) | 1093.08 | 353.26–21684.85 | 0.140 | 0.154 (−0.050–0.358) | |
| Late (III + IV) | 2750.49 | 141.16–136039.19 | 0.002 | 0.567 (0.348–0.787)* | |
| Limited | 2092.47 | 1147.58–82300.26 | — | ||
| Extended | 3884.15 | 1667.40–29844.72 | — | ||
| (pg/mL) | |||||
| Early (I + II) | 1181.22 | 0.00–7309.16 | 0.104 | 0.503 (−0.382–1.387) | |
| Late (III + IV) | 4791.34 | 0.00–173410.17 | 0.002 | 1.105 (0.459–1.752)* | |
| Limited | 1754.30 | 219.64–6610.22 | — | ||
| Extended | 5965.40 | 0.00-12228.10 | — |
Abbreviations: NSCLC = Non-Small Cell Lung Cancer, SCLC = Small Cell Lung Cancer.
aSample size in training set: Control n = 72, NSCLC n = 59 (Early stage n = 16, Late stage n = 43), SCLC n = 9 (Limited stage n = 3, Extended stage n = 6).
bMann-Whitney U test for comparison between the control groups and lung cancer stratified by histology and stage corrected by Benjamini-Hochberg method to control familywise error under multiple comparisons.
cAdjusted effects and 95% confidence intervals of histology and stage on each of the log-transformed markers considered as outcome in lineal regression model adjusted for gender, age and smoking. *P-value statistically significant.
Performance of the Four-Marker Panel and EGF, sCD26, CAL and CEA in the Diagnosis of Lung Cancer.
| Training Set | Cut-off | Sn (%) | Sp (%) | PPVa (%) | NPVa (%) | AUC (95% CI)b |
|---|---|---|---|---|---|---|
| Multivariate Algorithm: EGF, sCD26, CAL, CEA | >0.266 | 97 | 43 | 57.6 | 94.7 | 0.873 (0.811–0.925) |
| EGF | >178.48 pg/mL | 95 | 22.2 | 49.4 | 84.8 | 0.698 (0.615–0.773) |
| sCD26 | ≤637.2 ng/mL | 95 | 13.9 | 46.8 | 77.7 | 0.711(0.629–0.785) |
| CAL | >96.37 ng/mL | 95 | 30.6 | 52.2 | 88.4 | 0.759 (0.679–0.827) |
| CEA | >258.2 pg/mL | 95 | 11.1 | 46 | 73.6 | 0.744 (0.663–0.814) |
| Clinical Modelc | >0.237 | 95 | 26.4 | 50.8 | 86.9 | 0.717 (0.637-0.799) |
| Multivariate Algorithm: EGF, sCD26, CAL, CEA | >0.266 | 91.7 | 45.4 | 57.3 | 87.3 | 0.837 (0.718-0.936) |
| Clinical Modelc | >0.237 | 91.7 | 27.3 | 50.2 | 80.5 | 0.659 (0.488-0.816) |
Abbreviations: Sn = Sensitivity, Sp = Specificity, PPV = Predictive Positive Value, NPV = Negative Predictive Value.
aPositive and negative predictive values were estimated assuming a prevalence of lung cancer of 44.4% (QDU of the Pneumology Service of Hospital Álvaro Cunqueiro EOXI Vigo).
bAUC and 95% CI evaluated in the training test is not protected against overfitting.
cClinical model includes gender, age and smoking.
Figure 1ROC Curve Analysis for Lung Cancer Prediction in the Training Set.
ROC curves are shown for each individual marker included in the classification algorithm, together with the clinical model and the 4-marker panel derived from logistic Lasso regression. Training set included 68 lung cancer cases and 72 controls (36 healthy and 36 benign respiratory pathologies).
Classification Accuracy of the Multivariate Algorithm for Subgroups of Patients in the Combined Set (Training and Validation Set).
| Cases correctly classifieda/Total cases | % Sn at 43% Sp | ||
|---|---|---|---|
| 88/92 | 95.6 | ||
| NSCLC | 76/80 | 95 | |
| I | 18/19 | 94.7 | |
| II | 3/3 | 100 | |
| III | 20/21 | 95.2 | |
| IV | 35/37 | 94.6 | |
| ADC | 41/44 | 93.2 | |
| SqCC | 20/20 | 100 | |
| LCC | 13/13 | 100 | |
| BAC | 1/2 | 50 | |
| ND | 1/1 | 100 | |
| SCLC | 12/12 | 100 | |
| 41/94 | 43.6 | ||
| Healthy | 18/44 | 40.9 | |
| Benign | 23/50 | 46 | |
| RI | 19/41 | 46.3 | |
| ILD | 4/9 | 44.4 | |
| 7/7 | 100 | ||
| NSCLC | 6/6 | 100 | |
| I | 1/1 | 100 | |
| II | 2/2 | 100 | |
| III | 1/1 | 100 | |
| IV | 2/2 | 100 | |
| SCLC | Extended | 1/1 | 100 |
| 1/3 | 33.3 | ||
| Healthy | 0/1 | 0 | |
| Benign | 1/2 | 50 | |
Abbreviations: Sn = Sensitivity, Sp = Specificity, NSCLC = Non Small Cell Lung Cancer, ADC = Adenocarcinoma, SqCC = Squamous Cell Carcinoma, LCC = Large Cell Carcinoma, BAC = Bronchioloalveolar Carcinoma, ND = Not Differentiated Carcinoma, SCLC = Small Cell Lung Cancer, RI = Respiratory Infection, ILD = Interstitial Lung Disease.
aCut-off p = 0.266 for a sensitivity of 97% and specificity of 43% in the training set.