| Literature DB >> 20859284 |
E C Farlow1, M S Vercillo, J S Coon, S Basu, A W Kim, L P Faber, W H Warren, P Bonomi, M J Liptay, J A Borgia.
Abstract
BACKGROUND: In this study, we appraised a wide assortment of biomarkers previously shown to have diagnostic or prognostic value for non-small cell lung cancer (NSCLC) with the intent of establishing a multi-analyte serum test capable of identifying patients with lung cancer.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20859284 PMCID: PMC2967050 DOI: 10.1038/sj.bjc.6605865
Source DB: PubMed Journal: Br J Cancer ISSN: 0007-0920 Impact factor: 7.640
Characteristics of patient populations
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| ||||||
| Range | 40–83 | 47–80 | 46–84 | 40–80 | 49–92 | 46–84 |
| Median | 67.5 | 62 | 69 | 61 | 70 | 68 |
|
| ||||||
| Male | 3 | 9 | 38 | 6 | 21 | 17 |
| Female | 9 | 22 | 52 | 9 | 19 | 16 |
|
| ||||||
| Ia | 16 | 11 | ||||
| Ib | 37 | 14 | ||||
| IIa | 2 | 2 | ||||
| IIb | 11 | 5 | ||||
| IIIa | 19 | 1 | ||||
| IIIb | 5 | — | ||||
|
| ||||||
| Adenocarcinoma | 57 | 18 | ||||
| Squamous | 30 | 10 | ||||
| Adenosquamous | 1 | 2 | ||||
| NSCLC – other | 2 | 3 | ||||
|
| ||||||
| No lung pathology | 31 | |||||
| Granuloma | 5 | 6 | ||||
| COPD | 1 | 34 | ||||
| Asthma | 6 | |||||
| Sarcoidosis | 1 | |||||
| Pneumonitis | 1 | 1 | ||||
| Pneumonia | 2 | 1 | ||||
| Benign cyst | 1 | |||||
| Hamartoma | 2 | |||||
| Chronic inflammation | 2 | |||||
| Lymphoid infiltrate | 1 | |||||
| Thymoma | 1 | |||||
| Lipoma | 1 | 1 | ||||
Discovery group refers to the initial group of patients on which 47 biomarkers were tested and multi-analyte panel was created, groups are as follows: I=resected non-neoplastic disease; II=rheumatology controls; III=NSCLC patients. Validation group refers to second cohort on whom our six-multi-analyte panel was tested, groups are as follows: I=resected non-neoplastic disease; II=COPD/asthma patients; III=NSCLC patients.
Pathologic stage.
Biomarkers selected for multivariate analysis based on statistical relevance
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
| CYFRA 21-1 | 371.9 | 227–783 | 277.54 | 122.3–617.2 | 630.89 | 169.5–5351 | 674.56 | 293.4–3663 | 0.873 | <0.001 |
| TNF- | 8.92 | 3.7–27 | 4.06 | 1.37–11.94 | 14.35 | 1.37–140 | 13.85 | 2.9–41.93 | 0.862 | <0.001 |
| MCP-1 | 0.47 | 0.28–1.07 | 0.77 | 0.2–2.34 | 0.40 | 0.18–2.06 | 0.4 | 0.12–5.7 | 0.753 | <0.001 |
| IL-1ra | 184.2 | 42–1168 | 81.59 | 2.41–606.4 | 326.93 | 2.41–2144 | 326.93 | 1.12–3258 | 0.719 | <0.001 |
| MMP-2 | 6792 | 4440–10013 | 7757 | 5029–12836 | 6195 | 3293–12693 | 5516 | 897–10094 | 0.705 | <0.001 |
| IL-6 | 37.59 | 1.17–1495 | 11.39 | 1.17–1520 | 61.46 | 1.17–5862 | 54.99 | 3.44–906.5 | 0.702 | <0.001 |
| EOTAXIN | 0.135 | 0.07–0.23 | 0.16 | 0.06–0.26 | 0.10 | 0.04–0.39 | 0.11 | 0.04–0.56 | 0.698 | <0.001 |
| CA-125 | 1.13 | 0.43–11.47 | 0.57 | 0.12–4.92 | 1.38 | 0.12–143.4 | 1.8 | 0.1–24.6 | 0.698 | <0.001 |
| sE-selectin | 1643 | 702–3962 | 1635 | 862–4079 | 1198 | 417–2603 | 1283 | 509–2547 | 0.690 | <0.001 |
| sP-selectin | 2779 | 1265–8764 | 3168 | 1722–5158 | 2726 | 926–5181 | 1892 | 926–11835 | 0.677 | <0.001 |
| MIP-1 | 0.13 | 0.1–0.15 | 0.14 | 0.12–0.21 | 0.13 | 0.1–0.7 | 0.12 | 0.05–0.81 | 0.669 | 0.00117 |
| IL-10 | 21.32 | 3.04–92.08 | 3.04 | 3.04–576.6 | 16.83 | 3.04–1361 | 42.5 | 3.04–428 | 0.667 | 0.00162 |
| CRP | 2.659 | 0.945–4.336 | 4.89 | 0.945–23.7 | 2.69 | 0.95–10.2 | 3.05 | 0.015–9.92 | 0.662 | 0.00245 |
| IL-2R | 54.4 | 18.7–119.7 | 29.815 | 5.33–224.62 | 57.44 | 5.33–3359 | 46.98 | 7.03–192.8 | 0.652 | 0.00462 |
Pathologic stage.
Lymph node-positive disease.
Values expressed as pg ml–1.
Mann–Whitney U (two-sided test).
Descriptive statistical parameters and individual test performance characteristics measured from our training cohort for each biomarker within our statistical thresholds.
Abbreviations: NSCLC=non-small cell lung cancer; AUC=area under the curve; TNF=tumour necrosis factor; MCP=monocyte chemotactic protein; IL=interleukin; MMP=matrix metalloproteinases; CA=cancer antigen; MIP=macrophage inflammatory protein; CRP=C-reactive protein; CYFRA=CYFRA 21-1, cytokeratin 19 fragment.
Variable selection of biomarkers using Random Forests
|
| |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 1 | 15 | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | 0.336 |
| 2 | 12 | X | X | X | X | X | X | X | X | X | X | X | X | 0.308 | |||
| 3 | 10 | X | X | X | X | X | X | X | X | X | X | 0.308 | |||||
| 4 | 8 | X | X | X | X | X | X | X | X | 0.317 | |||||||
| 5 | 6 | X | X | X | X | X | X | 0.308 | |||||||||
| 6 | 5 | X | X | X | X | X | 0.289 | ||||||||||
| 7 | 4 | X | X | X | X | 0.345 | |||||||||||
Abbreviations: OOB=out-of-bag (misclassification error); TNF=tumour necrosis factor; IL=interleukin; MMP=matrix metalloproteinases; MIP=macrophage inflammatory protein; CRP=C-reactive protein; SCF=stem cell factor; IFN=interferon; G-CSF=granulocyte colony-stimulating factor; M-CSF=monocyte colony-stimulating factor; sICAM-1=soluble intracellular adhesion molecule 1.
Figure 1Box plots for the 6 biomarkers identified by the Random Forest algorithm. Box plots for the six selected biomarkers selected by the Random Forest analysis on the discovery cohort. Abscissa labels: 0=surgically resected, non-neoplastic nodules, 1=‘normal’ controls, 2=Stage IA NSCLC, 3=Stage IB NSCLC and 4=Stages II and III (node positive) NSCLC. Notes: disease staging is based on pathologic stage; extreme values are not shown in the plots. Significance (Mann–Whitney Rank sum test) is shown with bars above boxes with a=P< 0.001; b=P<0.01 and c=P<0.05.
Figure 2Classification and Regression Tree for Final Panel. Classification and Regression Tree for predicting whether a patient is positive for NSCLC. Briefly, the algorithm represents a series of binary ‘if-then’ decision rules that are used to split the data into separate branches of the tree. Each node of the tree displays the analyte being considered and the threshold concentrations used to partition the patient groups. Additional classifications continue along each arm of the split in which it is indicated whether the measured value is either less than or equal to or exceeding the indicated threshold cutoff value. The number of classifications (observations) are listed at each terminal node, with each final arm labelled (0=NSCLC negative; 1=NSCLC positive). Abbreviations: obs.=observations; TNF-α=tumour necrosing factor-α; MCP-1=monocyte chemotactic protein-1; MMP-2=matrix metalloproteinase-2 and IL-1ra=interleukin-1 receptor antagonist.
Figure 3The ROC curve for the six-analyte serum test. ROC curve for the optimised six-analyte CART algorithm using the original training cohort of patients. Area under the curve=0.979; sensitivity=99% specificity=95%.