| Literature DB >> 28769740 |
Lisa J Croner1, Roslyn Dillon1, Athit Kao1, Stefanie N Kairs1, Ryan Benz1, Ib J Christensen2, Hans J Nielsen2, John E Blume1, Bruce Wilcox1.
Abstract
BACKGROUND: The aim was to improve upon an existing blood-based colorectal cancer (CRC) test directed to high-risk symptomatic patients, by developing a new CRC classifier to be used with a new test embodiment. The new test uses a robust assay format-electrochemiluminescence immunoassays-to quantify protein concentrations. The aim was achieved by building and validating a CRC classifier using concentration measures from a large sample set representing a true intent-to-test (ITT) symptomatic population.Entities:
Keywords: Blood tests; Clinical markers; Colorectal neoplasms; Proteomics; Tumor biomarkers
Year: 2017 PMID: 28769740 PMCID: PMC5526294 DOI: 10.1186/s12014-017-9163-z
Source DB: PubMed Journal: Clin Proteomics ISSN: 1542-6416 Impact factor: 3.988
Study subject characteristics for Endoscopy II overall, and for the current study subsets
| Endo II | CRC ITT discovery set | CRC ITT validation set | |||
|---|---|---|---|---|---|
| All | Control | Disease | Control | Disease | |
| Total | 4698 | 2759 | 340 | 1189 | 147 |
|
| |||||
| 31 | 605 | 340 | 53 | 135 | 26 |
| 32 | 299 | 171 | 13 | 81 | 9 |
| 33 | 966 | 579 | 71 | 249 | 21 |
| 34 | 300 | 190 | 18 | 65 | 12 |
| 35 | 957 | 564 | 72 | 240 | 34 |
| 36 | 858 | 510 | 48 | 248 | 17 |
| 37 | 713 | 405 | 65 | 171 | 28 |
|
| |||||
| Mean | 63.5 | 62.7 | 69.7 | 62.9 | 70.1 |
| Standard deviation | 12.6 | 12.6 | 10.6 | 12.7 | 10.7 |
| Median | 64.3 | 63.6 | 69.5 | 63.3 | 71.5 |
| Minimum | 18.1 | 20.1 | 37.5 | 18.1 | 23.6 |
| Maximum | 96.0 | 96.0 | 94.8 | 93.3 | 89.1 |
|
| |||||
| Female | 2455 (52.3) | 1473 (53.4) | 144 (42.4) | 650 (54.7) | 55 (37.4) |
| Male | 2243 (47.7) | 1286 (46.6) | 196 (57.6) | 539 (45.3) | 92 (62.6) |
|
| |||||
| Mean | 25.6 | 25.6 | 25.5 | 25.5 | 26.2 |
| Standard deviation | 4.6 | 4.7 | 4.7 | 4.5 | 3.9 |
| Median | 25.1 | 25.1 | 24.8 | 25.0 | 25.7 |
| Minimum | 11.7 | 13.0 | 15.8 | 11.7 | 16.9 |
| Maximum | 50.5 | 50.2 | 47.1 | 50.5 | 39.1 |
|
| |||||
| I | 101 (19.8) | 0 (0) | 74 (21.8) | 0 (0) | 25 (17.0) |
| II | 163 (31.9) | 0 (0) | 105 (31.0) | 0 (0) | 50 (34.0) |
| III | 139 (27.2) | 0 (0) | 87 (25.7) | 0 (0) | 45 (30.6) |
| IV | 108 (21.1) | 0 (0) | 73 (21.5) | 0 (0) | 27 (18.4) |
|
| |||||
| Colon cancer | 319 (6.8) | 0 (0) | 211 (62.1) | 0 (0) | 92 (62.6) |
| Rectal cancer | 193 (4.1) | 0 (0) | 129 (37.9) | 0 (0) | 55 (37.4) |
| Adenoma colon | 515 (11.0) | 340 (12.3) | 0 (0) | 148 (12.4) | 0 (0) |
| Adenoma rectum | 174 (3.7) | 117 (4.2) | 0 (0) | 51 (4.3) | 0 (0) |
| No comorbidity-no finding | 1164 (24.8) | 763 (27.7) | 0 (0) | 334 (28.1) | 0 (0) |
| Comorbidity-no finding | 814 (17.3) | 534 (19.4) | 0 (0) | 229 (19.3) | 0 (0) |
| Other cancer | 177 (3.8) | 119 (4.3) | 0 (0) | 50 (4.2) | 0 (0) |
| Other finding | 1342 (28.6) | 886 (32.1) | 0 (0) | 377 (31.7) | 0 (0) |
Protein targets across the five multiplexed panels, showing allowable ranges of analytical parameters
| Panel | Protein | Abbreviation | Dilution factor | Duplicate CV max (%) | Hill slope min | Hill slope max | LLoQ (pg/mL) | ULoQ (pg/mL) |
|---|---|---|---|---|---|---|---|---|
| 1 | Alpha-1-acid glycoprotein | A1AG | 300,000 | 20 | 0.9 | 1.1 | 12.21 | 50,000 |
| 1 | Alpha-1 antitrypsin | A1AT | 300,000 | 20 | 0.9 | 1.1 | 73.24 | 300,000 |
| 1 | Apolipoprotein A-I | APOA1 | 300,000 | 20 | 0.9 | 1.1 | 244.14 | 1,000,000 |
| 1 | Complement 3 | CO3 | 300,000 | 20 | 0.9 | 1.1 | 610.35 | 2,500,000 |
| 1 | Haptoglobin | HPT | 300,000 | 20 | 0.9 | 1.1 | 488.28 | 2,000,000 |
| 2 | Alpha-antichymotrypsin | AACT | 5000 | 20 | 0.6 | 0.8 | 1220.7 | 5,000,000 |
| 2 | Carbonic anhydrase 1 | CAH1 | 5000 | 20 | 0.9 | 1.1 | 2.44 | 10,000 |
| 2 | Clusterin | CLUS | 5000 | 20 | 0.8 | 1.2 | 244.14 | 1,000,000 |
| 2 | Complement 9 | CO9 | 5000 | 20 | 0.9 | 1.2 | 24.41 | 100,000 |
| 2 | C-reactive protein | CRP | 5000 | 20 | 0.9 | 1.1 | 12.21 | 50,000 |
| 2 | Dipeptidyl peptidase IV | DPPIV | 5000 | 20 | 0.9 | 1.1 | 2.44 | 10,000 |
| 2 | Serum amyloid A | SAA | 5000 | 20 | 0.9 | 1.1 | 12.21 | 50,000 |
| 2 | Transferrin receptor protein | TFRC | 5000 | 20 | 0.9 | 1.1 | 0.49 | 2000 |
| 3 | Protein S100-A8/-A9 | CALP | 100 | 20 | 1.1 | 1.6 | 48.83 | 200,000 |
| 3 | Cathepsin D | CATD | 100 | 20 | 0.9 | 1.1 | 19.53 | 80,000 |
| 3 | Growth differentiation factor 15 | GDF15 | 100 | 20 | 0.9 | 1.1 | 0.12 | 500 |
| 3 | Gelsolin | GELS | 100 | 20 | 0.85 | 1.1 | 488.28 | 2,000,000 |
| 3 | Prolyl endopeptidase FAP | SEPR | 100 | 20 | 0.85 | 1.15 | 2.44 | 10,000 |
| 3 | Tissue metalloproteinase inhibitor 1 | TIMP1 | 100 | 20 | 1.3 | 1.5 | 12.21 | 50,000 |
| 4 | Annexin A1 | ANXA1 | 4 | 20 | 0.9 | 1.1 | 12.21 | 50,000 |
| 4 | Carcinoembryonic antigen-related cell adhesion molecule 5 | CEA | 4 | 20 | 0.9 | 1.1 | 24.41 | 100,000 |
| 4 | Glycine-tRNA ligase | GARS | 4 | 20 | 0.9 | 1.1 | 122.07 | 500,000 |
| 4 | Macrophage migration inhibitory factor | MIF | 4 | 20 | 0.9 | 1.1 | 14.65 | 60,000 |
| 4 | Trefoil factor 3 | TFF3 | 4 | 20 | 0.9 | 1.1 | 0.49 | 2000 |
| 5 | Pyruvate kinase isozyme M2 | PKM2 | 4 | 20 | 0.9 | 1.2 | 7812.5 | 2,000,000 |
| 5 | Peroxiredoxin-1 | PRDX1 | 4 | 20 | 0.9 | 1.1 | 12.21 | 50,000 |
| 5 | P-selectin glycoprotein ligand 1 | PSGL | 4 | 20 | 0.9 | 1.1 | 12.21 | 50,000 |
Observed lower and upper limits of quantitation (LLoQ and ULoQ) are listed as concentrations of diluted samples
Fig. 1Validation ROC (black curve). The Validation AUC was 0.86 with a 95% confidence interval of 0.82–0.90 (shaded region). The selected sensitivity/specificity point was 0.80/0.83 (black dot). ROCs for each of the single model predictors are also shown (gray curves, univariate AUCs listed in the figure)
Confusion matrix from the validation set
| Model call | Total | ||
|---|---|---|---|
| Non-CRC | CRC | ||
|
| |||
| Non-CRC | 758 | 159 | 917 |
| CRC | 22 | 87 | 109 |
| Total | 780 | 246 | 1026 |
The final model’s performance parameters
| Sensitivity | 0.80 |
| Specificity | 0.83 |
| Positive predictive value (PPV) | 36.5% |
| Negative predictive value (NPV) | 97.1% |
The final model’s CRC sensitivities for early and late stage cancer
| CRC stage | Incorrect call | Correct call | Sensitivity |
|---|---|---|---|
| I–II | 12 | 36 | 0.75 |
| III–IV | 10 | 51 | 0.84 |
| Total | 22 | 87 | 0.80 |
| Fisher’s test | 0.338 |
Comparison of univariate CRC AUCs for the three proteins appearing in both the ELISA panel [14] and in the new electrochemiluminescense panel
| Protein | AUC, ELISA measures | AUC, electrochemiluminescence assay measures |
|---|---|---|
| CEA | 0.702 | 0.725 |
| CO9 | 0.706 | 0.742 |
| MIF | 0.558 | 0.493 |
AUCs were calculated for the CRC versus non-CRC discrimination across each study’s full Discovery set (Indeterminate samples were not removed)
Fig. 2Scatterplot of univariate AUCs for discriminating CRC from non-CRC in the full Discovery set of the current study versus the ELISA study [14]. Black dots show the eight proteins used in the current study’s final model; circles identify the proteins used in both the current and the ELISA panel. The identity line is dashed
Fig. 3Heatmap illustrating the relative CRC signal found using different combinations of feature selection methods (x-axis) and classifier algorithms (y-axis) across 56,841 classifier builds using ten predictors. The gray scale represents the median of Discovery set AUCs (each calculated as the median of the Discovery hold-out test sets, see “Methods” section) found across 147–10,848 builds (with varying algorithm parameters)