| Literature DB >> 32085733 |
Cherylle Goebel1, Christopher L Louden2, Robert Mckenna3, Osita Onugha3, Andrew Wachtel4, Thomas Long5.
Abstract
BACKGROUND: In a previous study (Goebel et. al, Cancer Genomics Proteomics 16:229-244, 2019), we identified 33 biomarkers for an early stage (I-II) Non-Small Cell Lung Cancer (NSCLC) test with 90% accuracy, 80.3% sensitivity, and 95.4% specificity. For the current study, we used a narrowed ensemble of 21 biomarkers while retaining similar accuracy in detecting early stage lung cancer.Entities:
Keywords: Biomarkers; Detection; Diagnostic tests; Early stage lung cancer; Immunoassay; Non-small cell lung cancer; Proteomics
Year: 2020 PMID: 32085733 PMCID: PMC7035746 DOI: 10.1186/s12885-020-6625-x
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Sample Criteria
| Cohort | Inclusion | Exclusion |
|---|---|---|
| All cohort | M/F, 18 y/o or older, sample collected in the USA | Pregnant, incarcerated, lack of capacity to consent, samples collected outside of sthe United States |
| Asthma | Smoker or non-smoker | Any cancer diagnosis |
| Non-Smoker | Healthy | Smokers, any cancer diagnosis |
| NSCLC, (Stage I-II) | Stage I-II; smoker or non-smoker | Stage III-IV lung cancer |
| Smoker | 10 pack years | Any cancer diagnosis |
| Other Cancers | Breast, colon-rectal, pancreatic, and prostate cancer, all stages; smoker or non-smoker |
The non-smoker and NSCLC served as negative and positive control for lung cancer, respectively. Asthma sufferer and COPDs were included to test whether the diagnostic test can differentiate lung cancer from those who may have other respiratory diseases which share similar symptoms. The smokers consisted of high-risk population for LC who were not diagnosed with any cancer. Other cancers (i.e., breast, prostate, pancreatic, and colon-rectal) were included to ensure that the diagnostic test was specific to NSCLC
Sample Distribution
| African-American | Caucasian | Hispanic | Total | |
|---|---|---|---|---|
| Training Set | ||||
| | ||||
| Asthma | 0 | 10 | 1 | 11 |
| Breast Cancer | 0 | 1 | 0 | 1 |
| Colon-Rectal Cancer | 0 | 2 | 0 | 2 |
| Non-Smoker | 15 | 17 | 10 | 42 |
| NSCLC, (Stage I-II) | 10 | 22 | 10 | 42 |
| Pancreatic Cancer | 0 | 2 | 0 | 2 |
| Smoker | 12 | 15 | 6 | 33 |
| | ||||
| Asthma | 0 | 4 | 0 | 4 |
| Non-Smoker | 18 | 14 | 9 | 41 |
| NSCLC, (Stage I-II) | 9 | 17 | 10 | 37 |
| Pancreatic Cancer | 0 | 2 | 0 | 2 |
| Smoker | 22 | 16 | 4 | 42 |
| | ||||
| Validation Set | ||||
| | ||||
| Asthma | 0 | 8 | 0 | 8 |
| Breast Cancer | 5 | 35 | 0 | 40 |
| Colon-Rectal Cancer | 0 | 3 | 0 | 3 |
| Non-Smoker | 9 | 12 | 9 | 30 |
| NSCLC, Stage I | 6 | 17 | 4 | 27 |
| Pancreatic Cancer | 0 | 2 | 0 | 2 |
| Smoker | 9 | 11 | 5 | 25 |
| | ||||
| Asthma | 0 | 3 | 0 | 3 |
| Colon-Rectal Cancer | 0 | 2 | 0 | 2 |
| Non-Smoker | 7 | 11 | 9 | 27 |
| NSCLC, Stage I | 5 | 18 | 5 | 28 |
| Pancreatic Cancer | 0 | 1 | 0 | 1 |
| Prostate | 3 | 6 | 0 | 9 |
| Smoker | 10 | 10 | 3 | 23 |
| | ||||
All samples were collected in the United States and proportionately distributed between genders. The age range was between 21 and 82 years old with an average age of 56
Results of Algorithm Models. Results of Optimized Algorithm Models (Training Set)
| Biomarker | Algorithm 33 | Algorithm 21 | LCDT1 Algorithm |
|---|---|---|---|
| SE (95% CI) | 92.8% (87.9, 96.1%) | 97.4% (92.0, 99.5%) | 92.4% (89.2, 94.3%) |
| SP (95% CI) | 97.2% (95.5, 98.8%) | 98.3% (95.4, 99.5%) | 96.9% (95.2, 98.0%) |
The LCDT1 algorithm was developed with slight modifications using a smaller subset of biomarkers from the 21. This information is proprietary and a patent application was filed
Median Biomarker Concentrations and P-Value Using the Training Set
| Biomarker | NSCLC Median [Q1-Q3], pg/mL | Asthma, Smokers, Non-Smokers Median [Q1-Q3], pg/mL | |
|---|---|---|---|
| CA125 | 26.4 [13.7–41.7] | 13.6 [6.9–36.7] | 0.073 |
| CEA | 2884.4 [1815.9–5573.8] | 2115.3 [1194.8–3242.2] | 0.003 |
| CXCL9-MIG | 4378.2 [2604.0–6844.5] | 908.0 [539.0–1965.8] | < 0.001 |
| CYFRA-21-1 | 5354.8 [3429.5–8090.4] | 5088.3 [2939.9–9770.1] | 0.026 |
| GRO | 2890.0- [2076.9–4178.0] | 874.4 [507.5–1790.2] | < 0.001 |
| HGF | 869.4 [643.9–1647.1] | 476.3 [271.9–1177.1] | 0.006 |
| IL-10 | 22.8 [11.7–38.2] | 23.8 [14.0–45.8] | 0.525 |
| IL-12p70 | 21.1 [15.5–27.0] | 19.9 [16.5–127.4] | 0.082 |
| IL-16 | 693.6 [345.1–1458.7] | 717.5 [298.8–1469.4] | 0.902 |
| IL-2 | 11.5 [10.9–16.6] | 33.8 [19.7–52.1] | 0.005 |
| IL-4 | 41.7 [25.3–51.6] | 33.3 [22.5–50.3] | 0.902 |
| IL-5 | 17.9 [15.7–23.5] | 28.7 [12.8–46.8] | 0.188 |
| IL-7 | 10.6 [10.6–10.6] | 34.9 [18.9–61.1] | NA |
| IL-8 | 126.3 [44.5–323.8] | 23.6 [15.9–42.3] | < 0.001 |
| IL-9 | 11.9 [11.0–20.7] | 22.3 [15.2–42.6] | 0.016 |
| Leptin | 30,408.0 [16,682.6–45,886.3] | 22,190.7 [8684.4–54,863.7] | 0.224 |
| LIF | 45.5 [30.5–79.3] | 39.6 [27.1–82.3] | 0.511 |
| MCP-1 | 530.2 [391.363–721.512] | 372.8 [279.7–462.0] | < 0.001 |
| MIF | 865.6 [453.6–1501.3] | 395.0 [196.7–1274.8] | 0.752 |
| MMP-7 | 1978.3 [1184.2–3190.33] | 3585.2 [2671.9–5080.6] | < 0.001 |
| MMP-9 | 93,587.2 [62,827.2–124,300.6] | 12,593.8 [8856.8–19,799.6] | < 0.001 |
| MPO | 353,987.8 [246,376.2–616,739.2] | 117,658.8 [69,768.5–212,726.3] | < 0.001 |
| NSE | 7273.5 [3852.3–10,487.8] | 6576.1 [3806.6–46,981.4] | < 0.001 |
| PDGF AB/BB | 25,169.6 [21,611.8–30,055.0] | 41,800.6 [26,115.3–53,016.0] | < 0.001 |
| RANTES | 105,356.2 [79,497.9–155,040.2] | 38,458.4 [23,423.8–112,641.5] | 0.003 |
| Resistin | 35,145.6 [25,185.8–53,466.7] | 12,966.2 [9521.2–17,533.1] | < 0.001 |
| SAA | 6.55e7 [2.52e7–1.2e8] | 8.5e6 [4.175e6–1.9825e7] | < 0.001 |
| sCD40L | 381.8 [155.9–752.5] | 219.9 [110.285–628.7] | 0.018 |
| sEGFR | 654.9 [544.0–1175.5] | 936.5 [543.2–1943.2] | < 0.001 |
| sFasL | 229.8 [78.2–498.2] | 263.7 [135.9–573.4] | 0.03 |
| sICAM-1 | 150,304.6 [123,699.9–187,843.8] | 145,329.4 [117,164.7–182,796.7] | 0.519 |
| sTNFRII | 15,477.5 [11,712.9–20,103.6] | 5818.1 [4574.8–7295.3] | < 0.001 |
| TNFRI | 2514.8 [1748.7–3743.5] | 619.5 [413.9–860.2] | < 0.001 |
Table was generated using R Version 3.4.4
Fig. 1Validation Test Result
Blind Test Performance for the 33, 21, and LCDT1 Algorithm (Validation Set)
| Statistics | Algorithm 33 | Algorithm 21 | LCDT1 Algorithm |
|---|---|---|---|
| Accuracy | 95.6% (92.4, 97.7%) | 94.3% (90.7, 96.8%) | 95.6% (92.4, 97.7%) |
| Sensitivity | 89.1% (78.9, 95.3%) | 89.1% (78.9, 95.3%) | 89.1% (78.9, 95.3%) |
| Specificity | 97.7% (94.6, 99.2%) | 96.0% (92.2–98.2%) | 97.7% (94.6, 99.2%) |
| Positive Predictive Value (PPV) | 92.5% (83.0, 97.4%) | 87.5% (77.0, 94.2%) | 92.5% (83.0, 97.4%) |
| Negative Predictive Value (NPV) | 96.6% (93.1, 98.6%) | 96.5% (93.0, 98.5%) | 96.6% (93.1, 98.6%) |
| NSCLC Prevalence | 24.1% | 24.1% | 24.1% |
| True Positive (TP) | 49 | 44 | 49 |
| True Negative (TN) | 169 | 166 | 169 |
| False Positive (FP) | 4 | 6 | 4 |
| False Negative (FN) | 6 | 7 | 6 |
All entries show the statistical (95% CI). *Other cancer types were included in the analysis. Each subject consisted of two replicates (N = 2) or two data points processed by the algorithm. If one data point was positive, then the subject was considered positive for LC. Table was generated using R Version 3.4.4
Fig. 2ROC/AUC Curves. a. ROC/AUC Curves with other cancers types included. b. ROC/AUC Curves with only NSCLC cancers. Figures were generated using R version 3.4.4