| Literature DB >> 32575471 |
Victoria El-Khoury1, Anna Schritz2, Sang-Yoon Kim3, Antoine Lesur3, Katriina Sertamo1, François Bernardin3, Konstantinos Petritis4, Patrick Pirrotte4, Cheryl Selinsky4, Jeffrey R Whiteaker5, Haizhen Zhang5, Jacob J Kennedy5, Chenwei Lin5, Lik Wee Lee5, Ping Yan5, Nhan L Tran6, Landon J Inge7, Khaled Chalabi8, Georges Decker9, Rolf Bjerkvig1,10, Amanda G Paulovich5, Guy Berchem1,11, Yeoun Jin Kim1.
Abstract
Lung cancer is the deadliest cancer worldwide, mainly due to its advanced stage at the time of diagnosis. A non-invasive method for its early detection remains mandatory to improve patients' survival. Plasma levels of 351 proteins were quantified by Liquid Chromatography-Parallel Reaction Monitoring (LC-PRM)-based mass spectrometry in 128 lung cancer patients and 93 healthy donors. Bootstrap sampling and least absolute shrinkage and selection operator (LASSO) penalization were used to find the best protein combination for outcome prediction. The PanelomiX platform was used to select the optimal biomarker thresholds. The panel was validated in 48 patients and 49 healthy volunteers. A 6-protein panel clearly distinguished lung cancer from healthy individuals. The panel displayed excellent performance: area under the receiver operating characteristic curve (AUC) = 0.999, positive predictive value (PPV) = 0.992, negative predictive value (NPV) = 0.989, specificity = 0.989 and sensitivity = 0.992. The panel detected lung cancer independently of the disease stage. The 6-protein panel and other sub-combinations displayed excellent results in the validation dataset. In conclusion, we identified a blood-based 6-protein panel as a diagnostic tool in lung cancer. Used as a routine test for high- and average-risk individuals, it may complement currently adopted techniques in lung cancer screening.Entities:
Keywords: lung cancer; molecular diagnostics; parallel reaction monitoring; plasma biomarker; targeted proteomics
Year: 2020 PMID: 32575471 PMCID: PMC7352295 DOI: 10.3390/cancers12061629
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Pathway enrichment analysis of the differentially expressed proteins in plasma from lung cancer patients and healthy donors. The enrichment analysis was done using Pathway Commons, Kyoto encyclopedia of genes and genomes (KEGG) and gene ontology (GO) databases. The top 10 significantly enriched pathways are shown. The analysis was done based on the concentrations of the 229 differentially expressed proteins in plasma from lung cancer patients (n = 128) and healthy volunteers (n = 93).
Figure 2Plasma levels of the 6 protein biomarkers identified as a lung cancer diagnostic panel. Scatter plots of (a) filamin-A (FLNA), (b) tubulin alpha-4A chain (TUBA4A), (c) glutathione S-transferase omega-1 (GSTO1), (d) peroxiredoxin-6 (PRDX6), (e) rho GDP-dissociation inhibitor 2 (ARHGDIB) and (f) cadherin-13 (CDH13) concentrations obtained from lung cancer patients (n = 128) and healthy volunteers (n = 93) using the LC-PRM assay targeting proteotypic peptides. Data points and their median are shown. **** Adjusted p < 0.0001 using the non-parametric Kruskal–Wallis test.
Performance of the logistic regression models in tumor prediction.
| Model | AIC | AUC | PPV | NPV | Specificity | Sensitivity |
|---|---|---|---|---|---|---|
| 6-protein combination | 30.876 | 0.999 | 0.992 | 0.989 | 0.989 | 0.992 |
| 3-protein combination | 31.402 | 0.999 | 0.984 | 0.968 | 0.978 | 0.977 |
| FLNA | 65.647 | 0.990 | 0.967 | 0.908 | 0.957 | 0.930 |
| TUBA4A | 41.556 | 0.997 | 0.984 | 0.948 | 0.978 | 0.961 |
| GSTO1 | 45.427 | 0.996 | 0.976 | 0.947 | 0.968 | 0.961 |
| PRDX6 | 51.763 | 0.993 | 0.976 | 0.957 | 0.968 | 0.969 |
| ARHGDIB | 54.303 | 0.981 | 0.992 | 0.929 | 0.989 | 0.945 |
| CDH13 | 219.090 | 0.845 | 0.791 | 0.747 | 0.699 | 0.828 |
| TFPI | 204.860 | 0.851 | 0.836 | 0.737 | 0.785 | 0.797 |
| Xpresys® XL panel | 45.592 | 0.996 | 0.969 | 0.957 | 0.957 | 0.969 |
| ALDOA | 43.946 | 0.994 | 0.969 | 0.947 | 0.957 | 0.961 |
| COL18A1 | 250.790 | 0.767 | 0.752 | 0.630 | 0.677 | 0.711 |
| FTL | 297.720 | 0.554 | 0.579 | NaN | 0.000 | 1.000 |
| LGALS3BP | 295.220 | 0.601 | 0.601 | 0.500 | 0.258 | 0.813 |
| THBS1 | 161.780 | 0.924 | 0.871 | 0.794 | 0.828 | 0.844 |
FLNA = Filamin-A; TUBA4A = Tubulin alpha-4A chain; GSTO1 = Glutathione S-transferase omega-1; PRDX6 = Peroxiredoxin-6; ARHGDIB = Rho GDP-dissociation inhibitor 2; CDH13 = Cadherin-13; TFPI = Tissue factor pathway inhibitor; ALDOA = Fructose-bisphosphate aldolase A; COL18A1 = Collagen alpha-1(XVIII) chain; FTL = Ferritin light chain; LGALS3BP = Galectin-3-binding protein; THBS1 = Thrombospondin-1; AIC = Akaike Information Criterion; AUC = Area under the receiver operating characteristic curve; PPV = Positive predictive value; NPV = Negative predictive value; NaN = Not a number (cannot be calculated since no patient was classified as not having a cancer).
Number of clinically annotated and predicted healthy and lung cancer patients, including their stages, as obtained using the 6-protein classifier.
| Cancer stages | Clinically Annotated Stages | ||||||
|---|---|---|---|---|---|---|---|
| No cancer | Stage NA * | Stage I | Stage II | Stage III | Stage IV | ||
|
| 92 | 1 | 1 | 0 | 1 | 0 | |
|
|
| 0 | 2 | 0 | 1 | 0 | 0 |
|
| 0 | 2 | 9 | 1 | 2 | 6 | |
|
| 0 | 0 | 0 | 0 | 0 | 1 | |
|
| 0 | 0 | 0 | 1 | 0 | 0 | |
|
| 1 | 6 | 13 | 8 | 16 | 57 | |
|
| 93 | 11 | 23 | 11 | 19 | 64 | |
* NA = not available.
Threshold values and positivity of the biomarkers when optimizing the global accuracy (TA) or the sensitivity or specificity (TS) of the panel, as defined by PanelomiX platform.
| Protein Biomarker | TA | TS |
|---|---|---|
| FLNA | >0.48091298 | >0.48091298 |
| TUBA4A | >1.6875327 | >0.18983749 |
| GSTO1 | >5.363042 | >5.363042 |
| PRDX6 | >5.9975386 | >4.038682 |
| ARHGDIB | >0.5091874 | >0.5091874 |
| CDH13 | <69.826614 | <148.1571 |
Performance of the classification models on the validation dataset.
| Performance metrics | 6-Protein Panel | Xpresys® XL Panel | ||
|---|---|---|---|---|
| TA Thresholds | TS Thresholds | Logistic Regression | Logistic Regression | |
| NPV (95% CI) | 0.840 (0.709–0.928) | 0.849 (0.724–0.933) | 0.935 (0.821–0.986) | 0.930 (0.809–0.985) |
| PPV (95% CI) | 0.851 (0.717–0.938) | 0.909 (0.783–0.975) | 0.882 (0.761–0.956) | 0.833 (0.707–0.921) |
| Sensitivity (95% CI) | 0.833 (0.698–0.925) | 0.833 (0.698–0.925) | 0.938 (0.828–0.987) | 0.938 (0.828–0.987) |
| Specificity (95% CI) | 0.857 (0.728–0.941) | 0.918 (0.804–0.977) | 0.878 (0.752–0.954) | 0.816 (0.680–0.912) |
| AUC (95% CI) | 0.845 (0.773–0.918) | 0.876 (0.810–0.942) | 0.908 (0.850–0.965) | 0.877 (0.812–0.942) |