| Literature DB >> 35053597 |
Arsela Prelaj1,2, Mattia Boeri3, Alessandro Robuschi2, Roberto Ferrara1, Claudia Proto1, Giuseppe Lo Russo1, Giulia Galli1, Alessandro De Toma1, Marta Brambilla1, Mario Occhipinti1, Sara Manglaviti1, Teresa Beninato1, Achille Bottiglieri1, Giacomo Massa1, Emma Zattarin1, Rosaria Gallucci1, Edoardo Gregorio Galli1, Monica Ganzinelli1, Gabriella Sozzi3, Filippo G M de Braud1, Marina Chiara Garassino1, Marcello Restelli2, Alessandra Laura Giulia Pedrocchi2, Francesco Trovo'2.
Abstract
(1) Background: In advanced non-small cell lung cancer (aNSCLC), programmed death ligand 1 (PD-L1) remains the only biomarker for candidate patients to immunotherapy (IO). This study aimed at using artificial intelligence (AI) and machine learning (ML) tools to improve response and efficacy predictions in aNSCLC patients treated with IO. (2)Entities:
Keywords: artificial intelligence; biomarker; immunotherapy; machine learning; non-small cell lung cancer
Year: 2022 PMID: 35053597 PMCID: PMC8773718 DOI: 10.3390/cancers14020435
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Development of the plasma MSC test using 24 miRNA and 4 risk groups. MSC: microRNA signature classifier; N.A.: not analyzable.
Figure 2Process and methods used in this study.
Features selected based on literature review and clinician experience, and keeping only one of the variables in a pair showing linear correlation >0.8.
| Feature Classes | Features |
|---|---|
| Clinical features | Age, sex, smoker/non-smoker, packs per year, ECOG |
| Laboratory exams | NLR * NLR4, LDH |
| Tumor features | PD-L1, histology (adenocarcinoma, squamous, other) |
| Radiological | Metastatic sites (liver, brain, bone) |
| Treatment features | IO line (first or further line) |
| Omic features | MSC test |
* NLR was used both as a continuous variable or binary variable with cut-off 4.
Features selected for the different models and corresponding performances.
| ML Model | Selected Features | AIC | ACC | F1 | AUC |
|---|---|---|---|---|---|
| LR | ECOG, IOLine, NLR, MSC, PD-L1 | 132.5 | 0.756 | 0.722 | 0.83 (0.76–0.88) |
| FFNN | NLR, IOLine, MSC, LDH, ECOG, PackYear | 137.2 | 0.732 | 0.686 | 0.80 (0.73–0.86) |
| K-NN | NLR, IOLine, ECOG, MSC, NLR4 | 137.4 | 0.726 | 0.667 | 0.81 (0.74–0.87) |
| SVM | ECOG, IOLine, NLR, MSC, PD-L1 | 134.5 | 0.738 | 0.703 | 0.83 (0.75–0.88) |
| RF | NLR, IOLine, ECOG, Age, MSC | 135.5 | 0.701 | 0.657 | 0.82 (0.73–0.87) |
Figure 3Confusion matrix for the analyzed ML models for responders (R) and non-responders (NR) for ML model algorithms (a) NN, (b) LR, (c) K-NN, (d) SVM and (e) RF.
Figure 4ROC curves (true positive rate (TPR) vs. false positive rate (FPR)) for the analyzed ML models. The performance of PD-L1 is represented as a red circle. As suggested by the AUC confidence intervals, there is no method that outperforms the others significantly.
Performances of the LR method when some of the features are removed from the initial pool of available ones.
| Initial Feature Set | Selected Features | ACC | F1 | AUC |
|---|---|---|---|---|
| All | ECOG, IOLine, NLR, MSC, PD-L1 | 0.756 | 0.722 | 0.83 (0.76–0.88) |
| No PD-L1 | ECOG, IOLine, NLR, MSC | 0.726 | 0.696 | 0.82 (0.75–0.88) |
| NO MSC | ECOG, IOLine, NLR, PD-L1, Age | 0.750 | 0.709 | 0.81 (0.74–0.87 |
| NO PD-L1 and MSC | ECOG, IOLine, NLR, Age | 0.707 | 0.662 | 0.80 (0.73–0.86) |
| NO ECOG | IOLine, NLR, MSC, PD-L1 | 0.726 | 0.690 | 0.80 (0.73–0.87) |
Figure A1ROC curves for the LR method starting from different sets of features. The performance of PD-L1 is represented as a red circle. As suggested by the AUC confidence intervals, there is no method that outperforms the others significantly.
Figure 5Kaplan–Meier according to PFS (a) and OS (b) curves divided by R (red curves) and NR (blue curves) groups.
Figure A2The Kaplan–Meier curves according to line of therapy in responder (R) and non-responder (NR) groups are reported. PFS for first (A) and second or further line (B) and OS for first (C) and second or further line IO (D).
Features selected for the different models and corresponding performances for the task of predicting the long- survival patients.
| ML Model | Selected Features | AIC | ACC | F1 | AUC |
|---|---|---|---|---|---|
| LR | ECOG, Histology, NLR, IOLine | 58.1 | 0.855 | 0.917 | 0.89 (0.80–0.94) |
| FFNN | Histology, NLR, PD-L1, NLR4 | 61.4 | 0.839 | 0.908 | 0.87 (0.78–0.92) |
| K-NN | NLR, PD-L1, Histology | 60.6 | 0.847 | 0.916 | 0.88 (0.80–0.93) |
| SVM | Age, Histology, MSC, ECOG, PD-L1, NLR | 63.2 | 0.847 | 0.913 | 0.90 (0.83–0.94) |
| RF | NLR, PD-L1 | 63.8 | 0.847 | 0.917 | 0.83 (0.74–0.89) |
Figure A3ROC curves for the LR method predicting the patients’ survival (longer or shorter than 24 months). The performance of PD-L1 is represented as a red circle.