| Literature DB >> 32125097 |
Ailing Liu1, Zhiheng Wang2, Yachao Yang3, Jingtao Wang2, Xiaoyu Dai2, Lijie Wang2, Yuan Lu2, Fuzhong Xue2,4.
Abstract
BACKGROUND: Lung cancer is the most commonly diagnosed cancer worldwide. Its survival rate can be significantly improved by early screening. Biomarkers based on radiomics features have been found to provide important physiological information on tumors and considered as having the potential to be used in the early screening of lung cancer. In this study, we aim to establish a radiomics model and develop a tool to improve the discrimination between benign and malignant pulmonary nodules.Entities:
Keywords: computed tomography; early screening; lung cancer; nomogram; pulmonary nodule; radiomics
Mesh:
Year: 2020 PMID: 32125097 PMCID: PMC7163925 DOI: 10.1002/cac2.12002
Source DB: PubMed Journal: Cancer Commun (Lond) ISSN: 2523-3548
Clinical characteristics of the 875 patients with pulmonary nodules in the training cohort and validation cohort
| Training cohort ( | Validation cohort ( | ||||||
|---|---|---|---|---|---|---|---|
| Characteristic | Benign | Malignant |
| Benign | Malignant |
| |
| Age (years) | < 0.001 | 0.546 | |||||
| < 55 | 59 (52.2) | 137 (27.5) | 22 (39.3) | 70 (33.8) | |||
| ≥ 55 | 54 (47.8) | 362 (72.5) | 34 (60.7) | 137 (66.2) | |||
| Gender | 0.113 | 0.537 | |||||
| Female | 60 (53.1) | 308 (61.7) | 32 (57.1) | 130 (62.8) | |||
| Male | 53 (46.9) | 191 (38.3) | 24 (42.9) | 77 (37.2) | |||
| Location of pulmonary nodule | 0.030 | 0.506 | |||||
| In lung | Right upper | 32 (28.3) | 190 (38.1) | 12 (21.4) | 65 (31.4) | ||
| Right middle | 10 (8.8) | 37 (7.4) | 5 (8.9) | 22 (10.6) | |||
| Right lower | 30 (26.5) | 85 (17.0) | 10 (17.9) | 35 (16.9) | |||
| Left upper | 18 (15.9) | 112 (22.4) | 15 (26.8) | 50 (24.2) | |||
| Left lower | 23 (20.4) | 75 (15.0) | 14 (25.0) | 35 (16.9) | |||
Note: P values were obtained from the univariate association analyses between the pulmonary nodules and each clinical characteristic.
Figure 1Radiomics feature selection using the LASSO logistic regression model. A. The curve of the coefficient path of 1288 radiomics features in the training cohort. The dashed vertical line was drawn using the value of the selected in the 10‐fold cross‐validation in B, and there are 20 features with nonzero coefficients. B. The adjustment penalty parameter λ was selected in the LASSO model by 10‐fold cross‐validation based on the error within one standard error range of the minimum. The AUC values from the LASSO regression cross‐validation process were plotted as a function of . The y‐axis represents the AUC value. The lower x‐axis represents. The number above the x‐axis represents the average of the predictors. The red dot indicates the AUC value of each model with a given λ, and the vertical bar of the red dot indicates the upper and lower limits of the deviation. The vertical black line defines the best value for, where the model can provide the best result for the data. The best λ value was 0.024, , and a total of 20 variables were selected. C‐D. The waterfall plot of the training cohort (C) and validation cohort (D) was used to visualize the distribution of the radiomics score and the benign and malignant state of the pulmonary nodules of individual patients. The best cutoff value was 0.981. Abbreviations: LASSO, least absolute shrinkage and selection operator; AUC, area under the curve
Figure 2Validation of the radiomics score. The comparison of the ROC curve of the logistic regression model with the clinical variables (red line) and the ROC curve of the logistic model with clinical variables along with the radiomics score (green line). Abbreviations: ROC, receiver operating characteristic; AUC, area under the curve
Univariate and multivariate logistic regression analysis of the predicted factors for pulmonary nodules of patients in the training cohort
| Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|
| Characteristic | OR (95% CI) |
| OR (95% CI) |
| |
| Radiomics score | 5.89 (4.12‐8.41) | < 0.001 | 5.77 (4.03‐8.27) | < 0.001 | |
| Age (years) | < 55 | 1.00 | 1.00 | ||
| ≥ 55 | 1.05 (1.02‐1.07) | < 0.001 | 1.04 (1.01‐1.06) | 0.002 | |
| Gender | Male | 1.00 | |||
| Female | 0.70 (0.47‐1.06) | 0.092 | – | – | |
| Location of pulmonary nodule in lung | Right upper | 1.00 | |||
| Right middle | 0.62 (0.28‐1.38) | 0.242 | – | – | |
| Right lower | 0.48 (0.27‐0.84) | 0.010 | 0.58 (0.30‐1.12) | 0.105 | |
| Left upper | 1.05 (0.56‐1.95) | 0.883 | – | – | |
| Left lower | 0.55 (0.30‐1.00) | 0.050 | – | – | |
Note: ‐, not available. The variables were eligible for inclusion in the multivariate analysis if P < 0.05 in univariate analysis, so the OR and P values were not available. Abbreviations: OR, odd ratio; CI, confidence interval.
Figure 3The ROC curve of the logistic regression model. A‐B shows the ROC curves of the multivariate logistic regression model with radiomics score and age in the training (A) and validation (B) cohorts. C shows the test between the two ROC curves using the DeLong method
Figure 4Radiomics nomogram and calibration curves. A. Radiomics nomogram for predicting benign and malignant pulmonary nodules. B‐C. Calibration curves of the radiomics nomogram in the training (B) and validation (C) cohorts. The calibration curve describes the calibration of the nomogram based on the agreement between the prediction of the benign and malignant pulmonary nodules and the observed actual benign and malignant results. The black line represents the perfect prediction, and the blue line represents the prediction performance of the radiomics nomogram