Xiaoling Ma1, Wenzhi Lv2, Cong Wang3, Dehao Tu4, Jinhan Qiao1, Chanyuan Fan1, Jiandong Niu5, Wen Zhou1, Qiuyu Liu1, Liming Xia1. 1. Department of Radiology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China. 2. Department of Artificial Intelligence, Julei Technology Company, Wuhan, China. 3. Jiangsu Key Laboratory of Medical Optics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, China. 4. Department of Thoracic surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China. 5. Department of Radiology, The General Hospital of Ningxia Medical University, Yinchuan, China.
Abstract
BACKGROUND: We aimed to construct a clinical-radiomics nomogram to predict disease-free survival (DFS) and the added survival benefit of adjuvant chemotherapy (ACT) for node-negative, early-stage (I-II) lung adenocarcinoma (ADC). METHODS: In this retrospective study including 310 patients from two independent cohorts, the CT-derived radiomics features were selected by least absolute shrinkage and selection operator Cox regression to generate a radiomics signature associated with DFS. The radiomics signature was incorporated to construct a clinical-radiomics nomogram along with the independent clinical risk predictors. The model performance was evaluated with reference to discrimination quantified by Harrell concordance index (C-index), integrated discrimination improvement (IDI) and net reclassification index (NRI), calibration and clinical utility. The risk score (RS) for clinical-radiomics nomogram was calculated. The association between ACT and survival benefit was assessed in high and low RS subgroup. RESULTS: The clinical-radiomics nomogram achieved the highest C-index of 0.822 [95% confidence interval (CI): 0.769, 0.876] in training cohort and 0.802 (95% CI: 0.716, 0.888) in validation cohort. The incorporation of radiomics signature into clinical-radiomics nomogram showed an incremental benefit over clinical nomogram according to the improved NRI and IDI. The calibration curves and decision curve analysis further verified the clinical utility of clinical-radiomics nomogram. Further, patients with high RS based on clinical-radiomics nomogram were more prone to benefit from ACT. CONCLUSIONS: The clinical-radiomics nomogram approach can feasibly conduct risk prediction and have potential to identify the beneficiaries of ACT among patients with node-negative, early-stage ADC, which might serve as a helpful tool in informing therapeutic decision-making. 2022 Journal of Thoracic Disease. All rights reserved.
BACKGROUND: We aimed to construct a clinical-radiomics nomogram to predict disease-free survival (DFS) and the added survival benefit of adjuvant chemotherapy (ACT) for node-negative, early-stage (I-II) lung adenocarcinoma (ADC). METHODS: In this retrospective study including 310 patients from two independent cohorts, the CT-derived radiomics features were selected by least absolute shrinkage and selection operator Cox regression to generate a radiomics signature associated with DFS. The radiomics signature was incorporated to construct a clinical-radiomics nomogram along with the independent clinical risk predictors. The model performance was evaluated with reference to discrimination quantified by Harrell concordance index (C-index), integrated discrimination improvement (IDI) and net reclassification index (NRI), calibration and clinical utility. The risk score (RS) for clinical-radiomics nomogram was calculated. The association between ACT and survival benefit was assessed in high and low RS subgroup. RESULTS: The clinical-radiomics nomogram achieved the highest C-index of 0.822 [95% confidence interval (CI): 0.769, 0.876] in training cohort and 0.802 (95% CI: 0.716, 0.888) in validation cohort. The incorporation of radiomics signature into clinical-radiomics nomogram showed an incremental benefit over clinical nomogram according to the improved NRI and IDI. The calibration curves and decision curve analysis further verified the clinical utility of clinical-radiomics nomogram. Further, patients with high RS based on clinical-radiomics nomogram were more prone to benefit from ACT. CONCLUSIONS: The clinical-radiomics nomogram approach can feasibly conduct risk prediction and have potential to identify the beneficiaries of ACT among patients with node-negative, early-stage ADC, which might serve as a helpful tool in informing therapeutic decision-making. 2022 Journal of Thoracic Disease. All rights reserved.
Lung cancer remains the leading lethal malignancy, responsible for 18.4% of all cancer-related deaths worldwide in 2018 (1). Non-small cell lung cancer (NSCLC) is in the majority of all lung cancer cases, accounting for approximately 85% (2). Surgical resection is the prime therapeutic modality for early-stage (I–II) NSCLC (3). Despite generally prescribed with curing excision, early-stage NSCLC remains at a substantial risk of recurrence. Even in stage I NSCLC, more than 30% of patients will suffer a relapse after surgery within 5 years (4).The tumor-node-metastasis (TNM) classification is regarded as the dominant reference criteria for prognosis evaluation in NSCLC. Given the heterogeneous outcomes of NSCLC patients even in identical pathological stage, it is insufficient to estimate prognosis and direct treatment merely depending on TNM classification system. Adjuvant chemotherapy (ACT) is an additional therapeutic option for lung cancer. Several large-sample, randomized controlled, clinical trials have reported the survival benefit of platinum-based ACT following surgery in patients with NSCLC (5-8). ACT is generally recommended for resected stage II–III patients (9), but there is still an unsolved debate on the indication of ACT in node-negative, early-stage NSCLC, especially in stage I cases. A comprehensive analysis based on data from The National Cancer Database, observed a survival benefit associated with ACT in patients with tumors 3 cm or larger and high-risk clinicopathological factors (10). Krivitsky et al. constructed a risk stratification model for negative-node NSCLC and assigned 9% of stage I patients at high-risk who could be potential candidates for ACT (11). More interestingly, series studies demonstrated the improved survival associated with ACT even in stage IA patients with high-risk histopathological factors (12,13).These discrepant propositions indicate the compelling necessity to refine risk stratification, intended to recognize patients at a substantial risk of recurrence as target candidates for more strict management and individualized therapy. Several efforts have been put into developing prognostic biomarkers, including genetic phenotypes (4,14), cytokines (15) and serological indicators (16), but none of these established biomarkers can be clinically practical prior to laboratory testing standardization and trial validation (17). “Radiomics” is an emerging translational field of research widely applied in various aspects of medicine domain. Quantitative radiomics features excavated from multimodality medical images can noninvasively embody abundant information underlying pathophysiology and habitat microenvironment within tumors, which are hard to be appreciated by human naked eyes (18,19). Lung adenocarcinoma (ADC) accounting for approximately 60% of NSCLC has distinct characteristics in histological architecture and molecular expression from other subtypes of lung cancer (20). This study sought to develop and validate a prognostic model based on the radiomics signature and clinical risk predictors to predict survival and conduct risk stratification in node-negative, early-stage ADC. It is hypothesized that this prognostic model could effectively identify potential beneficiaries of ACT among node-negative, early-stage ADC patients.We present the following article in accordance with the TRIPOD reporting checklist (available at https://jtd.amegroups.com/article/view/10.21037/jtd-21-1520/rc).
Methods
Patients
The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The Institutional Ethics Committee of Tongji Hospital approved this retrospective study (No. TJ-IRB-20180615) and the requirement for informed consent was waived. We retrospectively reviewed 4,124 patients receiving tumor radical resection and systematic lymph nodes dissection in Tongji hospital affiliated to Huazhong University of Science and Technology from May 2012 to September 2016. Totally, 186 eligible patients from 2,561 patients undergoing preoperative contrast-enhanced CT examination on the multi-slice spiral CT scanner of GE Discovery CT 750 HD (CT scanner 1) were included and constituted the training cohort. We enrolled 124 eligible patients from 1,563 patients receiving preoperative contrast-enhanced CT examination on the multi-slice spiral CT scanner of TOSHIBA Aquilion One (CT scanner 2) to set up the validation cohort. The inclusion criteria consisted of: (I) pathologically confirmed early-stage (I–II) ADC with negative margin and without regional lymph node and distant metastasis (N0M0R0); (II) no presurgical radiotherapy/chemotherapy and postsurgical radiotherapy/targeted therapy; (III) within interval time of 2 weeks between presurgical contrast-enhanced CT examination and surgical resection. Tumor stage was determined according to the eighth edition of TNM classification for lung cancer. We excluded the patients: (I) with ADC in situ, minimally invasive ADC, the rare variants of ADC including mucinous, cribriform, colloidal, fetal and intestinal ADC; (II) with two or more metachronous and synchronous primary tumors or locoregional invasion; (III) incomplete pathological reports; (IV) presurgical thin-section CT images were unavailable or CT images were highly distorted by motion artifacts; (V) lack of follow-up data; (VI) subjected to other cancers.showed the recruitment pathway. The baseline characteristics including gender, age, smoking history, pack-year, histological grade, preoperative serum carcinoembryonic antigen (CEA) status, Ki-67 labeling index (LI), tumor size, pathological TNM stage, epidermal growth factor receptor (EGFR) mutation status, follow-up time and therapy were recorded. Histological grade is the prognostic classification of ADC predominant growth patterns, which separates invasive ADC into three prognostic grades: low grade (lepidic predominant ADC), intermediate grade (acinar or papillary predominant ADC) and high grade (solid or micropapillary predominant ADC) (21).
Figure 1
The recruitment pathway of this study. CT, computer tomography; ADC, adenocarcinoma; ACT, adjuvant chemotherapy.
The recruitment pathway of this study. CT, computer tomography; ADC, adenocarcinoma; ACT, adjuvant chemotherapy.
Follow-up
Patients were followed up from the date of surgery every 6 months within the first 2 years, and then every 12 months. The follow-up time was shortest for 36 months, longest for 87 months. The follow-up evaluation included physical condition, thoracic CT examination (brain CT scan and abdominal ultrasound if required), serum tumor markers and postsurgical therapy. The endpoint, disease-free survival (DFS) was defined as the time from the date of surgery to the date of event or being censored. Tumor locoregional relapse, distant metastasis or death was regarded as the endpoint event. Patients last known to be alive free from recurrence were censored.
Imaging acquisition and radiomics analysis
All subjects underwent preoperative contrast-enhanced CT examination by the multi-slice spiral CT scanners of GE Discovery CT 750 HD and TOSHIBA Aquilion One. The scanning parameters across different scanners or vendors always keep consistent as revealed in Appendix 1.An open-source software, ITK-SNAP (version 3.8.0) was utilized to perform images segmentation. Two thoracic radiologists (M.X.L. and Z.W., with 8 and 3 years of experience) manually delineated the boundaries of tumors as region of interests on axial CT images in lung window (width, 1,500 HU; level, −500 HU) slice-by-slice independently without the knowledge of the baseline characteristics. Each region of interest was drawn close to but not beyond the boundaries of tumors as far as possible, avoiding bronchus, vessels and necrosis in tumors. The radiomics feature extraction from the arterial phase (30 s after contrast agent injection) images was conducted with a free open-source package of PyRadiomic (available at https://pyradiomics.readthedocs.io/en/latest/index.html) implemented in platform of Python (version 3.6.2), which is a popular programming language for scientific computation based on robust ecosystem and could be available on any system (22).
Features selection and radiomics signature development
The interclass correlation coefficients (ICCs) were respectively calculated to assess the interobserver agreement. To ensure the stability of features, radiomics features with ICC <0.8 were eliminated from the radiomics dataset. Then we calculated the Spearman correlation coefficient of paired features and excluded the redundant features with Spearman correlation coefficient >0.7. We conducted data normalization with the approach of Z-score to achieve a standard distribution of feature value, the rationale of which is transforming different magnitude data into unified measurement for comparison. The least absolute shrinkage and selection operator (LASSO) Cox regression algorithm with 10-fold cross validation was utilized to select the radiomics features with non-zero coefficients. Then we developed a radiomics signature denoted as radiomics score (Rad-score) by a linear combination of the selected features weighted by corresponding LASSO coefficients. The Rad-score was binarized at cut-off point to stratify patients into high Rad-score and low Rad-score subgroups in training cohort. The X-tile software (version 3.6.1, Yale University, USA) was used to determine the cut-off point, where survival between two divided subgroups were best separated with the maximum chi-square value in log-rank test. Subsequently, the survival difference between high Rad-score and low Rad-score subgroup was assessed in validation cohort using Kaplan-Meier curves with log-rank tests. The prognostic performance of radiomics signature was analyzed in both training cohort and validation cohort.
The clinical nomogram and clinical-radiomics nomogram construction
Univariate and multivariate Cox regression analysis were used to identify independent clinical risk predictors associated with DFS. Then the clinical risk predictors were incorporated to construct a clinical nomogram based on multivariate Cox proportional hazards analysis with regression coefficients. Similarly, a clinical-radiomics nomogram was constructed integrating clinical risk predictors along with radiomics signature.The Harrell concordance index (C-index) was calculated to quantify the discriminative ability of prediction models. To quantify the incremental value of radiomics signature in predictive accuracy, we also calculated the integrated discrimination improvement (IDI) and net reclassification index (NRI). The two assessment indexes, as an effective supplement to area under the receiver operating characteristic curve/C-index, were used to estimate performance improvement for models when introducing a new variable (23). The calibration curves for nomograms were depicted to assess the agreement between predicted probabilities with nomograms and actual observed probabilities. The decision curve analysis was conducted to exhibit the clinical utility of prediction models by assessing the overall benefits across a range of reasonable diagnostic thresholds in comprehensive consideration of insufficient treatment (false negative) and excessive treatment (false positive) (24).
Clinical utility of nomogram in estimating survival benefit of ACT
To estimate survival benefit of ACT, risk score (RS) for clinical-radiomics nomogram was calculated with a linear polynomial of corresponding risk predictors weighted by respective Cox regression coefficients. Patients were categorized into high RS and low RS subgroup by the optimal threshold determined by the X-tile. Then DFS of all patients between patients receiving surgery with ACT and without ACT in low RS or high RS subgroup was analyzed using Kaplan-Meier curves and Cox proportional hazards regression analysis.
Statistical analysis
The statistical analysis was performed in R software (version 3.5.3) and SPSS statistics (IBM, ver. 26.0). The normality of data distribution was assessed with the Shapiro-Wilk test. Continuous variables (tumor size, follow-up time and radiomics signature) were presented as medians (inter-quartiles). Categorical variables (gender, age, smoking history, pack-year, histological grade, CEA status, Ki-67 LI, pathological TNM stage, EGFR mutation status and therapy) were presented as numbers (percentages). The continuous variables between two groups (training cohort and validation cohort; high RS subgroup and low RS subgroup; patients receiving surgery alone and surgery plus ACT) were compared using the Mann-Whitney U test. For categorical variables, comparisons of two groups were performed using the Chi-square test or Fisher’s exact test. The DFS rates of patients receiving surgery alone and surgery plus ACT were compared with the Chi-square test or Fisher’s exact test. The variables significantly associated with DFS in univariate Cox regression were then analyzed in multivariate Cox regression with a method of forward stepwise to assess whether variables were independently associated with DFS. The two-sided P value less than 0.05 indicated a significant difference.
Results
Baseline characteristics
All baseline characteristics between training cohort (n=186) and validation cohort (n=124) were similarly distributed as revealed in . Of the 310 eligible patients, 156 (50.3%) were male patients and the median age was 59 years. Totally, 67 (21.6%) patients were subjected to a recurrence or death: locoregional relapse and lung metastasis occurred in seven and thirty-one patients, respectively; nine, seven, three and two patients experienced brain, bone, liver, and pleural metastases, respectively; four patients experienced cervical or abdominal lymph node metastases; concurrent lung and brain metastases occurred in three patients; one patient died from primary hepatic carcinoma. The range of DFS of all patients was 4 to 84 months and the median (inter-quartile) DFS was 45.5 (36.0–54.0) months. The DFS of patients in training cohort [median (inter-quartile): 43.5 (36,54)] and validation cohort [48 (36,54.75)] were balanced (P=0.834). In total, 226 (72.9%) patients underwent surgery alone and 84 (27.1%) patients received postsurgical ACT.
Table 1
The baseline characteristics of patients with node-negative, early-stage lung adenocarcinoma in training cohort and validation cohort
Baseline characteristics
Training cohort (n=186)
Validation cohort (n=124)
P value
Gender
0.100
Male
86 (46.2%)
70 (56.5%)
Female
100 (53.8%)
54 (43.5%)
Age (year)
0.699
<65
153 (82.3%)
99 (79.8%)
≥65
33 (17.7%)
25 (20.2%)
Smoking history
0.179
Yes
70 (37.6%)
57 (46.0%)
No
116 (62.4%)
67 (54.0%)
Pack-year
0.176
≤3
123 (66.1%)
69 (55.6%)
3–40
34 (18.3%)
30 (24.2%)
>40
29 (15.6%)
25 (20.2%)
Histological grade
0.976
Low (LPA)
44 (23.7%)
28 (22.6%)
Intermediate (APA + PPA)
117 (62.9%)
79 (63.7%)
High (SPA + MPA)
25 (13.4%)
17 (13.7%)
CEA status (μg/L)
0.911
<5
145 (78.0%)
96 (77.4%)
≥5
41 (22.0%)
28 (22.6%)
Ki-67 LI
0.621
<10%
73 (39.2%)
43 (34.7%)
10–29%
74 (39.8%)
56 (45.2%)
≥30%
39 (21.00%)
25 (20.1%)
Tumor size (cm)‡
2.5 (1.9, 3.4)
2.8 (2.1, 3.5)
0.064
Pathological TNM stage
0.577
IA (≤3 cm)
120 (64.5%)
73 (58.9%)
IB (3–4 cm)
42 (22.6%)
28 (22.6%)
IIA (4–5 cm)
14 (7.5%)
14 (11.3%)
IIB (5–7 cm)
10 (5.4%)
9 (7.2%)
EGFR mutation status
0.708
EGFR mutation
56 (30.1%)
38 (30.6%)
EGFR wild type
22 (11.8%)
11 (8.9%)
Unknown
108 (58.1%)
75 (60.5%)
Follow-up time‡
48.0 (39.0–55.0)
48.0 (42.0, 56.0)
0.227
Therapy
0.880
Surgery alone
138 (74.2%)
88 (71.0%)
Surgery plus ACT
48 (25.8%)
36 (29.0%)
Unless otherwise stated, data were presented as numbers (percentages) and compared using the Chi-square test. ‡, data were presented as medians (inter-quartiles) and compared using the Mann-Whitney U test. Histological grade, Histological grade is the prognostic classification of lung adenocarcinoma predominant growth patterns, which separates invasive lung adenocarcinoma into three prognostic grades: low grade, intermediate grade and high grade; LPA, lepidic predominant adenocarcinoma; APA, acinar predominant adenocarcinoma; PPA, papillary predominant adenocarcinoma; SPA, solid predominant adenocarcinoma; MPA, micropapillary predominant adenocarcinoma; CEA, carcinoembryonic antigen; LI, labeling index; ACT, adjuvant chemotherapy; EGFR, epidermal growth factor receptor; TNM, tumor-node-metastasis.
Unless otherwise stated, data were presented as numbers (percentages) and compared using the Chi-square test. ‡, data were presented as medians (inter-quartiles) and compared using the Mann-Whitney U test. Histological grade, Histological grade is the prognostic classification of lung adenocarcinoma predominant growth patterns, which separates invasive lung adenocarcinoma into three prognostic grades: low grade, intermediate grade and high grade; LPA, lepidic predominant adenocarcinoma; APA, acinar predominant adenocarcinoma; PPA, papillary predominant adenocarcinoma; SPA, solid predominant adenocarcinoma; MPA, micropapillary predominant adenocarcinoma; CEA, carcinoembryonic antigen; LI, labeling index; ACT, adjuvant chemotherapy; EGFR, epidermal growth factor receptor; TNM, tumor-node-metastasis.
Radiomics signature construction and validation
A total of 851 radiomics features were extracted from the delineated three-dimensional volume of interests (Table S1). After excluding the radiomics features with ICC <0.80, 704 features were retained. The redundant features with Spearman correlation coefficient >0.7 were then removed and the remaining 42 features were reserved to be normalized by Z-score and selected by LASSO Cox regression algorithm. Eventually, thirteen nonzero coefficient features were used to construct radiomics signature () and their regression coefficients were depicted in . The detailed formula for calculating the Rad-score was showed in Appendix 2. The optimal threshold of Rad-score generated by the X-tile was 0.169, which classified patients into high Rad-score and low Rad-score subgroup. Patients with high Rad-score (>0.169) had poorer survival than those with low Rad-score (≤0.169). Kaplan-Meier curves for two subgroups were significantly separate in training cohort (HR: 13.45; 95% CI: 5.25, 34.42; P<0.001) and a significant survival difference was also observed in validation cohort [HR: 6.51; 95% CI: 2.64, 16.08; P<0.001; (A1,A2)]. When all patients were stratified by gender, age level, smoking history, CEA status, Ki-67 LI and tumor size (4 cm), the survival differences between two subgroups were also significant (all P<0.05; ). The radiomics signature achieved a C-index of 0.779 (95% CI: 0.719, 0.840) in training cohort and 0.801 (95% CI: 0.713, 0.890) in validation cohort.
Figure 2
CT-derived radiomics feature selection using the LASSO Cox regression algorithm. (A) The optimal tuning parameter (λ) was selected from LASSO Cox regression analysis with 10-fold cross validation. The curve of partial likelihood deviation versus log (λ) was plotted. The optimal λ value was 0.026 at the log (λ) value of −3.657 where the left dotted vertical lines intersected x-axis; (B) the optimal radiomics features set with non-zero coefficients were obtained at the log (λ) values of −3.657 where the left solid vertical lines intersected x-axis; (C) the respective LASSO coefficients for the thirteen radiomics features with non-zero coefficients. CT, computer tomography; LASSO, least absolute shrinkage and selection operator.
Figure 3
The Kaplan-Meier curves for high Rad-score subgroup versus low Rad-score subgroup. The Kaplan-Meier curves for high Rad-score subgroup versus low Rad-score subgroup based on radiomics signature in training cohort (A1) and validation cohort (A2); the Kaplan-Meier curves in all patients stratified by gender, age level, smoking history, CEA status, Ki-67 LI, and tumor size (4 cm) (B1-G2). CEA, carcinoembryonic antigen; LI, labeling index.
CT-derived radiomics feature selection using the LASSO Cox regression algorithm. (A) The optimal tuning parameter (λ) was selected from LASSO Cox regression analysis with 10-fold cross validation. The curve of partial likelihood deviation versus log (λ) was plotted. The optimal λ value was 0.026 at the log (λ) value of −3.657 where the left dotted vertical lines intersected x-axis; (B) the optimal radiomics features set with non-zero coefficients were obtained at the log (λ) values of −3.657 where the left solid vertical lines intersected x-axis; (C) the respective LASSO coefficients for the thirteen radiomics features with non-zero coefficients. CT, computer tomography; LASSO, least absolute shrinkage and selection operator.The Kaplan-Meier curves for high Rad-score subgroup versus low Rad-score subgroup. The Kaplan-Meier curves for high Rad-score subgroup versus low Rad-score subgroup based on radiomics signature in training cohort (A1) and validation cohort (A2); the Kaplan-Meier curves in all patients stratified by gender, age level, smoking history, CEA status, Ki-67 LI, and tumor size (4 cm) (B1-G2). CEA, carcinoembryonic antigen; LI, labeling index.
Construction and performance of clinical nomogram and clinical-radiomics nomogram
To avoid multicollinearity in regression equation, we excluded pathological TNM stage from regression analysis due to a strong correlation with tumor size. EGFR mutation status did not be involved in analysis given the excessive missing records in gene mutation detection. Finally, histological grade (high grade: HR: 6.420; 95% CI: 2.575, 16.006; P<0.001; intermediate grade: HR: 2.113; 95% CI: 0.887, 5.003; P=0.091; low grade: reference) and tumor size (HR: 1.687; 95% CI: 1.395, 2.040; P<0.001) as the independent clinical risk predictors in multivariate Cox regression analysis were incorporated to construct the clinical nomogram. Histological grade (high grade: HR: 3.343; 95% CI: 1.300, 8.595; P=0.012; intermediate grade: HR: 1.554; 95% CI: 0.643, 3.756; P=0.328; low grade: reference), tumor size (HR: 1.295; 95% CI: 1.038, 1.615; P=0.022) and radiomics signature (HR: 3.207; 95% CI: 2.105, 4.884; P<0.001) were included to construct the clinical-radiomics nomogram (, ). In training cohort, the clinical-radiomics nomogram (C-index: 0.822; 95% CI: 0.769, 0.876) showed an improvement in predictive performance over clinical nomogram (C-index: 0.792; 95% CI: 0.733, 0.850) but it did not reach statistical significance (P=0.458). The consistent results were observed in validation cohort (C-index for clinical-radiomics nomogram: 0.802; 95% CI: 0.716, 0.888; C-index for clinical nomogram: 0.792; 95% CI: 0.733, 0.850; P=0.654). However, with the incorporation of radiomics signature into clinical-radiomics nomogram, the predictive accuracy significantly improved compared to clinical nomogram in terms of IDI (0.082; 95% CI: 0.002, 0.178; P=0.040) and NRI (0.261; 95% CI: 0.002, 0.485; P=0.040) in training cohort. The incremental benefit with the integration of radiomics signature was also significant in validation cohort (IDI =0.114; 95% CI: 0.026, 0.209; P=0.007) though NRI was not but approximately significant (0.395; 95% CI: −0.008, 0.613, P=0.053). Furthermore, the calibration curve showed that the probabilities of 3-year DFS estimated with clinical-radiomics nomogram were more agreement with actual observed probabilities (). Decision curve analysis showed that clinical-radiomics nomogram had a higher net benefit than radiomics signature or clinical nomogram across the vast majority of the threshold probability of 0–0.6 ().
Table 2
Univariate and multivariate analysis of clinical characteristics and radiomics signature in patients with node-negative, early-stage lung adenocarcinoma
The constructed nomograms with their corresponding calibration curves. The clinical nomogram (A) and clinical-radiomics nomogram (B) for predicting the probabilities of 1-, 2-, and 3-year DFS. The calibration curves for clinical nomogram (C) and clinical-radiomics nomogram (D) depict the agreement between the predicted probabilities of 3-year DFS with the nomogram and the actual observed probabilities of 3-year DFS. The x-axis indicates the estimated probabilities of 3-year DFS with the nomogram and the y-axis indicates the actual observed probabilities of 3-year DFS. The diagonal dotted line plots the nomogram predicted results fitting perfectly the actual observed results. The solid line plots the performance of the constructed nomogram. The solid line more closely fitting the diagonal dotted line represents a better predictive performance. APA, acinar predominant adenocarcinoma; PPA, papillary predominant adenocarcinoma; SPA, solid predominant adenocarcinoma; MPA, micropapillary predominant adenocarcinoma; LPA, lepidic predominant adenocarcinoma; DFS, disease free survival.
Figure 5
Decision curve analysis of different models. The x-axis represents the threshold probability. The y-axis represents the net benefits in comprehensive consideration of insufficient treatment (false negative) and excessive treatment (false positive) across the threshold probability of 0–0.6. Decision curve analysis reveals that clinical-radiomics nomogram had a higher net benefit than the radiomics signature or clinical nomogram in clinical practice.
CEA, carcinoembryonic antigen; HR, hazard ratio; CI, confidence interval; LI, labeling index; ACT, adjuvant chemotherapy.The constructed nomograms with their corresponding calibration curves. The clinical nomogram (A) and clinical-radiomics nomogram (B) for predicting the probabilities of 1-, 2-, and 3-year DFS. The calibration curves for clinical nomogram (C) and clinical-radiomics nomogram (D) depict the agreement between the predicted probabilities of 3-year DFS with the nomogram and the actual observed probabilities of 3-year DFS. The x-axis indicates the estimated probabilities of 3-year DFS with the nomogram and the y-axis indicates the actual observed probabilities of 3-year DFS. The diagonal dotted line plots the nomogram predicted results fitting perfectly the actual observed results. The solid line plots the performance of the constructed nomogram. The solid line more closely fitting the diagonal dotted line represents a better predictive performance. APA, acinar predominant adenocarcinoma; PPA, papillary predominant adenocarcinoma; SPA, solid predominant adenocarcinoma; MPA, micropapillary predominant adenocarcinoma; LPA, lepidic predominant adenocarcinoma; DFS, disease free survival.Decision curve analysis of different models. The x-axis represents the threshold probability. The y-axis represents the net benefits in comprehensive consideration of insufficient treatment (false negative) and excessive treatment (false positive) across the threshold probability of 0–0.6. Decision curve analysis reveals that clinical-radiomics nomogram had a higher net benefit than the radiomics signature or clinical nomogram in clinical practice.
Predictive ability of clinical-radiomics nomogram in ACT benefit
The RS for clinical-radiomics nomogram was calculated for each patient with the following formula: RS = 0.0 × 0 (if histological grade: low) + 0.441 × 1 (if histological grade: intermediate) + 1.207 × 1 (if histological grade: high) + 0.258 × tumor size + 1.165 × radiomics signature. We stratified patients into high RS and low RS subgroup by the optimal threshold of 1.570. The survival differences between high RS subgroup (≥1.570) and low RS subgroup (<1.570) was significant both in training cohort (HR: 15.45; 95% CI: 6.03, 39.57; P<0.001) and validation cohort (HR: 5.49; 95% CI: 2.33, 12.94; P<0.001; ). indicated that male smokers with larger pack-year, CEA ≥5 µg/L, higher histological grade and Ki-67 LI, larger tumor size, EGFR wild type and receiving postsurgical ACT were prone to harbor high RS. Furthermore, patients with EGFR mutation harbored significantly lower RS than those with EGFR wild type (P=0.013). One-year, two-year and three-year DFS rates of patients in low RS subgroup were significantly higher than those in high RS subgroup (all P<0.001; ).
Figure 6
Kaplan-Meier curves in clinical-radiomics nomogram. DFS of patients with high RS versus patients with low RS based on clinical-radiomics nomogram in training cohort (A) and validation cohort (B); DFS of all patients treated with surgery plus ACT versus treated with surgery alone in low RS subgroup (C) and high RS subgroup (D). DFS, disease free survival; RS, risk score; ACT, adjuvant chemotherapy.
Table 3
The distribution of clinical characteristics and radiomics signature in high RS and low RS subgroups based on clinical-radiomics nomogram
Characteristics
High RS subgroup (n=119)
Low RS subgroup (n=191)
P value
Gender
0.007
Male
72 (60.5%)
84 (44.0%)
Female
47 (39.5%)
107 (56.0%)
Age (year)
0.944
<65
96 (80.7%)
156 (81.7%)
≥65
23 (19.3%)
35 (18.3%)
Smoking status
0.003
Yes
62 (52.1%)
65 (34.0%)
No
57 (47.9%)
126 (66.0%)
Pack-year
<0.001
≤3
59 (49.6%)
133 (69.6%)
3–40
26 (21.8%)
38 (19.9%)
>40
34 (28.6%)
20 (10.5%)
Histological grade
<0.001
Low
6 (5.0%)
66 (34.6%)
Intermediate
73 (61.4%)
123 (64.4%)
High
40 (33.6%)
2 (1.0%)
CEA status (μg/L)
<0.001
<5
73 (61.3%)
168 (88.0%)
≥5
46 (38.7%)
23 (12.0%)
Ki-67 LI
<0.001
<10%
21 (17.7%)
95 (49.7%)
10–29%
50 (42.0%)
80 (41.9%)
≥30%
48 (40.3%)
16 (8.4%)
Tumor size (cm)‡
3.50 (2.90, 4.35)
2.20 (1.65, 2.70)
<0.001
Pathological TNM stage
<0.001
IA (≤3 cm)
34 (28.6%)
159 (83.2%)
IB (3–4 cm)
42 (35.3%)
28 (14.7%)
IIA (4–5 cm)
24 (20.1%)
4 (2.1%)
IIB (5–7 cm)
19 (16.0%)
0 (/)
EGFR mutation status
0.017
EGFR mutation
36 (30.3%)
58 (30.4%)
EGFR wild type
20 (16.8%)
13 (6.8%)
Unknown
63 (52.9%)
120 (62.8%)
Radiomics signature‡
0.589 (0.349, 0.927)
−0.271 (−0.751, −0.033)
<0.001
Therapy
0.010
Surgery alone
77 (64.7%)
149 (78.0%)
Surgery plus ACT
42 (35.3%)
42 (22.0%)
Unless otherwise stated, data were presented as numbers (percentages) and compared using the Chi-square test or Fisher’s exact test. ‡, data were presented as medians (inter-quartiles) and compared using the Mann-Whitney U test. RS, risk score; CEA, carcinoembryonic antigen; LI, labeling index; TNM, tumor-node-metastasis; EGFR, epidermal growth factor receptor; ACT, adjuvant chemotherapy.
Table 4
The disease-free survival rates at 1-, 2-, and 3-year of patients treated with surgery versus surgery plus ACT in high/low RS subgroup according to clinical-radiomics nomogram
DFS rate, n/N (%)
High RS subgroup
Low RS subgroup
P value*
Total (N=119)
Surgery (N=77)
Surgery plus ACT (n=42)
P value†
Total (N=191)
Surgery (N=149)
Surgery plus ACT (N=42)
P value‡
At 1-year
107/119 (89.9%)
68/77 (88.3%)
39/42 (92.9%)
0.639
190/191 (99.5%)
148/149 (99.3%)
42/42 (100%)
1.000
<0.001
At 2-year
90/119 (75.6%)
55/77 (71.4%)
35/42 (83.3%)
0.148
185/191 (96.9%)
144/149 (96.6%)
41/42 (97.6%)
1.000
<0.001
At 3-year
73/119 (61.3%)
41/77 (53.2%)
32/42 (76.2%)
0.014
182/191 (95.3%)
141/149 (94.6%)
41/42 (97.6%)
0.693
<0.001
All comparisons were performed using the Chi-square test or Fisher’s exact test. †, the comparison of DFS rates between patients treated with surgery and surgery plus ACT in high RS subgroup; ‡, the comparison of DFS rates between patients treated with surgery and surgery plus ACT in low RS subgroup; *, the comparison of DFS rates between patients in high RS subgroup and low RS subgroup. ACT, adjuvant chemotherapy; RS, risk score; DFS, disease free survival.
Kaplan-Meier curves in clinical-radiomics nomogram. DFS of patients with high RS versus patients with low RS based on clinical-radiomics nomogram in training cohort (A) and validation cohort (B); DFS of all patients treated with surgery plus ACT versus treated with surgery alone in low RS subgroup (C) and high RS subgroup (D). DFS, disease free survival; RS, risk score; ACT, adjuvant chemotherapy.Unless otherwise stated, data were presented as numbers (percentages) and compared using the Chi-square test or Fisher’s exact test. ‡, data were presented as medians (inter-quartiles) and compared using the Mann-Whitney U test. RS, risk score; CEA, carcinoembryonic antigen; LI, labeling index; TNM, tumor-node-metastasis; EGFR, epidermal growth factor receptor; ACT, adjuvant chemotherapy.All comparisons were performed using the Chi-square test or Fisher’s exact test. †, the comparison of DFS rates between patients treated with surgery and surgery plus ACT in high RS subgroup; ‡, the comparison of DFS rates between patients treated with surgery and surgery plus ACT in low RS subgroup; *, the comparison of DFS rates between patients in high RS subgroup and low RS subgroup. ACT, adjuvant chemotherapy; RS, risk score; DFS, disease free survival.In this study, patients receiving postsurgical ACT were prone to harbor higher Ki-67 LI and radiomics signature, larger tumor size, and be in high RS subgroup (Table S2). Interestingly, ACT was significantly associated with survival benefit in high RS subgroup (HR: 0.489; 95% CI: 0.262, 0.911; P=0.024), whereas ACT was not beneficial to patients in low RS subgroup (HR: 0.324; 95% CI: 0.042, 2.510; P=0.281; ). Furthermore, 3-year DFS rates for patients treated with surgery alone (53.2%) and surgery plus ACT (76.2%) were incomparable in high RS subgroup (P=0.014), but no significant beneficial effect of ACT on DFS rates was observed in low RS subgroup (all P>0.05; ). These findings suggested that the clinical-radiomics nomogram may confer a role in identifying patients who were likely to acquire benefit from ACT.
Discussion
In this study, we developed and validated a radiomics signature reliably prognostic of DFS for node-negative, early-stage ADC. This noninvasive radiomics signature, readily obtained from routine presurgical thoracic-CT images, was independently associated with DFS, even after adjustment for clinical characteristics. Moreover, the radiomics signature exhibited an incremental value with respect to clinical risk predictors of tumor size and histological grade in individualized risk prediction. The integrated clinical-radiomics nomogram not only exhibited a desirable performance in risk stratification but harbored the potential to identify potential candidates for ACT in patients with node-negative, early-stage ADC.The thirteen-feature based radiomics signature yielded a C-index of 0.801 in validation cohort, which exhibited a better prognostic accuracy than a previous study by Choe et al. (C-index: 0.734) in predicting DFS for ADC (25). An explanation may be that radiomics signature in this study was derived from feature sets including wavelet-transformed features, which offered comprehensive spatial and frequency distribution of voxel-grey levels by decomposing images into eight volumetric images with a high-pass or low-pass filter (26,27). The benefit of wavelet-transformed features in enhancing model performance was also demonstrated by previous studies (27,28). Furthermore, the incorporation of radiomics signature into clinical-radiomics nomogram showed a substantial improvement compared to clinical nomogram in terms of NRI and IDI. The clinical-radiomics nomogram exhibited a better calibration as well as a superior clinical utility from decision curve analysis, which further lent support to the incremental value of radiomics signature over clinical risk predictors for predicting prognosis in node-negative, early-stage ADC.In the developed radiomics signature, the best performing feature “first order_90th Percentile”, which probably indicates intratumor solid component, consistently revealed the prognostic potential for DFS in a prior study (25). The other predominant features such as glcm_Imc2, and glszm_ZoneEntropy, glcm_Correlation, ngtdm_Contrast, and glcm_ ClusterShade were derived from original and multiscale wavelets families, respectively. These texture features could quantify intratumor heterogeneity and reflect tumor phenotype characteristics at radiological level (29,30), which have been used as prognosticators for ADC (31-33). Radiogenomics provided insight into the underlying biological mechanism of radiomics approach and revealed the association of genetic phenotype and biological behavior with radiomics features. Series studies reported that prognostic radiomics features were closely associated with cell atypia, angiogenesis or metabolic pathways to facilitate tumor growth (34,35). Given that, it is understandable that the synthetic radiomics signature could take on the prognostic potential.For node-negative NSCLC, tumor size was the predominant determinant of survival as well as the reference criterion for directing ACT decision-making (10). However, many studies reported controversial results regarding the selection of candidates for ACT (10,12,36,37). Strauss et al. initially reported that ACT improved DFS and overall survival in patients with node-negative tumors 4 cm or greater in a randomized clinical trial (36), whereas Morgensztern et al. suggested a survival benefit associated with ACT in all node-negative tumors 3–4 cm (37). In a retrospective study, ACT exhibited a survival advantage in high-risk (micropapillary predominant) patients with node-negative small-size (≤3 cm) lung ADC (12). Contrary to this, a more recent large-sample study failed to observe a beneficial effect of ACT in patients with 3 cm or smaller irrespective of the presence of clinicopathological risk factors (10). Because of these controversial findings and the non-trivial recurrence for patients with early-stage NSCLC, it is imperative to exploit effective methods to refine the indication for ACT. Currently, several studies have applied the radiomics approach to predict therapeutic response in various types of cancer (38-40). Jiang et al. established and validated a CT-derived radiomics signature to evaluate the immune microenvironment and predict immunotherapeutic effect in gastric cancer (38). Montemezzi et al. retrospectively analyzed the magnetic resonance images of 60 patients with breast cancer and found that radiomics features substantially improved performance of model in predicting complete response to neoadjuvant chemotherapy (39). In this study, we stratified patients into different subgroups based on RS for clinical-radiomics model. We found that male smokers with larger pack-year, CEA ≥5 µg/L, higher histological grade and Ki-67 LI, larger tumor size and EGFR wild type were likely to harbor high RS. The patients with high RS significantly acquired benefit from ACT whereas no distinct survival benefit from ACT was observed in patients with low RS, suggesting that the clinical-radiomics model may confer a role in identifying potential beneficiaries of ACT among node-negative, early-stage ADC. On account of the retrospective nature of data acquisition, the time to initiate ACT following surgery and treatment regimens for individuals were hard to be uniform. Therefore, prospective, randomly-controlled trials are warranted to confirm the clinical utility of clinical-radiomics model in predicting benefit of ACT. This exploratory study might not direct therapeutic decision-making instantly, but the developed model could provoke thoughts on risk stratification by integrating routine clinicopathological data and readily-accessed imaging predictors, not only assisting in guiding appropriate patients in benefiting from ACT but in avoiding toxic effects and side reaction of ACT in potential no-responders.There are several limitations to the present study. First, the reproducibility of radiomics features is an ongoing issue that we need to address. Multi-vendor CT scanners and different acquisition parameters in external datasets were required to verify the reproducibility of the developed radiomics signature. Second, considering this radiomics feature extraction from contrast-enhanced CT images, the variation of radiomics signature in non-enhanced CT images should be tested in further work. Third, the analysis on the development of radiomics signature prognostic of overall survival should be supplemented in future research. Last, gene mutation status was excluded from cox regression analysis due to the excessive missing records in gene mutation detection. The prognostic role of genetic mutation and the association of radiomics signature with genetic mutation as well as other molecular profiles should be explored in future work.In conclusion, radiomics signature offered a complementary role to clinical risk predictors of tumor size and histological grade in predicting prognosis in node-negative, early-stage ADC. The clinical-radiomics nomogram has the potential to conduct risk stratification and identify the potential beneficiaries of ACT among patients with node-negative, early-stage ADC. On the premise of validation by large sample-size external cohorts, the approach of radiomics signature in combination with clinical risk predictors is expected to be a helpful tool for clinicians to make up individualized therapeutic schedule.The article’s supplementary files as
Authors: Timothy Winton; Robert Livingston; David Johnson; James Rigas; Michael Johnston; Charles Butts; Yvon Cormier; Glenwood Goss; Richard Inculet; Eric Vallieres; Willard Fry; Drew Bethune; Joseph Ayoub; Keyue Ding; Lesley Seymour; Barbara Graham; Ming-Sound Tsao; David Gandara; Kenneth Kesler; Todd Demmy; Frances Shepherd Journal: N Engl J Med Date: 2005-06-23 Impact factor: 91.245
Authors: Gavitt A Woodard; Sue X Wang; Johannes R Kratz; Clara T Zoon-Besselink; Chun-Yuan Chiang; Matthew A Gubens; Thierry M Jahan; Collin M Blakely; Kirk D Jones; Michael J Mann; David M Jablons Journal: Clin Lung Cancer Date: 2017-05-31 Impact factor: 4.785
Authors: Femke C R Staal; Denise J van der Reijd; Marjaneh Taghavi; Doenja M J Lambregts; Regina G H Beets-Tan; Monique Maas Journal: Clin Colorectal Cancer Date: 2020-11-07 Impact factor: 4.481