| Literature DB >> 34820324 |
Yinghao Meng1,2, Hao Zhang1, Qi Li1, Fang Liu1, Xu Fang1, Jing Li1, Jieyu Yu1, Xiaochen Feng1, Mengmeng Zhu1, Na Li1, Guodong Jing1, Li Wang1, Chao Ma1, Jianping Lu1, Yun Bian1, Chengwei Shao1.
Abstract
PURPOSE: To develop and validate a machine learning classifier based on multidetector computed tomography (MDCT), for the preoperative prediction of tumor-stroma ratio (TSR) expression in patients with pancreatic ductal adenocarcinoma (PDAC).Entities:
Keywords: carcinoma; multidetector computed tomography; pancreatic neoplasm; prognosis; radiomics; tumor-stroma ratio
Year: 2021 PMID: 34820324 PMCID: PMC8606777 DOI: 10.3389/fonc.2021.707288
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Flow chart illustrating the patient selection process.
Figure 2Radiomics workflow.
Figure 3The Kaplan-Meier curve and log-rank test. Patients in the tumor-stroma ratio (TSR)-low group had significantly longer survival than those in the TSR-high group.
Baseline characteristics of patients with pancreatic cancer.
| Characteristics | Training set | Validation set | ||||
|---|---|---|---|---|---|---|
| TSR-low (n = 66) | TSR-high (n = 101) |
| TSR-low (n = 25) | TSR-high (n = 35) |
| |
|
| ||||||
| Sex, n (%) | 0.82 | 0.53 | ||||
| Male | 42 (63.64) | 66 (65.35) | 19 (76.00) | 24 (68.57) | ||
| Female | 24 (36.36) | 35 (34.65) | 6 (24.00) | 11 (31.43) | ||
| Age, years (meanSD) | 60.89 ± 8.91 | 60.84 ± 9.79 | 0.97 | 59.76 ± 9.77 | 60.71 ± 8.47 | 0.69 |
| BMI, kg/m2 (meanSD) | 22.68 ± 2.99 | 29.31 ± 65.35 | 0.41 | 23.58 ± 3.13 | 22.65 ± 2.73 | 0.23 |
| Operation, n (%) | 0.18 | 0.36 | ||||
| Pancreaticoduodenectomy | 35 (53.03) | 64 (63.37) | 15 (60.00) | 25 (71.43) | ||
| Distal pancreatectomy | 31 (46.97) | 37 (36.63) | 10 (40.00) | 10 (28.57) | ||
|
| ||||||
| T stage, n (%) | 0.01 | 0.03 | ||||
| T1 | 3 (4.55) | 4 (3.96) | 1 (4.00) | 1 (2.86) | ||
| T2 | 6 (9.09) | 27 (26.73) | 4 (16.00) | 16 (45.71) | ||
| T3-4 | 57 (86.36) | 70 (69.31) | 20 (80.00) | 18 (51.43) | ||
| N stage, n (%) | 0.35 | 0.43 | ||||
| N0 | 35 (53.03) | 42 (41.58) | 11 (44.00) | 12 (34.29) | ||
| N1 | 25 (37.88) | 47 (46.53) | 10 (40.00) | 20 (57.14) | ||
| N2 | 6 (9.09) | 12 (11.88) | 4 (16.00) | 3 (8.57) | ||
| Grade of differentiation, n (%) | 0.19 | 0.46 | ||||
| Well-moderately | 50 (75.76) | 67 (66.34) | 18 (72.00) | 22 (62.86) | ||
| Poorly-undifferentiated | 16 (24.24) | 34 (33.66) | 7 (28.00) | 13 (37.14) | ||
| Duodenum Invasion, n (%) | 0.93 | 0.75 | ||||
| Negative | 46 (69.70) | 71 (70.30) | 16 (64.00) | 21 (60.00) | ||
| Positive | 20 (30.30) | 30 (29.70) | 9 (36.00) | 14 (40.00) | ||
| Bile Invasion, n (%) | 0.03 | 0.06 | ||||
| Negative | 50 (75.76) | 69 (68.32) | 20 (80.00) | 20 (57.14) | ||
| Positive | 16 (24.24) | 32 (31.68) | 5 (20.00) | 15 (42.86) | ||
| LVSI n (%) | 0.19 | 1.00 | ||||
| Negative | 47 (71.21) | 62 (61.39) | 15 (60.00) | 21 (60.00) | ||
| Positive | 19 (28.79) | 39 (38.61) | 10 (40.00) | 14 (40.00) | ||
| Perineural invasion, n (%) | 0.53 | 0.51 | ||||
| Negative | 3 (4.55) | 7 (6.93) | 0 (0.00) | 2 (5.71) | ||
| Positive | 63 (95.45) | 94 (93.07) | 25 (100.00) | 33 (94.29) | ||
|
| ||||||
| CT-reported tumor size, cm (meanSD) | 3.26 ± 1.36 | 3.13 ± 1.52 | 0.56 | 3.19 ± 1.93 | 2.92 ± 0.99 | 0.48 |
| Location, n (%) | 0.18 | 0.36 | ||||
| Head | 35 (53.03) | 64 (63.37) | 15 (60.00) | 25 (71.43) | ||
| Body and tail | 31 (46.97) | 37 (36.63) | 10 (40.00) | 10 (28.57) | ||
| Pancreatitis, n (%) | 0.11 | 0.39 | ||||
| No | 62 (93.94) | 87 (86.14) | 24 (96.00) | 30 (85.71) | ||
| Yes | 4 (6.06) | 14 (13.86) | 1 (4.00) | 5 (14.29) | ||
| PD cutoff and dilation, n (%) | 0.96 | 0.84 | ||||
| No | 14 (21.21) | 21 (20.79) | 7 (28.00) | 9 (25.71) | ||
| Yes | 52 (78.79) | 80 (79.21) | 18 (72.00) | 26 (74.29) | ||
| CBD cutoff and dilation, n (%) | 0.43 | 0.53 | ||||
| No | 47 (71.21) | 66 (65.35) | 17 (68.00) | 21 (60.00) | ||
| Yes | 19 (28.79) | 35 (34.65) | 8 (32.00) | 14 (40.00) | ||
| Parenchymal atrophy, n (%) | 0.18 | 0.71 | ||||
| No | 41 (62.12) | 52 (51.49) | 16 (64.00) | 24 (68.57) | ||
| Yes | 25 (37.88) | 49 (48.51) | 9 (36.00) | 11 (31.43) | ||
| Contour abnormality, n (%) | 0.98 | 0.12 | ||||
| No | 23 (34.85) | 35 (34.65) | 12 (48.00) | 10 (28.57) | ||
| Yes | 43 (65.15) | 66 (65.35) | 13 (52.00) | 25 (71.43) | ||
| Cyst, n (%) | 0.34 | 1.00 | ||||
| No | 57 (86.36) | 92 (91.09) | 22 (88.00) | 31 (88.57) | ||
| Yes | 9 (13.64) | 9 (8.91) | 3 (12.00) | 4 (11.43) | ||
| Vascular invasion, n (%) | 0.21 | 0.36 | ||||
| No | 15 (22.73) | 32 (31.68) | 13 (52.00) | 14 (40.00) | ||
| Yes | 51 (77.27) | 69 (68.32) | 12 (48.00) | 21 (60.00) | ||
BMI, body mass index; LVSI, lymphovascular space invasion; PD , pancreatic duct; CBD, common bile duct.
Figure 4Radiomic feature selection by a parametric method, the least absolute shrinkage and selection operator (LASSO). (A) Selection of the tuning parameter (λ) in the LASSO model via 10-fold cross-validation based on minimum criteria. Binomial deviances from the LASSO regression cross-validation procedure are plotted as a function of log(λ). The y-axis indicates binomial deviances, whereas the lower x-axis indicates the log(λ). Numbers along the upper x-axis represent the average number of predictors. Red dots indicate the average deviance values for each model with a given λ. The vertical bars through the red dots depict the upper and lower values of the deviances. The vertical black lines define the optimal values of λ, where the model provides its best fit to the data. An optimal λ value of 0.036 with a log(λ) of -3.315 is selected. (B) LASSO coefficient profiles of the 25 texture features. The dotted vertical line is plotted at the value selected using 10-fold cross-validation in (A) The 12 resulting features with nonzero coefficients are indicated on the plot. (C) The error-bar chart of the 12 radiomics features and radiomics score.
The radiomics features selected by Lasso Regression.
| Phase | Prediction model | |
|---|---|---|
| Intercept | 0.421 | |
| β | Radiomics name | |
|
| ||
| -0.051 | exponential_firstorder_Median | |
| -0.053 | square_firstorder_InterquartileRange | |
| -0.001 | square_glrlm_LongRunEmphasis | |
| -0.078 | square_glrlm_LongRunHighGrayLevelEmphasis | |
| 0.252 | wavelet-LHL_glszm_SizeZoneNonUniformityNormalized | |
| 0.138 | wavelet-LHH_firstorder_Median | |
| 0.086 | wavelet-HHH_firstorder_Skewness | |
|
| ||
| 0.140 | exponential_firstorder_Median | |
| 0.176 | exponential_glrlm_ShortRunEmphasis | |
| -0.133 | wavelet-LLH_glszm_SmallAreaEmphasis | |
| -0.133 | wavelet-HHH-glszm_SizeZoneNonUniformityNormalized | |
| -0.067 | wavelet-LLL_glszm_ZoneVariance | |
Radiomics score =0.421 - 0.051 × exponential_firstorder_Median (arterial phase).
- 0.053 ×square_firstorder_InterquartileRange (arterial phase).
- 0.001 ×square_glrlm_LongRunEmphasis (arterial phase).
- 0.078 ×square_glrlm_LongRunHighGrayLevelEmphasis (arterial phase).
+ 0.252 ×wavelet-LHL_glszm_SizeZoneNonUniformityNormalized (arterial phase).
+ 0.138 ×wavelet-LHH_firstorder_Median (arterial phase).
+ 0.086 ×wavelet-HHH_firstorder_Skewness (arterial phase).
- 0.140 ×exponential_firstorder_Median (portal venous phase).
+ 0.176 ×exponential_glrlm_ShortRunEmphasis (portal venous phase).
- 0.133 ×wavelet-LLH_glszm_SmallAreaEmphasis (portal venous phase).
- 0.133 ×wavelet-HHH-glszm_SizeZoneNonUniformityNormalized (portal venous phase).
- 0.067 ×wavelet-LLL_glszm_ZoneVariance (portal venous phase).
Figure 5A comparison between patients with low and high tumor–stroma ratio (TSR). (A, B) Patient 1: A 69-year-old man with PDAC in the TSR-low group. (A) Low TSR expression (×10). (B) The axial portal-phase computed tomography (CT) image shows an infiltrative, low-attenuation mass (arrows) located at the pancreatic body and tail. (C, D) Patient 2: A case of a 42-year-old woman with PDAC in the TSR-high group. (C) High TSR expression (×10). (D) The axial portal-phase CT image shows an infiltrative, low-attenuation mass (arrows) located at the pancreatic body and tail. (E) The comparison of the 13 radiomics features between patient 1 and patient 2.
Figure 6The classification and survival prediction of the extreme gradient boosting (XGBoost) classifier. (A) Mosaic plot of the training set. (B) Mosaic plot of the validation set. (C) The survival prediction of the XGBoost classifier shows significantly longer survival for patients in the tumor–stroma ratio (TSR)-low group than those in the TSR-high group in the training set. (D) The survival prediction of the XGBoost classifier reveals significantly longer survival for patients in the tumor–stroma ratio (TSR)-low group than those in the TSR-high group in the validation set.
Figure 7The performance of the extreme gradient boosting (XGBoost) classifier. (A) Receiver operating characteristic curves of the XGBoost classifier. (B) Calibration curves of the XGBoost classifier. (C, D) Decision curve analysis for the XGBoost classifier. The red line represents the training set. The blue line represents the validation set. The gray line represents the hypothesis that all patients had high tumor–stroma ratio (TSR). The black line represents the hypothesis that all patients had low TSR. (C) The decision curves in the validation set show that the radiomics score offered greater benefit than the treat-all-patients as low TSR scheme or the treat-none as high TSR scheme in the training set with a threshold probability >0.06. (D) The prediction model offered greater benefit than the treat-all-patients as high TSR expression scheme or the treat-none as low TSR expression scheme in the validation set with a threshold probability between 0.29 and 0.63.