| Literature DB >> 34164467 |
Chenyu Song1, Mingyu Wang2, Yanji Luo1, Jie Chen3, Zhenpeng Peng1, Yangdi Wang1, Hongyuan Zhang2, Zi-Ping Li1, Jingxian Shen4, Bingsheng Huang2, Shi-Ting Feng1.
Abstract
BACKGROUND: To establish and validate a prediction model for pancreatic neuroendocrine neoplasms (pNENs) recurrence after radical surgery with preoperative computed tomography (CT) images.Entities:
Keywords: Pancreatic neuroendocrine neoplasms (pNENs); deep learning radiomics (DLR); survival analysis
Year: 2021 PMID: 34164467 PMCID: PMC8184461 DOI: 10.21037/atm-21-25
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Figure 1Flow chart of the study design. Computed tomography (CT) images were obtained in the unenhanced, arterial, and venous phases. Data from Hospital I were used to establish the prediction models (radiologist assessment, radiomics, and deep learning radiomics). Then, the external group from Hospital II was used to validate the prediction models. After the optimal prediction model had been selected, clinical indicators were added to observe changes in the predictive performance of this optimal model. In addition, an optimum model-based risk stratification model was established to explore its survival predictive potential.
Figure 2Data filtering procedure. pNEN, pancreatic neuroendocrine neoplasm; MEN, multiple endocrine neoplasia; CT, computed tomography.
Accuracy, sensitivity, specificity, and AUC values of the radiomics models for recurrence prediction (56 patients from Hospital I and 18 patients from Hospital II)
| Models | Hospital I (internal data set) | Hospital II (independent data set) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ACC | SEN | SPC | AUC | P | ACC | SEN | SPC | AUC | P | ||
| Radiomics-A | 0.75 | 0.70 | 0.76 | 0.74 | 0.020 | 0.44 | 0.11 | 0.78 | 0.56 | 0.691 | |
| Radiomics-V | 0.71 | 0.80 | 0.70 | 0.68 | 0.083 | 0.44 | 0.33 | 0.56 | 0.52 | 0.965 | |
| Radiomics-A&V | 0.71 | 0.80 | 0.70 | 0.70 | 0.044 | 0.56 | 0.22 | 0.89 | 0.56 | 0.691 | |
The threshold of the predictive probability used to calculate ACC, SEN, and SPC was the highest Youden index of the cross-validation ROC curves for the internal data set. A P value indicates the significance level of the comparison between an AUC with that of a random case (AUC =0.5). AUC, area under the curve; ACC, accuracy; SEN, sensitivity; SPC, specificity; A, arterial; V, venous; A&V, arterial & venous.
Accuracy, sensitivity, specificity, and AUC values of the DLR models for recurrence prediction (56 patients from Hospital I and 18 patients from Hospital II)
| Models | Hospital I (internal data set) | Hospital II (independent data set) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ACC | SEN | SPC | AUC | P | ACC | SEN | SPC | AUC | P | ||
| DLR-A | 0.71 | 0.90 | 0.67 | 0.80 | 0.003 | 0.61 | 0.55 | 0.66 | 0.77 | 0.058 | |
| DLR-V | 0.73 | 0.60 | 0.76 | 0.58 | 0.429 | 0.44 | 0.22 | 0.67 | 0.48 | 0.895 | |
| DLR-A&V | 0.71 | 0.80 | 0.70 | 0.72 | 0.034 | 0.61 | 0.44 | 0.78 | 0.64 | 0.310 | |
The threshold of the predictive probability used to calculate ACC, SEN, and SPC was the highest Youden index of the cross-validation ROC curves for the internal data set. A P value indicates the significance level of the comparison between an AUC with that of a random case (AUC =0.5). AUC, area under the curve; DLR, deep learning radiomics; ACC, accuracy; SEN, sensitivity; SPC, specificity; A, arterial; V, venous; A&V, arterial & venous.
Performance comparison between the optimal radiomics model (radiomics-A), the optimal DLR model (DLR-A), and the model based on CT findings (56 patients from Hospital I)
| Model | ACC | SEN | SPC | AUC | P |
|---|---|---|---|---|---|
| Radiomics-A | 0.75 | 0.70 | 0.76 | 0.74 | 0.020 |
| DLR-A | 0.71 | 0.90 | 0.67 | 0.80 | 0.003 |
| CT findings | 0.63 | 0.50 | 0.65 | 0.53 | 0.748 |
A P value indicates the significance level of the comparison between an AUC with that of a random case (AUC =0.5). DLR, deep learning radiomics; A, arterial; CT, computed tomography; ACC, accuracy; SEN, sensitivity; SPC, specificity; AUC, area under the curve.
Accuracy, sensitivity, specificity, and AUC values of the DLR-A recurrence prediction model with added clinical information (56 patients from Hospital I)
| Model | ACC | SEN | SPC | AUC | Pa | Pb | ||
|---|---|---|---|---|---|---|---|---|
| DLR-A + s | DLR-A + sa | DLR-A + sag | ||||||
| DLR-A | 0.71 | 0.90 | 0.67 | 0.80 | 0.003 | 0.413 | 0.822 | 0.680 |
| DLR-A + s | 0.71 | 0.80 | 0.70 | 0.75 | 0.015 | – | 0.459 | 0.108 |
| DLR-A + sa | 0.76 | 0.90 | 0.73 | 0.79 | 0.004 | – | – | 0.483 |
| DLR-A + sag | 0.80 | 0.80 | 0.80 | 0.83 | 0.001 | – | – | – |
a, a P value indicates the significance level of the comparison between an AUC with that of a random case (AUC =0.5). b, a P value indicates the significance level of comparison between every two AUCs. AUC, area under the curve; DLR, deep learning radiomics; A, arterial; ACC, accuracy; SEN, sensitivity; SPC, specificity; + s, symptom added; + sa, symptom and age added; + sag, symptom, age, and gender added.
Figure 3The receiver operating characteristic (ROC) of deep learning radiomics (DLR), radiomics and CT findings models in Hospital I. (A) The receiver operating characteristic (ROC) curves of the optimal radiomics (R) model (R-A), the optimal deep learning radiomics model (DLR-A), and the model based on CT findings. (B) The ROC curves of the DLR-A model with added clinical information. AUC, area under the curve.
Figure 4Survival analysis using the high- and low-risk groups according to the DLR-A model. The Kaplan-Meier analysis shows a statistically significant difference (P=0.003; log-rank test) between these groups regarding recurrence-free survival. DLR, deep learning radiomics; A, arterial.