| Literature DB >> 34745008 |
Jingjing Li1,2, Xinxin Wu2, Ning Mao3,4,5, Guibin Zheng6, Haicheng Zhang4, Yakui Mou2,5, Chuanliang Jia2,4,5, Jia Mi7, Xicheng Song2,4,5.
Abstract
Objectives: This study aimed to develop a computed tomography (CT)-based radiomics model to predict central lymph node metastases (CLNM) preoperatively in patients with papillary thyroid carcinoma (PTC).Entities:
Keywords: central lymph node metastases; computer aided diagnosis; machine learning; papillary thyroid carcinoma; radiomics
Mesh:
Year: 2021 PMID: 34745008 PMCID: PMC8567994 DOI: 10.3389/fendo.2021.741698
Source DB: PubMed Journal: Front Endocrinol (Lausanne) ISSN: 1664-2392 Impact factor: 5.555
Figure 1Flowchart of the process of CT image segmentation and VOI construction. An example of primary tumor segmentation and VOI construction for patients with central lymph node metastasis. VOI segmentation is performed on unenhanced and contrast-enhanced computed tomography images (A). Features are extracted from the VOI of primary tumor (B), including tumor shape, intensity, and texture. The VOI of primary tumor can be shown in a 3D status (C). VOI, volume of interest.
Figure 2The flow chart of patients Recruitment process. Hospital 1 stands for patients from Yantai Yuhuangding Hospital, Hospital 2 stands for patients from the Affiliated Hospital of Binzhou Medical University.
Clinical characteristics of patients in the training and test sets.
| Characteristics | Training Set (N = 423) | Internal test set (N = 182) | External test set (N = 73) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| CLNM (+) (N = 218) | CLNM (-) (N = 205) | p-value | CLNM (+) (N = 94) | CLNM (-) (N = 88) | p-value | CLNM (+) (N = 46) | CLNM (-) (N = 27) | p-value | |
| Age, mean ± SD, years | 44± 12 | 44 ± 12 | 44± 12 | 44 ± 12 | 44 ± 12 | 44 ± 12 | |||
| < 45, No. (%) | 131 (60.09) | 84 (40.98) | <0.001 | 52 (55.32) | 41 (46.59) | 0.239 | 21 (45.65) | 12 (44.44) | 0.92 |
| ≥ 45, No. (%) | 87(39.91) | 121 (59.02) | 42 (44.68) | 47 (53.41) | 25 (54.35) | 15 (55.56) | |||
| Sex, No. (%) | |||||||||
| Male | 70 (32.11) | 45 (21.95) | <0.001 | 24 (25.53) | 19 (21.59) | 0.532 | 18 (39.13) | 2 (28.57) | 0.793 |
| Female | 148 (67.89) | 160 (78.05) | 70 (74.47) | 69 (78.41) | 28 (60.87) | 25 (71.43) | |||
| CT-maximum diameter*, No. (%) | |||||||||
| <1cm | 88 (40.37) | 120 (58.54) | <0.001 | 35 (37.23) | 55 (62.50) | <0.05 | 19 (41.30) | 12 (44.44) | 0.793 |
| ≧1cm | 130 (59.63) | 85(41.46) | 59 (62.77) | 33 (37.50) | 27 (58.70) | 5 (55.56) | |||
| CT-reported LN status, No. (%) | |||||||||
| LN-negative | 114 (52.29) | 147 (71.71) | <0.001 | 45 (47.87) | 68 (77.27) | <0.05 | 10 (21.74) | 4 (14.81) | 0.468 |
| LN-suspicious | 39 (17.89) | 45 (21.95) | 23 (24.47) | 12 (13.64) | 0 (0.00) | 0 (0.00) | |||
| LN-positive | 65 (29.82) | 13 (6.34) | 26 (27.66) | 8 (9.09) | 36 (78.26) | 23 (85.19) | |||
Data are n (%) unless otherwise indicated. No significant divergences were found between CLNM (+) and CLNM (-) in terms of age, sex, CT-maximum diameter, CT-reported LN status in the training set (p < 0.05). CT-longest diameter and CT-reported LN status were significantly different between CLNM (+) and CLNM (-) in the internal test set (p < 0.05), but not in the external test set.
CT, computed tomography; LN, lymph node; CLNM, central lymph node metastasis.
*Largest diameter of the target primary lesion.
Radiomics features selected in ANOVA and LASSO regression analysis.
| Radiomics features | Coefficient |
|---|---|
| lbp-2D_firstorder_Range_ scan | 0.007418999 |
| lbp-2D_firstorder_10Percentile_scan | 0.004015608 |
| Wavelet-HHL_glrlm_RunLengthNonUniformityNormalized_scan | -0.07654446 |
| wavelet-HLH_glszm_GrayLevelNonUniformityNormalized_scan | -0.01873799 |
| wavelet-HHL_firstorder_10Percentile_scan | 0.015804638 |
| wavelet-HLH_glszm_SmallAreaHighGrayLevelEmphasis_scan | -0.02602846 |
| wavelet-HHH_glszm_HighGrayLevelZoneEmphasis_scan | 0.001441019 |
| wavelet-HHL_firstorder_Mean_scan | 0.003108156 |
| original_shape_LeastAxisLength_AP | 0.006855814 |
| original_shape_Elongation_AP | 0.00434467 |
| original_gldm_LargeDependenceEmphasis_AP | 0.017969827 |
| original_glrlm_RunLengthNonUniformityNormalized_AP | -0.0047654 |
| logarithm_glrlm_RunLengthNonUniformityNormalized_AP | -0.00065297 |
Fourteen radiomics features with non-zero coefficients in one-way analysis of variance (ANOVA) and the least absolute shrinkage and selection operator (LASSO) logistic regression model were selected.
Figure 3Computed tomography (CT) image features selection using one-way analysis of variance (ANOVA) and the least absolute shrinkage and selection operator (LASSO) logistic regression model in the training set. (A) The five-fold cross-validation and the minimal criteria process was used to generate the optimal penalization coefficient lambda (λ) in the LASSO model. The vertical line defines the optimal values of λ, where the model provides its best fit to the data. The optimal λ value of 0.165 with -log (λ) =1.5 was selected. (B) LASSO coefficient profiles of the radiomics features. The vertical line was drawn at the value selected using five-fold cross-validation, where optimal λ resulted in 14 nonzero coefficients.
Figure 4Receiver operating characteristic (ROC) curves of the radiomics model constructed by six algorithms in the training set (A) and internal test set (B).
Figure 5Receiver operating characteristic (ROC) curves of the combined model constructed by six algorithms in the training set (A) and internal test set (B).
The performance of combined models constructed by six algorithms in the internal test set.
| AUC | 95%CI | SEN | 95%CI | SPE | 95%CI | ACC | 95%CI | |
|---|---|---|---|---|---|---|---|---|
| KNN+ Clinical | 0.605 | 0.507-0.611 | 0.777 | 0.677-0.853 | 0.341 | 0.245-0.451 | 0.566 | 0.491-0.639 |
| LR+ Clinical | 0.706 | 0.610-0.725 | 0.723 | 0.620-0.808 | 0.614 | 0.503-0.714 | 0.670 | 0.570-0.738 |
| DT+ Clinical | 0.662 | 0.538-0.655 | 0.511 | 0.406-0.614 | 0.670 | 0.561-0.765 | 0.588 | 0.513-0.660 |
| Lin-SVM+ Clinical | 0.709 | 0.634-0.786 | 0.702 | 0.600-0.790 | 0.636 | 0.526-0.734 | 0.670 | 0.600-0.738 |
| Gaus-SVM+ Clinical | 0.644 | 0.520-0.630 | 0.766 | 0.665-0.845 | 0.386 | 0.286-0.497 | 0.582 | 0.507-0.645 |
| Ploy-SVM+ Clinical | 0.663 | 0.533-0.647 | 0.691 | 0.587-0.780 | 0.489 | 0.381-0.597 | 0.593 | 0.518-0.666 |
| Clinical | 0.660 | 0.570-0.786 | 0.574 | 0.468-0.674 | 0.636 | 0.526-0.734 | 0.604 | 0.530-0.676 |
KNN, K-nearest neighbor; LR, logistic regression; DT, decision tree; Lin-SVM, linear-support vector machine; Gaus-SVM, Gaussian- support vector machine; Ploy-SVM, polynomial-SVM.
Figure 6Receiver operating characteristic (ROC) curves of the combined model (blue lines), Rad-score model (orange lines) and clinical model (green lines) in the training set (A), internal test set (B) set, and external test set (C), respectively. Calibration curves (D) of the combined model in the training and test sets, respectively. The diagonal dotted line represents an ideal prediction, while the red line represents the performance of the training set, the blue line and black line represents the performance of the internal test set and external test set, respectively, Closer fit to the diagonal dotted line indicates a better prediction.
Figure 7Decision curve analysis (DCA) of training set (A), internal test set (B), and external test set (C). The y-axis measures the net benefit. The x-axis shows the corresponding risk threshold. The grey line represents the assumption that all lesions were central lymph node metastases. The black straight line represents the assumption that all nodules were non-CLNM. If the threshold probability was more than 30%, using the combined model (black line) to predict CLNM added more benefit than the Rad-score model (red line) and clinical model (blue line) in the internal test set; If the threshold probability was more than 10%, using the combined model (black line) to predict CLNM added more benefit than the Rad-score model (red line) and clinical model (blue line) in the external test set.