| Literature DB >> 34367940 |
Xuejiao Han1, Jing Yang2,3, Jingwen Luo1, Pengan Chen4, Zilong Zhang4, Aqu Alu1, Yinan Xiao4, Xuelei Ma1.
Abstract
OBJECTIVES: The purpose of this study aimed at investigating the reliability of radiomics features extracted from contrast-enhanced CT in differentiating pancreatic cystadenomas from pancreatic neuroendocrine tumors (PNETs) using machine-learning methods.Entities:
Keywords: CT; differentiation; machine learning; pNETs; pancreatic cystadenomas; pancreatic neuroendocrine tumors; radiomics
Year: 2021 PMID: 34367940 PMCID: PMC8339967 DOI: 10.3389/fonc.2021.606677
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Flowchart shows selection of study population and exclusion criteria.
Figure 2Examples of lesion segmentation and contouring on contrast-enhanced CT images. Portal vein phase CT images of a patient with histopathologically proved pancreatic cystadenomas (A, B) and pancreatic neuroendocrine tumors (C, D).
Figure 3Flowchart depicts the workflow of the whole study.
Characteristics of patients and lesions.
| Characteristics | Pancreatic cystadenomas | Pancreatic neuroendocrine tumors |
|---|---|---|
|
| 66 | 54 |
|
| 50.26 (24-77) | 50.48 (29-73) |
|
| ||
| Male | 15 (22.7%) | 32 (59.3%) |
| Female | 51 (77.3%) | 22 (40.7%) |
|
| ||
| Head | 27 (40.9%) | 23 (42.6%) |
| Body or tail | 39 (59.1%) | 31 (57.4%) |
|
| 4.1 (0.8-12) | 4.19 (1-12) |
Figure 4The result of Pearson correlation coefficients test between radiomics features.
Selected features of five selection methods.
| DC | RF | LASSO | Xgboost | GBDT |
|---|---|---|---|---|
| meanValue | meanValue | maxValue | meanValue | HISTO_Kurtosis |
| HISTO_Kurtosis | HISTO_Kurtosis | HISTO_Skewness | HISTO_Kurtosis | GLRLM_SRHGE |
| GLZLM_HGZE | PARAMS_YSpatialResampling | SHAPE_Sphericity | GLRLM_HGRE | GLRLM_LRHGE |
| GLZLM_SZHGE | GLRLM_HGRE | GLRLM_SRLGE | GLRLM_LRHGE | GLZLM_HGZE |
| GLRLM_HGRE | GLRLM_SRHGE | GLRLM_GLNU | GLZLM_SZHGE | |
| GLRLM_SRHGE | GLRLM_LRHGE | GLRLM_RLNU | ||
| GLZLM_SZE | GLZLM_SZE | GLRLM_RP | ||
| GLZLM_HGZE | GLZLM_SZE | |||
| GLZLM_SZHGE | GLZLM_LGZE | |||
| GLZLM_SZLGE | ||||
| GLZLM_LZLGE | ||||
| GLZLM_GLNU | ||||
| GLZLM_ZLNU | ||||
| GLZLM_ZP |
DC, distance correlation; RF, random forest; LASSO, least absolute shrinkage and selection operator; Xgboost, eXtreme gradient boosting; GBDT, gradient boosting decision tree; GLZLM, gray-level zone length matrix; HGZE, High Grey Level Zone Emphasis; GLRLM, Gray Level Run Length Matrix; SZHGE, Short Zone High Grey Level Emphasis; HGRE, High Gray Level Run Emphasis; SRHGE, Short-Run High Grey Level Emphasis; SZE, Short Zone Emphasis; LRHGE, Long-Run High Grey Level Emphasis; SRLGE, Short-Run Low Grey Level Emphasis; GLNU, Grey Level Non-Uniformity; RLNU, Run Length Non-Uniformity; RP, Run Percentage; LGZE, Low Gray Level Zone Emphasis; SZLGE, Short Zone Low Grey Level Emphasis; LZLGE, Long Zone Low Grey Level Emphasis; ZP, Zone Percentage.
Results of discriminative models in distinguishing pancreatic cystadenomas and pancreatic neuroendocrine tumors in the testing group.
| DC | RF | LASSO | Xgboost | GBDT | |||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | Accuracy | Sensitivity | Specificity | F1-score | AUC | Accuracy | Sensitivity | Specificity | F1-score | AUC | Accuracy | Sensitivity | Specificity | F1-score | AUC | Accuracy | Sensitivity | Specificity | F1-score | AUC | Accuracy | Sensitivity | Specificity | F1-score | |
|
| 0.907 | 0.858 | 0.797 | 0.912 | 0.832 | 0.915 | 0.917 | 0.847 | 0.971 | 0.890 | 0.901 | 0.867 | 0.763 | 0.955 | 0.832 | 0.947 | 0.917 | 0.817 | 1.000 | 0.894 | 0.918 | 0.875 | 0.810 | 0.929 | 0.850 |
|
| 0.853 | 0.700 | 0.633 | 0.757 | 0.653 | 0.972 | 0.725 | 0.430 | 0.971 | 0.542 | 0.777 | 0.642 | 0.267 | 0.957 | 0.365 | 0.946 | 0.833 | 0.650 | 0.986 | 0.764 | 0.966 | 0.908 | 0.817 | 0.986 | 0.886 |
|
| 0.997 | 0.983 | 0.980 | 0.986 | 0.980 | 0.994 | 0.975 | 0.960 | 0.986 | 0.969 | 0.975 | 0.950 | 0.947 | 0.952 | 0.948 | 0.997 | 0.992 | 0.980 | 1.000 | 0.989 | 0.989 | 0.992 | 0.980 | 1.000 | 0.989 |
|
| 0.990 | 0.967 | 0.960 | 0.967 | 0.961 | 0.990 | 0.967 | 0.960 | 0.967 | 0.961 | 0.976 | 0.975 | 1.000 | 0.952 | 0.977 | 0.990 | 0.975 | 0.940 | 1.000 | 0.964 | 0.990 | 0.975 | 0.943 | 1.000 | 0.966 |
|
| 0.959 | 0.925 | 0.870 | 0.971 | 0.908 | 0.983 | 0.967 | 0.967 | 0.971 | 0.964 | 0.760 | 0.675 | 0.647 | 0.700 | 0.634 | 0.932 | 0.925 | 0.890 | 0.957 | 0.912 | 0.969 | 0.975 | 0.940 | 1.000 | 0.967 |
|
| 0.973 | 0.942 | 0.893 | 0.986 | 0.928 | 0.973 | 0.775 | 0.523 | 0.986 | 0.654 | 0.986 | 0.975 | 0.963 | 0.986 | 0.971 | 0.926 | 0.633 | 0.223 | 0.971 | 0.292 | 0.926 | 0.608 | 0.167 | 0.971 | 0.209 |
|
| 0.862 | 0.692 | 0.613 | 0.757 | 0.633 | 0.946 | 0.708 | 0.393 | 0.971 | 0.492 | 0.743 | 0.675 | 0.360 | 0.940 | 0.458 | 0.935 | 0.708 | 0.377 | 0.986 | 0.502 | 0.909 | 0.625 | 0.170 | 1.000 | 0.276 |
|
| 0.989 | 0.975 | 0.980 | 0.969 | 0.972 | 0.979 | 0.983 | 0.980 | 0.986 | 0.980 | 0.976 | 0.975 | 1.000 | 0.952 | 0.977 | 0.993 | 0.983 | 0.980 | 0.986 | 0.980 | 0.983 | 0.983 | 0.980 | 0.983 | 0.981 |
|
| 0.975 | 0.975 | 0.980 | 0.969 | 0.972 | 0.983 | 0.983 | 0.980 | 0.986 | 0.980 | 0.976 | 0.975 | 1.000 | 0.952 | 0.977 | 0.990 | 0.992 | 0.980 | 1.000 | 0.989 | 0.982 | 0.983 | 0.980 | 0.983 | 0.981 |
DC, distance correlation; RF, random forest; LASSO, least absolute shrinkage and selection operator; Xgboost, eXtreme gradient boosting; GBDT, gradient boosting decision tree; LDA, linear discriminant analysis; SVM, support vector machine; KNN, k-nearest neighbor; LR, logistic regression; DT, decision tree; AUC, area under curve.
Figure 5The results of AUC (A), sensitivity (B) and specificity (C) in the testing group.
Figure 6The ROC curves of 10 fold for DC+RF (A) and Xgboost+RF (B) in the testing group.