| Literature DB >> 34499018 |
Hanyue Xu1,2, Xiuhe Zou3, Yunuo Zhao1,2, Tao Zhang1,2, Youyin Tang4, Aiping Zheng1,2, Xianghong Zhou1,2, Xuelei Ma1.
Abstract
Purpose: This study aimed to explore the ability of texture parameters combining with machine learning methods in distinguishing intrahepatic cholangiocarcinoma (ICCA) and hepatic lymphoma (HL). Method: A total of 28 patients with HL and 101 patients with ICCA were included. A total of 45 texture features were extracted by the software LifeX from contrast-enhanced computer tomography (CECT) images and 38 of them were eligible. A total of 5 feature selection methods and 9 feature classification methods were used to build the best diagnostic models, combining with the 10-fold cross-validation to assess the accuracy of these models. The discriminative ability of each model was evaluated by receiver operating characteristic analysis. Result: A total of 45 predictive models were built by the cross combination of each selection and classification method to differentiate ICCA from HL. According to the results of test group, most of the models performed well with a large area under the curve (AUC) (>0.85) and high accuracy (>0.85). Random Forest (RF)_Linear Discriminant Analysis (LDA) (AUC = 0.997, accuracy = 0.969) was the best model among all the 45 models.Entities:
Keywords: contrast-enhanced computer tomography; hepatic lymphoma; intrahepatic cholangiocarcinoma; machine learning; texture
Mesh:
Substances:
Year: 2021 PMID: 34499018 PMCID: PMC8435928 DOI: 10.1177/15330338211039125
Source DB: PubMed Journal: Technol Cancer Res Treat ISSN: 1533-0338
Figure 1.CECT of patients with ICCA and HL. (a and b) The CECT images of 1 ICCA patient. This patient presented with intermittent dull pain in the right upper abdomen with postprandial pain that had been evident for 8 years and worsened for 20 days. No nausea, vomiting, or yellowish skin staining was found. An irregular and mixed low-density mass was seen in the left internal lobe of the liver, with a blurred boundary and a size of about 8.1 × 3.9 cm on CECT. (c and d) The CECT images of 1 HL patient. This patient was admitted to the hospital for 3 months with intermittent fever, night sweats, and pain in the right chest, accompanied by decreased appetite and no yellowing of the skin. On CECT, a soft tissue mass with slightly lower density was seen in the lower segment of the right lobe of liver, about 8.8 × 8.7 cm, with an ill-defined boundary and uneven moderate enhancement. The boundary between the lesion and the right anterior and posterior portal vein branches was not clear. ROIs were all drawn along the liver lesion slice by slice, and all areas of calcification and necrosis were excluded.
Figure 2.The flowchart of this study.
Clinical Parameters of ICCA and HL.
| ICCA
( | HL ( | ||
|---|---|---|---|
| Age | 58.2 ± 10.8 | 53.2 ± 17.9 | |
| Sex (M: F) | 55: 46 | 17: 11 | |
| Size | |||
| <5 cm | 24 | NA | |
| 5 to 10 cm | 71 | ||
| >10 cm | 6 | ||
| Differentiated degree | |||
| Poorly differentiated | 26 | NA | |
| Medium to low differentiated | 54 | ||
| Medium differentiated | 19 | ||
| Medium to high differentiated | 2 | ||
| Stage | PHL
( | SHL
( | |
| II | NA | 1 | 1 |
| III | 1 | ||
| IV | 25 | ||
Abbreviations: ICCA, intrahepatic cholangiocarcinoma; HL, hepatic lymphoma; M: F male: female; NA, not appropriate; PHL, primary hepatic lymphoma; SHL, secondary hepatic lymphoma.
The Differentiational Ability of all Models Based on 5 Feature Selection Methods and 9 Feature Classification Methods.
| Models | Test group | |||
|---|---|---|---|---|
| Sensitivity | Specificity | Accuracy | AUC | |
| OD_LDA | 0.990 | 0.833 | 0.954 | 0.953 |
| DC_LDA | 0.990 | 0.867 | 0.962 | 0.997 |
| RF_LDA | 0.990 | 0.900 | 0.969 | 0.997 |
| LASSO_LDA | 0.950 | 0.500 | 0.853 | 0.788 |
| Xgboost_LDA | 0.980 | 0.933 | 0.969 | 0.993 |
| GBDT_LDA | 0.990 | 0.867 | 0.962 | 0.980 |
| OD_SVM | 0.990 | 0.833 | 0.923 | 0.987 |
| DC_SVM | 0.990 | 0.867 | 0.962 | 0.993 |
| RF_SVM | 1.000 | 0.900 | 0.954 | 0.987 |
| LASSO_SVM | 1.000 | 0.500 | 0.792 | 0.915 |
| Xgboost_SVM | 0.970 | 0.933 | 0.938 | 0.990 |
| GBDT_SVM | 1.000 | 0.867 | 0.962 | 0.983 |
| OD_RF | 0.970 | 0.717 | 0.915 | 0.983 |
| DC_RF | 0.990 | 0.833 | 0.954 | 0.990 |
| RF_RF | 0.980 | 0.733 | 0.923 | 0.990 |
| LASSO_RF | 0.980 | 0.783 | 0.938 | 0.958 |
| Xgboost_RF | 0.990 | 0.833 | 0.954 | 0.995 |
| GBDT_RF | 0.990 | 0.783 | 0.946 | 0.997 |
| OD_Adaboost | 0.990 | 0.767 | 0.938 | 0.987 |
| DC_Adaboost | 0.961 | 0.750 | 0.915 | 0.962 |
| RF_Adaboost | 0.980 | 0.783 | 0.938 | 0.983 |
| LASSO_Adaboost | 0.960 | 0.750 | 0.915 | 0.933 |
| Xgboost_Adaboost | 0.990 | 0.800 | 0.946 | 0.990 |
| GBDT_Adaboost | 0.980 | 0.900 | 0.962 | 0.993 |
| OD_KNN | 0.960 | 0.633 | 0.884 | 0.955 |
| DC_KNN | 0.960 | 0.800 | 0.923 | 0.970 |
| RF_KNN | 0.980 | 0.767 | 0.931 | 0.970 |
| LASSO_KNN | 0.970 | 0.450 | 0.861 | 0.912 |
| Xgboost_KNN | 0.950 | 0.783 | 0.915 | 0.962 |
| GBDT_KNN | 0.980 | 0.800 | 0.938 | 0.977 |
| OD_GaussianNB | 0.980 | 0.767 | 0.931 | 0.923 |
| DC_GaussianNB | 0.970 | 0.867 | 0.946 | 0.953 |
| RF_GaussianNB | 0.910 | 0.867 | 0.900 | 0.935 |
| LASSO_GaussianNB | 0.950 | 0.467 | 0.846 | 0.907 |
| Xgboost_GaussianNB | 0.950 | 0.867 | 0.931 | 0.950 |
| GBDT_GaussianNB | 0.940 | 0.900 | 0.931 | 0.935 |
| OD_LR | 0.990 | 0.583 | 0.900 | 0.953 |
| DC_LR | 1.000 | 0.633 | 0.915 | 0.957 |
| RF_LR | 1.000 | 0.667 | 0.923 | 0.967 |
| LASSO_LR | 1.000 | 0.000 | 0.783 | 0.909 |
| Xgboost_LR | 1.000 | 0.400 | 0.869 | 0.952 |
| GBDT_LR | 1.000 | 0.483 | 0.885 | 0.990 |
| OD_GBDT | 0.960 | 0.783 | 0.923 | 0.898 |
| DC_GBDT | 0.980 | 0.850 | 0.954 | 0.932 |
| RF_GBDT | 0.960 | 0.783 | 0.923 | 0.917 |
| LASSO_GBDT | 0.980 | 0.817 | 0.946 | 0.941 |
| Xgboost_GBDT | 0.970 | 0.850 | 0.946 | 0.932 |
| GBDT_GBDT | 0.980 | 0.783 | 0.938 | 0.937 |
| OD_DT | 0.970 | 0.783 | 0.931 | 0.877 |
| DC_DT | 0.980 | 0.750 | 0.931 | 0.865 |
| RF_DT | 0.970 | 0.750 | 0.923 | 0.860 |
| LASSO_DT | 0.960 | 0.750 | 0.915 | 0.855 |
| Xgboost_DT | 0.980 | 0.750 | 0.931 | 0.865 |
| GBDT_DT | 0.990 | 0.750 | 0.938 | 0.870 |
Abbreviations: Adaboost, Adaptiveboosting; AUC, area under the curve; DC, distance correlation; DT, decision tree; GaussianNB, Gaussian Naïve Bayes; GBDT, gradient boosted decision tree; ICCA, intrahepatic cholangiocarcinoma; KNN, k-nearest neighbor; LASSO, least absolute shrinkage and selection operator; LDA, linear discriminant analysis; LR, logistic regression; RF, random forest; SVM, support vector machine; OD, original data; XGBoost, eXtreme gradient boosting.
Figure 3.The heatmap of AUC and accuracy of 45 models in the test group.