| Literature DB >> 35204507 |
Ayten Kayi Cangir1,2, Kaan Orhan2,3,4, Yusuf Kahya1, Ayse Uğurum Yücemen1, İslam Aktürk1, Hilal Ozakinci5, Aysegul Gursoy Coruh6, Serpil Dizbay Sak5.
Abstract
Radiomics is a new image processing technology developed in recent years. In this study, CT radiomic features are evaluated to differentiate pulmonary hamartomas (PHs) from pulmonary carcinoid tumors (PCTs). A total of 138 patients (78 PCTs and 60 PHs) were evaluated. The Radcloud platform (Huiying Medical Technology Co., Ltd., Beijing, China) was used for managing the data, clinical data, and subsequent radiomics analysis. Two hand-crafted radiomics models are prepared in this study: the first model includes the data regarding all of the patients to differentiate between the groups; the second model includes 78 PCTs and 38 PHs without signs of fat tissue. The separation of the training and validation datasets was performed randomly using an (8:2) ratio and 620 random seeds. The results revealed that the MLP method (RF) was best for PH (AUC = 0.999) and PCT (AUC = 0.999) for the first model (AUC = 0.836), and PC (AUC = 0.836) in the test set for the second model. Radiomics tumor features derived from CT images are useful to differentiate the carcinoid tumors from hamartomas with high accuracy. Radiomics features may be used to differentiate PHs from PCTs with high levels of accuracy, even without the presence of fat on the CT. Advances in knowledge: CT-based radiomic holds great promise for a more accurate preoperative diagnosis of solitary pulmonary nodules (SPNs).Entities:
Keywords: carcinoid; machine learning; pulmonary hamartomas; radiomics
Year: 2022 PMID: 35204507 PMCID: PMC8871366 DOI: 10.3390/diagnostics12020416
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 13D segmentation, feature selection, and radiomics analysis of workflow used for PCTs and PHs.
Figure 2The use of K best on the feature selection to further select radiomics features, which results in 11 features.
Figure 3Lasso algorithm on feature selection. (a) Lasso path; (b) MSE path; and (c) coefficients in Lass model. Using the Lasso model, eight optimal features that correspond to the optimal alpha value were selected.
Clinical characteristics of the patients.
| Characteristics | Pulmonary Carcinoid Tumor, | Pulmonary Hamartoma, | |
|---|---|---|---|
| Male | 35 | 38 | |
| Female | 43 | 22 | 0.03 |
| Age, median (range) (years) | 52 | 55 | 0.06 |
| Tumor site, n | |||
| Right lung | 44 | 38 | 0.412 |
| Left lung | 34 | 22 | |
| Tumor diameter, mean (range) (mm) on CT | 26.2 | 19.8 | 0.58 |
Figure 4The figure shows the results of the 1st modeling ROC curve analysis for the training and test data for differentiating between PHs and PCTs. Note that green—PHs and red—PCTs. (a) ROC curve of the training set and (b) ROC curve test set.
ROC outcomes with six machine learning classifiers for the test set using the first model.
| Classifiers | Category | AUC | 95% CI | Sensitivity | Specificity |
|---|---|---|---|---|---|
| KNN | PHs | 0.898 | 0.82–0.98 | 0.69 | 0.9 |
| PCTs | 0.898 | 0.82–0.98 | 0.90 | 0.69 | |
| SVM | PHs | 0.849 | 0.76–0.94 | 0.76 | 0.77 |
| PCTs | 0.849 | 0.76–0.94 | 0.77 | 0.76 | |
| XGBoost | PHs | 0.996 | 0.95–1.00 | 0.95 | 0.93 |
| PCTs | 0.996 | 0.95–1.00 | 0.93 | 0.95 | |
| RF | PHs | 0.999 | 0.98–1.00 | 0.97 | 1 |
| PCTs | 0.999 | 0.98–1.00 | 1.00 | 0.97 | |
| LR | PHs | 0.809 | 0.71–0.90 | 0.73 | 0.73 |
| PCTs | 0.809 | 0.71–0.90 | 0.73 | 0.73 | |
| DT | PHs | 0.806 | 0.73–0.90 | 0.75 | 0.75 |
| PCTs | 0.806 | 0.73–0.90 | 0.75 | 0.75 |
Results of the four indicators of precision, recall, F1 score, and support based on the test data using the first model.
| Indicators | KNN | SVM | XGBoost | RF | LR | DT | |
|---|---|---|---|---|---|---|---|
| PHs | Precision | 0.93 | 0.87 | 0.97 | 1.00 | 0.85 | 0.75 |
| Recall | 0.69 | 0.76 | 0.95 | 0.97 | 0.73 | 075 | |
| F1 score | 0.80 | 0.81 | 0.96 | 0.98 | 0.78 | 0.75 | |
| Support | 62.00 | 62.00 | 62.00 | 62.00 | 62.00 | 62.00 | |
| PCTs | Precision | 0.59 | 0.61 | 0.90 | 0.94 | 0.56 | 0.56 |
| Recall | 0.90 | 0.77 | 0.93 | 1.00 | 0.73 | 0.77 | |
| F1 score | 0.71 | 0.68 | 0.92 | 0.97 | 0.64 | 0.76 | |
| Support | 30.00 | 30.00 | 30.00 | 30.00 | 30.00 | 30.00 |
Figure 5The figure shows the results of the 2nd modeling ROC curve analysis results for the training and test data for differentiating between PHs and PCTs. Note that green—PHs and red—PCTs. (a) ROC curve of the training set and (b) ROC curve of the test set.
ROC outcomes for six machine learning classifiers of the test set and the second model.
| Classifiers | Category | AUC | 95% CI | Sensitivity | Specificity |
|---|---|---|---|---|---|
| KNN | PHs | 0.613 | 0.39–0.84 | 0.62 | 0.56 |
| PCTs | 0.613 | 0.39–0.84 | 0.56 | 0.63 | |
| SVM | PHs | 0.676 | 0.45–0.90 | 0.50 | 0.69 |
| PCTs | 0.676 | 0.45–0.90 | 0.69 | 0.5 | |
| XGBoost | PHs | 0.82 | 0.66–0.98 | 0.88 | 0.81 |
| PCTs | 0.82 | 0.66–0.98 | 0.81 | 0.88 | |
| RF | PHs | 0.836 | 0.66–1.00 | 0.88 | 0.69 |
| PCTs | 0.836 | 0.66–1.00 | 0.69 | 0.88 | |
| LR | PHs | 0.723 | 0.55–0.90 | 0.88 | 0.63 |
| PCTs | 0.723 | 0.55–0.90 | 0.62 | 0.88 | |
| DT | PHs | 0.563 | 0.35 0.78 | 0.38 | 0.75 |
| PCTs | 0.563 | 0.35–0.78 | 0.75 | 0.38 |
Outcomes for four indicators, including precision, recall, F1 score, and support for the test set using the second model.
| Indicators | KNN | SVM | XGBoost | RF | LR | DT | |
|---|---|---|---|---|---|---|---|
| PHs | Precision | 0.57 | 0.68 | 0.93 | 0.93 | 0.50 | 1.00 |
| Recall | 0.90 | 0.79 | 0.97 | 0.97 | 0.66 | 1.00 | |
| F1 score | 0.69 | 0.73 | 0.95 | 0.95 | 0.57 | 1.00 | |
| Support | 29 | 29 | 29 | 29 | 29 | 29 | |
| PCTs | Precision | 0.93 | 0.90 | 0.98 | 0.98 | 0.81 | 1.00 |
| Recall | 0.68 | 0.83 | 0.97 | 0.97 | 0.70 | 1.00 | |
| F1 score | 0.79 | 0.86 | 0.98 | 0.98 | 0.75 | 1.00 | |
| Support | 63 | 63 | 63 | 63 | 63 | 63 |
Details of the confusion matrix for PHs and PCTs using the highest learning MLP classifier (RF) for all patients in the first and second models.
| Types of Pathology | RF (1st Modeling) | RF (2nd Modeling) | ||||
|---|---|---|---|---|---|---|
| True | False | Accuracy (%) | True | False | Accuracy (%) | |
| PHs | 12 | 0 | 100 | 12 | 4 | 75 |
| PCTs | 61 | 1 | 98.38 | 62 | 4 | 91.1 |
| Accuracy (%) | 99% | 83% | ||||