| Literature DB >> 35574352 |
Ruxian Tian1, Yumei Li1, Chuanliang Jia1, Yakui Mou1, Haicheng Zhang2, Xinxin Wu1, Jingjing Li1, Guohua Yu3, Ning Mao2, Xicheng Song1,4.
Abstract
Objective: We aim to establish and validate computed tomography (CT)-based radiomics model for predicting TP53 status in patients with laryngeal squamous cell carcinoma (LSCC).Entities:
Keywords: TP53; computed tomography; laryngeal squamous cell carcinoma; machine learning; radiomics
Year: 2022 PMID: 35574352 PMCID: PMC9095903 DOI: 10.3389/fonc.2022.823428
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1Radiomics workflow and study flowchart. LSCC, laryngeal squamous cell carcinoma; ANOVA, analysis of variance; LASSO, least absolute shrinkage and selection operator; KNN, K-Nearest Neighbor; SVM, support vector machine.
Figure 2An example of manual segmentation in laryngeal squamous cell carcinoma (LSCC). (A) Localized space-occupying lesion of LSCC was observed on plain CT image; (B) Manual segmentation on the same axial slice was depicted with red label.
Clinical characteristics of patients in training set 1 and testing set 1 in the wild-type group and mutated group.
| Characteristics | Training set 1 | Testing set 1 | ||||
|---|---|---|---|---|---|---|
| Wild-type group | Mutated group | Wild-type group | Mutated group | |||
| Age (Mean ± SD, years) | 61.78 ± 3.02 | 62.93 ± 5.07 | 0.348 | 62.24 ± 3.43 | 60.28 ± 4.73 | 0.176 |
| Gender, | 0.494 | 0.467 | ||||
| Male | 33 (94.29) | 31 (100) | 16 (100) | 13 (92.86) | ||
| female | 2 (5.71) | 0 (0) | 0 (0) | 1 (7.14) | ||
| Tumor location, | 0.563 | 0.972 | ||||
| Supraglottis | 5 (14.29) | 6 (19.35) | 4 (25.00) | 3 (21.43) | ||
| Glottis | 29 (82.86) | 25 (80.65) | 11 (68.75) | 10 (71.43) | ||
| Subglottis | 1 (2.85) | 0 (0) | 1 (6.25) | 1 (7.14) | ||
| T stage, | 0.132 | 0.296 | ||||
| T1 | 6 (17.14) | 7 (22.58) | 5 (31.25) | 1 (7.14) | ||
| T2 | 16 (45.71) | 6 (19.35) | 5 (20.00) | 5 (35.71) | ||
| T3 | 8 (22.86) | 13 (41.94) | 4 (25.00) | 7 (50.00) | ||
| T4 | 5 (14.29) | 5 (16.13) | 2 (12.50) | 1 (7.14) | ||
| N stage, | 0.094 | 0.151 | ||||
| N0 | 28 (80.00) | 19 (61.29) | 13(81.25) | 8 (57.14) | ||
| N1, N2 | 7 (20.00) | 12 (38.71) | 3(18.75) | 6 (42.86) | ||
| TNM stage, | 0.099 | 0.424 | ||||
| I | 6 (17.14) | 7 (22.58) | 5 (31.25) | 1 (7.14) | ||
| II | 14 (40.00) | 4 (12.90) | 4 (25.00) | 4 (28.57) | ||
| III | 8 (22.86) | 12 (38.71) | 4 (25.00) | 5 (35.71) | ||
| IV | 7 (20.00) | 8 (25.81) | 3 (18.75) | 4 (28.57) | ||
| Histologic grade, | 0.064 | 0.503 | ||||
| Poor | 5 (14.29) | 9 (29.03) | 3 (18.8) | 3 (21.43) | ||
| Moderate | 20 (57.14) | 9 (29.03) | 10 (62.5) | 6 (42.86) | ||
| Well | 10 (28.57) | 13 (41.94) | 3 (18.8) | 5 (35.71) | ||
| Smoking, | 1.000 | 0.586 | ||||
| Yes | 30 (85.71) | 27 (87.10) | 15 (93.75) | 12 (85.71) | ||
| No | 5(14.29) | 4 (12.90) | 1 (6.25) | 2 (14.29) | ||
| Drinking, | 0.792 | 0.675 | ||||
| Yes | 23 (65.71) | 22 (70.97) | 13 (81.25) | 10 (71.4) | ||
| No | 12 (34.29) | 9 (29.03) | 3 (18.75) | 4 (28.6) | ||
| Family history of cancer, | 0.265 | 0.157 | ||||
| Yes | 6 (17.14) | 2 (6.45) | 1 (6.25) | 4 (28.57) | ||
| No | 29 (82.86) | 29 (93.55) | 15 (93.75) | 10 (71.43) | ||
Figure 3Twenty-two radiomics features were selected using the least absolute shrinkage and selection operator algorithm (LASSO). (A) The LASSO coefficient profiles of the 117 radiomic features. Each colored line represents a coefficient corresponding to each feature. A vertical line is drawn at the value where the optimal alpha results in 22 nonzero coefficients. (B) Mean square error path using five-fold cross-testing.
Figure 4Receiver operating characteristic (ROC) curves of radiomics models based on different machine learning methods in training set 1 and testing set 1. (A–E) ROC curves for the model based on K-Nearest Neighbor, logistic regressive, linear-support vector machine (SVM), Gaussian-SVM, and polynomial-SVM, respectively.
Performance of the different radiomics models in the training set 1.
| AUC (95%CI) | Specificity (95%CI) | Sensitivity (95%CI) | Accuracy (95%CI) | |
|---|---|---|---|---|
| KNN | 0.799 (0.625-0.909) | 0.657 (0.477-0.803) | 0.829 (0.657-0.928) | 0.743 (0.570-0.879) |
| Logistics Regression | 0.855 (0.735-0.956) | 0.971 (0.834-0.999) | 0.800 (0.625-0.909) | 0.857 (0.675-0.930) |
| Linear-SVM | 0.831 (0.712-0.930) | 0.971 (0.834-0.999) | 0.714 (0.535-0.848) | 0.843 (0.647-0.942) |
| Gaussian-SVM | 0.888 (0.750-0.963) | 0.971 (0.834-0.999) | 0.793 (0.597-0.913) | 0.814 (0.634-0.912) |
| Polynomial-SVM | 0.782 (0.612-0.892) | 0.829 (0.657-0.928) | 0.657 (0.477-0.803) | 0.743 (0.570-0.879) |
CI, confidence interval; AUC, area under ROC curve; KNN, K-Nearest Neighbor; SVM, support vector machine.
Performance of the different radiomics models in the testing set 1.
| AUC (95%CI) | Specificity (95%CI) | Sensitivity (95%CI) | Accuracy (95%CI) | |
|---|---|---|---|---|
| KNN | 0.683 (0.472-0.768) | 0.333 (0.113-0.646) | 1.000 (0.781-1.000) | 0.600 (0.406-0.773) |
| Logistics Regression | 0.741 (0.569-0.856) | 0.750 (0.474-0.917) | 0.714 (0.420-0.904) | 0.733 (0.541-0.877) |
| Linear-SVM | 0.797 (0.632-0.957) | 0.750 (0.500-0.938) | 0.786 (0.571-1.000) | 0.667 (0.472-0.827) |
| Gaussian-SVM | 0.607 (0.467-0.720) | 0.286 (0.096-0.580) | 0.875 (0.604-0.978) | 0.600 (0.406-0.773) |
| Polynomial-SVM | 0.683 (0.506-0.826) | 0.500 (0.255-0.745) | 0.929 (0.642-0.996) | 0.700 (0.504-0.848) |
CI, confidence interval; AUC, area under ROC curve; KNN, K-Nearest Neighbor; SVM, support vector machine.
Figure 5The rad-scores of each patient in the training set 1. Red represents the TP53 wild type and blue represents the mutated TP53.
Figure 6Decision curve analysis (DCA) for the radiomics models. (A) DCA for the five radiomics models in training set 1. (B) DCA for the five radiomics models in testing set 1. (C) DCA for the radiomics model based on liner-SVM in training set 2 and testing set 2. The gray line is the decision curve for treat all patients assuming TP53 status as mutation. The black line is the decision curve for treat none patients which assumes TP53 status as wild-type.