| Literature DB >> 35417660 |
Yafeng Liu1, Jiawei Zhou1, Jing Wu1,2, Wenyang Wang1, Xueqin Wang1, Jianqiang Guo1, Qingsen Wang1, Xin Zhang1, Danting Li1, Jun Xie3, Xuansheng Ding1,4,5, Yingru Xing1,6, Dong Hu1,2,3.
Abstract
OBJECTIVE: To develop and validate a generalized prediction model that can classify epidermal growth factor receptor (EGFR) mutation status in non-small cell lung cancer patients.Entities:
Keywords: computed tomography; epidermal growth factor receptor; machine learning; non–small cell lung cancer; radiomics
Mesh:
Substances:
Year: 2022 PMID: 35417660 PMCID: PMC9016531 DOI: 10.1177/10732748221092926
Source DB: PubMed Journal: Cancer Control ISSN: 1073-2748 Impact factor: 2.339
Figure 1.The overall framework of data analysis and model integration.
Patients in the Training and Validation Cohorts.
| Training | Validation | |||||
|---|---|---|---|---|---|---|
| Characteristic | n = 169
| n = 86
| n = 41
| total = 296 | n = 50
| |
| Age (y, mean ± SD) | 67.65 ± 10.33 | 64.65 ± 11.65 | 67.93 ± 14.89 | 66.82 ± 11.49 | 66.56 ± 9.44 | .675 |
| .029 | ||||||
| Wild type | 126 (74.56%) | 34 (39.53%) | 19 (46.34%) | 179 (60.47%) | 22 (44.00%) | |
| Mutant | 43 (25.44%) | 52 (60.47%) | 22 (53.66%) | 117 (39.53%) | 28 (56.00%) | |
| Sex | .007 | |||||
| Female | 62 (36.69%) | 31 (36.05%) | 19 (46.34%) | 112 (37.84%) | 29 (58.00%) | |
| Male | 107 (63.31%) | 55 (63.95%) | 22 (53.66%) | 184 (62.16%) | 21 (42.00%) | |
| Smoking status | .002 | |||||
| Never smoker | 40 (23.67%) | 33 (38.37%) | 21 (51.22%) | 94 (31.76%) | 27 (54.00%) | |
| Smoker | 129 (76.33%) | 53 (61.63%) | 20 (48.78%) | 202 (68.24%) | 23 (46.00%) | |
| TYPE | <.001 | |||||
| Luad | 149 (88.17%) | 70 (81.40%) | 34 (82.93%) | 253 (85.47%) | 32 (64.00%) | |
| Lusc | 17 (10.06%) | 15 (17.44%) | 7 (17.07%) | 39 (13.18%) | 18 (36.00%) | |
| Other | 3 (1.78%) | 1 (1.16%) | 0 (.00%) | 4 (1.35%) | 0 (.00%) | |
Note: Luad, Lung adenocarcinoma.
Lusc, lung squad cell carcinoma.
Other, 3 Large cell carcinoma and 1 pulmonary sarcomatoid carcinoma.
aThe Cancer Imaging Archive.
bCancer Hospital of Anhui University of Science and Technology.
cHuainan Chaoyang Hospital of Anhui University of Science and Technology.
dEastern Hospital of Anhui University of Science and Technology.
Figure 2.Selection of radiomics features. (A): ICC histogram of radiomics features; (B/C): LASSO method for screening of radiomics features.
Texture Features Selection for Radiomics Models.
| Parameters | Parameter category | Importance |
|---|---|---|
| Mean absolute deviation | Intensity histogram | −.065652535 |
| 60 Percentile area | Intensity histogram | −.027231004 |
| Convex | Shape | .737731264 |
| Correlation | Gray level cooccurence matrix 3 | −.364074713 |
| Dissimilarity | Gray level cooccurence matrix 3 | .338776007 |
| 5-1 Homogeneity 2 | Gray level cooccurence matrix 3 | −.176750873 |
| 10-4 Homogeneity 2 | Gray level cooccurence matrix 3 | −.048826405 |
| -333-7 Information measure corr 1 | Gray level cooccurence matrix 3 | −.053598886 |
| 8-1 Information measure corr 1 | Gray level cooccurence matrix 3 | −.217087549 |
| 9-7 Information measure corr 1 | Gray level cooccurence matrix 3 | .095533464 |
| 12-4 Inverse diff norm | Gray level cooccurence matrix 3 | −.013040947 |
| 6-4 Inverse variance | Gray level cooccurence matrix 3 | .322764285 |
| 8-4 Inverse variance | Gray level cooccurence matrix 3 | −.092512731 |
| 8-1 Max Probability | Gray level cooccurence matrix 3 | −.024734566 |
| 12-7 Max Probability | Gray level cooccurence matrix 3 | −.102971831 |
| −333 Run length nonuniformity | Gray level Run length matrix 25 | −.002072575 |
Figure 3.Building and performance of four machine learning classifier models. Receiver operating characteristic curves (3A), Calibration curves (3B), and Decision curves (3C) of different classifiers and models generated from the development cohorts; Receiver operating characteristic curves (3D), Calibration curves (3E), and Decision curves (3F) of different classifiers and models generated from the validation cohorts.
Performance of the Radiomics Signature.
| AUC | Accuracy | Sensitivity | Specificity | C-index | Brier score | |
|---|---|---|---|---|---|---|
| Training | ||||||
| LR | .723 | .69 | .53 | .794 | .725 | .203 |
| DT | .842 | .794 | .701 | .855 | .837 | .153 |
| RF | .995 | .99 | .992 | .989 | .998 | .007 |
| SVM | .883 | .845 | .838 | .85 | .905 | .119 |
| Validation | ||||||
| LR | .658 | .68 | .75 | .591 | .664 | .235 |
| DT | .567 | .46 | .358 | .591 | .605 | .244 |
| RF | .88 | .96 | .965 | .955 | .883 | .137 |
| SVM | .765 | .84 | .715 | .99 | .773 | .162 |
Delong Test of Machine Learning Classifier Model.
| Training- AUC | Validation- AUC | |
|---|---|---|
| RF | RF | >.05 |
| RF | LR | <.001 |
| RF | DT | <.001 |
| RF | SVM | <.001 |