| Literature DB >> 28404977 |
Li Yang1, Qiao Zhang2, Li Bai2, Ting-Yuan Li1, Chuang He1, Qian-Li Ma2, Liang-Shan Li1, Xue-Quan Huang1, Gui-Sheng Qian2.
Abstract
There are no large samples or exact prediction models for assessing the cancer risk factors of solitary pulmonary nodules (SPNs) in the Chinese population. We retrospectively analyzed the clinical and imaging data of patients with SPNs who underwent computer tomography guided needle biopsy in our hospital from Jan 1st of 2011 to March 30th of 2016. These patients were divided into a development data set and a validation data set. These groups included 1078 and 344 patients, respectively. A prediction model was developed from the development data set and was validated with the validation data set using logistic regression. The predictors of cancer in our model included female gender, age, pack-years of smoking, a previous history of malignancy, nodule size, lobulated and spiculated edges, lobulation alone and spiculation alone. The Area Under the Curves, sensitivity and specificity of our model in the development and validation data sets were significantly higher than those of the Mayo model and VA model (p < 0.001). We established the largest sampling risk prediction model of SPNs in a Chinese cohort. This model is particularly applicable to SPNs > 8 mm in size. SPNs in female patients, as well as SPNs featuring a combination of lobulated and spiculated edges or lobulated edges alone, should be evaluated carefully due to the probability that they are malignant.Entities:
Keywords: malignancy; model; pulmonary nodule; risk factor; solitary
Mesh:
Year: 2017 PMID: 28404977 PMCID: PMC5438732 DOI: 10.18632/oncotarget.16426
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Clinical characteristics of the SPNs in development and validation data sets
| Characteristics | Development data set, | Validation data set, | p | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Total | Malignant | Benign | Non-diagnostic | Total | Malignant | Benign | Non-diagnostic | |||
| Participants | 1078 | 721 | 182 | 175 | 344 | 236 | 46 | 62 | ||
| Males | 647 | 439 | 98 | 110 | 196 | 132 | 23 | 41 | 0.317 | |
| Females | 431 | 282 | 84 | 65 | 148 | 104 | 23 | 21 | ||
| Age (years) | 55.41 ± 11.94 | 58.22 ± 10.83 | 49.01 ± 11.88 | 50.53 ± 12.27 | 55.34 ± 11.24 | 57.48 ± 10.22 | 47.17 ± 12.02 | 53.27 ± 11.35 | 0.105 | |
| Smoking status | ||||||||||
| Censored data | 249 | 163 | 39 | 47 | 78 | 42 | 15 | 21 | 0.871 | |
| Non-smoker | 415 | 269 | 84 | 62 | 154 | 111 | 22 | 21 | 0.000 | |
| Former or | 414 | 289 | 59 | 66 | 112 | 83 | 9 | 20 | 0.051 | |
| Pack-year | 38.95 ± 39.71 | 41.03 ± 36.58 | 36.09 ± 63.51 | 32.39 ± 20.50 | 35.81 ± 20.63 | 34.95 ± 20.55 | 36.78 ± 26.54 | 38.95 ± 18.76 | 0.081 | |
| < 30 (n,%) | 139 | 80 | 30 | 29 | 33 | 26 | 3 | 4 | 0.411 | |
| ≥ 30 (n,%) | 275 | 209 | 29 | 37 | 79 | 57 | 6 | 16 | ||
| Previous medical history | ||||||||||
| Censored data | 217 | 139 | 34 | 44 | 72 | 41 | 12 | 19 | 0.748 | |
| No disease | 479 | 317 | 89 | 73 | 191 | 144 | 21 | 26 | 0.000 | |
| Extra-thoracic | 176 | 121 | 32 | 23 | 46 | 27 | 9 | 10 | 0.189 | |
| Lung disease | 152 | 99 | 20 | 33 | 19 | 14 | 4 | 1 | 0.000 | |
| Malignancy | 54 | 45 | 7 | 2 | 16 | 10 | 0 | 6 | 0.789 | |
CT characteristics of the SPNs in the development and validation data sets
| CT Characteristics | Development data set, | Validation data set, | p | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Malignant | Benign | Non-diagnostic | Total | Malignant | Benign | Non-diagnostic | ||
| Size (mm) | 18.43 ± 5.03 | 19.41 ± 4.89 | 16.47 ± 4.54 | 16.44 ± 4.91 | 18.16 ± 5.05 | 19.28 ± 4.66 | 16.71 ± 4.84 | 14.97 ± 5.09 | 0.388 |
| 4-8 mm | 17 | 5 | 4 | 8 | 8 | 2 | 0 | 6 | 0.358 |
| 8-10 mm | 55 | 24 | 15 | 16 | 16 | 7 | 4 | 5 | 0.738 |
| 10-20 mm | 632 | 388 | 129 | 115 | 190 | 118 | 31 | 41 | 0.267 |
| 20-30 mm | 374 | 304 | 34 | 36 | 130 | 109 | 11 | 10 | 0.296 |
| Edge | |||||||||
| Spiculated | 244 | 153 | 47 | 44 | 88 | 62 | 11 | 15 | 0.088 |
| Lobulation | 363 | 298 | 33 | 32 | 70 | 60 | 3 | 7 | 0.000 |
| Spiculation | 234 | 161 | 34 | 39 | 73 | 46 | 11 | 16 | 0.849 |
| Lobulation and | 83 | 77 | 4 | 2 | 35 | 34 | 1 | 0 | 0.147 |
| Irregular edge | 78 | 21 | 22 | 35 | 46 | 26 | 8 | 12 | 0.000 |
| Smooth edge | 76 | 11 | 42 | 23 | 32 | 8 | 15 | 9 | 0.170 |
| Density | |||||||||
| Solid | 728 | 515 | 121 | 92 | 190 | 135 | 25 | 30 | 0.000 |
| Purely ground-glass | 3 | 3 | 0 | 0 | 6 | 4 | 0 | 2 | 0.003 |
| Partly solid | 170 | 117 | 18 | 35 | 105 | 18 | 8 | 19 | 0.000 |
| Thin cavitation | 5 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 0.206 |
| Thickened cavitation | 116 | 75 | 20 | 21 | 26 | 17 | 5 | 4 | 0.085 |
| Necrosis | 14 | 6 | 2 | 6 | 5 | 1 | 4 | 0 | 0.828 |
| Calcification | 42 | 2 | 20 | 20 | 12 | 1 | 4 | 7 | 0.730 |
| Location | |||||||||
| Upper lobe | 596 | 409 | 90 | 97 | 202 | 150 | 22 | 30 | 0.264 |
| Middle lobe | 114 | 85 | 18 | 11 | 32 | 21 | 5 | 6 | |
| Lower lobe | 368 | 227 | 74 | 67 | 110 | 65 | 19 | 26 | |
Figure 1Histopathology of the development and validation data
*The development data set comprised 24 non-small cell lung cancers, 6 neuroendocrine carcinomas, 4 lymphoma-like epithelial carcinomas, 4 complex carcinomas, 8 carcinoids, 2 sarcoma carcinomas, and 1 lymphoma. The validation data set comprised 19 NSCLCs, 1 neuroendocrine carcinoma, and 1 sarcoma carcinoma. †The development data set comprised 16 sclerosing angiomas, 11 hamartomas, 8 cartilaginous tumors, 2 vascular smooth muscle tumors, 6 inflammatory pseudotumors, and 2 spindle cell tumors. The validation data set comprised 4 sclerosing angiomas, 3 hamartomas, 3 inflammatory pseudotumors, 3 spindle cell tumors, 2 bronchoceles and 2 chondrophymas. §In the development data set, 29 nodules became markedly smaller within a short time, and 20 cases were stable after more than two years. The validation data set comprised 2 suppurative inflammation lesions and 1 eosinophilic inflammatory lesion. # Conditions that did not have a final diagnosis, including chronic inflammation, fibroplasia and gland hyperplasia.
Figure 2ROC curves of our model, the Mayo model and the VA model in the development data set
Figure 3ROC curves of our model, the Mayo model and the VA model in the validation data set
Figure 4SPN edge and density characteristics
(A) Spiculate protuberance, (B) Lobulation, (C) Spiculation, (D) Lobulation combined with spiculation, (E) Irregular edges, (F) Smooth edges, (G) Solid density, (H) Purely ground-glass, (I) Partly solid, (J) Thin cavitation, (K) Thickened cavitation, (L) Necrosis, (M) Calcification.
| Model | AUC | Standard Error | 95%CI | Sensitivity Y | Specificity Y | |
|---|---|---|---|---|---|---|
| Lower | Upper | |||||
| Ours | 0.807 | 0.015 | 0.778 | 0.834 | 85.71% | 60.36% |
| Mayo | 0.566 | 0.022 | 0.53 | 0.6 | 71.99% | 41.91% |
| VA | 0.636 | 0.02 | 0.601 | 0.669 | 66.11% | 53.01% |
Comparison of ROC parameters in the development data set
| VA-Mayo | Ours-VA | Ours-Mayo | ||||
|---|---|---|---|---|---|---|
| Difference | 0.07 | 0.171 | 0.242 | |||
| Standard error | 0.027 | 0.021 | 0.023 | |||
| 0.008 | < 0.001 | < 0.001 | ||||
| Model | AUC | Standard Error | 95%CI | Sensitivity Y | Specificity Y | |
|---|---|---|---|---|---|---|
| Lower | Upper | |||||
| Ours | 0.784 | 0.027 | 0.731 | 0.831 | 70.10% | 78.57% |
| Mayo | 0.649 | 0.037 | 0.59 | 0.706 | 82.63% | 53.57% |
| VA | 0.599 | 0.036 | 0.539 | 0.657 | 63.40% | 57.14% |
Comparison of ROC parameters in the validation data set
| VA-Mayo | Ours-VA | Ours-Mayo | ||||
|---|---|---|---|---|---|---|
| Difference | 0.045 | 0.189 | 0.144 | |||
| Standard error | 0.046 | 0.037 | 0.042 | |||
| 0.32 | < 0.001 | 0.001 | ||||