| Literature DB >> 33973725 |
Si-Qi Liu1,2, Xiao-Bin Ma1, Wan-Mei Song1,2, Yi-Fan Li1, Ning Li3, Li-Na Wang4, Jin-Yue Liu5, Ning-Ning Tao6,7, Shi-Jin Li1,2, Ting-Ting Xu1, Qian-Yun Zhang1,2, Qi-Qi An1,2, Bin Liang1, Huai-Chen Li1,8.
Abstract
BACKGROUND: Considering the high morbidity and mortality of lung cancer and the high incidence of pulmonary nodules, clearly distinguishing benign from malignant lung nodules at an early stage is of great significance. However, determining the kind of lung nodule which is more prone to lung cancer remains a problem worldwide.Entities:
Keywords: decision tree; logistics regression; lung cancer; pulmonary nodules
Mesh:
Year: 2021 PMID: 33973725 PMCID: PMC8201526 DOI: 10.1111/1759-7714.13991
Source DB: PubMed Journal: Thorac Cancer ISSN: 1759-7706 Impact factor: 3.500
FIGURE 1Pathological results of lung nodules patients that were underwent VATS lobectomy from January 2013 to November 2018 in Shandong Provincial Hospital
Clinical and nodule characteristics of 458 patients with pulmonary nodules diagnosed with computer tomography (CT) who underwent video‐assisted thoracoscopy (VATS) lobectomy in Shandong Province Hospital, China
| Variable | Benign nodule | Lung cancer | Total | Univariable analysis | Multivariable analysis | ||
|---|---|---|---|---|---|---|---|
| OR (95%CI) |
| OR (95% CI) |
| ||||
| Sex | |||||||
| Female | 41 (40.20%) | 182 (51.12%) | 223 (48.69%) | Reference | Reference | ||
| Male | 61 (59.80%) | 174 (48.88%) | 235 (51.31%) | 0.64 (0.41–1.01) | 0.053 | ||
| Age (years) | |||||||
| Average | 52.35 ± 10.86 | 58.91 ± 9.69 | 57.45 ± 10.33 | ||||
| ≤45 | 25 (24.51%) | 35 (9.83%) | 60 (13.10%) | Reference | Reference | Reference | Reference |
| 45–60 | 53 (51.96%) | 153 (42.98%) | 206 (44.98%) | 2.06 (1.31–3.76) | 0.018 | 1.80 (0.85–3.82) | 0.125 |
| >60 | 24 (23.53%) | 168 (47.19%) | 192 (41.92%) | 5.00 (2.56–9.75) | <0.001 | 3.30 (1.46–7.48) | 0.004 |
| Profession | |||||||
| Others | 19 (18.63%) | 49 (13.76%) | 68 (14.85%) | Reference | Reference | ||
| Workmen | 11 (10.78%) | 28 (7.87%) | 39 (8.52%) | 0.99 (0.41–2.37) | 0.997 | ||
| Laborers | 38 (37.25%) | 125 (35.11%) | 163 (35.59%) | 1.28 (0.67–2.42) | 0.458 | ||
| Office clerk | 34 (33.33%) | 154 (43.26%) | 188 (41.05%) | 1.76 (0.92–3.35) | 0.088 | ||
| Smoking | |||||||
| No | 64 (62.75%) | 241 (67.70%) | 305 (66.59%) | Reference | Reference | ||
| Yes | 38 (37.25%) | 115 (32.30%) | 153 (33.41%) | 0.80 (0.51–1.27) | 0.350 | ||
| Drinking | |||||||
| No | 78 (76.47%) | 282 (79.21%) | 360 (78.60%) | Reference | Reference | ||
| Yes | 24 (23.53%) | 74 (20.79%) | 98 (21.40%) | 0.85 (0.51–1.44) | 0.552 | ||
| Family history of cancer | |||||||
| No | 78 (76.47%) | 282 (79.21%) | 370 (80.79%) | Reference | Reference | ||
| Yes | 24 (23.53%) | 74 (20.79%) | 88 (19.21%) | 0.71 (0.42–1.21) | 0.211 | ||
| Nodule size‐cm | |||||||
| Average | 1.59 ± 0.64 | 1.84 ± 0.63 | 1.78 ± 0.64 | ||||
| <0.6 | 5/8 (62.50%) | 3/8 (37.50%) | 8/458 (1.75%) | Reference | Reference | Reference | Reference |
| 0.6–1.0 | 19/60 (31.67%) | 41/60 (68.33%) | 60/458 (13.10%) | 3.60 (0.78–16.63) | 0.101 | 2.26 (0.38–13.35) | 0.370 |
| 1.0–2.0 | 56/231 (24.24%) | 175/231 (75.76%) | 231/458 (50.44%) | 5.21 (1.21–22.49) | 0.027 | 3.42 (0.62–19.00) | 0.160 |
| 2.0–3.0 | 22/159 (13.84%) | 137/159 (86.16%) | 159/458 (34.72%) | 10.38 (2.32–46.54) | 0.002 | 6.23 (1.07–36.29) | 0.042 |
| Nodule location | |||||||
| Right upper lobe | 25/129 (19.38%) | 104/129 (80.62%) | 129/458 (28.17%) | Reference | Reference | ||
| Right middle lobe | 8/38 (21.05%) | 30/38 (78.95%) | 38/458 (8.30%) | 0.90 (0.37–2.20) | 0.820 | ||
| Right lower lobe | 18/90 (20.00%) | 72/90 (80.00%) | 90/458 (19.65%) | 0.96 (0.49–1.89) | 0.910 | ||
| Left upper lobe | 25/107 (23.36%) | 82/107 (76.64%) | 107/458 (23.36%) | 0.79 (0.42–1.47) | 0.456 | ||
| Left lower lobe | 26/94 (27.66%) | 68/94 (72.34%) | 94/458 (20.52%) | 0.63 (0.34–1.18) | 0.148 | ||
| Lobulation | |||||||
| No | 27/45 (60.00%) | 18/45 (40.00%) | 45/458 (9.83%) | Reference | Reference | Reference | Reference |
| Yes | 75/413 (18.16%) | 338/413 (81.84%) | 413/458 (90.17%) | 6.76 (3.54–12.91) | <0.001 | 3.34 (1.38–8.12) | 0.008 |
| Spiculation | |||||||
| No | 79/246 (32.11%) | 167/246 (67.89%) | 246/458 (53.71%) | Reference | Reference | Reference | Reference |
| Yes | 23/212 (10.85%) | 189/212 (89.15%) | 212/458 (46.29%) | 3.89 (2.34–6.47) | <0.001 | 3.15 (1.71–5.82) | <0.001 |
| Boundary | |||||||
| Distinct | 98/417 (23.50%) | 319/417 (76.50%) | 417/458 (91.05%) | Reference | Reference | ||
| Indistinct | 4/41 (9.76%) | 37/41 (90.24%) | 41/458 (8.95%) | 2.84 (0.99–8.17) | 0.053 | ||
| Satellite nodule | |||||||
| No | 97/452 (21.46%) | 355/452 (78.54%) | 452/458 (98.69%) | Reference | Reference | Reference | Reference |
| Yes | 5/6 (83.33%) | 1/6 (16.67%) | 6/458 (1.31%) | 0.06 (0.01–0.47) | 0.008 | 0.05 (0.01–0.44) | 0.007 |
| Vessel convergence sign | |||||||
| No | 78/232 (33.62%) | 154/232 (66.38%) | 232/458 (50.66%) | Reference | Reference | Reference | Reference |
| Yes | 24/226 (10.62%) | 202/226 (89.38%) | 226/458(49.34%) | 4.26 (2.58–7.05) | <0.001 | 3.09 (1.64–5.80) | <0.001 |
| Pleural indentation | |||||||
| No | 60/185 (32.43%) | 125/185 (67.57%) | 185/458 (40.39%) | Reference | Reference | Reference | Reference |
| Yes | 42/273 (15.38%) | 231/273 (84.62%) | 273/458 (59.61%) | 2.64 (1.68–4.14) | <0.001 | 1.10 (0.60–2.03) | <0.001 |
| Nodule type | |||||||
| Solid | 86/274 (31.39%) | 188/274 (68.61%) | 274/458 (59.83%) | Reference | Reference | Reference | Reference |
| GGNs | 3/45 (6.67%) | 42/45 (93.33%) | 45/458 (9.83%) | 6.40 (1.93–21.34) | 0.002 | 28.25 (6.85–116.40) | <0.001 |
| Part‐solid | 13/139 (9.35%) | 126/139 (90.65%) | 139/458(30.35%) | 4.43 (2.37–8.29) | <0.001 | 6.57(3.10–13.94) | <0.001 |
Note: (1) The people with smoking included former and current smokers; (2) Group 45–60 covers 60 years; (3) Group 0.6–1.0 covers the maximum diameter is 0.6 and 1.0 cm; Group 1.0–2.0 covers the maximum diameter is 2.0 cm; Group 2.0–3.0 covers the maximum diameter is 3.0 cm.
p < 0.05.
p < 0.01.
p < 0.001.
Effect evaluation of machine learning models
| Models | Sensitivity | Specificity | Precision | F‐measure | AUC |
|---|---|---|---|---|---|
| NaiveBayes‐1 | 0.775 (0.762–0.786) | 0.688 (0.667–0.695) | 0.725 (0.697–0.750) | 0.720 (0.707–0.730) | 0.646 (0.624–0.663) |
| SMO‐1 | ‐ | ‐ | ‐ | ‐ | 0.500 |
| J48‐1 | 0.754 (0.707–0.771) | 0.745 (0.692–0.766) | 0.661 (0.624–0.700) | 0.685 (0.675–0.708) | 0.561 (0.545–0.587) |
| RandomForest‐1 | 0.702 (0.681–0.716) | 0.682 (0.655–0.728) | 0.662 (0.634–0.677) | 0.679 (0.655–0.693) | 0.554 (0.539–0.563) |
| NaiveBayes‐2 | 0.832 (0.825–0.841) | 0.379 (0.353–0.398) | 0.823 (0.817–0.834) | 0.826 (0.820–0.836) | 0.821 (0.815–0.827) |
| SMO‐2 | 0.847 (0.832–0.858) | 0.397 (0.363–0.440) | 0.837 (0.819–0.850) | 0.836 (0.818–0.849) | 0.725 (0.696–0.746) |
| J48‐2 | 0.846 (0.825–0.869) | 0.372 (0.336–0.413) | 0.837 (0.816–0.863) | 0.838 (0.818–0.861) | 0.714 (0.668–0.747) |
| RandomForest‐2 | 0.844 (0.834–0.852) | 0.444 (0.420–0.467) | 0.835 (0.822–0.846) | 0.828 (0.817–0.834) | 0.816 (0.810–0.827) |
| NaiveBayes‐3 | 0.831 (0.825–0.836) | 0.382 (0.362–0.399) | 0.822 (0.817–0.827) | 0.825 (0.820–0.830) | 0.818 (0.814–0.820) |
| SMO‐3 | 0.835 (0.817–0.843) | 0.496 (0.472–0.549) | 0.828 (0.801–0.839) | 0.812 (0.787–0.821) | 0.670 (0.634–0.684) |
| J48‐3 | 0.834 (0.810–0.845) | 0.462 (0.409–0.497) | 0.822 (0.793–0.836) | 0.817 (0.796–0.830) | 0.741 (0.706–0.763) |
| Random Forest‐3 | 0.822 (0.808–0.834) | 0.419 (0.390–0.451) | 0.810 (0.799–0.824) | 0.813 (0.803–0.827) | 0.768 (0.758–0.790) |
FIGURE 2Proposed decision tree for prevalent nodules in a patient with pulmonary nodules
Thirteen if‐then rules extracted from the decision tree in Figure 2
| Rule 1: IF without lobulation, without spiculation, solid, THEN nodule is benign (22/25 or 80%) |
| Rule 2: IF without lobulation, without spiculation, part‐solid, size ≤ 1.4 cm, THEN nodule is benign (4/5 or 80%) |
| Rule 3: IF without lobulation, without spiculation, part‐solid, size > 1.4 cm, THEN nodule is malignant (4 or 100%) |
| Rule 4: IF without lobulation, without spiculation, GGNs, THEN nodule is benign (6/7 or 86%) |
| Rule 5: IF without lobulation, with spiculation, THEN nodule is malignant (4 or 100%) |
| Rule 6: IF with lobulation, without vessel convergence sign, solid, without spiculation, age ≤ 55, THEN nodule is benign (19/22 or 86%) |
| Rule 7: IF with lobulation, without vessel convergence sign, solid, without spiculation, age >55, size ≤ 1.1, THEN nodule is benign (3/4 or 75%) |
| Rule 8: IF with lobulation, without vessel convergence sign, solid, without spiculation, age >55, size > 1.1, THEN nodule is malignant (25/39 or 64%) |
| Rule 9: IF with lobulation, without vessel convergence sign, solid, with spiculation, THEN nodule is malignant (43/53 or 81%) |
| Rule 10: IF with lobulation, without vessel convergence sign, part‐solid, THEN nodule is malignant (41/46 or 89%) |
| Rule 11: IF with lobulation, without vessel convergence sign, GGNs, THEN nodule is malignant (27/29 or 93%) |
| Rule 12: IF with lobulation, with vessel convergence sign, without satellite nodule, THEN nodule is malignant (197/216 or 91%) |
| Rule 13: IF with lobulation, with vessel convergence sign, with satellite nodule, THEN nodule is benign (3/4 or 75%) |