| Literature DB >> 34887374 |
Hong Lin Wu1, Gao Wu Yan2, Li Cheng Lei3, Yong Du4, Xiang Ke Niu1, Tao Peng1.
Abstract
BACKGROUND Computed tomography (CT)-guided percutaneous transthoracic needle biopsy (PTNB) is an effective means for diagnosing various thoracic diseases. Pneumothorax is the most common complication, and when it becomes life-threatening, urgent medical intervention is required. The purpose of this study was to develop and validate a model that can be used to predict postoperative pneumothorax following CT-guided PTNB. MATERIAL AND METHODS We enrolled 245 patients who completed CT-guided PTNB to develop the model. A random forest (RF) model was built using 15 risk factors (15-RFs). The 7 most critical risk factors (7-RFs) were extracted by feature selection and used to build a new model. The independent external validation data contained 97 patients. Logistic regression (LR), support vector machine (SVM), and decision tree (DT) models were also developed using both 15-RFs and 7-RFs, and their performance was compared with the RF models. RESULTS The length of the aerated lung traversed was identified as the most important risk factor for developing pneumothorax, followed by angle of pleural puncture, lesion depth, lesion size, age, procedure time, and sex. The RF model demonstrated better performance in the development and validation datasets when compared with the LR, SVM, and DT based on 15-RFs and 7-RFs. According to DeLong's test for difference in ROC curves, the RF models based on the 15-RFs and 7-RFs achieved similar classification performance (P>0.05). CONCLUSIONS This study demonstrated the feasibility of using the 7-RFs RF model for predicting postoperative pneumothorax before patients undergo CT-guided PTNB.Entities:
Mesh:
Year: 2021 PMID: 34887374 PMCID: PMC8669970 DOI: 10.12659/MSM.932137
Source DB: PubMed Journal: Med Sci Monit ISSN: 1234-1010
Figure 1Flow diagram for the development and validation data.
Characteristics of development data.
| Risk factors | N=245 | P value | |
|---|---|---|---|
| Pneumothorax (n=108) | Non-pneumothorax (n=137) | ||
| Patients’ characteristics | |||
| Age (years), Mean±SD | 63.4±9.4 | 60.2±10.1 | 0.005 |
| Gender (number), Male/Female | 87 (80.6%)/21 (19.4%) | 87 (63.5%)/50 (36.5%) | 0.003 |
| Lesion characteristics | |||
| Lesion size (mm), Mean±SD | 28.5±14.6 | 27.1±12.1 | 0.798 |
| Lesion depth (mm), Mean±SD | 12.3±10.0 | 10.4±11.3 | 0.034 |
| Lesion location (number), upper lobe (left lung)/lower lobe (left lung)/upper lobe (right lung)/middle lobe (right lung)/lower lobe (right lung) | 19 (17.6%)/21 (19.4%)/ 34 (31.5%)/8 (7.4%)/26 (24.1%) | 29 (21.2%)/24 (17.5%)/ 40 (29.2%)/10 (7.3%)/34 (24.8%) | 0.959 |
| Cavity inside the lesion (number), yes | 12 (11.1%) | 27 (19.7%) | 0.068 |
| Emphysema/bullae (number), yes | 29 (26.9%) | 32 (23.4%) | 0.530 |
| Pneumonitis (number), yes | 66 (61.1%) | 71 (51.8%) | 0.146 |
| Procedure characteristics | |||
| Length of the aerated lung traversed (mm), Mean±SD | 20.2±12.7 | 17.6±14.4 | 0.081 |
| Angle of pleural puncture (°), Mean±SD | 61.1±18.6 | 59.3±18.4 | 0.405 |
| Procedure time (min.), Mean±SD | 10.2±4.8 | 9.2±4.0 | 0.590 |
| Biopsy position (number), supine/prone | 41 (38.0%)/67 (62.0%) | 66 (48.2%)/71 (51.8%) | 0.110 |
| Crossing of pleural indentation (number), yes | 26 (24.1%) | 41 (29.9%) | 0.308 |
| Crossing the interlobar fissure (number), yes | 5 (4.6%) | 4 (2.9%) | 0.513 |
| Number of biopsy (number), 1/2/3 | 100 (92.6%)/7 (6.5%)/1 (0.9%) | 132 (96.4%)/5 (3.6%)/0 (0%) | 0.298 |
Characteristics of validation data.
| Risk factors | N=97 | P value | |
|---|---|---|---|
| Pneumothorax (n=54) | Non-pneumothorax (n=43) | ||
| Patients’ characteristics | |||
| Age (years), Mean±SD | 63.3±11.1 | 62.1±13.1 | 0.654 |
| Gender(number), Male/Female | 31 (57.4%)/23 (42.6%) | 31 (72.1%)/12 (27.9%) | 0.135 |
| Lesion characteristics | |||
| Lesion size (mm), Mean±SD | 24.4±14.3 | 34.2±20.9 | 0.026 |
| Lesion depth (mm), Mean±SD | 6.7±10.5 | 2.6±4.6 | 0.037 |
| Lesion location (number), upper lobe (left lung)/lower lobe (left lung)/upper lobe (right lung)/middle lobe (right lung)/lower lobe (right lung) | 8 (14.8%)/9 (16.7%)/ 9 (16.7%)/8 (14.8%)/20 (37.0%) | 10 (23.3%)/11 (25.6%)/ 12 (27.9%)/3 (7.0%)/7 (16.3%) | 0.083 |
| Cavity inside the lesion (number), yes | 12 (22.2%) | 2 (4.7%) | 0.014 |
| Emphysema/bullae (number), yes | 7 (13.0%) | 9 (20.9%) | 0.294 |
| Pneumonitis (number), yes | 36 (66.7%) | 28 (65.1%) | 0.873 |
| Procedure characteristics | |||
| Length of the aerated lung traversed (mm), Mean±SD | 14.2±16.3 | 9.9±12.4 | 0.186 |
| Angle of pleural puncture (°), Mean±SD | 62.0±16.3 | 62.1±17.8 | 1.000 |
| Procedure time (min.), Mean±SD | 8.3±4.9 | 8.0±4.0 | 0.997 |
| Biopsy position (number), supine/prone | 20 (37.0%)/34 (63.0%) | 16 (37.2%)/27 (62.8%) | 0.986 |
| Crossing of pleural indentation (number), yes | 11 (20.4%) | 7 (16.3%) | 0.607 |
| Crossing the interlobar fissure (number), yes | 8 (14.8%) | 2 (4.7%) | 0.177 |
| Number of biopsy (number), 1/2/3 | 44 (81.5%)/9 (16.7%)/1 (1.9%) | 40 (93.0%)/3 (7.0%)/0 (0%) | 0.217 |
Figure 2Measurement methods. (A) Axial chest CT image in a 61-year-old male patient demonstrates a solid nodule in the right upper lobe. The lesion size is measured along the maximum short-axis diameter (white dotted line). The lesion depth is measured from the costal pleura to the nearest edge of the lesion (yellow double arrows). The length of the aerated lung traversed by the needle is measured from the pleural puncture point to the edge of lesion along the needle path (red double arrows). The angle of pleural puncture is the smallest angle between a line along the needle route and a tangential line to the pleura (white arc). (B) Axial chest CT image in a 78-year-old male patient demonstrates a cavitary lesion in the right upper lobe. The biopsy needle passed through the pleural indentation (white arrow). (C) Axial chest CT image demonstrates the pneumothorax measurement. The longest distance between parietal pleura and visceral pleura (white line).
Figure 3Feature selection. (A) Ranking of important risk factors for the prediction of pneumothorax. (B) Predictive performance (AUCs) of the RF models at each number of risk factors in the development data and the external validation date.
Prediction performance of models based on 15 risk factors.
| Models | Development set | Verification set | ||||||
|---|---|---|---|---|---|---|---|---|
| LR | SVM | DT | RF | LR | SVM | DT | RF | |
| Sensitivity | 0.701 | 0.781 | 0.759 | 0.825 | 0.581 | 0.791 | 0.884 | 0.930 |
| Specificity | 0.500 | 0.574 | 0.676 | 0.806 | 0.667 | 0.611 | 0.556 | 0.759 |
| Accuracy | 0.612 | 0.690 | 0.722 | 0.816 | 0.629 | 0.691 | 0.701 | 0.835 |
| PPV | 0.640 | 0.699 | 0.748 | 0.843 | 0.581 | 0.618 | 0.613 | 0.755 |
| NPV | 0.568 | 0.674 | 0.689 | 0.784 | 0.667 | 0.786 | 0.857 | 0.932 |
PPV – positive predictive value; NPV – negative predictive value; LR – logistic regression; SVM – support vector machine; DT – decision tree; RF – random forest.
Prediction performance of models on 7 risk factors.
| Models | Development set | Verification set | ||||||
|---|---|---|---|---|---|---|---|---|
| LR | SVM | DT | RF | LR | SVM | DT | RF | |
| Sensitivity | 0.657 | 0.657 | 0.788 | 0.847 | 0.767 | 0.791 | 0.907 | 0.953 |
| Specificity | 0.537 | 0.648 | 0.639 | 0.824 | 0.315 | 0.315 | 0.519 | 0.685 |
| Accuracy | 0.604 | 0.653 | 0.722 | 0.837 | 0.515 | 0.526 | 0.691 | 0.804 |
| PPV | 0.643 | 0.703 | 0.735 | 0.859 | 0.471 | 0.479 | 0.600 | 0.707 |
| NPV | 0.552 | 0.598 | 0.704 | 0.809 | 0.630 | 0.654 | 0.875 | 0.949 |
PPV – positive predictive value; NPV – negative predictive value; LR – logistic regression; SVM – support vector machine; DT – decision tree; RF – random forest.
Figure 4Receiver operating characteristic curves (ROCs) and the areas under the curves (AUCs) of models in (A) the development data based on 15 risk factors versus 7 critical risk factors, (B) the validation data based on 15 risk factors versus 7 critical risk factors.
Prediction performance of models on 15 vs 7 risk factors.
| Models (AUC) | Development set | Delong test | Verification set | Delong test | ||
|---|---|---|---|---|---|---|
| 15 risk factors | 7 risk factors | 15 risk factors | 7 risk factors | |||
| LR | 0.667 | 0.636 | 0.272 | 0.672 | 0.599 | 0.148 |
| SVM | 0.789 | 0.698 | 0.011 | 0.813 | 0.658 | 0.012 |
| DT | 0.792 | 0.789 | 0.815 | 0.825 | 0.814 | 0.632 |
| RF | 0.910 | 0.921 | 0.480 | 0.921 | 0.914 | 0.849 |
RF vs LR/SVM/DT (DeLong test, P<0.05).
LR – logistic regression; SVM – support vector machine; DT – decision tree; RF – random forest; AUC – area under the curve.