| Literature DB >> 36186967 |
Tao Yu1, Runnan Shen2, Guochang You2, Lin Lv2, Shimao Kang2, Xiaoyan Wang2, Jiatang Xu3, Dongxi Zhu2, Zuqi Xia2, Junmeng Zheng2,3, Kai Huang2,3.
Abstract
Background: Prevention is highly involved in reducing the incidence of post-thrombotic syndrome (PTS). We aimed to develop accurate models with machine learning (ML) algorithms to predict whether PTS would occur within 24 months. Materials and methods: The clinical data used for model building were obtained from the Acute Venous Thrombosis: Thrombus Removal with Adjunctive Catheter-Directed Thrombolysis study and the external validation cohort was acquired from the Sun Yat-sen Memorial Hospital in China. The main outcome was defined as the occurrence of PTS events (Villalta score ≥5). Twenty-three clinical variables were included, and four ML algorithms were applied to build the models. For discrimination and calibration, F scores were used to evaluate the prediction ability of the models. The external validation cohort was divided into ten groups based on the risk estimate deciles to identify the hazard threshold.Entities:
Keywords: deep vein thrombosis; endovascular; machine learning; post-thrombotic syndrome; prognosis
Year: 2022 PMID: 36186967 PMCID: PMC9523080 DOI: 10.3389/fcvm.2022.990788
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
FIGURE 1Model development and evaluation pipeline. ATTRACT, Acute Venous Thrombosis: Thrombus Removal with Adjunctive Catheter-Directed Thrombolysis; BIOLINCC, Biologic Specimen and Data Repository Information Coordinating Center; PTS, post-thrombotic syndrome; XGBoost, eXtreme gradient boosting; GBDT, gradient boosting decision tree; RF, random forest; LR, logistic regression; DVT, deep vein thrombosis.
Baseline characteristics and outcome of derivation cohort and validation cohort.
| Characteristics and outcome | Derivation cohort ( | Validation cohort ( | |
| Treatment | 0.011 | ||
| Using PCDT with anticoagulation | 279 (50.3%) | 43 (36.8%) | |
| Using anticoagulation only | 276 (49.7%) | 74 (63.2%) | |
| DVT type | 0.055 | ||
| Extend to Iliac vein | 313 (56.4%) | 54 (46.2%) | |
| Isolated femoropopliteal | 242 (43.6%) | 63 (53.8%) | |
| Age | 54.00 [44.00, 62.00] | 59.00 [48.00, 67.00] | 0.002 |
| Gender | 0.004 | ||
| Male | 349 (62.9%) | 56 (47.9%) | |
| Female | 206 (37.1%) | 61 (52.1%) | |
|
| |||
| Hypertension | 242 (43.6%) | 15 (12.8%) | <0.001 |
| Diabetes mellitus | 91 (16.4%) | 8 (6.8%) | 0.012 |
| High cholesterol | 176 (31.7%) | 8 (6.8%) | <0.001 |
| Asthma | 57 (10.3%) | 3 (2.6%) | 0.013 |
| COPD | 22 (4.0%) | 2 (1.7%) | 0.357 |
| MI | 25 (4.5%) | 4 (3.4%) | 0.783 |
| CHF | 26 (4.7%) | 1 (0.9%) | 0.097 |
| Height | 175.00 [165.10, 182.88] | 164.00 [156.00, 170.00] | <0.001 |
| Weight | 93.00 [80.95, 112.14] | 62.50 [57.00, 71.50] | <0.001 |
| BMI | 30.84 [26.98, 36.17] | 23.87 [21.31, 26.20] | <0.001 |
|
| 0.029 | ||
| Right | 209 (37.7%) | 31 (26.5%) | |
| Left | 346 (62.3%) | 86 (73.5%) | |
| Previous VTE | 130 (23.4%) | 23 (19.7%) | 0.446 |
|
| |||
| Major surgery | 48 (8.6%) | 25 (21.4%) | <0.001 |
| Hospitalization | 55 (9.9%) | 14 (12.0%) | 0.618 |
| Plaster cast immob | 15 (2.7%) | 3 (2.6%) | 1 |
| Childbirth | 7 (1.3%) | 5 (4.3%) | 0.064 |
| Inpatient qualify DVT | 92 (16.6%) | 17 (14.5%) | 0.683 |
| Taken aspirin | 119 (21.4%) | 12 (10.3%) | 0.008 |
| SOX-PTS score | 2.00 [1.00, 3.00] | 1.00 [0.00, 1.00] | <0.001 |
| PTS in 24 Months | 327 (58.9%) | 38 (32.5%) | <0.001 |
PCDT, pharmacomechanical catheter-directed thrombolysis; DVT, deep vein thrombosis; COPD, chronic obstructive pulmonary disease; MI, myocardial infarction; CHF, congestive heart failure; BMI, body mass index; VTE, venous thromboembolism; PTS, post-thrombotic syndrome.
All values included in the machine learning model had missing value rate lower than 1%.
FIGURE 2Radar plot for the seven important predictors of post-thrombotic syndrome in 24 months. Higher value means more importance of the features determined by different ML algorithms. PTS, post-thrombotic syndrome; XGBoost, eXtreme gradient boosting; VTE, venous thromboembolism; BMI, body mass index; COPD, chronic obstructive pulmonary disease; DVT, deep vein thrombosis; ML, machine learning.
FIGURE 3Receiver operating characteristic curves for post-thrombotic syndrome at 2-year follow-up. ROC, receiver operating characteristic curve; PTS, post-thrombotic syndrome; AUC, area under the curve; XGboost, eXtreme gradient boosting.
Net reclassification improvement and integrated discrimination improvement results of machine learning models compared with the SOX-PTS score.
| Different methods compared with SOX-PTS | Derivation cohort | Validation cohort | ||
| NRI (95% CI/ | IDI (95% CI/ | NRI (95% CI/ | IDI (95% CI/ | |
| XGBoost | 0.621 (0.461–0.782/<0.001) | 0.098 (0.074–0.121/<0.001) | 0.351 (0.095–0.607/0.007) | 0.176 (0.091– 0.260/<0.001) |
| LR | 0.642 (0.484–0.801/<0.001) | 0.082 (0.062–0.103/<0.001) | 0.518 (0.264–0.772/<0.001) | 0.239 (0.154–0.324/<0.001) |
| RF | 0.664 (0.507–0.820/<0.001) | 0.124 (0.099–0.149/<0.001) | 0.350 (0.077–0.622/0.012) | 0.078 (−0.001–0.157/0.054) |
| GBDT | 0.672 (0.514–0.830/<0.001) | 0.102 (0.078–0.125/<0.001) | 0.404 (0.141–0.668/0.003) | 0.144 (0.062–0.227/<0.001) |
XGBoost, extreme gradient boosting; LR, logistic regression; RF, random forest; GBDT, gradient boosting decision tree; NRI, net reclassification improvement; IDI, integrated discrimination improvement.
FIGURE 4Risk of post-thrombotic syndrome within 24 months according to deciles of event probability based on four machine learning models in the validation cohort. PTS, post-thrombotic syndrome; ML, machine learning; XGBoost, eXtreme gradient boosting.
Outcome in each risk groups defined by prediction probability of the XGBoost model in external validation cohort.
| Outcome | Low-risk group ( | Intermediate-risk group ( | High-risk group ( | |
|
| ||||
| Baseline Villalta score | 1.00 [1.00, 3.00] | 2.00 [1.00, 3.50] | 9.00 [3.00, 13.00] | <0.001 |
| 6 Month Villalta score | 0.00 [0.00, 2.00] | 1.00 [0.00, 2.25] | 5.00 [4.00, 10.75] | <0.001 |
| 12 Month Villalta score | 0.00 [0.00, 1.00] | 0.00 [0.00, 1.50] | 4.00 [1.00, 6.00] | 0.002 |
| 18 Month Villalta score | 1.00 [0.00, 1.00] | 1.00 [0.00, 2.75] | 3.00 [2.00, 6.50] | 0.001 |
| 24 Month Villalta score | 1.00 [0.00, 2.00] | 1.00 [0.00, 3.00] | 3.00 [0.00, 5.25] | 0.073 |
| 6 Month VCSS score | 0.00 [0.00, 1.00] | 1.00 [0.00, 1.25] | 3.00 [2.00, 5.00] | <0.001 |
| 12 Month VCSS score | 0.00 [0.00, 1.00] | 0.00 [0.00, 1.00] | 2.00 [1.00, 3.50] | 0.002 |
| 18 Month VCSS score | 1.00 [0.00, 1.00] | 1.00 [0.00, 1.25] | 2.00 [1.00, 3.00] | 0.004 |
| 24 Month VCSS score | 1.00 [0.00, 1.00] | 1.00 [0.00, 2.00] | 2.00 [0.00, 3.25] | 0.057 |
| Baseline SOX-PTS score | 0.00 [0.00, 1.00] | 0.00 [0.00, 1.00] | 1.00 [1.00, 2.00] | <0.001 |
|
| ||||
| PTS in 24 months | 9 (14.8%) | 12 (34.3%) | 17 (81.0%) | <0.001 |
| Moderate to severe PTS in 24 months | 0 (0.0%) | 2 (5.7%) | 6 (28.6%) | <0.001 |
| Severe PTS in 24 months | 0 (0.0%) | 0 (0.0%) | 4 (19.0%) | <0.001 |
Low-risk group was defined as patients whose XGBoost prediction ability is lower than 30%, Intermediate-risk group was defined as patients whose XGBoost prediction ability is between 30 and 40%, High-risk group was defined as patients whose XGBoost prediction ability is higher than 40%.
VCSS, venous clinical severity scores; PTS, post-thrombotic syndrome.