| Literature DB >> 35645754 |
Sai Chen1, Le-Ping Liu1, Yong-Jun Wang2, Xiong-Hui Zhou1, Hang Dong1, Zi-Wei Chen3, Jiang Wu4, Rong Gui1, Qin-Yu Zhao5.
Abstract
Background: Liver transplantation surgery is often accompanied by massive blood loss and massive transfusion (MT), while MT can cause many serious complications related to high mortality. Therefore, there is an urgent need for a model that can predict the demand for MT to reduce the waste of blood resources and improve the prognosis of patients. Objective: To develop a model for predicting intraoperative massive blood transfusion in liver transplantation surgery based on machine learning algorithms.Entities:
Keywords: liver transplantation; machine learning; massive blood transfusion; prediction model; red cell transfusion
Year: 2022 PMID: 35645754 PMCID: PMC9140217 DOI: 10.3389/fninf.2022.893452
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 3.739
Figure 1The flowchart of our study.
Clinical characteristics.
| All Patients ( | Non-MT group ( | MT group ( | |||
|---|---|---|---|---|---|
| Age, mean (SD), year | 46.15 (11.77) | 45.77 (12.01) | 48.96 (9.36) | <0.001 | |
| Sex, M, n (%) | 210 (17.60) | 188 (17.84) | 22 (15.83) | 0.641 | |
| Diagnosis, n (%) | Cirrhosis | 150 (17.46) | 140 (18.49) | 10 (9.80) | 0.019 |
| Liver malignant tumor | 154 (17.93) | 138 (18.23) | 16 (15.69) | ||
| Liver failure | 83 (9.66) | 79 (10.44) | 4 (3.92) | ||
| Alcoholic hepatitis | 42 (4.89) | 33 (4.36) | 9 (8.82) | ||
| Viral hepatitis | 255 (29.69) | 218 (28.80) | 37 (36.27) | ||
| Cholestatic liver disease | 24 (2.79) | 21 (2.77) | 3 (2.94) | ||
| Others | 151 (17.58) | 128 (16.91) | 23 (22.55) | ||
| Portal hypertension, n (%) | 335 (28.08) | 280 (26.57) | 55 (39.57) | 0.002 | |
| Hepatic encephalopathy, n (%) | 136 (11.40) | 117 (11.10) | 19 (13.67) | 0.609 | |
| Ascites, n (%) | 385 (32.27) | 321 (30.46) | 64 (46.04) | <0.001 | |
| Weight, mean (SD), kg | 64.13 (13.24) | 64.38 (13.49) | 62.31 (11.10) | 0.121 | |
| ALB, mean (SD), g/L | 34.77 (6.17) | 34.96 (6.04) | 33.50 (6.86) | 0.045 | |
| ALT, median [Q1, Q3], U/L | 53.85 [26.98, 154.93] | 53.85 [27.00, 150.25] | 51.85 [26.53, 253.50] | 0.729 | |
| APTT, mean (SD), s | 51.21 (20.13) | 50.24 (18.85) | 57.25 (26.09) | 0.010 | |
| AST, median [Q1, Q3], U/L | 72.00 [39.00, 197.35] | 72.00 [38.40, 183.85] | 73.75 [41.22, 283.38] | 0.236 | |
| CR, median [Q1, Q3], μmol/L | 66.90 [55.80, 88.00] | 66.00 [55.38, 85.35] | 71.00 [58.10, 114.95] | 0.010 | |
| DBIL, median [Q1, Q3], μmol/L | 69.45 [15.97, 231.20] | 65.80 [15.30, 237.10] | 85.10 [21.95, 200.45] | 0.498 | |
| GLO, mean (SD), g/L | 26.98 (8.77) | 27.06 (8.59) | 26.48 (9.87) | 0.586 | |
| HB, mean (SD), g/L | 102.47 (25.23) | 104.47 (25.20) | 89.58 (21.39) | <0.001 | |
| HCT, mean (SD), % | 30.47 (7.34) | 31.14 (7.31) | 26.67 (6.36) | <0.001 | |
| INR, median [Q1, Q3], U/L | 1.63 [1.29, 2.30] | 1.63 [1.28, 2.30] | 1.58 [1.36, 2.27] | 0.700 | |
| PLT, median [Q1, Q3], *109/L | 69.00 [42.00, 104.00] | 71.00 [43.00, 105.00] | 63.00 [40.00, 97.00] | 0.191 | |
| PT, median [Q1, Q3], s | 18.95 [15.20, 25.23] | 19.00 [15.20, 25.20] | 18.10 [15.40, 25.35] | 0.881 | |
| TBIL, median [Q1, Q3], μmol/L | 107.55 [33.82, 380.83] | 104.00 [32.30, 384.10] | 140.70 [48.05, 336.50] | 0.414 | |
| FIB, mean (SD), g/L | 4.70 (13.77) | 3.12 (4.90) | 12.10 (30.19) | 0.053 | |
| TP, median [Q1, Q3], g/L | 61.50 [55.00, 68.40] | 62.05 [55.30, 68.70] | 59.45 [53.27, 65.53] | 0.016 | |
| TT, median [Q1, Q3], s | 19.50 [17.40, 22.20] | 19.45 [17.23, 22.10] | 19.70 [17.88, 22.92] | 0.162 | |
| UA, median [Q1, Q3], μmol/L | 224.45 [134.40, 332.05] | 223.00 [135.05, 330.08] | 243.65 [133.20, 357.65] | 0.288 | |
| UREA, median [Q1, Q3], mmol/L | 5.45 [3.87, 8.09] | 5.39 [3.82, 7.62] | 6.58 [4.07, 10.81] | 0.003 | |
| WBC, median [Q1, Q3], *109/L | 5.22 [3.43, 8.09] | 5.28 [3.38, 8.22] | 4.95 [3.50, 7.31] | 0.318 |
Definition of abbreviations: SD, Standard Deviation; ALT, Alanine Aminotransferase; APTT, Activated Partial Thromboplastin Time; AST, Aspartate Aminotransferase; DBIL, Direct Bilirubin; INR, International Standard Ratio; PT, Prothrombin Time; TBIL, Total Bilirubin; TP, Total Protein; TT, Thrombin Time; UA, Uric Acid; WBC, White Blood Cell.
Figure 2Variable distribution. This figure described the distribution of key variables between groups in the training set. Orange represents MT group, blue represents non-MT group, *p < 0.05, **p < 0.01.
Figure 3Heatmap. The value in the grid corresponding to the abscissa and ordinate is the correlation value of the two indicators. Corresponding colors and values indicate the degree of relevance.
Prediction model performance.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| CatBoost | 0.81 (0.75–0.87) | 68 (63–73) | 0.55 | 89 (79–98) | 66 (60–70) | 0.41 (0.32–0.49) |
| LightGBM | 0.75 (0.68–0.82) | 70 (65–75) | 0.45 | 76 (62–88) | 69 (65–74) | 0.38 (0.29–0.48) |
| XGBoost | 0.75 (0.67–0.81) | 67 (63–72) | 0.46 | 80 (67–90) | 66 (60–71) | 0.37 (0.28–0.45) |
| KNN | 0.74 (0.66–0.81) | 70 (66–75) | 0.43 | 73 (59–86) | 70 (65–75) | 0.38 (0.28–0.47) |
| Naïve Bayes | 0.73 (0.64–0.80) | 69 (64–74) | 0.40 | 71 (57–83) | 69 (63–74) | 0.36 (0.26–0.45) |
| RF | 0.72 (0.63–0.80) | 74 (69–78) | 0.43 | 68 (53–82) | 75 (70–80) | 0.39 (0.29–0.49) |
| AdaBoost | 0.72 (0.65–0.80) | 62 (58–67) | 0.38 | 78 (65–89) | 60 (55–66) | 0.34 (0.25–0.41) |
| LR | 0.72 (0.65–0.78) | 51 (46–57) | 0.41 | 96 (88–100) | 45 (40–51) | 0.32 (0.25–0.40) |
| GBDT | 0.70 (0.63–0.77) | 56 (52–62) | 0.35 | 82 (70–93) | 53 (48–59) | 0.32 (0.24–0.39) |
| MLP | 0.69 (0.61–0.76) | 56 (50–61) | 0.34 | 82 (69–93) | 52 (46–57) | 0.31 (0.23–0.38) |
| SVM | 0.66 (0.57–0.74) | 55 (50–61) | 0.28 | 75 (62–88) | 53 (47–58) | 0.29 (0.21–0.37) |
Definition of abbreviations: CatBoost, Categorical Boosting; LightGBM, Light Gradient Boosting Machine; XGBOOST, Extremely Gradient Boosting; KNN, K-Nearest Neighbor; RF, Random Forest; AdaBoost, Adaptive boosting; LR, Logistic Regression; GBDT, Gradient Boosting Decision Tree; MLP, Multi-Layer Perceptron; SVM, Support Vector Machine.
Figure 4Performance of models and key features. (A) Receiver operating characteristic curves for the machine learning models and logistic regression. (B) Relative importance of variables included in CatBoost model. (C) Relative importance of variables included in LightGBM model. (D) Relative importance of variables included in XGBoost model.
Prospective verification.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| CatBoost | 0.75 (0.60–0.88) | 63 (50–78) | 0.41 | 100 (100–100) | 41 (24–60) | 0.67 (0.49–0.80) |
| LightGBM | 0.72 (0.54–0.87) | 78 (65–89) | 0.51 | 65 (39–88) | 87 (73–97) | 0.69 (0.44–0.86) |
| XGBoost | 0.72 (0.54–0.87) | 72 (57–85) | 0.45 | 78 (56–95) | 69 (50–86) | 0.67 (0.47–0.82) |
| LR | 0.61 (0.42–0.77) | 59 (43–72) | 0.30 | 89 (71–100) | 42 (23–59) | 0.61 (0.43–0.75) |
Figure 5SHAP analysis of the CatBoost model on the validation set. This figure described data from the validation set. Each point represents a sample, and a wide area means a large number of samples are gathered. The color on the right indicates the value of the feature, red indicates that the feature value is high, and blue indicates that the feature value is low.
Figure 6Example of tool usage. Entering the specific input value of each patient to obtain the specific output value. Showing the contribution of each indicator to the prediction result.