| Literature DB >> 34969378 |
Siyu Zeng1, Lele Li2, Yanjie Hu3, Li Luo1, Yuanchen Fang4.
Abstract
BACKGROUND: For liver cancer patients, the occurrence of postoperative complications increases the difficulty of perioperative nursing, prolongs the hospitalization time of patients, and leads to large increases in hospitalization costs. The ability to identify influencing factors and to predict the risk of complications in patients with liver cancer after surgery could assist doctors to make better clinical decisions.Entities:
Keywords: Cancer of the liver; Complication; Machine learning; Risk prediction
Mesh:
Year: 2021 PMID: 34969378 PMCID: PMC8719378 DOI: 10.1186/s12911-021-01731-3
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Attributes of the dataset
| NO | Attributes | Type | Description |
|---|---|---|---|
| 1 | L_complication | Discrete | Occurrence of complications 0: No 1:Yes |
| 2 | Resection range | Discrete | Hepatic segments included in liver cancer surgery 1:hepatic segment 1 2: hepatic segment 2 3: hepatic segment 3 4: hepatic segment 4 |
| 3 | Type of incision | Discrete | Surgical incision shape 1: Surgical incision along the costal margin 2: Surgical incision along the midline of abdomen |
| 4 | Length of incision | Continuous | The length of the incision during the surgery |
| 5 | BMI | Continuous | Body Mass Index: the ratio of weight in kg to height in meters squared, a standard measure |
| 6 | CA | Continuous | Abdominal cavity depth: The distance from umbilicus to vertebral lip |
| 7 | Diameter of tumor | Continuous | The diameter of the biggest tumor |
| 8 | Number of tumors | Discrete | Count of tumors in the liver |
| 9 | Duration | Continuous | Time from the skin incision to definitive abdominal closure |
| 10 | Bleeding | Continuous | Volume of bleeding (ml): The amount of bleeding during the surgery |
| 11 | Transfusion | Discrete | Whether transfusion was needed 0: No 1: Yes |
| 12 | Plasma infusion | Continuous | Volume of plasma infusion (ml) input during surgery |
| 13 | Erythrocyte suspension | Continuous | Volume of erythrocyte suspension (ml) during operation |
Performance of the different algorithms
| Type of ML | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| Logistic Regression (LR) | 0.83 [0.77,0.89] | 0.72 | 0.60 | 0.82 [0.66,0.98] |
| T-wise Regression (TR) | 0.79 [0.73,0.84] | 0.70 | 0.59 | 0.79 [0.66,0.92] |
| Decision Tree: C5.0 (C5.0) | 0.92 [0.83,1] | 0.87 | 0.94 | 0.91 [0.77,1] |
| Decision Tree: CART (CART) | 0.87 [0.80,0.94] | 0.69 | 0.91 | 0.82 [0.70,0.94] |
| Support Vector Machine (SVM) | 0.81 [0.75,0.87] | 0.50 | 0.94 | 0.72 [0.59,0.85] |
| Random Forest (RF) | 0.77 [0.71,0.84] | 0.56 | 0.86 | 0.7 [0.60,0.81] |
Patient characteristics (n = 175)
| Risk factor | Mean | SD | Min | P25 | P50 | P75 | Max |
|---|---|---|---|---|---|---|---|
| BMI (kg/m2) | 22.52 | 2.96 | 16.41 | 20.25 | 22.49 | 24.27 | 32.85 |
| Length of incision (cm) | 23.30 | 4.61 | 13 | 19 | 24 | 27 | 35 |
| CA | 0.34 | 0.04 | 0.2 | 0.31 | 0.34 | 0.37 | 0.46 |
| Diameter of tumor (cm) | 5.69 | 3.39 | 1.0 | 3.0 | 5.0 | 6.9 | 18.0 |
| Duration (min) | 201.04 | 58.89 | 70 | 160 | 195 | 235 | 375 |
| Bleeding (ml) | 380 | 307.82 | 50 | 200 | 300 | 500 | 1600 |
Baseline patient characteristics in the complications group and non-complications group (N = 175)
| Variables | Variable category (assignment) | Sample size (N %) | Complications group (N=58) | Non-complications group (N=116) | Kruskal-Wallis chi-squared | P-Value |
|---|---|---|---|---|---|---|
| Age | 18–45:1 | 50 (28.57%) | 14 (24.14%) | 36 (30.77%) | 0.83 | 0.3621 |
| 45–65:2 | 125 (71.42%) | 44 (75. | 81 (69.23%) | |||
| Gender | Male (1) | 144 (82.29%) | 48 (82.76%) | 96 (82.05%) | 0.01 | 0.9084 |
| Female (2) | 31 (19.61%) | 10 (17.24%) | 21(17.95%) | |||
| BMI (kg/m2) | Continuous variable ( | 22.52 | 21.93 | 22.81 | 3.16 | 0.0750 |
| Tumor diameter | Continuous variable ( | 5.69 | 6.58 | 5.26 | 70.94 | 0.1577 |
| Tumor number | 1 (1) | 145 (82.86%) | 46 (79.31%) | 99 (84.61%) | 5.19 | 0.3935 |
| 2 (2) | 16 (9.14%) | 4 (6.90%) | 12 (10.23%) | |||
| 3 (3) | 7 (4.00%) | 4 (6.90%) | 3 (2.56%) | |||
| 4 (4) | 4 (2.29%) | 2 (3.45%) | 2 (1.70%) | |||
| 5 (5) | 2 (1.14%) | 1 (1.72%) | 1 (0.85%) | |||
| 6 (6) | 1 (0.57%) | 1 (1.72%) | 0 (0%) | |||
| CA | Continuous variable ( | 0.34 | 0.33 | 0.5661 | ||
| Incision type | Kocher’s incision (1) | 129 (73.71%) | 51 (87.93%) | 78 (66.67%) | 9.00 | 0.0027 |
| Abdominal incision (2) | 46 (26.28%) | 7 (12.07%) | 39 (33.33%) | |||
| Incision length | Continuous variable ( | 23.31 | 24.78 | 22.58 | 37.70 | 0.0494 |
| Duration | Continuous variable ( | 201.04 | 234.1 | 184.64 | 64.60 | 0.0401 |
| Incision range | One liver segment (1) | 37 (21.14%) | 5 (8.62%) | 32 (27.35%) | 16.29 | 0.0009 |
| Two liver segments (2) | 55 (31.42%) | 16 (27.59%) | 39 (33.33%) | |||
| Three liver segments (3) | 44 (25.14%) | 15 (25.86%) | 29 (24.78%) | |||
| Four liver segments (4) | 39 (22.28%) | 22 (37.93%) | 17 (14.53%) | |||
| Bleeding | Continuous variable ( | 380 | 516.72 | 312.22 | 34.27 | 0.0243 |
| Transfusion | Yes (1) | 10(5.71%) | 6(3.43%) | 4(2.28%) | 3.43 | 0.0630 |
| No (2) | 165(94.28%) | 52(29.71) | 113(64.67%) | |||
| Erythrocyte suspension | Continuous variable ( | 0.12+0.54 | 0.28 | 0.042 | 5.05 | 0.0240 |
| Complications | No (0) | 117 (66.48%) | ||||
| Yes (1) | 58 (33.52%) | |||||
Fig. 1a The distribution of length of incision. b The distribution of BMI. c The distribution of CA. d The distribution of duration. e The distribution of duration. f The distribution of Volume of bleeding during operation. g The distribution of Volume of plasma infusion during operation. h The distribution of patient Volume of erythrocyte suspension during operation
Fig. 2ROC analysis of the different ML algorithms. The AUC for each algorithm is indicated on the diagram. The C5.0 decision tree algorithm is had the best AUC
Comparison of different models
| No | Authors | Techniques | Diagnosis | Accuracy |
|---|---|---|---|---|
| 1 | Our best method | C5.0 decision tree | Liver cancer | 0.9245 |
| 2 | Ming et al. (2019) [ | ML-adaptive | Breast cancer | 0.9017 |
| 3 | Bronsert et al. (2019) [ | Binomial generalized linear model | Various diagnosis from electronic health record | 0.88 |
| 4 | Feng et al. (2019) [ | Logistic regression and Twenty-two machine learning (ML) models | Traumatic brain injuries | 0.88 |
| 5 | Abd El-Salam et al. (2019) [ | Bayesian Nets | Liver cirrhosis | 0.689 |
Fig. 3Decision curve analysis of different models for the prediction of postoperative complication risk in liver resection patients
Top five risk factors in descending order of importance for the five ML algorithms
| Type of ML | First factor | Second factor | Third factor | Fourth factor | Fifth factor |
|---|---|---|---|---|---|
| Decision Tree: C5.0 | Duration | BMI | CA | Plasma infusion | Length of incision |
| Decision Tree: CART | Duration | Diameter of tumor | BMI | CA | Length of incision |
| Support Vector Machine (SVM) | Duration | BMI | CA | Erythrocyte suspension | Length of incision |
| Random Forest (RF) | Duration | BMI | CA | Diameter of tumor | bleeding |
Probability of belonging to each level of the dependent variable
| Risk factor | Estimate | Std. Error | z value | Pr ( >|z|) |
|---|---|---|---|---|
| Length of incision | 0.105457 | 0.047442 | 2.223 | 0.026227* |
| BMI | − 0.242907 | 0.070834 | − 3.429 | 0.000605** |
| Duration | 0.017495 | 0.003912 | 4.472 | 7.73e−06** |
*P < 0.05; **P < 0.01