| Literature DB >> 32690077 |
Yueyang Zhao1, Li Fang1, Lei Cui2, Song Bai3.
Abstract
BACKGROUND: Surgical resection of pheochromocytoma may lead to high risk factors for intraoperative hemodynamic instability (IHD), which can be life-threatening. This study aimed to investigate the risk factors that could predict IHD during pheochromocytoma surgery by data mining.Entities:
Keywords: Data mining; Decision trees; Logistic regression; Naive Bayes; Pheochromocytoma; Random forest; Relief-F
Mesh:
Year: 2020 PMID: 32690077 PMCID: PMC7370474 DOI: 10.1186/s12911-020-01180-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
The population characteristics of patients
| Without IHD | With IHD | ||
|---|---|---|---|
| Mean age (years) | 51.9 ± 12.3 | 54.0 ± 13.8 | 0.233 |
| Sex (male/female) | 110 (52.6) / 99 (47.4) | 31 (41.9) / 43 (58.1) | 0.112 |
| BMI (kg/m2) | 24.1 ± 3.5 | 21.9 ± 2.7 | < 0.001 |
| ASA score 1/2/3 | 52(24.9)/136(65.1)/21(10.0) | 14(18.9)/53(71.6)/7(9.5) | 0.548 |
| Diabetes mellitus | 61 (29.2) | 23 (31.1) | 0.759 |
| Coronary heart disease | 66 (31.6) | 37 (50.0) | 0.005 |
| Hypertension Normal/Intermittent/Continuous | 82(39.2)/47(22.5)/80(38.3) | 30(40.5)/18(24.3)/26(35.1) | 0.883 |
| Arrhythmia | 12 (5.7) | 4 (5.4) | 0.914b |
| Tumor side (left/right) | 103 (49.3) / 106 (50.7) | 39 (52.7) / 35 (47.3) | 0.613 |
| Radiographic tumor size (cm) | 5.2 ± 2.5 | 6.5 ± 3.1 | < 0.001 |
| Tumor necrosis | 69 (33.0) | 33 (44.6) | 0.075 |
| Tumor enhanced CT difference (Hu) | 43.2 ± 20.6 | 45.6 ± 20.2 | 0.435 |
| Use of α adrenoreceptor antagonists | 115 (55.0) | 42 (56.8) | 0.797 |
| Use of crystal/colloid fluid | 118 (56.5) | 29 (39.2) | 0.011 |
| Use of blood transfusion | 54 (25.8) | 15 (20.3) | 0.338 |
| 24-h urine metanephrines/ normal upper limit | 1.4 (0.9–2.2) | 1.47 (0.9–2.7) | 0.153a |
| Laparoscopic vs. Open | 109 (52.2) / 100 (47.8) | 42 (56.8) / 32 (43.2) | 0.495 |
Continuous variables with normal distribution are reported as the mean ± standard deviation (SD), while non-normal continuous variables as the median (interquartile range) and categorical variables as numbers (percentages). Student’s t-test was used to compare the mean values of two continuous normally distributed variables and the Mann–Whitney U-test was used to determine mean values of two continuous non-normally distributed variables. The chi-squared or Fisher’s exact test was used for categorical variables
a Mann–Whitney U-test
b Fisher’s exact test
BMI body mass index; ASA American Society of Anesthesiologists; CT computed tomography; IHD intraoperative hemodynamic instability
Fig. 1Flow chart for predicting IHD during pheochromocytoma surgery
Indicator weights obtained by Relief-F
| Attributes | Weights |
|---|---|
| ctvalue | −2.8871 |
| prevma | −3.3417 |
| arrhythmia | −4.4444 |
| age | −4.6933 |
| bmi | −6.7295 |
| size | −9.2895 |
| asa | −13.3596 |
| preblood | −19.3992 |
| hypertension | −22.7188 |
| dm | −23.1212 |
Accuracy and AUC values of all models
| hold out 80/20 | hold out 70/30 | hold out 60/40 | CV 5 fold | CV 10 fold | CV 15 fold | bootstrap 50 | bootstrap 100 | bootstrap 200 | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Numerical IHD Dataset | ||||||||||
| Logistic regression | Accuracy | 0.7018 | 0.7529 | 0.7368 | 0.7386 | 0.7278 | 0.7207 | 0.6223 | 0.7486 | 0.7470 |
| AUC | 0.5374 | 0.6392 | 0.6105 | 0.6096 | 0.5951 | 0.5597 | 0.5388 | 0.6343 | 0.6102 | |
| Naive Bayes | Accuracy | 0.7544 | 0.7412 | 0.7719 | 0.7245 | 0.7319 | 0.7312 | 0.6524 | 0.7705 | 0.7590 |
| AUC | 0.7392 | 0.7791 | 0.7999 | 0.6591 | 0.6740 | 0.7041 | 0.4966 | 0.6786 | 0.7719 | |
| CART | Accuracy | 0.7719 | 0.7059 | 0.6930 | 0.7031 | 0.7036 | 0.7060 | 0.6567 | 0.7377 | 0.7349 |
| AUC | 0.6827 | 0.3797 | 0.6999 | 0.6787 | 0.6097 | 0.6587 | 0.4495 | 0.6862 | 0.6694 | |
| C4.5 | Accuracy | 0.7193 | 0.7294 | 0.7544 | 0.7563 | 0.7284 | 0.7528 | 0.7167 | 0.7268 | 0.6747 |
| AUC | 0.6520 | 0.6792 | 0.7569 | 0.6727 | 0.6991 | 0.7480 | 0.4422 | 0.6151 | 0.6233 | |
| C5.0 | Accuracy | 0.6667 | 0.6824 | 0.7544 | 0.7246 | 0.7318 | 0.7493 | 0.6652 | 0.7268 | 0.6747 |
| AUC | 0.6478 | 0.7132 | 0.7716 | 0.6514 | 0.7150 | 0.7146 | 0.4847 | 0.3861 | 0.6498 | |
| C5.0 boosted | Accuracy | 0.7018 | 0.7882 | 0.7544 | 0.7706 | 0.7499 | 0.7596 | 0.6695 | 0.7268 | 0.7590 |
| AUC | 0.6420 | 0.7710 | 0.7686 | 0.7415 | 0.7130 | 0.7510 | 0.6600 | 0.6988 | 0.7849 | |
| Random Forest | Accuracy | 0.7544 | 0.7765 | 0.8023 | 0.8025 | 0.8123 | 0.7639 | 0.8033 | 0.7952 | |
| AUC | 0.8181 | 0.8524 | 0.7943 | 0.8268 | 0.8274 | 0.6923 | 0.8538 | 0.8533 | ||
| Categrical IHD dataset | ||||||||||
| Logistic regression | Accuracy | 0.6842 | 0.7411 | 0.7544 | 0.7563 | 0.7493 | 0.7483 | 0.5794 | 0.7377 | 0.7349 |
| AUC | 0.5257 | 0.6448 | 0.6220 | 0.6255 | 0.6442 | 0.6322 | 0.5535 | 0.6191 | 0.6179 | |
| Naive Bayes | Accuracy | 0.7544 | 0.7412 | 0.7632 | 0.7245 | 0.7319 | 0.7312 | 0.6481 | 0.7650 | 0.7590 |
| AUC | 0.7359 | 0.7812 | 0. 7986 | 0.6565 | 0.6745 | 0.7012 | 0.4976 | 0.6580 | 0.7711 | |
| CART | Accuracy | 0.7368 | 0.7059 | 0.6930 | 0.7031 | 0.7108 | 0.7097 | 0.6567 | 0.7377 | 0.7349 |
| AUC | 0.6653 | 0.3797 | 0. 6999 | 0.6787 | 0.5971 | 0.6575 | 0.4495 | 0.6862 | 0.6694 | |
| C4.5 | Accuracy | 0.7193 | 0.7412 | 0.7632 | 0.7456 | 0.7461 | 0.7774 | 0.7554 | 0.6831 | 0.7108 |
| AUC | 0.4427 | 0.7037 | 0. 7580 | 0.6784 | 0.6818 | 0.7365 | 0.4641 | 0.5457 | 0.6575 | |
| C5.0 | Accuracy | 0.7193 | 0.6706 | 0.7544 | 0.7316 | 0.7459 | 0.7528 | 0.6395 | 0.7268 | 0.6747 |
| AUC | 0.6171 | 0.6939 | 0. 7716 | 0.6775 | 0.6983 | 0.6994 | 0.6209 | 0.3861 | 0.6701 | |
| C5.0 boosted | Accuracy | 0.7544 | 0.7529 | 0.7719 | 0.7598 | 0.7562 | 0.7943 | 0.6395 | 0.7541 | 0.7470 |
| AUC | 0.7575 | 0.7283 | 0. 7084 | 0.7318 | 0.7335 | 0.7947 | 0.6209 | 0.7169 | 0.7596 | |
| Random Forest | Accuracy | 0.7719 | 0.7765 | 0.8093 | 0.8130 | 0.8123 | 0.7811 | 0.8197 | 0.7952 | |
| AUC | 0.8198 | 0.8597 | 0.7782 | 0.8194 | 0.8179 | 0.7064 | 0.8542 | 0.8322 | ||
The confusion matrix of the random forest model
| Actual classes | |||
|---|---|---|---|
| Positive | Negative | ||
| Predicted classes | Positive | 85 (True Positive, TP) | 2 (False Positive, FP) |
| Negative | 15 (False Negative, FN) | 12 (True Negative, TN) | |
Fig. 2Comparison of multiple evaluation indicators
Fig. 3Receiver-Operating Characteristic curve for prediction of hemodynamics instability
Fig. 4Visualization of important attribute scores
Important attribute scores according to the improved random forest model
| Attributes | Importance Scores of random forest | Attributes | Importance Scores of random forest | Attributes | Relief-F |
|---|---|---|---|---|---|
| Mean Decrease Accuracy | Mean Decrease Gini | Weight of attribute | |||
| bmi | 19.3095 | bmi | 16.688 | ctvalue | −2.8871 |
| size | 9.6143 | size | 12.2722 | prevma | −3.3417 |
| asa | 5.7061 | prevma | 9.5934 | arrhythmia | −4.4444 |
| hypertension | 4.6416 | ctvalue | 9.3884 | age | −4.6933 |
| ctvalue | 4.6293 | age | 8.0945 | bmi | −6.7295 |
| prevma | 1.3311 | hypertension | 2.9614 | size | −9.2895 |
| preblood | 0.8616 | asa | 2.3838 | asa | −13.3596 |
| arrhythmia | 0.2419 | preblood | 1.5554 | preblood | −19.3992 |
| dm | 0.0276 | dm | 1.2103 | hypertension | −22.7188 |
| age | −0.4422 | arrhythmia | 0.3524 | dm | −23.1212 |