| Literature DB >> 35755847 |
Minyue Yin1, Rufa Zhang2, Zhirun Zhou3, Lu Liu1, Jingwen Gao1, Wei Xu1, Chenyan Yu1, Jiaxi Lin1, Xiaolin Liu1, Chunfang Xu1, Jinzhou Zhu1.
Abstract
Background: Machine learning (ML) algorithms are widely applied in building models of medicine due to their powerful studying and generalizing ability. This study aims to explore different ML models for early identification of severe acute pancreatitis (SAP) among patients hospitalized for acute pancreatitis.Entities:
Keywords: artificial intelligence; automated machine learning; logistic regression analysis; predictive models; severe acute pancreatitis
Mesh:
Year: 2022 PMID: 35755847 PMCID: PMC9226483 DOI: 10.3389/fcimb.2022.886935
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 6.073
Figure 1The flow chart of this study.
Demographic and clinical characteristics of patients in training, validation and test groups.
| Variables | The developing dataset ( | The test dataset ( | |||||
|---|---|---|---|---|---|---|---|
| Group | Non-SAP ( | SAP ( |
| Non-SAP ( | SAP ( |
| |
| Sex (%) | Male | 518 (58.3) | 76 (61.3) | 0.597 | 133 (66.5) | 9 (75.0) | 0.770 |
| Female | 370 (41.7) | 48 (38.7) | 67 (33.5) | 3 (25.0) | |||
| Age (year) (median [IQR]) | 52.00 [38.00, 65.00] | 45.50 [35.00, 61.75] | 0.141 | 47.00 [35.75, 65.25] | 44.00 [33.75, 58.75] | 0.810 | |
| Etiology (%) | Biliary | 402 (45.3) | 42 (33.9) | <0.001 | 88 (44.0) | 5 (41.7) | 0.461 |
| Hyperlipidemia | 158 (17.8) | 44 (35.5) | 37 (18.5) | 4 (33.3) | |||
| Alcoholic | 48 (5.4) | 5 (4.0) | 21 (10.5) | 0 (0.0) | |||
| Others | 280 (31.5) | 33 (26.6) | 54 (27.0) | 3 (25.0) | |||
| Smoke (%) | No | 767 (86.4) | 108 (87.1) | 0.936 | 161 (80.5) | 9 (75.0) | 0.927 |
| Yes | 121 (13.6) | 16 (12.9) | 39 (19.5) | 3 (25.0) | |||
| Hypertension (%) | No | 592 (66.7) | 76 (61.3) | 0.279 | 145 (72.5) | 9 (75.0) | 1.000 |
| Yes | 296 (33.3) | 48 (38.7) | 55 (27.5) | 3 (25.0) | |||
| Diabetes (%) | No | 773 (87.0) | 102 (82.3) | 0.187 | 170 (85.0) | 8 (66.7) | 0.202 |
| Yes | 115 (13.0) | 22 (17.7) | 30 (15.0) | 4 (33.3) | |||
| MAP (mean (SD)) | 97.12 (11.95) | 98.85 (15.57) | 0.147 | 94.95 (12.59) | 94.00 (14.20) | 0.801 | |
| PLT (*109/L) (mean (SD)) | 199.27 (66.43) | 212.48 (79.94) | 0.040 | 215.68 (66.31) | 225.83 (84.44) | 0.612 | |
| WBC (*109/L) (median [IQR]) | 12.00 [9.16, 15.30] | 16.07 [11.72, 20.64] | <0.001 | 11.90 [9.07, 14.83] | 12.05 [10.50, 19.28] | 0.216 | |
|
| 10.35 (4.61) | 14.45 (6.12) | <0.001 | 10.29 (4.68) | 13.27 (6.26) | 0.037 | |
|
| 1.20 [0.80, 1.80] | 0.91 [0.66, 1.50] | <0.001 | 1.30 [0.80, 1.80] | 1.00 [0.75, 1.15] | 0.174 | |
| NLR (median [IQR]) | 7.75 [4.32, 13.46] | 13.71 [8.96, 23.31] | <0.001 | 6.87 [4.58, 13.60] | 11.75 [7.25, 19.68] | 0.060 | |
| HCT (L/L) (mean (SD)) | 0.47 (1.52) | 0.83 (4.48) | 0.097 | 0.86 (4.24) | 0.46 (0.06) | 0.741 | |
| RDW (%) (mean (SD)) | 13.00 (1.01) | 13.27 (1.58) | 0.009 | 12.92 (1.08) | 12.76 (0.51) | 0.602 | |
| Lr (%) (median [IQR]) | 10.85 [6.60, 17.30] | 6.55 [3.98, 9.60] | <0.001 | 11.55 [6.47, 16.60] | 7.85 [4.90, 11.43] | 0.067 | |
| PCT (%) (mean (SD)) | 0.21 (0.11) | 0.22 (0.08) | 0.291 | 0.22 (0.06) | 0.22 (0.07) | 0.695 | |
| Cr (µmol/L) (median [IQR]) | 63.40 [53.90, 75.23] | 61.50 [49.85, 85.85] | 0.982 | 64.50 [54.00, 75.00] | 66.50 [57.25, 106.75] | 0.271 | |
| TB (µmol/L) (median [IQR]) | 21.00 [14.88, 32.23] | 19.60 [12.83, 30.40] | 0.165 | 17.15 [12.33, 26.15] | 19.70 [14.20, 24.53] | 0.666 | |
| DB (µmol/L) (median [IQR]) | 7.20 [4.50, 13.40] | 7.35 [4.07, 12.95] | 0.613 | 7.90 [5.47, 13.12] | 10.20 [7.30, 16.95] | 0.226 | |
| DTR (median [IQR]) | 0.36 [0.28, 0.48] | 0.41 [0.30, 0.52] | 0.042 | 0.47 [0.39, 0.60] | 0.58 [0.48, 0.67] | 0.058 | |
| Urea (mmol/L) (median [IQR]) | 4.90 [3.80, 6.20] | 5.70 [4.27, 8.53] | <0.001 | 4.20 [3.38, 5.90] | 6.35 [3.80, 13.40] | 0.036 | |
| LDH (U/L) (median [IQR]) | 217.10 [178.00, 289.65] | 341.00 [244.70, 498.88] | <0.001 | 199.00 [165.00, 250.25] | 376.50 [211.75, 575.50] | 0.001 | |
| Ca2+ (mmol/L) (mean (SD)) | 2.18 (0.19) | 2.03 (0.28) | <0.001 | 2.10 (0.15) | 1.73 (0.46) | <0.001 | |
| TG (mmol/L) (median [IQR]) | 1.42 [0.88, 3.30] | 2.41 [1.27, 9.31] | <0.001 | 1.12 [0.66, 2.90] | 3.42 [0.86, 9.17] | 0.154 | |
| GLU (mmol/L) (median [IQR]) | 7.03 [5.84, 9.16] | 8.18 [6.62, 11.79] | <0.001 | 6.98 [5.48, 9.27] | 13.08 [9.50, 13.88] | 0.002 | |
| TyG (median [IQR]) | 8.98 [8.41, 9.92] | 9.79 [8.95, 11.11] | <0.001 | 8.79 [8.11, 9.75] | 10.06 [9.05, 11.46] | 0.031 | |
| ALT (U/L) (median [IQR]) | 39.10 [18.90, 141.00] | 23.80 [14.20, 53.48] | <0.001 | 40.00 [17.75, 137.25] | 24.00 [13.00, 55.75] | 0.225 | |
| AST (U/L) (median [IQR]) | 30.00 [18.60, 86.25] | 27.10 [18.95, 52.05] | 0.129 | 28.00 [17.00, 68.75] | 30.50 [20.00, 53.75] | 0.919 | |
| GGT (U/L) (median [IQR]) | 86.40 [35.92, 267.00] | 64.70 [27.75, 195.28] | 0.09 | 102.50 [40.00, 266.00] | 106.00 [61.75, 157.75] | 0.959 | |
| ALP (U/L) (median [IQR]) | 90.00 [67.57, 131.85] | 74.80 [55.80, 101.75] | <0.001 | 86.00 [67.00, 133.25] | 75.50 [64.00, 84.50] | 0.095 | |
| ALB (g/L) (mean (SD)) | 37.19 (4.99) | 33.45 (6.47) | <0.001 | 38.17 (5.01) | 33.46 (5.55) | 0.002 | |
| K+ (mmol/L) (mean (SD)) | 4.01 (0.45) | 3.99 (0.63) | 0.256 | 4.12 (0.53) | 4.32 (0.49) | 0.205 | |
| AGR (median [IQR]) | 1.40 [1.20, 1.60] | 1.20 [1.03, 1.40] | <0.001 | 1.50 [1.36, 1.69] | 1.49 [1.20, 1.56] | 0.266 | |
| PT (s) (mean (SD)) | 13.38 (2.19) | 14.82 (3.05) | <0.001 | 13.95 (1.18) | 14.80 (1.48) | 0.018 | |
| INR (mean (SD)) | 1.10 (0.18) | 1.22 (0.29) | <0.001 | 1.08 (0.11) | 1.17 (0.12) | 0.011 | |
| APTT (s) (mean (SD)) | 32.58 (6.82) | 38.34 (18.43) | <0.001 | 37.74 (6.00) | 40.48 (11.85) | 0.154 | |
| CRP (median [IQR]) | 26.05 [3.49, 111.51] | 149.02 [15.67, 265.12] | <0.001 | 76.95 [25.85, 143.78] | 244.75 [103.62, 303.18] | 0.003 | |
| CAR (median [IQR]) | 0.63 [0.09, 3.19] | 4.09 [0.34, 8.99] | <0.001 | 2.04 [0.60, 4.06] | 8.58 [3.07, 9.75] | 0.001 | |
| RCR (median [IQR]) | 5.94 [5.53, 6.36] | 6.57 [5.89, 7.24] | <0.001 | 6.06 [5.76, 6.54] | 6.78 [6.01, 9.58] | 0.019 | |
| SIRS (%) | No | 640 (72.1) | 30 (24.2) | <0.001 | 163 (81.5) | 2 (16.7) | <0.001 |
| Yes | 248 (27.9) | 94 (75.8) | 37 (18.5) | 10 (83.3) | |||
| PE (%) | No | 609 (68.6) | 15 (12.1) | <0.001 | 138 (69.0) | 2 (16.7) | 0.001 |
| Yes | 279 (31.4) | 109 (87.9) | 62 (31.0) | 10 (83.3) | |||
| MCTSI (median [IQR]) | 2.00 [2.00, 4.00] | 4.00 [4.00, 4.00] | <0.001 | 2.00 [2.00, 4.00] | 5.00 [4.00, 6.00] | <0.001 | |
| RANSON (median [IQR]) | 1.00 [0.00, 2.00] | 2.00 [1.00, 2.00] | <0.001 | 1.00 [0.00, 1.00] | 2.00 [1.00, 2.25] | 0.001 | |
| BISAP (median [IQR]) | 1.00 [0.00, 2.00] | 2.00 [2.00, 3.00] | <0.001 | 1.00 [0.00, 1.00] | 2.00 [1.25, 3.00] | 0.002 | |
| SABP (median [IQR]) | 3.08 [-2.94, 10.83] | 9.10 [1.43, 22.56] | <0.001 | 2.02 [-2.82, 7.33] | 16.89 [-2.09, 28.61] | 0.073 | |
MAP, mean artery pressure; N, neutrophil; L, lymphocyte; NLR, neutrophil/lymphocyte; Lr, percentage of lymphocytes; Cr, creatinine; TB, total bilirubin; DB, direct bilirubin; DTR, direct bilirubin/total bilirubin; TG, total triglycerides; GLU, glucose; TyG, TG/GLU; AGR, albumin/globulin; CAR, CRP/albumin; RCR, RDW/Ca2+; ALB, albumin; SABP, acute biliary pancreatitis (0.55 + SIRS * 1.02 − 0.63 * ALB + 1.76 * BUN/0.356 + 1.66 * PE); PE, pleural effusion.
Figure 2Nomogram of the LASSO model for the early prediction of severe acute pancreatitis.
Comparison of LR and AutoML models for early prediction of SAP in the test cohort.
| AUC | Sensitivity | Specificity | Accuracy | PPV | NPV | LR+ | LR− | |
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
|
| 0.945 | 0.583 | 0.975 | 0.953 | 0.583 | 0.975 | 23.333 | 0.427 |
|
| 0.898 | 0.583 | 0.980 | 0.958 | 0.636 | 0.975 | 29.167 | 0.425 |
|
| 0.871 | 0.417 | 0.950 | 0.920 | 0.333 | 0.964 | 8.333 | 0.614 |
|
| 0.868 | 0.500 | 0.925 | 0.901 | 0.286 | 0.969 | 6.667 | 0.541 |
|
| 0.860 | 0.500 | 0.920 | 0.896 | 0.273 | 0.968 | 6.250 | 0.543 |
|
| ||||||||
|
| 0.898 | 0.500 | 0.965 | 0.943 | 0.417 | 0.975 | 3.821 | 0.109 |
|
| ||||||||
|
| 0.764 | 0.667 | 0.800 | 0.896 | 0.188 | 0.954 | 3.335 | 0.416 |
|
| 0.869 | 1.000 | 0.588 | 0.611 | 0.128 | 1.000 | 2.427 | 0 |
|
| 0.787 | 0.700 | 0.796 | 0.854 | 0.5 | 0.884 | 3.431 | 0.377 |
|
| 0.673 | 0.600 | 0.871 | 0.752 | 0.333 | 0.825 | 4.651 | 0.459 |
LR, logistic regression; AutoML, automated machine learning; SAP, severe acute pancreatitis; PPV, positive predictive value; NPV, negative predictive value; LR+, positive likelihood ration; LR−, negative likelihood ratio.
Figure 3Variable importance of the GBM model in the training set, showing that albumin was the most important feature, followed by PE, SIRS, TGs, LDH, etc. ALB, albumin; PE, pleural effusion; SIRS, systemic inflammatory response syndrome; TG, triglyceride; LDH, lactic dehydrogenase; RCR, ratio of RDW to Ca2+; Ca, Ca2+; n, neutrophil count; TyG, ratio of triglyceride to glucose; PT, prothrombin time.
Figure 4SHAP of the GBM model in the training set. The closer the values of the variables were to 1, the more likely patients were to progress to severity acute pancreatitis. SHAP, SHapley additive explanation; PE, pleural effusion; ALB, albumin; SIRS, systemic inflammatory response syndrome; LDH, lactic dehydrogenase; TG, triglyceride; PT, prothrombin time; n, neutrophil count; AGR, ratio of albumin to globulin; ALT, glutamic-pyruvic transaminase; Ca, Ca2+.
Figure 5LIME of the GBM model in the test set. LIME, Local Interpretable Model Agnostic Explanation.