| Literature DB >> 35552440 |
Péter Hegyi1,2,3, Andrea Szentesi4,5, Szabolcs Kiss6,1,7, József Pintér8, Roland Molontay8,9, Marcell Nagy8, Nelli Farkas1,10, Zoltán Sipos1, Péter Fehérvári1,11, László Pecze1, Mária Földi6,1,7, Áron Vincze12, Tamás Takács13, László Czakó13, Ferenc Izbéki14, Adrienn Halász6,14, Eszter Boros14, József Hamvas15, Márta Varga16, Artautas Mickevicius17, Nándor Faluhelyi18, Orsolya Farkas18, Szilárd Váncsa1,2, Rita Nagy1,7,2, Stefania Bunduc2,19, Péter Jenő Hegyi2,3, Katalin Márta2,3, Katalin Borka2,20, Attila Doros2,21, Nóra Hosszúfalusi2,22, László Zubek2,23, Bálint Erőss2,3, Zsolt Molnár2,23,24, Andrea Párniczky1,7.
Abstract
Pancreatic necrosis is a consistent prognostic factor in acute pancreatitis (AP). However, the clinical scores currently in use are either too complicated or require data that are unavailable on admission or lack sufficient predictive value. We therefore aimed to develop a tool to aid in necrosis prediction. The XGBoost machine learning algorithm processed data from 2387 patients with AP. The confidence of the model was estimated by a bootstrapping method and interpreted via the 10th and the 90th percentiles of the prediction scores. Shapley Additive exPlanations (SHAP) values were calculated to quantify the contribution of each variable provided. Finally, the model was implemented as an online application using the Streamlit Python-based framework. The XGBoost classifier provided an AUC value of 0.757. Glucose, C-reactive protein, alkaline phosphatase, gender and total white blood cell count have the most impact on prediction based on the SHAP values. The relationship between the size of the training dataset and model performance shows that prediction performance can be improved. This study combines necrosis prediction and artificial intelligence. The predictive potential of this model is comparable to the current clinical scoring systems and has several advantages over them.Entities:
Mesh:
Year: 2022 PMID: 35552440 PMCID: PMC9098474 DOI: 10.1038/s41598-022-11517-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flowchart representing the process of developing the model.
Characteristics of the analyzed study population.
| Variable | Value (n = 2387) |
|---|---|
| Age in years, median (IQR) | 57 (44–69) |
| Male, n (%) | 1357 (56.85%) |
| BMI, median (IQR) | 27.14 (23.88–31.25) |
| Biliary | 955 (40.01%) |
| Alcoholic | 484 (20.28%) |
| Hypertriglyceridaemia | 81 (3.39%) |
| Biliary and alcoholic | 39 (1.63%) |
| Biliary and hypertriglyceridaemia | 13 (0.54%) |
| Alcoholic and hypertriglyceridaemia | 58 (2.43%) |
| Post-ERCP | 67 (2.81%) |
| Idiopathic | 432 (18.10%) |
| Other | 258 (10.81%) |
| Mild, n (%) | 1714 (71.81%) |
| Moderate, n (%) | 551 (23.08%) |
| Severe, n (%) | 122 (5.11%) |
| Mortality, n (%) | 66 (2.76%) |
| Length of stay in days, median (IQR) | 8 (6–12) |
| Patients with local complication, n (%) | 623 (26.19%) |
| APFC, n (%) | 510 (21.37%) |
| Pseudocyst, n (%) | 179 (7.50%) |
| Acute necrotic collection, n (%) | 233 (9.76%) |
| Patients with systemic complication, n (%) | 202 (8.46%) |
| Respiratory failure, n (%) | 136 (5.70%) |
| Heart failure, n (%) | 52 (2.18%) |
| Renal failure, n (%) | 83 (3.48%) |
| New-onset diabetes, n (%) | 75 (3.14%) |
APFC acute peripancreatic fluid collection, BMI body mass index, ERCP endoscopic retrograde cholangiopancreatography, IQR interquartile range.
Figure 2Association between necrosis development and other complications in acute pancreatitis.
Figure 3Receiver operating characteristic (ROC) curve for the XGBoost model.
Figure 4The relationship between the size of the data set and the model performance. The blue dot represents the area under the ROC curve value and the vertical lines show the corresponding confidence intervals.
Figure 5The predicted necrosis probabilities with the corresponding 50% (between the 25th and 50th percentiles) and 80% confidence (between the 10th and 90th percentiles).
Figure 6(A) The features with the highest impact on model output based on the SHAP values. The higher the predictor is on the list, the bigger the impact on model output. Each patient is represented by a dot. The x-axis represents the extent of the impact on prediction. The color of the dot shows the feature value (e.g. the red color implies higher values). (B) An example of prediction and its textual interpretation. The lower picture highlights the effect of individual predictors and the final necrosis probability provided by the model.
Figure 7The models build on the k predictors with the highest SHAP value.