| Literature DB >> 35954279 |
Anna Lemańska-Perek1, Dorota Krzyżanowska-Gołąb1, Katarzyna Kobylińska2, Przemysław Biecek2,3, Tomasz Skalec4, Maciej Tyszko4, Waldemar Gozdzik4, Barbara Adamik4.
Abstract
Fibronectin (FN) plays an essential role in the host's response to infection. In previous studies, a significant decrease in the FN level was observed in sepsis; however, it has not been clearly elucidated how this parameter affects the patient's survival. To better understand the relationship between FN and survival, we utilized innovative approaches from the field of explainable machine learning, including local explanations (Break Down, Shapley Additive Values, Ceteris Paribus), to understand the contribution of FN to predicting individual patient survival. The methodology provides new opportunities to personalize informative predictions for patients. The results showed that the most important indicators for predicting survival in sepsis were INR, FN, age, and the APACHE II score. ROC curve analysis showed that the model's successful classification rate was 0.92, its sensitivity was 0.92, its positive predictive value was 0.76, and its accuracy was 0.79. To illustrate these possibilities, we have developed and shared a web-based risk calculator for exploring individual patient risk. The web application can be continuously updated with new data in order to further improve the model.Entities:
Keywords: artificial intelligence models; biomarkers; fibronectin; sepsis; survival prediction
Mesh:
Substances:
Year: 2022 PMID: 35954279 PMCID: PMC9368279 DOI: 10.3390/cells11152433
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 7.666
Figure 1A workflow diagram. (A) This part presents how the best model was chosen. (B) The workflow of building and explaining the final model. Results are available at: https://stats4med.shinyapps.io/xai2shiny/ (accessed on 8 June 2022).
Patient characteristics at baseline.
| Parameter | All | Nonsurvivors | Survivors |
|
|---|---|---|---|---|
| Age (years) | 68.0 | 71.0 | 64.0 | 0.001 |
| (60.0–77.0) | (65.0–79.0) | (56.0–74.0) | ||
| Female/male ( | 58/64 | 27/27 | 31/37 | 0.627 |
| APACHE II score | 24.0 | 28.0 | 20.0 | <0.001 |
| (points) | (17.0–29.0) | (22.0–33.0) | (15.0–26.0) | |
| SOFA score | 10.0 | 11.5 | 9.0 | <0.001 |
| (points) | (8.0–13.0) | (10.0–15.0) | (7.0–11.0) | |
| Procalcitonin | 8.38 | 14.57 | 4.47 | <0.001 |
| (ng/mL) | (1.76–30.4) | (3.90–34.20) | (0.80–15.47) | |
| C-reactive protein | 192.3 | 197.3 | 186.7 | 0.726 |
| (mg/L) | (112.6–302.5) | (123.0–307.9) | (100.4–302.5) | |
| INR | 1.34 | 1.49 | 1.20 | <0.001 |
| (1.16–1.60) | (1.32–1.80) | (1.12–1.43) | ||
| Platelet count | 182.5 | 138.5 | 209.5 | 0.001 |
| (103/μL) | (124.0–310.0) | (74.0–243.0) | (155.0–335.0) | |
| D-dimers | 5.68 | 5.70 | 5.54 | 0.294 |
| (μg/mL) | (3.64–12.59) | (3.97–15.59) | (3.37–11.47) | |
| ICU stay | 9.5 | 5.5 | 12.5 | <0.001 |
| (days) | (4.0–18.0) | (3.0–12.0) | (6.5–29.5) | |
| Leukocytes | 15.5 | 15.9 | 15.0 | 0.660 |
| (103/μL) | (11.0–22.5) | (9.7–22.5) | (11.2–22.9) |
APACHE II, Acute Physiology and Chronic Health Evaluation II; SOFA, Sequential Organ Failure Assessment; INR, international normalized ratio. Values are presented as median and quartiles; the p-value represents the difference between Nonsurvivors and Survivors.
Figure 2A comparison of the area-under-the-curve values of the logistic regression, random forest, and gradient boosting models. The mean test AUC was the highest for the random forest model, whereas the lowest mean test AUC was for the boosting model. The blue dot represents the mean and the bold midline represents the median of the AUC results, whereas the upper and lower limits of the boxes correspond to the third and first quartiles. Black dots represent outliers in the data. Each boxplot presents the results of a 5-fold cross-validation procedure repeated five times for a specific model.
Figure 3The ROC curve of the machine learning model for predicting the survival of sepsis. The following input variables were included in the model: pFN, the INR value, the APACHE II score, age, the SOFA score, the platelet count, the procalcitonin level, and the d-dimers level.
Figure 4The Feature Importance plot showing the most significant variables for the model. The length of the bar indicates the loss in the AUC when a given variable was altered. The bigger the loss, the more important the variable is.
Figure 5The survival prediction model for a selected patient. (A) The impact of the values of particular features on the survival prediction based on the Break Down method: an example based on data collected on admission to the ICU. (B) The impact of the values of particular features on the survival prediction based on SHAP values: an example based on data collected on admission to the ICU (C). The calculation of survival when one feature changed: an example based on data collected on admission to the ICU.