| Literature DB >> 35845515 |
Zhanxiao Liu1, Ya Yang1, Huanhuan Song1, Ji Luo2.
Abstract
Background: Accurate and prompt clinical assessment of the severity and prognosis of patients with acute pancreatitis (AP) is critical, particularly during hospitalization. Natural language processing algorithms gain an opportunity from the growing number of free-text notes in electronic health records to mine this unstructured data, e.g., nursing notes, to detect and predict adverse outcomes. However, the predictive value of nursing notes for AP prognosis is unclear. In this study, a predictive model for in-hospital mortality in AP was developed using measured sentiment scores in nursing notes.Entities:
Keywords: Acute pancreatitis; Medical Information Mart for Intensive Care III (MIMIC-III); in-hospital mortality; predictive model; sentiment
Year: 2022 PMID: 35845515 PMCID: PMC9279801 DOI: 10.21037/atm-22-1613
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Baseline information of patients with acute pancreatitis, n (%)/M (Q1, Q3)/()
| Variables | Total (n=631) | Survivors (n=543) | Non-survivors (n=88) | χ2/Z/t | P |
|---|---|---|---|---|---|
| Gender | 1.596 | 0.206 | |||
| Female | 283 (44.85) | 249 (45.86) | 34 (38.64) | ||
| Male | 348 (55.15) | 294 (54.14) | 54 (61.36) | ||
| Age, years | 60.38 (47.08, 72.54) | 59.40 (46.26, 71.61) | 66.45 (52.19, 81.58) | 3.339 | <0.001 |
| Marital status | 1.166 | 0.761 | |||
| Married | 312 (49.45) | 265 (48.80) | 47 (53.41) | ||
| Separated/divorced | 54 (8.56) | 47 (8.66) | 7 (7.95) | ||
| Single | 186 (29.48) | 164 (30.20) | 22 (25.00) | ||
| Widowed | 79 (12.52) | 67 (12.34) | 12 (13.64) | ||
| Ethnicity | – | 0.965 | |||
| White | 516 (81.77) | 441 (81.22) | 75 (85.23) | ||
| Asian | 15 (2.38) | 14 (2.58) | 1 (1.14) | ||
| Black | 62 (9.83) | 54 (9.94) | 8 (9.09) | ||
| Hispanic | 21 (3.33) | 19 (3.50) | 2 (2.27) | ||
| Others | 17 (2.69) | 15 (2.76) | 2 (2.27) | ||
| First care unit | 6.293 | 0.178 | |||
| CCU | 43 (6.81) | 34 (6.26) | 9 (10.23) | ||
| CSRU | 32 (5.07) | 30 (5.52) | 2 (2.27) | ||
| MICU | 367 (58.16) | 310 (57.09) | 57 (64.77) | ||
| SICU | 130 (20.60) | 117 (21.55) | 13 (14.77) | ||
| TSICU | 59 (9.35) | 52 (9.58) | 7 (7.95) | ||
| Last care unit | 3.567 | 0.468 | |||
| CCU | 31 (4.91) | 24 (4.42) | 7 (7.95) | ||
| CSRU | 32 (5.07) | 29 (5.34) | 3 (3.41) | ||
| MICU | 358 (56.74) | 305 (56.17) | 53 (60.23) | ||
| SICU | 150 (23.77) | 131 (24.13) | 19 (21.59) | ||
| TSICU | 60 (9.51) | 54 (9.94) | 6 (6.82) | ||
| ICU LOS, days | 4.27 (2.02, 11.92) | 3.87 (1.89, 10.72) | 9.19 (4.65, 17.85) | 5.239 | <0.001 |
| RR, insp/min | 20.80±6.53 | 20.53±6.56 | 22.49±6.06 | −2.63 | 0.009 |
| Temperature, ℃ | 36.94±1.09 | 37.00±1.06 | 36.62±1.17 | 3.04 | 0.002 |
| Heart rate, bpm | 98.37±21.89 | 98.31±22.03 | 98.77±21.14 | −0.19 | 0.853 |
| SBP, mmHg | 128.96±27.46 | 129.88±27.71 | 123.27±25.33 | 2.10 | 0.036 |
| DBP, mmHg | 67.21±17.41 | 67.92±17.43 | 62.82±16.74 | 2.56 | 0.011 |
| MAP, mmHg | 85.24±17.90 | 85.93±17.99 | 81.01±16.81 | 2.40 | 0.017 |
| SpO2, % | 96.41±4.18 | 96.55±3.75 | 95.53±6.15 | 1.51 | 0.134 |
| White blood cells, ×109/L | 12.70 (8.80, 17.60) | 12.60 (8.80, 17.60) | 13.35 (8.90, 17.60) | 0.428 | 0.668 |
| Red blood cells, ×109/L | 3.93±0.86 | 3.97±0.85 | 3.69±0.84 | 2.95 | 0.003 |
| Sodium, mEq/L | 138.05±5.78 | 138.15±5.72 | 137.42±6.14 | 1.10 | 0.273 |
| Potassium, mEq/L | 4.19±0.89 | 4.18±0.88 | 4.28±0.96 | −1.02 | 0.307 |
| Phosphate, mg/dL | 3.30 (2.50, 4.30) | 3.20 (2.40, 4.20) | 3.80 (2.85, 5.40) | 3.421 | <0.001 |
| Calcium, mg/dL | 8.31±1.37 | 8.35±1.39 | 8.11±1.23 | 1.49 | 0.136 |
| Platelets, ×109/L | 224.00 (159.00, 302.00) | 226.00 (165.00, 308.00) | 207.50 (118.00, 286.50) | −2.391 | 0.017 |
| pH | 7.35±0.11 | 7.36±0.11 | 7.34±0.15 | 1.13 | 0.263 |
| Lactate, mmol/L | 1.80 (1.30, 2.90) | 1.80 (1.30, 2.80) | 1.90 (1.30, 3.65) | 1.449 | 0.147 |
| INR | 1.20 (1.10, 1.50) | 1.20 (1.10, 1.50) | 1.30 (1.10, 1.80) | 2.694 | 0.007 |
| MCV, fL | 91.35±8.05 | 91.02±7.62 | 93.41±10.12 | −2.12 | 0.036 |
| Magnesium, mg/dL | 1.90±0.44 | 1.89±0.44 | 1.95±0.44 | −1.18 | 0.238 |
| Glucose, mg/dL | 129.00 (103.00, 173.00) | 129.00 (103.00, 168.00) | 129.50 (107.00, 187.50) | 1.157 | 0.247 |
| Creatinine, mg/dL | 1.10 (0.80, 1.90) | 1.10 (0.80, 1.80) | 1.30 (0.80, 2.30) | 1.998 | 0.046 |
| BUN, mg/dL | 22.00 (14.00, 40.00) | 21.00 (13.00, 36.00) | 31.00 (17.50, 51.50) | 3.949 | <0.001 |
| Bicarbonate, mEq/L | 22.31±5.86 | 22.34±5.83 | 22.09±6.06 | 0.37 | 0.711 |
| Neutrophil, % | 77.55±15.26 | 77.63±14.65 | 77.11±18.66 | 0.25 | 0.803 |
| Lymphocytes, % | 8.70 (5.00, 14.90) | 9.00 (5.20, 15.00) | 7.00 (4.00, 11.00) | −2.900 | 0.004 |
| Albumin, g/dL | 3.15±0.71 | 3.20±0.71 | 2.85±0.67 | 4.45 | <0.001 |
| TBIL, mg/dL | 0.90 (0.50, 2.20) | 0.80 (0.50, 2.00) | 1.25 (0.60, 4.70) | 3.251 | 0.001 |
| Hematocrit, % | 35.68±7.08 | 35.94±7.02 | 34.12±7.24 | 2.25 | 0.025 |
| PO2, mmHg | 97.00 (71.00, 158.00) | 99.80 (71.00, 159.80) | 92.50 (73.50, 138.80) | −1.057 | 0.291 |
| Hemoglobin, g/dL | 12.02±2.48 | 12.13±2.46 | 11.32±2.49 | 2.89 | 0.004 |
| MCHC, % | 33.71±1.63 | 33.80±1.60 | 33.13±1.67 | 3.60 | <0.001 |
| ALP, IU/L | 105.00 (70.00, 175.00) | 104.00 (69.00, 171.00) | 119.50 (75.50, 225.00) | 2.133 | 0.033 |
| PCO2, mmHg | 38.80 (33.00, 46.00) | 39.00 (33.20, 46.00) | 37.00 (31.50, 48.00) | 0.918 | 0.359 |
| RDW, % | 14.92±2.02 | 14.76±1.87 | 15.89±2.57 | −3.96 | <0.001 |
| ALT, IU/L | 42.00 (23.00, 129.00) | 42.00 (21.00, 129.00) | 45.00 (25.50, 145.00) | 1.275 | 0.202 |
| AST, IU/L | 57.00 (28.00, 138.00) | 56.00 (28.00, 138.00) | 73.50 (32.00, 162.00) | 1.548 | 0.122 |
| Amylase, IU/L | 180.00 (74.00, 583.00) | 185.00 (77.00, 590.00) | 161.50 (69.00, 532.00) | −0.859 | 0.391 |
| Lipase, IU/L | 188.00 (53.00, 945.00) | 193.00 (58.00, 1,027.00) | 169.00 (37.50, 740.50) | −1.638 | 0.101 |
| COPD | 61 (9.67) | 56 (10.31) | 5 (5.68) | 1.860 | 0.173 |
| Lung cancer | 7 (1.11) | 3 (0.55) | 4 (4.55) | – | 0.009 |
| Atrial fibrillation | 143 (22.66) | 114 (20.99) | 29 (32.95) | 6.180 | 0.013 |
| Liver cirrhosis | 46 (7.29) | 34 (6.26) | 12 (13.64) | 6.094 | 0.014 |
| Congestive heart failure | 188 (29.79) | 155 (28.55) | 33 (37.50) | 2.903 | 0.088 |
| Heart disease | 28 (4.44) | 26 (4.79) | 2 (2.27) | – | 0.407 |
| Diabetes mellitus | 167 (26.47) | 137 (25.23) | 30 (34.09) | 3.055 | 0.080 |
| Respiratory failure | 246 (38.99) | 193 (35.54) | 53 (60.23) | 19.398 | <0.001 |
| Hyperlipidemia | 159 (25.20) | 153 (28.18) | 6 (6.82) | 18.328 | <0.001 |
| Renal failure | 295 (46.75) | 233 (42.91) | 62 (70.45) | 23.080 | <0.001 |
| Malignant cancer | 92 (14.58) | 76 (14.00) | 16 (18.18) | 1.065 | 0.302 |
| SAPS-II | 36.00 (27.00, 45.00) | 34.00 (25.00, 44.00) | 47.00 (37.50, 62.50) | 7.554 | <0.001 |
| SOFA score | 5.00 (3.00, 8.00) | 5.00 (3.00, 8.00) | 8.00 (5.00, 10.50) | 5.621 | <0.001 |
| LOS, day | 13.72 (7.27, 24.01) | 12.92 (7.15, 23.37) | 15.56 (7.95, 30.78) | 1.577 | 0.115 |
| Subjective mean | 5.16±1.28 | 5.13±1.26 | 5.34±1.39 | −1.44 | 0.150 |
| Subjective minimum | 2.40 (1.00, 3.13) | 2.50 (1.00, 3.18) | 2.01 (0.00, 2.83) | −2.392 | 0.017 |
| Polarity mean | 0.58 (0.36, 0.89) | 0.62 (0.37, 0.95) | 0.47 (0.18, 0.64) | −4.320 | <0.001 |
| Polarity minimum | −0.42 (−1.06, 0.03) | −0.38 (−1.00, 0.05) | −0.75 (−1.41, −0.24) | −3.797 | <0.001 |
“–” represents Fisher’s exact test. ICU, intensive care unit; CCU, coronary care unit; CSRU, cardiac surgery recovery unit; MICU/SICU/TSICU, medical/surgical/trauma or surgical intensive care unit, LOS, length of stay; RR, respiratory rate; SBP, systolic blood pressure; DBP, diastolic blood pressure; MAP, mean atrial pressure; SpO2, oxygen saturation; INR, international normalized ratio; MCV, mean corpuscular volume; BUN, blood urea nitrogen; TBIL, total bilirubin; PO2, oxygen partial pressure; MCHC, mean corpuscular hemoglobin concentration; ALP, alkaline phosphatase; PCO2, partial pressure of carbon dioxide; RDW, red cell distribution width; ALT, alanine aminotransferase; AST, aspartate aminotransferase; COPD, chronic obstructive pulmonary disease; SAPS-II, Simplified Acute Physiology Score-II; SOFA, sequential organ failure assessment.
Association between sentiment scores and in-hospital morality in acute pancreatitis
| Variables | Model 1 | Model 2 | Model 3 | |||||
|---|---|---|---|---|---|---|---|---|
| OR (95% CI) | P | OR (95% CI) | P | OR (95% CI) | P | |||
| Subjective mean | 1.129 (0.953–1.326) | 0.15 | 0.998 (0.815–1.206) | 0.982 | 0.974 (0.785–1.195) | 0.807 | ||
| Subjective minimum | 0.819 (0.694–0.966) | 0.018 | 0.932 (0.772–1.129) | 0.471 | 0.967 (0.785–1.195) | 0.757 | ||
| Polarity mean | 0.353 (0.210–0.577) | <0.001 | 0.391 (0.226–0.656) | <0.001 | 0.448 (0.233–0.833) | 0.014 | ||
| Polarity minimum | 0.756 (0.651–0.888) | <0.001 | 0.867 (0.715–1.055) | 0.148 | 0.896 (0.723–1.117) | 0.319 | ||
Model 1, the original model; Model 2, the model after adjusting for the variables age, gender, ethnicity, and ICU length of stay; Model 3, the model after adjusting for the variables in model 2 and the remaining variables (respiratory rate, total bilirubin, hemoglobin, lung cancer, respiratory failure, hyperlipidemia, and renal failure) screened by stepwise regression. OR, odds ratio; CI, confidence interval.
Effect of sentiment scores on the in-hospital morality in acute pancreatitis before data imputation
| Variables | Model 1 | Model 2 | Model 3 | |||||
|---|---|---|---|---|---|---|---|---|
| OR (95% CI) | P | OR (95% CI) | P | OR (95% CI) | P | |||
| Subjective mean | 1.129 (0.953–1.326) | 0.150 | 0.997 (0.788–1.234) | 0.979 | 0.971 (0.757–1.223) | 0.810 | ||
| Subjective minimum | 0.819 (0.694–0.966) | 0.018 | 0.906 (0.727–1.132) | 0.383 | 0.956 (0.752–1.217) | 0.712 | ||
| Polarity mean | 0.353 (0.210–0.577) | <0.001 | 0.409 (0.234–0.713) | 0.002 | 0.481 (0.231–0.971) | 0.046 | ||
| Polarity minimum | 0.756 (0.651–0.888) | <0.001 | 0.871 (0.714–1.063) | 0.173 | 0.936 (0.719–1.246) | 0.637 | ||
Model 1, the original model; Model 2, the model after adjusting for the variables age, gender, ethnicity, and ICU length of stay; Model 3, the model after adjusting for the variables in model 2 and the remaining variables (respiratory rate, total bilirubin, hemoglobin, lung cancer, respiratory failure, hyperlipidemia, and renal failure) screened by stepwise regression. OR, odds ratio; CI, confidence interval.
Comparison of the baseline characteristics between the training and testing groups, n (%)/M (Q1, Q3)/()
| Variables | Training (n=410) | Testing (n=221) | χ2/Z/t | P |
|---|---|---|---|---|
| Gender | 0.035 | 0.851 | ||
| Female | 185 (45.12) | 98 (44.34) | ||
| Male | 225 (54.88) | 123 (55.66) | ||
| Marital status | 2.992 | 0.393 | ||
| Married | 193 (47.07) | 119 (53.85) | ||
| Separated/divorced | 36 (8.78) | 18 (8.14) | ||
| Single | 129 (31.46) | 57 (25.79) | ||
| Widowed | 52 (12.68) | 27 (12.22) | ||
| Ethnicity | 1.501 | 0.827 | ||
| White | 334 (81.46) | 182 (82.35) | ||
| Asian | 8 (1.95) | 7 (3.17) | ||
| Black | 42 (10.24) | 20 (9.05) | ||
| Hispanic | 15 (3.66) | 6 (2.71) | ||
| Others | 11 (2.68) | 6 (2.71) | ||
| ICU LOS, days | 4.49 (2.06, 12.70) | 4.04 (1.89, 11.55) | −0.755 | 0.450 |
| Age, years | 59.81 (46.93, 71.61) | 61.20 (47.41, 74.30) | 1.135 | 0.256 |
| RR, insp/min | 20.94±6.77 | 20.54±6.06 | −0.72 | 0.471 |
| TBIL, mg/dL | 0.80 (0.50, 2.10) | 0.90 (0.50, 2.24) | 0.873 | 0.383 |
| Hemoglobin, g/dL | 12.11±2.49 | 11.85±2.46 | −1.25 | 0.212 |
| Lung cancer | – | 0.431 | ||
| No | 404 (98.54) | 220 (99.55) | ||
| Yes | 6 (1.46) | 1 (0.45) | ||
| Respiratory failure | 3.645 | 0.056 | ||
| No | 239 (58.29) | 146 (66.06) | ||
| Yes | 171 (41.71) | 75 (33.94) | ||
| Hyperlipidemia | 1.652 | 0.199 | ||
| No | 300 (73.17) | 172 (77.83) | ||
| Yes | 110 (26.83) | 49 (22.17) | ||
| Renal failure | 0.792 | 0.374 | ||
| No | 213 (51.95) | 123 (55.66) | ||
| Yes | 197 (48.05) | 98 (44.34) | ||
| SAPS-II | 36.00 (27.00, 46.00) | 36.00 (26.00, 44.00) | −0.271 | 0.786 |
| SOFA score | 5.00 (3.00, 9.00) | 6.00 (3.00, 8.00) | 0.073 | 0.942 |
| Subjective mean | 5.19±1.31 | 5.10±1.23 | −0.85 | 0.395 |
| Subjective minimum | 2.34 (0.67, 3.09) | 2.50 (1.20, 3.21) | 1.631 | 0.103 |
| Polarity mean | 0.57 (0.36, 0.86) | 0.62 (0.35, 0.98) | 1.253 | 0.210 |
| Polarity minimum | −0.43 (−1.06, 0.00) | −0.41 (−1.06, 0.05) | 0.648 | 0.517 |
| In-hospital mortality | 0.081 | 0.776 | ||
| Survivors | 354 (86.34) | 189 (85.52) | ||
| Non-survivors | 56 (13.66) | 32 (14.48) |
“–” represents Fisher’s exact test. ICU, intensive care unit; LOS, length of stay; RR, respiratory rate; TBIL, total bilirubin; SAPS-II, Simplified Acute Physiology Score-II; SOFA, sequential organ failure assessment.
The predictive performance of the model in in-hospital mortality
| Variables | Selected model | Selected model without sentiments | SOFA score | SAPS-II |
|---|---|---|---|---|
| Cut-off value | 0.153 | 0.150 | 0.126 | 0.160 |
| Training group | ||||
| Sensitivity (95% CI) | 0.786 (0.678–0.893) | 0.696 (0.576–0.817) | 0.696 (0.576–0.817) | 0.607 (0.479–0.735)* |
| Specificity (95% CI) | 0.760 (0.715–0.804) | 0.732 (0.685–0.778) | 0.540 (0.488–0.591)* | 0.749 (0.703–0.794) |
| PPV (95% CI) | 0.341 (0.259–0.423) | 0.291 (0.214–0.368) | 0.193 (0.139–0.248)* | 0.276 (0.197–0.355) |
| NPV (95% CI) | 0.957 (0.934–0.981) | 0.938 (0.910–0.967) | 0.918 (0.881–0.956) | 0.923 (0.893–0.954) |
| AUC (95% CI) | 0.840 (0.838–0.842) | 0.759 (0.692–0.826)* | 0.661 (0.659–0.663)* | 0.725 (0.723–0.728)* |
| Accuracy (95% CI) | 0.763 (0.722–0.805) | 0.727 (0.684–0.770) | 0.561 (0.513–0.609)* | 0.729 (0.686–0.772) |
| Testing group | ||||
| Sensitivity (95% CI) | 0.656 (0.492–0.821) | 0.719 (0.563–0.875) | 0.812 (0.677–0.948) | 0.594 (0.424–0.764) |
| Specificity (95% CI) | 0.815 (0.759–0.870) | 0.720 (0.656–0.784) | 0.545 (0.474–0.616)* | 0.831 (0.777–0.884) |
| PPV (95% CI) | 0.375 (0.248–0.502) | 0.303 (0.199–0.406) | 0.232 (0.154–0.310) | 0.373 (0.240–0.505) |
| NPV (95% CI) | 0.933 (0.895–0.971) | 0.938 (0.899–0.977) | 0.945 (0.902–0.988) | 0.924 (0.884–0.963) |
| AUC (95% CI) | 0.812 (0.809–0.815) | 0.793 (0.708–0.879)* | 0.732 (0.729–0.735)* | 0.792 (0.790–0.795)* |
| Accuracy (95% CI) | 0.792 (0.738–0.845) | 0.719 (0.660–0.779) | 0.584 (0.519–0.649)* | 0.796 (0.743–0.849) |
“*” represents the value of P less than 0.05 by comparison to the model established. CI, confidence interval; PPV, positive predictive value; NPV, negative predictive value; AUC, area under the curve; SAPS-II, Simplified Acute Physiology Score-II; SOFA, sequential organ failure assessment.
Figure 1The receiver operating characteristic curves of different models in the prediction of in-hospital mortality. (A) Training group; (B) testing group. AUC, area under the curve; SOFA, sequential organ failure assessment; SAPS-II, Simplified Acute Physiology Score-II.
Figure 2Comparison of the clinical value among different models. SOFA, sequential organ failure assessment; SAPS-II, Simplified Acute Physiology Score-II.
Figure 3A nomogram to predict the risk of in-hospital mortality in acute pancreatitis. *, P<0.05; **, P<0.01; ***, P<0.001.
Figure 4A schematic nomogram for predicting the risk of in-hospital mortality in a specific patient with acute pancreatitis. *, P<0.05; **, P<0.01; ***, P<0.001.