| Literature DB >> 35536634 |
Pei-Fu Chen1,2, Lichin Chen3, Yow-Kuan Lin1,4, Guo-Hung Li1, Feipei Lai1,5,6, Cheng-Wei Lu2,7, Chi-Yu Yang8,9, Kuan-Chih Chen1,10, Tzu-Yu Lin2,7.
Abstract
BACKGROUND: Machine learning (ML) achieves better predictions of postoperative mortality than previous prediction tools. Free-text descriptions of the preoperative diagnosis and the planned procedure are available preoperatively. Because reading these descriptions helps anesthesiologists evaluate the risk of the surgery, we hypothesized that deep learning (DL) models with unstructured text could improve postoperative mortality prediction. However, it is challenging to extract meaningful concept embeddings from this unstructured clinical text.Entities:
Keywords: anesthesia; anesthesiologist; bidirectional encoder representations from transformers; deep learning model; deep neural network; electronic health record; machine learning; natural language processing; neural network; postoperative mortality prediction; prediction model; preoperative medicine; unstructured text
Year: 2022 PMID: 35536634 PMCID: PMC9131148 DOI: 10.2196/38241
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Flow diagram. ASAPS: American Society of Anesthesiologist Physical Status.
Characteristics of the cohort. Categorical variables are represented as frequency (%). Continuous variables are represented as the median (25th, 75th percentile). The testing cohort was split by time between the training and validation cohorts, and those cases arising from the training and validation cohorts were removed to prevent data leakage (n=5890, 4.9%).
| Feature | Training cohort (N=79,324) | Validation cohort (N=19,832) | Testing cohort (N=16,267) | Overall cohort (N=121,313) | |||||
| Age (years), median (25th, 75th percentile) | 54 (40, 66) | 54 (40, 66) | 53 (39, 65) | 55 (41, 66) | |||||
| Male sex, n (%) | 40,444 (51.0) | 9922 (50.0) | 8101 (49.8) | 61,485 (50.7) | |||||
| Height (cm), median (25th, 75th percentile) | 162 (157, 168) | 162 (156, 168) | 162 (157, 169) | 162 (157, 168) | |||||
| Weight (kg), median (25th, 75th percentile) | 64 (56, 74) | 64 (56, 74) | 65 (56, 75) | 64 (56, 74) | |||||
| BMI, median (25th, 75th percentile) | 24 (22, 27) | 24 (22, 27) | 24 (22, 27) | 24 (22, 27) | |||||
|
| |||||||||
|
| 1 | 2925 (3.7) | 739 (3.7) | 660 (4.1) | 4404 (3.6) | ||||
|
| 2 | 54,056 (68.15) | 13,549 (68.3) | 11,508 (70.7) | 82,588 (68.1) | ||||
|
| 3 | 20,842 (26.3) | 5155 (26.0) | ,654 (22.5) | 31,878 (26.3) | ||||
|
| 4 | 1345 (1.70) | 355 (1.8) | 397 (2.4) | 2204 (1.8) | ||||
|
| 5 | 156 (0.2) | 34 (0.2) | 48 (0.3) | 239 (0.2) | ||||
| ASAb emergency, n (%) | 6379 (8.0) | 1615 (8.1) | 1678 (10.3) | 9942 (8.2) | |||||
|
| |||||||||
|
| General | 69,898 (88.3) | 17,497 (88.4) | 14,486 (89.2) | 107,176 (88.5) | ||||
|
| Neuraxial | 9297 (11.7) | 2303 (11.6) | 1748 (10.8) | 13,929 (11.5) | ||||
|
| |||||||||
|
| Elective | 62,226 (78.5) | 15,455 (77.9) | 12,000 (73.8) | 94,816 (78.2) | ||||
|
| Urgent | 13,800 (17.4) | 3,567 (18.0) | 3,356 (20.6) | 21,342 (17.6) | ||||
|
| Emergency | 2849 (3.6) | 708 (3.6) | 801 (4.9) | 4484 (3.7) | ||||
|
| Immediate | 449 (0.57) | 102 (0.51) | 110 (0.7) | 671 (0.6) | ||||
|
| |||||||||
|
| Ward | 47,187 (59.5) | 11,788 (59.4) | 9824 (60.4) | 72,045 (59.4) | ||||
|
| Outpatient | 18,386 (23.2) | 4463 (22.5) | 2995 (18.4) | 27,830 (22.9) | ||||
|
| Emergency department | 10,083 (12.7) | 2592 (13.1) | 2283 (14.0) | 15,247 (12.6) | ||||
|
| Intensive care unit | 3668 (4.6) | 989 (5.0) | 1165 (7.2) | 6191 (5.1) | ||||
|
| |||||||||
|
| Urology | 14,760 (18.6) | 3630 (18.3) | 2665 (16.4) | 22,471 (18.5) | ||||
|
| General | 11,416 (14.4) | 2926 (14.8) | 2457 (15.1) | 17,608 (14.5) | ||||
|
| Orthopedics | 10,976 (13.8) | 2748 (13.9) | 2338 (14.4) | 16,772 (13.8) | ||||
|
| Gynecologyc | 10,206 (12.9) | 2,578 (13.0) | 2,302 (14.2) | 15,679 (12.9) | ||||
|
| Cardiovascular | 8692 (11.0) | 2086 (10.5) | 1491 (9.2) | 13,049 (10.8) | ||||
|
| Otolaryngology | 6193 (7.8) | 1505 (7.6) | 1223 (7.5) | 9427 (7.8) | ||||
|
| Plastic surgery | 5116 (6.5) | 1294 (6.5) | 1077 (6.6) | 7821 (6.4) | ||||
|
| Neurosurgery | 3233 (4.1) | 833 (4.2) | 727 (4.5) | 4955 (4.1) | ||||
|
| Traumatology | 2808 (3.5) | 740 (3.7) | 722 (4.4) | 4357 (3.6) | ||||
|
| Thoracic surgery | 2006 (2.5) | 514 (2.6) | 430 (2.6) | 3104 (2.6) | ||||
|
| Colorectal surgery | 1679 (2.1) | 423 (2.1) | 331 (2.0) | 2574 (2.1) | ||||
|
| Others | 2239 (2.8) | 555 (2.8) | 504 (3.1) | 3496 (2.9) | ||||
|
| |||||||||
|
| Diabetes mellitus | 15,906 (20.1) | 3863 (19.5) | 2812 (17.3) | 24,314 (20.0) | ||||
|
| Hyperlipidemia | 8704 (11.0) | 2119 (10.7) | 1740 (10.7) | 13,678 (11.3) | ||||
|
| Hypertension | 28,462 (35.9) | 7055 (35.6) | 4999 (30.7) | 43,391 (35.8) | ||||
|
| Prior cerebrovascular accident | 4355 (5.5) | 1028 (5.2) | 717 (4.4) | 6564 (5.4) | ||||
|
| Cardiac disease | 13,215 (16.7) | 3254 (16.4) | 2227 (13.7) | 20,156 (16.6) | ||||
|
| Chronic obstructive pulmonary disease | 1549 (2.0) | 380 (1.9) | 286 (1.8) | 2428 (2.0) | ||||
|
| Asthma | 3024 (3.8) | 762 (3.8) | 592 (3.6) | 4626 (3.8) | ||||
|
| Hepatic disease | 9118 (11.5) | 2299 (11.6) | 1664 (10.2) | 13,887 (11.4) | ||||
|
| Renal disease | 12,471 (15.7) | 3095 (15.6) | 1466 (9.0) | 18,874 (15.6) | ||||
|
| Bleeding disorder | 11,243 (14.2) | 2684 (13.5) | 2122 (13.0) | 17,543 (14.5) | ||||
|
| Prior major operations | 54,356 (68.5) | 13,592 (68.5) | 10,040 (61.7) | 83,490 (68.8) | ||||
|
| Smoking | 20,235 (25.5) | 5098 (25.7) | 3719 (22.9) | 30,433 (25.1) | ||||
|
| Drug allergy | 11,662 (14.7) | 2959 (14.9) | 2190 (13.5) | 18,092 (14.9) | ||||
|
| Consciousness | 69,858 (88.1) | 17,461 (88.0) | 15,107 (92.9) | 107,906 (88.9) | ||||
| 30-day mortality, n (%) | 997 (1.3) | 249 (1.3) | 215 (1.3) | 1562 (1.3) | |||||
aASAPS: American Society of Anesthesiologist Physical Status.
bASA: American Society of Anesthesiologists.
cThe gynecology department consists of gynecology and obstetrics.
Feature groups included in the models.
| Feature type | Feature classesa | |
|
| ||
|
| Continuous | Age, height, weight, BMI |
|
| Categorical | Sex (2), ASAPSb (5), ASAc emergency (2), department (22), preoperative location (4), anesthesia type (4) |
|
| ||
|
| Categorical | Emergency level (4) |
|
| Free text | Preoperative diagnosis, proposed procedure |
|
| ||
|
| Categorical | Diabetes mellitus (2), hyperlipidemia (2), hypertension (2), cerebrovascular accident (2), cardiac disease (2), chronic obstructive pulmonary disease (2), asthma (2), hepatic disease (2), renal disease (2), bleeding disorder (2), major operations (2), smoking (2), drug allergy (2) |
|
| ||
|
| Continuous | Hemoglobin, platelet, international normalized ratio, prothrombin time, activated partial thromboplastin time, creatinine, aspartate transaminase, alanine transaminase, blood sugar, serum sodium, serum potassium |
|
| ||
|
| Continuous | Body temperature, oxygen saturation, heart rate, respiratory rate, systolic and diastolic blood pressure |
|
| Categorical | Consciousness status (2) |
aThe number of classes is shown in parentheses.
bASAPS: American Society of Anesthesiologist Physical Status.
cASA: American Society of Anesthesiologists.
Figure 2Architectures of models. BERT: bidirectional encoder representations from transformers: DNN: deep neural network; FC: fully connected; ML: machine learning; ReLU: rectified linear unit; XGBoost: extreme gradient boosting.
Prediction performances of MLa models and ASAPSb on the testing cohort with 95% CIs.
| Model | AUROCc (95% CI) | AUPRCd (95% CI) | Accuracye (95% CI) | Sensitivitye (95% CI) | Specificitye (95% CI) | Precisiona (95% CI) | F1 scoree (95% CI) |
| BERTf-DNNg | 0.964 (0.961-0.967) | 0.336 (0.276-0.402) | 0.955 (0.952-0.958) | 0.749 (0.689-0.805) | 0.958 (0.955-0.961) | 0.193 (0.166-0.219) | 0.307 (0.269-0.342) |
| DNN | 0.959 (0.956-0.962) | 0.319 (0.260-0.384) | 0.913 (0.909-0.917) | 0.885 (0.841-0.926) | 0.913 (0.909-0.918) | 0.120 (0.104-0.136) | 0.212 (0.187-0.236) |
| Random forest | 0.961 (0.958-0.964) | 0.296 (0.239-0.360) | 0.986 (0.984-0.988) | 0.167 (0.122-0.222) | 0.997 (0.996-0.998) | 0.445 (0.341-0.557) | 0.242 (0.182-0.314) |
| XGBoosth | 0.950 (0.946-0.953) | 0.281 (0.225-0.345) | 0.986 (0.984-0.987) | 0.195 (0.144-0.249) | 0.996 (0.995-0.997) | 0.409 (0.312-0.500) | 0.263 (0.201-0.326) |
| Logistic regression | 0.952 (0.949-0.955) | 0.276 (0.220-0.339) | 0.904 (0.900-0.909) | 0.833 (0.780-0.882) | 0.905 (0.901-0.910) | 0.105 (0.091-0.119) | 0.187 (0.164-0.210) |
| ASAPS | 0.892 (0.887-0.896) | 0.149 (0.107-0.203) | 0.970 (0.968-0.973) | 0.409 (0.342-0.478) | 0.978 (0.975-0.980) | 0.197 (0.160-0.235) | 0.266 (0.220-0.310) |
aML: machine learning.
bASAPS: American Society of Anesthesiologist Physical Status.
cAUROC: area under the receiver operating characteristic.
dAUPRC: area under the precision-recall curve.
eThese metrics were calculated without adjusting the threshold (using 0.5 as the cut-off).
fBERT: bidirectional encoder representations from transformers.
gDNN: deep neural network.
hXGBoost: extreme gradient boosting.
Figure 3Comparison of discrimination of different models. (A) AUROC. (B) AUPRC. ASAPS: American Society of Anesthesiologist Physical Status; AUPRC: area under the precision-recall curve; AUROC: area under the receiver operating characteristic curve; BERT: bidirectional encoder representations from transformers; DNN: deep neural network; XGBoost: extreme gradient boosting.
Statistical significances of AUROCsa of different models. Values are P values. We applied a nonparametric approach proposed by DeLong et al [27] to calculate the SE of the area and the P value.
|
| BERTb-DNNc | DNN | Random forest | XGBoostd | Logistic regression |
| ASAPSe | <0.0001f | <0.0001f | <0.0001f | <0.0001f | <0.0001f |
| Logistic regression | 0.0005f | 0.0711 | 0.0351f | 0.6451 | N/Ag |
| XGBoost | 0.0025f | 0.0939 | 0.0262f | N/A | N/A |
| Random forest | 0.3816 | 0.5972 | N/A | N/A | N/A |
| DNN | 0.0944 | N/A | N/A | N/A | N/A |
aAUROC: area under the receiver operating characteristic.
bBERT: bidirectional encoder representations from transformers.
cDNN: deep neural network.
dXGBoost: extreme gradient boosting.
eASAPS: American Society of Anesthesiologist Physical Status.
fThe difference in areas achieved statistical significance (P<.05).
gN/A: not applicable.
Statistical significances of AUPRCsa of different models. Values are differences in areas with 95% CIs calculated by bootstrapping 1000 times [28]. If the 95% CI for the difference in areas does not include 0, it can be concluded that these 2 areas are significantly different (P<.05).
|
| BERTb-DNNc, difference in areas (95% CI) | DNN, difference in areas (95% CI) | Random forest, difference in areas (95% CI) | XGBoostd, difference in areas (95% CI) | Logistic regression, difference in areas (95% CI) |
| ASAPSe | 0.188 (0.159-0.221)f | 0.170 (0.137-0.201)f | 0.147 (0.122-0.177)f | 0.133 (0.107-0.162)f | 0.127 (0.101-0.154)f |
| Logistic regression | 0.061 (0.051-0.073)f | 0.043 (0.021-0.056)f | 0.020 (0.006-0.031)f | 0.006 (–0.006 to 0.014) | N/Ag |
| XGBoost | 0.055 (0.044-0.068)f | 0.038 (0.024-0.046)f | 0.015 (0.005-0.022)f | N/A | N/A |
| Random forest | 0.040 (0.030-0.054)f | 0.023 (0.010-0.032)f | N/A | N/A | N/A |
| DNN | 0.018 (0.008-0.037)f | N/A | N/A | N/A | N/A |
aAUPRC: area under the precision-recall curve.
bBERT: bidirectional encoder representations from transformers.
cDNN: deep neural network.
dXGBoost: extreme gradient boosting.
eASAPS: American Society of Anesthesiologist Physical Status.
fThe difference in areas achieved statistical significance (P<.05).
gN/A: not applicable.
Figure 4Calibration plot. The observed incidence of mortality was plotted against the calibrated predicted probability of mortality among patients in the test cohort (n=16,267, 14.1%). Predicted probabilities were calibrated by applying the histogram binning technique in the validation cohort using 5 bins. Mean predicted probabilities of in-hospital 30-day mortality were calculated within each group.
Figure 5Word embeddings visualized by t distributed stochastic neighbor embedding. (A) Word embeddings of the training set. (B) Word embeddings of the testing set. “Probs” indicates probabilities predicated by the BERT-DNN model. The intensity of color increased with the probability. “Labels” indicates mortalities by “x” and survivors by “•”. ards: acute respiratory distress syndrome; atfl: anterior talofibular ligament; avg: arteriovenous graft; avp: aortic valvuloplasty; BERT: bidirectional encoder representations from transformers; bct: breast-conserving therapy; bil: bilateral; bph: benign prostate hypertrophy; bx: biopsy; chr: chronic hypertrophic rhinitis; cps: chronic paranasal sinusitis; dbj: double J stent; DNN: deep neural network; ecmo: extracorporeal membrane oxygenation; emh: endometrial hemorrhage; esrd: end-stage renal disease; fess: functional endoscopic sinus surgery; itc: intertrochanter; ivg: intravenous general anesthesia; lih: left inguinal hernia; mvr: mitral valve replacement; nsd: nasal septum deviation; p: post; pcnl: percutaneous nephrolithotomy; perm cath: permanent catheter; psa: prostate-specific antigen; r: rule out; r’t: right; rirs: retrograde intrarenal surgery; rv: right ventricle; slnd: sentinel lymph node dissection; SNE: stochastic neighbor embedding; t colon: transverse colon; tee: transesophageal echocardiography; tep: total extraperitoneal approach; trus: transrectal ultrasound; turp: transurethral resection of the prostate; urs: ureteroscopy; vats: video-assisted thoracic surgery; vhd: valvular heart disease. Higher-resolution version of this figure available in Multimedia Appendix 3.
Texts and their predicted probabilities by language model. Values are probabilities or mortalities.
| Predicted probability | Observed mortality (1=mortality; 0=no mortality) | Original free text combining preoperative diagnosis and proposed procedures |
| 0.951 | 1 | IHCAa pb CPRc ECMOd ACSe ARf full sternotomy CABGg AVRh |
| 0.948 | 0 | AMIi cardiogenic shock p ECMO remove ECMO TEEj |
| 0.940 | 1 | hollow organ perforation rk PPUl related LPPUm possible EXP LAPn |
| 0.936 | 1 | intra-abdominal bleeding EXP LAP |
| 0.932 | 1 | ischemic bowel laparoscopic diagnosis possible EXP LAP |
| 0.927 | 0 | acute pulmonary embolism IHCA p ECMO angiography TEE |
| 0.925 | 1 | duodenal ulcer perforation p duodenorrhaphy leakage bleeding EXP LAP |
| 0.912 | 0 | respiratory failure tracheostomy |
| 0.880 | 0 | hallow organ perforation r PPU LPPU |
| 0.815 | 0 | acute kidney failure perm catho insertion |
| 0.760 | 0 | post UPPPp wound bleeding check bleeding |
| 0.680 | 0 | ESRDq HDr via right perm caths qw2 4 6 perm cath dysfunction perm cath insertion change perm cath right neck |
| 0.527 | 0 | ESRD left AVGt occlusion left AVG thrombectomy |
| 0.415 | 0 | left lower leg soft tissue infection suspect necrotizing fasciitis debridement |
| 0.353 | 0 | ESRD right AVFu dysfunction upper arm angiography PTAv |
| 0.250 | 0 | RLLw lung tumor r lung cancer vats RLL lobectomy wedge first send frozen exam |
| 0.186 | 0 | left lower extremity NFx open BKy |
| 0.114 | 0 | left anterior mediastinal tumor multiple lung nodules rectal cancer p CCRTz VATSaa mediastinal tumor excision LARab |
| 0.042 | 0 | right ACLac MCLad injury arthroscopy ACL reconstruction |
| 0.041 | 0 | 1 C4 5 6 spondylosis 2 right carpal tunnel 1 ACDFae C4 5 6 2 right median nerve decompression |
| 0.031 | 0 | bilaf ovag teratoma laparoscopy adnexectomy |
| 0.030 | 0 | left ureter stone URSLah laser left |
| 0.029 | 0 | uterine myoma robotic myomectomy |
| 0.029 | 0 | acute appendicitis laparoscopic appendectomy |
| 0.029 | 0 | hemorrhoids hemorrhoidectomy |
| 0.029 | 0 | nontoxic goiter thyroidectomy |
| 0.027 | 0 | infertility TVORai |
| 0.027 | 0 | endometrial polyp TCRaj |
| 0.027 | 0 | GAak 38 weeks breech caesarean section |
| 0.025 | 0 | rtal breast lesion MRIam guided biopsy |
| 0.025 | 0 | right inguinal hernia TEPan right |
aIHCA: intrahospital cardiac arrest.
bp: post.
cCPR: cardiopulmonary resuscitation.
dECMO: extracorporeal membrane oxygenation.
eACS: acute coronary syndrome.
fAR: aortic regurgitation.
gCABG: coronary artery bypass graft.
hAVR: aortic valve replacement.
iAMI: acute myocardial infarction.
jTEE: transesophageal echocardiography.
kr: rule out.
lPPU: perforated peptic ulcer.
mLPPU: laparoscopic perforated peptic ulcer surgery.
nEXP LAP: exploratory laparotomy.
ocath: catheter.
pUPPP: uvulopalatopharyngoplasty.
qESRD, end-stage renal disease.
rHD: hemodialysis.
sperm cath: permanent catheter.
tAVG: arteriovenous graft.
uAVF: arteriovenous fistula.
vPTA: percutaneous transluminal angioplasty.
wRLL: right lower lobe.
xNF: necrotizing fasciitis.
yBK: below-knee amputation.
zCCRT: concurrent chemoradiotherapy.
aaVATS: video-assisted thoracic surgery.
abLAR: low anterior resection.
acACL: anterior cruciate ligament.
adMCL: medial collateral ligament.
aeACDF: anterior cervical discectomy and fusion.
afbil: bilateral.
agov: ovarian.
ahURSL: ureteroscopic lithotomy.
aiTVOR: transvaginal oocyte retrieval.
ajTCR: transcervical resectoscope.
akGA: gestational age.
alrt: right.
amMRI: magnetic resonance imaging.
anTEP: total extraperitoneal approach.