| Literature DB >> 36017511 |
Xiran Peng1,2, Tao Zhu1,2, Guo Chen1,2, Yaqiang Wang3, Xuechao Hao1,2.
Abstract
Aim: Postoperative pulmonary complications (PPCs) can increase the risk of postoperative mortality, and the geriatric population has high incidence of PPCs. Early identification of high-risk geriatric patients is of great value for clinical decision making and prognosis improvement. Existing prediction models are based purely on structured data, and they lack predictive accuracy in geriatric patients. We aimed to develop and validate a deep neural network model based on combined natural language data and structured data for improving the prediction of PPCs in geriatric patients.Entities:
Keywords: deep neural network model; electronic health records; geriatric assessment (MeSH); postoperative pulmonary complications; risk assessment
Year: 2022 PMID: 36017511 PMCID: PMC9395933 DOI: 10.3389/fsurg.2022.976536
Source DB: PubMed Journal: Front Surg ISSN: 2296-875X
Performance metrics of the deep neural network model and other machine learning models.
| Model | Precision (95% CI) | Sensitivity (95% CI) | F1 score (95% CI) | AUPRC (95% CI) | Specificity (95% CI) | Accuracy (95% CI) | AUROC (95% CI) |
|---|---|---|---|---|---|---|---|
| Extreme gradient boosting | 0.675 (0.674–0.676) | 0.502 (0.501–0.502) | 0.575 (0.575–0.576) | 0.605 (0.604–0.605) | 0.954 (0.953–0.954) | 0.882 (0.881–0.882) | 0.866 (0.866–0.867) |
| Gradient boosting machine | 0.687 (0.686–0.687) | 0.519 (0.519–0.520) | 0.591 (0.591–0.592) | 0.620 (0.620–0.621) | 0.955 (0.954–0.955) | 0.885 (0.885–0.886) | 0.878 (0.878–0.879) |
| Random Forest | 0.695 (0.687–0.698) | 0.513 (0.512–0.515) | 0.590 (0.589–0.592) | 0.628 (0.621–0.630) | 0.955 (0.954–0.955) | 0.886 (0.885–0.886) | 0.870 (0.868–0.871) |
| Support vector machine | 0.704 (0.703–0.705) | 0.208 (0.207–0.208) | 0.321 (0.320–0.322) | 0.559 (0.558–0.559) | 0.983 (0.983–0.984) | 0.860 (0.859–0.860) | 0.839 (0.838–839) |
| Elastic Net logistic regression | 0.682 (0.681–0.683) | 0.318 (0.318–0.319) | 0.434 (0.433–0.434) | 0.571 (0.570–0.571) | 0.971 (0.971–0.972) | 0.867 (0.867–0.868) | 0.841 (0.840–0.841) |
| Deep neural network | 0.684 (0.682 –0.686) | 0.943 (0.941–0.944) | 0.885 (0.884–0.887) | ||||
| In external validation | |||||||
| Deep neural network | 0.610 (0.609–0.610) | 0.596 (0.595–0.596) | 0.930 (0.929–0.930) | 0.877 (0.877–0.878) | |||
Abbreviations: CI, confidence interval; AUPRC, area under the precision-recall curve; AUROC, area under the receiver operating characteristic curve.
The deep neural network model retrained on the external dataset.
Figure 1Hosmer-Lemeshow calibration plot of the deep neural network model based on the derivation dataset. Values on the x-axis are deciles of predicted risk of postoperative pulmonary complications and values on the y-axis are rates of postoperative pulmonary complications for each decile. The result of Hosmer–Lemeshow test (P = 0.80) showed good agreement between the deep neural network model-based prediction and observed outcome.
Top ten most important variables in the deep neural network model for patients in the derivation dataset and the two case examples.
| Patients in the derivation dataset | Patient A | Patient B | |||
|---|---|---|---|---|---|
| Variable | Importance | Variable | Importance | Variable | Importance |
| Acidophil count | 65.23 | Acidophil count | 50.89 | Triglyceride | 50.99 |
| Triglyceride | 61.93 | Difficult ventilation history | 50.02 | COPD | 50.19 |
| Fibrinogen | 60.73 | COPD | 50.01 | Difficult ventilation history | 49.98 |
| Functional capacity | 59.93 | Airway obstruction | 49.96 | Airway obstruction | 49.79 |
| Platelet count | 58.78 | Respiratory infection within last 1 month | 49.70 | Free text record | 49.48 |
| Acidophil percentage | 57.76 | Low activity | 49.63 | Difficult intubation history | 49.45 |
| Neck movement test | 57.68 | Systolic blood pressure | 49.45 | MCH | 49.30 |
| Hydroxybutyrate dehydrogenase | 57.52 | Free text record | 49.23 | Upper digestive tract hemorrhage within last 1 week | 49.28 |
| MCHC | 57.44 | Total bilirubin | 49.02 | NYHA classification | 49.27 |
| MCH | 57.25 | Monocyte percentage | 48.94 | Decreased endurance | 49.09 |
Abbreviations: COPD, chronic obstructive pulmonary disease; MCH, mean corpuscular hemoglobin; MCHC, mean corpuscular hemoglobin concentration; NYHA, New York Heart Association.
Figure 2Hosmer-Lemeshow calibration plot of the deep neural network model based on the external dataset. Values on the x-axis are deciles of predicted risk of postoperative pulmonary complications and values on the y-axis are rates of postoperative pulmonary complications for each decile. The result of Hosmer–Lemeshow test (P = 0.78) showed good agreement between the deep neural network model-based prediction and observed outcome.