| Literature DB >> 35748075 |
Jung Ho Han1, So Jin Yoon1, Hye Sun Lee2, Goeun Park2, Joohee Lim1, Jeong Eun Shin1, Ho Seon Eun1, Min Soo Park1, Soon Min Lee3.
Abstract
PURPOSE: The aims of the study were to develop and evaluate a machine learning model with which to predict postnatal growth failure (PGF) among very low birth weight (VLBW) infants.Entities:
Keywords: Growth failure; machine learning; neonatal intensive care unit; prediction; very low birth weight infants
Mesh:
Substances:
Year: 2022 PMID: 35748075 PMCID: PMC9226835 DOI: 10.3349/ymj.2022.63.7.640
Source DB: PubMed Journal: Yonsei Med J ISSN: 0513-5796 Impact factor: 3.052
Fig. 1Schematic flow chart of the enrolled very low birth weight (VLBW) infants.
Baseline Demographics of the Training Set and Test Set (n=7954)
| Training (n=6363) | Test (n=1591) | |||
|---|---|---|---|---|
| Variables for Day 0 | ||||
| Sex | 3169 (49.8) | 825 (51.9) | 0.143 | |
| Maternal hypertension | 1433 (22.5) | 327 (20.6) | 0.085 | |
| Maternal PROM | 2297 (36.1) | 578 (36.3) | 0.857 | |
| Gestational age (weeks) | 28.99±2.4 | 28.99±2.4 | 0.515 | |
| Birth weight (g) | 1123.20±252.8 | 1115.23±259.5 | 0.263 | |
| SGA | 1016 (16.0) | 250 (15.7) | 0.804 | |
| Variables added for Day 7 | ||||
| Air leak syndrome | 187 (2.9) | 43 (2.7) | 0.615 | |
| Respiratory distress syndrome | 5049 (79.4) | 1263 (79.4) | 0.976 | |
| Intraventricular hemorrhage | 852 (13.4) | 211 (13.3) | 0.885 | |
| Duration of invasive ventilation until 7 days of age (days) | 3.51±3.0 | 3.57±3.0 | 0.490 | |
| Duration of non-invasive ventilation until 7 days of age (days) | 4.85±2.6 | 4.88±2.6 | 0.646 | |
| Variables added for Day 14 | ||||
| PDA medical treatment | 2139 (33.6) | 560 (35.2) | 0.357 | |
| PDA surgical treatment | 659 (10.4) | 162 (10.2) | 0.700 | |
| Duration of invasive ventilation until 14 days of age (days) | 5.54±5.7 | 5.67±5.7 | 0.396 | |
| Duration of non-invasive ventilation until 14 days of age (days) | 8.56±5.3 | 8.57±5.3 | 0.931 | |
| Variables added for Day 28 | ||||
| Idiopathic spontaneous bowel perforation | 98 (1.5) | 27 (1.7) | 0.653 | |
| Sepsis | 1202 (18.9) | 340 (21.4) | 0.030 | |
| Necrotizing enterocolitis (≥stage 2) | 297 (4.7) | 81 (5.1) | 0.479 | |
| Duration of invasive ventilation until 28 days of age (days) | 8.38±10.5 | 8.74±10.7 | 0.235 | |
| Duration of non-invasive ventilation until 28 days of age (days) | 14.12±10.7 | 14.25±10.8 | 0.654 | |
| Variables added for at discharge | ||||
| Total duration of invasive ventilation (days) | 11.88±19.6 | 13.04±21.1 | 0.048 | |
| Total duration of non-invasive ventilation (days) | 19.47±18.5 | 19.38±18.7 | 0.862 | |
PROM, premature rupture of membrane; SGA, small for gestational age; PDA, patent ductus arteriosus.
Data are presented as mean±standard deviation or n (%).
Predictive Performances of Four Machine Learning Models and MLR for PGF at Discharge
| Models | XGB | RF | SVM | CNN | MLR | |
|---|---|---|---|---|---|---|
| Day 0 | ||||||
| AUROC | 0.72 (0.69–0.74) | 0.67 (0.65–0.70)* | 0.66 (0.64–0.69)* | 0.66 (0.63–0.68)* | 0.71 (0.69–0.74) | |
| Accuracy | 0.66 (0.64–0.69) | 0.63 (0.61–0.66) | 0.62 (0.59–0.64)* | 0.61 (0.59–0.64)* | 0.66 (0.63–0.68) | |
| Error rate | 0.34 (0.31–0.36) | 0.37 (0.34–0.39) | 0.38 (0.36–0.41)* | 0.39 (0.36–0.41)* | 0.34 (0.32–0.37) | |
| Precision | 0.60 (0.57–0.64) | 0.60 (0.56–0.63) | 0.56 (0.52–0.59)* | 0.55 (0.52–0.59)* | 0.60 (0.56–0.63) | |
| Sensitivity | 0.73 (0.70–0.76) | 0.56 (0.52–0.59)* | 0.71 (0.67–0.74) | 0.71 (0.68–0.75) | 0.71 (0.68–0.75) | |
| Specificity | 0.61 (0.58–0.64) | 0.70 (0.67–0.72)* | 0.54 (0.51–0.57)* | 0.53 (0.50–0.56)* | 0.61 (0.58–0.64) | |
| F1 score | 0.66 (0.63–0.69) | 0.58 (0.55–0.61)* | 0.62 (0.59–0.65)* | 0.62 (0.60–0.65)* | 0.65 (0.62–0.68) | |
| Day 7 | ||||||
| AUROC | 0.74 (0.71–0.76)* | 0.72 (0.70–0.75) | 0.67 (0.64–0.69)* | 0.70 (0.68–0.72)* | 0.72 (0.70–0.75) | |
| Accuracy | 0.68 (0.66–0.70) | 0.65 (0.63–0.68) | 0.61 (0.59–0.64)* | 0.66 (0.64–0.68) | 0.67 (0.65–0.69) | |
| Error rate | 0.32 (0.30–0.34) | 0.35(0.32–0.37) | 0.39 (0.36–0.41)* | 0.34 (0.32–0.36) | 0.33 (0.31–0.35) | |
| Precision | 0.62 (0.59–0.65) | 0.59 (0.55–0.62)* | 0.55 (0.52–0.59)* | 0.60 (0.56–0.63)* | 0.62 (0.59–0.65) | |
| Sensitivity | 0.73 (0.70–0.76)* | 0.78 (0.75–0.81)* | 0.72 (0.69–0.75)* | 0.74 (0.71–0.77)* | 0.68 (0.65–0.71) | |
| Specificity | 0.63 (0.60–0.67)* | 0.55 (0.52–0.59)* | 0.53 (0.50–0.56)* | 0.59 (0.56–0.62)* | 0.66 (0.63–0.69) | |
| F1 score | 0.67 (0.64–0.70)* | 0.67 (0.64–0.70) | 0.63 (0.60–0.65) | 0.66 (0.63–0.69) | 0.65 (0.62–0.68) | |
| Day 14 | ||||||
| AUROC | 0.74 (0.72–0.76) | 0.73 (0.71–0.76) | 0.67 (0.65–0.70)* | 0.71 (0.69–0.74)* | 0.73 (0.70–0.75) | |
| Accuracy | 0.68 (0.66–0.70) | 0.68 (0.66–0.70) | 0.62 (0.59–0.64)* | 0.66 (0.64–0.69) | 0.68 (0.66–0.71) | |
| Error rate | 0.32 (0.30–0.34) | 0.32 (0.30–0.34) | 0.38 (0.36–0.41)* | 0.34 (0.31–0.36) | 0.32 (0.29–0.34) | |
| Precision | 0.62 (0.59–0.66)* | 0.64 (0.60–0.67)* | 0.56 (0.52–0.59)* | 0.60 (0.56–0.63)* | 0.67 (0.63–0.70) | |
| Sensitivity | 0.71 (0.68–0.74)* | 0.68 (0.64–0.71)* | 0.72 (0.69–0.76)* | 0.77 (0.74–0.80)* | 0.59 (0.55–0.62) | |
| Specificity | 0.65 (0.62–0.68)* | 0.69 (0.66–0.72)* | 0.53 (0.50–0.56)* | 0.58 (0.54–0.61)* | 0.76 (0.73–0.79) | |
| F1 score | 0.66 (0.64–0.69)* | 0.66 (0.63–0.68)* | 0.63 (0.60–0.66) | 0.67 (0.64–0.70)* | 0.62 (0.59–0.65) | |
| Day 28 | ||||||
| AUROC | 0.74 (0.72–0.77) | 0.75 (0.72–0.77)* | 0.69 (0.66–0.71)* | 0.71 (0.69–0.74)* | 0.73 (0.71–0.76) | |
| Accuracy | 0.70 (0.68–0.72) | 0.70 (0.67–0.72) | 0.62 (0.60–0.65)* | 0.68 (0.65–0.70) | 0.69 (0.67–0.71) | |
| Error rate | 0.30 (0.28–0.32) | 0.30 (0.28–0.33) | 0.38 (0.35–0.40)* | 0.32 (0.30–0.35) | 0.31 (0.29–0.33) | |
| Precision | 0.65 (0.62–0.68) | 0.65 (0.62–0.69) | 0.56 (0.53–0.59)* | 0.63 (0.59–0.66)* | 0.66 (0.63–0.70) | |
| Sensitivity | 0.71 (0.67–0.74)* | 0.69 (0.66–0.72)* | 0.75 (0.72–0.78)* | 0.68 (0.65–0.72)* | 0.62 (0.59–0.66) | |
| Specificity | 0.69 (0.66–0.72)* | 0.70 (0.67–0.73)* | 0.52 (0.49–0.55)* | 0.67 (0.64–0.70)* | 0.74 (0.71–0.77) | |
| F1 score | 0.68 (0.65–0.70)* | 0.67 (0.64–0.70)* | 0.64 (0.61–0.67) | 0.65 (0.63–0.68) | 0.64 (0.61–0.67) | |
| At discharge | ||||||
| AUROC | 0.75 (0.72–0.77) | 0.75 (0.73–0.77)* | 0.69 (0.67–0.72)* | 0.72 (0.70–0.75)* | 0.74 (0.71–0.76) | |
| Accuracy | 0.70 (0.67–0.72) | 0.70 (0.67–0.72) | 0.65 (0.62–0.67)* | 0.67 (0.65–0.69)* | 0.69 (0.67–0.71) | |
| Error rate | 0.30 (0.28–0.32) | 0.30 (0.28–0.33) | 0.38 (0.35–0.40)* | 0.32 (0.30–0.35)* | 0.31 (0.29–0.33) | |
| Precision | 0.65 (0.62–0.68) | 0.65 (0.61–0.68) | 0.61 (0.57–0.65)* | 0.60 (0.57–0.64)* | 0.65 (0.62–0.69) | |
| Sensitivity | 0.69 (0.66–0.73)* | 0.71 (0.68–0.74)* | 0.57 (0.53–0.61)* | 0.76 (0.73–0.79)* | 0.67 (0.63–0.70) | |
| Specificity | 0.70 (0.67–0.72) | 0.69 (0.65–0.72)* | 0.71 (0.68–0.74) | 0.59 (0.56–0.63)* | 0.71 (0.68–0.74) | |
| F1 score | 0.67 (0.64–0.70) | 0.68 (0.65–0.70) | 0.59 (0.56–0.62)* | 0.67 (0.65–0.70) | 0.66 (0.63–0.69) | |
PGF, postnatal growth failure; XGB, extreme gradient boosting; RF, random forest; SVM, support vector machine; CNN, convolutional neural network; MLR, multiple logistic regression; AUROC, area under the receiver operating characteristic curve.
*p<0.05 (compared with MLR).
Optimal Cut-Off Points of Four Machine Learning Models and MLR by Youden’s Index
| XGB | RF | SVM | CNN | MLR | |
|---|---|---|---|---|---|
| Day 0 | >0.3986730 | >0.4748020 | >0.3694691 | >0.3923501 | >0.4053404 |
| Day 7 | >0.4296191 | >0.3707643 | >0.3646140 | >0.3796651 | >0.4227439 |
| Day 14 | >0.4245050 | >0.4427885 | >0.3605821 | >0.3587566 | >0.4829504 |
| Day 28 | >0.4307109 | >0.4462974 | >0.3507205 | >0.4578726 | >0.4578184 |
| At discharge | >0.4497405 | >0.4416529 | >0.4327583 | >0.4162702 | >0.4523962 |
XGB, extreme gradient boosting; RF, random forest; SVM, support vector machine; CNN, convolutional neural network; MLR, multiple logistic regression.
Comparison of AUROCs of the XGB Model among Different Time-Points for Predicting PGF at Discharge Shown as P-Values
| AUROC | ||||||
|---|---|---|---|---|---|---|
| vs. Day 0 | vs. Day 7 | vs. Day 14 | vs. Day 28 | vs. at discharge | ||
| Day 0 | 0.72 (0.69–0.74) | Reference | 0.0045 | 0.0031 | 0.0028 | 0.0004 |
| Day 7 | 0.74 (0.71–0.76) | 0.0045 | Reference | 0.6918 | 0.3205 | 0.0793 |
| Day 14 | 0.74 (0.72–0.76) | 0.0031 | 0.6918 | Reference | 0.4514 | 0.1292 |
| Day 28 | 0.74 (0.72–0.77) | 0.0028 | 0.3205 | 0.4514 | Reference | 0.3790 |
| At discharge | 0.75 (0.72–0.77) | 0.0004 | 0.0793 | 0.1292 | 0.3790 | Reference |
AUROC, area under the receiver operating characteristic curve; XGB, extreme gradient boosting; PGF, postnatal growth failure.