| Literature DB >> 30213963 |
Marco Podda1, Davide Bacciu1, Alessio Micheli1, Roberto Bellù2,3, Giulia Placidi4, Luigi Gagliardi5,6.
Abstract
Estimation of mortality risk of very preterm neonates is carried out in clinical and research settings. We aimed at elaborating a prediction tool using machine learning methods. We developed models on a cohort of 23747 neonates <30 weeks gestational age, or <1501 g birth weight, enrolled in the Italian Neonatal Network in 2008-2014 (development set), using 12 easily collected perinatal variables. We used a cohort from 2015-2016 (N = 5810) as a test set. Among several machine learning methods we chose artificial Neural Networks (NN). The resulting predictor was compared with logistic regression models. In the test cohort, NN had a slightly better discrimination than logistic regression (P < 0.002). The differences were greater in subgroups of neonates (at various gestational age or birth weight intervals, singletons). Using a cutoff of death probability of 0.5, logistic regression misclassified 67/5810 neonates (1.2 percent) more than NN. In conclusion our study - the largest published so far - shows that even in this very simplified scenario, using only limited information available up to 5 minutes after birth, a NN approach had a small but significant advantage over current approaches. The software implementing the predictor is made freely available to the community.Entities:
Mesh:
Year: 2018 PMID: 30213963 PMCID: PMC6137213 DOI: 10.1038/s41598-018-31920-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Descriptive statistics for the development (N = 23747) and test (N = 5810) datasets.
| Characteristics | Levels | DEVELOPMENT | TEST | ||
|---|---|---|---|---|---|
|
| % |
| % | ||
| Birth weight (grams) | <1000 | 8801 | 37.06 | 2142 | 36.87 |
| 1000–1500 | 14650 | 61.69 | 3616 | 62.24 | |
| >1500 | 296 | 1.25 | 52 | 0.89 | |
| GA (weeks) | <26 | 3423 | 14.41 | 867 | 14.92 |
| 26–30 | 12934 | 54.47 | 3204 | 55.15 | |
| 31–35 | 7218 | 30.40 | 1693 | 29.14 | |
| >35 | 172 | 0.72 | 46 | 0.79 | |
| Apgar score (1 min.) | 0–3 | 4206 | 17.71 | 950 | 16.35 |
| 4–6 | 8355 | 35.18 | 2209 | 38.02 | |
| 7–10 | 11186 | 47.11 | 2651 | 45.63 | |
| Apgar score (5 min.) | 0–3 | 760 | 3.20 | 149 | 2.56 |
| 4–6 | 3096 | 13.04 | 754 | 12.98 | |
| 7–10 | 19891 | 83.76 | 4907 | 84.46 | |
| Sex | male | 12006 | 50.56 | 2962 | 50.98 |
| female | 11741 | 49.44 | 2848 | 49.02 | |
| Mode of Delivery | cesarean | 19316 | 81.34 | 4689 | 80.71 |
| vaginal | 4431 | 18.66 | 1121 | 19.29 | |
| Maternal race | black | 1139 | 4.80 | 314 | 5.40 |
| hispanic | 1561 | 6.57 | 374 | 6.44 | |
| white | 19705 | 82.98 | 4809 | 82.77 | |
| asian | 882 | 3.71 | 252 | 4.34 | |
| other | 460 | 1.94 | 61 | 1.05 | |
| Chorioamnionitis | no | 20707 | 87.20 | 4951 | 85.22 |
| yes | 3040 | 12.80 | 859 | 14.78 | |
| Prenatal care | no | 1924 | 8.10 | 320 | 5.51 |
| yes | 21823 | 91.90 | 5490 | 94.49 | |
| Antenatal steroids | no | 4434 | 18.67 | 753 | 12.96 |
| yes | 19313 | 81.33 | 5057 | 87.04 | |
| Maternal hypertension | no | 17635 | 74.26 | 4365 | 75.13 |
| yes | 6112 | 25.74 | 1445 | 24.87 | |
| Multiple birth | no | 15780 | 66.45 | 3801 | 65.42 |
| yes | 7967 | 33.55 | 2009 | 34.58 | |
| Died before discharge | no | 20840 | 87.76 | 5147 | 88.59 |
| yes | 2907 | 12.24 | 663 | 11.41 | |
The development set refers to data from 2008 to 2014, the test set refers to data from 2015 to 2016.
Results of the model selection procedure on the INN development set [N = 23758].
| MODELS | TRAINING | CROSS-VALIDATION |
|---|---|---|
| Logistic Regression | 0.9105 ± 0.0010 | 0.9098 ± 0.0038 |
| K-Nearest Neighbor | 0.9142 ± 0.0012 | 0.9108 ± 0.0040 |
| Random Forest | 0.9373 ± 0.0010 | 0.9138 ± 0.0053 |
| Gradient Boosting Machine | 0.9200 ± 0.0011 | 0.9147 ± 0.0048 |
| Support Vector Machine | 0.9170 ± 0.0013 | 0.9147 ± 0.0047 |
|
| 0.9171 ± 0.0010 |
For each candidate model, its training and validation AUROC (averaged over 5 CV iterations, ±standard deviation) is reported. The selected model is highlighted in bold.
AUROC values of different preterm survival predictors on the 2015–2016 test data under different segmentations of the original population.
| BW | BW + GA | Manktelow[ | Tyson[ | Logistic1 | Logistic2 | PISA | |
|---|---|---|---|---|---|---|---|
| FULL [ | 0.8733 | 0.8875 | 0.8643 | 0.8738 | 0.9023 | 0.9081 |
|
| ELBWI [ | 0.6943 | 0.7233 | 0.6901 | 0.7076 | 0.7584 | 0.7725 |
|
| VLBWI [ | 0.6761 | 0.7156 | 0.6871 | 0.6703 | 0.7878 | 0.7992 |
|
| SINGLETONS [ | 0.8319 | 0.8471 | 0.8165 | 0.8322 | 0.8733 | 0.8802 |
|
The conditions listed in the leftmost column indicate the inclusion criteria for subjects on which the AUROC is computed. N indicates the number of test subjects that meet the inclusion criteria. ELBWI: <26 weeks and 400–999 g; VLBWI: 1000–1500 g. SINGLETONS: singletons with 23 ≤ GA ≤ 32. BW: birth weight; GA: gestational age.
Figure 1Plot of the observed mortality vs. mortality predicted by the two best-scoring models (PISA and Logistic2), per gestational week, in the test dataset (2015–2016).
Brier loss of the examined predictors on the 2015–2016 test data under different segmentations of the original population.
| BW | BW + GA | Manktelow[ | Tyson[ | Logistic1 | Logistic2 | PISA | |
|---|---|---|---|---|---|---|---|
| FULL [ | 0.1001 | 0.0866 | 0.0812 | 0.0692 | 0.0640 | 0.0623 |
|
| ELBWI [ | 0.3588 | 0.2997 | 0.2722 | 0.2174 | 0.1980 | 0.1912 |
|
| VLBWI [ | 0.0316 | 0.0265 | 0.0219 | 0.0220 | 0.0209 | 0.0201 |
|
| SINGLETONS [ | 0.1145 | 0.1019 | 0.0982 | 0.0872 | 0.0797 | 0.0768 |
|
Lower values indicate better goodness of fit.