| Literature DB >> 33077570 |
Innocent B Mboya1,2, Michael J Mahande2, Mohanad Mohammed3, Joseph Obure4, Henry G Mwambi3.
Abstract
OBJECTIVE: We aimed to determine the key predictors of perinatal deaths using machine learning models compared with the logistic regression model.Entities:
Keywords: epidemiology; neonatology; perinatology; prenatal diagnosis; reproductive medicine
Mesh:
Year: 2020 PMID: 33077570 PMCID: PMC7574940 DOI: 10.1136/bmjopen-2020-040132
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Characteristics of study participants (n=42 319)
| Characteristics | Total | Perinatal death | P value* | Characteristics | Total | Perinatal death | P value* |
| Maternal | n (%) | n (%) | Obstetrics | n (%) | n (%) | ||
| <0.001 | 0.89 | ||||||
| 15–19 | 3470 (8.2) | 99 (2.9) | No | 42 288 (99.9) | 1560 (3.7) | ||
| 20–34 | 32 675 (77.2) | 1158 (3.5) | Yes | 31 (0.1) | 1 (3.2) | ||
| 35–39 | 4984 (11.8) | 235 (4.7) | 0.002 | ||||
| 40+ | 1190 (2.8) | 69 (5.8) | No | 42 240 (99.8) | 1553 (3.7) | ||
| <0.001 | Yes | 79 (0.2) | 8 (10.1) | ||||
| None | 567 (1.3) | 32 (5.6) | <0.001 | ||||
| Primary | 23 010 (54.4) | 1019 (4.4) | No | 42 241 (99.8) | 1550 (3.7) | ||
| Secondary | 5275 (12.5) | 159 (3.0) | Yes | 78 (0.2) | 11 (14.1) | ||
| Higher | 13 467 (31.8) | 351 (2.6) | <0.001 | ||||
| 0.37 | No | 41 897 (99.0) | 1528 (3.6) | ||||
| Unemployed | 9316 (22.0) | 365 (3.9) | Yes | 422 (1.0) | 33 (7.8) | ||
| Employed | 30 061 (71.0) | 1085 (3.6) | 0.004 | ||||
| Others | 2942 (7.0) | 111 (3.8) | No | 41 661 (98.4) | 1523 (3.7) | ||
| 0.89 | Yes | 658 (1.6) | 38 (5.8) | ||||
| Single | 4954 (11.7) | 186 (3.8) | 0.79 | ||||
| Married | 37 300 (88.1) | 1372 (3.7) | No | 36 746 (86.8) | 1352 (3.7) | ||
| Widowed/divorced | 65 (0.2) | 3 (4.6) | Yes | 5573 (13.2) | 209 (3.8) | ||
| <0.001 | 0.43 | ||||||
| Urban | 25 056 (59.2) | 725 (2.9) | No | 41 588 (98.3) | 1538 (3.7) | ||
| Rural | 17 263 (40.8) | 836 (4.8) | Yes | 731 (1.7) | 23 (3.1) | ||
| 0.001 | |||||||
| No | 30 759 (72.7) | 1191 (3.9) | <0.001 | ||||
| Yes | 11 560 (27.3) | 370 (3.2) | No | 40 668 (96.1) | 1355 (3.3) | ||
| 0.97 | Yes | 1651 (3.9) | 206 (12.5) | ||||
| Yes | 53 (0.1) | 2 (3.8) | <0.001 | ||||
| No | 42 266 (99.9) | 1559 (3.7) | No | 32 732 (77.3) | 1105 (3.4) | ||
| <0.001 | Yes | 9587 (22.7) | 456 (4.8) | ||||
| ≥4 | 28 742 (67.9) | 760 (2.6) | 0.006 | ||||
| <4 | 13 577 (32.1) | 801 (5.9) | No | 41 416 (97.9) | 1543 (3.7) | ||
| <0.001 | Yes | 903 (2.1) | 18 (2.0) | ||||
| No | 32 762 (77.4) | 819 (2.5) | <0.001 | ||||
| Yes | 9557 (22.6) | 742 (7.8) | No | 42 091 (99.5) | 1516 (3.6) | ||
| Yes | 228 (0.5) | 45 (19.7) | |||||
| 0.001 | 0.49 | ||||||
| <25 | 3938 (9.3) | 122 (3.1) | No | 42 305 (99.9) | 1560 (3.7) | ||
| 25–29 | 10 593 (25.0) | 346 (3.3) | Yes | 14 (0.1) | 1 (7.1) | ||
| 30–34 | 12 303 (29.1) | 457 (3.7) | <0.001 | ||||
| 35+ | 15 485 (36.6) | 636 (4.1) | No | 42 193 (99.7) | 1490 (3.5) | ||
| <0.001 | Yes | 126 (0.3) | 71 (56.3) | ||||
| None | 281 (0.7) | 27 (9.6) | 0.04 | ||||
| Primary | 18 987 (44.9) | 868 (4.6) | No | 42 245 (99.8) | 1555 (3.7) | ||
| Secondary | 4565 (10.8) | 154 (3.4) | Yes | 74 (0.2) | 6 (8.1) | ||
| Higher | 18 486 (43.7) | 512 (2.8) | <0.001 | ||||
| <0.001 | Cephalic | 41 833 (98.9) | 1459 (3.5) | ||||
| Unemployed | 5710 (13.5) | 323 (5.7) | Breach/ Transverse | 486 (1.1) | 102 (21.0) | ||
| Employed | 36 102 (85.3) | 1218 (3.4) | <0.001 | ||||
| Others | 507 (1.2) | 20 (3.9) | Term birth (≥37 weeks) | 37 764 (89.2) | 914 (2.4) | ||
| Preterm birth (<37 weeks) | 4555 (10.8) | 647 (14.2) | |||||
| <0.001 | |||||||
| Normal birth weight | 37 991 (89.8) | 801 (2.1) | |||||
| Low birth weight | 4328 (10.2) | 760 (17.6) | |||||
| 0.42 | |||||||
| Female | 20 430 (48.3) | 738 (3.6) | |||||
| 42 319 | 1561 (3.7%) | Male | 21 889 (51.7) | 823 (3.8) |
*P value based on the χ2 test.
ANC, antenatal care; PPH, postpartum haemorrhage; PROM, premature rupture of the membranes.
Figure 1Schematic diagram showing the number of singleton deliveries analysed, KCMC medical birth registry data, 2000–2015. KCMC, Kilimanjaro Christian Medical Centre.
Figure 2Trends of perinatal death, KCMC medical birth registry data, 2000–2015. KCMC, Kilimanjaro Christian Medical Centre.
Figure 3Variable importance of predictors for perinatal death in the random forest model scaled to have a maximum value of 100. ANC, antenatal care.
Figure 4Prediction ability of perinatal deaths comparing different machine learning models in the test set: (A) Receiver operating characteristics curves. The corresponding values of the area under the receiver operating characteristics curve for each model are in table 2. (B) Decision curve analysis. The net benefit of the machine learning models (except for boosting) is larger over a range of threshold probability values compared with that of the logistic regression model.
Prediction performance of the reference and machine learning models in the test set
| Model | Logistic regression | Artificial neural network | Random forests | Naïve bayes | Bagging | Boosting |
| ACC | 0.86 (0.85 to 0.86) | 0.83 (0.82 to 0.83) | 0.87 (0.86 to 0.87) | 0.84 (0.83 to 0.85) | 0.82 (0.81 to 0.83) | 0.87 (0.87 to 0.88) |
| AUC | 0.78 (0.76 to 0.81) | 0.78 (0.76 to 0.80) | 0.79 (0.76 to 0.81) | 0.79 (0.76 to 0.81) | 0.76 (0.74 to 0.79) | 0.79 (0.76 to 0.81) |
| P value* | Reference | 0.59 | 0.37 | 0.65 | 0.006 | 0.20 |
| Sensitivity | 0.56 (0.51 to 0.60) | 0.60 (0.55 to 0.64) | 0.54 (0.49 to 0.58) | 0.57 (0.52 to 0.62) | 0.55 (0.50 to 0.59) | 0.54 (0.49 to 0.58) |
| Specificity | 0.87 (0.86 to 0.88) | 0.84 (0.83 to 0.84) | 0.88 (0.88 to 0.89) | 0.85 (0.84 to 0.86) | 0.83 (0.82 to 0.84) | 0.89 (0.88 to 0.89) |
| PPV | 0.14 (0.12 to 0.16) | 0.12 (0.11 to 0.14) | 0.15 (0.13 to 0.17) | 0.13 (0.11 to 0.14) | 0.11 (0.10 to 0.12) | 0.15 (0.14 to 0.17) |
| NPV | 0.98 (0.98 to 0.98) | 0.98 (0.98 to 0.98) | 0.98 (0.98 to 0.98) | 0.98 (0.98 to 0.98) | 0.98 (0.98 to 0.98) | 0.98 (0.98 to 0.98) |
*We calculated p values to compare the AUC the receiver operating characteristics curve of logistic with each machine learning model.
ACC, accuracy; AUC, area under the curve; NPV, negative predictive value; PPV, positive predictive value.
The number of actual and predicted outcomes of prediction models in the test set
| Prediction model | Classification | Perinatal status | |
| Alive | Died | ||
| Actual number of events | 12 227 | 468 | |
| Logistic regression | Correctly predicted outcome | 10 627 | 261 |
| Incorrectly predicted outcome | 1600 | 207 | |
| Artificial neural network | Correctly predicted outcome | 10 225 | 280 |
| Incorrectly predicted outcome | 2002 | 188 | |
| Random Fforests | Correctly predicted outcome | 10 774 | 251 |
| Incorrectly predicted outcome | 1453 | 217 | |
| Naïve bayes | Correctly predicted outcome | 10 386 | 267 |
| Incorrectly predicted outcome | 1841 | 201 | |
| Bagging | Correctly predicted outcome | 10 175 | 260 |
| Incorrectly predicted outcome | 2052 | 208 | |
| Boosting | Correctly predicted outcome | 10 852 | 252 |
| Incorrectly predicted outcome | 1375 | 216 | |