| Literature DB >> 34857568 |
Clifford Silver Tarimo1,2, Soumitra S Bhuyan3, Quanman Li1, Michael Johnson J Mahande4, Jian Wu1, Xiaoli Fu5.
Abstract
OBJECTIVES: We aimed at identifying the important variables for labour induction intervention and assessing the predictive performance of machine learning algorithms.Entities:
Keywords: epidemiology; gynaecology; health informatics; obstetrics; paediatrics; public health
Mesh:
Year: 2021 PMID: 34857568 PMCID: PMC8647548 DOI: 10.1136/bmjopen-2021-051925
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 3.006
Figure 1CONSORT diagram for sample size determination. CONSORT, Consolidated Standards of Reporting Trials; IOL, induction of labour.
Sociodemographic characteristics of study participant (N=21 578)
| Characteristics | Induced delivery | Spontaneous delivery | χ2 p value |
| n (%) | n (%) | ||
| Maternal age | |||
| <25 | 3728 (42.3) | 4304 (33.72) | |
| 25–35 | 4162 (47.22) | 6122 (47.96) | |
| >35 | 924 (10.48) | 2338 (18.32) | <0.001 |
| Maternal religion | |||
| Muslim | 3464 (39.3) | 4830 (37.84) | |
| Christian | 5350 (60.7) | 7934 (62.16) | 0.03 |
| Gestational age | |||
| Preterm | 1018 (11.55) | 1802 (14.12) | |
| Term | 7029 (79.75) | 9906 (77.61) | |
| Post-term | 767 (8.70) | 1056 (8.27) | <0.001 |
| Maternal residence | |||
| Rural | 2904 (32.95) | 4795 (37.57) | |
| Urban | 5910 (67.05) | 7969 (62.43) | <0.001 |
| Maternal BMI | |||
| Underweight | 57 (0.65) | 72 (0.56) | |
| Normal weight | 2807 (31.85) | 3813 (29.87) | |
| Overweight | 3667 (41.60) | 5022 (39.35) | |
| Obese | 2283 (25.90) | 3857 (30.22) | <0.001 |
| Birth weight | |||
| Low | 975 (11.06) | 1481 (11.60) | |
| Normal | 6245 (70.85) | 8786 (68.83) | |
| High | 1594 (18.08) | 2497 (19.56) | 0.006 |
| Parity status | |||
| Nulliparous | 6036 (68.48) | 3964 (31.06) | |
| Multiparous | 2778 (31.52) | 8800 (68.94) | <0.001 |
| Multiple gestation | |||
| No | 8466 (96.05) | 12 171 (95.35) | |
| Yes | 348 (3.95) | 593 (4.65) | 0.014 |
| PROM | |||
| No | 8623 (97.83) | 12 567 (98.46) | |
| Yes | 191 (2.17) | 197 (1.54) | 0.001 |
| Child sex | |||
| Male | 4275 (48.5) | 6243 (48.91) | |
| Female | 4539 (51.50) | 6521 (51.09) | 0.555 |
| Circumcision | |||
| No | 7810 (88.61) | 10 785 (84.50) | |
| Yes | 1004 (11.39) | 1979 (15.50) | <0.001 |
| Referred for delivery | |||
| No | 7418 (84.16) | 10 517 (82.4) | |
| Yes | 1396 (15.84) | 2247 (17.60) | 0.001 |
| Alcohol use during pregnancy | |||
| No | 6999 (79.41) | 9578 (75.04) | |
| Yes | 1815 (20.59) | 3186 (24.96) | <0.001 |
| Maternal occupation | |||
| Employed | 6570 (55.64) | 4210 (43.09) | |
| Unemployed | 5237 (44.36) | 5561 (56.91) | <0.001 |
| Maternal education | |||
| None | 1378 (18.97) | 3547 (24.78) | |
| Primary | 3231 (44.48) | 5879 (41.07) | |
| Secondary | 1756 (24.17) | 2877 (20.10) | |
| Higher | 899 (12.38) | 2011 (14.05) | <0.001 |
| No of ANC visits | |||
| ≥4 | 4874 (62.72) | 9765 (70.72) | |
| <4 | 2897 (37.28) | 4042 (29.28) | <0.001 |
| Marital status | |||
| Married | 7479 (84.85) | 11 459 (89.78) | |
| Not married | 1335 (15.15) | 1305 (10.22) | <0.001 |
ANC, Antenatal care; BMI, body mass index; PROM, prelabour rapture of membrane.
Figure 2Variable importance measures for prediction of labour induction intervention.
Figure 3ROC curve for comparing the performance of ML algorithms. ML, machine learning; ROC, receiver operating characteristic.
Overall prediction performance of the machine learning models
| Model | Logistic regression | Artificial neural network | Random forest | Naïve Bayes | Bagging | Boosting |
| ACC | 0.69 (0.68–0.70) | 0.74 (0.73–0.75) | 0.73 (0.72–0.74) | 0.72 (0.71–0.73) | 0.71 (0.70–0.72) | 0.74 (0.73–0.75) |
| AUROC | 0.71 (0.70–0.73) | 0.73 (0.72–0.75) | 0.74 (0.72–0.75) | 0.73 (0.72–0.75) | 0.72 (0.71–0.73) | 0.75 (0.73–0.76) |
| P value* | Reference | <0.001 | <0.001 | <0.001 | 0.3326 | <0.001 |
| Sensitivity | 0.70 (0.68–0.71) | 0.86 (0.85–0.87) | 0.84 (0.83–0.85) | 0.82 (0.80–0.83) | 0.80 (0.79–0.82) | 0.85 (0.83–0.86) |
| Specificity | 0.68 (0.66–0.70) | 0.58 (0.56–0.60) | 0.57 (0.56–0.59) | 0.57 (0.56–0.59) | 0.57 (0.55–0.59) | 0.59 (0.57–0.61) |
| PPV | 0.76 (0.74–0.77) | 0.75 (0.73–0.76) | 0.74 (0.73–0.75) | 0.74 (0.72–0.75) | 0.73 (0.72–0.74) | 0.75 (0.74–0.76) |
| NPV | 0.61 (0.59–0.62) | 0.74 (0.72–0.75) | 0.71 (0.69–0.73) | 0.68 (0.66–0.70) | 0.67 (0.65–0.69) | 0.73 (0.71–0.75) |
N=21 578.
ACC, Accuracy; AUROC, area under the receiver operating characteristic curve; NPV, negative predictive value; PPV, positive predictive value.
Figure 4The decision curve analysis showing the net-benefit of machine learning models for predicting likelihood of labour induction over the range of threshold probabilities. ANN, artificial neural networks; Lreg, logistic regression; NB, naïve Bayes; RF, random forest.
Prediction performance of the ML algorithm by maternal age
| Model | Logistic regression | Artificial neural network | Random forest | Naïve Bayes | Bagging | Boosting | |
| Maternal age <25 (n=8032) | ACC | 0.74 (0.72–0.75) | 0.74 (0.72–0.76) | 0.74 (0.72–0.76) | 0.74 (0.72–0.76) | 0.71 (0.69–0.73) | 0.76 (0.74–0.77) |
| AUROC | 0.75 (0.73–0.77) | 0.74 (0.71–0.75) | 0.76 (0.72–0.76) | 0.76 (0.74–0.78) | 0.74 (0.72–0.76) | 0.77 (0.75–0.78) | |
| Sensitivity | 0.78 (0.76–0.80) | 0.85 (0.83–0.87) | 0.82 (0.80–0.84) | 0.79 (0.77–0.81) | 0.76 (0.74–0.78) | 0.85 (0.83–0.87) | |
| Specificity | 0.68 (0.66–0.71) | 0.62 (0.59–0.64) | 0.65 (0.63–0.68) | 0.68 (0.65–0.70) | 0.65 (0.63–0.68) | 0.64 (0.62–0.67) | |
| PPV | 0.74 (0.72–0.76) | 0.72 (0.69–0.74) | 0.73 (0.71–0.75) | 0.74 (0.72–0.76) | 0.72 (0.69–0.74) | 0.73 (0.71–0.76) | |
| NPV | 0.73 (0.70–0.76) | 0.78 (0.75–0.80) | 0.76 (0.73–0.78) | 0.74 (0.71–0.77) | 0.70 (0.67–0.73) | 0.79 (0.76–0.82) | |
| Maternal age 25–35 (n=10 284) | ACC | 0.74 (0.73–0.76) | 0.74 (0.73–0.76) | 0.73 (0.73–0.76) | 0.75 (0.73–0.76) | 0.71 (0.70–0.73) | 0.75 (0.73–0.76) |
| AUROC | 0.74 (0.72–0.76) | 0.74 (0.72–0.76) | 0.74 (0.72–0.76) | 0.74 (0.72–0.75) | 0.73 (0.71–0.74) | 0.73 (0.73–0.76) | |
| Sensitivity | 0.83 (0.82–0.85) | 0.84 (0.82–0.85) | 0.84 (0.82–0.86) | 0.84 (0.82–0.85) | 0.81 (0.79–0.83) | 0.84 (0.82–0.85) | |
| Specificity | 0.62 (0.59–0.64) | 0.61 (0.58–0.64) | 0.57 (0.54–0.60) | 0.61 (0.58–0.64) | 0.57 (0.54–0.60) | 0.61 (0.58–0.64) | |
| PPV | 0.76 (0.74–0.78) | 0.76 (0.74–0.78) | 0.74 (0.72–0.76) | 0.76 (0.74–0.78) | 0.73 (0.72–0.75) | 0.76 (0.74–0.78) | |
| NPV | 0.72 (0.69–0.74) | 0.72 (0.69–0.74) | 0.71 (0.68–0.74) | 0.72 (0.69–0.74) | 0.67 (0.64–0.70) | 0.72 (0.69–0.74) | |
| Maternal age >35 (n=3262) | ACC | 0.73 (0.71–0.74) | 0.75 (0.74–0.76) | 0.74 (0.73–0.76) | 0.74 (0.72–0.75) | 0.72 (0.71–0.74) | 0.75 (0.74–0.77) |
| AUROC | 0.75 (0.74–0.78) | 0.76 (0.75–0.78) | 0.77 (0.75–0.79) | 0.77 (0.76–0.79) | 0.76 (0.74–0.77) | 0.77 (0.76–0.79) | |
| Sensitivity | 0.69 (0.67–0.71) | 0.80 (0.78–0.82) | 0.77 (0.75–0.79) | 0.72 (0.70–0.74) | 0.74 (0.72–0.76) | 0.78 (0.76–0.80) | |
| Specificity | 0.76 (0.74–0.78) | 0.70 (0.68–0.72) | 0.72 (0.69–0.74) | 0.76 (0.74–0.78) | 0.70 (0.68–0.72) | 0.73 (0.71–0.75) | |
| PPV | 0.75 (0.73–0.77) | 0.73 (0.71–0.75) | 0.73 (0.71–0.75) | 0.75 (0.73–0.77) | 0.71 (0.69–0.73) | 0.74 (0.72–0.76) | |
| NPV | 0.71 (0.69–0.73) | 0.78 (0.76–0.80) | 0.75 (0.73–0.77) | 0.73 (0.71–0.75) | 0.73 (0.71–0.75) | 0.77 (0.75–0.79) |
ACC, Accuracy; AUROC, area under the receiver operating characteristic curve; NPV, negative predictive value; PPV, positive predictive value.
Prediction performance of the ML algorithm by parity status
| Model | Logistic regression | Artificial neural network | Random forest | Naïve Bayes | Bagging | Boosting | |
| Nulliparous women (n=10 000) | ACC | 0.66 (0.64–0.69) | 0.65 (0.64–0.67) | 0.73 (0.72–0.75) | 0.72 (0.70–0.73) | 0.71 (0.70–0.73) | 0.74 (0.72–0.75) |
| AUROC | 0.64 (0.61–0.66) | 0.67 (0.65–0.69) | 0.74 (0.72–0.76) | 0.72 (0.70–0.74) | 0.73 (0.71–0.75) | 0.75 (0.73–0.77) | |
| Sensitivity | 0.31 (0.28–0.33) | 0.44 (0.42–0.47) | 0.55 (0.52–0.58) | 0.53 (0.50–0.56) | 0.56 (0.53–0.59) | 0.54 (0.51–0.57) | |
| Specificity | 0.89 (0.88–0.91) | 0.80 (0.78–0.81) | 0.85 (0.83–0.87) | 0.84 (0.82–0.86) | 0.82 (0.80–0.83) | 0.86 (0.85–0.88) | |
| PPV | 0.66 (0.62–0.70) | 0.59 (0.56–0.62) | 0.71 (0.68–0.74) | 0.69 (0.65–0.72) | 0.67 (0.64–0.70) | 0.72 (0.69–0.75) | |
| NPV | 0.66 (0.64–0.68) | 0.69 (0.67–0.71) | 0.74 (0.72–0.76) | 0.73 (0.71–0.75) | 0.74 (0.72–0.76) | 0.74 (0.72–0.76) | |
| Multiparous women (n=11 578) | ACC | 0.84 (0.83–0.85) | 0.84 (0.82–0.85) | 0.83 (0.82–0.85) | 0.80 (0.79–0.81) | 0.82 (0.81–0.83) | 0.84 (0.83–0.85) |
| AUROC | 0.85 (0.84–0.86) | 0.84 (0.82–0.86) | 0.84 (0.83–0.86) | 0.84 (0.82–0.85) | 0.84 (0.82–0.85) | 0.85 (0.84–0.86) | |
| Sensitivity | 0.99 (0.98–1.00) | 0.98 (0.97–0.99) | 0.96 (0.95–0.97) | 0.89 (0.87–0.90) | 0.93 (0.92–0.94) | 0.99 (0.98–1.00) | |
| Specificity | 0.67 (0.64–0.69) | 0.67 (0.64–0.69) | 0.69 (0.67–0.71) | 0.71 (0.68–0.73) | 0.70 (0.67–0.72) | 0.67 (0.65–0.69) | |
| PPV | 0.76 (0.75–0.78) | 0.77 (0.75–0.78) | 0.77 (0.76–0.79) | 0.77 (0.75–0.78) | 0.77 (0.75–0.79) | 0.77 (0.75–0.78) | |
| NPV | 0.98 (0.97–1.00) | 1.00 (0.99–1.00) | 0.95 (0.93–0.96) | 0.85 (0.83–0.87) | 0.91 (0.89–0.92) | 0.99 (0.99–1.00) |
ACC, Accuracy; AUROC, area under the receiver operating characteristic curve; ML, machine learning; NPV, negative predictive value; PPV, positive predictive value.