| Literature DB >> 35455095 |
Jeong Ha Wie1, Se Jin Lee2, Sae Kyung Choi3, Yun Sung Jo4, Han Sung Hwang5, Mi Hye Park6, Yeon Hee Kim7, Jae Eun Shin8, Ki Cheol Kil9, Su Mi Kim10, Bong Suk Choi11, Hanul Hong11, Hyun-Joo Seol12, Hye-Sung Won13, Hyun Sun Ko14, Sunghun Na2.
Abstract
This study was a multicenter retrospective cohort study of term nulliparous women who underwent labor, and was conducted to develop an automated machine learning model for prediction of emergent cesarean section (CS) before onset of labor. Nine machine learning methods of logistic regression, random forest, Support Vector Machine (SVM), gradient boosting, extreme gradient boosting (XGBoost), light gradient boosting machine (LGBM), k-nearest neighbors (KNN), Voting, and Stacking were applied and compared for prediction of emergent CS during active labor. External validation was performed using a nationwide multicenter dataset for Korean fetal growth. A total of 6549 term nulliparous women was included in the analysis, and the emergent CS rate was 16.1%. The C-statistics values for KNN, Voting, XGBoost, Stacking, gradient boosting, random forest, LGBM, logistic regression, and SVM were 0.6, 0.69, 0.64, 0.59, 0.66, 0.68, 0.68, 0.7, and 0.69, respectively. The logistic regression model showed the best predictive performance with an accuracy of 0.78. The machine learning model identified nine significant variables of maternal age, height, weight at pre-pregnancy, pregnancy-associated hypertension, gestational age, and fetal sonographic findings. The C-statistic value for the logistic regression machine learning model in the external validation set (1391 term nulliparous women) was 0.69, with an overall accuracy of 0.68, a specificity of 0.83, and a sensitivity of 0.41. Machine learning algorithms with clinical and sonographic parameters at near term could be useful tools to predict individual risk of emergent CS during active labor in nulliparous women.Entities:
Keywords: cesarean; emergency; labor; machine learning; nulliparous; prediction
Year: 2022 PMID: 35455095 PMCID: PMC9033083 DOI: 10.3390/life12040604
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
Figure 1Participant selection process.
Baseline characteristics of the study population.
| Characteristics | Successful Vaginal Delivery | Emergent Cesarean Section | |
|---|---|---|---|
| Maternal age, yr | 31.29 ± 3.8 | 32.49 ± 3.95 | <0.001 |
| Gestational age, w | 39.49 ± 0.99 | 39.67 ± 0.99 | <0.001 |
| Birth weight, gm | 3.17 ± 0.36 | 3.31 ± 0.44 | <0.001 |
| Maternal height, cm | 162.24 ± 5.02 | 160.93 ± 5.14 | <0.001 |
| Maternal weight at pre-pregnancy, kg | 53.96 ± 7.93 | 56.05 ± 9.97 | <0.001 |
| Maternal weight at delivery, kg | 67.13 ± 9.11 | 69.92 ± 10.72 | <0.001 |
| Gestational age at sonogram, w | 36.9 ± 0.95 | 36.96 ± 0.95 | 0.075 |
| Sonographic parameters, cm | |||
| Biparietal diameter | 9.05 ± 0.37 | 9.09 ± 0.37 | <0.001 |
| Abdominal circumference | 31.88 ± 1.62 | 32.23 ± 1.88 | <0.001 |
| Femur length | 6.9 ± 0.31 | 6.92 ± 0.33 | 0.125 |
| Estimated fetal weight | 2831.84 ± 320.48 | 2908.89 ± 365.84 | <0.001 |
| Neonate male sex | 2779 (50.46%) | 580 (55.66%) | 0.002 |
| Pregnancy-associated hypertension | 167 (3.03%) | 61 (5.85%) | <0.001 |
| Gestational diabetes | 253 (4.59%) | 62 (5.95%) | 0.072 |
Figure 2All-against-all scatter plot analysis of variables.
Figure 3Correlations between delivery type and patient variables.
Comparison of predictive performance for emergent cesarean section.
| Algorithm | Accuracy | Precision | Sensitivity | F1_Score | Specificity |
|---|---|---|---|---|---|
| Logistic Regression | 0.78 | 0.35 | 0.43 | 0.39 | 0.85 |
| Voting | 0.83 | 0.38 | 0.17 | 0.23 | 0.95 |
| SVM | 0.77 | 0.31 | 0.37 | 0.34 | 0.85 |
| Random Forest | 0.83 | 0.42 | 0.19 | 0.26 | 0.95 |
| LGBM | 0.85 | 0.55 | 0.15 | 0.23 | 0.98 |
| Gradient Boosting | 0.83 | 0.36 | 0.12 | 0.19 | 0.96 |
| XGBoost | 0.82 | 0.34 | 0.12 | 0.17 | 0.96 |
| KNN | 0.69 | 0.24 | 0.42 | 0.3 | 0.74 |
| Stacking | 0.83 | 0.35 | 0.09 | 0.14 | 0.97 |
SVM, Support Vector Machine; LGBM, Light Gradient Boosting Machine; KNN, k-nearest neighbors.
Figure 4Receiver operating characteristic curves of emergency cesarean section prediction models.
Figure 5Odd ratios by logistic regression analysis.
Figure 6Receiver operating characteristic curve of emergency cesarean section prediction model based on logistic regression in an external validation set.
Figure 7The confusion matrix of the prediction for an external validation set.
Figure 8Model performance according to the threshold of the logistic regression model.