| Literature DB >> 36213675 |
Ploywarong Rueangket1, Kristsanamon Rittiluechai1, Akara Prayote2.
Abstract
Objective: Ectopic pregnancy (EP) is well known for its critical maternal outcome. Early detection could make the difference between life and death in pregnancy. Our aim was to make a prompt diagnosis before the rupture occur. Thus, the predictive analytical models using both conventional statistics and machine learning (ML) methods were studied. Materials and methods: A retrospective cohort study was conducted on 407 pregnancies with unknown location (PULs): 306 PULs for internal validation and 101 PULs for external validation, randomized with a nested cross-validation technique. Using a set of 22 study features based on clinical factors, serum marker and ultrasound findings from electronic medical records, analyzing with neural networks (NNs), decision tree (DT), support vector machines (SVMs), and a statistical logistic regression (LR). Diagnostic performances were compared with the area under the curve (ROC-AUC), including sensitivity and specificity for decisional use.Entities:
Keywords: decision tree and support vector machines; ectopic pregnancy; machine learning; neural networks; pregnancy of unknown location
Year: 2022 PMID: 36213675 PMCID: PMC9537586 DOI: 10.3389/fmed.2022.976829
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
FIGURE 1Study flow diagram based on the foundational method for data science (FMD), IBM (27, 28).
FIGURE 2Conceptual overview of four predictive models.
Descriptive demographics and features of the study population.
| Characteristics | Total ( | EP | Non-EP ( | |||||
|
| 29.58 ± 6.08 | 30.86 ± 6.30 | ||||||
|
| ||||||||
| <35 (%) | 304 | (74.69%) | 161 | (52.96%) | 143 | (47.04%) | 0.198 | |
| ≥35 (%) | 103 | (25.31%) | 47 | (45.63%) | 56 | (54.37%) | ||
|
| ||||||||
| <23 (kg/m2) (%) | 272 | (66.83%) | 141 | (51.84%) | 131 | (48.16%) | 0.675 | |
| ≥23 (kg/m2) (%) | 135 | (33.17%) | 67 | (49.63%) | 68 | (50.37%) | ||
|
| ||||||||
| Nulliparity (%) | 182 | (44.72%) | 85 | (46.70%) | 97 | (53.30%) | 0.110 | |
| Multiparity (%) | 225 | (55.28%) | 123 | (54.67%) | 102 | (45.33%) | ||
|
| 52.40 ± 15.78 | 51.82 ± 16.43 | 0.717 | |||||
|
| ||||||||
| No (%) | 304 | (78.15%) | 164 | (53.95%) | 140 | (46.05%) | 0.351 | |
| Yes (%) | 85 | (21.85%) | 41 | (48.24%) | 44 | (51.76%) | ||
|
| ||||||||
| No (%) | 380 | (97.94%) | 198 | (52.11%) | 182 | (47.89%) | 0.007 | |
| Yes (%) | 8 | (2.06%) | 8 | (100.00%) | 0 | (0.00%) | ||
|
| ||||||||
| No (%) | 385 | (95.77%) | 197 | (51.17%) | 188 | (48.83%) | 0.886 | |
| Yes (%) | 17 | (4.23%) | 9 | (52.94%) | 8 | (47.06%) | ||
|
| ||||||||
| No (%) | 347 | (96.12%) | 188 | (54.18%) | 159 | (45.82%) | 0.020 | |
| Yes (%) | 14 | (3.88%) | 12 | (85.71%) | 2 | (14.29%) | ||
|
| ||||||||
| No (%) | 331 | (89.46%) | 162 | (48.94%) | 169 | (51.06%) | <0.001 | |
| Yes (%) | 39 | (10.54%) | 33 | (84.62%) | 6 | (15.38%) | ||
|
| ||||||||
| No (%) | 390 | (97.26%) | 200 | (51.28%) | 190 | (48.72%) | 0.831 | |
| Yes (%) | 11 | (2.74%) | 6 | (54.55%) | 5 | (45.45%) | ||
|
| ||||||||
| No (%) | 79 | (19.46%) | 27 | (34.18%) | 52 | (65.82%) | 0.001 | |
| Yes (%) | 327 | (80.54%) | 180 | (55.05%) | 147 | (44.95%) | ||
|
| ||||||||
| No (%) | 84 | (20.64%) | 51 | (60.71%) | 33 | (39.29%) | 0.048 | |
| Yes (%) | 323 | (79.36%) | 157 | (48.61%) | 166 | (51.39%) | ||
|
| ||||||||
| No (%) | 267 | (85.85%) | 146 | (54.68%) | 121 | (45.32%) | 0.408 | |
| Yes (%) | 44 | (14.15%) | 27 | (61.36%) | 17 | (38.64%) | ||
|
| ||||||||
| No (%) | 293 | (92.43%) | 156 | (53.24%) | 137 | (46.76%) | 0.014 | |
| Yes (%) | 24 | (7.57%) | 19 | (79.17%) | 5 | (20.83%) | ||
|
|
| |||||||
| No (%) | 209 | (51.35%) | 67 | (32.06%) | 142 | (67.94%) | <0.001 | |
| Yes (%) | 198 | (48.65%) | 141 | (71.21%) | 57 | (28.79%) | ||
|
| ||||||||
| No (%) | 324 | (80.19%) | 126 | (38.89%) | 198 | (61.11%) | <0.001 | |
| Yes (%) | 80 | (19.81%) | 79 | (98.75%) | 1 | (1.25%) | ||
|
| ||||||||
| <1,000 (%) | 153 | (53.87%) | 46 | (30.07%) | 107 | (69.93%) | <0.001 | |
| ≥1,000 (%) | 131 | (46.13%) | 74 | (56.49%) | 57 | (43.51%) | ||
|
| ||||||||
| No (%) | 351 | (86.88%) | 182 | (51.85%) | 169 | (48.15%) | 0.251 | |
| Yes (%) | 53 | (13.12%) | 23 | (43.40%) | 30 | (56.60%) | ||
|
| ||||||||
| No (%) | 321 | (83.38%) | 163 | (50.78%) | 158 | (49.22%) | 0.304 | |
| Yes (%) | 64 | (16.62%) | 28 | (43.75%) | 36 | (56.25%) | ||
|
| ||||||||
| No (%) | 191 | (47.04%) | 26 | (13.61%) | 165 | (86.39%) | <0.001 | |
| Yes (%) | 215 | (52.96%) | 181 | (84.19%) | 34 | (15.81%) | ||
|
| ||||||||
| No (%) | 235 | (58.75%) | 74 | (31.49%) | 161 | (68.51%) | <0.001 | |
| Yes (%) | 165 | (41.25%) | 129 | (78.18%) | 36 | (21.82%) | ||
SD, Standard deviation; BMI, body mass index (kg/m2); PID, pelvic inflammatory disease.
aChi-square test. bt-test.
Features selected in the four models.
| Machine learning model | Feature selections (yes/no) |
| Logistic regression, support vector machine | Multipara, vaginal bleeding, cervical tenderness, serum hCG ≥ 1,000 mIU/mL, inhomogeneous adnexal mass in ultrasound |
| Neural network | Multipara, history of pelvic surgery, cervical tenderness, serum hCG ≥ 1,000 mIU/mL, inhomogeneous adnexal mass, intrauterine anechoic sac in ultrasound |
| Decision tree | History of pelvic inflammatory disease, emergency pill, nausea-vomiting, cervical tenderness, serum hCG, inhomogeneous adnexal mass, free fluid in ultrasound |
hCG, human chorionic gonadotropin.
FIGURE 3ROC-AUC (95%CI) performance comparison of the four models using cross-validation (internal validation), created by RapidMiner Studio 9.9.003.
FIGURE 4Predictive performance of the four models (external validation).
FIGURE 5Decision tree model for predictive ectopic pregnancy diagnosis. Adx mass: inhomogeneous adnexal mass, N/V: nausea-vomiting, Cx tender: cervical tenderness, PID: pelvic inflammatory disease, created by RapidMiner Studio 9.9.003.
Average ROC-AUC performance comparison of the four models applied to the internal and external validation datasets.
| Model | Internal validation | External validation | ||
| Average | S.D. | AUC | S.D. | |
| LR | 0.879 | 0.010 | 0.896 | 0.034 |
| SVMs | 0.869 | 0.016 | 0.882 | 0.029 |
| DT | 0.855 | 0.009 | 0.856 | 0.033 |
| NNs | 0.876 | 0.012 | 0.898 | 0.027 |
AUC, area under the curve; S.D., standard deviation.