| Literature DB >> 36035487 |
Mengyuan Liu1, Xiaofeng Yang1, Guolu Chen2, Yuzhen Ding1, Meiting Shi1, Lu Sun1, Zhengrui Huang1, Jia Liu1, Tong Liu2, Ruiling Yan1, Ruiman Li1.
Abstract
Objective: The aim of this study was to use machine learning methods to analyze all available clinical and laboratory data obtained during prenatal screening in early pregnancy to develop predictive models in preeclampsia (PE). Material andEntities:
Keywords: deep neural network; machine learning; prediction; preeclampsia; pregnancy
Year: 2022 PMID: 36035487 PMCID: PMC9413067 DOI: 10.3389/fphys.2022.896969
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.755
FIGURE 1Participant inclusion and exclusion criteria flow diagram.
Demographic and clinical characteristics of the study population.
| Variables | Control ( | Cases ( | p Value |
|---|---|---|---|
| Maternal age, y | 29 (27–33) | 31 (28–36) | <0.001 |
| Weight, kg | 53 (48–58) | 57 (53–65) | <0.001 |
| Height, cm | 160 (156–163) | 159 (156–161) | 0.037 |
| BMI, kg/m2 | 20.58 (19.02–22.52) | 22.88 (20.60–25.30) | <0.001 |
| Gestational age at screening, d | 87 (84–90) | 87 (84–90) | 0.525 |
| Method of conception, n (%) | 0.376 | ||
| Natural | 10,684 (97.89) | 142 (99.30) | |
| Assisted | 230 (2.11) | 1 (0.70) | |
| Smoking, n (%) | 8 (0.07) | 0 (0) | 1.0 |
| Chronic hypertension, n (%) | 15 (0.14) | 15 (10.49) | <0.001 |
| SLE/APS, n (%) | 19 (0.17) | 0 (0) | 1.0 |
| The history of GDM, n (%) | 299 (2.74) | 5 (3.50) | 0.598 |
| The history of DM, n (%) | 11 (0.10) | 2 (1.40) | 0.012 |
| The history of FGR, n (%) | 149 (1.37) | 5 (3.50) | 0.049 |
| Parity, n (%) | 0.933 | ||
| Nulliparous | 5,796 (53.11) | 75 (52.45) | |
| Parous | 5,118 (46.89) | 68 (47.55) | |
| The history of PE, n (%) | 74 (0.68) | 10 (6.99) | <0.001 |
| Mean arterial pressure, mm Hg | 82.80 (78.60–87.10) | 91.20 (86.45–99.65) | <0.001 |
| Free β-HCG, ng/ml | 62.30 (40.30–96.80) | 54.70 (38.28–84.85) | 0.034 |
| PAPP-A, IU/L | 2,890 (1820–4,510) | 2,150 (1,190–3,365) | <0.001 |
| Left uterine artery PI | 1.82 (1.46–2.24) | 2.00 (1.52–2.40) | 0.011 |
| Right uterine artery PI | 1.75 (1.42–2.16) | 1.88 (1.49–2.24) | 0.078 |
| Mean uterine artery PI | 1.82 (1.51–2.15) | 1.91 (1.56–2.25) | 0.022 |
Data are presented as media (interquartile range) unless indicated as n (%).
BMI, body mass index, SLE, systemic lupus erythematosus, APS, antiphospholipid antibody syndrome, GDM, gestational diabetes mellitus, DM, diabetes mellitus, FGR, fetal growth restriction, PE, preeclampsia. PI, pulse index.
Discrimination tests of five machine learning models for predicting preeclampsia.
| Algorithm | Discrimination tests | ||||
|---|---|---|---|---|---|
| AUROC (95% CI) | Prec. (95% CI) | Accuracy (95% CI) | Recall (95% CI) | F1-score (95% CI) | |
| DNN | 0.57 (0.46, 0.69) | 0.43 (0.40, 0.46) | 0.60 (0.57, 0.64) | 0.58 (0.54, 0.62) | 0.49 (0.46, 0.53) |
| LR | 0.69 (0.60, 0.78) | 0.52 (0.51, 0.53) | 0.64 (0.63, 0.65) | 0.60 (0.58, 0.62) | 0.56 (0.54, 0.57) |
| SVM | 0.79 (0.71, 0.86) | 0.56 (0.55, 0.57) | 0.68 (0.67, 0.69) | 0.77 (0.76, 0.79) | 0.65 (0.64, 0.66) |
| DT | 0.71 (0.63, 0.79) | 0.61 (0.60, 0.62) | 0.70 (0.70, 0.71) | 0.62 (0.60, 0.64) | 0.61 (0.60, 0.63) |
| RF | 0.86 (0.80, 0.92) | 0.82 (0.79, 0.84) | 0.74 (0.74, 0.75) | 0.42 (0.41, 0.44) | 0.56 (0.54, 0.57) |
AUROC, area under the receiver operating characteristic curve; Prec. = precision; DNN, deep neural network; LR, logistic regression; SVM, support vector machine; DT, decision tree; RF, random forest.
Calibration tests of five machine learning models for predicting preeclampsia.
| Algorithm | Calibration | ||
|---|---|---|---|
| Brier score (95% CI) | Slope (95% CI) | Intercept (95% CI) | |
| DNN | 0.25 (0.23, 0.26) | 0.17 (–0.00, 0.35) | 0.20 (0.12, 0.29) |
| LR | 0.25 (0.24, 0.26) | 0.44 (0.39, 0.50) | 0.16 (0.14, 0.18) |
| SVM | 0.24 (0.23, 0.25) | 0.48 (0.40, 0.55) | 0.16 (0.11, 0.21) |
| DT | 0.22 (0.21, 0.23) | 0.60 (0.57, 0.63) | 0.18 (0.17, 0.19) |
| RF | 0.17 (0.17, 0.17) | 0.92 (0.87, 0.96) | 0.20 (0.18, 0.21) |
DNN, deep neural network; LR, logistic regression; SVM, support vector machine; DT, decision tree; RF, random forest.
FIGURE 2Receiver operating characteristics (ROC) curves for the five machine learning models. (A) DNN; (B) DT; (C) LR; (D) RF; (E) SVM.
Predictive performances shown by DNN models in this study compared to those from previous studies.
| Source | Predictive performance | ||
|---|---|---|---|
| AUROC (95% CI) | Prec. (95% CI) | Sens. (95% CI) | |
| RF in this study | 0.86 (0.80, 0.92) | 0.82 (0.79, 0.84) | 0.42 (0.41, 0.44) |
|
| 0.96 | 0.45 | 0.79 |
|
| 0.95 (0.91, 0.99) | NA | 0.88 |
|
| 0.83 (0.79, 0.88) | NA | NA |
|
| 0.88 (0.84, 0.92) | NA | 0.90 |
|
| NA | NA | 0.91 |
AUROC, area under the receiver operating characteristic curve; Prec, precision; Sens, sensitivity; DNN, deep artificial neural network; NA, not available.