| Literature DB >> 32348266 |
Herdiantri Sufriyana1,2, Yu-Wei Wu1,3, Emily Chia-Yu Su1,3,4.
Abstract
BACKGROUND: Preeclampsia and intrauterine growth restriction are placental dysfunction-related disorders (PDDs) that require a referral decision be made within a certain time period. An appropriate prediction model should be developed for these diseases. However, previous models did not demonstrate robust performances and/or they were developed from datasets with highly imbalanced classes.Entities:
Keywords: intrauterine growth restriction; machine learning; preeclampsia; sFlt-1/PlGF ratio; uterine artery Doppler
Year: 2020 PMID: 32348266 PMCID: PMC7265111 DOI: 10.2196/15411
Source DB: PubMed Journal: JMIR Med Inform
Descriptive and comparative analyses.
| Feature | Class | |||||||
|
| Control (n=29) | PDDsa (n=66) |
| |||||
|
|
|
|
| |||||
|
| Maternal age (years), mean (95% CI)b | 31.2 (30.9-31.5) | 32.6 (32.4-32.7) | .23c | ||||
|
|
|
|
|
| ||||
|
|
| Nulliparous | 15 (52) | 47 (71) |
| |||
|
|
| Parous | 14 (48) | 19 (29) |
| |||
|
| Maternal weight (kg), median (IQR)f | 58.0 (55.0-65.0) | 68.0 (60.0-76.0) | .001g,h | ||||
|
| Maternal height (m), mean (95% CI) | 1.66 (1.658-1.666) | 1.65 (1.651-1.655) | .51c | ||||
|
| BMI (kg/m2), median (IQR) | 21.6 (19.9-22.5) | 24.4 (23.0-28.2) | <.001g,h,i | ||||
|
|
|
|
|
| ||||
|
|
| <25 kg/m2 | 24 (83) | 36 (55) |
| |||
|
|
| ≥25 kg/m2 | 5 (17) | 30 (45) |
| |||
|
|
|
|
| |||||
|
| Right resistivity index (RI)-UtA | 0.57 (0.49-0.61) | 0.71 (0.63-0.78) | <.001g,h | ||||
|
| Left RI-UtA | 0.59 (0.53-0.64) | 0.73 (0.61-0.78) | <.001g,h | ||||
|
| Mean RI-UtA | 0.57 (0.52-0.62) | 0.71 (0.61-0.77) | <.001g,h | ||||
|
| Right pulsatility index (PI)-UtA | 0.66 (0.60-0.71) | 1.24 (0.79-1.56) | <.001g,h,i | ||||
|
| Left PI-UtA | 0.70 (0.67-0.75) | 1.33 (0.82-1.59) | <.001g,h | ||||
|
| Mean PI-UtA | 0.68 (0.63-0.71) | 1.26 (0.86-1.57) | <.001g,h,i | ||||
|
| Right peak systolic velocity (PSV)-UtA | 58.30 (55.10-62.40) | 59.25 (56.80-64.18) | .09h | ||||
|
| Left PSV-UtA | 60.20 (59.10-64.10) | 60.05 (57.10-63.80) | .99h | ||||
|
| Mean PSV-UtA | 59.55 (58.25-61.40) | 60.38 (57.54-64.06) | .31h | ||||
|
|
|
|
|
| ||||
|
|
| Nulliparous | 0 (0) | 47 (71) |
| |||
|
|
| Parous | 29 (100) | 19 (29) |
| |||
|
| Lowest PI-UtA, median (IQR) | 0.65 (0.57-0.69) | 1.16 (0.74-1.53) | <.001g,h,j | ||||
|
|
|
|
|
| ||||
|
|
| Right UtA | 23 (79) | 43 (65) |
| |||
|
|
| Left UtA | 6 (21) | 23 (35) |
| |||
|
|
|
|
| |||||
|
| sFlt-1 (µg/L) | 3014 (1852-4116) | 13,961 (8893-22,218) | <.001g,h,i | ||||
|
| PlGF (µg/L) | 626.9 (281.3-752.8) | 68.4 (42.9-150.1) | <.001g,h,i | ||||
|
| sFlt-1/PlGF ratio | 4.7 (2.6-15.1) | 230.1 (100.8-483.0) | <.001g,h,i | ||||
aPDD: placental dysfunction–related disorder.
bMean and 95% CI were calculated for numerical values with a normal distribution.
cIndependent t test.
dNumbers and column proportions (%) were calculated for categorical values.
eFisher exact test.
fMedian and IQR were calculated for numerical values without a normal distribution.
gStatistically significant (alpha=.05).
hWilcoxon rank test.
iSelected feature for the best model from automatic selection.
jUsed for manual selection only.
ksFlt-1: soluble fms-like tyrosine kinase receptor-1.
lPlGF: placental growth factor.
Features used by the models in this study compared to those from previous studies.a
| Source | Gestational age at prediction | Features, n (for maternal characteristics) or + (used by the model) or – (not used by the model) | |||||||||||||||
|
|
| Maternal characteristics | MAPb | PI-UtAc | Bilateral notch | sFlt-1d | PlGFe | PAPP-Af | |||||||||
|
|
|
|
|
|
|
|
|
| |||||||||
|
| CVRg1 (right PI-UtA) | 24-37 weeks | 2 | – | + | – | + | + | – | ||||||||
|
| CVR2 (mean PI-UtA) | 24-37 weeks | 2 | – | + | + | + | + | – | ||||||||
|
| CVR3 (lowest PI-UtA) | 24-37 weeks | 2 | – | + | – | + | + | – | ||||||||
|
| 158-tree random forest | 24-37 weeks | 1 | – | + | + | + | + | – | ||||||||
|
|
|
|
|
|
|
|
|
| |||||||||
|
| Wright A et al (2019) [ | 11-13 weeks | 10 | – | + | – | + | + | – | ||||||||
|
| Wright D et al (2019) [ | 11-13 weeks | 11 | + | + | – | + | + | – | ||||||||
|
| Tan MY et al (2018) [ | 11-13 weeks | 11 | + | + | – | – | + | – | ||||||||
|
| Sonek J et al (2018) [ | 11-13 weeks | 10 | – | + | – | – | + | + | ||||||||
|
| Perales A et al (2017) [ | 27-28 weeks | 3 | + | – | – | + | + | – | ||||||||
|
| Nuriyeva G et al (2017) [ | 11-13 weeks | N/Ah | – | + | – | – | + | + | ||||||||
|
| O'Gorman N et al (2017) [ | 11-13 weeks | 11 | + | + | – | – | + | + | ||||||||
|
| Gallo DM et al (2016) [ | 19-24 weeks | 11 | + | + | – | + | + | – | ||||||||
|
| Tsiakkas A et al (2016) [ | 30-34 weeks | 11 | – | – | – | + | + | – | ||||||||
|
| Andrietti S et al (2016) [ | 35-37 weeks | 11 | + | + | – | + | + | – | ||||||||
|
| O'Gorman N et al (2016) [ | 11-13 weeks | 10 | + | + | – | – | + | + | ||||||||
|
| Wright D et al (2015) [ | 11-13 weeks | 11 | – | – | – | – | – | – | ||||||||
aModels that showed the best sensitivity and an acceptable specificity in each study.
bMAP: mean arterial pressure.
cPI-UtA: pulsatility index of the uterine artery.
dsFlt-1: soluble fms-like tyrosine kinase receptor-1.
ePlGF: placental growth factor.
fPAPP-A: pregnancy-associated plasma protein-A.
gCVR: classification via regression.
hN/A: not applicable.
Figure 1Characteristics of the classification via regression model using the lowest pulsatility index of the uterine artery (PI-UtA). Fractions in leaf nodes consist of true predicted numbers (numerators) and all predicted ones (denominators). A ratio of true predicted numbers is shown for control (C), both intrauterine growth restriction (IUGR) and preeclampsia (IP), IUGR only (I), and preeclampsia only (P). BMI_bP: body mass index before pregnancy (kg/m2); LM: linear model; low_PIUtA: the lowest pulsatility index of the uterine artery; MW_bP: maternal weight before pregnancy (kg); PDD: placental dysfunction–related disorder; PlGF: placental growth factor; sFlt: soluble fms-like tyrosine kinase receptor.
The seven best machine learning models.
| Model | Performance metrics and rank | |||||
|
| Area under the ROCa curve | Area under the PRCb | Accuracy (%) | ∆i AICCc | Sensitivity (%) | |
| Automatic selection: random forest | 0.976 (1) | 0.958 (1) | 92.6 (1) | 0 (1) | 90.7 (1) | |
|
|
|
|
|
|
| |
|
| CVRd | 0.954 (5) | 0.922 (3) | 90.6 (4) | 15 (4) | 89.7 (2) |
|
| Naïve Bayes | 0.960 (2) | 0.928 (2) | 90.2 (5) | 25 (5) | 89.0 (3) |
|
| Simple logistic | 0.958 (3) | 0.921 (4) | 90.9 (2) | 6 (2) | 88.2 (4) |
|
| Logistic model tree | 0.957 (4) | 0.920 (5) | 90.8 (3) | 7 (3) | 88.0 (5) |
|
| Multi-class classifier | 0.932 (6) | 0.868 (6) | 89.9 (6) | 30 (6) | 86.8 (6) |
|
| Logistic regression | 0.932 (7) | 0.868 (7) | 89.9 (7) | 30 (7) | 86.8 (7) |
aROC: receiver operating characteristic.
bPRC: precision-recall curve.
cAICC: corrected Akaike’s information criterion (∆i AICC = AIC Ci – AIC C min).
dCVR: classification via regression.
Figure 2Calibration plots of classification via regression (CVR) models using the lowest, right, and mean pulsatility index of the uterine artery (PI-UtA). Each point demonstrates a validation subset taken from repeated 10-fold cross-validation. Colors denote subsets from stratified random sampling. RMSE: root mean square error.
Figure 3Receiver operating characteristic (ROC) curves of classification via regression (CVR) models using the lowest, right, and mean pulsatility index of the uterine artery (PI-UtA). Each ROC curve demonstrates a validation subset taken from repeated 10-fold cross-validation. Colors denote subsets from stratified random sampling. AUC: area under the receiver operating characteristic curve.
Predictive performances shown by models in this study compared to those from recent studies.a
| Source | Predictive performanceb | |||
|
| AUCc | Sensitivity, % | Specificity, % | |
|
|
|
|
| |
|
| CVRd1 (right PI-UtAe) | 0.906 (0.896-0.916) | 91 (85-96) | 97 (90-100) |
|
| CVR2 (mean PI-UtA) | 0.926 (0.919-0.933) | 95 (91-100) | 100 (100-100) |
|
| CVR3 (lowest PI-UtA) | 0.970 (0.966-0.974) | 95 (91-100) | 100 (100-100) |
|
| 158-tree random forest | 0.976 (0.967-0.985) | 91 (87-94) | 93 (92-95) |
|
|
|
|
| |
|
| Wright A et al (2019) [ | N/Af,g | 85 (72-94) | 90 (90-90) |
|
| Wright D et al (2019) [ | 0.970 (0.950-0.990) | 93 (76-99) | 90h |
|
| Tan MY et al (2018) [ | N/Ag | 90 (80-96) | 90h |
|
| Sonek J et al (2018) [ | N/Ag | 85i | 95i |
|
| Perales A et al (2017) [ | 0.930i | 81i | 95i |
|
| Nuriyeva G et al (2017) [ | 0.888i | 76i | 90i |
|
| O'Gorman N et al (2017) [ | 0.987i | 100 (80-100) | 90h |
|
| Gallo DM et al (2016) [ | 0.930 (0.892-0.968) | 85 (74-93) | 90h |
|
| Tsiakkas A et al (2016) [ | 0.987 (0.980-0.994) | 100 (92-100) | 90h |
|
| Andrietti S et al (2016) [ | 0.938 (0.917-0.959) | 82 (70-91) | 90h |
|
| O'Gorman N et al (2016) [ | 0.907i | 89 (79-96) | 90h |
|
| Wright D et al (2015) [ | 0.811i | 67 (59-74) | 90h |
aModels that showed the best sensitivity and an acceptable specificity in each study.
bPoint and interval estimates.
cAUC: area under the receiver operating characteristic (ROC) curve.
dCVR: classification via regression.
ePI-UtA: pulsatility index of the uterine artery.
fN/A: not applicable because it was not available.
gThis study showed an ROC curve without an AUC statement.
hFixed specificity in order to define sensitivity.
iThis study did not report an interval estimate.
Figure 4The Matthew correlation coefficient (MCC) and class balance. Control samples did not include other subtypes of either hypertension in pregnancy or placental dysfunction–related disorders (PDDs). Colors denote validation methods. Several studies did not report interval estimates and/or cross-validation (CV). To improve visualization, the scales for either case or control sample sizes were individually log-transformed. CVR: classification via regression; ePDD: early placental dysfunction–related disorder; ePE: early preeclampsia; ITS: independent test set; PE: preeclampsia; PI-UtA: pulsatility index of the uterine artery; pPE: preterm preeclampsia.