| Literature DB >> 35127665 |
Ayleen Bertini1,2, Rodrigo Salas3,4,5, Steren Chabert3,4,5, Luis Sobrevia6,7,8,9,10,11, Fabián Pardo1,6,12.
Abstract
Introduction: Artificial intelligence is widely used in medical field, and machine learning has been increasingly used in health care, prediction, and diagnosis and as a method of determining priority. Machine learning methods have been features of several tools in the fields of obstetrics and childcare. This present review aims to summarize the machine learning techniques to predict perinatal complications. Objective: To identify the applicability and performance of machine learning methods used to identify pregnancy complications.Entities:
Keywords: artificial intelligence; machine learning; perinatal complications; prediction model; predictive tool; pregnancy
Year: 2022 PMID: 35127665 PMCID: PMC8807522 DOI: 10.3389/fbioe.2021.780389
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Search expressions used in the systematic review.
| Data base | Search expression | Year of publication |
|---|---|---|
| PubMed | [“Machine learning” (Mesh)] AND “Pregnancy Complications” (Mesh) NOT (“postpartum”) | 2015–2020 |
| Web of Science | (“Machine learning” OR “Deep learning” AND (“complications in pregnancy” OR “pregnancy complications” OR “perinatal complications”) NOT (“postpartum”) | |
| Scopus |
FIGURE 1Process for selecting articles for the systematic review (PRISMA). One hundred four articles were found. Sixteen articles were excluded by title, 18 were excluded by criteria, and 19 were excluded after reading. Finally, 31 articles for the review were selected.
Main characteristics of selected articles.
| Type of study | Temporality | Geographic location of the study group | Year of publication |
|---|---|---|---|
| Cohort (87.2%) | Retrospective (96.8%) | Asia (32.3%) | 2015 (3.2%) |
| Control case (6.4%) | Prospective (3.2%) | Europe (32.3%) | 2016 (9.6%) |
| Exploratory (3.2%) | North America (22.5%) | 2017 (12.9%) | |
| Cross section (3.2%) | South America (6.5%) | 2018 (19.4%) | |
| Africa (3.2%) | 2019 (35.5%) | ||
| Oceania (3.2%) | 2020 (19.4%) |
Perinatal complications predicted through ML models using electronic medical records.
| Electronic medical records | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Ref | Time of data collection | Number of records | Outcome | Validation technique | ML methods | Performance metrics | |||
| AUC | Sen. (%) | Spec. (%) | Acc. (%) | ||||||
|
| During pregnancy with term delivery | 9,888 | TOLAC failure risk | 10-fold cross-validation and deletion of a portion of the data | Gradient increasing machines | 0.793 | — | — | — |
| -High | RF | 0.756 | — | — | — | ||||
| -Medium | RF | 0.782 | — | — | — | ||||
| -Low | AdaBoost set | 0.784 | — | — | — | ||||
|
| <22 gw | 100 | Severe neonatal mortality v/s no severe neonatal mortality | 10 replicates of 10-fold cross-validation and on the one standard error rule | Decision tree | 0.853 | 79.7 | 80.9 | 75.6 |
| SVM | 0.851 | 79.1 | 79.6 | 77.4 | |||||
| Generalized additive model | 0.850 | 80.6 | 81.8 | 75.0 | |||||
| Simple neural network | 0.848 | 78.5 | 80.7 | 73.3 | |||||
|
| <20 gw | 588,622 | High-risk GDM v/s low-risk GDM | Cross-validation on the training set, and resampling from the validation | Gradient augmentation machine built with decision tree base learners | 0.850 | — | — | — |
|
| Early second trimester to 34 gw | 1,006 | Pre-eclampsia v/s no pre-eclampsia | Training (70%) validation set (30%) | Logistic regression | — | 70.3 | — | 86.2 |
| Decision tree | — | 64.8 | — | 87.4 | |||||
| Naive Bayes | — | 50 | — | 89.9 | |||||
| SVM | — | 13.7 | — | 89.2 | |||||
| RF | — | 67.9 | — | 92.3 | |||||
| Stochastic gradient augmentation method | — | 60.3 | — | 97.3 | |||||
|
| During pregnancy (not specified) | 1,450 | Premature v/s not premature | k-fold cross-validation (with 10 folds) | Binary logistic regression model, RF classification, and generalized additive model | 0.868 | 98.9 | — | — |
| — | Gestational age prediction | k-fold cross-validation (with 10 folds) | Combined continuous model of linear regression, RF, regression, and generalized additive models | 0.878 | 90.2 | — | — | ||
|
| Pre-pregnancy at 26 gw | 30,705 | LGA v/s AGA | Test (20%) training (80%) and ten-fold cross-validation in the training data | RF | 0.728 | — | — | 79.9 |
| Decision tree | 0.718 | — | — | 79.4 | |||||
| Elastic net | 0.748 | — | — | 80.9 | |||||
| Gradient increasing machines | 0.748 | — | — | 80.5 | |||||
| Logistic regression | 0.745 | — | — | 81.3 | |||||
| Neural network | 0.746 | — | — | 81.2 | |||||
| SGA v/s AGA | Test (20%) training (80%) and ten-fold cross-validation in the training data | RF | 0.745 | — | — | 90.3 | |||
| Decision tree | 0.713 | — | — | 80.1 | |||||
| Elastic net | 0.771 | — | — | 91.2 | |||||
| Gradient increasing machines | 0.766 | — | — | 91.1 | |||||
| Logistic regression | 0.771 | — | — | 91.2 | |||||
| RF | 0.772 | — | — | 91.4 | |||||
|
| During pregnancy, before 37 gw | 1,547,677 | Non-premature delivery v/s premature | Training dataset | Set of decision trees, SVM and RF | 0.68 | — | — | 81.0 |
|
| During pregnancy (not specified) | 952,813 | Miscarriage v/s born alive | Dataset was randomly divided into 10 folds | Artificial neural networks: multilayer perceptron + radial base networks | — | 80 | 94.1 | 90.9 |
|
| During pregnancy (not specified) | 6,457 | Adverse delivery v/s non-adverse delivery | 10-fold cross-validation and repeated the cross-validation process with new folds 9 more times in the test set | Logistic regression | — | 31.9 | — | — |
| Linear discriminant analysis | — | 31.7 | — | — | |||||
| RF | — | 30.1 | — | — | |||||
| Naive Bayes | — | 29.2 | — | — | |||||
|
| During pregnancy (not specified) | 25 | Hypertensive disorder v/s without hypertensive disorder | 10-fold cross-validation for decision trees | Decision tree J48 | 0.748 | 60 | — | — |
| 5-fold cross-validation method | Naive Bayes | 0.782 | 52 | — | — | ||||
|
| During pregnancy (not specified) | 45,858 | Severe maternal morbidity v/s no serious maternal morbidity | Train dataset and 10-fold stratified cross-validation | Logistic regression | 0.937 | 76.5 | — | — |
|
| Between 22 and 32 gw | 617 | Delivery prediction within 48 h of transfer v/s Before 32 gw | Validation set | Multivariate logistic regression | 0.850 | — | — | — |
|
| Data from the first and last prenatal checkup | 15,263 | Macrosomia v/s No macrosomia | Training dataset (90%) and a validation dataset (10%) | Logistic regression | 0.880 | 88 | 55 | — |
| RF | 0.990 | 60 | 82 | — | |||||
|
| Before the first trimester | 149 | Live births v/s stillbirths | Test (70%) training (30%) | Logistic regression | 0.834 | 40.5 | 99.7 | 94.7 |
| Decision tree | 0.808 | 40.6 | 94.7 | 99.7 | |||||
| RF | 0.836 | 41.1 | 94.7 | 99.7 | |||||
| XGBoost | 0.842 | 45.3 | 94.7 | 99.7 | |||||
| Artificial neural networks multilayer perceptron | 0.840 | 43.5 | 94.7 | 99.7 | |||||
| Spontaneous preterm birth | Multivariate logistic regression | 0.670 | — | — | — | ||||
|
| Each trimester of pregnancy | 36,898 | Pregnancies without congenital abnormality v/s pregnancies with congenital abnormality | Method of data validation is not identified | RF | — | — | — | 88.9 |
Ref., references; ML, machine learning; AUC, area under curve; Sen, sensitivity; Spec, specificity; Acc, accuracy; TOLAC, trial of labor after caesarean, RF, random forest; gw, gestational weeks; SVM, support vector machine; GDM: gestational diabetes mellitus; LGA, large for gestational age; AGA, adequate for gestational age, SGA, mall for gestational age.
This study also uses biological markers.
Perinatal complications predicted through ML models using medical images.
| Medical Images | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Ref | Time of data collection | Number of records | Outcome | Validation technique | ML methods | Performance metrics | |||
| AUC | Sen. (%) | Spec. (%) | Acc. (%) | ||||||
|
| After 24 gw | 155 | Placental invasion v/s placenta previa simple | Test (83%) Training (17%) | Genetic algorithm-based machine learning algorithm implemented in TPOT | 0.980 | 100 | 88.5 | 95.2 |
|
| 150 EHG in pregnancy (not specified) and 150 EHG in labor (24 h before delivery usually) | 300 | Premature v/s born of term | Test (67%) training (33%) | Stacked sparse autocoder | 0.900 | 92 | 88 | 90 |
| Extreme learning machine | 0.840 | 80 | 88 | 83 | |||||
| SVM | 0.850 | 88 | 82 | 85 | |||||
|
| >36 gw | 552 | Vaginal delivery v/s caesarean section | Test (80%) training (30%) | SVM RF and linear discriminant analysis of features | 0.960 | 87 | 90 | |
| — | — | — | — | ||||||
|
| From 24 to 28 gw | 20 | Deliver after 7 days v/s deliver within 7 days | 10-fold cross-validation | PCA + SVM | — | — | — | 83.32 |
| RQA + SVM | — | — | — | 79.3 | |||||
|
| During pregnancy (not specified) | ni | Diagnosis of recurrent lung diseases in the newborn | Test Training | RVM | — | — | — | 100 |
| Multilevel RVM | — | — | — | 90 | |||||
|
| During pregnancy (not specified) | 108 | Delivery with placental accreta spectrum v/s delivery without placental accreta spectrum | Test (75%) training (25%) and a 10-fold cross-validation | RF | — | 93.7 | 93.7 | 95.6 |
| K-nearest neighbor | — | 97.5 | 98.7 | 98.1 | |||||
| Naive Bayes | — | 86.1 | 75 | 80.5 | |||||
| — | Multilayer perceptron | — | 92.4 | 83.8 | 88.6 | ||||
|
| Between the 27th and the 32nd gw | 30 | Premature vs. term | 100 iterations of “holdout” cross-validation for training and test sets | SVM | 0.952 | 98.4 | 93 | 95.7 |
|
| During pregnancy (not specified) | 552 | Presence of fetal hypoxia v/s absence of fetal hypoxia | Test (90%) training (10%) and 10-fold cross-validation | Least squares support vector machines | — | 63.5 | 65.9 | 65.4 |
|
| First prenatal visit | ∼2,700,000 | Born preterm v/s born of term in white women v/s color | Test set and 5-fold cross-validation | Logistic Regression | 0.625 | 56 | 62.5 | — |
Ref., references; ML, machine learning; AUC, area under curve; Sen, sensitivity; Spec, specificity; Acc, accuracy; gw, gestational weeks; TPOT, tree-based pipeline optimization tool; EHG, electrohysterograhic; SVM, support vector machine; PCA, principal components analysis; RQA, recurrence quantification analysis; RVM, relevance vector machine.
Perinatal complications predicted through ML models using biological markers.
| Biological Markers | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Ref | Time of data collection | Numbers of records | Outcome | Validation technique | ML methods | Performance metrics | |||
| AUC | Sen. (%) | Spec. (%) | Acc. (%) | ||||||
|
| For GDM <18 gw | 2,199 | GDM | Training and validation | Logistic regression | 0.732 | — | — | 72.6 |
| For PE | PE | 0.813 | — | — | 81.5 | ||||
| <20gw | MA | 0.766 | 71 | 82.3 | 80.0 | ||||
| For MA and FGR, 12–28 gw | FGR | 0.775 | — | — | 79.5 | ||||
|
| >20 gw | 77 | PE v/s control | Test and training | SVM | 0.958 | 95 | 66.7 | — |
|
| >20 gw | 38 | PE v/s control | Test (85%) training (15%) | Artificial neural networks multilayer perception | 0.908 | — | — | — |
|
| First trimester of gestation | 43 | GDM v/s without GDM | Trained and evaluated the datasets via a leave-one-out cross-validation | Logistic regression | 0.740 | 88 | 40 | 76 |
| RF | 0.810 | 94 | 40 | 81 | |||||
| AdaBoost | 0.770 | 94 | 60 | 86 | |||||
|
| Between 12 and 37 gw | 113 | Severe PE v/s without PE | Dataset trained with 10-fold stratified cross-validation | AdaBoost | 0.964 | 88 | 92 | 89 |
Ref., references; ML, machine learning; AUC, area under curve; Sen, sensitivity; Spec, specificity; Acc, accuracy; GDM, gestational diabetes mellitus; gw, gestational weeks; PE, pre-eclampsia; MA, macrosomia; FGR, fetal growth restriction; SVM, support vector machine.
This study also uses electronic medical records.
Perinatal complications predicted through ML models using sensors and fetal heart rate.
| Other features | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Ref. | Time of data collection | Numbers of records | Outcome | Validation technique | ML methods | Performance metrics | |||
| AUC | Sen. (%) | Spec. (%) | Acc. (%) | ||||||
|
| During pregnancy (not specified) | 25 | Complication in hypertensive disorder v/s without complication in hypertensive disorder | Leave-one-out method of cross-validation | Naive Bayes | 0.687 | 42.3 | 94.4 | 80 |
|
| Intrapartum | 552 | Presence v/s absence of fetal acidemia | Training set and 10-fold cross-validation | Deep convolutional neural network | 0.978 | 98.2 | 94.9 | 98.4 |
Ref., references; ML, machine learning; AUC, area under curve; Sen, sensitivity; Spec, specificity; Acc, accuracy.
Sensors.
Fetal heart rate.
FIGURE 2Number of studies according to the complication to be predicted. Sixteen complications were identified: Prematurity, pre-eclampsia, adverse delivery, size for gestational age, gestational diabetes mellitus, neonatal mortality, fetal acidemia, fetal hypoxia, placental accreta, pulmonary diseases, cesarean section, placental invasion, congenital anomaly, spontaneous abortion and trial of labor after cesarean (TOLAC) failure, and severe maternal morbidity.
Models with best performance according to AUC and accuracy.
| Prediction | Input characteristics | ML model | Performance | No of pregnant women |
|---|---|---|---|---|
| Placental invasion | Magnetic resonance | TPOT | AUC: 0.980 – Acc: 95.2% | 100–1,000 |
| Fetal academia | Maternal sociodemographic characteristics | Neural networks | AUC: 0.978 – Acc: 98.4% | 100–1,000 |
| Pre-eclampsia | Biological marker | AdaBoost | AUC: 0.964 – Acc: 89% | <100 |
| Prematurity | EHG recordings | SVM | AUC: 0.952 – Acc. 95.7% | 100–1,000 |
| Prematurity | EHG recordings | Stacked sparse autocoder | AUC 0.900 – Acc: 90% | 100–1,000 |
| Neonatal mortality | Maternal sociodemographic characteristics | XGBoost | AUC: 0.842 – Acc: 99.7% | >10,000 |
ML, machine learning; TPOT, tree-based pipeline optimization tool; AUC, area under curve; Acc, accuracy; EHG, electrohysterogram; SVM, support vector machine.
Models and precision based on deep learning.
| Prediction | Input characteristics | Deep learning model | Performance | N° of pregnant women |
|---|---|---|---|---|
| Fetal acidemia | Maternal and newborn sociodemographic characteristics | Deep convolutional network | AUC: 0.978, Acc: 98.4% | 100 - 1,000 |
| Spontaneous abortion | Maternal sociodemographic characteristics | Multilayer Perceptron and radial-based networks | Acc: 90.9% | 100 - 1,000 |
| Pre-eclampsia | Biological markers | Multilayer Perceptron | AUC: 0.908 | <100 |
| Neonatal mortality | Maternal sociodemographic characteristics | Multilayer Perceptron | AUC: 0.84 - Acc: 99.7% | >100,000 |
AUC, area under curve; Acc, accuracy.
Main predictive variables for predicting perinatal complications
| Prediction | Predictive variables | Machine learning model | Performance | |
|---|---|---|---|---|
| AUC | Acc | |||
| Premature birth | Gestational diabetes | Set of decision trees, SVM and RF | 0.680 | 81% |
| Cardiovascular disease | ||||
| Underlying diseases | ||||
| Maternal age | ||||
| Chronic arterial hypertension | ||||
| SGA | Smoking | RF | 0.728 | 79.9% |
| A particular values of gestational weight gain | DT | 0.718 | 79.4% | |
| Low–birth weight newborn | Elastic net | 0.748 | 80.9% | |
| Gradient increasing machines | 0.748 | 80.5% | ||
| Logistic regression | 0.745 | 81.3% | ||
| Neural network | 0.746 | 81.2% | ||
| LGA | Pre-pregnancy BMI | RF | 0.745 | 90.3% |
| Gestational weight gain | DT | 0.713 | 80.1% | |
| Macrosomic newborn in a previous delivery | Elastic net | 0.771 | 91.2% | |
| Gradient increasing machines | 0.766 | 91.1% | ||
| Logistic regression | 0.771 | 91.2% | ||
| Neural network | 0.772 | 91.4% | ||
| Fetal Macrosomia | Greater than 30 years-old | Logistic regression | 0.888 | ni |
| Multiparity | RF | 0.990 | ni | |
| A 12 kg total weight gain in pregnancy | ||||
| Abdominal circumference > 95 cm (at last perinatal checkup) | ||||
| Gestation age > 39 weeks | ||||
| Pre-eclampsia | At second trimester | Logistic regression | ni | 86.2% |
| Systolic blood pressure | DT | ni | 87.4% | |
| Serum levels of ureic nitrogen | Naive Bayes | ni | 89.9% | |
| Creatinine in the blood | SVM | ni | 89.2% | |
| Platelet count, serum potassium level | RF | ni | 92.3% | |
| Leukocyte count | Stochastic gradient augmentation method | ni | 97.3% | |
| Blood glucose level | ||||
| Serum calcium and urinary protein levels | ||||
| Adverse delivery (preterm, low birth weight, neonatal/infant death, stay in the neonatal intensive care unit) v/s non-adverse delivery | High pre-pregnancy BMI | Logistic regression | ni | ni |
| Linear discriminant analysis | ni | ni | ||
| Previous preterm births | Random forest | ni | ni | |
| Naive Bayes | ni | ni | ||
| TOLAC Failure Risk | Parity | Gradient increasing machines | 0.793 | ni |
| Age | RF | 0.756 | ni | |
| Vaginal birth with cesarean section in the past Gestational week | RF | 0.782 | ni | |
| Minimum gestation week in previous deliveries | AdaBoost set | 0.784 | ni | |
| The weight of the newborn from the previous delivery | ||||
| Dilation and head position | ||||
| Gestational age (if the newborn will be preterm) | Hypertension during labor | Binary logistic regression model, random forest classification, and generalized additive model | 0.868 | 98.9% |
| HIV serological status | ||||
| Delivery prediction within 48 h of transfer v/s before 32 weeks gestation | Presence of premature rupture of membranes | Multivariate logistic regression | 0.850 | ni |
| Vaginal bleeding | ||||
| Ultrasound cervical length | ||||
| Gestation week | ||||
| Fetal fibronectin and serum C-reactive protein | ||||
| Spontaneous preterm birth | Maternal age | Multivariate logistic regression | 0.670 | ni |
| Black woman | ||||
| Hispanic woman | ||||
| Asian | ||||
| Mother born in the United States | ||||
| Paid delivery by herself or physician | ||||
| Diabetes mellitus | ||||
| Chronic arterial hypertension | ||||
| Thyroid dysfunction | ||||
| Asthma | ||||
| Previous stillbirth | ||||
| Fetal weight loss | ||||
|
| ||||
| Nulliparity | ||||
| Pregnant smoker during the first trimester | ||||
| BMI | ||||
| Stillbirth | Current pregnancy complications | Logistic regression | 0.834 | 94.7% |
| Congenital anomalies | Decision tree | 0.808 | 99.7% | |
| Maternal characteristics | Random forest | 0.836 | 99.7% | |
| Medical history | XGBoost | 0.842 | 99.7% | |
| Artificial neural networks multilayer perceptron | 0.840 | 99.7% | ||
| Prediction of complications in pregnancy: pre-eclampsia, GDM, restriction of fetal growth, macrosomia | Maternal age | Logistic regression | 0.770 | 78.6% |
| BMI | ||||
| Newborn weight | ||||
| Results of adverse events in previous pregnancies | ||||
| Severe maternal morbidity | Ventilator dependence | Logistic regression | 0.937 | ni |
| Intubation | ||||
| Critical care | ||||
| Acute respiratory failure | ||||
| Ventilation | ||||
| Trauma and postoperative pulmonary failure | ||||
| Fluid and electrolyte disorder | ||||
| Systemic inflammatory response syndrome | ||||
| Acidosis and septicemia | ||||
| Fetal acidemia | Maternal age | Deep convolutional neural network | 0.978 | 98.4% |
| Gestational age pH | ||||
| Extracellular fluid deficit pC O 2 | ||||
| Base excess | ||||
| APGAR 1 min, and 5 min | ||||
| Parity | ||||
| Gestational diabetes | ||||
| Birth weight | ||||
| Child sex | ||||
| Type of delivery | ||||
AUC, area under the curve; Acc., accuracy; SVM, support vector machines; RF, random forest; SGA, small for gestational age; DT, decision tree; LGA, large for gestational age; BMI, body index mass; TOLAC, trial of labor of after cesarean; HIV, human immunodeficiency virus; GDM, gestational diabetes mellitus; ni, not informed.
This study does not specify either AUC or accuracy. The only performance metric used is sensitivity; logistic regression: 31.9%, linear discriminant analysis: 31.7%, random forest: 30.1%, naive Bayes: 29.2%.