| Literature DB >> 35327499 |
Konstantinos Sfakianoudis1, Evangelos Maziotis2, Sokratis Grigoriadis2, Agni Pantou1, Georgia Kokkini2, Anna Trypidi2, Polina Giannelou1, Athanasios Zikopoulos3, Irene Angeli1, Terpsithea Vaxevanoglou1, Konstantinos Pantos1, Mara Simopoulou2.
Abstract
Artificial intelligence (AI) has been gaining support in the field of in vitro fertilization (IVF). Despite the promising existing data, AI cannot yet claim gold-standard status, which serves as the rationale for this study. This systematic review and data synthesis aims to evaluate and report on the predictive capabilities of AI-based prediction models regarding IVF outcome. The study has been registered in PROSPERO (CRD42021242097). Following a systematic search of the literature in Pubmed/Medline, Embase, and Cochrane Central Library, 18 studies were identified as eligible for inclusion. Regarding live-birth, the Area Under the Curve (AUC) of the Summary Receiver Operating Characteristics (SROC) was 0.905, while the partial AUC (pAUC) was 0.755. The Observed: Expected ratio was 1.12 (95%CI: 0.26-2.37; 95%PI: 0.02-6.54). Regarding clinical pregnancy with fetal heartbeat, the AUC of the SROC was 0.722, while the pAUC was 0.774. The O:E ratio was 0.77 (95%CI: 0.54-1.05; 95%PI: 0.21-1.62). According to this data synthesis, the majority of the AI-based prediction models are successful in accurately predicting the IVF outcome regarding live birth, clinical pregnancy, clinical pregnancy with fetal heartbeat, and ploidy status. This review attempted to compare between AI and human prediction capabilities, and although studies do not allow for a meta-analysis, this systematic review indicates that the AI-based prediction models perform rather similarly to the embryologists' evaluations. While AI models appear marginally more effective, they still have some way to go before they can claim to significantly surpass the clinical embryologists' predictive competence.Entities:
Keywords: IVF; artificial intelligence; data-synthesis
Year: 2022 PMID: 35327499 PMCID: PMC8945147 DOI: 10.3390/biomedicines10030697
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Figure 1PRISMA flowchart.
Study Characteristics.
| Study | Outcome | Type of Input (TL/Static Images) | Sample Size | Type of AI | Model Optimization |
|---|---|---|---|---|---|
| Alegre 2020 [ | Live-birth | TL | 244 | ANN | NP |
| Meseguer 2019 [ | Live-birth | TL | TL: 111 | ANN | NP |
| Miyagi 2019 [ | Live-birth | TL | 1139 | CNN | 5-fold cross validation |
| Sawada 2021 [ | Live-birth | TL | 376 | CNN with Attention Branch Network | Back-propagation for 5 epochs |
| Hardy 2020 [ | Clinical Pregnancy with FHB | TL | 113 | CNN | NP |
| VerMilyea 2020 [ | Clinical Pregnancy with FHB | STATIC IMAGES | 1667 | ResNet | Back-propagation and SGD for 5 epochs |
| Chavez-Badiola 2020 [ | Clinical Pregnancy | STATIC IMAGES | 221 | SVM | 10-fold cross validation |
| Liao 2021 [ | Clinical Pregnancy with FHB | TL | 209 | DNN | NP |
| Bori 2020 [ | Clinical Pregnancy with FHB | TL | 451 | ANN | 5-fold cross validation |
| Kan-Tor 2020 [ | Clinical Pregnancy | TL | 401 | DNN | 20–60 epochs validation |
| Bormann 2020 [ | Clinical Pregnancy with FHB | STATIC IMAGES | 102 | CNN | Genetic algorithm per 100 samples for a dataset of 3469 embryos |
| Silver 2020 [ | Clinical Pregnancy with FHB | TL | 272 | CNN | NP |
| Cao 2018 [ | Clinical Pregnancy | STATIC IMAGES | 344 | CNN | NP |
| Ueno 2021 [ | Clinical Pregnancy with FHB | TL | 3014 | DNN | Back-propagation for 20 epochs and 5-fold cross validation |
| Bori 2021 [ | Ploidy | TL | 331 | ANN | Back-propagation |
| Aparicio Ruiz 2021 [ | Ploidy | TL | 319 | ANN | NP |
| Lee 2021 [ | Ploidy | TL | 138 | CNN (3D ConvNets) | NP |
| Chavez-Badiola 2020 [ | Ploidy | STATIC IMAGES | 84 | DNN | 10-fold cross validation |
TL: time-lapse microscopy; ANN: artificial neural network; CNN: convolutional network; DNN: deep neural network; ResNet: Residual Neural Networks; SGD: stochastic gradient resent with momentum; epoch: one complete pass through the entire dataset; NP: not provided.
Assessment of Bias.
| Study | Participants | Predictors | Outcomes | Analysis | Overall |
|---|---|---|---|---|---|
| Alegre 2020 [ | - | + | + | + | - |
| Meseguer 2019 [ | - | + | + | + | - |
| Miyagi 2019 [ | + | + | + | - | - |
| Sawada 2021 [ | + | + | + | + | + |
| Hardy 2020 [ | - | + | + | + | - |
| VerMilyea 2020 [ | + | - | + | + | - |
| Chavez-Badiola 2020 [ | - | - | + | + | - |
| Liao 2021 [ | - | + | + | + | - |
| Bori 2020 [ | + | + | + | + | + |
| Kan-Tor 2020 [ | + | + | + | + | + |
| Bormann 2020 [ | - | - | + | + | - |
| Silver 2020 [ | - | + | + | + | - |
| Cao 2018 [ | + | - | + | + | - |
| Ueno 2021 [ | + | + | + | - | - |
| Bori 2021 [ | + | + | + | - | - |
| Aparicio Ruiz 2021 [ | + | + | + | + | + |
| Lee 2021 [ | - | + | + | + | - |
| Chavez-Badiola 2020 [ | - | + | + | + | - |
+: Low risk of bias; -: High risk of bias.
Figure 2Forest plots representing: (A) sensitivity; (B). specificity; (C) DOR; (D) PPV; (E) NPV of the live birth prediction outcome. Subgroup “0” represents static images as the type of input, and subgroup “1” represents time-lapse.
Figure 3SROC of the live birth outcome.
Figure 4Forest plots representing: (A) sensitivity; (B) specificity; (C) DOR; (D) PPV; (E) NPV of the clinical pregnancy prediction outcome. Subgroup “0” represents static images as type of input, and subgroup “1” represents time-lapse.
Figure 5Prediction of pregnancy outcome.
Figure 6Forest plots representing: (A) sensitivity; (B) specificity; (C) DOR; (D) PPV; (E) NPV of the clinical pregnancy with fetal heart beat prediction outcome. Subgroup “0” represents static images as type of input, and subgroup “1” represents time-lapse.
Figure 7SROC of prediction of clinical pregnancy with fetal heartbeat.
Figure 8Forest plots representing: (A) sensitivity; (B) specificity; (C) DOR; (D) PPV; (E) NPV of the ploidy prediction outcome. Subgroup “0” represents static images as type of input, and subgroup “1” represents time-lapse.
Figure 9SROC of the prediction of ploidy status outcome.
Summary of the results.
| Outcomes | Sensitivity | Specificity | PPV | NPV | DOR |
|---|---|---|---|---|---|
| Live-Birth | 70.6% (38.1–90.4%) | 90.6% (79.3–96.1%) | 74.2% (44.1–91.3%) | 88.4% (80.6–93.3%) | 19.662 (5.061–76.397) |
| Live-Birth SI | 90.7% (77.7–96.5%) | 89.7% (79.9–95.0%) | 84.8% (71.4–92.6%) | 93.8% (84.7–97.7%) | 84.964 (23.329–309.437) |
| Live-Birth TL | 62.9% (27.7–88.2%) | 91.0% (75.6–97.1%) | 71.2% (33.7–92.3%) | 86.9% (78.0–92.5%) | 13.204 (3.336–52.264) |
| Clinical Pregnancy | 71.0% (58.1–81.2%) | 62.5% (47.4–75.5%) | 66.4% (51.7–78.5%) | 67.9% (60.7–74.4%) | 3.962 (2.501–6.275) |
| Clinical Pregnancy SI | 72.7% (60.6–82.2%) | 58.6% (49.6–67.1%) | 67.6% (46.6–83.4%) | 66.0% (56.5–74.3%) | 3.861 (1.708–8.729) |
| Clinical Pregnancy TL | 70.0% (49.4–84.8%) | 64.2% (39.9–82.9%) | 65.6% (45.2–81.5%) | 69.2% (58.8–78.0%) | 4.074 (1.880–8.827) |
| Clinical Pregnancy with FHB | 75.2% (66.8–82.0%) | 55.3% (41.2–68.7%) | 62.5% (43.9–78.0%) | 69.5% (60.4–77.2%) | 3.549 (2.113–5.961) |
| Clinical Pregnancy with FHB SI | 69.3% (65.8–72.6%) | 56.7% (43.9–68.7%) | 44.0% (41.1–46.9%) | 75.1% (72.2–77.9%) | 2.415 (1.986–2.937) |
| Clinical Pregnancy with FHB TL | 78.7% (70.3–85.2%) | 53.9% (35.1–71.6%) | 66.8% (42.7–84.5%) | 68.1% (55.1–78.7%) | 4.101 (1.636–10.276) |
| Ploidy | 61.5% (44.1–76.5%) | 79.6% (70.4–86.4%) | 50.5% (34.5–68.1%) | 85.8% (77.3–91.5%) | 5.978 (4.036–8.855) |
| Ploidy TL | 55.7% (37.2–72.8%) | 82.6% (75.1–88.2%) | 49.0% (28.7–69.7%) | 85.7% (74.7–92.5%) | 5.811 (3.807–8.871) |
| Ploidy SI | 78.6% (59.0–90.0%) | 66.1% (52.8–77.2%) | 53.7% (38.5–68.1%) | 86.0% (72.2–93.6%) | 7.140 (2.477–20.583) |
FHB: Fetal Heart Beat; SI: Static Image; TL: Time-Lapse.