| Literature DB >> 35204580 |
Liu Liu1, Fujin Shen1, Hua Liang1, Zhe Yang2, Jing Yang2, Jiao Chen2.
Abstract
Appropriate ovarian responses to the controlled ovarian stimulation strategy is the premise for a good outcome of the in vitro fertilization cycle. With the booming of artificial intelligence, machine learning is becoming a popular and promising approach for tailoring a controlled ovarian stimulation strategy. Nowadays, most machine learning-based tailoring strategies aim to generally classify the controlled ovarian stimulation outcome, lacking the capacity to precisely predict the outcome and evaluate the impact features. Based on a clinical cohort composed of 1365 women and two machine learning methods of artificial neural network and supporting vector regression, a regression prediction model of the number of oocytes retrieved is trained, validated, and selected. Given the proposed model, an index called the normalized mean impact value is defined and calculated to reflect the importance of each impact feature. The proposed models can estimate the number of oocytes retrieved with high precision, with the regression coefficient being 0.882% and 89.84% of the instances having the prediction number ≤ 5. Among the impact features, the antral follicle count has the highest importance, followed by the E2 level on the human chorionic gonadotropin day, the age, and the Anti-Müllerian hormone, with their normalized mean impact value > 0.3. Based on the proposed model, the prognostic results for ovarian response can be predicted, which enables scientific clinical decision support for the customized controlled ovarian stimulation strategies for women, and eventually helps yield better in vitro fertilization outcomes.Entities:
Keywords: clinical decision support; controlled ovarian stimulation; dosage of Gn; machine learning; number of oocytes retrieved
Year: 2022 PMID: 35204580 PMCID: PMC8871024 DOI: 10.3390/diagnostics12020492
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1The overall framework of our study. (A) Data-processing toward the research cohort with 1365 sets of clinical data; Pearson correlation analysis is conducted to identify the impact features with significant correlation value. (B) Based on the two types of methods, ANN and supporting vector regression (SVR), two prediction models of the number of oocytes retrieved are built; the model selected is based on the training and prediction results. (C) Quantitative evaluation of the impact features, and ranking the importance of them. (D) Clinical application of the proposed model.
Demographics and clinical properties of the research cohort.
| Features | Values | ||
|---|---|---|---|
| Age | 32.44 (21–50) | <0.001 | |
| Infertility type | Primary infertility | 680 (49.82) | <0.001 |
| Secondary infertility | 685 (50.18) | ||
| Infertility duration | 3.6 (0–22) | <0.001 | |
| BMI | 22.27 (15.0–36.2) | 0.594 | |
| AFC | 19.60 (2–65) | <0.001 | |
| bFSH | 9.37 (0.97–151.65) | <0.001 | |
| E2 | 83.96 (3.92–5086.19) | 0.312 | |
| AMH | 3.15 (0.1–23) | <0.001 | |
| Infertility cause | Pelvic and fallopian tube factors | 444 (32.53) | <0.001 |
| Polycystic ovary syndrome (PCOS) ovulatory obstacle | 112 (8.21) | ||
| Decreased ovarian reserve | 173 (12.67) | ||
| Endometriosis and uterine factors | 72 (5.27) | ||
| Multiple factors | 494 (36.40) | ||
| Others | 70 (5.13) | ||
| Therapeutic regimen | Long protocol | 222 (16.26) | <0.001 |
| Super-long protocol | 385 (28.21) | ||
| Antagonist regimen | 332 (24.32) | ||
| PPOS | 309 (22.64) | ||
| Others | 117 (8.57) | ||
| Days of Gn | 10.51(0–55) | <0.001 | |
| Dosage of Gn | 2270.15 (0–6262.5) | <0.001 | |
| E2 level on the HCG day | 2284.60 (57.41–19,432.60) | <0.001 | |
| Number of oocytes retrieved | 11.18 (0–29) | / | |
Values are represented as the number of women (%) or average (range).
Figure 2The structure of the proposed ANN.
RMSE and regression coefficient of the two PMORNs.
| Model | ANN-Based PMORN | SVR-Based PMORN |
|---|---|---|
| RMSE value | 2.63 | 3.70 |
| Regression coefficient | 0.882 | 0.799 |
Figure 3Prediction performance of the proposed models. (A) Regression coefficient R for the ANN-based PMORN; (B) regression coefficient R for the SVR-based PMORN. The horizontal and vertical axes, respectively, represent the actual value and the predicted value, which are denoted by the symbols Y and T, respectively.
Figure 4Distribution of the prediction error for the proposed models. (A) prediction error for ANN-based PMORN; (B) prediction error for SVR-based PMORN. The horizontal axis and vertical axis are the prediction error and the corresponding ratio for all the instances, respectively.
NMIVs and correlation coefficients of the impact features.
| Features | Age | Infertility Type | Infertility Duration | AFC | bFSH | AMH | Infertility Cause | Therapeutic Regimen | Days of Gn | Dosage of Gn | E2 Level on the HCG Day |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NIMV | −0.354 | 0.107 | −0.039 | 1.0 | −0.131 | 0.314 | 0.070 | −0.241 | 0.234 | 0.219 | 0.951 |
| r-value | −0.325 | −0.1 | −0.11 | 0.768 | −0.23 | 0.596 | −0.209 | −0.398 | 0.128 | −0.148 | 0.723 |
Figure 5ANN-based PMORN: ranking of the features according to the NMIV, and the comparison of NMIV with correlation coefficient.