| Literature DB >> 35675328 |
Cheng-Wei Wang1, Chao-Yang Kuo2,3, Chi-Huang Chen1,4, Yu-Hui Hsieh5, Emily Chia-Yu Su2,6.
Abstract
INTRODUCTION: Assisted reproductive technology has been proposed for women with infertility. Moreover, in vitro fertilization (IVF) cycles are increasing. Factors contributing to successful pregnancy have been widely explored. In this study, we used machine learning algorithms to construct prediction models for clinical pregnancies in IVF.Entities:
Mesh:
Year: 2022 PMID: 35675328 PMCID: PMC9176781 DOI: 10.1371/journal.pone.0267554
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Characteristics of the analytical variables.
| Variable | Definition or Range |
|---|---|
| Male age (years) | 23 to 78 |
| Female age (years) | 21 to 55 |
| Duration of infertility (years) | 1 to 14 |
| Number of IVF cycles performed | 0 to 16 |
| Number of oocytes retrieved | 0 to 52 |
| The Number of Embryos Transferred | 1 to 4 |
| The Total Number of Frozen Embryos | 0 to 36 |
| Cause of infertility | Tubal factor, ovary factor, endometriosis, uterine factor (myoma adenomyosis, uterine synechia), others (other female factors), male factor, either two factors, unexplained |
| Fertilization method | IVF/ET (in vitro fertilization/embryo transfer), ZIFT/TET (zygote intrafallopian transfer/tubal embryo transfer) |
| Micromanipulation technique | ICSI (intracytoplasmic sperm injection), assisted hatching, ICSI and assisted hatching, PGT (preimplantation genetic testing), no use, others |
| Source of sperm and oocytes | Without donation, oocyte donation, sperm donation |
| The use of fresh/freeze-thaw | Fresh, freeze-thaw |
| Ovarian hyperstimulation syndrome | None, mild, moderate, severe |
| Ovarian stimulation protocol | Natural cycle (including frozen-embryo transfer cycle), oral stimulation drug, short protocol, long protocol, ultra-long protocol, antagonist protocol, others |
Fig 1Experiment flowchart for predicting clinical pregnancy outcomes.
Descriptive statistics of categorical variables for clinical pregnancy and non-clinical pregnancy groups.
| Pregnancy | No pregnancy | ||||
|---|---|---|---|---|---|
| ( | |||||
|
| Percentage |
| Percentage | ||
| Female age group |
| ||||
| <30 years | 294 | 4.49 | 267 | 2.49 | |
| 30~34 years | 1903 | 29.06 | 2091 | 19.47 | |
| 35~39 years | 3124 | 47.71 | 4425 | 41.20 | |
| >40 years | 1227 | 18.74 | 3957 | 36.84 | |
| Cause of infertility |
| ||||
| Tubal factor | 902 | 13.78 | 1335 | 12.43 | |
| Ovary factor | 607 | 9.27 | 826 | 7.69 | |
| Endometriosis | 691 | 10.55 | 1037 | 9.66 | |
| Uterine factor (myoma adenomyosis, uterine synechia) | 306 | 4.67 | 454 | 4.23 | |
| Others (other female factors) | 2078 | 31.73 | 3962 | 36.89 | |
| Male factor | 985 | 15.04 | 1311 | 12.21 | |
| Either two factors | 940 | 14.36 | 1779 | 16.56 | |
| Unexplained | 39 | 0.60 | 36 | 0.34 | |
| Oocyte retrieval |
| ||||
| Yes | 4416 | 67.44 | 7412 | 69.01 | |
| No | 2132 | 32.56 | 3328 | 30.99 | |
| Artificially assisted reproductive method | |||||
| IVF/ET | 6544 | 99.94 | 10,739 | 99.99 | |
| ZIFT/TET | 4 | 0.06 | 1 | 0.01 | |
| Fertilization method |
| ||||
| ICSI | 4375 | 66.81 | 7352 | 68.45 | |
| Assisted hatching | 223 | 3.41 | 226 | 2.10 | |
| ICSI and assisted hatching | 13 | 0.20 | 27 | 0.25 | |
| PGS | 121 | 1.85 | 117 | 1.09 | |
| PGD | 0 | 0.00 | 0 | 0.00 | |
| No Use | 1760 | 26.88 | 2943 | 27.40 | |
| Others | 56 | 0.86 | 75 | 0.70 | |
| Source of sperm and oocytes | |||||
| Without donation | 6445 | 98.43 | 10,596 | 98.66 | |
| Oocyte donation | 75 | 1.15 | 93 | 0.87 | |
| Sperm donation | 28 | 0.43 | 51 | 0.47 | |
| The use of fresh/freeze-thaw |
| ||||
| Fresh | 4406 | 67.29 | 7456 | 69.42 | |
| Frozen | 2142 | 32.71 | 3284 | 30.58 | |
| Ovarian hyperstimulation syndrome |
| ||||
| None | 6323 | 96.56 | 10,526 | 98.01 | |
| Mild | 218 | 3.33 | 212 | 1.97 | |
| Moderate | 6 | 0.09 | 1 | 0.01 | |
| Severe | 1 | 0.02 | 1 | 0.01 | |
| Ovarian stimulation protocol |
| ||||
| Natural cycle (including frozen-embryo transfer cycle) | 2132 | 55.84 | 3328 | 54.51 | |
| Oral stimulation drug | 0 | 0.00 | 3 | 0.05 | |
| Short protocol | 248 | 6.50 | 442 | 7.24 | |
| Long protocol | 306 | 8.01 | 356 | 5.83 | |
| Ultra-long protocol | 124 | 3.25 | 165 | 2.70 | |
| Antagonist protocol | 1006 | 26.35 | 1803 | 29.53 | |
| Others | 2 | 0.05 | 8 | 0.13 | |
* p < 0.05,
** p < 0.01,
*** p < 0.001 by a chi-squared test for categorical variables.
IVF/ET, in vitro fertilization/embryo transfer; ZIFT/TET, zygote intrafallopian transfer/tubal embryo transfer; ICSI, intracytoplasmic sperm injection; PGS, preimplantation genetic screening; PGD, preimplantation genetic diagnosis.
Descriptive statistics of continuous variables for clinical pregnancy and non-clinical pregnancy groups.
| Variable | Pregnancy | No pregnancy | |
|---|---|---|---|
| ( | ( | ||
| (M±SD) | (M±SD) | ||
| Duration of infertility (years) | 3.73±2.75 | 4.24±3.04 |
|
| Number of IVF cycles performed | 0.99±1.48 | 1.27±1.77 |
|
| Number of oocytes retrieved | 6.96±7.22 | 5.17±6.02 |
|
| Embryos | 5.62±4.11 | 4.23±3.38 |
|
| The Number of Embryos Transferred | 2.66±0.89 | 2.58±1.01 |
|
| The Total Number of Frozen Embryos | 2.79±3.76 | 1.53±3.00 |
|
| Female age (years) | 36.07±3.85 | 37.94±4.34 |
|
| Male age (years) | 38.29±5.02 | 39.87±5.52 |
|
Abbreviations: M, mean; SD, standard deviation; IVF, in vitro fertilization
*** p < 0.001 by Student’s t-test for continuous variables.
Predictive performance of different machine learning algorithms for the training and test datasets.
| Algorithm | Dataset | Number | Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC |
|---|---|---|---|---|---|---|
| Logistic regression | Training | 8644 | 62.20 | 62.05 | 62.36 | 0.6783 |
| Test | 8644 | 62.78 | 61.81 | 63.12 | 0.6766 | |
| Random forest | Training | 8644 | 83.39 | 83.02 | 83.76 | 0.9197 |
| Test | 8644 | 64.78 | 66.58 | 64.16 | 0.7208 |
AUC, area under the receiver operating characteristics curve.
Fig 2Receiver operating characteristic (ROC) curves and the area under the ROC curve (AUC) for clinical pregnancy predictions based on the logistic regression and random forest for the test set.
Predictive performance based on random forest for the training, test, and external dataset.
| Dataset | Year | Number | Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC |
|---|---|---|---|---|---|---|
| Training | 2007–2018 | 8644 | 83.39 | 83.02 | 83.76 | 0.9197 |
| Test | 8644 | 64.78 | 66.58 | 64.16 | 0.7208 | |
| External | 2019–2020 | 3352 | 62.98 | 68.55 | 60.97 | 0.7123 |
Fig 3Rank of importance of variables based on random forest clinical pregnancy predictions.
Fig 4Partial dependence plots of the most influential continuous and categorical variables for clinical pregnancy outcomes analyzed for (a) the total number of frozen embryos, (b) female age, (c) the number of embryos transferred, (d) duration of infertility, (e) cause of infertility, (f) the use of fresh/freeze-thaw, and (g) ovarian stimulation protocol.
Continuous variables are presented as line plots, and categorical are presented as bar plots.