Literature DB >> 35587030

Ovarian stimulation strategies for intrauterine insemination in couples with unexplained infertility: a systematic review and individual participant data meta-analysis.

J A Wessel1, N A Danhof1, R van Eekelen1, M P Diamond2, R S Legro3, K Peeraer4, T M D'Hooghe5,6,7, M Erdem8, T Dankert9, B J Cohlen10, C Thyagaraju11, B W J Mol12,13, M Showell14, M van Wely1, M H Mochtar1, R Wang12.   

Abstract

BACKGROUND: Intrauterine insemination with ovarian stimulation (IUI-OS) is a first-line treatment for unexplained infertility. Gonadotrophins, letrozole and clomiphene citrate (CC) are commonly used agents during IUI-OS and have been compared in multiple aggregate data meta-analyses, with substantial heterogeneity and no analysis on time-to-event outcomes. Individual participant data meta-analysis (IPD-MA) is considered the gold standard for evidence synthesis as it can offset inadequate reporting of individual studies by obtaining the IPD, and allows analyses on treatment-covariate interactions to identify couples who benefit most from a particular treatment. OBJECTIVE AND RATIONALE: We performed this IPD-MA to compare the effectiveness and safety of ovarian stimulation with gonadotrophins, letrozole and CC and to explore treatment-covariate interactions for important baseline characteristics in couples undergoing IUI. SEARCH
METHODS: We searched electronic databases including MEDLINE, EMBASE, CENTRAL, CINAHL, and PsycINFO from their inception to 28 June 2021. We included randomized controlled trials (RCTs) comparing IUI-OS with gonadotrophins, letrozole and CC among couples with unexplained infertility. We contacted the authors of eligible RCTs to share the IPD and established the IUI IPD-MA Collaboration. The primary effectiveness outcome was live birth and the primary safety outcome was multiple pregnancy. Secondary outcomes were other reproductive outcomes, including time to conception leading to live birth. We performed a one-stage random effects IPD-MA. OUTCOMES: Seven of 22 (31.8%) eligible RCTs provided IPD of 2495 couples (62.4% of the 3997 couples participating in 22 RCTs), of which 2411 had unexplained infertility and were included in this IPD-MA. Six RCTs (n = 1511) compared gonadotrophins with CC, and one (n = 900) compared gonadotrophins, letrozole and CC. Moderate-certainty evidence showed that gonadotrophins increased the live birth rate compared to CC (6 RCTs, 2058 women, RR 1.30, 95% CI 1.12-1.51, I2 = 26%). Low-certainty evidence showed that gonadotrophins may also increase the multiple pregnancy rate compared to CC (6 RCTs, 2058 women, RR 2.17, 95% CI 1.33-3.54, I2 = 69%). Heterogeneity on multiple pregnancy could be explained by differences in gonadotrophin starting dose and choice of cancellation criteria. Post-hoc sensitivity analysis on RCTs with a low starting dose of gonadotrophins (≤75 IU) confirmed increased live birth rates compared to CC (5 RCTs, 1457 women, RR 1.26, 95% CI 1.05-1.51), but analysis on only RCTs with stricter cancellation criteria showed inconclusive evidence on live birth (4 RCTs, 1238 women, RR 1.15, 95% CI 0.94-1.41). For multiple pregnancy, both sensitivity analyses showed inconclusive findings between gonadotrophins and CC (RR 0.94, 95% CI 0.45-1.96; RR 0.81, 95% CI 0.32-2.03, respectively). Moderate certainty evidence showed that gonadotrophins reduced the time to conception leading to a live birth when compared to CC (6 RCTs, 2058 women, HR 1.37, 95% CI 1.15-1.63, I2 = 22%). No strong evidence on the treatment-covariate (female age, BMI or primary versus secondary infertility) interactions was found. WIDER IMPLICATIONS: In couples with unexplained infertility undergoing IUI-OS, gonadotrophins increased the chance of a live birth and reduced the time to conception compared to CC, at the cost of a higher multiple pregnancy rate, when not differentiating strategies on cancellation criteria or the starting dose. The treatment effects did not seem to differ in women of different age, BMI or primary versus secondary infertility. In a modern practice where a lower starting dose and stricter cancellation criteria are in place, effectiveness and safety of different agents seem both acceptable, and therefore intervention availability, cost and patients' preferences should factor in the clinical decision-making. As the evidence for comparisons to letrozole is based on one RCT providing IPD, further RCTs comparing letrozole and other interventions for unexplained infertility are needed.
© The Author(s) 2022. Published by Oxford University Press on behalf of European Society of Human Reproduction and Embryology.

Entities:  

Keywords:  clomiphene citrate; gonadotrophins; individual participant data; intrauterine insemination; letrozole; meta-analysis; ovarian stimulation; unexplained infertility

Mesh:

Substances:

Year:  2022        PMID: 35587030      PMCID: PMC9434229          DOI: 10.1093/humupd/dmac021

Source DB:  PubMed          Journal:  Hum Reprod Update        ISSN: 1355-4786            Impact factor:   17.179


Introduction

Intrauterine insemination with ovarian stimulation (IUI-OS) is a first-line treatment for couples with unexplained infertility (Practice Committee of the American Society for Reproductive Medicine, 2020). It aims to increase the pregnancy rates by increasing the number of dominant follicles per cycle, which is achieved by increasing the serum levels of FSH (van Rumste ). Agents which increase FSH serum levels include exogenous gonadotrophins, letrozole or clomiphene citrate (CC). Gonadotrophins have a direct effect on follicle growth as they contain FSH and may also contain recombinant LH- or HCG-driven LH activity. Letrozole is a third-generation aromatase inhibitor that interferes with the oestrogenic feedback at the pituitary by blocking oestrogen biosynthesis thus stimulating the production of serum FSH (Mitwally and Casper, 2001). CC is a selective oestrogen modulator and competes with oestrogen for binding to the hypothalamic oestrogen receptors, thus stimulating the production of serum FSH (Mitwally and Casper, 2001). While letrozole and CC are orally taken for 5 days, the gonadotrophins are injected subcutaneously. Multiple systematic reviews have compared these ovarian stimulation agents with each other in women with unexplained infertility undergoing IUI. IUI with gonadotrophins increases live birth and/or ongoing pregnancy rates but also increased multiple pregnancy rates compared to other oral agents (Danhof ; Zolton ). Nevertheless, IUI with adherence to strict cancellation criteria, i.e. withholding insemination if more than three dominant follicles develop, led to an acceptable multiple pregnancy rate without compromising the effectiveness (Danhof ). However, a substantial unexplained heterogeneity across the primary trials comparing gonadotrophins to letrozole and CC was observed, and time-to-event outcomes were not reported in these meta-analyses (Eskew ; Danhof ; Zolton ). The population of couples with unexplained infertility is heterogeneous and the prognostic variables such as female age and duration of infertility affect pregnancy chances and safety issues independent of ovarian stimulation, such that on an individual level certain treatments may be more effective and/or safe than others (Steures ). Given the heterogeneous inclusion criteria of the primary trials, it is impossible to analyse interaction variables of couples with unexplained infertility in an aggregate data meta-analysis. To evaluate whether certain groups of couples benefit more from one treatment than from another, individual participant data meta-analysis (IPD-MA) of RCTs is optimal and therefore considered the as the gold standard for evidence synthesis (Riley ). In addition, IPD-MA also allows us to study time to conception leading to live birth which was impossible in aggregate data meta-analysis. We therefore performed this IPD-MA to compare the effectiveness and safety of ovarian stimulation with gonadotrophins, letrozole and CC and to explore treatment–covariate interactions for important baseline characteristics in couples undergoing IUI-OS.

Methods

Registration and literature search

We conducted this IPD-MA according to a registered protocol (PROSPERO CRD42017053966) and reported it according to the Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data (PRISMA-IPD) statement (Stewart ). We performed the search update on the 28th of June, 2021 based on an existing search strategy (Danhof ). In brief, we searched the following electronic databases including MEDLINE, EMBASE, Cochrane Central Register of Controlled Trials (CENTRAL), CINAHL, PsycINFO, Cochrane Gynaecology and Fertility Group trial register and Clinical Trial Registration Databases (clinicaltrial.gov and International Clinical Trials Registry Platform (ICTRP)). The detailed search strategy is presented in Supplementary Table SI.

Eligibility criteria

We included randomized controlled trials (RCTs) comparing IUI-OS with gonadotrophins, letrozole or CC among couples with unexplained infertility. We excluded dose comparing studies of the same drug. If a trial also includes other factors of infertility, for instance ovulatory disorders, the trial was included but participants with other factors of infertility were excluded. We did not apply language restrictions.

Study selection and data collection

Two authors (J.W. and M.v.W.) independently examined the studies for compliance with the inclusion criteria and selected eligible studies. Disagreements were resolved by discussion with a third author (R.W.). We contacted the corresponding authors of all eligible studies to join the IUI IPD-MA collaboration and share their IPD and established the IUI IPD-MA Collaboration. We tried to obtain study protocols where possible. When we did not receive responses, we sent at least two more reminders. All authors sharing the IPD were asked to provide clarifications when information in the publications or datasets were unclear or inconsistent. We evaluated internal data consistency by checking duplicated and missing values as well as possible data errors and contacted the trial investigators for further clarification when needed.

Outcomes

The primary effectiveness outcome was cumulative live birth per woman randomized and the primary safety outcome was multiple pregnancy. Secondary outcomes included ongoing pregnancy, clinical pregnancy, miscarriage, time to conception leading to live birth, cancellation and the total number of follicles > 14 mm at time of ovulation triggering. The unit of analysis was per couple randomized for all outcomes except for cancellation and total number of follicles > 14 mm at the time of ovulation triggering, in which the unit of analysis was per cycle. The definition of miscarriage was harmonized across different trials in this IPD-MA according to The International Glossary on Infertility and Fertility Care 2017 (Zegers-Hochschild ).

Risk of bias and overall certainty of evidence assessment

Two authors (J.W. and R.W.) independently assessed the risk of bias of the included studies using the domain-based evaluation tool described in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins ). Disagreements were resolved by discussion with a third author (M.v.W.). We assessed the following domains as low, unclear or high risk of bias: random sequence generation, allocation concealment, blinding of participant and personnel, blinding of outcome assessors, incomplete outcome data, selective reporting and other bias. The overall certainty of evidence across RCTs were assessed when at least two studies were included by using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach, including the risk of bias, consistency of effect, imprecision, indirectness and publication bias.

Statistical analysis

We performed the analysis based on an intention-to-treat principle. We conducted a one-stage IPD-MA including random effects for trial in each pairwise comparison with studies contributing to IPD. We also provided forest plots to visualize the results per trial and used the I2 statistic to quantify heterogeneity. Note that in these forest plots, the summary estimate was the one-stage estimate. For dichotomous outcomes, we estimated risk ratios (RR) using a generalized mixed model with a binomial distribution and a log link with random intercepts for study. For continuous outcomes, we estimated mean differences using a linear mixed model with random intercepts for study and cycle number. For time to conception leading to live birth, we used the number of IUI cycles as a time unit and calculated a pooled hazard ratio (HR) in Cox proportional hazards regression models for discrete time with a random effect (frailty with a normal distribution) for study (Fisher, 2015). Only conception that led to live birth were included. Next, we explored treatment–covariate interaction of the following covariates on live birth: female age, type of infertility (primary/secondary) and body mass index (BMI). These treatment–covariate interactions were conducted using a two-stage approach, and were thus based solely on within‐study information as recommended to avoid ecological bias (Fisher ; Riley ). As we limited the treatment–covariate interaction analysis to covariates that were available in at least 85% of participants, we did not perform analyses on other prespecified covariates duration of infertility, total motile sperm count, Hunault score, smoking status, ethnicity and antral follicle count due to missing data. We then conducted pre-specified sensitivity analysis on studies with overall low risk of bias and studies with low risk of bias at allocation concealment to test the robustness of the findings. In addition, we also performed two post-hoc sensitivity analyses on studies with low starting dose of gonadotrophins (≤75 IU) and on studies with stricter cancellation criteria (≤3 dominant follicles), respectively. All sensitivity analyses were limited to the primary outcomes live birth and multiple pregnancy. Finally, we intended to use funnel plots to explore the possibility of small study effects if at least 10 studies were present per comparison. To examine IPD availability bias, we also presented meta-analyses of trials without IPD. Missing outcome data were not imputed. Data on the covariates female age, primary/secondary infertility and BMI were missing for 9% of participants and imputed using single imputation. Data were prepared in Stata 16.1 and Microsoft Excel. IPD-MA was performed in R version 3.6.0 using the rms, survival, foreign, mice, lme4, meta and miceadds R packages and additional analysis was performed in Stata 16.1 using admetan package.

Results

Study selection

In total, we identified 338 studies, of which 313 studies were excluded after screening titles and abstract (Fig. 1). After screening full text, 22 studies were eligible. IPD was not sought from four studies due to insufficient contact information (n = 4, 224 couples) (Kamel, 1995; Sammour ; Fatemi ; Galal, 2015). The authors of the remaining 18 studies were contacted, among which IPD of 11 studies (1278 couples) were not available, due to either no response (n = 9, 1009 couples) (Balasch ; Nakajima ; El Helw and El Sadek, 2002; Al-Fozan ; Ozmen ; Gregoriou ; Fouda and Sayed, 2011; Ibrahim ; Goldman ) or data loss (n = 2, 269 couples) (Baysoy ; Berker ). These studies are listed in Supplementary Table SII. IPD of seven trials were provided by the trial authors (Ecochard ; Dankert ; Diamond ; Erdem ; Peeraer ; Danhof ; Naidu ).
Figure 1.

PRISMA IPD flow diagram. IPD, individual participant data.

PRISMA IPD flow diagram. IPD, individual participant data.

Study characteristics

Characteristics of trials with and without IPD are presented in Table I and Supplementary Table SIII. No major issues were identified when checking the consistency of IPD.
Table I

Outline and design per trial.

StudyYearCountryNumber of participantsIntervention*
Cancellation criteriaOutcomes
Clomiphene citrateLetrozoleGonadotrophins
Danhof et al. 2018The Netherlands738100 mg75 IUMax 3 follicles of ≥ 14 mm or max 5 follicles of ≥ 12 mmLive birth, multiple pregnancy, clinical pregnancy, ongoing pregnancy, miscarriage, time to conception, cancellation, number of follicles

Dankert et al. 2007The Netherlands138100 mg75 IUMax 3 follicles of ≥ 14 mmLive birth, multiple pregnancy, clinical pregnancy, ongoing pregnancy, miscarriage, time to conception

Diamond et al. 2015USA900100 mg5 mg150 IUMax 4 follicles (mean diameter >18 mm) or max serum E2 levels of 3000 pg/mlLive birth, multiple pregnancy, clinical pregnancy, ongoing pregnancy, miscarriage, time to conception, cancellation, number of follicles

Ecochard et al. 2000France5450 or 100 mg150 IUMax 3 follicles of ≥ 15 mm and/or max serum E2 levels of 1200 pg/mlClinical pregnancy, miscarriage

Erdem et al. 2015Turkey219100 mg75 IUMax 4 follicles of ≥ 14 mm and/or max serum E2 levels of 1500 pg/mlLive birth, multiple pregnancy, clinical pregnancy, ongoing pregnancy, miscarriage, time to conception, cancellation, number of follicles

Naidu et al. 2020India112100 mg75 IUMax 3 follicles of ≥ 18 mmLive birth, multiple pregnancy, clinical pregnancy, ongoing pregnancy, miscarriage, time to conception, cancellation, number of follicles

Peeraer et al. 2015Belgium25050 mg37.5 or 75 IUMax 3 follicles of ≥ 14 mmLive birth, multiple pregnancy, clinical pregnancy, ongoing pregnancy, miscarriage, time to conception, cancellation, number of follicles

Start dosing, during the cycles dosage can be adjusted. ‡No participants had more than three dominant (≥14 mm) follicles in the trial.

Outline and design per trial. Start dosing, during the cycles dosage can be adjusted. ‡No participants had more than three dominant (≥14 mm) follicles in the trial. Of the included RCTs that provided IPD, five were multicentre studies (Ecochard ; Dankert ; Diamond ; Peeraer ; Danhof ) and two were single-centre studies (Erdem ; Naidu ). All these RCTs were published in English between 2000 and 2020, including a conference abstract-only publication (Naidu ). Five of the seven RCTs only included couples with unexplained infertility. The other two RCTs (Ecochard ; Peeraer ) also included women with other factors of infertility (e.g. ovulatory dysfunction and mixed factors) that were excluded from this IPD-MA (n = 84). In six RCTs involving 1511 women undergoing IUI-OS, gonadotrophins were compared to CC (Ecochard ; Dankert ; Erdem ; Peeraer ; Danhof ; Naidu ). One study, investigating 900 women, compared all three medications: gonadotrophins, letrozole and CC (Diamond ). All RCTs had an IUI protocol with cancellation criteria, with five of them being more strict than the other two (maximum of three dominant follicles) (Ecochard ; Dankert ; Peeraer ; Danhof ; Naidu ). One study performed selective ultrasound-guided follicular aspiration or cancelled the cycle (Peeraer ). The seven RCTs provided IPD on 2411 women who received 5678 IUI cycles. There were 1054 women allocated to gonadotrophins, 299 to letrozole and 1058 to CC. Overall characteristics and outcomes of all couples included in this IPD-MA are presented in Table II and Supplementary Table SIV.
Table II

Overall characteristics and outcomes.

Characteristic or outcomeNumber of studiesNumber of womenGonadotrophins mean (25th–75th percentile) or N (%)Clomiphene citrate mean (25th–75th percentile) or N (%)Letrozole mean (25th–75th percentile) or N (%)
Female age (in years) 6227331.9 (29.0–35.0)32.0 (29.0–35.0)32.2 (29.0–35.0)
Body mass index, BMI 5216124.9 (21.2–27.0)24.7 (21.1–26.6)27.3 (22.3–30.9)
Primary infertility (%) 62273714 (72%)695 (70%)180 (60%)
Number of cycles 623812.5 (1.0–4.0)2.6 (1.0–4.0)3.2 (2.0–4.0)
Live birth (%) 62357287 (28%)222 (22%)56 (19%)
Multiple pregnancy (%) 6235747 (5%)22 (2%)9 (3%)
- Twin 35 (74%)21 (95%)9 (100%)
- Triplet 12 (26%)1 (5%)0 (0%)
Ongoing pregnancy (%) 62357299 (29%)233 (23%)60 (20%)
Clinical pregnancy (%) 72411326 (31%)268 (25%)67 (22%)
Miscarriage (%) 7241171 (7%)54 (5%)9 (3%)
Cancellations, per cycle (%) 52163231 (9%)204 (8%)35 (4%)
Number of follicles >14 mm, per cycle 521632.2 (1–3)2.1 (1–3)2.0 (1–2)
Overall characteristics and outcomes.

Risk of bias of individual RCTs

The details of the risk of bias assessment of the individual RCTs are presented in Fig. 2. One study (Erdem ) was scored high risk of bias at allocation concealment because allocation was not concealed. One study (Diamond et al. 2015) was a double-blinded study for CC and letrozole and was scored low risk of performance bias for this comparison. All RCTs involving gonadotrophins were open label and therefore were scored at unclear risk of performance bias. Given that all reproductive outcomes of interest were objective outcomes, it is unlikely that the non-blinded design will affect these outcome measurements and therefore we scored all the included RCTs at low risk for detection bias. Attrition bias was scored at low risk for all trials. For selective reporting, we scored two studies to have unclear risk (Ecochard ; Naidu ) because study protocol or registration was not available for assessments. Other risk of bias was scored unclear for two studies due to lack of information on important baseline variables (Dankert ; Naidu ).
Figure 2.

Risk of bias summery of the included randomized controlled trials (RCTs) according to the bias assessment tool of the Cochrane Collaboration. Performance bias for Diamond was considered as low risk for letrozole versus clomiphene citrate (CC) and as unclear risk for the other two comparisons.

Risk of bias summery of the included randomized controlled trials (RCTs) according to the bias assessment tool of the Cochrane Collaboration. Performance bias for Diamond was considered as low risk for letrozole versus clomiphene citrate (CC) and as unclear risk for the other two comparisons.

Primary outcomes

Live births

Six trials provided data on live birth. The results per study and the pooled one-stage estimated RRs are shown in Fig. 3a–c and Table III. Gonadotrophins increased the chance of a live birth compared to CC (RR 1.30, 95% CI 1.12–1.51, I2 = 26%) and letrozole (RR 1.72, 95% CI 1.29–2.29). This implies that if the live birth rate following IUI with CC is assumed to be 22%, the live birth rate following IUI with gonadotrophins would be between 25% and 33% (NNT = 15 (9–38)). If the live birth rate following IUI with letrozole is assumed to be 19%, the live birth rate following IUI with gonadotrophins would be between 25% and 44% (NNT = 7 (4–17)). There was insufficient evidence of a difference between letrozole and CC on live birth (RR 0.80, 95% CI 0.59–1.10). If the live birth rate following IUI with CC is assumed to be 23%, the live birth rate following IUI with letrozole would be between 13% and 25% (NNT = −22). We did not conduct a network meta-analysis as only one trial included letrozole (Diamond ).
Figure 3.

Forest plots for live birth and multiple pregnancy. (a–c) Live birth: (a) comparing gonadotrophins and CC; (b) comparing letrozole and CC; (c) comparing gonadotrophins and letrozole. (d–f) Multiple pregnancy: (d) comparing gonadotrophins and CC. (e) Comparing letrozole and CC. (f) Comparing gonadotrophins and letrozole. In each forest plot, the study level estimate was based on individual patient data (IPD) of each individual study and the summary estimate was based on a one-stage IPD meta-analysis (IPD-MA). In (a), Ecochard (2000) did not report live birth and therefore was not included in the IPD-MA. In (d), Ecochard (2000) did not report multiple pregnancy and therefore was not included in the IPD-MA. Note that study level estimates were not shown for two other studies (Peeraer, 2015; Naidu, 2020), due to the presence of 0 events in one group (Peeraer, 2015) or 0 events in both groups (Naidu, 2020), but the one-stage IPD-MA for multiple pregnancy included these two studies. CC, clomiphene citrate; RR, relative risk.

Table III

Meta-analyses and GRADE assessments of all outcomes.

ComparisonOutcomeNumber of RCTsNumber of participantsRisk ratio or hazard ratio95% CI I 2 Overall certainty of evidence (GRADE)
Gn vs CC Live birth620581.301.12–1.5126%Moderatea
Multiple pregnancy620582.171.33–3.5469%Lowa,b
Ongoing pregnancy620581.291.11–1.4918%Moderatea
Clinical pregnancy721121.221.07–1.400%Moderatea
Miscarriage721121.320.94–1.860%Lowa,c
Time to conception620581.371.15–1.6322%Moderatea

Letrozole vs CC Live birth15990.800.59–1.10n/an/a
Multiple pregnancy15991.130.44–2.89n/an/a
Ongoing pregnancy15990.790.59–1.07n/an/a
Clinical pregnancy15990.790.60–1.04n/an/a
Miscarriage15990.650.28–1.47n/an/a
Time to conception15990.770.54–1.09n/an/a

Gn vs Letrozole Live birth16001.721.29–2.29n/an/a
Multiple pregnancy16003.751.83–7.69n/an/a
Ongoing pregnancy16001.661.25–2.18n/an/a
Clinical pregnancy16001.591.22–2.06n/an/a
Miscarriage16002.871.37–6.02n/an/a
Time to conception16002.041.47–2.83n/an/a

Hazard ratio for time to conception leading to live birth

Downgraded by one level due to concerns on risk of bias

Downgraded by one level due to inconsistency

Downgraded by one level due to imprecision

Gn, gonadotrophins; CC, clomiphene citrate; GRADE, Grading of Recommendations, Assessment, Development and Evaluations; RCT, randomized controlled trial.

Forest plots for live birth and multiple pregnancy. (a–c) Live birth: (a) comparing gonadotrophins and CC; (b) comparing letrozole and CC; (c) comparing gonadotrophins and letrozole. (d–f) Multiple pregnancy: (d) comparing gonadotrophins and CC. (e) Comparing letrozole and CC. (f) Comparing gonadotrophins and letrozole. In each forest plot, the study level estimate was based on individual patient data (IPD) of each individual study and the summary estimate was based on a one-stage IPD meta-analysis (IPD-MA). In (a), Ecochard (2000) did not report live birth and therefore was not included in the IPD-MA. In (d), Ecochard (2000) did not report multiple pregnancy and therefore was not included in the IPD-MA. Note that study level estimates were not shown for two other studies (Peeraer, 2015; Naidu, 2020), due to the presence of 0 events in one group (Peeraer, 2015) or 0 events in both groups (Naidu, 2020), but the one-stage IPD-MA for multiple pregnancy included these two studies. CC, clomiphene citrate; RR, relative risk. Meta-analyses and GRADE assessments of all outcomes. Hazard ratio for time to conception leading to live birth Downgraded by one level due to concerns on risk of bias Downgraded by one level due to inconsistency Downgraded by one level due to imprecision Gn, gonadotrophins; CC, clomiphene citrate; GRADE, Grading of Recommendations, Assessment, Development and Evaluations; RCT, randomized controlled trial. Sensitivity analysis on five RCTs with low risk of bias at allocation concealment, thereby excluding Erdem et al., were consistent with the main findings (RR 1.23, 95% CI 1.05–1.45, I2 =0;) (Table V). Sensitivity analysis on RCTs with overall low risk of bias was not performed due to the open-label design on the use of gonadotrophins in all RCTs.
Table V

Sensitivity analyses on live birth and multiple pregnancy comparing gonadotrophins versus clomiphene citrate.

Sensitivity analysisOutcomeNumber of RCTsNumber of participantsRisk Ratio (RR)95% CI I 2
RCTs with low risk of bias at allocation concealment* Live birth518391.231.05–1.450%
Multiple pregnancy518392.371.39–4.0478%

RCTs with low starting dose of gonadotrophins (≤75IU)** Live birth514571.261.05–1.5137%
Multiple pregnancy514570.940.45–1.960%

RCTs with stricter cancellation criteria (≤ 3 dominant follicles)*** Live birth412381.150.94–1.410%
Multiple pregnancy412380.810.32–2.030%

Erdem (2015) was excluded;

Diamond (2015) was excluded;

Both Erdem (2015) and Diamond (2015) were excluded.

RCT, randomized controlled trial.

Sensitivity analyses on live birth and multiple pregnancy comparing gonadotrophins versus clomiphene citrate. Erdem (2015) was excluded; Diamond (2015) was excluded; Both Erdem (2015) and Diamond (2015) were excluded. RCT, randomized controlled trial. Post-hoc sensitivity analyses on RCTs with low starting dose of gonadotrophins (≤75 IU) showed similar results (RR 1.26, 95% CI 1.05–1.51) (Table V). This implies that if the live birth rate following IUI with CC is assumed to be 22%, the live birth rate following IUI with gonadotrophins would be between 23% and 33%. For stricter cancellation criteria, we are uncertain whether gonadotrophins lead to higher live birth rates than CC (RR 1.15, 95% CI 0.94–1.41) (Table V). This implies that if the live birth rate following IUI with CC is assumed to be 22%, the live birth rate following IUI with gonadotrophins would be between 21% and 31%.

Multiple pregnancy

Six trials provided data on multiple pregnancy. The results per study and the pooled one-stage estimated RRs are shown in Fig. 3d–f and Table III. Gonadotrophins increased the risk of a multiple pregnancy compared to both CC (RR 2.17, 95% CI 1.33–3.54, I2 = 69%) and letrozole (RR 3.75, 95% CI 1.83–7.69), whereas there was insufficient evidence of a difference between letrozole and CC (RR 1.13, 95% CI 0.44–2.89). There were 12 triplet pregnancies in the gonadotrophins group, one in the CC group (RR 12.06, 95% CI 1.57–92.46, I2 = 0) and none in the letrozole group. Out of the 12 women in the gonadotrophins group who had triplets, 10 used a high dose (9 used 150 IU and 1 used 225 IU) and 2 used a low dose (75 IU) of gonadotrophins. Post-hoc sensitivity analyses on RCTs with low starting dose of gonadotrophins (≤75 IU) and on RCTs with stricter cancellation criteria (≤3 dominant follicles) showed insufficient evidence of a difference in multiple pregnancy (RR 0.94, 95% CI 0.45–1.96; and RR 0.81, 95% CI 0.32–2.03, respectively) (Table V).

Secondary outcomes per woman

Ongoing pregnancy and clinical pregnancy

Six trials provided data on ongoing pregnancy while all seven trials provided data on clinical pregnancy (Table III). The results were consistent with those of the primary outcome; gonadotrophins increased the change of an ongoing pregnancy and clinical pregnancy compared to both CC and letrozole whereas there was insufficient evidence of a difference between letrozole and CC.

Miscarriage

Seven trials provided data on miscarriages (Table III). There was insufficient evidence of a difference between gonadotrophins and CC (RR 1.32, 95% CI 0.94–1.86, I = 0%) or between letrozole and CC (RR 0.65, 95% CI 0.28–1.47), whereas gonadotrophins increased the chance of a miscarriage compared to letrozole (RR 2.87, 95% CI 1.37–6.02).

Time to conception leading to live birth

Six trials provided data on time to conception leading to live birth (Table III) (Dankert ; Diamond ; Erdem ; Peeraer ; Danhof ; Naidu ). Gonadotrophins reduced the time to conception leading to a live birth compared to both CC (HR 1.37, 95% CI 1.15–1.63, I2 = 22%) and letrozole (HR 2.04, 95% CI 1.47–2.83). Letrozole appeared to increase the time to conception leading to a live birth compared to CC (HR 0.77, 95% CI 0.54–1.09).

Ovarian hyperstimulation syndrome

Three trials reported on ovarian hyperstimulation syndrome, with only one case in the gonadotrophins group. Therefore, a meta-analysis was not performed.

Secondary outcomes per cycle

Cancellations

Five trials provided cycle-level data on cancellations of insemination (5958 cycles) for reasons of cancellation (Supplementary Table SV) (Diamond ; Erdem ; Peeraer ; Danhof ; Naidu ). The use of gonadotrophins resulted in a higher risk for cancellation than letrozole (RR 1.84, 95% CI 1.17–2.87), while there was insufficient evidence of a difference in cancellation of insemination between gonadotrophins and CC (RR 1.15, 95% CI 0.96–1.37, I2 = 77%), or between letrozole and CC (RR 1.14, 95% CI 0.69–1.88).

Number of follicles

Five trials provided cycle-level data on the number of follicles of >14 mm (5958 cycles) (Diamond ; Erdem ; Peeraer ; Danhof ; Naidu ). Gonadotrophins led to a larger mean number of follicles compared to CC (MD 0.16, 95% CI 0.07–0.26, I2 = 90%) and inconclusive findings compared to letrozole (MD 1.15, 95% CI 0.95–1.34), whereas letrozole resulted in a smaller mean number of follicles compared to CC (MD −0.59, 95% CI −0.75 to −0.43).

Exploratory treatment–covariate interactions

Female age

Five trials provided data on female age and live birth and were used for the treatment–covariate interaction analyses on age (Table IV) (Diamond ; Erdem ; Peeraer ; Danhof ; Naidu ). When comparing gonadotrophins to CC, the estimated interaction RR per year of female age was 0.94 (95% CI 0.85–1.05, I2 = 61%). Comparing letrozole to CC, the estimated interaction RR was 1.03 (95% CI 0.96–1.10). Comparing gonadotrophins to letrozole, the estimated interaction RR was 1.02 (95% CI 0.95–1.09). Insufficient evidence on the treatment–covariate interaction of female age was found.
Table IV

Meta-analyses of treatment-covariate interactions on live birth.

ComparisonBaseline covariateNumber of RCTsNumber of participantsRisk ratio (RR)95% CI I 2
Gn vs CC Age519200.940.85–1.0561%
BMI418081.030.95–1.1140%
Type of infertility (primary vs secondary)315890.860.58–1.260%

Letrozole vs CC Age15991.030.96–1.10
BMI15991.020.98–1.08
Type of infertility (primary vs secondary)15991.330.70–2.54

Gn vs Letrozole Age16001.020.95–1.09
BMI16000.990.94–1.03
Type of infertility (primary vs secondary)16000.600.33–1.09

Gn, gonadotrophins; CC, clomiphene citrate; RCT, randomized controlled trial.

Meta-analyses of treatment-covariate interactions on live birth. Gn, gonadotrophins; CC, clomiphene citrate; RCT, randomized controlled trial.

BMI

Four trials provided data on BMI and live birth and were used for the treatment–covariate interaction analyses on BMI (Table IV) (Diamond ; Erdem ; Peeraer ; Danhof ). When comparing gonadotrophins to CC, the estimated interaction RR per unit of BMI was 1.03 (95% CI 0.95–1.11, I2 = 40%). Comparing letrozole to CC, the estimated interaction RR was 1.02 (95% CI 0.98–1.08). Comparing gonadotrophins to letrozole, the estimated interaction RR was 0.99 (95% CI 0.94–1.03). Insufficient evidence on the treatment–covariate interaction of BMI was found.

Primary versus secondary infertility

Five trials provided data on live birth and type of infertility but due to small number of events, RR was not estimable in Naidu and due to primary infertility as part of the inclusion criteria, the interaction was not estimable in Erdem ; therefore three trials remained (Diamond ; Peeraer ; Danhof ). When comparing gonadotrophins to CC, the estimated interaction RR for primary versus secondary infertility was 0.86 (95% CI 0.58–1.26, I2 = 0%). Comparing letrozole to CC, the estimated interaction RR was 1.33 (95% CI 0.70–2.54). Comparing gonadotrophins to letrozole, the estimated interaction RR was 0.60 (95% CI 0.33–1.09). Insufficient evidence on the treatment–covariate interaction of primary versus secondary infertility was found.

Additional analysis

We did not present funnel plots as fewer than 10 trials were included. Meta-analyses of trials without IPD showed overlapping confidence intervals with the IPD-MAs in most comparisons, except for letrozole versus CC (Supplementary Table SVI). None of the trials comparing letrozole versus CC that did not contribute to IPD reported live birth. In addition, meta-analysis of trials not contributing to IPD showed higher clinical pregnancy rates in the letrozole group compared to CC (RR 1.66, 95% CI 1.24–2.24), which was inconsistent with the result from the trials contributing to IPD.

Discussion

Summary of evidence

In this IPD-MA, we evaluated the effectiveness and safety of ovarian stimulation with gonadotrophins, letrozole or CC in couples with unexplained infertility undergoing IUI-OS. Moderate-quality evidence showed that gonadotrophins increased the chance of a live birth compared to both CC and letrozole, while low-quality evidence due to substantial heterogeneity, suggested it may also increase the chance of a multiple pregnancy, especially for triplet pregnancy. Gonadotrophins reduced the time to conception leading to live birth when compared to both CC and letrozole. Gonadotrophins gave a significantly higher number of dominant follicles compared to letrozole but also a significantly higher risk of cancellation. Compared to CC, the number of follicles was significantly higher when gonadotrophins were used but this does not necessarily lead to a higher cancellation rate, as the heterogeneity for the number of follicles was very high. We did not find treatment–covariate interactions on live birth for the pre-specified covariates: age, BMI or primary versus secondary infertility. The heterogeneity between trials comparing gonadotrophins and CC on multiple pregnancy could be explained by the choice of different starting doses of gonadotrophins and cancellation criteria in different trials. In the gonadotrophin group, there were 12 triplet pregnancies, and 10 of these women used a high dose of gonadotrophins (9 used 150 IU and 1 used 225 IU). When limiting the analysis to studies with a low starting dose of gonadotrophins (<=75 IU), ovarian stimulation with gonadotrophins still significantly increased the probability of live birth. When limiting the analysis to studies with strict cancellation criteria, the difference was no longer significant, such that we are uncertain whether ovarian stimulation with gonadotrophins lead to higher live birth rates compared to CC. Both sensitivity analyses showed comparable risks of a multiple pregnancy between gonadotrophins and CC.

Strengths and limitations

The strengths of the IPD-MA are the harmonization of the eligibility criteria (by excluding women with anovulatory infertility) and outcome definitions, the analyses of time to conception and treatment–covariate interactions. A potential limitation of this IPD-MA is that we were not able to access the IPD of all eligible studies. IPD was available for just 32% (7/22) of the included trials, however the included studies accounted for 62% (2495/3997) of all participants. Previous empirical evidence in our research field has demonstrated that results of the RCTs without IPD have lower quality and more methodological issues compared to RCTs who shared IPD and therefore may result in different findings (Wicherts ; Bordewijk ). The willingness to share these data may indicate a good quality trial. For example, all 15 studies that did not share IPD lacked adequate trial registration (Supplementary Table SIII). Only two had trial registration, both of which were registered after start of the recruitment process. It is worth noting that one eligible trial with a large sample size (n = 412) comparing letrozole and CC was retracted due to concerns on its validity and this trial was not included in our IPD-MA (Badawy ). As data checking is a mandatory process during IPD-MA, evidence from this IPD-MA should be considered the gold standard to inform clinical practice. Another limitation is that some baseline variables including total sperm motile count, smoking status, ethnicity and Hunault score were not available in the databases of multiple trials and therefore these treatment–covariate interactions were not explored.

Interpretations

Clinical decision-making should be based on a joint assessment of safety, effectiveness, availability, cost of the interventions as well as couples’ preferences. Multiple pregnancy is an important measure for safety. The overall effectiveness on gonadotrophins in multiple pregnancy was dominated by a single study (Diamond ) with the highest number of events in the gonadotrophins group (34/301), in which a less strict cancellation criteria and a conventional starting dose (150 IU) were used. Although the post-hoc sensitivity analyses should be interpreted with caution, they indicate that a lower starting dose of gonadotrophins and stricter cancellation criteria may result in a comparable low multiple pregnancy rate between gonadotrophins to CC, by reducing the chance of multifollicular growth and/or cancellation of the cycle (van Rumste ). In view of ongoing concerns on high multiple pregnancies resulting from non-IVF medically assisted reproduction procedures globally, a low starting dose of gonadotrophins with strict cancellation criteria should be considered (Danhof ; Bergh ), but it is not clear to which extent this recommendation has been applied in daily clinical practice worldwide. The findings of the overall IPD-MA on effectiveness including all studies favours gonadotrophins, but the advantage of gonadotrophins over CC on effectiveness (live birth) becomes smaller when factoring in a low starting dose or even disappears when applying strict cancellation criteria. In a modern practice where a lower starting dose and stricter cancellation criteria are in place, the effectiveness and safety of the two different agents seem both acceptable, and therefore intervention availability, cost and patients’ preferences are valued as more important in clinical decision-making. Letrozole is still an off-label agent for unexplained infertility in many countries, resulting in its unavailability, although it has been widely used in clinical practice with no evidence of an increased risk of congenital foetal malformation (Pundir ). Comparisons including letrozole were underpowered due to letrozole only being used in a single trial in this IPD-MA. The other 11 eligible trials involving letrozole did not contribute IPD, including eight studies which compared letrozole to CC and three studies which compared gonadotrophins to letrozole. RCTs contributing IPD and RCTs not contributing IPD showed inconsistent results on clinical pregnancy (Supplementary Table SVI) and none of the RCTs that did not contribute IPD reported on live birth. Therefore, evidence on its effectiveness in women with unexplained infertility is urgently needed and new RCTs comparing letrozole with other ovarian stimulation agents should be performed. Recent cost-effectiveness analyses on ovarian stimulation agents in IUI showed that in settings where a live birth is valued at €3000 or less, between €3000 and €55 000 and above €55 000, CC, letrozole and gonadotrophins were the most cost-effective option in terms of net benefit, respectively (van Eekelen ). While recommending CC and letrozole, the authors also highlighted the high uncertainty surrounding such findings and call for more research on the relative effectiveness in this area (van Eekelen ). This could be due to the overall small differences between these agents in modern practice where a lower starting dose and stricter cancellation criteria are in place (Danhof ). Given the cost variations across countries, future cost-effectiveness studies in different settings would be helpful to provide further health economic evidence to inform clinical decision-making. This is especially important for clinical decision-making in low or middle resource settings, where limited economic evidence is available. Finally, although preferences would depend on the health system in different countries, oral agents are still likely to be preferred in many settings when the effectiveness and safety are acceptable, given their convenience in use (Practice Committee of the American Society for Reproductive Medicine, 2020).

Conclusion

In couples with unexplained infertility undergoing IUI-OS, gonadotrophins increased the chance of a live birth and reduced the time to conception compared to CC, at the cost of a higher multiple pregnancy rate, when not differentiating strategies on the cancellation criteria or the starting dose. The treatment effects did not seem to differ in women with different age, BMI or primary versus secondary infertility. In a modern practice where a lower starting dose and stricter cancellation criteria are in place, effectiveness and safety of different agents seem both acceptable, and therefore intervention availability, cost and patients’ preferences should factor in the clinical decision-making. As the evidence for comparisons to letrozole is based on one RCT providing IPD, further RCTs comparing letrozole and other interventions for unexplained infertility are needed.

Supplementary data

Supplementary data are available at Human Reproduction Update online. Click here for additional data file.
  35 in total

1.  Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data: the PRISMA-IPD Statement.

Authors:  Lesley A Stewart; Mike Clarke; Maroeska Rovers; Richard D Riley; Mark Simmonds; Gavin Stewart; Jayne F Tierney
Journal:  JAMA       Date:  2015-04-28       Impact factor: 56.272

2.  Recombinant FSH increases live birth rates as compared to clomiphene citrate in intrauterine insemination cycles in couples with subfertility: a prospective randomized study.

Authors:  Mehmet Erdem; Seraf Abay; Ahmet Erdem; Mehmet Firat Mutlu; Esra Nas; Ilknur Mutlu; Mesut Oktem
Journal:  Eur J Obstet Gynecol Reprod Biol       Date:  2015-03-28       Impact factor: 2.435

3.  Letrozole versus clomiphene citrate for superovulation in Egyptian women with unexplained infertility: a randomized controlled trial.

Authors:  Moustafa I Ibrahim; Rowaa A Moustafa; Ahmed A Abdel-Azeem
Journal:  Arch Gynecol Obstet       Date:  2012-07-26       Impact factor: 2.344

4.  A randomized clinical trial to determine optimal infertility treatment in older couples: the Forty and Over Treatment Trial (FORT-T).

Authors:  Marlene B Goldman; Kim L Thornton; David Ryley; Michael M Alper; June L Fung; Mark D Hornstein; Richard H Reindollar
Journal:  Fertil Steril       Date:  2014-04-30       Impact factor: 7.329

5.  A randomized trial of letrozole versus clomiphene citrate in women undergoing superovulation.

Authors:  Haya Al-Fozan; Maha Al-Khadouri; Seang Lin Tan; Togas Tulandi
Journal:  Fertil Steril       Date:  2004-12       Impact factor: 7.329

6.  Recombinant FSH versus clomiphene citrate for ovarian stimulation in couples with unexplained infertility and male subfertility undergoing intrauterine insemination: a randomized trial.

Authors:  Bulent Berker; Korhan Kahraman; Salih Taskin; Yavuz Emre Sukur; Murat Sonmezer; Cem Somer Atabekoglu
Journal:  Arch Gynecol Obstet       Date:  2011-07-20       Impact factor: 2.344

7.  Use of an aromatase inhibitor for induction of ovulation in patients with an inadequate response to clomiphene citrate.

Authors:  M F Mitwally; R F Casper
Journal:  Fertil Steril       Date:  2001-02       Impact factor: 7.329

8.  Prediction of an ongoing pregnancy after intrauterine insemination.

Authors:  Pieternel Steures; Jan Willem van der Steeg; Ben W J Mol; Marinus J C Eijkemans; Fulco van der Veen; J Dik F Habbema; Peter G A Hompes; Patrick M M Bossuyt; Harold R Verhoeve; Yvonne M van Kasteren; Peter A van Dop
Journal:  Fertil Steril       Date:  2004-07       Impact factor: 7.329

9.  Risk of foetal harm with letrozole use in fertility treatment: a systematic review and meta-analysis.

Authors:  Jyotsna Pundir; Chiara Achilli; Priya Bhide; Luca Sabatini; Richard S Legro; Luk Rombauts; Helena Teede; Arri Coomarasamy; Javier Zamora; Shakila Thangaratinam
Journal:  Hum Reprod Update       Date:  2021-04-21       Impact factor: 15.610

10.  Meta-analytical methods to identify who benefits most from treatments: daft, deluded, or deft approach?

Authors:  David J Fisher; James R Carpenter; Tim P Morris; Suzanne C Freeman; Jayne F Tierney
Journal:  BMJ       Date:  2017-03-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.