Literature DB >> 23178304

Why we should talk about compliance with assisted reproductive technologies (ART): a systematic review and meta-analysis of ART compliance rates.

S Gameiro1, C M Verhaak, J A M Kremer, J Boivin.   

Abstract

BACKGROUND: The goal of this systematic review and meta-analysis was to estimate the rate of compliance with assisted reproductive technologies (ART) and examine its relationship with treatment success rates.
METHODS: Six databases were systematically searched from 1978 to December 2011. Studies were included if they reported data on patient progression through three consecutive standard ART cycles. Compliance was estimated for the first three ART cycles (typical ART Regimen Compliance, TARC) and after the first and the second failed cycles (CAF1, CAF2). Treatment success rates for all patients who started ART and for those who fully complied with the three ART cycles were estimated.
RESULTS: Ten studies with data for 14 810 patients were included. TARC was 78.2% [95% confidence interval (CI) 68.8-85.3%], CAF1 was 81.8% (73.3-88.1%) and CAF2 was 75.3% (68.2-81.2%). The overall success rate was 42.7% (32.6-53.6%) for all patients starting ART and 57.9% (49.4-65.9%) for those who complied with three ART cycles. Compliance rates did not vary according to study quality, but TARC was higher for studies that reported data on doctor-censored patients versus those that did not (84.2% 95% CI 75.5-90.2 versus 70.6% 95% CI 58.3-80.5, P = 0.043). Analysis of funnel plots and the Egger test indicated publication bias for CAF1.
CONCLUSIONS: Findings from this meta-analysis should reassure clinics and patients that most patients are able to comply with three cycles of ART. Compliers could increase their chances of success by as much as 15%. A more detailed assessment of compliance requires monitoring long-term treatment trajectories through the creation of national registries.

Entities:  

Mesh:

Year:  2012        PMID: 23178304      PMCID: PMC3576003          DOI: 10.1093/humupd/dms045

Source DB:  PubMed          Journal:  Hum Reprod Update        ISSN: 1355-4786            Impact factor:   15.610


Introduction

Most couples have life plans that include having children but 9–15% will have problems conceiving spontaneously (Boivin ). Infertility is a significant impairment of function, which the first World Disability Survey ranks as 5th in the list of moderate to severe disabilities within the global population under the age of 60 (World Health Organization and The World Bank, 2011). Fortunately, the chances of achieving parenthood are high for couples undergoing fertility treatment. The world live birth rate with assisted reproductive technologies (ART, e.g. IVF) is 22% per single initiated cycle of treatment (de Mouzon ) but can be 49% (Stern ) or higher (Witsenburg ; Verhagen ) if people undergo the optimal number of cycles, typically three [National Institute for Clinical Excellence (NICE), 2004, p.5]. However, many couples do not undergo multiple cycles of ART, even when there is a favourable prognosis and ability to cover the costs of treatment (Domar 2004; Brandes ). Indeed, discontinuation rates as high as 65% mainly due to psychological demands of treatment (Smeenk ; Brandes ) have been reported (Rajkhowa ). Practice guidelines and national regulations emphasize the importance of discussing treatment success rates but not the rates of discontinuation [National Institute for Clinical Excellence (NICE), 2004; European Society of Human Reproduction and Embryology (ESHRE), 2008; The Practice Committee of the Society for Assisted Reproductive Technology and the Practice Committee of the American Society for Reproductive Medicine, 2008]. Recently, the UK National Institute for Clinical Excellence (NICE) recommended using compliance as a way of auditing treatment delivery at clinics [National Institute for Clinical Excellence (NICE), 2004, p.42], but to our knowledge, this has not been done. The World Health Organization (WHO) defines treatment compliance (or adherence) as ‘… the extent to which a person's behaviour follows medical advice or corresponds with agreed recommendations from a health care provider…’ (WHO, 2003, p.3). In medical practice, in general, compliance means ‘the degree of constancy and accuracy with which a patient follows a prescribed regimen’ (http://medical-dictionary.thefreedictionary.com/compliance). Therefore, in ART, compliance would refer to the uptake of the ART cycles recommended by the doctor until pregnancy is achieved or until there is a recommendation to end treatment (as well as compliance with medication, which is not addressed in the present review.) Although the terminology is compatible with the concepts of shared and informed decision-making on the part of the patient, there has been a reluctance to conceptualize discontinuation in ART as a compliance issue or to influence patient decision-making about pursuing treatment. Reference to compliance is made implicitly, when clinicians mention cumulative pregnancy rates or offer financial packages that take into account better success rates with multiple cycles (Garrido ); however, few patients recall having the opportunity to discuss the advantages (24%) or disadvantages (18%) of ending/continuing treatment (Peddie ). The lack of emphasis on compliance in fertility treatment may be due to several factors. Unlike other disease contexts, people can opt out of fertility treatment without threatening their physical health and opting out can at times have beneficial consequences, for example on mental health (Peddie ). Active intervention to encourage compliance could also be avoided because of popular conceptions of fertility doctors taking advantage of desperate infertile couples (Thompson, 2005). However, even if doctors want to discuss compliance with their patients, they lack precise information as its prevalence has not yet been systematically estimated from the available literature. Whatever the cause, providing explicit information about compliance at the start of treatment (e.g. compliance rate, consequence of ending/continuing treatment on success rate) is essential for informed consent; otherwise, patients begin treatment optimistic about success without fully realizing that the demands of treatment (e.g. physical, emotional and practical) may be such that they are unable to pursue the optimal number of cycles even when their prognosis is favourable and costs of treatment are covered (Domar, 2004; McDowell and Murray, 2011). There is high variability in the discontinuation rate reported in primary research, ranging from 15% (Brandes ) to 65% (Rajkhowa ), which makes it difficult to be confident about compliance. Variability may, in large part, be explained by the lack of consensus on the definition and monitoring of compliance; for example, in many studies the non-complier group includes poor prognosis patients who discontinued treatment because they were advised to stop treatment (De Vries ), some studies monitor patients for too short a follow-up period to accurately conclude on compliance (Land ) and most studies do not control for patients who continue treatment at different clinics (Stolwijk ; Verhagen, ) or at a later time in their lives (Pearson ). Other issues that contribute to variability in the compliance rate reported are treatment reimbursement policy, the type of population under study (e.g. previous experience with ART and parity), the type of ART treatment investigated and other methodological aspects (e.g. design, assessment of treatment initiation and success). Another important issue is that primary research has shown that ART success rates cannot be accurately estimated without considering discontinuation (Land ), and therefore, the aforementioned issues would also impact on the reporting of success rates in ART. Further, the clinics’ success rates may also influence compliance as past research has shown that people move to clinics perceived to have higher pregnancy rates to improve their chances of success (Marcus ). A systematic review taking into account these issues would help achieve greater clarity on compliance in ART and its association with treatment success rates. The aims of the present systematic review and meta-analysis were 3-fold. The first goal was to provide the first estimate of compliance among typical infertile patients undergoing standard ART treatment. In order to promote future consensus on how to define, monitor and report compliance, the second goal was to examine conceptual and methodological causes of variability in compliance. Finally, the third goal was to assess how compliance is associated with treatment success rates.

Methods

Systematic search

The present work is part of a larger review that investigated reasons and predictors of discontinuation from fertility treatment (Gameiro ). The Sure Support Unit for Research Evidence (Cardiff University) searched six databases (Medline, Medline In Progress, EMBASE, BNI, PsycINFO and The Cochrane Library) from 1978 to December 2011 (inclusive). A search strategy was created using terminology from the International Committee for Monitoring Assisted Reproductive Technology and the WHO-revised glossary of ART (Zegers-Hochschild ) for fertility treatment (e.g. ART, IVF) AND discontinuation (e.g. dropout, compliance and discontinuation), which, with small adaptations, was used in all databases (see Supplementary data, Table SI). MeSH terms were used in PubMed. No restriction was made on the type (journal, conference paper or dissertation) or language of publication. The reference sections of all identified articles were examined by S.G. and a research specialist (Debbie Moss, see funding) to identify other relevant manuscripts.

Inclusion and exclusion criteria

Studies were included if data were reported (or could be obtained from the corresponding author) on patient progression through a maximum of three consecutive standard ART (IVF or ICSI) cycles (i.e. number of patients starting, pregnant, discontinuing, continuing after failed treatment) or, if fewer, until pregnancy or until the clinician recommended the patient to end treatment (i.e. doctor censoring, where this information was provided). Three cycles were used because it is the typically recommended and/or subsidized number of cycles that patients face for an optimal chance of pregnancy in an ART programme [National Institute for Clinical Excellence (NICE), 2004]. Only studies that focused on patients with no previous experience of ART were included. Studies that solely investigated single groups (e.g. third-party reproduction, recurrent miscarriage) or specific ART treatment (e.g. modified natural IVF, transport IVF/ICSI) were also excluded to focus on the typical ART population. Duplicate or secondary publications on the same sample were excluded to avoid multiple-publication bias. In these cases, we prioritized the publication that focused on discontinuation from treatment and, if this criterion did not apply, the publication that reported data for the largest sample. Excluded studies were classified according to reason for exclusion (see Fig. 1).
Figure 1

Decision flowchart for identified studies.

Decision flowchart for identified studies.

Data extraction

S.G. and a research specialist (D.B.) extracted data using a standardized protocol. Disagreement was resolved by discussion. Data were extracted or obtained from the corresponding author on characteristics of the study (e.g. country of origin, design), study population (e.g. average female age), clinical protocol (e.g. type of ART), health context (e.g. availability of subsidized/reimbursed treatment) and methodology (e.g. duration of follow-up period, inclusion or exclusion of cryopreserved IVF cycles in data reported). The data extracted to calculate the compliance rates were the numbers of patients who started treatment, who had successful or failed treatment, who were recommended to end treatment by their doctor (i.e. doctor censoring, where provided) and who discontinued or continued after a failed cycle. For those studies that reported on doctor censoring, data on its medical indication were also extracted.

Quality assessment

S.G., J.B. and C.M.V. assessed study quality according to the Newcastle–Ottawa Quality (NOQ) assessment scale (Wells ) adapted for the present study. The NOQ is used to appraise quality in terms of population representativeness, measurement of outcome (compliance), within-population comparability (compliers versus discontinuers) and adequacy of follow-up (completion rates). The specific criteria used for quality assessment were already described elsewhere (Gameiro ). Low-, moderate- and high-quality labels were assigned to scores of 0–2, 3–5 and 6–7, respectively (see Supplementary data, Table SVI).

Data analysis

Studies differed in terms of the number of subsidized treatment cycles and the number of cycles followed up. To control for this variability, we based our compliance calculations on the treatment uptake for the first three ART cycles. Uptake of the first cycle was 100% because studies only followed up patients who did a first cycle. We assumed that after failure on first or second cycles, patients would be expected to undertake a further cycle unless they were recommended to end treatment (i.e. doctor censoring). Ideally, treatment success should be defined as achievement of a live birth. However, that is often not the case in primary research. Thus, treatment success (versus failure) was defined according to the success outcome reported in the primary study, which could be a β-hCG urine or blood test ≤21 days after embryo transfer, an ultrasonographic visualization of fetal heart activity or a live birth, as per standard definitions (Zegers-Hochschild ). Three compliance rates were calculated per study: Typical ART Regimen Compliance (TARC) and compliance after the first and the second failed cycles (Compliance After-Failure, CAF1, CAF2). The TARC rate referred to patients who complied with all treatments recommended to them, that is, patients who continued with treatment for up to three cycles or until treatment success (as defined) or until advised to end treatment (i.e. doctor censoring). TARC was the sum of the number of patients who opted to undergo all three cycles when they failed on the first and the second cycles and of patients who stopped treatment either because it was successful or because they were censored by the doctor (where data on doctor censoring was reported), divided by the total number starting ART: The CAF rates provided an after-failure examination of compliance, that is, of patients who opted to undergo a further cycle after having had a failed cycle and therefore was the sum of the number of patients undergoing a further cycle divided by the number of patients with a failed cycle. Compliance after-failure was calculated for the first (CAF1) and the second (CAF2) failed cycles. Doctor-censored patients were excluded from the calculation of compliance after-failure rates (where such data were reported) because these patients would have been recommended to stop treatment, and therefore were not eligible for cycle uptake. The following formulas were used: To examine whether the clinic's success rates per cycle were associated with compliance after those cycles, we computed treatment success rates per cycle (first and second cycle) for each study. As all but one study (Rufat ) (excluded from this analysis) were single centre, this was equivalent to providing first and second cycle success rates for each clinic. The rates per cycle (first and second cycle) were the number of patients with a successful outcome in the first or the second cycle divided by the number of patients who underwent the first or the second cycle. To investigate how treatment success rates varied when compliance was taken into account, we calculated an overall success rate, which was the number of patients with a successful outcome in the first three ART cycles divided by the number of all patients who started the first ART cycle. We then calculated a separate typical regimen success rate that included only compliers (as defined in preceding TARC formula), that is, the number of patients with a successful outcome in the first three ART cycles divided by the number of compliers. Therefore, for each study, we had three types of success rates: clinic success rates per cycle [first and second cycles, excluding the study by Rufat )], overall success rate and success rate for compliers. In order to correct for variations in study sample size, pooled estimates across studies were obtained by means of random-effects models, after log transformation. We chose a random-effects model because single-group meta-analysis produces substantial heterogeneity. The I2 index was used to describe the proportion of total variation in study estimates that was due to heterogeneity (Higgins ). Subgroup and meta-regression analyses based on the random-model were performed to identify causes of heterogeneity in compliance rates among studies. Causes were defined a priori and referred to characteristics related to the studies' clinical aspects [clinic geographic location, clinic's success rates per cycle (assessed only for CAF), number of embryo transfer policy and whether treatment was subsidized/reimbursed], patients (parity) and methodology (study design, handling of doctor censoring, length of follow-up, definition of start-of-cycle and success, handling of cryopreserved IVF cycles, quality rating, year of publication). The χ2 test was used to assess differences between the subgroups and the significance of the meta-regression coefficients were assessed with a Z-test. Publication bias was examined via visual inspection of the funnel plots (of the natural log of the rates against its standard error) and the Egger's test (Egger ). Trim and fill was used to adjust the pooled rates for the presence of publication bias (Duval and Tweedie, 2000). We used the Comprehensive Meta Analysis software (Biostat Inc, 2011).

Results

Description of studies

The systematic search yielded 1128 non-duplicated records. Figure 1 presents the study decision flow chart. S.G. and D.B. agreed inclusion on all studies and agreed on reasons for exclusion for 91% of studies (see Table II of supplemental material for reasons for exclusion of full manuscripts screened). The authors of the 10 papers with missing or inconsistent data and of 5 other included papers were contacted to obtain missing data from the manuscripts. Four authors replied stating that the requested data were not available.
Table II

Design characteristics of the 10 included studies.

StudyProspective designa (yes/no)Data collection periodData available on number of doctor-censored patients (yes/no, if yes, reason for censoring)Definition of cycle start (started ovarian stimulation, had oocyte retrieval)Definition of treatment success (positive test, positive scan, live birthb)Number of embryo transfer policyFollow-up period (<12 months, ≥12 monthsc)IVF cycles exclude cryopreserved embryo transfers (yes/no)Subsidized/reimbursed treatment (yes/no)
Brandes et al. (2009)No2002–2004Yes, ‘poor prognosis (doctor's refusal)’Ovarian stimulationPositive scanNR≥12 monthsNRYes
De Vries et al. (1999)No1993–1996NoOvarian stimulationPositive testNR≥12 monthsNRNR
Emery et al. (1997)Yes1 yearNoOvarian stimulationPositive testNR≥12 monthsNRYes
Land et al. (1997)No1993–1994Yes, ‘denied further treatment for medical reasons (poor response to hMG or poor fertilization)’Ovarian stimulationPositive scanNR<12 monthsNRYes
Pearson et al. (2009)No1994–1998 and 1999–2003NoOvarian stimulationLive birthNRNRYesNR
Rufat et al. (1994)No1988–1992NoOocyte retrievalPositive scanNR≥12 monthsNRNR
Smeenk et al. (2004)Yes1999–2000Yes, ‘active censuring’Ovarian stimulationPositive scanNR≥12 monthsNodYes
Stolwijk et al. (1996)No1988–1993Yes, ‘a previous treatment with a fertilization rate of <10%, despite the presence of more than three large follicles (15 mm) on the day of HCG administration and the performance of oocyte aspiration, or three or less large follicles during two previous treatments'Ovarian stimulationPositive scanNRNRNoeNR
Verhagen et al. (2008)No2000–2003Yes, ‘active censuring (poor response, poor fertilization, poor response with poor fertilization, overweight with BMI >30 kg/m2, hypertension or improved semen quality not requiring ICSI any more)’Ovarian stimulationPositive testNRNRNoeYes
Witsenburg et al. (2005)No1996–2000NoOvarian stimulationLive birthMaximum of two when age <38, maximum of three when age ≥3<12 monthsNoeYes

NR, not reported; hMG, human menopausal gonadotrophins; HCG, human chorionic gonadotropin; BMI, body mass index; ICSI, intra cytoplasmic sperm injection.

aProspective studies are those where study design and data collection happened before any information on the outcome of interest was collected.

bPositive test: positive βhCG urine/blood test, positive scan: fetal heart activity at 6/7 weeks.

cor adequacy of follow period sufficiently justified by authors.

dNo information was given about how cryopreserved embryo transfer cycles were considered.

eTransfers of cryopreserved embryos were considered to be part of the cycle from which the embryos resulted.

The 10 included studies sampled 14 810 patients from five countries. The population characteristics and design features of the studies are shown in Tables I and II. See Supplementary data, Tables SIII–V for treatment trajectory data. Critical appraisal of the studies is shown in Table III. NOQ ratings indicated no low-quality study, three average studies (30%) and seven high-quality studies (70%) with substantial inter-rater agreement (S.G. and C.M.V.: Cohen's κ = 0.750, P = 0.007; S.G. and J.B.: Cohen's κ = 0.872, P < 0.001). See Table SIV of supplementary data for details on critical appraisal of the studies.
Table I

Sample characteristics reported in the 10 included studies.

StudyCountrySample sizeSelected population If yes, descriptionAge of women in years, mean ± SD (range)Duration of infertility in years, mean ± SDParity (none or at least one child)
Brandes, et al. (2009, 2011)The Netherlands373NoCOMP:31.0 ± 4.1, DISC:33.3 ± 5.1aCOMP:1.31 ± 1.0, DISC:1.9 ± 1.68aNR
De Vries, et al. (1998, 1999)Belgium1169NoCOMP:31 ± 4.3, DISC:32 ± 5.5aNRNR
Emery et al. (1997) and Slade et al. (1997)UK130No32.21 ± 3.378.27 ± 2.97None
Land et al. (1997)The Netherlands197NoNRNRNR
Pearson et al. (2009)USA2245Excluded patients using donor gametes35.2 ± 4.3 (20–49)NRAt least one child
Rufat et al. (1994)France8362No33.1 ± 4.3NRNR
Smeenk et al. (2004)The Netherlands380No34.1 ± 3.9 (21–43)3.7 ± 2.2 (1–16)NR
Stolwijk et al. (1996)The Netherlands616Excluded patients using donor gametesNRNRNR
Verhagen et al. (2008)The Netherlands588Excluded patients starting IVF for preimplantation genetic diagnosis, surgical sperm aspiration or using donor gametesCOMP:32.9 ± 3.6, DISC:33.8 ± 4.1aCOMP:3.0 ± 2.2, DISC:3.5 ± 2.4aNR
Witsenburg et al. (2005)The Netherlands750No33.0 ± 4.0NRNR

IVF, In vitro fertilization; COMP, group of patients who complied with treatment; DISC, group of patients who discontinued; NR, not reported; USA, United States of America; UK, United Kingdom.

aAverage age and duration of infertility for total sample not reported.

Table III

Quality ratings for the 10 included studies using an adapted Newcastle–Ottawa Quality assessment scale.

StudyQuality criterion
Overall quality rating (0–7)
Representative populationa (0–1)Ascertainment of treatment trajectoryb (0–3)Comparabilityc (0–2)Follow-upd (0–1)
Brandes et al. (2009)13217 (high)
De Vries et al. (1999)12216 (high)
Emery et al. (1997)12216 (high)
Land et al. (1997)12216 (high)
Pearson et al. (2009)11114 (moderate)
Rufat et al. (1994)12216 (high)
Smeenk et al. (2004)13206 (high)
Stolwijk et al. (1996)12115 (moderate)
Verhagen et al. (2008)12216 (high)
Witsenburg et al. (2005)11215 (moderate)
% of studies that meet criteria100%20% meet three criteria60% meet two criteria20% meet one criteria80% meet two criteria20% meet one criteria90%70% (high)30% (moderate)0% (low)

aThe ‘representativeness criterion’ was met when >80% of eligible patients were invited and >80% agreed to participate, or when the study reported on all consecutive series of patients over a defined period of time, or when sample size was >300 (1 point).

bThe ‘ascertainment of treatment trajectory’ criterion was met if the study provided enough data to ascertain that withdrawal from treatment was premature (before three cycles completed and not pregnant and not due to poor prognosis; 1 point), that withdrawal was either permanent (at least 12-monthperiod since last treatment cycle or permanence sufficiently justified by authors) or not only from the target clinic (patients did not go to other clinics) (1 point) and that withdrawal was ascertained from secure records (i.e. medical records, 1 point).

cThe ‘comparability criterion’ was met if all participants did treatment during the same period (i.e. data collection period was <5 years) (1 point); and sample was homogeneous regarding access to treatment (i.e. insurance coverage or number of subsidized cycles was described) or poor prognosis factors (i.e. mean age for all sample <40 or no statistical significant difference in age between groups) or type of treatment (all patients received the same treatment protocol), or IVF cycles excluded cryopreserved embryo transfer excluded (1 point).

dThe ‘follow-up criterion’ was met if all cases were accounted for or completion rate (number of patients with outcome at follow-up divided by the number of patients that initiated) was >80% or description of patients lost to follow-up showed lack of bias (1 point).

The overall quality rating was the sum of met criteria (maximum seven). Quality ratings were grouped into low (0–3), moderate (4–5) and high (6–7) quality studies.

Sample characteristics reported in the 10 included studies. IVF, In vitro fertilization; COMP, group of patients who complied with treatment; DISC, group of patients who discontinued; NR, not reported; USA, United States of America; UK, United Kingdom. aAverage age and duration of infertility for total sample not reported. Design characteristics of the 10 included studies. NR, not reported; hMG, human menopausal gonadotrophins; HCG, human chorionic gonadotropin; BMI, body mass index; ICSI, intra cytoplasmic sperm injection. aProspective studies are those where study design and data collection happened before any information on the outcome of interest was collected. bPositive test: positive βhCG urine/blood test, positive scan: fetal heart activity at 6/7 weeks. cor adequacy of follow period sufficiently justified by authors. dNo information was given about how cryopreserved embryo transfer cycles were considered. eTransfers of cryopreserved embryos were considered to be part of the cycle from which the embryos resulted. Quality ratings for the 10 included studies using an adapted Newcastle–Ottawa Quality assessment scale. aThe ‘representativeness criterion’ was met when >80% of eligible patients were invited and >80% agreed to participate, or when the study reported on all consecutive series of patients over a defined period of time, or when sample size was >300 (1 point). bThe ‘ascertainment of treatment trajectory’ criterion was met if the study provided enough data to ascertain that withdrawal from treatment was premature (before three cycles completed and not pregnant and not due to poor prognosis; 1 point), that withdrawal was either permanent (at least 12-monthperiod since last treatment cycle or permanence sufficiently justified by authors) or not only from the target clinic (patients did not go to other clinics) (1 point) and that withdrawal was ascertained from secure records (i.e. medical records, 1 point). cThe ‘comparability criterion’ was met if all participants did treatment during the same period (i.e. data collection period was <5 years) (1 point); and sample was homogeneous regarding access to treatment (i.e. insurance coverage or number of subsidized cycles was described) or poor prognosis factors (i.e. mean age for all sample <40 or no statistical significant difference in age between groups) or type of treatment (all patients received the same treatment protocol), or IVF cycles excluded cryopreserved embryo transfer excluded (1 point). dThe ‘follow-up criterion’ was met if all cases were accounted for or completion rate (number of patients with outcome at follow-up divided by the number of patients that initiated) was >80% or description of patients lost to follow-up showed lack of bias (1 point). The overall quality rating was the sum of met criteria (maximum seven). Quality ratings were grouped into low (0–3), moderate (4–5) and high (6–7) quality studies.

Meta-analysis

Compliance rates

Figure 2 shows the pooled TARC rate for the random-effect model. One study, Brandes ), did not report on data per cycle and was not included in the calculation of CAF rates. The meta-analysis showed that TARC was 78.2% [95% confidence interval (CI) 68.8–85.3%, I2 = 99.17], CAF1 was 81.8% (73.3–88.1%, I2 = 98.66) and CAF2 was 75.3% (68.2–81.2%, I2 = 95.64).
Figure 2

Typical regimen compliance (event rate and 95% CIs) in ART treatment (TARC).

Typical regimen compliance (event rate and 95% CIs) in ART treatment (TARC).

Subgroup and meta-regression analyses

Table IV presents the results of subgroup analysis performed. It was not possible to perform subgroup analysis on the basis of the geographical location of the clinic, parity or embryo transfer policy, whether treatment was subsidized/reimbursed, the definition of initiated cycle and handling of cryopreserved embryo transfers, because at least one of the subgroups had only one or no study. Variability among studies was explained only by how compliance was defined. More precisely, those studies that reported on the number of doctor-censored patients (and thus considered them to be compliers) presented higher TARC rates than studies that did not. The differences observed between subgroups related to the study design and population, length of follow-up and definition of treatment success were not significant. Finally, meta-regressions showed that publication year was not significantly related to compliance (TARC: Slope = 0.08, Z = 1.716, P = 0.086; CAF1: Slope = 0.06, Z = 1.312, P = 0.190; CAF2: Slope = 0.03, Z = 0.813, P = 0.416).
Table IV

Compliance rates (typical and after the first or second failed cycle) according to subgroup analysis.

VariablesTypical ART regimen compliance (TARC)
Compliance after first failed cycle (CAF1)
Compliance after second failed cycle (CAF2)
kCompliance rate95% CI LL95% CI ULχ2kCompliance rate95% CI LL95% CI ULχ2Compliance rate95% CI LL95% CI ULχ2
Clinical
 Population0.0850.1150.351
  General ART777.364.086.7680.969.588.773.762.782.3
  Selected ART population380.260.291.6383.668.292.478.564.188.2
 Geographic locationNANANA
  Europe979.467.287.9882.672.089.876.667.184.1
  USA165.622.692.6176.136.694.664.833.587.1
Patient
 ParityNANANA
  0180.072.286.0191.384.195.477.666.985.6
  ≥1 child165.663.667.5176.173.978.164.861.667.9
Methodological
 Prospective design0.4391.6170.479
  Yes283.263.193.5289.274.395.979.464.589.1
  No876.866.384.8779.469.386.874.166.380.7
 Data available on number doctors-censored patients4.088*0.6423.341
  Yes584.275.590.2484.673.991.480.472.786.3
  No570.658.380.5579.267.987.370.762.777.7
 Length of follow-up0.0070.2670.651
  Twelve months or more577.061.687.4479.165.288.471.360.480.1
  >12 months278.052.791.9283.966.093377.963.687.6
 Definition of initiated cycleNANANA
  Started hormonal stimulation980.573.785.9883.878.787.977.270.183.0
  Had oocyte retrieval149.423.875.4157.735.876.959.235.479.4
 Definition of treatment success0.1410.9050.148
  Live birth276.345.492.6283.861.794.375.453.888.9
  Positive scan at 6/7 weeks577.158.888.9477.259.688.673.858.484.9
  Positive βhCG urine/blood test381.158.492.9386.169.994.377.660.688.7
 IVF cycles exclude cryopreserved embryo transfersNANANA
  No485.380.289.2487.483.490.683.280.785.4
  Yes165.649.279.0176.173.978.164.861.667.9
 Quality0.0150.1010.239
  High778.665.987.4681.070.388.474.063.582.3
  Moderate377.356.490.0383.369.091.877.864.187.3

*P< 0.05, **P< 0.01, *** P< 0.001, k = number of studies, CI = confidence intervals, LL = lower limit, UL = upper limit, NA = not applicable because at least one of the subgroups only has one study, bold indicates P< 0.05.

Compliance rates (typical and after the first or second failed cycle) according to subgroup analysis. *P< 0.05, **P< 0.01, *** P< 0.001, k = number of studies, CI = confidence intervals, LL = lower limit, UL = upper limit, NA = not applicable because at least one of the subgroups only has one study, bold indicates P< 0.05. We excluded the study by Rufat ) from the examination of associations between per cycle success rates of the clinics and subsequent compliance because, as already explained, Rufat pooled data from several fertility clinics. In addition, another study, Brandes , did not report data per cycle and could not be included. The clinic's first-cycle success rate was not significantly associated with compliance after that first cycle (CAF1: Slope = 0.49, Z = 0.196, P = 0.845) and the clinic's second-cycle success rate was not associated with compliance after that second cycle (CAF2: Slope = 0.34, Z = 0.137, P = 0.891).

Study quality and publication bias

We performed subgroup analysis according to study quality (moderate or high) but the results of this analysis were not significant (see Table IV). Egger's test indicated the presence of publication bias for TARC (intercept = 14.21, t = 6.045, P < 0.001), CAF1 (intercept = 10.54, t = 5.31, P = 0.001) and CAF2 (intercept = 6.22, t = 4.70, P = 0.002). Investigation of publication bias through visual inspection of the funnel plot (see Supplementary data, Figs. S1–III) was confirmed only for CAF1, where one study was found to the lower right of the pooled compliance rate and none to the left (Supplementary data, Fig. S2). The trim and fill method only identified one missing study for CAF1, estimating a new compliance rate of 80.5% (95% CI 72.0–86.9).

Compliance and treatment success rates

Two studies (Brandes ; Pearson ) did not report on the number of pregnancies achieved for the first three ART cycles and were not included in the calculation of the overall and typical regimen success rates. The overall success rate for the first three cycles, which included everyone who started treatment, was 42.7% (32.6–53.6%, I2 = 98·8%). The typical regimen cycle success rate, which included only compliers (as defined in the TARC formula), was 57.9% (49.4–65.9%, I2 = 97.0%).

Discussion

This meta-analysis shows that the vast majority of patients will comply with the typical ART regimen of three cycles, with about 2 of 10 patients discontinuing treatment earlier than would have been expected. Although many studies have pointed to alarmingly low compliance rates in ART (Malcolm and Cumming, 2004; Rajkhowa ), doctors can expect that 78% of patients will opt to undergo their ART regimen until they achieve pregnancy or are advised to end treatment. Compliance is likely to decrease with ART failure, from 82% after the first failed cycle to 75% after the second failed cycle, but the decrease does not seem to be a function of the efficacy of the clinic. Compliance rates varied between 71 and 84% as a function of how compliance was defined (especially inclusion or exclusion of doctor-censored patients). Results suggest that a less rigorous definition of compliance may result in it being underestimated. To reach a definitive estimation of compliance in fertility treatment, researchers and practitioners need to reach consensus on the definition, monitoring and reporting of compliance. The chance of achieving a pregnancy for patients who initiated a typical three-cycle ART regimen was 43%, but 58% for those who complied. Patients need to be informed from the start of treatment of the possibility of facing a compliance decision (i.e. to continue treatment or not) and that chances of treatment success are optimal when people comply with recommendations. The typical regimen ART compliance rate was 78% in a patient population who was expected to undergo treatment until they achieved pregnancy or were recommended to end treatment. It is reassuring for patients and clinics alike to realize that only about 2 of 10 patients do not comply with recommendations. Studies that reported on doctor censoring yielded even higher compliance rates. The reporting of active censoring is critical because it allows calculation of a compliance rate that takes into account whether the end of treatment was due to patient initiative or due to doctor recommendation. Including actively censored patients in the discontinuation group is misleading because these patients comply with medical recommendation. However, many studies do not consider this or other conceptual issues such as differentiation between permanent and temporary discontinuation or between definitive abandonment of treatment or of treatment at a given clinic only. This hinders research on compliance in ART, not only when assessing its prevalence but also when trying to understand its causes. This meta-analysis showed that when only the best available evidence is considered, compliance is 84%, supporting the idea that compliance in ART is indeed high. The finding that compliance decreased with successive experience of unsuccessful cycles suggests that failure discourages couples from carrying on with treatment (Akyuz and Sever, 2009), maybe as a result of a subjective perception of poor prognosis or other factors such as cost. Although we could not do a subgroup analysis considering whether treatment was subsidized/reimbursed, the compliance rate when we considered only studies that clearly stated that treatment was subsidized/reimbursed was 84%. It may also be that ART is too demanding, an explanation consistent with patients' own stated reasons for discontinuation (Smeenk ; Verhaak ; Brandes ; Boivin ; Gameiro ). As such, the compliance rate can also indicate that for 22% of couples the cost of treatment (financial, emotional) may be too high. It is relevant to note that the clinic's success rate per cycle (first and second cycles) was not associated with subsequent compliance, indicating that the clinic's efficacy does not dictate the compliance of their patient, despite strong beliefs within clinical communities that patients leave clinics with lower success rates (Marcus ). It may be that patients disregard clinic success rates in favour of subjective perceptions of individual chances of success. It may also be that patients consider other outcomes beyond efficacy such as quality of care (van Empel ) when considering uptake of further treatment. Our results show that in every 100 typical couples starting ART treatment, 78 comply with three cycles and of these, 43 can expect to achieve pregnancy or live birth. However, if full compliance could be reached, 58 patients would achieve a pregnancy or live birth, which represents a 15% higher rate of success (if all other factors, including prognosis, are equal across three ART cycles). Therefore, addressing causes of non-compliance could help more people become parents, with a maximum estimated increase in success rates of 15%. In terms of number of treatment cycles, we would expect each clinic in Europe to carry out an additional 110 cycles per year if there was full compliance (based on European data: 402,039 cycles for 2007 in 1029 reporting clinics, excludes frozen embryo transfers, de Mouzon ). Although a more precise knowledge of why patients discontinue treatment is still lacking, there are indications that to increase compliance clinics should focus on organizing treatments so that burden is diminished as much as possible and ensuring that patients receive support to meet the demands of treatment (see Gameiro for reasons for discontinuation and Boivin for an integrated model of fertility care). In addition, more explicit communication about compliance with patients and between health care providers is needed. Reports have shown that only 60% of women deciding to stop fertility treatment were satisfied with their decision (Peddie ) and most felt they lacked the necessary information and counselling support (Peddie ). Explicit information that ART success is likely to require multiple cycles and that treatment may entail emotional and physical side effects and disruptions to daily life, for example, would help address issues previously cited as causes of discontinuation (Rauprich ; Boivin ; Gameiro ) and help patients have more realistic expectations of what a typical ART regimen entails for an optimal chance of pregnancy.

Strengths and limitations

Considering the increasing debate surrounding the issue of compliance in fertility treatment, and in particular in ART, a meta-analysis on this literature was timely and appropriate. The strengths of this review are its systematic review of 30 years of research on discontinuation from seven databases, which yielded 10 studies from five countries, sampling the treatment trajectories of 14 810 patients. Data were independently extracted and quality evaluations made according to standard protocols for all studies. Compliance rates were calculated according to a clearly defined specification for the typical ART regimen and after-ART failure, which was consistent with ART practice and guidelines. Analytic methods included the overall meta-analysis and a priori-defined subgroup analyses according to relevant clinical, patient and methodological characteristics. Publication bias, including trim and fill, provided reliable estimates for ‘missing’ studies. Finally, although high heterogeneity in compliance rates was observed (above 95%), it was mainly due to statistical artefact and methodological issues. By statistical artefact, we mean that the majority of published meta-analyses report on effect sizes (e.g. risk ratios) from which it is statistically possible to remove the between-studies variance in base rates for the phenomenon under investigation (e.g. 1% difference in a base rate of 3 and 4% versus 80 and 81%). However, this is not the case in single-group studies and therefore meta-analyses of prevalence rates invariably produces high heterogeneity (Borenstein , e.g. I2 of 94% in a recent meta-analysis of the prevalence of depression in primary care, Mitchell and Sanjay Rao, 2009). All studies were published in peer-reviewed journals. They were of moderate to high quality and the quality of the studies was due to the fact that all used representative samples, and most studies could demonstrate homogeneity between compliers and non-compliers at the start of treatment and provided high completion rates for follow-up. The presence of publication bias for compliance after the first failed cycle (CAF1) did not markedly influence the magnitude of the rate reported (estimated to be 1.3% lower). Despite these strengths, there were some limitations in primary research that were transmitted to the meta-analysis. In particular, the research does not provide a full account of patient progression through ART. Studies report on the proportion of patients who opted to undergo or stop treatment, but do not fully explain what then happened to patients registered as ending treatment at a particular clinic. These patients may have permanently ended treatment, as we assume, or they may have temporarily stopped or moved to another clinic. Analysis of the forest plot also revealed that one study (Rufat ) presented a somewhat lower compliance rate with typical ART regimen than the other studies. This is one of the only two studies (Rufat ; Stolwijk ) that cover the pre-ICSI period when many causes of male infertility could not be addressed with treatment, which could explain the lower compliance reported. Studies focusing on groups of patients with poor prognosis or on specific treatments were excluded from analysis to control for clinical heterogeneity (i.e. use of specialist treatments, defined clinical subpopulations) so compliance in these groups is not known. Although these limitations need to be considered and addressed in the interpretation of the study findings and future research, the strengths of the systematic review and meta-analytic procedures adopted support the view that the compliance estimates reported are reliable and reflect current best available evidence.

Conclusions and future research

Our results show that ∼78% of patients undergo the cycles offered as part of the typical ART regimen, with uptake lower after ART failures but still high (82 and 75% after the first and the second failed cycles, respectively). These estimates are reassuring and should be transmitted to patients, who need to be informed from the start of treatment that, although ART is demanding, 8 out of every 10 patients comply with the typical regimen and that compliance with recommended cycles will offer the most optimal chance of success. Decision support should be developed to help people choose the best option (compliance, discontinuation) as ∼22% will decide to end treatment for personal reasons and these patients need to be helped to reach equipoise about this decision. Future research should focus on trying to understand why patients discontinue treatment. Despite these encouraging results, a definitive estimate of compliance may still be lacking because of primary research not providing a full account of patients' progression through the ART cycles. To progress compliance research, clinicians and researchers need to reach conceptual and methodological consensus on what is compliance and how to monitor it. An accurate assessment of compliance requires reporting the number of patients who undergo the typical ART regimen. While we studied three cycles, more or fewer ART cycles could be recommended depending on the patient population (e.g. poor responders) and ART protocol (e.g. minimal stimulation ART). In addition, patients who temporarily stop treatment, move on to another clinic or, as noted, are advised to end treatment should not be considered as non-compliers. In ART, there is no a priori time period in which the typical ART regimen should be completed. Most studies, therefore, set time limits for undergoing another cycle, typically 12 months, after which patients are considered to have abandoned treatment. These time limits should be evaluated for their representativeness of typical cycle uptake and, when used, reported. There is voluminous literature on success rates in ART yet few studies also report the number of patients opting not to undergo ART, which undermines the research base. Finally, it should be noted that the literature focuses exclusively on not undergoing the typical ART regimen (i.e. premature discontinuation). However, non-compliance can also occur when patients are advised to stop treatment but resist this idea (Boivin ) and choose to continue ART at other clinics (i.e. over-persistence). This behaviour should also be monitored to reach an accurate estimation of the prevalence of ‘over-persistence’ and to obtain a better understanding of why couples are not able to follow recommendations to stop treatment. In summary, a precise assessment of compliance implies monitoring patients' long-term treatment trajectories. Such an endeavour requires the inclusion of compliance in national ART registers (e.g. in the UK the Human Fertilisation and Embryology Authority).

Supplementary data

Supplementary data are available at http://humupd.oxfordjournals.org/.

Authors' roles

S.G. did data extraction, critical appraisal, data analysis, data interpretation and writing of the report. J.B. and C.M.V. did critical appraisal, data interpretation and writing of the report. J.A.M.K. contributed to the writing of the report.

Funding

S.G. holds a postdoctoral fellowship from the Portuguese Foundation for Science and Technology (FCT-SFRH/BPD/63063/2009). Merck Serono SA, Switzerland, funded the systematic literature search that was performed by Sure Support Unit for Research Evidence, Cardiff University and the second coding of articles that was performed by Debbie Moss and Caudex Medical. Merck Serono SA performed a scientific review of the publication but the views and opinions described in the publication are those of the authors.
  35 in total

Review 1.  Women's emotional adjustment to IVF: a systematic review of 25 years of research.

Authors:  C M Verhaak; J M J Smeenk; A W M Evers; J A M Kremer; F W Kraaimaat; D D M Braat
Journal:  Hum Reprod Update       Date:  2006-08-29       Impact factor: 15.610

2.  Reasons for discontinuation of IVF treatment: a questionnaire study.

Authors:  M Rajkhowa; A McConnell; G E Thomas
Journal:  Hum Reprod       Date:  2005-11-03       Impact factor: 6.918

3.  World collaborative report on Assisted Reproductive Technology, 2002.

Authors:  Jacques de Mouzon; Paul Lancaster; Karl Gosta Nygren; Elisabeth Sullivan; Fernando Zegers-Hochschild; Ragaa Mansour; Osamu Ishihara; David Adamson
Journal:  Hum Reprod       Date:  2009-05-27       Impact factor: 6.918

4.  Revised minimum standards for practices offering assisted reproductive technologies.

Authors: 
Journal:  Fertil Steril       Date:  2008-11       Impact factor: 7.329

5.  What is the most accurate estimate of pregnancy rates in IVF dropouts?

Authors:  T E M Verhagen; J C M Dumoulin; J L H Evers; J A Land
Journal:  Hum Reprod       Date:  2008-06-20       Impact factor: 6.918

6.  Cumulative live birth rates in cohorts of patients treated with in vitro fertilization or intracytoplasmic sperm injection.

Authors:  Cynthia Witsenburg; Sandra Dieben; Lucette Van der Westerlaken; Harjo Verburg; Nico Naaktgeboren
Journal:  Fertil Steril       Date:  2005-07       Impact factor: 7.329

7.  International estimates of infertility prevalence and treatment-seeking: potential need and demand for infertility medical care.

Authors:  Jacky Boivin; Laura Bunting; John A Collins; Karl G Nygren
Journal:  Hum Reprod       Date:  2007-03-21       Impact factor: 6.918

8.  A qualitative study of women's decision-making at the end of IVF treatment.

Authors:  V L Peddie; E van Teijlingen; S Bhattacharya
Journal:  Hum Reprod       Date:  2005-03-31       Impact factor: 6.918

9.  Point of failure as a predictor of in vitro fertilization treatment discontinuation.

Authors:  Kimberly R Pearson; Russ Hauser; Daniel W Cramer; Stacey A Missmer
Journal:  Fertil Steril       Date:  2008-09-30       Impact factor: 7.329

10.  Clinical diagnosis of depression in primary care: a meta-analysis.

Authors:  Alex J Mitchell; Amol Vaze; Sanjay Rao
Journal:  Lancet       Date:  2009-07-27       Impact factor: 79.321

View more
  12 in total

1.  Couples' discontinuation of fertility treatments: a longitudinal study on demographic, biomedical, and psychosocial risk factors.

Authors:  Juliana Pedro; Maria Pedro Sobral; Joana Mesquita-Guimarães; Carla Leal; Maria E Costa; Mariana V Martins
Journal:  J Assist Reprod Genet       Date:  2016-11-29       Impact factor: 3.412

2.  Burden of care is the primary reason why insured women terminate in vitro fertilization treatment.

Authors:  Alice D Domar; Kristin Rooney; Michele R Hacker; Denny Sakkas; Laura E Dodge
Journal:  Fertil Steril       Date:  2018-06       Impact factor: 7.329

3.  Fertility and pregnancy outcome among women undergoing assisted reproductive technology treatment in Windhoek, Namibia.

Authors:  Adão Francisco Lucas; Dibaba B Gemechu; Stefan S Du Plessis; Yapo G Aboua
Journal:  J Assist Reprod Genet       Date:  2021-02-08       Impact factor: 3.412

4.  Realizing a desired family size: when should couples start?

Authors:  J Dik F Habbema; Marinus J C Eijkemans; Henri Leridon; Egbert R te Velde
Journal:  Hum Reprod       Date:  2015-07-15       Impact factor: 6.918

5.  DrawingOut - An innovative drawing workshop method to support the generation and dissemination of research findings.

Authors:  Sofia Gameiro; Berit Bliesemann de Guevara; Elisabeth El Refaie; Alida Payson
Journal:  PLoS One       Date:  2018-09-20       Impact factor: 3.240

6.  Predictors for the Early Termination of a Psychological Intervention During Treatment with Assisted Reproductive Technologies.

Authors:  Marie Bernd; Maren Schick; Sabine Rösner; Ariane Germeyer; Thomas Strowitzki; Markus Moessner; Stephanie Bauer; Beate Ditzen; Tewes Wischmann
Journal:  Geburtshilfe Frauenheilkd       Date:  2020-02-21       Impact factor: 2.915

7.  Study of pentoxifylline effects on motility and viability of spermatozoa from infertile asthenozoospermic males.

Authors:  Aliye Ghasemzadeh; Farid Karkon-Shayan; Solmaz Yousefzadeh; Mohammad Naghavi-Behzad; Kobra Hamdi
Journal:  Niger Med J       Date:  2016 Nov-Dec

8.  Laughs and Jokes in Assisted Reproductive Technologies: Quantitative and Qualitative Analysis of Video-Recorded Doctor-Couple Visits.

Authors:  Silvia Poli; Lidia Borghi; Martina De Stasio; Daniela Leone; Elena Vegni
Journal:  Front Psychol       Date:  2021-04-14

9.  Talking about possible IVF/ICSI failure and need for multiple cycles in treatment planning: qualitative investigation of multi-cycle planning and its acceptability to patients and staff.

Authors:  C Harrison; J Boivin; S Gameiro
Journal:  Hum Reprod       Date:  2022-03-01       Impact factor: 6.918

10.  Optimising Follicular Development, Pituitary Suppression, Triggering and Luteal Phase Support During Assisted Reproductive Technology: A Delphi Consensus.

Authors:  Raoul Orvieto; Christos A Venetis; Human M Fatemi; Thomas D'Hooghe; Robert Fischer; Yulia Koloda; Marcos Horton; Michael Grynberg; Salvatore Longobardi; Sandro C Esteves; Sesh K Sunkara; Yuan Li; Carlo Alviggi
Journal:  Front Endocrinol (Lausanne)       Date:  2021-05-10       Impact factor: 5.555

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.