| Literature DB >> 25853703 |
Andrew D A C Smith1, Kate Tilling1, Debbie A Lawlor1, Scott M Nelson2.
Abstract
BACKGROUND: Accurately predicting the probability of a live birth after in vitro fertilisation (IVF) is important for patients, healthcare providers and policy makers. Two prediction models (Templeton and IVFpredict) have been previously developed from UK data and are widely used internationally. The more recent of these, IVFpredict, was shown to have greater predictive power in the development dataset. The aim of this study was external validation of the two models and comparison of their predictive ability. METHODS ANDEntities:
Mesh:
Year: 2015 PMID: 25853703 PMCID: PMC4390202 DOI: 10.1371/journal.pone.0121357
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Inclusion/exclusion criteria and formation of the validation sample.
Characteristics of patients and treatment in the validation sample, consisting of 130,960 IVF cycles.
| Characteristic | Number in sample | Percentage of sample | Percentage using donor eggs |
|---|---|---|---|
| Live births | 33,553 | 25.6% | |
| Patient age at treatment (years) | |||
| 18–34 | 53,556 | 40.9% | 1.0% |
| 35–37 | 31,240 | 23.4% | 1.6% |
| 38–39 | 21,348 | 16.3% | 2.2% |
| 40–42 | 18,143 | 13.9% | 4.8% |
| 43–44 | 4,688 | 3.6% | 14.0% |
| 45–50 | 1,985 | 1.5% | 49.0% |
| Duration of infertility (years) | |||
| 0 | 3,125 | 2.4% | |
| 1–3 | 59,996 | 45.8% | |
| 4–6 | 44,400 | 33.9% | |
| 7–9 | 14,485 | 11.1% | |
| 10–12 | 5,560 | 4.3% | |
| 13- | 3,394 | 2.6% | |
| Previous live birth | |||
| Yes | 23,444 | 17.9% | |
| No | 107,516 | 82.1% | |
| Previous unsuccessful IVF cycles | |||
| 0 | 83,838 | 64.0% | |
| 1 | 25,051 | 19.1% | |
| 2 | 11,862 | 9.1% | |
| 3- | 10,209 | 7.8% | |
| Treatment type | |||
| IVF | 63,245 | 48.3% | |
| IVF plus ICSI | 67,715 | 51.7% | |
| Source of egg | |||
| Patient | 126,962 | 97.0% | |
| Donor | 3,998 | 3.0% | |
| Cause of infertility | |||
| Unexplained | 45,928 | 35.1% | |
| Tubal only | 15,244 | 11.6% | |
| Ovulatory only | 8,526 | 6.5% | |
| Endometriosis only | 5,164 | 3.9% | |
| Cervical only | 5 | 0.0% | |
| Male cause only | 43,663 | 33.3% | |
| Multiple causes | 12,430 | 9.5% | |
Area under the receiver-operator curve for IVFpredict and Templeton models, and female age alone, for predicting live birth from 130,960 IVF cycles.
| AUROC (95% CI) | Difference from row above | |
|---|---|---|
| IVFpredict, development sample | 0.635 (0.630, 0.637) | |
| IVFpredict | 0.628 (0.625, 0.631) | 0.006 ( |
| Templeton model | 0.616 (0.613, 0.620) | 0.012 ( |
| Continuous female age | 0.610 (0.606, 0.612) | 0.007 ( |
| Categorical female age | 0.604 (0.601, 0.608) | 0.004 ( |
* Based on 144,018 cycles occurring in the UK between 2003 and 2007 [12].
** Model predicting decreasing probability of live birth with increasing female age.
*** Model predicting decreasing probability of live birth with increasing female age categories (18–34, 35–37, 38–39, 40–42, 43–44 and 45–50 years).
Fig 2Calibration plot for the IVFpredict and Templeton models.
Based on 130,960 IVF cycles. Hosmer-Lemeshow test statistics: p<0.001. Solid line, IVFpredict model; dashed line, Templeton model; dotted, diagonal line, perfect prediction (reference).
Fig 3Calibration plot for updated IVFpredict and Templeton models.
Based on 130,960 IVF cycles. Hosmer-Lemeshow test statistics: p<0.001. Solid line, updated IVFpredict model; dashed line, updated Templeton model; dotted, diagonal line, perfect prediction (reference).
Observed and predicted live birth rates from updated IVFpredict and Templeton models stratified by characteristics of patients and treatment, in 130,960 IVF cycles.
| IVFpredict | Templeton | ||||
|---|---|---|---|---|---|
| Observed live birth rate | Predicted live birth rate (SD) | Ratio predicted to observed (95% CI) | Predicted live birth rate (SD) | Ratio predicted to observed (95% CI) | |
| Female age (years) patient eggs | |||||
| 18–34 | 0.326 | 0.330 (0.060) | 1.013 (1.000, 1.025) | 0.327 (0.058) | 1.004 (0.992, 1.016) |
| 35–37 | 0.273 | 0.273 (0.056) | 0.999 (0.980, 1.017) | 0.272 (0.066) | 0.996 (0.978, 1.014) |
| 38–39 | 0.198 | 0.202 (0.047) | 1.020 (0.992, 1.047) | 0.210 (0.059) | 1.062 (1.033, 1.091) |
| 40–42 | 0.129 | 0.123 (0.033) | 0.949 (0.913, 0.986) | 0.148 (0.051) | 1.140 (1.096, 1.184) |
| 43–44 | 0.050 | 0.045 (0.015) | 0.907 (0.785, 1.029) | 0.079 (0.029) | 1.574 (1.362, 1.785) |
| 45–50 | 0.020 | 0.022 (0.014) | 1.120 (0.631, 1.609) | 0.036 (0.016) | 1.827 (1.031, 2.623) |
|
|
| ||||
| Female age (years) donor eggs | |||||
| 18–34 | 0.301 | 0.317 (0.067) | 1.054 (0.917, 1.191) | 0.322 (0.064) | 1.070 (0.931, 1.209) |
| 35–37 | 0.324 | 0.345 (0.072) | 1.064 (0.930, 1.199) | 0.257 (0.069) | 0.792 (0.692, 0.893) |
| 38–39 | 0.328 | 0.236 (0.055) | 0.721 (0.627, 0.815) | 0.187 (0.052) | 0.569 (0.494, 0.645) |
| 40–42 | 0.328 | 0.300 (0.068) | 0.914 (0.826, 1.002) | 0.123 (0.045) | 0.376 (0.339, 0.412) |
| 43–44 | 0.344 | 0.253 (0.080) | 0.736 (0.656, 0.815) | 0.068 (0.027) | 0.197 (0.175, 0.219) |
| 45–50 | 0.327 | 0.276 (0.117) | 0.844 (0.766, 0.921) | 0.026 (0.017) | 0.079 (0.071, 0.086) |
|
|
| ||||
| Source of egg | |||||
| Patient | 0.254 | 0.255 (0.099) | 1.005 (0.996, 1.014) | 0.260 (0.094) | 1.023 (1.013, 1.032) |
| Donor | 0.326 | 0.287 (0.090) | 0.879 (0.839, 0.919) | 0.141 (0.112) | 0.433 (0.411, 0.455) |
|
|
| ||||
| Duration of infertility (years) | |||||
| 0 | 0.267 | 0.330 (0.118) | 1.237 (1.167, 1.308) | 0.264 (0.105) | 0.988 (0.932, 1.044) |
| 1–3 | 0.269 | 0.271 (0.103) | 1.007 (0.994, 1.020) | 0.281 (0.093) | 1.044 (1.030, 1.057) |
| 4–6 | 0.259 | 0.249 (0.089) | 0.962 (0.947, 0.977) | 0.237 (0.088) | 0.918 (0.904, 0.932) |
| 7–9 | 0.229 | 0.236 (0.093) | 1.032 (1.001, 1.062) | 0.242 (0.104) | 1.060 (1.028, 1.091) |
| 10–12 | 0.212 | 0.211 (0.086) | 0.998 (0.949, 1.048) | 0.230 (0.104) | 1.088 (1.033, 1.143) |
| 13- | 0.181 | 0.191 (0.090) | 1.055 (0.981, 1.130) | 0.164 (0.092) | 0.910 (0.845, 0.975) |
|
|
| ||||
| Previous unsuccessful IVF cycles | |||||
| 0 | 0.275 | 0.289 (0.094) | 1.050 (1.039, 1.061) | 0.280 (0.096) | 1.017 (1.006, 1.028) |
| 1 | 0.224 | 0.205 (0.079) | 0.915 (0.895, 0.936) | 0.232 (0.083) | 1.037 (1.014, 1.061) |
| 2 | 0.227 | 0.196 (0.075) | 0.861 (0.833, 0.889) | 0.209 (0.080) | 0.920 (0.890, 0.951) |
| 3+ | 0.211 | 0.181 (0.080) | 0.859 (0.827, 0.891) | 0.173 (0.077) | 0.821 (0.791, 0.852) |
|
|
| ||||
| Previous live birth by IVF | |||||
| No | 0.253 | 0.251 (0.096) | 0.993 (0.983, 1.003) | 0.246 (0.086) | 0.975 (0.965, 0.985) |
| Yes | 0.272 | 0.280 (0.111) | 1.028 (1.007, 1.049) | 0.301 (0.127) | 1.105 (1.082, 1.128) |
|
|
| ||||
| Treatment type | |||||
| IVF | 0.234 | 0.234 (0.091) | 1.001 (0.987, 1.015) | 0.249 (0.102) | 1.065 (1.050, 1.080) |
| ICSI | 0.277 | 0.277 (0.095) | 0.999 (0.987, 1.011) | 0.263 (0.098) | 0.949 (0.938, 0.960) |
|
|
| ||||
| Cause of infertility | |||||
| Unexplained | 0.242 | 0.244 (0.102) | 1.007 (0.991, 1.023) | 0.237 (0.098) | 0.980 (0.964, 0.995) |
| Tubal only | 0.237 | 0.233 (0.085) | 0.982 (0.955, 1.010) | 0.255 (0.093) | 1.074 (1.044, 1.104) |
| Ovulatory only | 0.281 | 0.267 (0.093) | 0.950 (0.918, 0.982) | 0.266 (0.098) | 0.949 (0.917, 0.981) |
| Endometriosis only | 0.253 | 0.253 (0.085) | 0.998 (0.952, 1.043) | 0.261 (0.086) | 1.029 (0.982, 1.077) |
| Male cause only | 0.271 | 0.274 (0.102) | 1.011 (0.995, 1.026) | 0.271 (0.097) | 1.002 (0.987, 1.017) |
| Multiple causes | 0.264 | 0.264 (0.094) | 0.997 (0.968, 1.026) | 0.266 (0.092) | 1.006 (0.977, 1.035) |
|
|
| ||||
P-values are for differences between observed and predicted number of live births, and those smaller than the threshold for a family-wise error rate of 5% are highlighted.
* Not including 5 cycles where cervical cause of infertility only, for which meaningful confidence intervals cannot be calculated.
Brier scores and decomposition for IVFpredict and Templeton models, from 130,960 IVF cycles.
| Scaled modified (Sanders) Brier score | Scaled reliability (in-the-small) | Scaled uncertainty | |
|---|---|---|---|
| Templeton model | |||
| Original | 1.065 | 0.096 | 0.032 |
| Updated | 0.971 | 0.003 | 0.032 |
| IVFpredict | |||
| Original | 0.965 | 0.004 | 0.039 |
| Updated | 0.963 | 0.001 | 0.039 |
Scaled modified Brier score = 1 + Scaled reliability—Scaled uncertainty.
Net reclassification index for IVFpredict compared with the Templeton model.
| Original | ||||
| Prognosis | No live birth | Live birth | ||
| IVFpredict higher than Templeton | 95,304 | (97.8%) | 33,311 | (99.3%) |
| Templeton higher than IVFpredict | 2,103 | (2.2%) | 242 | (0.7%) |
| Net reclassification index | 2.88% from Templeton to IVFpredict | |||
| Updated | ||||
| Prognosis | No live birth | Live birth | ||
| IVFpredict higher than Templeton | 46,750 | (48.0%) | 18,143 | (54.1%) |
| Templeton higher than IVFpredict | 50,654 | (52.0%) | 15,410 | (45.9%) |
| Net reclassification index | 12.2% from Templeton to IVFpredict | |||
* Not including 3 cycles with equal prognosis in the updated IVFpredict and Templeton models.