| Literature DB >> 21931598 |
Stephanie Medlock1, Anita C J Ravelli, Pieter Tamminga, Ben W M Mol, Ameen Abu-Hanna.
Abstract
CONTEXT: Being born very preterm is associated with elevated risk for neonatal mortality. The aim of this review is to give an overview of prediction models for mortality in very premature infants, assess their quality, identify important predictor variables, and provide recommendations for development of future models.Entities:
Mesh:
Year: 2011 PMID: 21931598 PMCID: PMC3169543 DOI: 10.1371/journal.pone.0023441
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Results of search.
This review focuses on the 41 development studies, which describe previously unpublished models predicting mortality.
Figure 2Classification of studies by the time at which a prediction can be made.
Summary of prediction models by time of prediction. The time of prediction is the point in time where a prediction can be made using the model. A model for antenatal prediction was reported in 5 studies, for prediction at live birth in 17 studies, upon NICU admission in 8 studies, from the first day of life in 12 studies, and after the first day of life in 6 studies. Two studies do not specify when the data should be collected or when a prediction can be made. Studies marked with an asterisk met our criteria for methodological quality.
Summary of performance measures reported in the 41 development studies.
| measure | n studies | lowest | highest | mean | median | |
| discrimination | AUC | 18/41 | 0.698 | 0.954 | 0.8583 | 0.87 |
| calibration | H-L p | 19/41 | <0.01 | 0.99 | 0.5757 | 0.6255 |
| accuracy | % correct / accuracy | 9/41 | 0.62 | 0.883 | 0.7857 | 0.79 |
| PPV | 14/41 | 0.44 | 0.84 | 0.703 | 0.75 | |
| NPV | 15/41 | 0.522 | 0.94 | 0.7986 | 0.81 | |
| sensitivity | 10/41 | 0.17 | 0.95 | 0.6474 | 0.72 | |
| specificity | 13/41 | 0.32% | 1.00 | 0.7848 | 0.85 | |
| discrimination, calibration, accuracy | R2 | 4/41 | 0.30 | 0.69 | 0.3588 | 0.39 |
| range | 15/41 | 27–52% | 1–98% | -- | -- |
Overview of performance measures reported by ≥2 studies. At least one performance measure was required for inclusion; most studies reported more than one. A summary of measures per study is given in Table S2, and the measures reported for each model are in Table S3. The heterogeneity of the studies precludes direct comparison of model performance.
AUC = area under the curve; ideal = 1.0 and chance = 0.5.
R2 = coefficient of determination; the proportion of variability that is accounted for by the model. ideal = 1.0.
H-L p = Hosmer-Lemeshow p value; any non-significant value indicates acceptable calibration.
% correct; ideal = 1.0 and chance ≈ %survival2.
PPV = positive predictive value; ideal = 1.0 and chance related to prevalence and cut-off.
NPV = negative predictive value; ideal = 1.0 and chance related to prevalence and cut-off.
sensitivity; ideal = 1.0 and chance related to prevalence.
specificity; ideal = 1.0 and chance related to prevalence.
range = the range of probabilities generated by the model.
Note that performance measures of unvalidated models may be overestimated.
Comparison of predictive performance of models as reported in the development studies.
| VLGA/BW, post-surfactant, developed countries, AUC | |||
| age/weight range | mortality rate | performance | |
| Zernikow 1998 | <1500 g or <32 w | 8.3% | AUC = 0.954 (SD 0.015) |
| Parry 2003 | < = 32 w | 7.9% | AUC = 0.92 (SE 0.01) |
| Evans 2007 | <1500 g or <32 w | 6.8% | AUC = 0.83 |
|
| |||
| Tyson 1996 | 501–800 g | 33.5% | AUC = 0.76 |
| Ambalavanan 2001 | <1000 g | 34% | AUC = 0.87 (SE 0.03) |
| Ambalavanan 2005 | 401–1000 g | 35% | AUC = 0.854 (SE 0.004) |
| Tyson 2008 | 401–1000 g and 22–25 w | 42% | AUC = 0.753 (95% CI 0.737–0.769) |
|
| |||
| Ambalavanan 2001 | <1000 g | 34% | R2 = 0.36 |
| Locatelli 2005 | <750 g and <34 w | 49.2% | R2 = 0.69 |
| Gargus 2009 | 401–1000 g | 34.4% | R2 = 0.4175 |
Studies which reported the same outcome measure were considered for comparison. Since the performance is only comparable in similar populations, the study populations were compared on whether the population was VLGA/BW or ELGA/BW, from a developed or developing country, from the pre- or post- surfactant era, and the reported mortality rate (an absolute difference of ≤10% was considered comparable). Only three categories contained more than two studies. All performance measures for each prediction model are given in Table S3.
*multiple models, using best performance of those reported.
performance assessed on development set, may be overestimated.
Figure 3Summary of framework for assessing the quality of studies reporting on the development of a new prediction model.
The quality framework consists of two parts, a methodological score with minimal criteria and a reporting score. The complete framework is available in the supporting text S1, along with an assessment of each included study.