| Literature DB >> 26398494 |
Allison Meisner, Kathleen F Kerr, Heather Thiessen-Philbrook, Steven G Coca, Chirag R Parikh.
Abstract
Individual biomarkers of renal injury are only modestly predictive of acute kidney injury (AKI). Using multiple biomarkers has the potential to improve predictive capacity. In this systematic review, statistical methods of articles developing biomarker combinations to predict AKI were assessed. We identified and described three potential sources of bias (resubstitution bias, model selection bias, and bias due to center differences) that may compromise the development of biomarker combinations. Fifteen studies reported developing kidney injury biomarker combinations for the prediction of AKI after cardiac surgery (8 articles), in the intensive care unit (4 articles), or other settings (3 articles). All studies were susceptible to at least one source of bias and did not account for or acknowledge the bias. Inadequate reporting often hindered our assessment of the articles. We then evaluated, when possible (7 articles), the performance of published biomarker combinations in the TRIBE-AKI cardiac surgery cohort. Predictive performance was markedly attenuated in six out of seven cases. Thus, deficiencies in analysis and reporting are avoidable, and care should be taken to provide accurate estimates of risk prediction model performance. Hence, rigorous design, analysis, and reporting of biomarker combination studies are essential to realizing the promise of biomarkers in clinical practice.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26398494 PMCID: PMC4805513 DOI: 10.1038/ki.2015.283
Source DB: PubMed Journal: Kidney Int ISSN: 0085-2538 Impact factor: 10.612
Figure 1Overview of Study Selection
Abbreviations: AKI = acute kidney injury.
Characteristics of Included Studies.
| First Author, Year | Journal | Biomarkers | Clinical Setting | AKI Outcome Definition | Sample Size | Method for Combining Biomarkers | |
|---|---|---|---|---|---|---|---|
| Cases | Controls | ||||||
| Vaidya, 2008 | Clinical and Translational Science | uKIM1, uNGAL, uIL18, uHGF, uCysC, uNAG, uVEGF, uIP10, total protein | Inpatient nephrology consultation service | Peak sCr > 50% increase over admission or known baseline | 102 | 102 | Logic regression |
|
| |||||||
| Han, 2009 | CJASN | uKIM1, uNAG, uNGAL | Cardiac surgery | ≥ 0.3 mg/dl increase in sCr from baseline or increase 2- to 3-fold within 72h | 36 | 54 | Logistic regression |
|
| |||||||
| Liangos, 2009 | Biomarkers | uKIM1, uNAG, uNGAL, uIL18, uCysC, u(α-1 microglobulin) | Cardiac surgery with CPB | ≥ 50% increase in sCr within 72h of CPB | 13 | 90 | Logistic regression |
|
| |||||||
| Che, 2010 | Nephron Clinical Practice | pCysC, uNGAL, uIL18, uRBP, uNAG | Cardiac surgery | ≥ 50% increase in SCr from baseline in 72h | 14 | 15 | Logistic regression |
|
| |||||||
| de Geus, 2011 | Nephron Extra | uNGAL, pNGAL, uCysC, pCysC | ICU admissions | ≥ 50% increase in SCr occurring and persisting for >24h after admission | 47 | 444 | Logistic regression |
|
| |||||||
| Parikh, 2011 | JASN | uIL18, uNGAL, pNGAL | Cardiac surgery | Dialysis or ≥ 100% increase in sCr | 60 | 1159 | Logistic regression |
|
| |||||||
| Katagiri, 2012 | Annals of Thoracic Surgery | uLFABP, uNAG | Cardiac surgery | ≥ 0.3 mg/dL or 50% increase in sCr from baseline within 3 days | 28 | 49 | Logistic regression |
|
| |||||||
| Kokkoris, 2012 | Renal Failure | pNGAL, uNGAL, pCysC, sCr | ICU admissions | Any AKI by RIFLE in 7 days | 36 | 64 | Logistic regression |
|
| |||||||
| Cho, 2013 | Journal of Korean Medical Science | uNGAL, uLFABP | ICU admissions | ≥ 0.3 mg/dL or ≥ 50% increase of sCr in 5 days | 54 | 91 | Unclear |
|
| |||||||
| Kambhampati, 2013 | Journal of Cardiovascular Surgery | Fluid balance, uNGAL, uIL18, pMCP-1, pTNFalpha | Cardiac surgery | ≥ 0.3 mg/dl increase in sCr in 48h | 27 | 73 | Logistic regression |
|
| |||||||
| Liu, 2013 | Biomarkers | uNGAL, uLFABP | Cardiac surgery | ≥ 0.3 mg/dL or ≥ 50% increase of sCr in 72h | 26 | 83 | Logistic regression |
|
| |||||||
| Luo, 2013 | Clinical Nephrology | uKIM1, uNGAL, uIL18 | Percutaneous coronary intervention | ≥ 0.5 mg/dL or ≥ 25% increase of sCr at 48h | 12 | 30 | Logistic regression |
|
| |||||||
| Parikh, 2013 | CJASN | uKIM1, uLFABP, uIL18, uNGAL, pNGAL | Cardiac surgery | Dialysis or ≥ 100% increase in sCr | 60 | 1159 | Logistic regression |
|
| |||||||
| Siew, 2013 | Kidney International | uNGAL, uLFABP, uCysC | ICU admissions | ≥ 0.3 mg/dL or ≥ 50% increase of sCr within 48h of biomarker measurement | 127 | 245 | Logistic regression |
|
| |||||||
| Zeng, 2014 | Clinical Chemistry and Laboratory Medicine | uNGAL, uLFABP | Admitted for major surgery | ≥ 0.3 mg/dL or ≥ 50% increase of SCr within 48h | 37 | 160 | Logistic regression |
Abbreviations: CJASN – Clinical Journal of the American Society of Nephrology; uKIM1 – urine kidney injury marker-1; uNGAL – urine neutrophil gelatinase-associated lipocalin; pNGAL – serum neutrophil gelatinase-associated lipocalin; uIL18 – urine interleukin 18; uCysC – urine cystatin C; pCysC – serum cystatin C; sCr – serum creatinine; uLFABP – urine liver-type fatty acid binding protein; uHGF – urine hepatocyte growth factor; uNAG – urine N-acetyl-β-D-glucosaminidase; uVEGF – urine vascular endothelial growth factor; uIP10 – urine chemokine interferon-inducible protein 10; u(α-1 microglobulin) – urine α-1 microglobulin; uRBP – urine retinol-binding protein; pMCP-1 – serum monocyte chemoattractant protein 1; pTNF-alpha – serum tumor necrosis factor-alpha; CPB – cardiopulmonary bypass; ICU – intensive care unit.
All biomarkers considered for combinations, including injury and functional markers.
Paper considered sustained and transient AKI; here we report analyses related only to sustained vs. no AKI (combinations only reported for sustained AKI).
Sources of Bias.
| First Author, Year | Possible Sources of Bias | Reported AUC | |||
|---|---|---|---|---|---|
| Resubstitution bias | Few events | Fit many models | Multiple centers | ||
| Vaidya, 2008 | No | No (nev = 102, nm = 4) | Yes (up to 29) | No | 0.75–0.78 |
| Han, 2009 | Yes | No (nev = 36, nm = 3) | No (5) | No | 0.75–0.78 |
| Liangos, 2009 | Yes | Yes (nev = 13, nm = 3) | No (7) | Yes | 0.78 |
| Che, 2010 | Yes | Yes (nev = 14, nm = 5) | Yes (29) | No | 0.98 |
| de Geus, 2011 | Yes | No (nev = 47, nm = 2) | No (7) | No | 0.83 |
| Parikh, 2011 | No | No (nev = 60, nm = 3) | Yes (165) | Yes | 0.77 |
| Katagiri, 2012 | Yes | No (nev = 28, nm = 2) | No (9) | No | 0.81 |
| Kokkoris, 2012 | Yes | No (nev = 36, nm = 2–3) | Yes (11) | No | 0.823–0.835 |
| Cho, 2013 | Yes | No (nev = 54, nm = 2) | No (1) | No | 0.800 |
| Kambhampati, 2013 | Yes | Yes (nev = 27, nm = 5) | No (1) | No | 0.80 |
| Liu, 2013 | Yes | No (nev = 26, nm = 2) | No (2) | No | 0.911–0.927 |
| Luo, 2013 | Yes | Yes (nev = 12, nm = 3) | No (2) | No | 0.99 |
| Parikh, 2013 | No | No (nev = 60, nm = 3) | Yes (455) | Yes | 0.78 |
| Siew, 2013 | Yes | No (nev = 127, nm = 2) | No (1) | No | 0.59 |
| Zeng, 2014 | Yes | No (nev = 37, nm = 2) | Yes (64) | No | 0.91–0.94 |
Dark red indicates a high likelihood of bias, dark pink indicates possible bias, and pale pink indicates low likelihood of bias. In the “Resubstitution bias” column, “no” indicated some attempt was made to address resubstitution bias. Under “Few events”, nev is the number of events and nm is the number of markers in the main reported combination; we considered nev:nm < 10 to be few events. The number of events reported may include individuals with missing marker values. We considered more than 10 models to be “many”. We calculated the number of models by hand (i.e., these were not explicitly reported by the authors). Under “Multiple centers”, “yes” indicates that multiple centers were involved and possible differences were not considered, though this does not mean there was bias due to center differences. Abbreviations: AUC – area under the receiver operating characteristic curve.
Includes search over univariate models (if best individual markers were chosen for combination).
Resubstitution bias was addressed, but may not be totally removed.
Reported 10 individuals with sustained AKI also had missing marker values.
Replication in TRIBE.
| Study Information | Reported AUC (95% CI) | AUC in TRIBE (95% CI) | |||||
|---|---|---|---|---|---|---|---|
| First Author, Year | Provided Combination | Biomarkers | Apparent | Center-Adjusted | Optimism-Corrected | Center-Adjusted and Optimism-Corrected | |
| de Geus, 2011 | No | uNGAL, pNGAL at admission | 0.83 (0.75–0.91) | 0.665 (0.585, 0.739) | 0.655 (0.573, 0.736) | 0.663 (0.583, 0.737) | 0.647 (0.565, 0.728) |
|
| |||||||
| Kokkoris, 2012 | Yes | pNGAL, sCr at admission | 0.823 (0.73–0.89) | 0.702 (0.655, 0.745) | 0.691 (0.647, 0.733) | ||
| pNGAL, uNGAL, sCr at admission | 0.835 (0.75–0.90) | 0.704 (0.660, 0.746) | 0.690 (0.649, 0.733) | ||||
|
| |||||||
| Cho, 2013 | No | uNGAL, uLFABP at admission | 0.800 (0.727–0.872) | 0.598 (0.564, 0.630) | 0.585 (0.549, 0.618) | 0.597 (0.563, 0.629) | 0.583 (0.547, 0.616) |
|
| |||||||
| Liu, 2013 | Yes | CuNGAL, CuLFABP at 0h | 0.927 (0.868, 0.986) | 0.587 (0.553, 0.622) | 0.570 (0.533, 0.605) | ||
| CuNGAL, CuLFABP at 2h | 0.911 (0.836, 0.987) | 0.587 (0.551, 0.618) | 0.569 (0.535, 0.606) | ||||
|
| |||||||
| Luo, 2013 | No | uKIM1, uNGAL, uIL18 at 24h | 0.99 (0.90–1.00) | 0.654 (0.607,0.700) | 0.588 (0.544, 0.639) | 0.650 (0.603, 0.696) | 0.580 (0.535, 0.631) |
|
| |||||||
| Siew, 2013 | No | CuNGAL, CuLFABP at 0h and 48h | 0.59 (0.56–0.69) | 0.622 (0.568, 0.676) | 0.610 (0.550, 0.661) | 0.615 (0.560, 0.669) | 0.602 (0.542, 0.653) |
|
| |||||||
| Zeng, 2014 | Yes | CuNGAL at 12h, CuLFABP at 4h | 0.94 (0.89–0.98) | 0.620 (0.564, 0.675) | 0.622 (0.561, 0.674) | ||
| 0.614 (0.561, 0.673) | 0.609 (0.553, 0.668) | ||||||
| CuNGAL at 12h, CuLFABP at 12h | 0.91 (0.85–0.97) | 0.639 (0.591, 0.686) | 0.619 (0.561, 0.674) | ||||
| 0.642 (0.584, 0.696) | 0.630 (0.572, 0.687) | ||||||
We present an overview of the studies replicated in TRIBE, including whether the paper reported the estimated combination, the biomarkers involved in the combination, the reported AUC, and the AUC in TRIBE. For the AUC in TRIBE, we considered the apparent, center-adjusted, optimism-corrected, and center-adjusted and optimism-corrected AUCs for those combinations re-estimated in TRIBE. The optimism-corrected AUC adjusts for resubstitution bias. For the combinations provided in the article, we considered the apparent and center-adjusted AUCs. The 95% confidence intervals in TRIBE were estimated by bootstrapping. Abbreviations: AUC – area under the receiver operating characteristic curve; uKIM1 – urine kidney injury marker-1; uNGAL – urine neutrophil gelatinase-associated lipocalin; CuNGAL – corrected (for urine creatinine) urine neutrophil gelatinase-associated lipocalin; pNGAL – serum neutrophil gelatinase-associated lipocalin; uIL18 – urine interleukin 18; sCr – serum creatinine; uLFABP – urine liver-type fatty acid binding protein; CuLFABP – corrected (for urine creatinine) urine liver-type fatty acid binding protein; CI – confidence interval.
Based on coefficients from paper.
Based on markers measured at 0–6h in TRIBE.
Used KIM1, IL18 and NGAL measured at day 2.
Used uNGAL at 6–12h, uLFABP at 0–6h.
Used uNGAL at 12–18h, uLFABP at 0–6h.
Used uNGAL at 6–12h, uLFABP at 6–12h.
Used uNGAL at 12–18h, uLFABP at 6–12h.
Recommendations Regarding the Design and Analysis of Biomarker Combination Studies.
| Study Design | |
|---|---|
| Sample size | For binary outcomes, the effective sample size is the minimum of the number of events and the number of non-events ( |
| Enrollment and follow-up | Data from a carefully designed and conducted study are preferable to convenience samples ( |
| Measuring biomarkers | Clearly define (including blood/urine and methods of preservation/storage) and measure biomarkers in a uniform, standardized way ( |
| Measuring outcomes | Outcome should be relevant to patients and decision-making ( |
| Timing of measurements | Timing must be carefully defined ( |
| Other design issues | Determine minimally acceptable values of performance measures at the design stage ( |
| Choosing candidate biomarkers | A candidate biomarker is any biomarker associated with the outcome; the association need not be causal ( |
| Handling continuous predictors | Categorizing continuous biomarkers results in a loss of information ( |
| Missing data | Complete case analysis can lead to bias and increased variance ( |
| Predictor selection | Smaller and simpler models may have practical advantages ( |
| Methods for combination | Multiple methods can be used ( |
| Performance metrics | Predictive accuracy, not measures of association or p-values, is what matters ( |
| Internal validation | A necessary step in model development ( |
| External validation | Strongly recommended ( |
| Existing models | Newly developed models should be quantitatively compared to existing models ( |
| Adhere to existing guidelines ( | |