Kathleen F Kerr1, Eric R Morenz1, Jeremy Roth1, Heather Thiessen-Philbrook2, Steven G Coca3, Chirag R Parikh2. 1. Department of Biostatistics, University of Washington, Seattle, Washington, USA. 2. Division of Nephrology, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA. 3. Division of Nephrology, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
Abstract
INTRODUCTION: Acute kidney injury (AKI) is a frequent complication of cardiac surgery, but only a fraction of cardiac surgery patients that experience postoperative AKI have progression to more severe stages. Biomarkers that can distinguish patients that will experience progression of AKI are potentially useful for clinical care and/or the development of therapies. METHODS: Data come from a prospective cohort study of cardiac surgery patients; the analytic dataset contained data from 354 cardiac surgery patients meeting criteria for AKI following surgery. Candidate predictors were 38 biomarkers of kidney function, insult, or injury measured at the time of AKI diagnosis. The outcome was AKI progression, defined as worsening of AKI Network stage. We investigated combining biomarkers with Bayesian model averaging (BMA) and random forests of classification trees, with and without center transformation. For both approaches, we used resampling-based methods to avoid optimistic bias in our assessment of model performance. RESULTS: BMA yielded a combination of 3 biomarkers and an optimism-corrected estimated area under the receiver operating characteristic curve (AUC) of 0.75 (95% confidence interval [CI]: 0.68, 0.82). The random forests approach, which nominally uses all biomarkers, had an estimated AUC of 0.74 (95% CI: 0.66, 0.82). A second application of random forests applied to biomarker values after a center-specific transformation had an estimated AUC of 0.80 (95% CI: 0.72, 0.88). CONCLUSION: These findings suggest that the application of advanced statistical techniques to combine biomarkers offers only modest improvements over use of single biomarkers alone. This exemplifies a common experience in biomarker research: combinations of modestly performing biomarkers often also have modest performance.
INTRODUCTION: Acute kidney injury (AKI) is a frequent complication of cardiac surgery, but only a fraction of cardiac surgery patients that experience postoperative AKI have progression to more severe stages. Biomarkers that can distinguish patients that will experience progression of AKI are potentially useful for clinical care and/or the development of therapies. METHODS: Data come from a prospective cohort study of cardiac surgery patients; the analytic dataset contained data from 354 cardiac surgery patients meeting criteria for AKI following surgery. Candidate predictors were 38 biomarkers of kidney function, insult, or injury measured at the time of AKI diagnosis. The outcome was AKI progression, defined as worsening of AKI Network stage. We investigated combining biomarkers with Bayesian model averaging (BMA) and random forests of classification trees, with and without center transformation. For both approaches, we used resampling-based methods to avoid optimistic bias in our assessment of model performance. RESULTS: BMA yielded a combination of 3 biomarkers and an optimism-corrected estimated area under the receiver operating characteristic curve (AUC) of 0.75 (95% confidence interval [CI]: 0.68, 0.82). The random forests approach, which nominally uses all biomarkers, had an estimated AUC of 0.74 (95% CI: 0.66, 0.82). A second application of random forests applied to biomarker values after a center-specific transformation had an estimated AUC of 0.80 (95% CI: 0.72, 0.88). CONCLUSION: These findings suggest that the application of advanced statistical techniques to combine biomarkers offers only modest improvements over use of single biomarkers alone. This exemplifies a common experience in biomarker research: combinations of modestly performing biomarkers often also have modest performance.
AKI is common after cardiac surgery and is associated with adverse outcomes. However, only a fraction of cardiac surgery patients who experience postoperative AKI have progression to more severe stages. Identifying cardiac surgery patients who are at high risk of AKI progression could potentially improve patient care or aid in the development of novel treatment strategies.Numerous biomarkers of injury, inflammation, repair, and fibrosis have been associated with the development of AKI following cardiac surgery.2, 3, 4, 5, 6, 7 Few studies have investigated biomarkers that can be used to predict AKI progression. We used data from a large multicenter cohort of cardiac surgery patients to seek out prognostic biomarkers of AKI progression. We also investigated whether the prognostic capacity of individual biomarkers could be leveraged to produce a biomarker panel with improved prognostic capacity.
Methods
Study Population
This is a secondary analysis of the Translational Research Investigating Biomarker Endpoints in AKI (TRIBE-AKI study). Briefly, this study prospectively enrolled 1219 adults undergoing coronary artery bypass graft (CABG) and/or valve surgery at 6 academic medical centers in North America between July 2007 and December 2009. Study details have been described previously. Patients who developed at least AKI Network stage 1 were included in this analysis.
Sample Collection
Urine and ethylenediamine tetraacetic acid (EDTA) plasma specimens were collected preoperatively and daily for up to 5 postoperative days. The first postoperative samples were collected soon after patients were admitted to the intensive care unit (0–6 hours after surgery). Details regarding sample collection and storage were provided in earlier reports.
Biomarkers
The present investigation considers biomarkers measured closest to the time of AKI diagnosis. For each patient, we defined the biomarker value as the most recent nonmissing postoperative value up to and including the day of that patient’s AKI diagnosis. For example, suppose a patient is diagnosed with AKI 2 days after surgery but has missing biomarker values on that day. If the patient had nonmissing biomarker values 1 day after surgery, then those values were used as the biomarker measurements. On the other hand, if the patient had no biomarker measurement at any time between cardiac surgery through the day of AKI progression, then the patient’s biomarker value was coded as missing. Supplementary Table S1 summarizes the timing of biomarker measurements used in analysis relative to the first day of clinical AKI. Use of the most-recent-carried-forward measurements reduced the number of patients with missing data for one or more biomarkers from 156 to 84 (of 354).Supplementary Table S2 lists biomarker assays; procedures are detailed in prior publications.3, 4, 6, 9, 10, 11 Our study included 31 blood plasma biomarkers, 7 urine biomarkers, and the change in serum creatinine level (day of AKI diagnosis vs. pre-surgery). Preoperative serum creatinine level, collected about 3 days prior to surgery (interquartile range: 1–11 days), served as a baseline measurement of kidney function. Pre- and post-operative creatinine values were measured by the same laboratory for each patient at all centers. Supplementary Table S1 provides details of the plasma and urine biomarker measurements.
Outcome
The primary outcome was AKI progression, defined as an increase in AKI Network stage from 1 to 2 or 3.
Statistical Methods
We summarized the association of individual biomarkers and AKI progression using odds ratios comparing patients 1 SD apart in the biomarker. We summarized the prognostic capacity of individual biomarkers using the area under the receiver operating characteristic curve (AUC). AUC is a general measure of how well a combination discriminates between patients who progress and those who do not, and we acknowledge that AUC is an incomplete assessment.12, 13We considered 40 candidates (31 plasma biomarkers, 7 urine biomarkers, change in serum creatinine level, and cardiopulmonary bypass surgery time) in developing biomarker combinations. Cardiopulmonary bypass surgery time was included as a binary variable indicating a time >120 minutes. Urine biomarkers were not normalized to urine creatinine level, although urine creatinine level was included as a candidate predictor. All biomarkers were log-transformed, except for serum creatinine level.
Biomarker Combination Methods (BMA and Random Forests)
We used 2 complementary methods to seek biomarker combinations. First, we used methods based on BMA to identify combinations of biomarkers and cardiopulmonary bypass surgery time. Briefly, BMA assigns a prior probability to each candidate predictor that reflects the chance that the predictor is useful for prediction. These prior probabilities induce a prior probability for each biomarker combination. The method uses these prior probabilities and the data to calculate a posterior probability for each candidate predictor. This posterior variable probability reflects the support in the data for the variable as a predictor of the outcome. The BMA methods use a “leaps and bounds” algorithm to strategically search the large space of all possible biomarker combinations to identify the most-promising combinations for further consideration. With 40 candidate predictors, there are >1 trillion possible combinations; the search algorithm makes exploring this space computationally feasible.In our implementation of BMA, we assigned each candidate predictor a prior probability of one-half, implying that each candidate was a priori as likely to be in the model as not. These prior probabilities induce a prior probability for each combination of (0.5)40 = 9.1 × 10–13, because there are 40 candidate predictors. In our application of BMA, we selected the combination of all biomarkers with posterior probability exceeding 0.5, which is sometimes called the median probability combination.Note that BMA seeks linear combinations of biomarkers. In particular, it does not easily handle interactions among biomarkers. As a complementary approach, we also used the random forests method of aggregating classification trees. In this ensemble approach, we took 1000 bootstrap samples of our data and fit a classification decision tree to each sample using a randomly selected set of 6 biomarkers. In each bootstrap sample, we used Gini impurity as the splitting criterion. Predicted risks of AKI progression from a single tree can be obtained as the frequency of the outcome in the nodes of the tree. To obtain predicted risks for individual patients from a collection of trees, we averaged these predicted risks using only the trees that, due to the random bootstrap sampling, did not include that patient (“out-of-bag” trees). Obtaining predictions by averaging across out-of-bag trees should protect against overfitting and avoid optimistic bias in assessment of performance.It is difficult to understand the role of individual predictors in a random forest, which is sometimes referred to as a “black box” prediction instrument. As a first-pass measure of the importance of individual predictors, we repeated the entire random forest procedure 40 times, once for each candidate predictor, excluding one candidate predictor each time. We measured the decrease in estimated AUC when an individual biomarker was excluded. We caution against overinterpretation of this measure of variable importance. For example, suppose there are 2 biomarkers that are each highly predictive but are also highly correlated with each other. Excluding one of the biomarkers as a candidate predictor will have little effect on the performance of the random forest, but this will understate the importance of the marker.
Optimism Correction
For both BMA and random forests, we used methods to correct for optimistic bias that arises when the same data are used to both fit a model and then evaluate the model’s performance. For BMA, we used a bootstrapping procedure to estimate and correct for optimism in estimates of combination performance due to resubstitution bias and model-selection bias (more details follow). For random forests, we used out-of-bag trees to obtain predicted risks for patients, in order to estimate model performance. That is, for a given observation, we used all trees that did not use that observation to grow the tree to make a prediction for that observation, and averaged those predictions.
Multiple Imputations of Missing Biomarker Data
There was a non-negligible amount of missing biomarker data, which was a particular consideration for our investigation of biomarker combinations. Among the 83 individuals with at least one missing biomarker measurement, 14.5% experienced progression. Among the 271 patients with no missing data, 10.0% experienced AKI progression. We used a multiple imputation approach to retain the 83 patients with missing biomarker data in our search for biomarker combinations. Next, we provide an overview of our approach for BMA analysis and provide details in the online Supplementary Methods.First, we created 10 versions of our dataset into which missing biomarker data were imputed. Second, in each imputed dataset, we used BMA to estimate the posterior probability of every candidate predictor. Third, we identified the predictors for which the posterior probability estimates, averaged over the 10 imputed datasets, exceeded 0.5. Fourth, we fit a logistic regression model using the selected predictors in each of the 10 imputed datasets. For each predictor, the final regression coefficient was the average regression coefficient across the 10 imputed datasets. Fifth, we calculated an empirical AUC based on these regression coefficients in each imputed dataset, and averaged these AUC values. In order to correct this estimate of performance for optimistic bias, we took independent bootstrap samples from each imputed dataset, re-running the steps above in each bootstrap sample. Obtaining estimated regression coefficients in a bootstrap dataset, and assessing the combination performance in both the bootstrap dataset and the original dataset, allowed us to estimate and average optimism using a total of 1000 bootstrap datasets. We subtracted this estimated optimism from our nominal estimate of performance to produce an optimism-corrected estimate of performance.
Methods to Handle Study-Center Effect
We aimed to additionally explore the effect of study center on results. Since BMA uses linear combinations of biomarkers, the natural method of center-adjusted performance is to force “center” as a covariate in the linear combinations. However, due to a small number of patients who progressed to AKI in some centers, this approach was not computationally feasible when combined with our resampling approach (described later). For random forests, this integrated approach to center-adjustment is not possible generally. Therefore, to adjust for center with random forests we transformed biomarkers to the percentile scale, using the distribution of the biomarker among those who did not progress to AKI in the same center as the reference distribution. In summary, we applied the random forests approach twice. First, we applied the algorithm to the biomarkers in the same form we used for BMA. In a second application, we applied random forests to the center-specific transformed biomarkers. We note that classification trees, unlike linear model methods such as BMA, are invariant to monotone transformations of predictors. Because the center-specific transformation procedure involves a highly variable nonlinear transformation, we did not apply BMA to the center-specific transformed biomarker values.
Assessing Variability in Predictive Model Performance
For the model selected by BMA, within each complete dataset arising from multiple imputations, we performed bootstrapping to estimate the AUC variance of the fitted BMA combination. We combined these AUC variances across multiple imputation datasets using the Rubin rule. This gave an estimated AUC variance that we used to construct an AUC confidence interval, which we corrected according to the estimated optimism (see earlier description).For the random forest prediction models, within each complete dataset arising from multiple imputations, we ran the forest-building algorithm and estimated its AUC using out-of-bag trees for each observation as described. We estimated the variance of these AUCs using DeLong's method. We combined these estimates using the Rubin rule, and then constructed an AUC confidence interval.
Software
All analyses were completed using R 3.1.2; the BMA package was used for the BMA analyses, the ranger package for random forests, the mice package for multiple imputations, and the pROC package for AUC variances.
Results
Table 1 describes the characteristics of the study population by AKI progression status. There were 354 patients in the full dataset, including 39 patients who progressed to a higher AKI stage. All patients were diagnosed with stage 1 AKI either 1, 2, or 3 days after surgery (33%, 38%, and 29%, respectively). Among the 39 patients with AKI progression, 21 (54%) progressed from stage 1 to stage 2, and 18 (46%) progressed from stage 1 to stage 3. Thirteen of the 18 stage-3 patients received dialysis. Fourteen (4%) patients died in the hospital; 7 each in the subsets of patients with and without AKI progression. Table 2 and Figure 1 summarize the distribution of the biomarkers in those with and without AKI progression (see also Supplementary Figure S1). Table 3 summarizes biomarker AUC and odds ratios (ORs) on the original scale and after the center-specific transformation. A handful of biomarkers show modest predictive capacity in the data, with point estimates for AUC in the range 0.68 to 0.72 and with wide confidence intervals. In terms of individual AUC, the center-specific transformation appeared to lower AUC values.
Table 1
Demographics and clinical variables of study population as a whole and stratified on acute kidney injury (AKI) progression status
Units are pg/ml unless otherwise indicated. Values are median (interquartile range).
Urine biomarkers and serum creatinine were missing for 4 patients. Plasma biomarker values were missing for 27 patients, except for the following biomarkers, which were missing for 71–73 patients: Pro BNP, TNI, TNTHS, CKMB HFABP, MCP-1.
Figure 1
Distributions of select biomarkers in patients who had versus did not have progression of acute kidney injury (AKI). See Supplementary Figure S1 for all biomarkers. hFABP, heart-type fatty acid-binding protein; NGAL, neutrophil gelatinase-associated lipocalin; Tie-2, tyrosine kinase-2; VEGF, vascular endothelial growth factor.
Table 3
Area under the ROC curve (AUC) and odds ratios (ORs) for individual biomarkers measured in this study, with 95% confidence intervals
ORs are per biomarker SD. We report mean AUC and OR values across the 10 imputed datasets resulting from our multiple imputation procedure.
Demographics and clinical variables of study population as a whole and stratified on acute kidney injury (AKI) progression statusCABG, coronary artery bypass grafting, CBP, cardiopulmonary bypass; eGFR, estimated glomerular filtration rate.Values are n (%), unless otherwise indicated.There were 2 missing data values for cardiac catheterization, 5 for preoperative myocardial infarction, and 5 for CPB surgery time.Summary of biomarkers included in this investigationAKI, acute kidney injury; bFGF, basic fibroblast growth factor; CKMB, creatine kinase-MB; EGF, epidermal growth factor; HFABP, heart-type fatty acid-binding protein; IFNg, human interferon gamma; KIM-1, kidney injury molecule-1; L-FABP, liver fatty acid-binding protein; MCP-1, monocyte chemoattractant protein-1; NGAL, neutrophil gelatinase-associated lipocalin; PIGF, placental growth factor; Pro BNP, pro-B-type natriuretic peptide; ST-2, soluble ST2; Tie-2, tyrosine kinase-2; TNFα, human tumor necrosis factor alpha; TNFR-1, tumor necrosis factor receptor 1; TNFR-2, tumor necrosis factor receptor 2; TNI, troponin I; TNTHS, high-sensitivity troponin T; VEGF, vascular endothelial growth factor; VEGFc, vascular endothelial growth factor-C; VEGFd, vascular endothelial growth factor-D; VEGF-R1, vascular endothelial growth factor receptor-1.Units are pg/ml unless otherwise indicated. Values are median (interquartile range).Urine biomarkers and serum creatinine were missing for 4 patients. Plasma biomarker values were missing for 27 patients, except for the following biomarkers, which were missing for 71–73 patients: Pro BNP, TNI, TNTHS, CKMB HFABP, MCP-1.Distributions of select biomarkers in patients who had versus did not have progression of acute kidney injury (AKI). See Supplementary Figure S1 for all biomarkers. hFABP, heart-type fatty acid-binding protein; NGAL, neutrophil gelatinase-associated lipocalin; Tie-2, tyrosine kinase-2; VEGF, vascular endothelial growth factor.Area under the ROC curve (AUC) and odds ratios (ORs) for individual biomarkers measured in this study, with 95% confidence intervalsbFGF, basic fibroblast growth factor; CKMB, creatine kinase-MB; CysC, cystatin C; EGF, epidermal growth factor; hFABP, heart-type fatty acid-binding protein; IFNg, human interferon gamma; KIM-1, kidney injury molecule-1; LFABP, liver fatty acid-binding protein; MCP-1, monocyte chemoattractant protein-1; NGAL, neutrophil gelatinase-associated lipocalin; PIGF, placental growth factor; PROBNP, pro-B-type natriuretic peptide; ST-2, soluble ST2; Tie-2, tyrosine kinase-2; TNFα, human tumor necrosis factor alpha; TNF-R1, tumor necrosis factor receptor 1; TNF-R2, tumor necrosis factor receptor 2; TNI, troponin I; TNTHS, high-sensitivity troponin T; VEGF, vascular endothelial growth factor; VEGFc, vascular endothelial growth factor-C; VEGFd, vascular endothelial growth factor-D; VEGF-R1, vascular endothelial growth factor receptor-1.ORs are per biomarker SD. We report mean AUC and OR values across the 10 imputed datasets resulting from our multiple imputation procedure.Applying BMA to develop biomarker combinations, 3 markers had posterior variable probabilities exceeding 0.5. The markers and their mean posterior probabilities across multiple imputation datasets were the following: plasma heart-type fatty acid-binding protein (95.8%), change in serum creatinine level (89.0%), and plasma IL-8 (66.8%). Figure 2 displays the posterior probabilities for all biomarkers. The optimism-corrected AUC estimate for the 3-biomarker combination was 0.75 (95% CI: 0.68, 0.82). A random forest of classification trees built using untransformed biomarkers had an AUC of 0.74 (95% CI: 0.66, 0.82). According to our measure of variable importance for random forests, the most important biomarkers were plasma IL-8, heart-type fatty acid-binding protein, Tie-2, and urine neutrophil gelatinase-associated lipocalin (Figure 3). When we built a random forest using the biomarkers after the center-specific transformation, the estimated AUC was 0.80 (95% CI: 0.72, 0.88). The strongest predictors after center transformation were plasma Tie-2, followed by plasma heart-type fatty acid-binding protein, urine creatinine, and tumor necrosis factor receptor 1 (Figure 4).
Figure 2
Biomarker posterior probabilities using Bayesian model averaging. The 3 biomarkers in blue have posterior probabilities exceeding 50%; these biomarkers are included in the Bayesian model averaging combination. bFGF, basic fibroblast growth factor; CKMB, creatine kinase-MB; hFABP, heart-type fatty acid-binding protein; IFNg, human interferon gamma; KIM1, kidney injury molecule-1; LFABP, liver fatty acid-binding protein; MCP-1, monocyte chemoattractant protein-1; NGAL, neutrophil gelatinase-associated lipocalin; PIGF, placental growth factor; PROBNP, pro-B-type natriuretic peptide; ST-2, soluble ST2; Tie-2, tyrosine kinase-2; TNFα, human tumor necrosis factor alpha; TNF-R1, tumor necrosis factor receptor 1; TNI, troponin I; TNTHS, high-sensitivity troponin T; VEGF, vascular endothelial growth factor; VEGF R1, vascular endothelial growth factor receptor 1.
Multiple biomarkers were associated with AKI progression among patients diagnosed with AKI following cardiac surgery. However, these associations do not imply high prognostic capacity. Of the 38 biomarkers we examined, the most prognostic individual biomarker had an AUC of only 0.73. The modest prognostic capacity of individual biomarkers motivated our investigation of whether a combination of multiple biomarkers could improve prediction.Figure 5 summarizes this investigation. The 2 methods we applied—BMA and random forests of classification trees—are complementary approaches in the sense that BMA seeks simple linear combinations of candidate predictors and parsimonious models, whereas random forests are well suited to situations in which interactions among candidate predictors are important and the model includes all biomarkers. Although the random forest approach uses all candidate predictors, the bootstrap aggregation component of the algorithm protects against overfitting. Despite these complementary approaches, we did not identify a biomarker combination that was clearly superior to the best-performing individual biomarkers.
Figure 5
Identifying a panel of biomarkers to predict acute kidney injury (AKI) progression. AUC, area under the receiver operating characteristic curve; BMA, Bayesian Model averaging; CPB, cardiopulmonary bypass; hFABP, heart-type fatty acid-binding protein; NGAL, neutrophil gelatinase-associated lipocalin; Tie-2, tyrosine kinase-2.
Identifying a panel of biomarkers to predict acute kidney injury (AKI) progression. AUC, area under the receiver operating characteristic curve; BMA, Bayesian Model averaging; CPB, cardiopulmonary bypass; hFABP, heart-type fatty acid-binding protein; NGAL, neutrophil gelatinase-associated lipocalin; Tie-2, tyrosine kinase-2.A disadvantage of a random forest is that it is a “black box” approach—it is difficult to understand the impact of any given biomarker on predictions. Using our measure of variable importance for random forests, the important biomarkers were roughly similar between the methods.Although it is counterintuitive, biomarkers that perform best in combination are not necessarily those that have the strongest individual performance.19, 20 It is mathematically possible for a biomarker to have null individual prognostic capacity yet contribute substantially to prediction in combination with other variables. On its own, Tie-2 had almost no predictive capacity in our dataset, yet it had the fourth-highest mean posterior probability in the BMA analysis and appeared as one of the top-performing biomarkers in the random forest analyses. Furthermore, although many share the intuition that combinations of modestly performing biomarkers should be able to achieve substantially better prediction, theoretical results teach us that this intuition may be misguided.19, 20The predictive capacity of individual biomarkers in our data tended to be lower after the center-specific transformation. This could suggest that the variability introduced by the transformation hurt performance. An alternative explanation is that performance of the untransformed biomarkers was inflated due to confounding effects of center. However, there was not compelling evidence of center effects for these biomarkers that would support the latter hypothesis. Counterintuitively, the most promising result for biomarker combinations was for the center-specific transformed biomarkers. We caution that the center-specific transformation is highly variable, particularly for smaller centers. Therefore, this result should be considered preliminary.We believe our experience in this endeavor is common across a wide array of fields in which there is interest in using biomarkers to improve risk prediction, prognosis, or other types of forecasting. For example, Wang et al. measured 10 biomarkers in 3209 participants in the Framingham Heart study. Adding multi-marker scores to conventional risk factors for death or cardiovascular events resulted in only small improvements in predictive capacity. Similar experiences are reported in the literature, and we suspect that many more “negative” results go unpublished.A plethora of methods are available for building predictive models. It is possible that another method, perhaps one that has not yet been invented, would yield a combination of our biomarkers that would accurately forecast AKI progression. However, there are reasons to suspect that other methods would not yield better results. Lim et al. compared 33 algorithms for building predictive models, including classical statistical algorithms and “machine learning” algorithms. The investigators had 32 datasets in which to compare the predictive performance of the models produced by the algorithms. Classical logistic regression was the second-best algorithm in terms of accuracy. An important point to note is that results were similar among the best-performing algorithms. A recent survey of the literature found no advantage to machine-learning methods over logistic regression methods.This study had several strengths, including the number of biomarkers measured; the use of biomarkers that are biologically meaningful and measured by high-quality immunoassays; and the use of statistical methods to assess performance while avoiding optimistic bias. Limitations of this study include a modest number of patients progressing in AKI, with the majority of patients with AKI progression (23 of 39) coming from the largest center. The other 5 centers had 2–5 patients progressing in AKI. Missing biomarker data further reduced power, although this reduction was mitigated through application of multiple imputations. Seven patients without AKI progression died in the hospital, raising a potential concern for misclassification, since progression might have been observed had these patients survived. However, these patients comprise only 2% of patients without AKI progression, so the potential impact of hypothetical misclassification is minor. Finally, the outcome of AKI progression may not serve as the ideal outcome to test multiple biomarker combinations due to the fact that the dimensions that determine “AKI progression” (change in serum creatinine level over a given unit of time) may not be congruent with the biological expression of the biomarkers that we measured.The application of advanced statistical techniques to combine the prognostic information in a panel of biomarkers produced a small or negligible improvement over individual biomarkers. These findings are instructive for investigators seeking to develop biomarker panels in many fields. In light of empirical experience and theoretical results, investigators should have realistic expectations for biomarker combinations when the performance of individual biomarkers is insufficient for clinical application: combinations of modestly performing biomarkers are unlikely to yield excellent prediction.
Disclosure
SGC and CRP receive financial compensation as consultants and advisory board members for Renalytix AI, Inc., and own equity in Renalytix. SGC has received consulting fees from Goldfinch Bio, CHF Solutions, Quark Biopharma, Janssen Pharmaceuticals, and Takeda Pharmaceuticals in the past 3 years. SGC was on the advisory board for pulseData and received consulting fees and equity in return. All the other authors declared no competing interests.
Authors: Evangelia Christodoulou; Jie Ma; Gary S Collins; Ewout W Steyerberg; Jan Y Verbakel; Ben Van Calster Journal: J Clin Epidemiol Date: 2019-02-11 Impact factor: 6.437
Authors: Jay L Koyner; Amit X Garg; Steven G Coca; Kyaw Sint; Heather Thiessen-Philbrook; Uptal D Patel; Michael G Shlipak; Chirag R Parikh Journal: J Am Soc Nephrol Date: 2012-03-01 Impact factor: 10.121
Authors: Kathleen F Kerr; Allison Meisner; Heather Thiessen-Philbrook; Steven G Coca; Chirag R Parikh Journal: Clin J Am Soc Nephrol Date: 2014-05-22 Impact factor: 8.237
Authors: Uptal D Patel; Amit X Garg; Harlan M Krumholz; Michael G Shlipak; Steven G Coca; Kyaw Sint; Heather Thiessen-Philbrook; Jay L Koyner; Madhav Swaminathan; Cary S Passik; Chirag R Parikh Journal: Circulation Date: 2012-02-09 Impact factor: 29.690
Authors: Chirag R Parikh; Heather Thiessen-Philbrook; Amit X Garg; Deepak Kadiyala; Michael G Shlipak; Jay L Koyner; Charles L Edelstein; Prasad Devarajan; Uptal D Patel; Michael Zappitelli; Catherine D Krawczeski; Cary S Passik; Steven G Coca Journal: Clin J Am Soc Nephrol Date: 2013-04-18 Impact factor: 8.237
Authors: Emilie P Belley-Côté; Chirag R Parikh; Colleen R Shortt; Steven G Coca; Amit X Garg; John W Eikelboom; Peter Kavsak; Eric McArthur; Heather Thiessen-Philbrook; Richard P Whitlock Journal: J Thorac Cardiovasc Surg Date: 2016-02-24 Impact factor: 5.209
Authors: Ivan Gocze; Matthias Koch; Philipp Renner; Florian Zeman; Bernhard M Graf; Marc H Dahlke; Michael Nerlich; Hans J Schlitt; John A Kellum; Thomas Bein Journal: PLoS One Date: 2015-03-23 Impact factor: 3.240