| Literature DB >> 26380550 |
Ghalib A Bello1, Gerard G Dumancas2, Chris Gennings3.
Abstract
In clinical settings, the diagnosis of medical conditions is often aided by measurement of various serum biomarkers through the use of laboratory tests. These biomarkers provide information about different aspects of a patient's health and overall function of multiple organ systems. We have developed a statistical procedure that condenses the information from a variety of health biomarkers into a composite index, which could be used as a risk score for predicting all-cause mortality. It could also be viewed as a holistic measure of overall physiological health status. This health status metric is computed as a function of standardized values of each biomarker measurement, weighted according to their empirically determined relative strength of association with mortality. The underlying risk model was developed using the biomonitoring and mortality data of a large sample of US residents obtained from the National Health and Nutrition Examination Survey (NHANES) and the National Death Index (NDI). Biomarker concentration levels were standardized using spline-based Cox regression models, and optimization algorithms were used to estimate the weights. The predictive accuracy of the tool was optimized by bootstrap aggregation. We also demonstrate how stacked generalization, a machine learning technique, can be used for further enhancement of the prediction power. The index was shown to be highly predictive of all-cause mortality and long-term outcomes for specific health conditions. It also exhibited a robust association with concurrent chronic conditions, recent hospital utilization, and current health status as assessed by self-rated health.Entities:
Keywords: Predictive tool for composite endpoint; clinical prediction model; composite endpoint prediction model; risk prediction model
Year: 2015 PMID: 26380550 PMCID: PMC4559200 DOI: 10.4137/BBI.S30172
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Figure 1(A–C) Examples of relative hazard plots used for transformation of raw biomarker measurements onto the relative hazard scale. Each plot represents the multivariate adjusted, spline-smoothed relative hazard estimates as a function of biomarker level.
Figure 2Bootstrap-averaged weights used to construct the HSM.
NHANES 2003–2008 selected questionnaire items and regression techniques used to model their relationship with HSM.
| VARIABLE NAME | QUESTIONNAIRE ITEM | NO. Of RESPONSE CATEGORIES | ANALYSIS TECHNIQUE |
|---|---|---|---|
| HUQ010 | Self-rated health | 5 | Linear regression |
| HUQ050 | No. of times healthcare received over past year | 6 | Poisson regression |
| HUQ080 | No. of times over past year respondent was overnight hospital patient | 6 | Poisson regression |
| DIQ010 | Doctor ever told respondent they have diabetes? | 2 (Yes/No) | Logistic regression |
| MCQ160L | Doctor ever told respondent they have liver condition? | 2 (Yes/No) | Logistic regression |
| KIQ020 | Doctor ever told respondent they have weak/failing kidneys? | 2 (Yes/No) | Logistic regression |
| MCQ160B–MCQ160F | Doctor ever told respondent they have congestive heart failure, coronary heart disease, angina, heart attack, stroke | 2 (Yes/No) | Logistic regression |
Figure 3Distribution of HSM in NHANES III population.
Figure 4(A) age, 18–39; gender, female. (B) age, 40–64; gender, female. (C) age, ≥65; gender, female. (D) age, 18–39; gender, male. (E) age, 40–64; gender, male. (F) age, ≥65; gender, male.
Predictive Validity of HSM (as measured by P-value and Odds Ratios [covariate-adjusted]) for death caused by a variety of chronic ailments.
| CAUSE OF DEATH | ODDS RATIO (95% CI) | |
|---|---|---|
| Cardiovascular disease | 0.5 | 0.9 (0.8–1.1) |
| Liver disease | <0.0001 | 3.7 (2.3–6.0) |
| Kidney disease | 0.0 0 4 | 2.2 (1.3–3.7) |
| Diabetes | <0.0001 | 2.3 (1.6–3.4) |
HSM relationship with self-reported hospital utilization and physician-diagnosed health conditions.
| QUESTIONNAIRE ITEM | ODDS RATIO (95% CI) | |
|---|---|---|
| Self-rated health | <0.0001 | N/A |
| # of times healthcare received over past year | <0.0001 | N/A |
| # of times over past year respondent was overnight hospital patient | 0.0 0 3 | N/A |
| Doctor ever told respondent they have diabetes? | <0.0001 | 3.0 (2.3–4) |
| Doctor ever told respondent they have liver condition? | <0.0001 | 2.1 (1.5–3) |
| Doctor ever told respondent they have weak/failing kidneys? | <0.0001 | 4.7 (3.2–7) |
| Doctor ever told respondent they have congestive heart failure, coronary heart disease, Angina, heart attack, or stroke | <0.0001 | 2.2 (1.6–3) |
Figure 5(A) Age- and gender-adjusted relationship between HSM score and 5-year mortality risk. (B) age- and gender-adjusted relationship between HSM score and 10-year mortality risk.
Sample sizes and outcome summaries for datasets used in this study.
| DATASET | SAMPLE SIZE | OUTCOME (MORTALITY)
| |
|---|---|---|---|
| PERCENTAGE OF CONFIRMED DEATHS | MEDIAN SURVIVAL TIME (MONTHS) | ||
| NHANES 1999–2002 | 3406 | 4.86% | 68 |
| NHANES III(1988–1994) | 10592 | 22.06% | 166 |
| NHANES 2003–2008 | 4670 | N/A | N/A |