| Literature DB >> 31178945 |
Terrence E Murphy1, Sui W Tsang1, Linda S Leo-Summers1, Mary Geda1, Dae H Kim2, Esther Oh3, Heather G Allore1, John Dodson4, Alexandra M Hajduk1, Thomas M Gill1, Sarwat I Chaudhry1.
Abstract
We describe a selection process for a multivariable risk prediction model of death within 30 days of hospital discharge in the SILVER-AMI study. This large, multi-site observational study included observational data from 2000 persons 75 years and older hospitalized for acute myocardial infarction (AMI) from 94 community and academic hospitals across the United States and featured a large number of candidate variables from demographic, cardiac, and geriatric domains, whose missing values were multiply imputed prior to model selection. Our objective was to demonstrate that Bayesian Model Averaging (BMA) represents a viable model selection approach in this context. BMA was compared to three other backward-selection approaches: Akaike information criterion, Bayesian information criterion, and traditional p-value. Traditional backward-selection was used to choose 20 candidate variables from the initial, larger pool of five imputations. Models were subsequently chosen from those candidates using the four approaches on each of 10 imputations. With average posterior effect probability ≥ 50% as the selection criterion, BMA chose the most parsimonious model with four variables, with average C statistic of 78%, good calibration, optimism of 1.3%, and heuristic shrinkage of 0.93. These findings illustrate the utility and flexibility of using BMA for selecting a multivariable risk prediction model from many candidates over multiply imputed datasets.Entities:
Keywords: AIC; AMI; BIC; Bayesian model averaging; Risk prediction; backward-selection
Year: 2019 PMID: 31178945 PMCID: PMC6553647 DOI: 10.6000/1929-6029.2019.08.01
Source DB: PubMed Journal: Int J Stat Med Res ISSN: 1929-6029
Figure 1Stages of Selection for the Final Multivariable Risk Prediction Model.
Abbreviations: BMA = Bayesian Model Averaging, AIC = Akaike Information Criterion, BIC = Bayesian Information Criterion.
Frequencies of Candidate Variable Selection by Approach
| Twenty Candidate Variables chosen by Backward Selection across Pool of Five Imputations | Percent of Times Chosen over 10 Imputations for Optimal Model by each Discrete Method | Average Posterior Effect Probability (Percent) | ||
|---|---|---|---|---|
| AIC | BIC | BW | BMA | |
| Acute kidney disease | 100 | 0 | 100 | 4.7 |
| Age in years at admission | 100 | 100 | 100 | 99.2 |
| End of month need help with finances | 0 | 0 | 0 | 0.0 |
| ESAS - Depressed | 100 | 0 | 0 | 0.0 |
| ESAS - Drowsy | 100 | 0 | 0 | 0.0 |
| ESAS - Dyspnea | 100 | 50 | 90 | 47.2 |
| First systolic blood pressure | 100 | 0 | 30 | 4.2 |
| Grip strength frailty | 0 | 0 | 0 | 0.0 |
| In-hospital bleeding event | 70 | 0 | 0 | 0.0 |
| In-hospital heart failure | 100 | 0 | 0 | 0.0 |
| In-hospital hyperglycemia | 0 | 0 | 0 | 0.0 |
| Length of stay | 70 | 0 | 0 | 82.7 |
| Month prior walking | 100 | 0 | 0 | 13.0 |
| PHQ evidence of depression | 100 | 0 | 80 | 8.1 |
| Prior history of CABG | 50 | 0 | 0 | 0.0 |
| Short form 12 general health | 100 | 100 | 100 | 82.7 |
| Social support: no one to relax with | 100 | 50 | 90 | 38.0 |
| TICS total score | 100 | 100 | 100 | 81.6 |
| Unintended WL | 100 | 0 | 70 | 33.0 |
| Visually impaired | 100 | 0 | 0 | 0.0 |
| Total number of variables chosen over 10 imputations (out of Possible 200) | 159 | 40 | 76 | N/A |
Abbreviations: AIC = Akaike information criterion; BIC = Bayesian information criterion; BW = backward selection with p-value of 0.05; BMA = Bayesian model averaging; ESAS = Edmonton Symptom Assessment Scale; PHQ = patient health questionnaire; CABG = Coronary artery bypass-graft; TICS = modified telephone interview for cognitive status; WL= weight loss; N/A = not applicable.
Comparing Final Models Chosen by Four Different Backward-Selection Approaches from 20 Candidate Variables across 10 Multiple Imputations
| Final Models Chosen by Backward-Selection | ||||
|---|---|---|---|---|
| Performance Metrics of | AIC | BIC | BW | BMA |
| Variables Selected Across 10 Imputations [ | Acute kidney disease | Age in years | Acute kidney disease | Age in years |
| Calibration | 0.08 | 0.19 | 0.46 | 0.10 |
| Discrimination | 88% | 79% | 84% | 78% |
| Heuristic Shrinkage[ | 0.85 | 0.92 | 0.90 | 0.93 |
| Optimism | 3.5% | 1.7% | 2.0% | 1.3% |
chosen from ≥ 5 imputations (AIC, BIC, and BW) or with average posterior probability ≥ 50% (BMA).
from Van Heuwelingen JC and le Cessie S (LR– number of terms) / (LR).
abbreviations: AIC = Akaike information criterion; BIC = Bayesian information criterion; BMA = Bayesian model averaging; BP = blood pressure; BW = backward selection with p-value of 0.05; ESAS = Edmonton Symptom Assessment Scale; HF = heart failure; HL = Hosmer-Lemeshow goodness of fit ; LR = likelihood ratio chi-square; NR = no relaxation; PHQ = patient health questionnaire; SF12genHealth = general health on Short Form 12; TICS = modified telephone interview for cognitive status; WL= weight loss.