| Literature DB >> 19293999 |
Adam Kiezun1, I-Ting Angelina Lee, Noam Shomron.
Abstract
Logistic regression is often used to help make medical decisions with binary outcomes. Here we evaluate the use of several methods for selection of variables in logistic regression. We use a large dataset to predict the diagnosis of myocardial infarction in patients reporting to an emergency room with chest pain. Our results indicate that some of the examined methods are well suited for variable selection in logistic regression and that our model, and our myocardial infarction risk calculator, can be an additional tool to aid physicians in myocardial infarction diagnosis.Entities:
Keywords: diagnostic markers; logistic regression; myocardial infarction; variable selection methods
Year: 2009 PMID: 19293999 PMCID: PMC2655051 DOI: 10.6026/97320630003311
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Predicting risk for MI using different selection methods applied on different sets of diagnostic parameters. Parameters were subdivided into Symptoms (including medical history), physical signs (Physical) and laboratory tests (Labs) and then the c‐index was calculated using four methods; BPSO, VBCM, SA and Stochastic Search (Random) (see text). Previous modeled data [2] and experts opinion [12] are added for comparison. The c‐index for previous data/work is the result of a deterministic run (marked with an asterisk) rather than an average (Ave) of 100 runs as for the techniques we tested. Also maximum (Max) values for our runs are presented. The heatmap was created using Heatmap Builder: http://quertermous.stanford.edu/heatmap.htm. We use the heatmap color‐coded scale to range from yellow to red, which represents the weight of each parameter, meaning the number of times the parameter is selected in the winning set, out of the 100 runs. The black and white colored boxes represent the inclusion and exclusion of the parameters, respectively. The top three c‐indexes in the list are colored blue.