Literature DB >> 17186501

A comparison of regression trees, logistic regression, generalized additive models, and multivariate adaptive regression splines for predicting AMI mortality.

Peter C Austin1.   

Abstract

Clinicians and health service researchers are frequently interested in predicting patient-specific probabilities of adverse events (e.g. death, disease recurrence, post-operative complications, hospital readmission). There is an increasing interest in the use of classification and regression trees (CART) for predicting outcomes in clinical studies. We compared the predictive accuracy of logistic regression with that of regression trees for predicting mortality after hospitalization with an acute myocardial infarction (AMI). We also examined the predictive ability of two other types of data-driven models: generalized additive models (GAMs) and multivariate adaptive regression splines (MARS). We used data on 9484 patients admitted to hospital with an AMI in Ontario. We used repeated split-sample validation: the data were randomly divided into derivation and validation samples. Predictive models were estimated using the derivation sample and the predictive accuracy of the resultant model was assessed using the area under the receiver operating characteristic (ROC) curve in the validation sample. This process was repeated 1000 times-the initial data set was randomly divided into derivation and validation samples 1000 times, and the predictive accuracy of each method was assessed each time. The mean ROC curve area for the regression tree models in the 1000 derivation samples was 0.762, while the mean ROC curve area of a simple logistic regression model was 0.845. The mean ROC curve areas for the other methods ranged from a low of 0.831 to a high of 0.851. Our study shows that regression trees do not perform as well as logistic regression for predicting mortality following AMI. However, the logistic regression model had performance comparable to that of more flexible, data-driven models such as GAMs and MARS. Copyright 2006 John Wiley & Sons, Ltd.

Entities:  

Mesh:

Year:  2007        PMID: 17186501     DOI: 10.1002/sim.2770

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  43 in total

1.  Boosted classification trees result in minor to modest improvement in the accuracy in classifying cardiovascular outcomes compared to conventional classification trees.

Authors:  Peter C Austin; Douglas S Lee
Journal:  Am J Cardiovasc Dis       Date:  2011-04-23

2.  Type I error rates, coverage of confidence intervals, and variance estimation in propensity-score matched analyses.

Authors:  Peter C Austin
Journal:  Int J Biostat       Date:  2009-04-14       Impact factor: 0.968

3.  Logic Forest: an ensemble classifier for discovering logical combinations of binary markers.

Authors:  Bethany J Wolf; Elizabeth G Hill; Elizabeth H Slate
Journal:  Bioinformatics       Date:  2010-07-13       Impact factor: 6.937

4.  Impact of Anesthetic Predictors on Postpartum Hospital Length of Stay and Adverse Events Following Cesarean Delivery: A Retrospective Study in 840 Consecutive Parturients.

Authors:  Ting Ting Oh; Colleen G Martel; Allison G Clark; Melissa B Russo; Bobby D Nossaman
Journal:  Ochsner J       Date:  2015

5.  Co-occurring risk factors for current cigarette smoking in a U.S. nationally representative sample.

Authors:  Stephen T Higgins; Allison N Kurti; Ryan Redner; Thomas J White; Diana R Keith; Diann E Gaalema; Brian L Sprague; Cassandra A Stanton; Megan E Roberts; Nathan J Doogan; Jeff S Priest
Journal:  Prev Med       Date:  2016-02-21       Impact factor: 4.018

6.  Discovery proteomics and nonparametric modeling pipeline in the development of a candidate biomarker panel for dengue hemorrhagic fever.

Authors:  Allan R Brasier; Josefina Garcia; John E Wiktorowicz; Heidi M Spratt; Guillermo Comach; Hyunsu Ju; Adrian Recinos; Kizhake Soman; Brett M Forshey; Eric S Halsey; Patrick J Blair; Claudio Rocha; Isabel Bazan; Sundar S Victor; Zheng Wu; Susan Stafford; Douglas Watts; Amy C Morrison; Thomas W Scott; Tadeusz J Kochel
Journal:  Clin Transl Sci       Date:  2012-02-23       Impact factor: 4.689

7.  Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes.

Authors:  Peter C Austin; Jack V Tu; Jennifer E Ho; Daniel Levy; Douglas S Lee
Journal:  J Clin Epidemiol       Date:  2013-02-04       Impact factor: 6.437

8.  Predicting Survival From Large Echocardiography and Electronic Health Record Datasets: Optimization With Machine Learning.

Authors:  Manar D Samad; Alvaro Ulloa; Gregory J Wehner; Linyuan Jing; Dustin Hartzel; Christopher W Good; Brent A Williams; Christopher M Haggerty; Brandon K Fornwalt
Journal:  JACC Cardiovasc Imaging       Date:  2018-06-13

9.  Classification and clustering analysis of pyruvate dehydrogenase enzyme based on their physicochemical properties.

Authors:  Amit Kumar Banerjee; Sunita M; Naveen M; Upadhyayula Suryanarayana Murty
Journal:  Bioinformation       Date:  2010-04-30

10.  Stratification of the severity of critically ill patients with classification trees.

Authors:  Javier Trujillano; Mariona Badia; Luis Serviá; Jaume March; Angel Rodriguez-Pozo
Journal:  BMC Med Res Methodol       Date:  2009-12-09       Impact factor: 4.615

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.