Geoffrey H Tison1, Robert Avram2, Gregory Nah2, Liviu Klein2, Barbara V Howard3, Matthew A Allison4, Ramon Casanova5, Rachael H Blair6, Khadijah Breathett7, Randi E Foraker8, Jeffrey E Olgin2, Nisha I Parikh2. 1. Division of Cardiology, Department of Medicine, University of California San Francisco, San Francisco, California, USA; Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, California, USA. Electronic address: Geoff.tison@ucsf.edu. 2. Division of Cardiology, Department of Medicine, University of California San Francisco, San Francisco, California, USA. 3. Medstar Health Research Institute and Georgetown/Howard Universities Center for Clinical and Translational Research, Washington DC, USA. 4. Division of Family Medicine and Public Health, University of California, San Diego, San Diego, California, USA. 5. Wake Forest School of Medicine, Winston-Salem, North Carolina, USA. 6. State University of New York at Buffalo, Buffalo, New York, USA. 7. Division of Cardiovascular Medicine, Department of Medicine, University of Arizona, Tucson Arizona, USA. 8. Washington University in St. Louis School of Medicine, St. Louis, Missouri, USA.
Abstract
BACKGROUND: Heart failure (HF) is a leading cause of cardiac morbidity among women, whose risk factors differ from those in men. We used machine-learning approaches to develop risk- prediction models for incident HF in a cohort of postmenopausal women from the Women's Health Initiative (WHI). METHODS: We used 2 machine-learning methods-Least Absolute Shrinkage and Selection Operator (LASSO) and Classification and Regression Trees (CART)-to perform variable selection on 1227 baseline WHI variables for the primary outcome of incident HF. These variables were then used to construct separate Cox proportional hazard models, and we compared these results, using receiver-operating characteristic (ROC) curve analysis, against a comparator model built using variables from the Atherosclerosis Risk in Communities (ARIC) HF prediction model. We analyzed 43,709 women who had 2222 incident HF events; median follow-up was 14.3 years. RESULTS: LASSO selected 10 predictors, and CART selected 11 predictors. The highest correlation between selected variables was 0.46. In addition to selecting well-established predictors such as age, myocardial infarction, and smoking, novel predictors included physical function, number of pregnancies, number of previous live births and age at menopause. In ROC analysis, the CART-derived model had the highest C-statistic of 0.83 (95% confidence interval [CI], 0.81-0.85), followed by LASSO 0.82 (95% CI, 0.81-0.84) and ARIC 0.73 (95% CI, 0.70-0.76). CONCLUSIONS: Machine-learning approaches can be used to develop HF risk-prediction models that can have better discrimination compared with an established HF risk model and may provide a basis for investigating novel HF predictors.
BACKGROUND: Heart failure (HF) is a leading cause of cardiac morbidity among women, whose risk factors differ from those in men. We used machine-learning approaches to develop risk- prediction models for incident HF in a cohort of postmenopausal women from the Women's Health Initiative (WHI). METHODS: We used 2 machine-learning methods-Least Absolute Shrinkage and Selection Operator (LASSO) and Classification and Regression Trees (CART)-to perform variable selection on 1227 baseline WHI variables for the primary outcome of incident HF. These variables were then used to construct separate Cox proportional hazard models, and we compared these results, using receiver-operating characteristic (ROC) curve analysis, against a comparator model built using variables from the Atherosclerosis Risk in Communities (ARIC) HF prediction model. We analyzed 43,709 women who had 2222 incident HF events; median follow-up was 14.3 years. RESULTS: LASSO selected 10 predictors, and CART selected 11 predictors. The highest correlation between selected variables was 0.46. In addition to selecting well-established predictors such as age, myocardial infarction, and smoking, novel predictors included physical function, number of pregnancies, number of previous live births and age at menopause. In ROC analysis, the CART-derived model had the highest C-statistic of 0.83 (95% confidence interval [CI], 0.81-0.85), followed by LASSO 0.82 (95% CI, 0.81-0.84) and ARIC 0.73 (95% CI, 0.70-0.76). CONCLUSIONS: Machine-learning approaches can be used to develop HF risk-prediction models that can have better discrimination compared with an established HF risk model and may provide a basis for investigating novel HF predictors.
Authors: Donna R Parker; Bing Lu; Megan Sands-Lincoln; Candyce H Kroenke; Cathy C Lee; Mary O'Sullivan; Hannah L Park; Nisha Parikh; Robert S Schenken; Charles B Eaton Journal: Ann Fam Med Date: 2014-07 Impact factor: 5.166
Authors: Suveen Angraal; Bobak J Mortazavi; Aakriti Gupta; Rohan Khera; Tariq Ahmad; Nihar R Desai; Daniel L Jacoby; Frederick A Masoudi; John A Spertus; Harlan M Krumholz Journal: JACC Heart Fail Date: 2019-10-09 Impact factor: 12.035
Authors: Philip S Hall; Gregory Nah; Barbara V Howard; Cora E Lewis; Matthew A Allison; Gloria E Sarto; Molly E Waring; Lisette T Jacobson; JoAnn E Manson; Liviu Klein; Nisha I Parikh Journal: J Am Coll Cardiol Date: 2017-05-23 Impact factor: 24.094
Authors: Michael Green; Jonas Björk; Jakob Forberg; Ulf Ekelund; Lars Edenbrandt; Mattias Ohlsson Journal: Artif Intell Med Date: 2006-09-07 Impact factor: 5.326
Authors: Eric D Adler; Adriaan A Voors; Liviu Klein; Fima Macheret; Oscar O Braun; Marcus A Urey; Wenhong Zhu; Iziah Sama; Matevz Tadel; Claudio Campagnari; Barry Greenberg; Avi Yagil Journal: Eur J Heart Fail Date: 2019-11-12 Impact factor: 15.534
Authors: Bharath Ambale-Venkatesh; Xiaoying Yang; Colin O Wu; Kiang Liu; W Gregory Hundley; Robyn McClelland; Antoinette S Gomes; Aaron R Folsom; Steven Shea; Eliseo Guallar; David A Bluemke; João A C Lima Journal: Circ Res Date: 2017-08-09 Impact factor: 17.367
Authors: Alberto Maino; Bob Siegerink; Ale Algra; Ida Martinelli; Flora Peyvandi; Frits R Rosendaal Journal: Br J Haematol Date: 2016-04-07 Impact factor: 6.998
Authors: Sunil K Agarwal; Lloyd E Chambless; Christie M Ballantyne; Brad Astor; Alain G Bertoni; Patricia P Chang; Aaron R Folsom; Max He; Ron C Hoogeveen; Hanyu Ni; Pedro M Quibrera; Wayne D Rosamond; Stuart D Russell; Eyal Shahar; Gerardo Heiss Journal: Circ Heart Fail Date: 2012-05-15 Impact factor: 8.790
Authors: Matthew W Segar; Byron C Jaeger; Kershaw V Patel; Vijay Nambi; Chiadi E Ndumele; Adolfo Correa; Javed Butler; Alvin Chandra; Colby Ayers; Shreya Rao; Alana A Lewis; Laura M Raffield; Carlos J Rodriguez; Erin D Michos; Christie M Ballantyne; Michael E Hall; Robert J Mentz; James A de Lemos; Ambarish Pandey Journal: Circulation Date: 2021-04-13 Impact factor: 29.690