Elsie Gyang Ross1, Nigam H Shah2, Ronald L Dalman1, Kevin T Nead3, John P Cooke4, Nicholas J Leeper5. 1. Division of Vascular Surgery, Stanford Health Care, Stanford, Calif. 2. Center for Biomedical Informatics Research, Stanford University, Stanford, Calif. 3. Department of Radiation Oncology, University of Pennsylvania, Philadelphia, Pa. 4. Department of Cardiovascular Sciences, Houston Methodist Research Institute, Houston, Tex; Center for Cardiovascular Regeneration, Houston Methodist DeBakey Heart and Vascular Center, Houston, Tex. 5. Division of Vascular Surgery, Stanford Health Care, Stanford, Calif. Electronic address: nleeper@stanford.edu.
Abstract
OBJECTIVE: A key aspect of the precision medicine effort is the development of informatics tools that can analyze and interpret "big data" sets in an automated and adaptive fashion while providing accurate and actionable clinical information. The aims of this study were to develop machine learning algorithms for the identification of disease and the prognostication of mortality risk and to determine whether such models perform better than classical statistical analyses. METHODS: Focusing on peripheral artery disease (PAD), patient data were derived from a prospective, observational study of 1755 patients who presented for elective coronary angiography. We employed multiple supervised machine learning algorithms and used diverse clinical, demographic, imaging, and genomic information in a hypothesis-free manner to build models that could identify patients with PAD and predict future mortality. Comparison was made to standard stepwise linear regression models. RESULTS: Our machine-learned models outperformed stepwise logistic regression models both for the identification of patients with PAD (area under the curve, 0.87 vs 0.76, respectively; P = .03) and for the prediction of future mortality (area under the curve, 0.76 vs 0.65, respectively; P = .10). Both machine-learned models were markedly better calibrated than the stepwise logistic regression models, thus providing more accurate disease and mortality risk estimates. CONCLUSIONS: Machine learning approaches can produce more accurate disease classification and prediction models. These tools may prove clinically useful for the automated identification of patients with highly morbid diseases for which aggressive risk factor management can improve outcomes.
OBJECTIVE: A key aspect of the precision medicine effort is the development of informatics tools that can analyze and interpret "big data" sets in an automated and adaptive fashion while providing accurate and actionable clinical information. The aims of this study were to develop machine learning algorithms for the identification of disease and the prognostication of mortality risk and to determine whether such models perform better than classical statistical analyses. METHODS: Focusing on peripheral artery disease (PAD), patient data were derived from a prospective, observational study of 1755 patients who presented for elective coronary angiography. We employed multiple supervised machine learning algorithms and used diverse clinical, demographic, imaging, and genomic information in a hypothesis-free manner to build models that could identify patients with PAD and predict future mortality. Comparison was made to standard stepwise linear regression models. RESULTS: Our machine-learned models outperformed stepwise logistic regression models both for the identification of patients with PAD (area under the curve, 0.87 vs 0.76, respectively; P = .03) and for the prediction of future mortality (area under the curve, 0.76 vs 0.65, respectively; P = .10). Both machine-learned models were markedly better calibrated than the stepwise logistic regression models, thus providing more accurate disease and mortality risk estimates. CONCLUSIONS: Machine learning approaches can produce more accurate disease classification and prediction models. These tools may prove clinically useful for the automated identification of patients with highly morbid diseases for which aggressive risk factor management can improve outcomes.
Authors: E E A Arts; C Popa; A A Den Broeder; A G Semb; T Toms; G D Kitas; P L van Riel; J Fransen Journal: Ann Rheum Dis Date: 2014-01-03 Impact factor: 19.103
Authors: A T Hirsch; M H Criqui; D Treat-Jacobson; J G Regensteiner; M A Creager; J W Olin; S H Krook; D B Hunninghake; A J Comerota; M E Walsh; M M McDermott; W R Hiatt Journal: JAMA Date: 2001-09-19 Impact factor: 56.272
Authors: Ewout W Steyerberg; Andrew J Vickers; Nancy R Cook; Thomas Gerds; Mithat Gonen; Nancy Obuchowski; Michael J Pencina; Michael W Kattan Journal: Epidemiology Date: 2010-01 Impact factor: 4.822
Authors: Therese Tillin; Alun D Hughes; Peter Whincup; Jamil Mayet; Naveed Sattar; Paul M McKeigue; Nish Chaturvedi Journal: Heart Date: 2013-11-01 Impact factor: 5.994
Authors: Tesfaye B Mersha; Yashira Afanador; Elisabet Johansson; Steven P Proper; Jonathan A Bernstein; Marc E Rothenberg; Gurjit K Khurana Hershey Journal: Clin Rev Allergy Immunol Date: 2021-04 Impact factor: 8.667
Authors: Tom Alsaigh; Belinda A Di Bartolo; Jocelyne Mulangala; Gemma A Figtree; Nicholas J Leeper Journal: Circ Res Date: 2021-06-10 Impact factor: 23.213