Wonse Kim1,2, Jin Joo Park3, Hae-Young Lee4, Kye Hun Kim5, Byung-Su Yoo6, Seok-Min Kang7, Sang Hong Baek8, Eun-Seok Jeon9, Jae-Joong Kim10, Myeong-Chan Cho11, Shung Chull Chae12, Byung-Hee Oh13, Woong Kook14, Dong-Ju Choi15,16. 1. Department of Mathematical Sciences, Seoul National University, Gwanak Ro 1, Gwanak-Gu, Seoul, Republic of Korea. 2. MetaEyes, 41, Yonsei-ro 5da-gil, Seodaemun-gu, Seoul, Republic of Korea. 3. Cardiovascular Center, Division of Cardiology, Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Republic of Korea. 4. Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea. 5. Heart Research Center, Chonnam National University, Gwangju, Republic of Korea. 6. Department of Internal Medicine, Yonsei University Wonju College of Medicine, Wonju, Republic of Korea. 7. Department of Internal Medicine, Yonsei University College of Medicine, Seoul, Republic of Korea. 8. Department of Internal Medicine, the Catholic University of Korea, Seoul, Republic of Korea. 9. Department of Internal Medicine, Sungkyunkwan University College of Medicine, Seoul, Republic of Korea. 10. Department of Internal Medicine, Asan Medical Center, Seoul, Republic of Korea. 11. Department of Internal Medicine, Chungbuk National University College of Medicine, Cheongju, Republic of Korea. 12. Department of Internal Medicine, Kyungpook National University College of Medicine, Daegu, Republic of Korea. 13. Department of Internal Medicine, Mediplex Sejong Hospital, Incheon, Republic of Korea. 14. Department of Mathematical Sciences, Seoul National University, Gwanak Ro 1, Gwanak-Gu, Seoul, Republic of Korea. woongkook@snu.ac.kr. 15. Cardiovascular Center, Division of Cardiology, Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Republic of Korea. djchoi@snubh.org. 16. Cardiovascular Center, Division of Cardiology, Department of Internal Medicine, Seoul National University Bundang Hospital, Gumiro 166, Bundang, Gyeonggi-do, Seongnam, Republic of Korea. djchoi@snubh.org.
Abstract
OBJECTIVE: Machine learning (ML) algorithm can improve risk prediction because ML can select features and segment continuous variables effectively unbiased. We generated a risk score model for mortality with ML algorithms in East-Asian patients with heart failure (HF). METHODS: From the Korean Acute Heart Failure (KorAHF) registry, we used the data of 3683 patients with 27 continuous and 44 categorical variables. Grouped Lasso algorithm was used for the feature selection, and a novel continuous variable segmentation algorithm which is based on change-point analysis was developed for effectively segmenting the ranges of the continuous variables. Then, a risk score was assigned to each feature reflecting nonlinear relationship between features and survival times, and an integer score of maximum 100 was calculated for each patient. RESULTS: During 3-year follow-up time, 32.8% patients died. Using grouped Lasso, we identified 15 highly significant independent clinical features. The calculated risk score of each patient ranged between 1 and 71 points with a median of 36 (interquartile range: 27-45). The 3-year survival differed according to the quintiles of the risk score, being 80% and 17% in the 1st and 5th quintile, respectively. In addition, ML risk score had higher AUCs than MAGGIC-HF score to predict 1-year mortality (0.751 vs. 0.711, P < 0.001). CONCLUSIONS: In East-Asian patients with HF, a novel risk score model based on ML and the new continuous variable segmentation algorithm performs better for mortality prediction than conventional prediction models. CLINICAL TRIAL REGISTRATION: Unique identifier: INCT01389843 https://clinicaltrials.gov/ct2/show/NCT01389843 .
OBJECTIVE: Machine learning (ML) algorithm can improve risk prediction because ML can select features and segment continuous variables effectively unbiased. We generated a risk score model for mortality with ML algorithms in East-Asian patients with heart failure (HF). METHODS: From the Korean Acute Heart Failure (KorAHF) registry, we used the data of 3683 patients with 27 continuous and 44 categorical variables. Grouped Lasso algorithm was used for the feature selection, and a novel continuous variable segmentation algorithm which is based on change-point analysis was developed for effectively segmenting the ranges of the continuous variables. Then, a risk score was assigned to each feature reflecting nonlinear relationship between features and survival times, and an integer score of maximum 100 was calculated for each patient. RESULTS: During 3-year follow-up time, 32.8% patients died. Using grouped Lasso, we identified 15 highly significant independent clinical features. The calculated risk score of each patient ranged between 1 and 71 points with a median of 36 (interquartile range: 27-45). The 3-year survival differed according to the quintiles of the risk score, being 80% and 17% in the 1st and 5th quintile, respectively. In addition, ML risk score had higher AUCs than MAGGIC-HF score to predict 1-year mortality (0.751 vs. 0.711, P < 0.001). CONCLUSIONS: In East-Asian patients with HF, a novel risk score model based on ML and the new continuous variable segmentation algorithm performs better for mortality prediction than conventional prediction models. CLINICAL TRIAL REGISTRATION: Unique identifier: INCT01389843 https://clinicaltrials.gov/ct2/show/NCT01389843 .
Authors: Stuart J Pocock; Cono A Ariti; John J V McMurray; Aldo Maggioni; Lars Køber; Iain B Squire; Karl Swedberg; Joanna Dobson; Katrina K Poppe; Gillian A Whalley; Rob N Doughty Journal: Eur Heart J Date: 2012-10-24 Impact factor: 29.983
Authors: Stuart J Pocock; Duolao Wang; Marc A Pfeffer; Salim Yusuf; John J V McMurray; Karl B Swedberg; Jan Ostergren; Eric L Michelson; Karen S Pieper; Christopher B Granger Journal: Eur Heart J Date: 2005-10-11 Impact factor: 29.983
Authors: Christopher M O'Connor; Vic Hasselblad; Rajendra H Mehta; Gudaye Tasissa; Robert M Califf; Mona Fiuzat; Joseph G Rogers; Carl V Leier; Lynne W Stevenson Journal: J Am Coll Cardiol Date: 2010-03-02 Impact factor: 24.094
Authors: Wayne C Levy; Dariush Mozaffarian; David T Linker; Santosh C Sutradhar; Stefan D Anker; Anne B Cropp; Inder Anand; Aldo Maggioni; Paul Burton; Mark D Sullivan; Bertram Pitt; Philip A Poole-Wilson; Douglas L Mann; Milton Packer Journal: Circulation Date: 2006-03-13 Impact factor: 29.690
Authors: Christopher M O'Connor; William T Abraham; Nancy M Albert; Robert Clare; Wendy Gattis Stough; Mihai Gheorghiade; Barry H Greenberg; Clyde W Yancy; James B Young; Gregg C Fonarow Journal: Am Heart J Date: 2008-10 Impact factor: 4.749