Wei-Syun Hu1,2, Meng-Hsuen Hsieh3, Cheng-Li Lin4. 1. School of Medicine, College of Medicine, China Medical University, Taichung, Taiwan. 2. Division of Cardiovascular Medicine, Department of Medicine, China Medical University Hospital, 2, Yuh-Der Road, Taichung, Taiwan. 3. Department of Electrical Engineering and Computer Sciences, College of Engineering, University of California, Berkeley, CA, USA. 4. Management Office for Health Data, China Medical University Hospital, Taichung, Taiwan.
Abstract
AIMS: We aimed to construct a random forest model to predict atrial fibrillation (AF) in Chinese population. METHODS AND RESULTS: This study was comprised of 682 237 subjects with or without AF. Each subject had 19 features that included the subjects' age, gender, underlying diseases, CHA2DS2-VASc score, and follow-up period. The data were split into train and test sets at an approximate 9:1 ratio: 614 013 data points were placed into the train set and 68 224 data points were placed into the test set. In this study, weighted average F1, precision, and recall values were used to measure prediction model performance. The F1, precision, and recall values were calculated across the train set, the test set, and all data. The area under receiving operating characteristic (ROC) curve was also used to evaluate the performance of the prediction model. The prediction model achieved a k-fold cross-validation accuracy of 0.979 (k = 10). In the test set, the prediction model achieved an F1 value of 0.968, precision value of 0.958, and recall value of 0.979. The area under ROC curve of the model was 0.948 (95% confidence interval 0.947-0.949). This model was validated with a separate dataset. CONCLUSIONS: This study showed a novel AF risk prediction scheme for Chinese individuals with random forest model methodology. Published on behalf of the European Society of Cardiology. All rights reserved.
AIMS: We aimed to construct a random forest model to predict atrial fibrillation (AF) in Chinese population. METHODS AND RESULTS: This study was comprised of 682 237 subjects with or without AF. Each subject had 19 features that included the subjects' age, gender, underlying diseases, CHA2DS2-VASc score, and follow-up period. The data were split into train and test sets at an approximate 9:1 ratio: 614 013 data points were placed into the train set and 68 224 data points were placed into the test set. In this study, weighted average F1, precision, and recall values were used to measure prediction model performance. The F1, precision, and recall values were calculated across the train set, the test set, and all data. The area under receiving operating characteristic (ROC) curve was also used to evaluate the performance of the prediction model. The prediction model achieved a k-fold cross-validation accuracy of 0.979 (k = 10). In the test set, the prediction model achieved an F1 value of 0.968, precision value of 0.958, and recall value of 0.979. The area under ROC curve of the model was 0.948 (95% confidence interval 0.947-0.949). This model was validated with a separate dataset. CONCLUSIONS: This study showed a novel AF risk prediction scheme for Chinese individuals with random forest model methodology. Published on behalf of the European Society of Cardiology. All rights reserved.
Authors: Jens Cosedis Nielsen; Yenn-Jiang Lin; Marcio Jansen de Oliveira Figueiredo; Alireza Sepehri Shamloo; Alberto Alfie; Serge Boveda; Nikolaos Dagres; Dario Di Toro; Lee L Eckhardt; Kenneth Ellenbogen; Carina Hardy; Takanori Ikeda; Aparna Jaswal; Elizabeth Kaufman; Andrew Krahn; Kengo Kusano; Valentina Kutyifa; Han S Lim; Gregory Y H Lip; Santiago Nava-Townsend; Hui-Nam Pak; Gerardo Rodríguez Diez; William Sauer; Anil Saxena; Jesper Hastrup Svendsen; Diego Vanegas; Marmar Vaseghi; Arthur Wilde; T Jared Bunch; Alfred E Buxton; Gonzalo Calvimontes; Tze-Fan Chao; Lars Eckardt; Heidi Estner; Anne M Gillis; Rodrigo Isa; Josef Kautzner; Philippe Maury; Joshua D Moss; Gi-Byung Nam; Brian Olshansky; Luis Fernando Pava Molano; Mauricio Pimentel; Mukund Prabhu; Wendy S Tzou; Philipp Sommer; Janice Swampillai; Alejandro Vidal; Thomas Deneke; Gerhard Hindricks; Christophe Leclercq Journal: Europace Date: 2020-08-01 Impact factor: 5.214
Authors: Ramesh Nadarajah; Jianhua Wu; Alejandro F Frangi; David Hogg; Campbell Cowan; Chris P Gale Journal: Eur Heart J Qual Care Clin Outcomes Date: 2022-06-06
Authors: Cheuk To Chung; George Bazoukis; Sharen Lee; Ying Liu; Tong Liu; Konstantinos P Letsas; Antonis A Armoundas; Gary Tse Journal: Int J Arrhythmia Date: 2022-04-01
Authors: Ramesh Nadarajah; Eman Alsaeed; Ben Hurdus; Suleman Aktaa; David Hogg; Matthew G D Bates; Campbel Cowan; Jianhua Wu; Chris P Gale Journal: Heart Date: 2022-06-10 Impact factor: 7.365