Tae Joon Jun1, Young-Hak Kim2, Imjin Ahn3, Wonjun Na3, Osung Kwon4, Dong Hyun Yang5, Gyung-Min Park6, Hansle Gwon3, Hee Jun Kang7, Yeon Uk Jeong3, Jungsun Yoo7, Yunha Kim3. 1. Big Data Research Center, Asan Institute for Life Sciences, Asan Medical Center, 88, Olumpicro 43gil, Songpagu, Seoul, 05505, Republic of Korea. taejoon@amc.seoul.kr. 2. Division of Cardiology, Department of Internal Medicine, Asan Medical Center, University of Ulsan College of Medicine, 88, Olumpicro 43gil, Songpagu, Seoul, 05505, Republic of Korea. mdyhkim@amc.seoul.kr. 3. Department of Medical Science, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea. 4. Division of Cardiology, Department of Internal Medicine, Eunpyeong St. Mary's Hospital, The Catholic University of Korea, Seoul, Republic of Korea. 5. Department of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea. 6. Department of Cardiology, Ulsan University Hospital, University of Ulsan College of Medicine, Ulsan, Republic of Korea. 7. Division of Cardiology, Department of Internal Medicine, Asan Medical Center, University of Ulsan College of Medicine, 88, Olumpicro 43gil, Songpagu, Seoul, 05505, Republic of Korea.
Abstract
BACKGROUND: Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to CVDs data, it is necessary to establish and curate a specialized database based on electronic health records (EHRs) and include pre-processed unstructured data. METHODS: To build a suitable database (CardioNet) for CVDs that can utilize AI technology, contributing to the overall care of patients with CVDs. First, we collected the anonymized records of 748,474 patients who had visited the Asan Medical Center (AMC) or Ulsan University Hospital (UUH) because of CVDs. Second, we set clinically plausible criteria to remove errors and duplication. Third, we integrated unstructured data such as readings of medical examinations with structured data sourced from EHRs to create the CardioNet. We subsequently performed natural language processing to structuralize the significant variables associated with CVDs because most results of the principal CVD-related medical examinations are free-text readings. Additionally, to ensure interoperability for convergent multi-center research, we standardized the data using several codes that correspond to the common data model. Finally, we created the descriptive table (i.e., dictionary of the CardioNet) to simplify access and utilization of data for clinicians and engineers and continuously validated the data to ensure reliability. RESULTS: CardioNet is a comprehensive database that can serve as a training set for AI models and assist in all aspects of clinical management of CVDs. It comprises information extracted from EHRs and results of readings of CVD-related digital tests. It consists of 27 tables, a code-master table, and a descriptive table. CONCLUSIONS: CardioNet database specialized in CVDs was established, with continuing data collection. We are actively supporting multi-center research, which may require further data processing, depending on the subject of the study. CardioNet will serve as the fundamental database for future CVD-related research projects.
BACKGROUND:Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to CVDs data, it is necessary to establish and curate a specialized database based on electronic health records (EHRs) and include pre-processed unstructured data. METHODS: To build a suitable database (CardioNet) for CVDs that can utilize AI technology, contributing to the overall care of patients with CVDs. First, we collected the anonymized records of 748,474 patients who had visited the Asan Medical Center (AMC) or Ulsan University Hospital (UUH) because of CVDs. Second, we set clinically plausible criteria to remove errors and duplication. Third, we integrated unstructured data such as readings of medical examinations with structured data sourced from EHRs to create the CardioNet. We subsequently performed natural language processing to structuralize the significant variables associated with CVDs because most results of the principal CVD-related medical examinations are free-text readings. Additionally, to ensure interoperability for convergent multi-center research, we standardized the data using several codes that correspond to the common data model. Finally, we created the descriptive table (i.e., dictionary of the CardioNet) to simplify access and utilization of data for clinicians and engineers and continuously validated the data to ensure reliability. RESULTS:CardioNet is a comprehensive database that can serve as a training set for AI models and assist in all aspects of clinical management of CVDs. It comprises information extracted from EHRs and results of readings of CVD-related digital tests. It consists of 27 tables, a code-master table, and a descriptive table. CONCLUSIONS:CardioNet database specialized in CVDs was established, with continuing data collection. We are actively supporting multi-center research, which may require further data processing, depending on the subject of the study. CardioNet will serve as the fundamental database for future CVD-related research projects.
Entities:
Keywords:
Artificial intelligence; Cardiovascular diseases; Database; Electronic health records
Authors: Harvey J Murff; Fern FitzHenry; Michael E Matheny; Nancy Gentry; Kristen L Kotter; Kimberly Crimin; Robert S Dittus; Amy K Rosen; Peter L Elkin; Steven H Brown; Theodore Speroff Journal: JAMA Date: 2011-08-24 Impact factor: 56.272
Authors: George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan Journal: Stud Health Technol Inform Date: 2015
Authors: Ryan Poplin; Avinash V Varadarajan; Katy Blumer; Yun Liu; Michael V McConnell; Greg S Corrado; Lily Peng; Dale R Webster Journal: Nat Biomed Eng Date: 2018-02-19 Impact factor: 25.671
Authors: Scott Mayer McKinney; Marcin Sieniek; Varun Godbole; Jonathan Godwin; Natasha Antropova; Hutan Ashrafian; Trevor Back; Mary Chesus; Greg S Corrado; Ara Darzi; Mozziyar Etemadi; Florencia Garcia-Vicente; Fiona J Gilbert; Mark Halling-Brown; Demis Hassabis; Sunny Jansen; Alan Karthikesalingam; Christopher J Kelly; Dominic King; Joseph R Ledsam; David Melnick; Hormuz Mostofi; Lily Peng; Joshua Jay Reicher; Bernardino Romera-Paredes; Richard Sidebottom; Mustafa Suleyman; Daniel Tse; Kenneth C Young; Jeffrey De Fauw; Shravya Shetty Journal: Nature Date: 2020-01-01 Impact factor: 49.962
Authors: Bjoern H Menze; Andras Jakab; Stefan Bauer; Jayashree Kalpathy-Cramer; Keyvan Farahani; Justin Kirby; Yuliya Burren; Nicole Porz; Johannes Slotboom; Roland Wiest; Levente Lanczi; Elizabeth Gerstner; Marc-André Weber; Tal Arbel; Brian B Avants; Nicholas Ayache; Patricia Buendia; D Louis Collins; Nicolas Cordier; Jason J Corso; Antonio Criminisi; Tilak Das; Hervé Delingette; Çağatay Demiralp; Christopher R Durst; Michel Dojat; Senan Doyle; Joana Festa; Florence Forbes; Ezequiel Geremia; Ben Glocker; Polina Golland; Xiaotao Guo; Andac Hamamci; Khan M Iftekharuddin; Raj Jena; Nigel M John; Ender Konukoglu; Danial Lashkari; José Antonió Mariz; Raphael Meier; Sérgio Pereira; Doina Precup; Stephen J Price; Tammy Riklin Raviv; Syed M S Reza; Michael Ryan; Duygu Sarikaya; Lawrence Schwartz; Hoo-Chang Shin; Jamie Shotton; Carlos A Silva; Nuno Sousa; Nagesh K Subbanna; Gabor Szekely; Thomas J Taylor; Owen M Thomas; Nicholas J Tustison; Gozde Unal; Flor Vasseur; Max Wintermark; Dong Hye Ye; Liang Zhao; Binsheng Zhao; Darko Zikic; Marcel Prastawa; Mauricio Reyes; Koen Van Leemput Journal: IEEE Trans Med Imaging Date: 2014-12-04 Impact factor: 10.048
Authors: Todd C Hollon; Balaji Pandian; Arjun R Adapa; Esteban Urias; Akshay V Save; Siri Sahib S Khalsa; Daniel G Eichberg; Randy S D'Amico; Zia U Farooq; Spencer Lewis; Petros D Petridis; Tamara Marie; Ashish H Shah; Hugh J L Garton; Cormac O Maher; Jason A Heth; Erin L McKean; Stephen E Sullivan; Shawn L Hervey-Jumper; Parag G Patil; B Gregory Thompson; Oren Sagher; Guy M McKhann; Ricardo J Komotar; Michael E Ivan; Matija Snuderl; Marc L Otten; Timothy D Johnson; Michael B Sisti; Jeffrey N Bruce; Karin M Muraszko; Jay Trautman; Christian W Freudiger; Peter Canoll; Honglak Lee; Sandra Camelo-Piragua; Daniel A Orringer Journal: Nat Med Date: 2020-01-06 Impact factor: 53.440
Authors: Chloé Sieber; Deborah Chiavi; Christina Haag; Marco Kaufmann; Andrea B Horn; Holger Dressel; Chiara Zecca; Pasquale Calabrese; Caroline Pot; Christian Philipp Kamm; Viktor von Wyl Journal: JMIR Mhealth Uhealth Date: 2022-10-05 Impact factor: 4.947
Authors: Haijiang Dai; Arwa Younis; Jude Dzevela Kong; Luca Puce; Georges Jabbour; Hong Yuan; Nicola Luigi Bragazzi Journal: Front Cardiovasc Med Date: 2022-04-01