Sungrim Moon1, Sijia Liu1, Christopher G Scott2, Sujith Samudrala3, Mohamed M Abidian3, Jeffrey B Geske3, Peter A Noseworthy3, Jane L Shellum4, Rajeev Chaudhry5, Steve R Ommen3, Rick A Nishimura3, Hongfang Liu1, Adelaide M Arruda-Olson6. 1. Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA. 2. Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA. 3. Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA. 4. Robert and Patricia Kern Center for Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA. 5. Robert and Patricia Kern Center for Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA; Division of Community Internal Medicine, Mayo Clinic, Rochester, MN, USA. 6. Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA; Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA. Electronic address: ArrudaOlson.Adelaide@mayo.edu.
Abstract
BACKGROUND: The management of hypertrophic cardiomyopathy (HCM) patients requires the knowledge of risk factors associated with sudden cardiac death (SCD). SCD risk factors such as syncope and family history of SCD (FH-SCD) as well as family history of HCM (FH-HCM) are documented in electronic health records (EHRs) as clinical narratives. Automated extraction of risk factors from clinical narratives by natural language processing (NLP) may expedite management workflow of HCM patients. The aim of this study was to develop and deploy NLP algorithms for automated extraction of syncope, FH-SCD, and FH-HCM from clinical narratives. METHODS AND RESULTS: We randomly selected 200 patients from the Mayo HCM registry for development (n = 100) and testing (n = 100) of NLP algorithms for extraction of syncope, FH-SCD as well as FH-HCM from clinical narratives of EHRs. The clinical reference standard was manually abstracted by 2 independent annotators. Performance of NLP algorithms was compared to aggregation and summarization of data entries in the HCM registry for syncope, FH-SCD, and FH-HCM. We also compared the NLP algorithms with billing codes for syncope as well as responses to patient survey questions for FH-SCD and FH-HCM. These analyses demonstrated NLP had superior sensitivity (0.96 vs 0.39, p < 0.001) and comparable specificity (0.90 vs 0.92, p = 0.74) and PPV (0.90 vs 0.83, p = 0.37) compared to billing codes for syncope. For FH-SCD, NLP outperformed survey responses for all parameters (sensitivity: 0.91 vs 0.59, p = 0.002; specificity: 0.98 vs 0.50, p < 0.001; PPV: 0.97 vs 0.38, p < 0.001). NLP also achieved superior sensitivity (0.95 vs 0.24, p < 0.001) with comparable specificity (0.95 vs 1.0, p-value not calculable) and positive predictive value (PPV) (0.92 vs 1.0, p = 0.09) compared to survey responses for FH-HCM. CONCLUSIONS: Automated extraction of syncope, FH-SCD and FH-HCM using NLP is feasible and has promise to increase efficiency of workflow for providers managing HCM patients.
BACKGROUND: The management of hypertrophic cardiomyopathy (HCM) patients requires the knowledge of risk factors associated with sudden cardiac death (SCD). SCD risk factors such as syncope and family history of SCD (FH-SCD) as well as family history of HCM (FH-HCM) are documented in electronic health records (EHRs) as clinical narratives. Automated extraction of risk factors from clinical narratives by natural language processing (NLP) may expedite management workflow of HCM patients. The aim of this study was to develop and deploy NLP algorithms for automated extraction of syncope, FH-SCD, and FH-HCM from clinical narratives. METHODS AND RESULTS: We randomly selected 200 patients from the Mayo HCM registry for development (n = 100) and testing (n = 100) of NLP algorithms for extraction of syncope, FH-SCD as well as FH-HCM from clinical narratives of EHRs. The clinical reference standard was manually abstracted by 2 independent annotators. Performance of NLP algorithms was compared to aggregation and summarization of data entries in the HCM registry for syncope, FH-SCD, and FH-HCM. We also compared the NLP algorithms with billing codes for syncope as well as responses to patient survey questions for FH-SCD and FH-HCM. These analyses demonstrated NLP had superior sensitivity (0.96 vs 0.39, p < 0.001) and comparable specificity (0.90 vs 0.92, p = 0.74) and PPV (0.90 vs 0.83, p = 0.37) compared to billing codes for syncope. For FH-SCD, NLP outperformed survey responses for all parameters (sensitivity: 0.91 vs 0.59, p = 0.002; specificity: 0.98 vs 0.50, p < 0.001; PPV: 0.97 vs 0.38, p < 0.001). NLP also achieved superior sensitivity (0.95 vs 0.24, p < 0.001) with comparable specificity (0.95 vs 1.0, p-value not calculable) and positive predictive value (PPV) (0.92 vs 1.0, p = 0.09) compared to survey responses for FH-HCM. CONCLUSIONS: Automated extraction of syncope, FH-SCD and FH-HCM using NLP is feasible and has promise to increase efficiency of workflow for providers managing HCM patients.
Authors: Constantinos O'Mahony; Fatima Jichi; Menelaos Pavlou; Lorenzo Monserrat; Aristides Anastasakis; Claudio Rapezzi; Elena Biagini; Juan Ramon Gimeno; Giuseppe Limongelli; William J McKenna; Rumana Z Omar; Perry M Elliott Journal: Eur Heart J Date: 2013-10-14 Impact factor: 29.983
Authors: Win-Kuang Shen; Robert S Sheldon; David G Benditt; Mitchell I Cohen; Daniel E Forman; Zachary D Goldberger; Blair P Grubb; Mohamed H Hamdan; Andrew D Krahn; Mark S Link; Brian Olshansky; Satish R Raj; Roopinder Kaur Sandhu; Dan Sorajja; Benjamin C Sun; Clyde W Yancy Journal: J Am Coll Cardiol Date: 2017-03-09 Impact factor: 24.094
Authors: Majid Rastegar-Mojarad; Sunghwan Sohn; Liwei Wang; Feichen Shen; Troy C Bleeker; William A Cliby; Hongfang Liu Journal: Int J Med Inform Date: 2017-10-10 Impact factor: 4.046
Authors: Yue Wang; Jin Luo; Shiying Hao; Haihua Xu; Andrew Young Shin; Bo Jin; Rui Liu; Xiaohong Deng; Lijuan Wang; Le Zheng; Yifan Zhao; Chunqing Zhu; Zhongkai Hu; Changlin Fu; Yanpeng Hao; Yingzhen Zhao; Yunliang Jiang; Dorothy Dai; Devore S Culver; Shaun T Alfreds; Rogow Todd; Frank Stearns; Karl G Sylvester; Eric Widen; Xuefeng B Ling Journal: Int J Med Inform Date: 2015-07-02 Impact factor: 4.046
Authors: Mar Pujades-Rodriguez; Oliver P Guttmann; Arturo Gonzalez-Izquierdo; Bram Duyx; Constantinos O'Mahony; Perry Elliott; Harry Hemingway Journal: PLoS One Date: 2018-01-11 Impact factor: 3.240
Authors: Hongfang Liu; Suzette J Bielinski; Sunghwan Sohn; Sean Murphy; Kavishwar B Wagholikar; Siddhartha R Jonnalagadda; K E Ravikumar; Stephen T Wu; Iftikhar J Kullo; Christopher G Chute Journal: AMIA Jt Summits Transl Sci Proc Date: 2013-03-18
Authors: Huzefa Bhopalwala; Nakeya Dewaswala; Sijia Liu; Christopher G Scott; James M Welper; Oluwatoyin Akinnusotu; Johan Martijn Bos; Steve R Ommen; Michael J Ackerman; Patricia A Pellikka; Jeffrey B Geske; Peter Noseworthy; Adelaide M Arruda-Olson Journal: Echocardiography Date: 2020-12-16 Impact factor: 1.724
Authors: Chayakrit Krittanawong; Kipp W Johnson; Edward Choi; Scott Kaplin; Eric Venner; Mullai Murugan; Zhen Wang; Benjamin S Glicksberg; Christopher I Amos; Michael C Schatz; W H Wilson Tang Journal: Life (Basel) Date: 2022-02-14