Joseph S Redman1, Yamini Natarajan2,3, Jason K Hou1,4, Jingqi Wang5, Muzammil Hanif4, Hua Feng4, Jennifer R Kramer4, Roxanne Desiderio4, Hua Xu5, Hashem B El-Serag1,4, Fasiha Kanwal1,4. 1. Baylor College of Medicine, Houston, TX, USA. 2. Baylor College of Medicine, Houston, TX, USA. ynataraj@bcm.edu. 3. Clinical Epidemiology and Comparative Effectiveness Program, Center for Innovations in Quality, Effectiveness and Safety, Michael E. Debakey VA Medical Center, John P. McGovern Campus, 2450 Holcombe Blvd., Suite 01Y, Houston, TX, 77021, USA. ynataraj@bcm.edu. 4. Clinical Epidemiology and Comparative Effectiveness Program, Center for Innovations in Quality, Effectiveness and Safety, Michael E. Debakey VA Medical Center, John P. McGovern Campus, 2450 Holcombe Blvd., Suite 01Y, Houston, TX, 77021, USA. 5. School of Biomedical Informatics, UT Health, Houston, TX, USA.
Abstract
INTRODUCTION: Natural language processing is a powerful technique of machine learning capable of maximizing data extraction from complex electronic medical records. METHODS: We utilized this technique to develop algorithms capable of "reading" full-text radiology reports to accurately identify the presence of fatty liver disease. Abdominal ultrasound, computerized tomography, and magnetic resonance imaging reports were retrieved from the Veterans Affairs Corporate Data Warehouse from a random national sample of 652 patients. Radiographic fatty liver disease was determined by manual review by two physicians and verified with an expert radiologist. A split validation method was utilized for algorithm development. RESULTS: For all three imaging modalities, the algorithms could identify fatty liver disease with >90% recall and precision, with F-measures >90%. DISCUSSION: These algorithms could be used to rapidly screen patient records to establish a large cohort to facilitate epidemiological and clinical studies and examine the clinic course and outcomes of patients with radiographic hepatic steatosis.
INTRODUCTION: Natural language processing is a powerful technique of machine learning capable of maximizing data extraction from complex electronic medical records. METHODS: We utilized this technique to develop algorithms capable of "reading" full-text radiology reports to accurately identify the presence of fatty liver disease. Abdominal ultrasound, computerized tomography, and magnetic resonance imaging reports were retrieved from the Veterans Affairs Corporate Data Warehouse from a random national sample of 652 patients. Radiographic fatty liver disease was determined by manual review by two physicians and verified with an expert radiologist. A split validation method was utilized for algorithm development. RESULTS: For all three imaging modalities, the algorithms could identify fatty liver disease with >90% recall and precision, with F-measures >90%. DISCUSSION: These algorithms could be used to rapidly screen patient records to establish a large cohort to facilitate epidemiological and clinical studies and examine the clinic course and outcomes of patients with radiographic hepatic steatosis.
Entities:
Keywords:
Electronic health records; Epidemiology; Fatty liver; Natural language processing; Nonalcoholic fatty liver disease; Triglycerides
Authors: Andrew Redd; Marjorie Carter; Guy Divita; Shuying Shen; Miland Palmer; Matthew Samore; Adi V Gundlapalli Journal: Stud Health Technol Inform Date: 2014
Authors: Timothy D Imler; Justin Morea; Charles Kahi; Eric A Sherer; Jon Cardwell; Cynthia S Johnson; Huiping Xu; Dennis Ahnen; Fadi Antaki; Christopher Ashley; Gyorgy Baffy; Ilseung Cho; Jason Dominitz; Jason Hou; Mark Korsten; Anil Nagar; Kittichai Promrat; Douglas Robertson; Sameer Saini; Amandeep Shergill; Walter Smalley; Thomas F Imperiale Journal: Am J Gastroenterol Date: 2015-03-10 Impact factor: 10.864
Authors: Mariana Lazo; Ruben Hernaez; Mark S Eberhardt; Susanne Bonekamp; Ihab Kamel; Eliseo Guallar; Ayman Koteish; Frederick L Brancati; Jeanne M Clark Journal: Am J Epidemiol Date: 2013-05-23 Impact factor: 4.897
Authors: Jason K Hou; Mimi Chang; Thien Nguyen; Jennifer R Kramer; Peter Richardson; Shubhada Sansgiry; Leonard W D'Avolio; Hashem B El-Serag Journal: Dig Dis Sci Date: 2012-10-21 Impact factor: 3.199
Authors: Tielman T Van Vleck; Lili Chan; Steven G Coca; Catherine K Craven; Ron Do; Stephen B Ellis; Joseph L Kannry; Ruth J F Loos; Peter A Bonis; Judy Cho; Girish N Nadkarni Journal: Int J Med Inform Date: 2019-07-06 Impact factor: 4.046
Authors: Dawn M Bravata; Jessica M Coffing; Devan Kansagara; Jennifer Myers; Lauren Murphy; Barbara J Homoya; Anthony J Perkins; Kathryn Snow; Jacquelyn A Quin; Ying Zhang; Laura J Myers Journal: JAMA Surg Date: 2019-02-20 Impact factor: 14.766
Authors: Fasiha Kanwal; Jennifer R Kramer; Srikar Mapakshi; Yamini Natarajan; Maneerat Chayanupatkul; Peter A Richardson; Liang Li; Roxanne Desiderio; Aaron P Thrift; Steven M Asch; Jinna Chu; Hashem B El-Serag Journal: Gastroenterology Date: 2018-08-23 Impact factor: 22.682
Authors: Arlene Casey; Emma Davidson; Michael Poon; Hang Dong; Daniel Duma; Andreas Grivas; Claire Grover; Víctor Suárez-Paniagua; Richard Tobin; William Whiteley; Honghan Wu; Beatrice Alex Journal: BMC Med Inform Decis Mak Date: 2021-06-03 Impact factor: 2.796
Authors: Carlos R Oliveira; Patrick Niccolai; Anette Michelle Ortiz; Sangini S Sheth; Eugene D Shapiro; Linda M Niccolai; Cynthia A Brandt Journal: JMIR Med Inform Date: 2020-11-03