Literature DB >> 21890871

Development of an optical character recognition pipeline for handwritten form fields from an electronic health record.

Luke V Rasmussen1, Peggy L Peissig, Catherine A McCarty, Justin Starren.   

Abstract

BACKGROUND: Although the penetration of electronic health records is increasing rapidly, much of the historical medical record is only available in handwritten notes and forms, which require labor-intensive, human chart abstraction for some clinical research. The few previous studies on automated extraction of data from these handwritten notes have focused on monolithic, custom-developed recognition systems or third-party systems that require proprietary forms.
METHODS: We present an optical character recognition processing pipeline, which leverages the capabilities of existing third-party optical character recognition engines, and provides the flexibility offered by a modular custom-developed system. The system was configured and run on a selected set of form fields extracted from a corpus of handwritten ophthalmology forms. OBSERVATIONS: The processing pipeline allowed multiple configurations to be run, with the optimal configuration consisting of the Nuance and LEADTOOLS engines running in parallel with a positive predictive value of 94.6% and a sensitivity of 13.5%. DISCUSSION: While limitations exist, preliminary experience from this project yielded insights on the generalizability and applicability of integrating multiple, inexpensive general-purpose third-party optical character recognition engines in a modular pipeline.

Entities:  

Mesh:

Year:  2011        PMID: 21890871      PMCID: PMC3392858          DOI: 10.1136/amiajnl-2011-000182

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  5 in total

1.  A modern optical character recognition system in a real world clinical setting: some accuracy and feasibility observations.

Authors:  Paul G Biondich; J Marc Overhage; Paul R Dexter; Stephen M Downs; Larry Lemmon; Clement J McDonald
Journal:  Proc AMIA Symp       Date:  2002

2.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

3.  Hybrid data capture for monitoring patients on highly active antiretroviral therapy (HAART) in urban Botswana.

Authors:  Hermann Bussmann; C William Wester; Ndwapi Ndwapi; Chris Vanderwarker; Tendani Gaolathe; Geoffrey Tirelo; Ava Avalos; Howard Moffat; Richard G Marlink
Journal:  Bull World Health Organ       Date:  2006-02-23       Impact factor: 9.408

4.  Electronic medical records for genetic research: results of the eMERGE consortium.

Authors:  Abel N Kho; Jennifer A Pacheco; Peggy L Peissig; Luke Rasmussen; Katherine M Newton; Noah Weston; Paul K Crane; Jyotishman Pathak; Christopher G Chute; Suzette J Bielinski; Iftikhar J Kullo; Rongling Li; Teri A Manolio; Rex L Chisholm; Joshua C Denny
Journal:  Sci Transl Med       Date:  2011-04-20       Impact factor: 17.956

5.  Use of document image processing in cancer registration: how and why?

Authors:  G Titlestad
Journal:  Medinfo       Date:  1995
  5 in total
  19 in total

1.  Using Electronic Health Records To Generate Phenotypes For Research.

Authors:  Sarah A Pendergrass; Dana C Crawford
Journal:  Curr Protoc Hum Genet       Date:  2018-12-05

2.  Importance of multi-modal approaches to effectively identify cataract cases from electronic health records.

Authors:  Peggy L Peissig; Luke V Rasmussen; Richard L Berg; James G Linneman; Catherine A McCarty; Carol Waudby; Lin Chen; Joshua C Denny; Russell A Wilke; Jyotishman Pathak; David Carrell; Abel N Kho; Justin B Starren
Journal:  J Am Med Inform Assoc       Date:  2012 Mar-Apr       Impact factor: 4.497

3.  Genetically-guided algorithm development and sample size optimization for age-related macular degeneration cases and controls in electronic health records from the VA Million Veteran Program.

Authors:  Christopher W Halladay; Tamer Hadi; Matthew D Anger; Paul B Greenberg; Jack M Sullivan; P Eric Konicki; Neal S Peachey; Robert P Igo; Sudha K Iyengar; Wen-Chih Wu; Dana C Crawford
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06

Review 4.  Multiscale integration of -omic, imaging, and clinical data in biomedical informatics.

Authors:  John H Phan; Chang F Quo; Chihwen Cheng; May Dongmei Wang
Journal:  IEEE Rev Biomed Eng       Date:  2012

5.  Building Structured Personal Health Records from Photographs of Printed Medical Records.

Authors:  Xiang Li; Gang Hu; Xiaofei Teng; Guotong Xie
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

Review 6.  The electronic health record for translational research.

Authors:  Luke V Rasmussen
Journal:  J Cardiovasc Transl Res       Date:  2014-07-29       Impact factor: 4.132

7.  Searching the PDF Haystack: Automated Knowledge Discovery in Scanned EHR Documents.

Authors:  Alexander L Kostrinsky-Thomas; Fuki M Hisama; Thomas H Payne
Journal:  Appl Clin Inform       Date:  2021-03-24       Impact factor: 2.342

8.  Chapter 13: Mining electronic health records in the genomics era.

Authors:  Joshua C Denny
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

9.  Cataract research using electronic health records.

Authors:  Carol J Waudby; Richard L Berg; James G Linneman; Luke V Rasmussen; Peggy L Peissig; Lin Chen; Catherine A McCarty
Journal:  BMC Ophthalmol       Date:  2011-11-11       Impact factor: 2.209

10.  Clinical research informatics: a conceptual perspective.

Authors:  Michael G Kahn; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2012-04-20       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.