Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Automatic classification of scanned electronic health record documents.

Literature DB >> 33091829

Automatic classification of scanned electronic health record documents.

Heath Goodrum¹, Kirk Roberts¹, Elmer V Bernstam².

Abstract

OBJECTIVES: Electronic Health Records (EHRs) contain scanned documents from a variety of sources such as identification cards, radiology reports, clinical correspondence, and many other document types. We describe the distribution of scanned documents at one health institution and describe the design and evaluation of a system to categorize documents into clinically relevant and non-clinically relevant categories as well as further sub-classifications. Our objective is to demonstrate that text classification systems can accurately classify scanned documents.
METHODS: We extracted text using Optical Character Recognition (OCR). We then created and evaluated multiple text classification machine learning models, including both "bag of words" and deep learning approaches. We evaluated the system on three different levels of classification using both the entire document as input, as well as the individual pages of the document. Finally, we compared the effects of different text processing methods.
RESULTS: A deep learning model using ClinicalBERT performed best. This model distinguished between clinically-relevant documents and not clinically-relevant documents with an accuracy of 0.973; between intermediate sub-classifications with an accuracy of 0.949; and between individual classes with an accuracy of 0.913. DISCUSSION: Within the EHR, some document categories such as "external medical records" may contain hundreds of scanned pages without clear document boundaries. Without further sub-classification, clinicians must view every page or risk missing clinically-relevant information. Machine learning can automatically classify these scanned documents to reduce clinician burden.
CONCLUSION: Using machine learning applied to OCR-extracted text has the potential to accurately identify clinically-relevant scanned content within EHRs.

Keywords: Classification; Electronic health records; Machine learning; Optical character recognition; Patient safety; Scanned documents

Mesh：

Year: 2020 PMID： 33091829 PMCID： PMC7731898 DOI： 10.1016/j.ijmedinf.2020.104302

Source DB: PubMed Journal: Int J Med Inform ISSN： 1386-5056 Impact factor: 4.046

14 in total

3. A Smartphone App to Increase Immunizations in the Pediatric Solid Organ Transplant Population: Development and Initial Usability Study.

Authors: Amy G Feldman; Susan Moore; Sheana Bull; Megan A Morris; Kumanan Wilson; Cameron Bell; Margaret M Collins; Kathryn M Denize; Allison Kempe
Journal: JMIR Form Res Date: 2022-01-13

3 in total

Automatic classification of scanned electronic health record documents.

1. A frequency-based technique to improve the spelling suggestion rank in medical queries.

2. Practice brief. Document imaging as a bridge to the EHR.

3. Is document imaging the right choice for your organization?

4. Note on the sampling error of the difference between correlated proportions or percentages.

5. A typology of electronic health record workarounds in small-to-medium size primary care practices.

6. An Ensemble Method for Spelling Correction in Consumer Health Questions.

7. Dermatologist-level classification of skin cancer with deep neural networks.

8. CLUSTERING AND PRIORITIZING PATIENT SAFETY ISSUES DURING EHR IMPLEMENTATION AND UPGRADES IN HOSPITAL SETTINGS.

9. MIMIC-III, a freely accessible critical care database.

10. Detecting and classifying lesions in mammograms with Deep Learning.

1. Deep learning-based NLP data pipeline for EHR-scanned document information extraction.

2. Searching the PDF Haystack: Automated Knowledge Discovery in Scanned EHR Documents.

3. A Smartphone App to Increase Immunizations in the Pediatric Solid Organ Transplant Population: Development and Initial Usability Study.