| Literature DB >> 31438086 |
Maxim Topaz1,2, Ludmila Murga3, Ofrit Bar-Bachar3, Kenrick Cato1, Sarah Collins1,4.
Abstract
We applied an open source natural language processing (NLP) system "NimbleMiner" to identify clinical notes with mentions of alcohol and substance abuse. NimbleMiner allows users to rapidly discover clinical vocabularies (using word embedding model) and then implement machine learning for text classification. We used a large inpatient dataset with over 50,000 intensive care unit admissions (MIMIC II). Clinical notes included physician-written discharge summaries (n = 51,201) and nursing notes (n = 412,343). We first used physician-written discharge summaries to train the system's algorithm and then added nursing notes to the physician-written discharge summaries and evaluated algorithms prediction accuracy. Adding nursing notes to the physician-written discharge summaries resulted in almost two-fold vocabulary expansion. NimbleMiner slightly outperformed other state-of-the-art NLP systems (average F-score = .84), while requiring significantly less time for the algorithms development.: Our findings underline the importance of nursing data for the analysis of electronic patient records.Entities:
Keywords: Alcoholism; Nursing informatics; Substance-Related disorders
Mesh:
Year: 2019 PMID: 31438086 DOI: 10.3233/SHTI190386
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630