Literature DB >> 30086416

Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts.

Wangjin Lee1, Kyungmo Kim2, Eun Young Lee3, Jinwook Choi4.   

Abstract

BACKGROUND: This study demonstrates clinical named entity recognition (NER) methods on the clinical texts of rheumatism patients in South Korea. Despite the recent increase in the adoption rate of the electronic health record (EHR) system in global health institutions, health information technologies for handling and acquisition of information from numerous unstructured texts in the EHR system are still in their developing stages. The aim of this study is to verify the conventional named entity recognition (NER) methods, namely dictionary-lookup-based string matching and conditional random fields (CRFs).
METHODS: We selected discharge summaries for 200 rheumatic patients from the EHR system of the Seoul National University Hospital and attempted to identify heterogeneous semantic types present in the clinical notes of each patient's history.
RESULTS: CRFs outperform string matching in extracting most semantic types (median F1 = 0.761, minimum = 0.705, maximum = 0.906). String matching is found to be better suited for identifying hospital visit information. The performance of both methods is comparable for identifying medications. The 10-fold cross-validation shows that CRFs had median F1 = 0.811 (minimum = 0.752, maximum = 0.918), and exhibited good performance even when trained with simple features.
CONCLUSION: CRFs are a good candidate for implementing clinical NER in Korean clinical narrative documents. Increasing the training data and incorporating sophisticated feature engineering might improve the accuracy of identifying health information, enabling automated patient history summarization in the future.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Clinical named entity recognition; Conditional random field; Discharge summary; Medical history; String matching

Mesh:

Year:  2018        PMID: 30086416     DOI: 10.1016/j.compbiomed.2018.07.019

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  3 in total

1.  Findings from the 2019 International Medical Informatics Association Yearbook Section on Health Information Management.

Authors:  Meryl Bloomrosen; Eta S Berner
Journal:  Yearb Med Inform       Date:  2019-08-16

2.  Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

Authors:  Wangjin Lee; Jinwook Choi
Journal:  BMC Med Inform Decis Mak       Date:  2019-07-15       Impact factor: 2.796

3.  Multi-task learning for Chinese clinical named entity recognition with external knowledge.

Authors:  Ming Cheng; Shufeng Xiong; Fei Li; Pan Liang; Jianbo Gao
Journal:  BMC Med Inform Decis Mak       Date:  2021-12-31       Impact factor: 2.796

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.