Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts.

Literature DB >> 30086416

Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts.

Wangjin Lee¹, Kyungmo Kim², Eun Young Lee³, Jinwook Choi⁴.

Abstract

BACKGROUND: This study demonstrates clinical named entity recognition (NER) methods on the clinical texts of rheumatism patients in South Korea. Despite the recent increase in the adoption rate of the electronic health record (EHR) system in global health institutions, health information technologies for handling and acquisition of information from numerous unstructured texts in the EHR system are still in their developing stages. The aim of this study is to verify the conventional named entity recognition (NER) methods, namely dictionary-lookup-based string matching and conditional random fields (CRFs).
METHODS: We selected discharge summaries for 200 rheumatic patients from the EHR system of the Seoul National University Hospital and attempted to identify heterogeneous semantic types present in the clinical notes of each patient's history.
RESULTS: CRFs outperform string matching in extracting most semantic types (median F1 = 0.761, minimum = 0.705, maximum = 0.906). String matching is found to be better suited for identifying hospital visit information. The performance of both methods is comparable for identifying medications. The 10-fold cross-validation shows that CRFs had median F1 = 0.811 (minimum = 0.752, maximum = 0.918), and exhibited good performance even when trained with simple features.
CONCLUSION: CRFs are a good candidate for implementing clinical NER in Korean clinical narrative documents. Increasing the training data and incorporating sophisticated feature engineering might improve the accuracy of identifying health information, enabling automated patient history summarization in the future.

Entities: Disease Species

Keywords: Clinical named entity recognition; Conditional random field; Discharge summary; Medical history; String matching

Mesh：

Year: 2018 PMID： 30086416 DOI： 10.1016/j.compbiomed.2018.07.019

Source DB: PubMed Journal: Comput Biol Med ISSN： 0010-4825 Impact factor: 4.589

Keyword Cloud
Cited

3 in total

1. Findings from the 2019 International Medical Informatics Association Yearbook Section on Health Information Management.

Authors: Meryl Bloomrosen; Eta S Berner
Journal: Yearb Med Inform Date: 2019-08-16

2. Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

Authors: Wangjin Lee; Jinwook Choi
Journal: BMC Med Inform Decis Mak Date: 2019-07-15 Impact factor: 2.796

3. Multi-task learning for Chinese clinical named entity recognition with external knowledge.

Authors: Ming Cheng; Shufeng Xiong; Fei Li; Pan Liang; Jianbo Gao
Journal: BMC Med Inform Decis Mak Date: 2021-12-31 Impact factor: 2.796

3 in total