Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Terminologies augmented recurrent neural network model for clinical named entity recognition.

Literature DB >> 31837473

Terminologies augmented recurrent neural network model for clinical named entity recognition.

Ivan Lerner¹, Nicolas Paris², Xavier Tannier³.

Abstract

OBJECTIVE: We aimed to enhance the performance of a supervised model for clinical named-entity recognition (NER) using medical terminologies. In order to evaluate our system in French, we built a corpus for 5 types of clinical entities.
METHODS: We used a terminology-based system as baseline, built upon UMLS and SNOMED. Then, we evaluated a biGRU-CRF, and a hybrid system using the prediction of the terminology-based system as feature for the biGRU-CRF. In French, we built APcNER, a corpus of 147 documents annotated for 5 entities (Drug names, Signs or symptoms, Diseases or disorders, Diagnostic procedures or lab tests and Therapeutic procedures). We evaluated each NER systems using exact and partial match definition of F-measure for NER. The APcNER contains 4,837 entities, which took 28 h to annotate. The inter-annotator agreement as measured by Cohen's Kappa was substantial for non-exact match (Κ = 0.61) and moderate considering exact match (Κ = 0.42). In English, we evaluated the NER systems on the i2b2-2009 Medication Challenge for Drug name recognition, which contained 8,573 entities for 268 documents, and i2b2-small a version reduced to match APcNER number of entities.
RESULTS: For drug name recognition on both i2b2-2009 and APcNER, the biGRU-CRF performed better that the terminology-based system, with an exact-match F-measure of 91.1% versus 73% and 81.9% versus 75% respectively. For i2b2-small and APcNER, the hybrid system outperformed the biGRU-CRF, with an exact-match F-measure of 87.8% versus 85.6% and 86.4% versus 81.9% respectively. On APcNER corpus, the micro-average F-measure of the hybrid system on the 5 entities was 69.5% in exact match and 84.1% in non-exact match.
CONCLUSION: APcNER is a French corpus for clinical-NER of five types of entities which covers a large variety of document types. The extension of the supervised model with terminology has allowed an easy increase in performance, especially for rare entities, and established near state of the art results on the i2b2-2009 corpus.

Entities: Disease

Keywords: APcNER; Clinical natural language processing; Information extraction; Machine learning; Named entity recognition

Mesh：

Year: 2019 PMID： 31837473 DOI： 10.1016/j.jbi.2019.103356

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

Keyword Cloud
Cited

4 in total

1. A Novel Algorithm for Detecting Microsatellite Instability Based on Next-Generation Sequencing Data.

Authors: Shijun Li; Bo Wang; Miaomiao Chang; Rui Hou; Geng Tian; Ling Tong
Journal: Front Oncol Date: 2022-06-30 Impact factor: 5.738

Review 2. A Year of Papers Using Biomedical Texts.

Authors: Cyril Grouin; Natalia Grabar
Journal: Yearb Med Inform Date: 2020-08-21

3. Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study.

Authors: Jordan Jouffroy; Sarah F Feldman; Ivan Lerner; Bastien Rance; Anita Burgun; Antoine Neuraz
Journal: JMIR Med Inform Date: 2021-03-16

4. Extraction of entity relations from Chinese medical literature based on multi-scale CRNN.

Authors: Tingyin Chen; Xuehong Wu; Linyi Li; Jianhua Li; Song Feng
Journal: Ann Transl Med Date: 2022-05

4 in total