Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Clinical concept normalization with a hybrid natural language processing system combining multilevel matching and machine learning ranking.

Literature DB >> 33029642

Clinical concept normalization with a hybrid natural language processing system combining multilevel matching and machine learning ranking.

Long Chen¹, Wenbo Fu¹, Yu Gu¹, Zhiyong Sun¹, Haodan Li¹, Enyu Li¹, Li Jiang¹, Yuan Gao¹, Yang Huang¹.

Abstract

OBJECTIVE: Normalizing clinical mentions to concepts in standardized medical terminologies, in general, is challenging due to the complexity and variety of the terms in narrative medical records. In this article, we introduce our work on a clinical natural language processing (NLP) system to automatically normalize clinical mentions to concept unique identifier in the Unified Medical Language System. This work was part of the 2019 n2c2 (National NLP Clinical Challenges) Shared-Task and Workshop on Clinical Concept Normalization.
MATERIALS AND METHODS: We developed a hybrid clinical NLP system that combines a generic multilevel matching framework, customizable matching components, and machine learning ranking systems. We explored 2 machine leaning ranking systems based on either ensemble of various similarity features extracted from pretrained encoders or a Siamese attention network, targeting at efficient and fast semantic searching/ranking. Besides, we also evaluated the performance of a general-purpose clinical NLP system based on Unstructured Information Management Architecture.
RESULTS: The systems were evaluated as part of the 2019 n2c2 challenge, and our original best system in the challenge obtained an accuracy of 0.8101, ranked fifth in the challenge. The improved system with newly designed machine learning ranking based on Siamese attention network improved the accuracy to 0.8209.
CONCLUSIONS: We demonstrate the successful practice of combining multilevel matching and machine learning ranking for clinical concept normalization. Our results indicate the capability and interpretability of our proposed approach, as well as the limitation, suggesting the opportunities of achieving better performance by combining general clinical NLP systems.

Entities: Chemical

Keywords: CUI; UMLS; attention; clinical natural language processing; concept normalization

Mesh：

Year: 2020 PMID： 33029642 PMCID： PMC7647369 DOI： 10.1093/jamia/ocaa155

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

19 in total

1. Automated encoding of clinical documents based on natural language processing.

Authors: Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal: J Am Med Inform Assoc Date: 2004-06-07 Impact factor: 4.497

2. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors: Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497

3. Normalizing clinical terms using learned edit distance patterns.

Authors: Rohit J Kate
Journal: J Am Med Inform Assoc Date: 2015-07-31 Impact factor: 4.497

4. The truth about computer-assisted coding.

Authors: Mark Crawford
Journal: J AHIMA Date: 2013-07

5. MCN: A comprehensive corpus for medical concept normalization.

Authors: Yen-Fu Luo; Weiyi Sun; Anna Rumshisky
Journal: J Biomed Inform Date: 2019-02-22 Impact factor: 6.317

6. A Hybrid Normalization Method for Medical Concepts in Clinical Narrative using Semantic Matching.

Authors: Yen-Fu Luo; Weiyi Sun; Anna Rumshisky
Journal: AMIA Jt Summits Transl Sci Proc Date: 2019-05-06

Review 7. What can natural language processing do for clinical decision support?

Authors: Dina Demner-Fushman; Wendy W Chapman; Clement J McDonald
Journal: J Biomed Inform Date: 2009-08-13 Impact factor: 6.317

8. CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.

Authors: Ergin Soysal; Jingqi Wang; Min Jiang; Yonghui Wu; Serguei Pakhomov; Hongfang Liu; Hua Xu
Journal: J Am Med Inform Assoc Date: 2018-03-01 Impact factor: 4.497

9. DNorm: disease name normalization with pairwise learning to rank.

Authors: Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal: Bioinformatics Date: 2013-08-21 Impact factor: 6.937

10. BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors: Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal: Bioinformatics Date: 2020-02-15 Impact factor: 6.937

3 in total

1. The UMLS knowledge sources at 30: indispensable to current research and applications in biomedical informatics.

Authors: Betsy L Humphreys; Guilherme Del Fiol; Hua Xu
Journal: J Am Med Inform Assoc Date: 2020-10-01 Impact factor: 4.497

2. Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model.

Authors: Kevin Lybarger; Aashka Damani; Martin Gunn; O Zlem Uzuner; Meliha Yetisgen
Journal: AMIA Annu Symp Proc Date: 2022-05-23

3. Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics.

Authors: Tiago Almeida; Rui Antunes; João F Silva; João R Almeida; Sérgio Matos
Journal: Database (Oxford) Date: 2022-07-01 Impact factor: 4.462

3 in total