Literature DB >> 33029642

Clinical concept normalization with a hybrid natural language processing system combining multilevel matching and machine learning ranking.

Long Chen1, Wenbo Fu1, Yu Gu1, Zhiyong Sun1, Haodan Li1, Enyu Li1, Li Jiang1, Yuan Gao1, Yang Huang1.   

Abstract

OBJECTIVE: Normalizing clinical mentions to concepts in standardized medical terminologies, in general, is challenging due to the complexity and variety of the terms in narrative medical records. In this article, we introduce our work on a clinical natural language processing (NLP) system to automatically normalize clinical mentions to concept unique identifier in the Unified Medical Language System. This work was part of the 2019 n2c2 (National NLP Clinical Challenges) Shared-Task and Workshop on Clinical Concept Normalization.
MATERIALS AND METHODS: We developed a hybrid clinical NLP system that combines a generic multilevel matching framework, customizable matching components, and machine learning ranking systems. We explored 2 machine leaning ranking systems based on either ensemble of various similarity features extracted from pretrained encoders or a Siamese attention network, targeting at efficient and fast semantic searching/ranking. Besides, we also evaluated the performance of a general-purpose clinical NLP system based on Unstructured Information Management Architecture.
RESULTS: The systems were evaluated as part of the 2019 n2c2 challenge, and our original best system in the challenge obtained an accuracy of 0.8101, ranked fifth in the challenge. The improved system with newly designed machine learning ranking based on Siamese attention network improved the accuracy to 0.8209.
CONCLUSIONS: We demonstrate the successful practice of combining multilevel matching and machine learning ranking for clinical concept normalization. Our results indicate the capability and interpretability of our proposed approach, as well as the limitation, suggesting the opportunities of achieving better performance by combining general clinical NLP systems.
© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  CUI; UMLS; attention; clinical natural language processing; concept normalization

Mesh:

Year:  2020        PMID: 33029642      PMCID: PMC7647369          DOI: 10.1093/jamia/ocaa155

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  19 in total

1.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

2.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

3.  Normalizing clinical terms using learned edit distance patterns.

Authors:  Rohit J Kate
Journal:  J Am Med Inform Assoc       Date:  2015-07-31       Impact factor: 4.497

4.  The truth about computer-assisted coding.

Authors:  Mark Crawford
Journal:  J AHIMA       Date:  2013-07

5.  MCN: A comprehensive corpus for medical concept normalization.

Authors:  Yen-Fu Luo; Weiyi Sun; Anna Rumshisky
Journal:  J Biomed Inform       Date:  2019-02-22       Impact factor: 6.317

6.  A Hybrid Normalization Method for Medical Concepts in Clinical Narrative using Semantic Matching.

Authors:  Yen-Fu Luo; Weiyi Sun; Anna Rumshisky
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06

Review 7.  What can natural language processing do for clinical decision support?

Authors:  Dina Demner-Fushman; Wendy W Chapman; Clement J McDonald
Journal:  J Biomed Inform       Date:  2009-08-13       Impact factor: 6.317

8.  CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.

Authors:  Ergin Soysal; Jingqi Wang; Min Jiang; Yonghui Wu; Serguei Pakhomov; Hongfang Liu; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2018-03-01       Impact factor: 4.497

9.  DNorm: disease name normalization with pairwise learning to rank.

Authors:  Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-08-21       Impact factor: 6.937

10.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

View more
  3 in total

1.  The UMLS knowledge sources at 30: indispensable to current research and applications in biomedical informatics.

Authors:  Betsy L Humphreys; Guilherme Del Fiol; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

2.  Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model.

Authors:  Kevin Lybarger; Aashka Damani; Martin Gunn; O Zlem Uzuner; Meliha Yetisgen
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

3.  Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics.

Authors:  Tiago Almeida; Rui Antunes; João F Silva; João R Almeida; Sérgio Matos
Journal:  Database (Oxford)       Date:  2022-07-01       Impact factor: 4.462

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.