Literature DB >> 35472514

A simple neural vector space model for medical concept normalization using concept embeddings.

Dongfang Xu1, Timothy Miller2.   

Abstract

OBJECTIVE: Medical concept normalization (MCN), the task of linking textual mentions to concepts in an ontology, provides a solution to unify different ways of referring to the same concept. In this paper, we present a simple neural MCN model that takes mentions as input and directly predicts concepts.
MATERIALS AND METHODS: We evaluate our proposed model on clinical datasets from ShARe/CLEF eHealth 2013 shared task and 2019 n2c2/OHNLP shared task track 3. Our neural MCN model consists of an encoder, and a normalized temperature-scaled softmax (NT-softmax) layer that maximizes the cosine similarity score of matching the mention to the correct concept. We adopt SAPBERT as the encoder and initialize the weights in the NT-softmax layer with pre-computed concept embeddings from SAPBERT.
RESULTS: Our proposed neural model achieves competitive performance on ShARe/CLEF 2013 and establishes a new state-of-the-art on 2019-n2c2-MCN. Yet this model is simpler than most prior work: it requires no complex pipelines, no hand-crafted rules, and no preprocessing, making it simpler to apply in new settings. DISCUSSION: Analyses of our proposed model show that the NT-softmax is better than the conventional softmax on the MCN task, and both the CUI-less threshold parameter and the initialization of the weight vectors in the NT-softmax layer contribute to the improvements.
CONCLUSION: We propose a simple neural model for clinical MCN, an one-step approach with simpler inference and more effective performance than prior work. Our analyses demonstrate future work on MCN may require more effort on unseen concepts.
Copyright © 2022 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Deep Learning; Medical Concept Normalization; Natural Language Processing; Normalized Temperature-scaled Softmax; Vector Space Model

Mesh:

Year:  2022        PMID: 35472514      PMCID: PMC9351985          DOI: 10.1016/j.jbi.2022.104080

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   8.000


  24 in total

1.  tmChem: a high performance approach for chemical named entity recognition and normalization.

Authors:  Robert Leaman; Chih-Hsuan Wei; Zhiyong Lu
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

2.  Cadec: A corpus of adverse drug event annotations.

Authors:  Sarvnaz Karimi; Alejandro Metke-Jimenez; Madonna Kemp; Chen Wang
Journal:  J Biomed Inform       Date:  2015-03-27       Impact factor: 6.317

3.  Medical concept normalization in social media posts with recurrent neural networks.

Authors:  Elena Tutubalina; Zulfat Miftahutdinov; Sergey Nikolenko; Valentin Malykh
Journal:  J Biomed Inform       Date:  2018-06-12       Impact factor: 6.317

4.  The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

Authors:  Sam Henry; Yanshan Wang; Feichen Shen; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

5.  A method for controlling complex confounding effects in the detection of adverse drug reactions using electronic health records.

Authors:  Ying Li; Hojjat Salmasian; Santiago Vilar; Herbert Chase; Carol Friedman; Ying Wei
Journal:  J Am Med Inform Assoc       Date:  2013-08-01       Impact factor: 4.497

6.  Automated identification of wound information in clinical notes of patients with heart diseases: Developing and validating a natural language processing application.

Authors:  Maxim Topaz; Kenneth Lai; Dawn Dowding; Victor J Lei; Anna Zisberg; Kathryn H Bowles; Li Zhou
Journal:  Int J Nurs Stud       Date:  2016-09-19       Impact factor: 5.837

7.  BioCreative V CDR task corpus: a resource for chemical disease relation extraction.

Authors:  Jiao Li; Yueping Sun; Robin J Johnson; Daniela Sciaky; Chih-Hsuan Wei; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Thomas C Wiegers; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-05-09       Impact factor: 3.451

8.  SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research.

Authors:  Honghan Wu; Giulia Toti; Katherine I Morley; Zina M Ibrahim; Amos Folarin; Richard Jackson; Ismail Kartoglu; Asha Agrawal; Clive Stringer; Darren Gale; Genevieve Gorrell; Angus Roberts; Matthew Broadbent; Robert Stewart; Richard J B Dobson
Journal:  J Am Med Inform Assoc       Date:  2018-05-01       Impact factor: 4.497

9.  Overview of BioCreative II gene normalization.

Authors:  Alexander A Morgan; Zhiyong Lu; Xinglong Wang; Aaron M Cohen; Juliane Fluck; Patrick Ruch; Anna Divoli; Katrin Fundel; Robert Leaman; Jörg Hakenberg; Chengjie Sun; Heng-hui Liu; Rafael Torres; Michael Krauthammer; William W Lau; Hongfang Liu; Chun-Nan Hsu; Martijn Schuemie; K Bretonnel Cohen; Lynette Hirschman
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

10.  DNorm: disease name normalization with pairwise learning to rank.

Authors:  Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-08-21       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.