Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A simple neural vector space model for medical concept normalization using concept embeddings.

Literature DB >> 35472514

A simple neural vector space model for medical concept normalization using concept embeddings.

Abstract

OBJECTIVE: Medical concept normalization (MCN), the task of linking textual mentions to concepts in an ontology, provides a solution to unify different ways of referring to the same concept. In this paper, we present a simple neural MCN model that takes mentions as input and directly predicts concepts.
MATERIALS AND METHODS: We evaluate our proposed model on clinical datasets from ShARe/CLEF eHealth 2013 shared task and 2019 n2c2/OHNLP shared task track 3. Our neural MCN model consists of an encoder, and a normalized temperature-scaled softmax (NT-softmax) layer that maximizes the cosine similarity score of matching the mention to the correct concept. We adopt SAPBERT as the encoder and initialize the weights in the NT-softmax layer with pre-computed concept embeddings from SAPBERT.
RESULTS: Our proposed neural model achieves competitive performance on ShARe/CLEF 2013 and establishes a new state-of-the-art on 2019-n2c2-MCN. Yet this model is simpler than most prior work: it requires no complex pipelines, no hand-crafted rules, and no preprocessing, making it simpler to apply in new settings. DISCUSSION: Analyses of our proposed model show that the NT-softmax is better than the conventional softmax on the MCN task, and both the CUI-less threshold parameter and the initialization of the weight vectors in the NT-softmax layer contribute to the improvements.
CONCLUSION: We propose a simple neural model for clinical MCN, an one-step approach with simpler inference and more effective performance than prior work. Our analyses demonstrate future work on MCN may require more effort on unseen concepts.

Entities: Chemical

Keywords: Deep Learning; Medical Concept Normalization; Natural Language Processing; Normalized Temperature-scaled Softmax; Vector Space Model

Mesh：
Space Simulation

Year: 2022 PMID： 35472514 PMCID： PMC9351985 DOI： 10.1016/j.jbi.2022.104080

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 8.000

Keyword Cloud
References

24 in total

1. tmChem: a high performance approach for chemical named entity recognition and normalization.

Authors: Robert Leaman; Chih-Hsuan Wei; Zhiyong Lu
Journal: J Cheminform Date: 2015-01-19 Impact factor: 5.514

2. Cadec: A corpus of adverse drug event annotations.

Authors: Sarvnaz Karimi; Alejandro Metke-Jimenez; Madonna Kemp; Chen Wang
Journal: J Biomed Inform Date: 2015-03-27 Impact factor: 6.317

3. Medical concept normalization in social media posts with recurrent neural networks.

Authors: Elena Tutubalina; Zulfat Miftahutdinov; Sergey Nikolenko; Valentin Malykh
Journal: J Biomed Inform Date: 2018-06-12 Impact factor: 6.317

4. The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

Authors: Sam Henry; Yanshan Wang; Feichen Shen; Ozlem Uzuner
Journal: J Am Med Inform Assoc Date: 2020-10-01 Impact factor: 4.497

5. A method for controlling complex confounding effects in the detection of adverse drug reactions using electronic health records.

Authors: Ying Li; Hojjat Salmasian; Santiago Vilar; Herbert Chase; Carol Friedman; Ying Wei
Journal: J Am Med Inform Assoc Date: 2013-08-01 Impact factor: 4.497

6. Automated identification of wound information in clinical notes of patients with heart diseases: Developing and validating a natural language processing application.

Authors: Maxim Topaz; Kenneth Lai; Dawn Dowding; Victor J Lei; Anna Zisberg; Kathryn H Bowles; Li Zhou
Journal: Int J Nurs Stud Date: 2016-09-19 Impact factor: 5.837

7. BioCreative V CDR task corpus: a resource for chemical disease relation extraction.

Authors: Jiao Li; Yueping Sun; Robin J Johnson; Daniela Sciaky; Chih-Hsuan Wei; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Thomas C Wiegers; Zhiyong Lu
Journal: Database (Oxford) Date: 2016-05-09 Impact factor: 3.451

8. SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research.

Authors: Honghan Wu; Giulia Toti; Katherine I Morley; Zina M Ibrahim; Amos Folarin; Richard Jackson; Ismail Kartoglu; Asha Agrawal; Clive Stringer; Darren Gale; Genevieve Gorrell; Angus Roberts; Matthew Broadbent; Robert Stewart; Richard J B Dobson
Journal: J Am Med Inform Assoc Date: 2018-05-01 Impact factor: 4.497

9. Overview of BioCreative II gene normalization.

Authors: Alexander A Morgan; Zhiyong Lu; Xinglong Wang; Aaron M Cohen; Juliane Fluck; Patrick Ruch; Anna Divoli; Katrin Fundel; Robert Leaman; Jörg Hakenberg; Chengjie Sun; Heng-hui Liu; Rafael Torres; Michael Krauthammer; William W Lau; Hongfang Liu; Chun-Nan Hsu; Martijn Schuemie; K Bretonnel Cohen; Lynette Hirschman
Journal: Genome Biol Date: 2008-09-01 Impact factor: 13.583

10. DNorm: disease name normalization with pairwise learning to rank.

Authors: Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal: Bioinformatics Date: 2013-08-21 Impact factor: 6.937