Literature DB >> 26958273

A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.

Yonghui Wu1, Jun Xu1, Min Jiang1, Yaoyun Zhang1, Hua Xu1.   

Abstract

Clinical Named Entity Recognition (NER) is a critical task for extracting important patient information from clinical text to support clinical and translational research. This study explored the neural word embeddings derived from a large unlabeled clinical corpus for clinical NER. We systematically compared two neural word embedding algorithms and three different strategies for deriving distributed word representations. Two neural word embeddings were derived from the unlabeled Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) II corpus (403,871 notes). The results from both 2010 i2b2 and 2014 Semantic Evaluation (SemEval) data showed that the binarized word embedding features outperformed other strategies for deriving distributed word representations. The binarized embedding features improved the F1-score of the Conditional Random Fields based clinical NER system by 2.3% on i2b2 data and 2.4% on SemEval data. The combined feature from the binarized embeddings and the Brown clusters improved the F1-score of the clinical NER system by 2.9% on i2b2 data and 2.7% on SemEval data. Our study also showed that the distributed word embedding features derived from a large unlabeled corpus can be better than the widely used Brown clusters. Further analysis found that the neural word embeddings captured a wide range of semantic relations, which could be discretized into distributed word representations to benefit the clinical NER system. The low-cost distributed feature representation can be adapted to any other clinical natural language processing research.

Entities:  

Mesh:

Year:  2015        PMID: 26958273      PMCID: PMC4765694     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  12 in total

1.  The KnowledgeMap project: development of a concept-based medical school curriculum database.

Authors:  Joshua C Denny; Plomarz R Irani; Firas H Wehbe; Jeffrey D Smithers; Anderson Spickard
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  Enhancing clinical concept extraction with distributional semantics.

Authors:  Siddhartha Jonnalagadda; Trevor Cohen; Stephen Wu; Graciela Gonzalez
Journal:  J Biomed Inform       Date:  2011-11-07       Impact factor: 6.317

3.  Extracting medication information from clinical text.

Authors:  Ozlem Uzuner; Imre Solti; Eithon Cadag
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

5.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

6.  Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database.

Authors:  Mohammed Saeed; Mauricio Villarroel; Andrew T Reisner; Gari Clifford; Li-Wei Lehman; George Moody; Thomas Heldt; Tin H Kyaw; Benjamin Moody; Roger G Mark
Journal:  Crit Care Med       Date:  2011-05       Impact factor: 7.598

7.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

8.  Towards a comprehensive medical language processing system: methods and issues.

Authors:  C Friedman
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

9.  Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.

Authors:  Berry de Bruijn; Colin Cherry; Svetlana Kiritchenko; Joel Martin; Xiaodan Zhu
Journal:  J Am Med Inform Assoc       Date:  2011-05-12       Impact factor: 4.497

10.  Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.

Authors:  Buzhou Tang; Hongxin Cao; Yonghui Wu; Min Jiang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2013-04-05       Impact factor: 2.796

View more
  25 in total

1.  Enhancing clinical concept extraction with contextual embeddings.

Authors:  Yuqi Si; Jingqi Wang; Hua Xu; Kirk Roberts
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  ML-Net: multi-label classification of biomedical texts with deep neural networks.

Authors:  Jingcheng Du; Qingyu Chen; Yifan Peng; Yang Xiang; Cui Tao; Zhiyong Lu
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

3.  Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

Authors:  Yonghui Wu; Xi Yang; Jiang Bian; Yi Guo; Hua Xu; William Hogan
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

4.  Adverse drug event and medication extraction in electronic health records via a cascading architecture with different sequence labeling models and word embeddings.

Authors:  Hong-Jie Dai; Chu-Hsien Su; Chi-Shin Wu
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

5.  Clinical Named Entity Recognition Using Deep Learning Models.

Authors:  Yonghui Wu; Min Jiang; Jun Xu; Degui Zhi; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

6.  Parsing clinical text using the state-of-the-art deep learning based parsers: a systematic comparison.

Authors:  Yaoyun Zhang; Firat Tiryaki; Min Jiang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2019-04-04       Impact factor: 2.796

7.  Joint Learning of Representations of Medical Concepts and Words from EHR Data.

Authors:  Tian Bai; Ashis Kumar Chanda; Brian L Egleston; Slobodan Vucetic
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

Review 8.  Deep learning in clinical natural language processing: a methodical review.

Authors:  Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

9.  Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

Authors:  Fábio Lopes; César Teixeira; Hugo Gonçalo Oliveira
Journal:  J Med Syst       Date:  2020-02-28       Impact factor: 4.460

10.  Automatic Normalization of Anatomical Phrases in Radiology Reports Using Unsupervised Learning.

Authors:  Amir M Tahmasebi; Henghui Zhu; Gabriel Mankovich; Peter Prinsen; Prescott Klassen; Sam Pilato; Rob van Ommering; Pritesh Patel; Martin L Gunn; Paul Chang
Journal:  J Digit Imaging       Date:  2019-02       Impact factor: 4.056

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.