Literature DB >> 30815153

Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

Yonghui Wu1, Xi Yang1, Jiang Bian1, Yi Guo1, Hua Xu2, William Hogan1.   

Abstract

There has been an increasing interest in developing deep learning methods to recognize clinical concepts from narrative clinical text. Recently, several studies have reported that Recurrent Neural Networks (RNNs) outperformed traditional machine learning methods such as Conditional Random Fields (CRFs). Deep learning-based Named Entity Recognition (NER) systems often use statistical language models to learn word embeddings from unlabeled corpora. However, current word embedding methods have limitations to learn decent representations for low-frequency words. Medicine is a knowledge-extensive domain; existing medical knowledge has the potential to improve feature representations for less frequent yet important words. However, it is still not clear how existing medical knowledge can help deep learning models in clinical NER tasks. In this study, we integrated medical knowledge from the Unified Medical Language System with word embeddings trained from an unlabeled clinical corpus in RNNs for detection of problems, treatments and lab tests. We examined three different ways to generate medical knowledge features, including a dictionary lookup program, the KnowledgeMap system, and the MedLEE system. We also compared representing medical knowledge as one-hot vectors versus representing medical knowledge as embedding layers. The evaluation results showed that the RNN with medical knowledge as embedding layers achieved new state-of-the-art performance (a strict F1 score of 86.21% and a relaxed F1 score of 92.80%) on the 2010 i2b2 corpus, outperforming an RNN with only word embeddings and RNNs with medical knowledge as one-hot vectors. This study demonstrated an efficient way of integrating medical knowledge with distributed word representations for clinical NER.

Entities:  

Mesh:

Year:  2018        PMID: 30815153      PMCID: PMC6371322     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  17 in total

1.  The KnowledgeMap project: development of a concept-based medical school curriculum database.

Authors:  Joshua C Denny; Plomarz R Irani; Firas H Wehbe; Jeffrey D Smithers; Anderson Spickard
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database.

Authors:  Mohammed Saeed; Mauricio Villarroel; Andrew T Reisner; Gari Clifford; Li-Wei Lehman; George Moody; Thomas Heldt; Tin H Kyaw; Benjamin Moody; Roger G Mark
Journal:  Crit Care Med       Date:  2011-05       Impact factor: 7.598

3.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

4.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Authors:  Min Jiang; Yukun Chen; Mei Liu; S Trent Rosenbloom; Subramani Mani; Joshua C Denny; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2011-04-20       Impact factor: 4.497

5.  Towards a comprehensive medical language processing system: methods and issues.

Authors:  C Friedman
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

6.  A hybrid system for temporal information extraction from clinical text.

Authors:  Buzhou Tang; Yonghui Wu; Min Jiang; Yukun Chen; Joshua C Denny; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2013-04-09       Impact factor: 4.497

Review 7.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

Authors:  Weiyi Sun; Anna Rumshisky; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2013-04-05       Impact factor: 4.497

8.  Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.

Authors:  Berry de Bruijn; Colin Cherry; Svetlana Kiritchenko; Joel Martin; Xiaodan Zhu
Journal:  J Am Med Inform Assoc       Date:  2011-05-12       Impact factor: 4.497

9.  Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.

Authors:  Sameer Pradhan; Noémie Elhadad; Brett R South; David Martinez; Lee Christensen; Amy Vogel; Hanna Suominen; Wendy W Chapman; Guergana Savova
Journal:  J Am Med Inform Assoc       Date:  2014-08-21       Impact factor: 4.497

10.  Entity recognition from clinical texts via recurrent neural network.

Authors:  Zengjian Liu; Ming Yang; Xiaolong Wang; Qingcai Chen; Buzhou Tang; Zhe Wang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2017-07-05       Impact factor: 2.796

View more
  9 in total

1.  A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings.

Authors:  Xi Yang; Tianchen Lyu; Chih-Yin Lee; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  IEEE Int Conf Healthc Inform       Date:  2019-11-21

Review 2.  Deep learning in clinical natural language processing: a methodical review.

Authors:  Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

3.  TAX-Corpus: Taxonomy based Annotations for Colonoscopy Evaluation.

Authors:  Shorabuddin Syed; Adam Jackson Angel; Hafsa Bareen Syeda; Carole Franc Jennings; Joseph VanScoy; Mahanazuddin Syed; Melody Greer; Sudeepa Bhattacharyya; Shaymaa Al-Shukri; Meredith Zozus; Fred Prior; Benjamin Tharian
Journal:  Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap       Date:  2022-02

4.  The h-ANN Model: Comprehensive Colonoscopy Concept Compilation Using Combined Contextual Embeddings.

Authors:  Shorabuddin Syed; Adam Jackson Angel; Hafsa Bareen Syeda; Carole France Jennings; Joseph VanScoy; Mahanazuddin Syed; Melody Greer; Sudeepa Bhattacharyya; Meredith Zozus; Benjamin Tharian; Fred Prior
Journal:  Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap       Date:  2022-02

5.  Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting.

Authors:  Xi Yang; Jiang Bian; Ruogu Fang; Ragnhildur I Bjarnadottir; William R Hogan; Yonghui Wu
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

Review 6.  Applications of artificial intelligence in drug development using real-world data.

Authors:  Zhaoyi Chen; Xiong Liu; William Hogan; Elizabeth Shenkman; Jiang Bian
Journal:  Drug Discov Today       Date:  2020-12-24       Impact factor: 7.851

7.  Clinical concept extraction using transformers.

Authors:  Xi Yang; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  J Am Med Inform Assoc       Date:  2020-12-09       Impact factor: 4.497

8.  A study of deep learning methods for de-identification of clinical notes in cross-institute settings.

Authors:  Xi Yang; Tianchen Lyu; Qian Li; Chih-Yin Lee; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-05       Impact factor: 2.796

9.  Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining.

Authors:  Lejun Gong; Zhifei Zhang; Shiqi Chen
Journal:  J Healthc Eng       Date:  2020-11-24       Impact factor: 2.682

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.