Literature DB >> 31569654

Ontology-Based Healthcare Named Entity Recognition from Twitter Messages Using a Recurrent Neural Network Approach.

Erdenebileg Batbaatar1, Keun Ho Ryu2,3.   

Abstract

Named Entity Recognition (NER) in the healthcare domain involves identifying and categorizing disease, drugs, and symptoms for biosurveillance, extracting their related properties and activities, and identifying adverse drug events appearing in texts. These tasks are important challenges in healthcare. Analyzing user messages in social media networks such as Twitter can provide opportunities to detect and manage public health events. Twitter provides a broad range of short messages that contain interesting information for information extraction. In this paper, we present a Health-Related Named Entity Recognition (HNER) task using healthcare-domain ontology that can recognize health-related entities from large numbers of user messages from Twitter. For this task, we employ a deep learning architecture which is based on a recurrent neural network (RNN) with little feature engineering. To achieve our goal, we collected a large number of Twitter messages containing health-related information, and detected biomedical entities from the Unified Medical Language System (UMLS). A bidirectional long short-term memory (BiLSTM) model learned rich context information, and a convolutional neural network (CNN) was used to produce character-level features. The conditional random field (CRF) model predicted a sequence of labels that corresponded to a sequence of inputs, and the Viterbi algorithm was used to detect health-related entities from Twitter messages. We provide comprehensive results giving valuable insights for identifying medical entities in Twitter for various applications. The BiLSTM-CRF model achieved a precision of 93.99%, recall of 73.31%, and F1-score of 81.77% for disease or syndrome HNER; a precision of 90.83%, recall of 81.98%, and F1-score of 87.52% for sign or symptom HNER; and a precision of 94.85%, recall of 73.47%, and F1-score of 84.51% for pharmacologic substance named entities. The ontology-based manual annotation results show that it is possible to perform high-quality annotation despite the complexity of medical terminology and the lack of context in tweets.

Entities:  

Keywords:  Twitter; conditional random field; deep learning; healthcare; named entity recognition; ontology; recurrent neural network; unified medical language system; word embedding

Mesh:

Year:  2019        PMID: 31569654      PMCID: PMC6801946          DOI: 10.3390/ijerph16193628

Source DB:  PubMed          Journal:  Int J Environ Res Public Health        ISSN: 1660-4601            Impact factor:   3.390


  23 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Identifying Diseases, Drugs, and Symptoms in Twitter.

Authors:  Antonio Jimeno-Yepes; Andrew MacKinlay; Bo Han; Qiang Chen
Journal:  Stud Health Technol Inform       Date:  2015

3.  BANNER: an executable survey of advances in biomedical named entity recognition.

Authors:  Robert Leaman; Graciela Gonzalez
Journal:  Pac Symp Biocomput       Date:  2008

4.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

5.  Twitter as a tool for communication and knowledge exchange in academic medicine: A guide for skeptics and novices.

Authors:  Esther K Choo; Megan L Ranney; Teresa M Chan; N Seth Trueger; Amy E Walsh; Ken Tegtmeyer; Shannon O McNamara; Ricky Y Choi; Christopher L Carroll
Journal:  Med Teach       Date:  2014-12-19       Impact factor: 3.650

6.  New technologies for reporting real-time emergent infections.

Authors:  Rumi Chunara; Clark C Freifeld; John S Brownstein
Journal:  Parasitology       Date:  2012-08-16       Impact factor: 3.234

7.  Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks.

Authors:  Qikang Wei; Tao Chen; Ruifeng Xu; Yulan He; Lin Gui
Journal:  Database (Oxford)       Date:  2016-10-24       Impact factor: 3.451

8.  A neural network multi-task learning approach to biomedical named entity recognition.

Authors:  Gamal Crichton; Sampo Pyysalo; Billy Chiu; Anna Korhonen
Journal:  BMC Bioinformatics       Date:  2017-08-15       Impact factor: 3.169

9.  Disease named entity recognition from biomedical literature using a novel convolutional neural network.

Authors:  Zhehuan Zhao; Zhihao Yang; Ling Luo; Lei Wang; Yin Zhang; Hongfei Lin; Jian Wang
Journal:  BMC Med Genomics       Date:  2017-12-28       Impact factor: 3.063

10.  BioWordVec, improving biomedical word embeddings with subword information and MeSH.

Authors:  Yijia Zhang; Qingyu Chen; Zhihao Yang; Hongfei Lin; Zhiyong Lu
Journal:  Sci Data       Date:  2019-05-10       Impact factor: 6.444

View more
  4 in total

1.  Discovering Thematically Coherent Biomedical Documents Using Contextualized Bidirectional Encoder Representations from Transformers-Based Clustering.

Authors:  Khishigsuren Davagdorj; Ling Wang; Meijing Li; Van-Huy Pham; Keun Ho Ryu; Nipon Theera-Umpon
Journal:  Int J Environ Res Public Health       Date:  2022-05-12       Impact factor: 4.614

2.  SEED: Symptom Extraction from English Social Media Posts using Deep Learning and Transfer Learning.

Authors:  Arjun Magge; Davy Weissenbacher; Karen Oâ Connor; Matthew Scotch; Graciela Gonzalez-Hernandez
Journal:  medRxiv       Date:  2022-03-21

3.  Comparing general and specialized word embeddings for biomedical named entity recognition.

Authors:  Rigo E Ramos-Vargas; Israel Román-Godínez; Sulema Torres-Ramos
Journal:  PeerJ Comput Sci       Date:  2021-02-18

4.  Topics, Sentiments, and Emotions Triggered by COVID-19-Related Tweets from IRAN and Turkey Official News Agencies.

Authors:  Waseem Ahmad; Bang Wang; Han Xu; Minghua Xu; Zeng Zeng
Journal:  SN Comput Sci       Date:  2021-07-29
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.