Literature DB >> 31627020

RedMed: Extending drug lexicons for social media applications.

Adam Lavertu1, Russ B Altman2.   

Abstract

Social media has been identified as a promising potential source of information for pharmacovigilance. The adoption of social media data has been hindered by the massive and noisy nature of the data. Initial attempts to use social media data have relied on exact text matches to drugs of interest, and therefore suffer from the gap between formal drug lexicons and the informal nature of social media. The Reddit comment archive represents an ideal corpus for bridging this gap. We trained a word embedding model, RedMed, to facilitate the identification and retrieval of health entities from Reddit data. We compare the performance of our model trained on a consumer-generated corpus against publicly available models trained on expert-generated corpora. Our automated classification pipeline achieves an accuracy of 0.88 and a specificity of >0.9 across four different term classes. Of all drug mentions, an average of 79% (±0.5%) were exact matches to a generic or trademark drug name, 14% (±0.5%) were misspellings, 6.4% (±0.3%) were synonyms, and 0.13% (±0.05%) were pill marks. We find that our system captures an additional 20% of mentions; these would have been missed by approaches that rely solely on exact string matches. We provide a lexicon of misspellings and synonyms for 2978 drugs and a word embedding model trained on a health-oriented subset of Reddit.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Drug Surveillance; Lexicon; Natural Language Processing; Pharmacovigilance; Social Media

Mesh:

Substances:

Year:  2019        PMID: 31627020      PMCID: PMC6874884          DOI: 10.1016/j.jbi.2019.103307

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  24 in total

1.  Pattern mining for extraction of mentions of Adverse Drug Reactions from user comments.

Authors:  Azadeh Nikfarjam; Graciela H Gonzalez
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Measures of semantic similarity and relatedness in the biomedical domain.

Authors:  Ted Pedersen; Serguei V S Pakhomov; Siddharth Patwardhan; Christopher G Chute
Journal:  J Biomed Inform       Date:  2006-06-10       Impact factor: 6.317

3.  Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study.

Authors:  Serguei Pakhomov; Bridget McInnes; Terrence Adam; Ying Liu; Ted Pedersen; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

4.  Exploring brand-name drug mentions on Twitter for pharmacovigilance.

Authors:  Pablo Carbonell; Miguel A Mayer; Àlex Bravo
Journal:  Stud Health Technol Inform       Date:  2015

5.  Term identification methods for consumer health vocabulary development.

Authors:  Qing T Zeng; Tony Tse; Guy Divita; Alla Keselman; Jon Crowell; Allen C Browne; Sergey Goryachev; Long Ngo
Journal:  J Med Internet Res       Date:  2007-02-28       Impact factor: 5.428

6.  Phonetic spelling filter for keyword selection in drug mention mining from social media.

Authors:  Pranoti Pimpalkhute; Apurv Patki; Azadeh Nikfarjam; Graciela Gonzalez
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2014-04-07

7.  The SIDER database of drugs and side effects.

Authors:  Michael Kuhn; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2015-10-19       Impact factor: 16.971

8.  Detecting Novel and Emerging Drug Terms Using Natural Language Processing: A Social Media Corpus Study.

Authors:  Sean S Simpson; Nikki Adams; Claudia M Brugman; Thomas J Conners
Journal:  JMIR Public Health Surveill       Date:  2018-01-08

9.  A Data-Driven Method of Discovering Misspellings of Medication Names on Twitter.

Authors:  Keyuan Jiang; Tingyu Chen; Liyuan Huang; Ricardo A Calix; Gordon R Bernard
Journal:  Stud Health Technol Inform       Date:  2018

10.  Comment on: "Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts".

Authors:  Arjun Magge; Abeed Sarker; Azadeh Nikfarjam; Graciela Gonzalez-Hernandez
Journal:  J Am Med Inform Assoc       Date:  2019-06-01       Impact factor: 4.497

View more
  4 in total

1.  Mining Social Media Data for Biomedical Signals and Health-Related Behavior.

Authors:  Rion Brattig Correia; Ian B Wood; Johan Bollen; Luis M Rocha
Journal:  Annu Rev Biomed Data Sci       Date:  2020-05-04

Review 2.  A Year of Papers Using Biomedical Texts.

Authors:  Cyril Grouin; Natalia Grabar
Journal:  Yearb Med Inform       Date:  2020-08-21

3.  Using weak supervision to generate training datasets from social media data: a proof of concept to identify drug mentions.

Authors:  Ramya Tekumalla; Juan M Banda
Journal:  Neural Comput Appl       Date:  2021-10-29       Impact factor: 5.102

4.  Language-agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records.

Authors:  Benjamin Skov Kaas-Hansen; Davide Placido; Cristina Leal Rodríguez; Hans-Christian Thorsen-Meyer; Simona Gentile; Anna Pors Nielsen; Søren Brunak; Gesche Jürgens; Stig Ejdrup Andersen
Journal:  Basic Clin Pharmacol Toxicol       Date:  2022-07-26       Impact factor: 3.688

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.