Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 RedMed: Extending drug lexicons for social media applications.

Literature DB >> 31627020

RedMed: Extending drug lexicons for social media applications.

Abstract

Social media has been identified as a promising potential source of information for pharmacovigilance. The adoption of social media data has been hindered by the massive and noisy nature of the data. Initial attempts to use social media data have relied on exact text matches to drugs of interest, and therefore suffer from the gap between formal drug lexicons and the informal nature of social media. The Reddit comment archive represents an ideal corpus for bridging this gap. We trained a word embedding model, RedMed, to facilitate the identification and retrieval of health entities from Reddit data. We compare the performance of our model trained on a consumer-generated corpus against publicly available models trained on expert-generated corpora. Our automated classification pipeline achieves an accuracy of 0.88 and a specificity of >0.9 across four different term classes. Of all drug mentions, an average of 79% (±0.5%) were exact matches to a generic or trademark drug name, 14% (±0.5%) were misspellings, 6.4% (±0.3%) were synonyms, and 0.13% (±0.05%) were pill marks. We find that our system captures an additional 20% of mentions; these would have been missed by approaches that rely solely on exact string matches. We provide a lexicon of misspellings and synonyms for 2978 drugs and a word embedding model trained on a health-oriented subset of Reddit.

Entities: Chemical Disease Gene Species

Keywords: Drug Surveillance; Lexicon; Natural Language Processing; Pharmacovigilance; Social Media

Mesh：

Substances：

Year: 2019 PMID： 31627020 PMCID： PMC6874884 DOI： 10.1016/j.jbi.2019.103307

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

24 in total

1. Pattern mining for extraction of mentions of Adverse Drug Reactions from user comments.

Authors: Azadeh Nikfarjam; Graciela H Gonzalez
Journal: AMIA Annu Symp Proc Date: 2011-10-22

2. Measures of semantic similarity and relatedness in the biomedical domain.

Authors: Ted Pedersen; Serguei V S Pakhomov; Siddharth Patwardhan; Christopher G Chute
Journal: J Biomed Inform Date: 2006-06-10 Impact factor: 6.317

3. Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study.

Authors: Serguei Pakhomov; Bridget McInnes; Terrence Adam; Ying Liu; Ted Pedersen; Genevieve B Melton
Journal: AMIA Annu Symp Proc Date: 2010-11-13

4. Exploring brand-name drug mentions on Twitter for pharmacovigilance.

Authors: Pablo Carbonell; Miguel A Mayer; Àlex Bravo
Journal: Stud Health Technol Inform Date: 2015

5. Term identification methods for consumer health vocabulary development.

Authors: Qing T Zeng; Tony Tse; Guy Divita; Alla Keselman; Jon Crowell; Allen C Browne; Sergey Goryachev; Long Ngo
Journal: J Med Internet Res Date: 2007-02-28 Impact factor: 5.428

6. Phonetic spelling filter for keyword selection in drug mention mining from social media.

Authors: Pranoti Pimpalkhute; Apurv Patki; Azadeh Nikfarjam; Graciela Gonzalez
Journal: AMIA Jt Summits Transl Sci Proc Date: 2014-04-07

7. The SIDER database of drugs and side effects.

Authors: Michael Kuhn; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal: Nucleic Acids Res Date: 2015-10-19 Impact factor: 16.971

8. Detecting Novel and Emerging Drug Terms Using Natural Language Processing: A Social Media Corpus Study.

Authors: Sean S Simpson; Nikki Adams; Claudia M Brugman; Thomas J Conners
Journal: JMIR Public Health Surveill Date: 2018-01-08

9. A Data-Driven Method of Discovering Misspellings of Medication Names on Twitter.

Authors: Keyuan Jiang; Tingyu Chen; Liyuan Huang; Ricardo A Calix; Gordon R Bernard
Journal: Stud Health Technol Inform Date: 2018

10. Comment on: "Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts".

Authors: Arjun Magge; Abeed Sarker; Azadeh Nikfarjam; Graciela Gonzalez-Hernandez
Journal: J Am Med Inform Assoc Date: 2019-06-01 Impact factor: 4.497

4 in total

1. Mining Social Media Data for Biomedical Signals and Health-Related Behavior.

Authors: Rion Brattig Correia; Ian B Wood; Johan Bollen; Luis M Rocha
Journal: Annu Rev Biomed Data Sci Date: 2020-05-04

Review 2. A Year of Papers Using Biomedical Texts.

Authors: Cyril Grouin; Natalia Grabar
Journal: Yearb Med Inform Date: 2020-08-21

3. Using weak supervision to generate training datasets from social media data: a proof of concept to identify drug mentions.

Authors: Ramya Tekumalla; Juan M Banda
Journal: Neural Comput Appl Date: 2021-10-29 Impact factor: 5.102

4. Language-agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records.

Authors: Benjamin Skov Kaas-Hansen; Davide Placido; Cristina Leal Rodríguez; Hans-Christian Thorsen-Meyer; Simona Gentile; Anna Pors Nielsen; Søren Brunak; Gesche Jürgens; Stig Ejdrup Andersen
Journal: Basic Clin Pharmacol Toxicol Date: 2022-07-26 Impact factor: 3.688

4 in total