Literature DB >> 25451103

Portable automatic text classification for adverse drug reaction detection via multi-corpus training.

Abeed Sarker1, Graciela Gonzalez2.   

Abstract

OBJECTIVE: Automatic detection of adverse drug reaction (ADR) mentions from text has recently received significant interest in pharmacovigilance research. Current research focuses on various sources of text-based information, including social media-where enormous amounts of user posted data is available, which have the potential for use in pharmacovigilance if collected and filtered accurately. The aims of this study are: (i) to explore natural language processing (NLP) approaches for generating useful features from text, and utilizing them in optimized machine learning algorithms for automatic classification of ADR assertive text segments; (ii) to present two data sets that we prepared for the task of ADR detection from user posted internet data; and (iii) to investigate if combining training data from distinct corpora can improve automatic classification accuracies.
METHODS: One of our three data sets contains annotated sentences from clinical reports, and the two other data sets, built in-house, consist of annotated posts from social media. Our text classification approach relies on generating a large set of features, representing semantic properties (e.g., sentiment, polarity, and topic), from short text nuggets. Importantly, using our expanded feature sets, we combine training data from different corpora in attempts to boost classification accuracies.
RESULTS: Our feature-rich classification approach performs significantly better than previously published approaches with ADR class F-scores of 0.812 (previously reported best: 0.770), 0.538 and 0.678 for the three data sets. Combining training data from multiple compatible corpora further improves the ADR F-scores for the in-house data sets to 0.597 (improvement of 5.9 units) and 0.704 (improvement of 2.6 units) respectively.
CONCLUSIONS: Our research results indicate that using advanced NLP techniques for generating information rich features from text can significantly improve classification accuracies over existing benchmarks. Our experiments illustrate the benefits of incorporating various semantic features such as topics, concepts, sentiments, and polarities. Finally, we show that integration of information from compatible corpora can significantly improve classification performance. This form of multi-corpus training may be particularly useful in cases where data sets are heavily imbalanced (e.g., social media data), and may reduce the time and costs associated with the annotation of data in the future.
Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Adverse drug reaction; Natural language processing; Pharmacovigilance; Social media monitoring; Text classification

Mesh:

Year:  2014        PMID: 25451103      PMCID: PMC4355323          DOI: 10.1016/j.jbi.2014.11.002

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  30 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Predicting adverse drug events from personal health messages.

Authors:  Brant W Chee; Richard Berlin; Bruce Schatz
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

3.  Understanding interobserver agreement: the kappa statistic.

Authors:  Anthony J Viera; Joanne M Garrett
Journal:  Fam Med       Date:  2005-05       Impact factor: 1.756

4.  Using information mining of the medical literature to improve drug safety.

Authors:  Kanaka D Shetty; Siddhartha R Dalal
Journal:  J Am Med Inform Assoc       Date:  2011-05-05       Impact factor: 4.497

5.  Social media and networks in pharmacovigilance: boon or bane?

Authors:  I Ralph Edwards; Marie Lindquist
Journal:  Drug Saf       Date:  2011-04-01       Impact factor: 5.606

6.  Combing signals from spontaneous reports and electronic health records for detection of adverse drug reactions.

Authors:  Rave Harpaz; Santiago Vilar; William Dumouchel; Hojjat Salmasian; Krystl Haerian; Nigam H Shah; Herbert S Chase; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2012-10-31       Impact factor: 4.497

7.  Identifying potential adverse effects using the web: a new approach to medical hypothesis generation.

Authors:  Adrian Benton; Lyle Ungar; Shawndra Hill; Sean Hennessy; Jun Mao; Annie Chung; Charles E Leonard; John H Holmes
Journal:  J Biomed Inform       Date:  2011-07-26       Impact factor: 6.317

8.  A pipeline to extract drug-adverse event pairs from multiple data sources.

Authors:  Srijyothsna Yeleswarapu; Aditya Rao; Thomas Joseph; Vangala Govindakrishnan Saipradeep; Rajgopal Srinivasan
Journal:  BMC Med Inform Decis Mak       Date:  2014-02-24       Impact factor: 2.796

9.  A side effect resource to capture phenotypic effects of drugs.

Authors:  Michael Kuhn; Monica Campillos; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Mol Syst Biol       Date:  2010-01-19       Impact factor: 11.429

10.  Extraction of potential adverse drug events from medical case reports.

Authors:  Harsha Gurulingappa; Abdul Mateen-Rajput; Luca Toldo
Journal:  J Biomed Semantics       Date:  2012-12-20
View more
  60 in total

Review 1.  Clinical Natural Language Processing in 2014: Foundational Methods Supporting Efficient Healthcare.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2015-08-13

2.  Social Media Listening for Routine Post-Marketing Safety Surveillance.

Authors:  Gregory E Powell; Harry A Seifert; Tjark Reblin; Phil J Burstein; James Blowers; J Alan Menius; Jeffery L Painter; Michele Thomas; Carrie E Pierce; Harold W Rodriguez; John S Brownstein; Clark C Freifeld; Heidi G Bell; Nabarun Dasgupta
Journal:  Drug Saf       Date:  2016-05       Impact factor: 5.606

Review 3.  Utilizing social media data for pharmacovigilance: A review.

Authors:  Abeed Sarker; Rachel Ginn; Azadeh Nikfarjam; Karen O'Connor; Karen Smith; Swetha Jayaraman; Tejaswi Upadhaya; Graciela Gonzalez
Journal:  J Biomed Inform       Date:  2015-02-23       Impact factor: 6.317

4.  Using Electronic Health Records for Quality Measurement and Accountability in Care of the Seriously Ill: Opportunities and Challenges.

Authors:  J Randall Curtis; Seelwan Sathitratanacheewin; Helene Starks; Robert Y Lee; Erin K Kross; Lois Downey; James Sibley; William Lober; Elizabeth T Loggers; James A Fausto; Charlotta Lindvall; Ruth A Engelberg
Journal:  J Palliat Med       Date:  2017-11-28       Impact factor: 2.947

5.  Transparent Reporting on Research Using Unstructured Electronic Health Record Data to Generate 'Real World' Evidence of Comparative Effectiveness and Safety.

Authors:  Shirley V Wang; Olga V Patterson; Joshua J Gagne; Jeffrey S Brown; Robert Ball; Pall Jonsson; Adam Wright; Li Zhou; Wim Goettsch; Andrew Bate
Journal:  Drug Saf       Date:  2019-11       Impact factor: 5.606

6.  Standardizing Heterogeneous Annotation Corpora Using HL7 FHIR for Facilitating their Reuse and Integration in Clinical NLP.

Authors:  Na Hong; Andrew Wen; Majid Rastegar Mojarad; Sunghwan Sohn; Hongfang Liu; Guoqian Jiang
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

7.  Will mHealth Revolutionize Health and Clinical Management and Open up New Horizons for Mental Health?

Authors:  E Conchon; N Bricon-Souf
Journal:  Yearb Med Inform       Date:  2016-11-10

8.  Detection of Adverse Drug Reactions using Medical Named Entities on Twitter.

Authors:  Andrew MacKinlay; Hafsah Aamer; Antonio Jimeno Yepes
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

9.  A comparison of rule-based and machine learning approaches for classifying patient portal messages.

Authors:  Robert M Cronin; Daniel Fabbri; Joshua C Denny; S Trent Rosenbloom; Gretchen Purcell Jackson
Journal:  Int J Med Inform       Date:  2017-06-23       Impact factor: 4.046

10.  Deep learning for pollen allergy surveillance from twitter in Australia.

Authors:  Jia Rong; Sandra Michalska; Sudha Subramani; Jiahua Du; Hua Wang
Journal:  BMC Med Inform Decis Mak       Date:  2019-11-08       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.