Literature DB >> 24928448

Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature.

Rong Xu1, QuanQiu Wang2.   

Abstract

Systems approaches to studying drug-side-effect (drug-SE) associations are emerging as an active research area for drug target discovery, drug repositioning, and drug toxicity prediction. However, currently available drug-SE association databases are far from being complete. Herein, in an effort to increase the data completeness of current drug-SE relationship resources, we present an automatic learning approach to accurately extract drug-SE pairs from the vast amount of published biomedical literature, a rich knowledge source of side effect information for commercial, experimental, and even failed drugs. For the text corpus, we used 119,085,682 MEDLINE sentences and their parse trees. We used known drug-SE associations derived from US Food and Drug Administration (FDA) drug labels as prior knowledge to find relevant sentences and parse trees. We extracted syntactic patterns associated with drug-SE pairs from the resulting set of parse trees. We developed pattern-ranking algorithms to prioritize drug-SE-specific patterns. We then selected a set of patterns with both high precisions and recalls in order to extract drug-SE pairs from the entire MEDLINE. In total, we extracted 38,871 drug-SE pairs from MEDLINE using the learned patterns, the majority of which have not been captured in FDA drug labels to date. On average, our knowledge-driven pattern-learning approach in extracting drug-SE pairs from MEDLINE has achieved a precision of 0.833, a recall of 0.407, and an F1 of 0.545. We compared our approach to a support vector machine (SVM)-based machine learning and a co-occurrence statistics-based approach. We show that the pattern-learning approach is largely complementary to the SVM- and co-occurrence-based approaches with significantly higher precision and F1 but lower recall. We demonstrated by correlation analysis that the extracted drug side effects correlate positively with both drug targets, metabolism, and indications.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Drug discovery; Drug repositioning; Drug side effect; Drug toxicity prediction; Text mining

Mesh:

Year:  2014        PMID: 24928448      PMCID: PMC4589180          DOI: 10.1016/j.jbi.2014.05.013

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  32 in total

1.  Predicting adverse drug events using pharmacological network models.

Authors:  Aurel Cami; Alana Arnold; Shannon Manzi; Ben Reis
Journal:  Sci Transl Med       Date:  2011-12-21       Impact factor: 17.956

2.  A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text.

Authors:  Rong Xu; Quanqiu Wang
Journal:  J Biomed Inform       Date:  2012-04-27       Impact factor: 6.317

3.  Towards building a disease-phenotype knowledge base: extracting disease-manifestation relationship from literature.

Authors:  Rong Xu; Li Li; Quanqiu Wang
Journal:  Bioinformatics       Date:  2013-07-04       Impact factor: 6.937

4.  Drug-target network.

Authors:  Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal:  Nat Biotechnol       Date:  2007-10       Impact factor: 54.908

5.  Using information mining of the medical literature to improve drug safety.

Authors:  Kanaka D Shetty; Siddhartha R Dalal
Journal:  J Am Med Inform Assoc       Date:  2011-05-05       Impact factor: 4.497

6.  An algorithmic framework for predicting side effects of drugs.

Authors:  Nir Atias; Roded Sharan
Journal:  J Comput Biol       Date:  2011-03       Impact factor: 1.479

Review 7.  Computational drug repositioning: from data to therapeutics.

Authors:  M R Hurle; L Yang; Q Xie; D K Rajpal; P Sanseau; P Agarwal
Journal:  Clin Pharmacol Ther       Date:  2013-01-15       Impact factor: 6.875

8.  Drug discovery using chemical systems biology: weak inhibition of multiple kinases may contribute to the anti-cancer effect of nelfinavir.

Authors:  Li Xie; Thomas Evangelidis; Lei Xie; Philip E Bourne
Journal:  PLoS Comput Biol       Date:  2011-04-28       Impact factor: 4.475

9.  A side effect resource to capture phenotypic effects of drugs.

Authors:  Michael Kuhn; Monica Campillos; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Mol Syst Biol       Date:  2010-01-19       Impact factor: 11.429

10.  Extraction of potential adverse drug events from medical case reports.

Authors:  Harsha Gurulingappa; Abdul Mateen-Rajput; Luca Toldo
Journal:  J Biomed Semantics       Date:  2012-12-20
View more
  23 in total

1.  Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2015-03-27       Impact factor: 6.317

2.  Development of a Controlled Vocabulary-Based Adverse Drug Reaction Signal Dictionary for Multicenter Electronic Health Record-Based Pharmacovigilance.

Authors:  Suehyun Lee; Jongsoo Han; Rae Woong Park; Grace Juyun Kim; John Hoon Rim; Jooyoung Cho; Kye Hwa Lee; Jisan Lee; Sujeong Kim; Ju Han Kim
Journal:  Drug Saf       Date:  2019-05       Impact factor: 5.606

3.  Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation.

Authors:  Chung-Chi Huang; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-03-25       Impact factor: 3.451

4.  Comparing a knowledge-driven approach to a supervised machine learning approach in large-scale extraction of drug-side effect relationships from free-text biomedical literature.

Authors:  Rong Xu; QuanQiu Wang
Journal:  BMC Bioinformatics       Date:  2015-03-18       Impact factor: 3.169

5.  Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2014-10-13       Impact factor: 6.317

Review 6.  Implications of the small number of distinct ligand binding pockets in proteins for drug discovery, evolution and biochemical function.

Authors:  Jeffrey Skolnick; Mu Gao; Ambrish Roy; Bharath Srinivasan; Hongyi Zhou
Journal:  Bioorg Med Chem Lett       Date:  2015-02-03       Impact factor: 2.823

7.  tcTKB: an integrated cardiovascular toxicity knowledge base for targeted cancer drugs.

Authors:  Rong Xu; QuanQiu Wang
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

Review 8.  An introduction to machine learning and analysis of its use in rheumatic diseases.

Authors:  Kathryn M Kingsmore; Christopher E Puglisi; Amrie C Grammer; Peter E Lipsky
Journal:  Nat Rev Rheumatol       Date:  2021-11-02       Impact factor: 20.543

9.  Chemical-induced disease relation extraction with various linguistic features.

Authors:  Jinghang Gu; Longhua Qian; Guodong Zhou
Journal:  Database (Oxford)       Date:  2016-04-06       Impact factor: 3.451

10.  CD-REST: a system for extracting chemical-induced disease relation in literature.

Authors:  Jun Xu; Yonghui Wu; Yaoyun Zhang; Jingqi Wang; Hee-Jin Lee; Hua Xu
Journal:  Database (Oxford)       Date:  2016-03-25       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.