Literature DB >> 23686935

Toward creation of a cancer drug toxicity knowledge base: automatically extracting cancer drug-side effect relationships from the literature.

Rong Xu1, QuanQiu Wang.   

Abstract

OBJECTIVE: A comprehensive and machine-understandable cancer drug-side effect (drug-SE) relationship knowledge base is important for in silico cancer drug target discovery, drug repurposing, and toxicity predication, and for personalized risk-benefit decisions by cancer patients. While US Food and Drug Administration (FDA) drug labels capture well-known cancer drug SE information, much cancer drug SE knowledge remains buried the published biomedical literature. We present a relationship extraction approach to extract cancer drug-SE pairs from the literature. DATA AND METHODS: We used 21,354,075 MEDLINE records as the text corpus. We extracted drug-SE co-occurrence pairs using a cancer drug lexicon and a clean SE lexicon that we created. We then developed two filtering approaches to remove drug-disease treatment pairs and subsequently a ranking scheme to further prioritize filtered pairs. Finally, we analyzed relationships among SEs, gene targets, and indications.
RESULTS: We extracted 56,602 cancer drug-SE pairs. The filtering algorithms improved the precision of extracted pairs from 0.252 at baseline to 0.426, representing a 69% improvement in precision with no decrease in recall. The ranking algorithm further prioritized filtered pairs and achieved a precision of 0.778 for top-ranked pairs. We showed that cancer drugs that share SEs tend to have overlapping gene targets and overlapping indications.
CONCLUSIONS: The relationship extraction approach is effective in extracting many cancer drug-SE pairs from the literature. This unique knowledge base, when combined with existing cancer drug SE knowledge, can facilitate drug target discovery, drug repurposing, and toxicity prediction.

Entities:  

Keywords:  Cancer Drug Toxicity; Drug Repurposing; Drug Target Discovery; Information Extraction; Natural Language Processing; Text Mining

Mesh:

Substances:

Year:  2013        PMID: 23686935      PMCID: PMC3912715          DOI: 10.1136/amiajnl-2012-001584

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  16 in total

1.  Automatic extraction of biological information from scientific text: protein-protein interactions.

Authors:  C Blaschke; M A Andrade; C Ouzounis; A Valencia
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1999

2.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles.

Authors:  C Friedman; P Kra; H Yu; M Krauthammer; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Dissemination of information on potentially fatal adverse drug reactions for cancer drugs from 2000 to 2002: first results from the research on adverse drug events and reports project.

Authors:  Lisa A Ladewski; Steven M Belknap; Jonathan R Nebeker; Oliver Sartor; E Allison Lyons; Timothy C Kuzel; Martin S Tallman; Dennis W Raisch; Amy R Auerbach; Glen T Schumock; Hau C Kwaan; Charles L Bennett
Journal:  J Clin Oncol       Date:  2003-10-15       Impact factor: 44.544

5.  A drug-adverse event extraction algorithm to support pharmacovigilance knowledge mining from PubMed citations.

Authors:  Wei Wang; Krystl Haerian; Hojjat Salmasian; Rave Harpaz; Herbert Chase; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

6.  Obstacles to answering doctors' questions about patient care with evidence: qualitative study.

Authors:  John W Ely; Jerome A Osheroff; Mark H Ebell; M Lee Chambliss; Daniel C Vinson; James J Stevermer; Eric A Pifer
Journal:  BMJ       Date:  2002-03-23

7.  Design and validation of an automated method to detect known adverse drug reactions in MEDLINE: a contribution from the EU-ADR project.

Authors:  Paul Avillach; Jean-Charles Dufour; Gayo Diallo; Francesco Salvo; Michel Joubert; Frantz Thiessard; Fleur Mougin; Gianluca Trifirò; Annie Fourrier-Réglat; Antoine Pariente; Marius Fieschi
Journal:  J Am Med Inform Assoc       Date:  2012-11-29       Impact factor: 4.497

8.  EDGAR: extraction of drugs, genes and relations from the biomedical literature.

Authors:  T C Rindflesch; L Tanabe; J N Weinstein; L Hunter
Journal:  Pac Symp Biocomput       Date:  2000

9.  Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs.

Authors:  Mei Liu; Yonghui Wu; Yukun Chen; Jingchun Sun; Zhongming Zhao; Xue-wen Chen; Michael Edwin Matheny; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2012-06       Impact factor: 4.497

10.  DrugBank: a knowledgebase for drugs, drug actions and drug targets.

Authors:  David S Wishart; Craig Knox; An Chi Guo; Dean Cheng; Savita Shrivastava; Dan Tzur; Bijaya Gautam; Murtaza Hassanali
Journal:  Nucleic Acids Res       Date:  2007-11-29       Impact factor: 16.971

View more
  9 in total

1.  Automatic signal extraction, prioritizing and filtering approaches in detecting post-marketing cardiovascular events associated with targeted cancer drugs from the FDA Adverse Event Reporting System (FAERS).

Authors:  Rong Xu; Quanqiu Wang
Journal:  J Biomed Inform       Date:  2013-10-28       Impact factor: 6.317

2.  Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2015-03-27       Impact factor: 6.317

3.  PubMedMiner: Mining and Visualizing MeSH-based Associations in PubMed.

Authors:  Yucan Zhang; Indra Neil Sarkar; Elizabeth S Chen
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

4.  Constructing a knowledge-based heterogeneous information graph for medical health status classification.

Authors:  Thuan Pham; Xiaohui Tao; Ji Zhang; Jianming Yong
Journal:  Health Inf Sci Syst       Date:  2020-02-14

5.  tcTKB: an integrated cardiovascular toxicity knowledge base for targeted cancer drugs.

Authors:  Rong Xu; QuanQiu Wang
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

6.  Large-scale combining signals from both biomedical literature and the FDA Adverse Event Reporting System (FAERS) to improve post-marketing drug safety signal detection.

Authors:  Rong Xu; QuanQiu Wang
Journal:  BMC Bioinformatics       Date:  2014-01-15       Impact factor: 3.169

Review 7.  Big data: the next frontier for innovation in therapeutics and healthcare.

Authors:  Naiem T Issa; Stephen W Byers; Sivanesan Dakshanamurthy
Journal:  Expert Rev Clin Pharmacol       Date:  2014-04-07       Impact factor: 5.045

8.  Immunotherapy-related adverse events (irAEs): extraction from FDA drug labels and comparative analysis.

Authors:  QuanQiu Wang; Rong Xu
Journal:  JAMIA Open       Date:  2018-10-15

9.  Computational advances in cancer informatics (a).

Authors:  Xiaoqian Jiang; Rui Chen; Samuel Cheng; Xia Jiang; Bairong Shen; Rong Xu; Song Yi
Journal:  Cancer Inform       Date:  2014-10-13
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.