Literature DB >> 23570835

A semi-supervised approach to extract pharmacogenomics-specific drug-gene pairs from biomedical literature for personalized medicine.

Rong Xu1, Quanqiu Wang.   

Abstract

Personalized medicine is to deliver the right drug to the right patient in the right dose. Pharmacogenomics (PGx) is to identify genetic variants that may affect drug efficacy and toxicity. The availability of a comprehensive and accurate PGx-specific drug-gene relationship knowledge base is important for personalized medicine. However, building a large-scale PGx-specific drug-gene knowledge base is a difficult task. In this study, we developed a bootstrapping, semi-supervised learning approach to iteratively extract and rank drug-gene pairs according to their relevance to drug pharmacogenomics. Starting with a single PGx-specific seed pair and 20 million MEDLINE abstracts, the extraction algorithm achieved a precision of 0.219, recall of 0.368 and F1 of 0.274 after two iterations, a significant improvement over the results of using non-PGx-specific seeds (precision: 0.011, recall: 0.018, and F1: 0.014) or co-occurrence (precision: 0.015, recall: 1.000, and F1: 0.030). After the extraction step, the ranking algorithm further improved the precision from 0.219 to 0.561 for top ranked pairs. By comparing to a dictionary-based approach with PGx-specific gene lexicon as input, we showed that the bootstrapping approach has better performance in terms of both precision and F1 (precision: 0.251 vs. 0.152, recall: 0.396 vs. 0.856 and F1: 0.292 vs. 0.254). By integrative analysis using a large drug adverse event database, we have shown that the extracted drug-gene pairs strongly correlate with drug adverse events. In conclusion, we developed a novel semi-supervised bootstrapping approach for effective PGx-specific drug-gene pair extraction from large number of MEDLINE articles with minimal human input.
Copyright © 2013 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Information extraction; Personalized medicine; Pharmacogenomics; Text mining

Mesh:

Year:  2013        PMID: 23570835      PMCID: PMC4452014          DOI: 10.1016/j.jbi.2013.04.001

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  24 in total

1.  Unsupervised method for extracting machine understandable medical knowledge from a large free text collection.

Authors:  Rong Xu; Amar K Das; Alan M Garber
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

Review 2.  The microeconomics of personalized medicine: today's challenge and tomorrow's promise.

Authors:  Jerel C Davis; Laura Furstenthal; Amar A Desai; Troy Norris; Saumya Sutaria; Edd Fleming; Philip Ma
Journal:  Nat Rev Drug Discov       Date:  2009-03-20       Impact factor: 84.694

3.  A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text.

Authors:  Rong Xu; Quanqiu Wang
Journal:  J Biomed Inform       Date:  2012-04-27       Impact factor: 6.317

Review 4.  Creating and evaluating genetic tests predictive of drug response.

Authors:  Scott T Weiss; Howard L McLeod; David A Flockhart; M Eileen Dolan; Neal L Benowitz; Julie A Johnson; Mark J Ratain; Kathleen M Giacomini
Journal:  Nat Rev Drug Discov       Date:  2008-06-20       Impact factor: 84.694

5.  Ranking gene-drug relationships in biomedical literature using Latent Dirichlet Allocation.

Authors:  Yonghui Wu; Mei Liu; W Jim Zheng; Zhongming Zhao; Hua Xu
Journal:  Pac Symp Biocomput       Date:  2012

Review 6.  Data-driven methods to discover molecular determinants of serious adverse drug events.

Authors:  A P Chiang; A J Butte
Journal:  Clin Pharmacol Ther       Date:  2009-01-28       Impact factor: 6.875

7.  An iterative searching and ranking algorithm for prioritising pharmacogenomics genes.

Authors:  Rong Xu; Quanqiu Wang
Journal:  Int J Comput Biol Drug Des       Date:  2013-02-21

8.  Generating genome-scale candidate gene lists for pharmacogenomics.

Authors:  N T Hansen; S Brunak; R B Altman
Journal:  Clin Pharmacol Ther       Date:  2009-04-15       Impact factor: 6.875

9.  Extraction of Conditional Probabilities of the Relationships Between Drugs, Diseases, and Genes from PubMed Guided by Relationships in PharmGKB.

Authors:  Martin Theobald; Nigam Shah; Jeff Shrager
Journal:  Summit Transl Bioinform       Date:  2009-03-01

10.  A side effect resource to capture phenotypic effects of drugs.

Authors:  Michael Kuhn; Monica Campillos; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Mol Syst Biol       Date:  2010-01-19       Impact factor: 11.429

View more
  8 in total

1.  Towards building a disease-phenotype knowledge base: extracting disease-manifestation relationship from literature.

Authors:  Rong Xu; Li Li; Quanqiu Wang
Journal:  Bioinformatics       Date:  2013-07-04       Impact factor: 6.937

2.  Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature.

Authors:  Rong Xu; QuanQiu Wang
Journal:  J Biomed Inform       Date:  2014-06-10       Impact factor: 6.317

3.  Clinical Document Classification Using Labeled and Unlabeled Data Across Hospitals.

Authors:  Hamed Hassanzadeh; Mahnoosh Kholghi; Anthony Nguyen; Kevin Chu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

4.  Comparing a knowledge-driven approach to a supervised machine learning approach in large-scale extraction of drug-side effect relationships from free-text biomedical literature.

Authors:  Rong Xu; QuanQiu Wang
Journal:  BMC Bioinformatics       Date:  2015-03-18       Impact factor: 3.169

Review 5.  Computational dynamic approaches for temporal omics data with applications to systems medicine.

Authors:  Yulan Liang; Arpad Kelemen
Journal:  BioData Min       Date:  2017-06-17       Impact factor: 2.522

6.  Global Text Mining and Development of Pharmacogenomic Knowledge Resource for Precision Medicine.

Authors:  Debleena Guin; Jyoti Rani; Priyanka Singh; Sandeep Grover; Shivangi Bora; Puneet Talwar; Muthusamy Karthikeyan; K Satyamoorthy; C Adithan; S Ramachandran; Luciano Saso; Yasha Hasija; Ritushree Kukreti
Journal:  Front Pharmacol       Date:  2019-08-07       Impact factor: 5.810

Review 7.  Chemical named entities recognition: a review on approaches and applications.

Authors:  Safaa Eltyeb; Naomie Salim
Journal:  J Cheminform       Date:  2014-04-28       Impact factor: 5.514

8.  Identifying Liver Cancer and Its Relations with Diseases, Drugs, and Genes: A Literature-Based Approach.

Authors:  Yongjun Zhu; Min Song; Erjia Yan
Journal:  PLoS One       Date:  2016-05-19       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.