Literature DB >> 22564551

Using PharmGKB to train text mining approaches for identifying potential gene targets for pharmacogenomic studies.

S Pakhomov1, B T McInnes, J Lamba, Y Liu, G B Melton, Y Ghodke, N Bhise, V Lamba, A K Birnbaum.   

Abstract

The main objective of this study was to investigate the feasibility of using PharmGKB, a pharmacogenomic database, as a source of training data in combination with text of MEDLINE abstracts for a text mining approach to identification of potential gene targets for pathway-driven pharmacogenomics research. We used the manually curated relations between drugs and genes in PharmGKB database to train a support vector machine predictive model and applied this model prospectively to MEDLINE abstracts. The gene targets suggested by this approach were subsequently manually reviewed. Our quantitative analysis showed that a support vector machine classifiers trained on MEDLINE abstracts with single words (unigrams) used as features and PharmGKB relations used for supervision, achieve an overall sensitivity of 85% and specificity of 69%. The subsequent qualitative analysis showed that gene targets "suggested" by the automatic classifier were not anticipated by expert reviewers but were subsequently found to be relevant to the three drugs that were investigated: carbamazepine, lamivudine and zidovudine. Our results show that this approach is not only feasible but may also find new gene targets not identifiable by other methods thus making it a valuable tool for pathway-driven pharmacogenomics research.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22564551      PMCID: PMC3438361          DOI: 10.1016/j.jbi.2012.04.007

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  12 in total

1.  Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Authors:  Hongfang Liu; Stephen B Johnson; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

2.  A study of abbreviations in MEDLINE abstracts.

Authors:  Hongfang Liu; Alan R Aronson; Carol Friedman
Journal:  Proc AMIA Symp       Date:  2002

Review 3.  Inheritance and drug response.

Authors:  Richard Weinshilboum
Journal:  N Engl J Med       Date:  2003-02-06       Impact factor: 91.245

4.  Using text to build semantic networks for pharmacogenomics.

Authors:  Adrien Coulet; Nigam H Shah; Yael Garten; Mark Musen; Russ B Altman
Journal:  J Biomed Inform       Date:  2010-08-17       Impact factor: 6.317

5.  Abbreviation and acronym disambiguation in clinical discourse.

Authors:  Sergeui Pakhomov; Ted Pedersen; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2005

6.  A comparative study of supervised learning as applied to acronym expansion in clinical reports.

Authors:  Mahesh Joshi; Serguei Pakhomov; Ted Pedersen; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2006

7.  Improving the prediction of pharmacogenes using text-derived drug-gene relationships.

Authors:  Yael Garten; Nicholas P Tatonetti; Russ B Altman
Journal:  Pac Symp Biocomput       Date:  2010

Review 8.  Hospital admissions associated with adverse drug reactions: a systematic review of prospective observational studies.

Authors:  Chuenjid Kongkaew; Peter R Noyce; Darren M Ashcroft
Journal:  Ann Pharmacother       Date:  2008-07-01       Impact factor: 3.154

9.  Generating genome-scale candidate gene lists for pharmacogenomics.

Authors:  N T Hansen; S Brunak; R B Altman
Journal:  Clin Pharmacol Ther       Date:  2009-04-15       Impact factor: 6.875

10.  Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text.

Authors:  Yael Garten; Russ B Altman
Journal:  BMC Bioinformatics       Date:  2009-02-05       Impact factor: 3.169

View more
  7 in total

1.  Text Mining Protocol to Retrieve Significant Drug-Gene Interactions from PubMed Abstracts.

Authors:  Oviya Ramalakshmi Iyyappan; Sharanya Manoharan; Sadhanha Anand; Dheepa Anand; Manonmani Alvin Jose; Raja Ravi Shanker
Journal:  Methods Mol Biol       Date:  2022

Review 2.  Drug target inference through pathway analysis of genomics data.

Authors:  Haisu Ma; Hongyu Zhao
Journal:  Adv Drug Deliv Rev       Date:  2013-01-28       Impact factor: 15.470

3.  Discovery of novel biomarkers and phenotypes by semantic technologies.

Authors:  Carlo A Trugenberger; Christoph Wälti; David Peregrim; Mark E Sharp; Svetlana Bureeva
Journal:  BMC Bioinformatics       Date:  2013-02-13       Impact factor: 3.169

4.  Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research.

Authors:  Àlex Bravo; Janet Piñero; Núria Queralt-Rosinach; Michael Rautschka; Laura I Furlong
Journal:  BMC Bioinformatics       Date:  2015-02-21       Impact factor: 3.169

5.  eGARD: Extracting associations between genomic anomalies and drug responses from text.

Authors:  A S M Ashique Mahmood; Shruti Rao; Peter McGarvey; Cathy Wu; Subha Madhavan; K Vijay-Shanker
Journal:  PLoS One       Date:  2017-12-20       Impact factor: 3.240

6.  A Framework of Knowledge Integration and Discovery for Supporting Pharmacogenomics Target Predication of Adverse Drug Events: A Case Study of Drug-Induced Long QT Syndrome.

Authors:  Guoqian Jiang; Chen Wang; Qian Zhu; Christopher G Chute
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2013-03-18

7.  PGxCorpus, a manually annotated corpus for pharmacogenomics.

Authors:  Joël Legrand; Romain Gogdemir; Cédric Bousquet; Kevin Dalleau; Marie-Dominique Devignes; William Digan; Chia-Ju Lee; Ndeye-Coumba Ndiaye; Nadine Petitpain; Patrice Ringot; Malika Smaïl-Tabbone; Yannick Toussaint; Adrien Coulet
Journal:  Sci Data       Date:  2020-01-02       Impact factor: 6.444

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.