Literature DB >> 28427468

Learning from biomedical linked data to suggest valid pharmacogenes.

Kevin Dalleau1, Yassine Marzougui1,2, Sébastien Da Silva1, Patrice Ringot1, Ndeye Coumba Ndiaye3, Adrien Coulet4.   

Abstract

BACKGROUND: A standard task in pharmacogenomics research is identifying genes that may be involved in drug response variability, i.e., pharmacogenes. Because genomic experiments tended to generate many false positives, computational approaches based on the use of background knowledge have been proposed. Until now, only molecular networks or the biomedical literature were used, whereas many other resources are available.
METHOD: We propose here to consume a diverse and larger set of resources using linked data related either to genes, drugs or diseases. One of the advantages of linked data is that they are built on a standard framework that facilitates the joint use of various sources, and thus facilitates considering features of various origins. We propose a selection and linkage of data sources relevant to pharmacogenomics, including for example DisGeNET and Clinvar. We use machine learning to identify and prioritize pharmacogenes that are the most probably valid, considering the selected linked data. This identification relies on the classification of gene-drug pairs as either pharmacogenomically associated or not and was experimented with two machine learning methods -random forest and graph kernel-, which results are compared in this article.
RESULTS: We assembled a set of linked data relative to pharmacogenomics, of 2,610,793 triples, coming from six distinct resources. Learning from these data, random forest enables identifying valid pharmacogenes with a F-measure of 0.73, on a 10 folds cross-validation, whereas graph kernel achieves a F-measure of 0.81. A list of top candidates proposed by both approaches is provided and their obtention is discussed.

Entities:  

Keywords:  Data mining; Knowledge discovery from databases; Linked data; Machine learning; Pharmacogenomics; Valid pharmacogenes

Mesh:

Year:  2017        PMID: 28427468      PMCID: PMC5399403          DOI: 10.1186/s13326-017-0125-1

Source DB:  PubMed          Journal:  J Biomed Semantics


  30 in total

Review 1.  Biological knowledge management: the emerging role of the Semantic Web technologies.

Authors:  Erick Antezana; Martin Kuiper; Vladimir Mironov
Journal:  Brief Bioinform       Date:  2009-05-19       Impact factor: 11.622

2.  To replicate or not to replicate: the case of pharmacogenetic studies: Have pharmacogenomics failed, or do they just need larger-scale evidence and more replication?

Authors:  John P A Ioannidis
Journal:  Circ Cardiovasc Genet       Date:  2013-08

Review 3.  Pharmacogenomics knowledge for personalized medicine.

Authors:  M Whirl-Carrillo; E M McDonagh; J M Hebert; L Gong; K Sangkuhl; C F Thorn; R B Altman; T E Klein
Journal:  Clin Pharmacol Ther       Date:  2012-10       Impact factor: 6.875

4.  Generating genome-scale candidate gene lists for pharmacogenomics.

Authors:  N T Hansen; S Brunak; R B Altman
Journal:  Clin Pharmacol Ther       Date:  2009-04-15       Impact factor: 6.875

5.  Linked open drug data for pharmaceutical research and development.

Authors:  Matthias Samwald; Anja Jentzsch; Christopher Bouton; Claus Stie Kallesøe; Egon Willighagen; Janos Hajagos; M Scott Marshall; Eric Prud'hommeaux; Oktie Hassenzadeh; Elgar Pichler; Susie Stephens
Journal:  J Cheminform       Date:  2011-05-16       Impact factor: 5.514

6.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes.

Authors:  Janet Piñero; Núria Queralt-Rosinach; Àlex Bravo; Jordi Deu-Pons; Anna Bauer-Mehren; Martin Baron; Ferran Sanz; Laura I Furlong
Journal:  Database (Oxford)       Date:  2015-04-15       Impact factor: 3.451

7.  Hyperlink Management System and ID Converter System: enabling maintenance-free hyperlinks among major biological databases.

Authors:  Tadashi Imanishi; Hajime Nakaoka
Journal:  Nucleic Acids Res       Date:  2009-05-19       Impact factor: 16.971

8.  A side effect resource to capture phenotypic effects of drugs.

Authors:  Michael Kuhn; Monica Campillos; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Mol Syst Biol       Date:  2010-01-19       Impact factor: 11.429

9.  ClinVar: public archive of relationships among sequence variation and human phenotype.

Authors:  Melissa J Landrum; Jennifer M Lee; George R Riley; Wonhee Jang; Wendy S Rubinstein; Deanna M Church; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2013-11-14       Impact factor: 16.971

10.  DGIdb 2.0: mining clinically relevant drug-gene interactions.

Authors:  Alex H Wagner; Adam C Coffman; Benjamin J Ainscough; Nicholas C Spies; Zachary L Skidmore; Katie M Campbell; Kilannin Krysiak; Deng Pan; Joshua F McMichael; James M Eldred; Jason R Walker; Richard K Wilson; Elaine R Mardis; Malachi Griffith; Obi L Griffith
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  2 in total

1.  Using predicate and provenance information from a knowledge graph for drug efficacy screening.

Authors:  Wytze J Vlietstra; Rein Vos; Anneke M Sijbers; Erik M van Mulligen; Jan A Kors
Journal:  J Biomed Semantics       Date:  2018-09-06

2.  Identification of early stage recurrence endometrial cancer biomarkers using bioinformatics tools.

Authors:  María José Besso; Luciana Montivero; Ezequiel Lacunza; María Cecilia Argibay; Martín Abba; Laura Inés Furlong; Eva Colas; Antonio Gil-Moreno; Jaume Reventos; Ricardo Bello; Mónica Hebe Vazquez-Levin
Journal:  Oncol Rep       Date:  2020-06-16       Impact factor: 3.906

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.