Literature DB >> 29763706

Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.

Gokhan Bakal1, Preetham Talari2, Elijah V Kakani3, Ramakanth Kavuluru4.   

Abstract

BACKGROUND: Identifying new potential treatment options for medical conditions that cause human disease burden is a central task of biomedical research. Since all candidate drugs cannot be tested with animal and clinical trials, in vitro approaches are first attempted to identify promising candidates. Likewise, identifying different causal relations between biomedical entities is also critical to understand biomedical processes. Generally, natural language processing (NLP) and machine learning are used to predict specific relations between any given pair of entities using the distant supervision approach.
OBJECTIVE: To build high accuracy supervised predictive models to predict previously unknown treatment and causative relations between biomedical entities based only on semantic graph pattern features extracted from biomedical knowledge graphs.
METHODS: We used 7000 treats and 2918 causes hand-curated relations from the UMLS Metathesaurus to train and test our models. Our graph pattern features are extracted from simple paths connecting biomedical entities in the SemMedDB graph (based on the well-known SemMedDB database made available by the U.S. National Library of Medicine). Using these graph patterns connecting biomedical entities as features of logistic regression and decision tree models, we computed mean performance measures (precision, recall, F-score) over 100 distinct 80-20% train-test splits of the datasets. For all experiments, we used a positive:negative class imbalance of 1:10 in the test set to model relatively more realistic scenarios.
RESULTS: Our models predict treats and causes relations with high F-scores of 99% and 90% respectively. Logistic regression model coefficients also help us identify highly discriminative patterns that have an intuitive interpretation. We are also able to predict some new plausible relations based on false positives that our models scored highly based on our collaborations with two physician co-authors. Finally, our decision tree models are able to retrieve over 50% of treatment relations from a recently created external dataset.
CONCLUSIONS: We employed semantic graph patterns connecting pairs of candidate biomedical entities in a knowledge graph as features to predict treatment/causative relations between them. We provide what we believe is the first evidence in direct prediction of biomedical relations based on graph features. Our work complements lexical pattern based approaches in that the graph patterns can be used as additional features for weakly supervised relation prediction.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Information extraction; Relation prediction; Semantic graph patterns

Mesh:

Year:  2018        PMID: 29763706      PMCID: PMC6070294          DOI: 10.1016/j.jbi.2018.05.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  25 in total

1.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

2.  Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.

Authors:  Yuan Luo; Özlem Uzuner; Peter Szolovits
Journal:  Brief Bioinform       Date:  2016-02-05       Impact factor: 11.622

3.  Using a shallow linguistic kernel for drug-drug interaction extraction.

Authors:  Isabel Segura-Bedmar; Paloma Martínez; Cesar de Pablo-Sánchez
Journal:  J Biomed Inform       Date:  2011-04-24       Impact factor: 6.317

Review 4.  Literature mining, ontologies and information visualization for drug repurposing.

Authors:  Christos Andronis; Anuj Sharma; Vassilis Virvilis; Spyros Deftereos; Aris Persidis
Journal:  Brief Bioinform       Date:  2011-06-28       Impact factor: 11.622

Review 5.  A survey of current trends in computational drug repositioning.

Authors:  Jiao Li; Si Zheng; Bin Chen; Atul J Butte; S Joshua Swamidass; Zhiyong Lu
Journal:  Brief Bioinform       Date:  2015-03-31       Impact factor: 11.622

6.  Using semantic predications to uncover drug-drug interactions in clinical data.

Authors:  Rui Zhang; Michael J Cairelli; Marcelo Fiszman; Graciela Rosemblat; Halil Kilicoglu; Thomas C Rindflesch; Serguei V Pakhomov; Genevieve B Melton
Journal:  J Biomed Inform       Date:  2014-01-19       Impact factor: 6.317

7.  Discovering discovery patterns with Predication-based Semantic Indexing.

Authors:  Trevor Cohen; Dominic Widdows; Roger W Schvaneveldt; Peter Davies; Thomas C Rindflesch
Journal:  J Biomed Inform       Date:  2012-07-26       Impact factor: 6.317

8.  SemMedDB: a PubMed-scale repository of biomedical semantic predications.

Authors:  Halil Kilicoglu; Dongwook Shin; Marcelo Fiszman; Graciela Rosemblat; Thomas C Rindflesch
Journal:  Bioinformatics       Date:  2012-10-08       Impact factor: 6.937

9.  Context-driven automatic subgraph creation for literature-based discovery.

Authors:  Delroy Cameron; Ramakanth Kavuluru; Thomas C Rindflesch; Amit P Sheth; Krishnaprasad Thirunarayan; Olivier Bodenreider
Journal:  J Biomed Inform       Date:  2015-02-07       Impact factor: 6.317

10.  DrugCentral: online drug compendium.

Authors:  Oleg Ursu; Jayme Holmes; Jeffrey Knockel; Cristian G Bologa; Jeremy J Yang; Stephen L Mathias; Stuart J Nelson; Tudor I Oprea
Journal:  Nucleic Acids Res       Date:  2016-10-26       Impact factor: 16.971

View more
  9 in total

1.  Non-Negative Matrix Factorization for Drug Repositioning: Experiments with the repoDB Dataset.

Authors:  Gokhan Bakal; Halil Kilicoglu; Ramakanth Kavuluru
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

2.  Knowledge-Based Biomedical Data Science.

Authors:  Tiffany J Callahan; Ignacio J Tripodi; Harrison Pielke-Lombardo; Lawrence E Hunter
Journal:  Annu Rev Biomed Data Sci       Date:  2020-04-07

3.  Exploring relationship between emotion and probiotics with knowledge graphs.

Authors:  Yueping Sun; Jiao Li; Zidu Xu; Yan Liu; Li Hou; Zhisheng Huang
Journal:  Health Inf Sci Syst       Date:  2022-09-10

4.  Identifying Cases of Shoulder Injury Related to Vaccine Administration (SIRVA) in the United States: Development and Validation of a Natural Language Processing Method.

Authors:  Chengyi Zheng; Jonathan Duffy; In-Lu Amy Liu; Lina S Sy; Ronald A Navarro; Sunhea S Kim; Denison S Ryan; Wansu Chen; Lei Qian; Cheryl Mercado; Steven J Jacobsen
Journal:  JMIR Public Health Surveill       Date:  2022-05-24

5.  AnthraxKP: a knowledge graph-based, Anthrax Knowledge Portal mined from biomedical literature.

Authors:  Baiyang Feng; Jing Gao
Journal:  Database (Oxford)       Date:  2022-06-02       Impact factor: 4.462

6.  Integrating Unified Medical Language System and Kleinberg's Burst Detection Algorithm into Research Topics of Medications for Post-Traumatic Stress Disorder.

Authors:  Shuang Xu; Dan Xu; Liang Wen; Chen Zhu; Ying Yang; Shuang Han; Peng Guan
Journal:  Drug Des Devel Ther       Date:  2020-09-24       Impact factor: 4.162

7.  A Year of Papers Using Biomedical Texts: Findings from the Section on Natural Language Processing of the IMIA Yearbook.

Authors:  Natalia Grabar; Cyril Grouin
Journal:  Yearb Med Inform       Date:  2019-08-16

8.  Community Approaches for Integrating Environmental Exposures into Human Models of Disease.

Authors:  Anne E Thessen; Cynthia J Grondin; Resham D Kulkarni; Susanne Brander; Lisa Truong; Nicole A Vasilevsky; Tiffany J Callahan; Lauren E Chan; Brian Westra; Mary Willis; Sarah E Rothenberg; Annie M Jarabek; Lyle Burgoon; Susan A Korrick; Melissa A Haendel
Journal:  Environ Health Perspect       Date:  2020-12-28       Impact factor: 9.031

9.  Broad-coverage biomedical relation extraction with SemRep.

Authors:  Halil Kilicoglu; Graciela Rosemblat; Marcelo Fiszman; Dongwook Shin
Journal:  BMC Bioinformatics       Date:  2020-05-14       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.