Literature DB >> 33816992

Application and evaluation of knowledge graph embeddings in biomedical data.

Mona Alshahrani1, Maha A Thafar2,3, Magbubah Essack2.   

Abstract

Linked data and bio-ontologies enabling knowledge representation, standardization, and dissemination are an integral part of developing biological and biomedical databases. That is, linked data and bio-ontologies are employed in databases to maintain data integrity, data organization, and to empower search capabilities. However, linked data and bio-ontologies are more recently being used to represent information as multi-relational heterogeneous graphs, "knowledge graphs". The reason being, entities and relations in the knowledge graph can be represented as embedding vectors in semantic space, and these embedding vectors have been used to predict relationships between entities. Such knowledge graph embedding methods provide a practical approach to data analytics and increase chances of building machine learning models with high prediction accuracy that can enhance decision support systems. Here, we present a comparative assessment and a standard benchmark for knowledge graph-based representation learning methods focused on the link prediction task for biological relations. We systematically investigated and compared state-of-the-art embedding methods based on the design settings used for training and evaluation. We further tested various strategies aimed at controlling the amount of information related to each relation in the knowledge graph and its effects on the final performance. We also assessed the quality of the knowledge graph features through clustering and visualization and employed several evaluation metrics to examine their uses and differences. Based on this systematic comparison and assessments, we identify and discuss the limitations of knowledge graph-based representation learning methods and suggest some guidelines for the development of more improved methods.
© 2021 Alshahrani et al.

Entities:  

Keywords:  Bio-ontologies; Biomedicine; Comparative evaluation; Embeddings methods; Knowledge graphs; Linked data; Performance studies

Year:  2021        PMID: 33816992      PMCID: PMC7959619          DOI: 10.7717/peerj-cs.341

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  23 in total

1.  Network embedding in biomedical data science.

Authors:  Chang Su; Jie Tong; Yongjun Zhu; Peng Cui; Fei Wang
Journal:  Brief Bioinform       Date:  2018-12-10       Impact factor: 11.622

2.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.

Authors:  Peter N Robinson; Sebastian Köhler; Sebastian Bauer; Dominik Seelow; Denise Horn; Stefan Mundlos
Journal:  Am J Hum Genet       Date:  2008-10-23       Impact factor: 11.025

3.  Deep mining heterogeneous networks of biomedical linked data to predict novel drug-target associations.

Authors:  Nansu Zong; Hyeoneui Kim; Victoria Ngo; Olivier Harismendy
Journal:  Bioinformatics       Date:  2017-08-01       Impact factor: 6.937

4.  Disease Ontology: a backbone for disease semantic integration.

Authors:  Lynn Marie Schriml; Cesar Arze; Suvarna Nadendla; Yu-Wei Wayne Chang; Mark Mazaitis; Victor Felix; Gang Feng; Warren Alden Kibbe
Journal:  Nucleic Acids Res       Date:  2011-11-12       Impact factor: 16.971

5.  STITCH 3: zooming in on protein-chemical interactions.

Authors:  Michael Kuhn; Damian Szklarczyk; Andrea Franceschini; Christian von Mering; Lars Juhl Jensen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2011-11-09       Impact factor: 16.971

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  Analysis of the human diseasome using phenotype similarity between common, genetic, and infectious diseases.

Authors:  Robert Hoehndorf; Paul N Schofield; Georgios V Gkoutos
Journal:  Sci Rep       Date:  2015-06-08       Impact factor: 4.379

8.  Systematic integration of biomedical knowledge prioritizes drugs for repurposing.

Authors:  Daniel Scott Himmelstein; Antoine Lizee; Christine Hessler; Leo Brueggeman; Sabrina L Chen; Dexter Hadley; Ari Green; Pouya Khankhanian; Sergio E Baranzini
Journal:  Elife       Date:  2017-09-22       Impact factor: 8.140

9.  Neuro-symbolic representation learning on biological knowledge graphs.

Authors:  Mona Alshahrani; Mohammad Asif Khan; Omar Maddouri; Akira R Kinjo; Núria Queralt-Rosinach; Robert Hoehndorf
Journal:  Bioinformatics       Date:  2017-09-01       Impact factor: 6.937

10.  Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes.

Authors:  Mona Alshahrani; Robert Hoehndorf
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

View more
  4 in total

1.  Multimodal reasoning based on knowledge graph embedding for specific diseases.

Authors:  Chaoyu Zhu; Zhihao Yang; Xiaoqiong Xia; Nan Li; Fan Zhong; Lei Liu
Journal:  Bioinformatics       Date:  2022-02-12       Impact factor: 6.937

Review 2.  Contexts and contradictions: a roadmap for computational drug repurposing with knowledge inference.

Authors:  Daniel N Sosa; Russ B Altman
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  Affinity2Vec: drug-target binding affinity prediction through representation learning, graph mining, and machine learning.

Authors:  Maha A Thafar; Mona Alshahrani; Somayah Albaradei; Takashi Gojobori; Magbubah Essack; Xin Gao
Journal:  Sci Rep       Date:  2022-03-19       Impact factor: 4.379

4.  Combining biomedical knowledge graphs and text to improve predictions for drug-target interactions and drug-indications.

Authors:  Mona Alshahrani; Abdullah Almansour; Asma Alkhaldi; Maha A Thafar; Mahmut Uludag; Magbubah Essack; Robert Hoehndorf
Journal:  PeerJ       Date:  2022-04-04       Impact factor: 2.984

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.