Literature DB >> 36266295

An automatic hypothesis generation for plausible linkage between xanthium and diabetes.

Arida Ferti Syafiandini1, Gyuri Song1, Yuri Ahn1, Heeyoung Kim1, Min Song2.   

Abstract

There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-generation step and using evidence from published journal articles or proceedings, previous studies have managed to reduce experimental time and costs. First, we applied the closed discovery approach from Swanson's ABC model to collect publications related to 36 Xanthium compounds or diabetes. Second, we extracted biomedical entities and relations using a knowledge extraction engine, the Public Knowledge Discovery Engine for Java or PKDE4J. Third, we built a knowledge graph using the obtained bio entities and relations and then generated paths with Xanthium compounds as source nodes and diabetes as the target node. Lastly, we employed graph embeddings to rank each path and evaluated the results based on domain experts' opinions and literature. Among 36 Xanthium compounds, 35 had direct paths to five diabetes-related nodes. We ranked 2,740,314 paths in total between 35 Xanthium compounds and three diabetes-related phrases: type 1 diabetes, type 2 diabetes, and diabetes mellitus. Based on the top five percentile paths, we concluded that adenosine, choline, beta-sitosterol, rhamnose, and scopoletin were potential candidates for diabetes drug development using natural products. Our framework for hypothesis generation employs a closed discovery from Swanson's ABC model that has proven very helpful in discovering biological linkages between bio entities. The PKDE4J tools we used to capture bio entities from our document collection could label entities into five categories: genes, compounds, phenotypes, biological processes, and molecular functions. Using the BioPREP model, we managed to interpret the semantic relatedness between two nodes and provided paths containing valuable hypotheses. Lastly, using a graph-embedding algorithm in our path-ranking analysis, we exploited the semantic relatedness while preserving the graph structure properties.
© 2022. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36266295      PMCID: PMC9585073          DOI: 10.1038/s41598-022-20752-0

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.996


  58 in total

1.  Text-based discovery in biomedicine: the architecture of the DAD-system.

Authors:  M Weeber; H Klein; A R Aronson; J G Mork; L T de Jong-van den Berg; R Vos
Journal:  Proc AMIA Symp       Date:  2000

Review 2.  Adenosine signaling in diabetes mellitus and associated cardiovascular and renal complications.

Authors:  Maria Peleli; Mattias Carlstrom
Journal:  Mol Aspects Med       Date:  2017-01-12

3.  Caffeic acid as active principle from the fruit of Xanthium strumarium to lower plasma glucose in diabetic rats.

Authors:  F L Hsu; Y C Chen; J T Cheng
Journal:  Planta Med       Date:  2000-04       Impact factor: 3.352

4.  Antihyperglycemic and antioxidant properties of caffeic acid in db/db mice.

Authors:  Un Ju Jung; Mi-Kyung Lee; Yong Bok Park; Seon-Min Jeon; Myung-Sook Choi
Journal:  J Pharmacol Exp Ther       Date:  2006-04-27       Impact factor: 4.030

5.  Design of new traditional Chinese medicine herbal formulae for treatment of type 2 diabetes mellitus based on network pharmacology.

Authors:  Rui-Feng Hu; Xiao-Bo Sun
Journal:  Chin J Nat Med       Date:  2017-06

6.  Discovering New Genes in the Pathways of Common Sporadic Neurodegenerative Diseases: A Bioinformatics Approach.

Authors:  Yong Hwan Kim; Seung Han Beak; Andreas Charidimou; Min Song
Journal:  J Alzheimers Dis       Date:  2016       Impact factor: 4.472

7.  A modular framework for biomedical concept recognition.

Authors:  David Campos; Sérgio Matos; José Luís Oliveira
Journal:  BMC Bioinformatics       Date:  2013-09-24       Impact factor: 3.169

8.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

9.  New approach for understanding genome variations in KEGG.

Authors:  Minoru Kanehisa; Yoko Sato; Miho Furumichi; Kanae Morishima; Mao Tanabe
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  PubChem 2019 update: improved access to chemical data.

Authors:  Sunghwan Kim; Jie Chen; Tiejun Cheng; Asta Gindulyte; Jia He; Siqian He; Qingliang Li; Benjamin A Shoemaker; Paul A Thiessen; Bo Yu; Leonid Zaslavsky; Jian Zhang; Evan E Bolton
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.