Literature DB >> 33622374

Hybrid semantic recommender system for chemical compounds in large-scale datasets.

Marcia Barros1,2, Andre Moitinho3, Francisco M Couto4.   

Abstract

The large, and increasing, number of chemical compounds poses challenges to the exploration of such datasets. In this work, we propose the usage of recommender systems to identify compounds of interest to scientific researchers. Our approach consists of a hybrid recommender model suitable for implicit feedback datasets and focused on retrieving a ranked list according to the relevance of the items. The model integrates collaborative-filtering algorithms for implicit feedback (Alternating Least Squares and Bayesian Personalized Ranking) and a new content-based algorithm, using the semantic similarity between the chemical compounds in the ChEBI ontology. The algorithms were assessed on an implicit dataset of chemical compounds, CheRM-20, with more than 16.000 items (chemical compounds). The hybrid model was able to improve the results of the collaborative-filtering algorithms, by more than ten percentage points in most of the assessed evaluation metrics.

Entities:  

Keywords:  Chemical compound; Ontology; Recommender system; Semantic similarity

Year:  2021        PMID: 33622374      PMCID: PMC7903631          DOI: 10.1186/s13321-021-00495-2

Source DB:  PubMed          Journal:  J Cheminform        ISSN: 1758-2946            Impact factor:   5.514


  13 in total

1.  Exploiting personalized information for reagent selection in drug design.

Authors:  Jonas Boström; Niklas Falk; Christian Tyrchan
Journal:  Drug Discov Today       Date:  2011-01-22       Impact factor: 7.851

2.  Identification of potent orally active factor Xa inhibitors based on conjugation strategy and application of predictable fragment recommender system.

Authors:  Tsukasa Ishihara; Yuji Koga; Yoshiyuki Iwatsuki; Fukushi Hirayama
Journal:  Bioorg Med Chem       Date:  2014-12-05       Impact factor: 3.641

3.  Semantic similarity for automatic classification of chemical compounds.

Authors:  João D Ferreira; Francisco M Couto
Journal:  PLoS Comput Biol       Date:  2010-09-23       Impact factor: 4.475

4.  Enhancement of chemical entity identification in text using semantic similarity validation.

Authors:  Tiago Grego; Francisco M Couto
Journal:  PLoS One       Date:  2013-05-02       Impact factor: 3.240

5.  ChEBI in 2016: Improved services and an expanding collection of metabolites.

Authors:  Janna Hastings; Gareth Owen; Adriano Dekker; Marcus Ennis; Namrata Kale; Venkatesh Muthukrishnan; Steve Turner; Neil Swainston; Pedro Mendes; Christoph Steinbeck
Journal:  Nucleic Acids Res       Date:  2015-10-13       Impact factor: 16.971

6.  Recommender Systems in Antiviral Drug Discovery.

Authors:  Ekaterina A Sosnina; Sergey Sosnin; Anastasia A Nikitina; Ivan Nazarov; Dmitry I Osolodkin; Maxim V Fedorov
Journal:  ACS Omega       Date:  2020-06-21

7.  A new chemoinformatics approach with improved strategies for effective predictions of potential drugs.

Authors:  Ming Hao; Stephen H Bryant; Yanli Wang
Journal:  J Cheminform       Date:  2018-10-11       Impact factor: 5.514

8.  Human Disease Ontology 2018 update: classification, content and workflow expansion.

Authors:  Lynn M Schriml; Elvira Mitraka; James Munro; Becky Tauber; Mike Schor; Lance Nickle; Victor Felix; Linda Jeng; Cynthia Bearer; Richard Lichenstein; Katharine Bisordi; Nicole Campion; Brooke Hyman; David Kurland; Connor Patrick Oates; Siobhan Kibbey; Poorna Sreekumar; Chris Le; Michelle Giglio; Carol Greene
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  Linking chemical and disease entities to ontologies by integrating PageRank with extracted relations from literature.

Authors:  Pedro Ruas; Andre Lamurias; Francisco M Couto
Journal:  J Cheminform       Date:  2020-09-21       Impact factor: 5.514

10.  The Gene Ontology Resource: 20 years and still GOing strong.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  1 in total

1.  SeEn: Sequential enriched datasets for sequence-aware recommendations.

Authors:  Marcia Barros; André Moitinho; Francisco M Couto
Journal:  Sci Data       Date:  2022-08-04       Impact factor: 8.501

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.