Giovanna Maria Dimitri1, Pietro Lió2. 1. Computer Laboratory, University of Cambridge, 15 JJ Thomson Avenue, Cambridge, UK. Electronic address: gmd43@cam.ac.uk. 2. Computer Laboratory, University of Cambridge, 15 JJ Thomson Avenue, Cambridge, UK.
Abstract
BACKGROUND: Identification of underlying mechanisms behind drugs side effects is of extreme interest and importance in drugs discovery today. Therefore machine learning methodology, linking such different multi features aspects and able to make predictions, are crucial for understanding side effects. METHODS: In this paper we present DrugClust, a machine learning algorithm for drugs side effects prediction. DrugClust pipeline works as follows: first drugs are clustered with respect to their features and then side effects predictions are made, according to Bayesian scores. Biological validation of resulting clusters can be done via enrichment analysis, another functionality implemented in the methodology. This last tool is of extreme interest for drug discovery, given that it can be used as a validation of the clusters obtained, as well as for the study of new possible interactions between certain side effects and nontargeted pathways. RESULTS: Results were evaluated on a 5-folds cross validations procedure, and extensive comparisons were made with available datasets in the field: Zhang et al. (2015), Liu et al. (2012) and Mizutani et al. (2012). Results are promising and show better performances in most of the cases with respect to the available literature. AVAILABILITY: DrugClust is an R package freely available at: https://cran.r-project.org/web/packages/DrugClust/index.html.
BACKGROUND: Identification of underlying mechanisms behind drugs side effects is of extreme interest and importance in drugs discovery today. Therefore machine learning methodology, linking such different multi features aspects and able to make predictions, are crucial for understanding side effects. METHODS: In this paper we present DrugClust, a machine learning algorithm for drugs side effects prediction. DrugClust pipeline works as follows: first drugs are clustered with respect to their features and then side effects predictions are made, according to Bayesian scores. Biological validation of resulting clusters can be done via enrichment analysis, another functionality implemented in the methodology. This last tool is of extreme interest for drug discovery, given that it can be used as a validation of the clusters obtained, as well as for the study of new possible interactions between certain side effects and nontargeted pathways. RESULTS: Results were evaluated on a 5-folds cross validations procedure, and extensive comparisons were made with available datasets in the field: Zhang et al. (2015), Liu et al. (2012) and Mizutani et al. (2012). Results are promising and show better performances in most of the cases with respect to the available literature. AVAILABILITY: DrugClust is an R package freely available at: https://cran.r-project.org/web/packages/DrugClust/index.html.
Authors: Rebecca N Jerome; Meghan Morrison Joly; Nan Kennedy; Jana K Shirey-Rice; Dan M Roden; Gordon R Bernard; Kenneth J Holroyd; Joshua C Denny; Jill M Pulley Journal: Drug Saf Date: 2020-06 Impact factor: 5.606
Authors: Ewerton Cristhian Lima de Oliveira; Kauê Santana; Luiz Josino; Anderson Henrique Lima E Lima; Claudomiro de Souza de Sales Júnior Journal: Sci Rep Date: 2021-04-07 Impact factor: 4.379
Authors: Ewerton Cristhian Lima de Oliveira; Kauê Santana da Costa; Paulo Sérgio Taube; Anderson H Lima; Claudomiro de Souza de Sales Junior Journal: Front Cell Infect Microbiol Date: 2022-03-25 Impact factor: 5.293