Joaquim Aguirre-Plans1, Janet Piñero1, Terezinha Souza2, Giulia Callegaro3, Steven J Kunnen3, Ferran Sanz1, Narcis Fernandez-Fuentes4,5, Laura I Furlong1, Emre Guney1, Baldo Oliva6. 1. Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), DCEXS, Pompeu Fabra University (UPF), Barcelona, Spain. 2. Department of Toxicogenomics, Maastricht University, Maastricht, The Netherlands. 3. Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands. 4. Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Vic, Spain. 5. Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, UK. 6. Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), DCEXS, Pompeu Fabra University (UPF), Barcelona, Spain. baldo.oliva@upf.edu.
Abstract
BACKGROUND: Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the pathophysiology and mechanisms of DILI are poorly understood. In the present study, we developed an ensemble learning approach based on different features (CMap gene expression, chemical structures, drug targets) to predict drugs that might cause DILI and gain a better understanding of the mechanisms linked to the adverse reaction. RESULTS: We searched for gene signatures in CMap gene expression data by using two approaches: phenotype-gene associations data from DisGeNET, and a non-parametric test comparing gene expression of DILI-Concern and No-DILI-Concern drugs (as per DILIrank definitions). The average accuracy of the classifiers in both approaches was 69%. We used chemical structures as features, obtaining an accuracy of 65%. The combination of both types of features produced an accuracy around 63%, but improved the independent hold-out test up to 67%. The use of drug-target associations as feature obtained the best accuracy (70%) in the independent hold-out test. CONCLUSIONS: When using CMap gene expression data, searching for a specific gene signature among the landmark genes improves the quality of the classifiers, but it is still limited by the intrinsic noise of the dataset. When using chemical structures as a feature, the structural diversity of the known DILI-causing drugs hampers the prediction, which is a similar problem as for the use of gene expression information. The combination of both features did not improve the quality of the classifiers but increased the robustness as shown on independent hold-out tests. The use of drug-target associations as feature improved the prediction, specially the specificity, and the results were comparable to previous research studies.
BACKGROUND: Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the pathophysiology and mechanisms of DILI are poorly understood. In the present study, we developed an ensemble learning approach based on different features (CMap gene expression, chemical structures, drug targets) to predict drugs that might cause DILI and gain a better understanding of the mechanisms linked to the adverse reaction. RESULTS: We searched for gene signatures in CMap gene expression data by using two approaches: phenotype-gene associations data from DisGeNET, and a non-parametric test comparing gene expression of DILI-Concern and No-DILI-Concern drugs (as per DILIrank definitions). The average accuracy of the classifiers in both approaches was 69%. We used chemical structures as features, obtaining an accuracy of 65%. The combination of both types of features produced an accuracy around 63%, but improved the independent hold-out test up to 67%. The use of drug-target associations as feature obtained the best accuracy (70%) in the independent hold-out test. CONCLUSIONS: When using CMap gene expression data, searching for a specific gene signature among the landmark genes improves the quality of the classifiers, but it is still limited by the intrinsic noise of the dataset. When using chemical structures as a feature, the structural diversity of the known DILI-causing drugs hampers the prediction, which is a similar problem as for the use of gene expression information. The combination of both features did not improve the quality of the classifiers but increased the robustness as shown on independent hold-out tests. The use of drug-target associations as feature improved the prediction, specially the specificity, and the results were comparable to previous research studies.
Entities:
Keywords:
CAMDA; Cmap; Drug safety; Drug structure; Drug-induced liver injury; Hepatotoxicity; Machine learning; Systems biology
Authors: Jörg Menche; Amitabh Sharma; Maksim Kitsak; Susan Dina Ghiassian; Marc Vidal; Joseph Loscalzo; Albert-László Barabási Journal: Science Date: 2015-02-20 Impact factor: 47.728
Authors: Michael J Keiser; Bryan L Roth; Blaine N Armbruster; Paul Ernsberger; John J Irwin; Brian K Shoichet Journal: Nat Biotechnol Date: 2007-02 Impact factor: 54.908
Authors: Gerd A Kullak-Ublick; Raul J Andrade; Michael Merz; Peter End; Andreas Benesic; Alexander L Gerbes; Guruprasad P Aithal Journal: Gut Date: 2017-03-23 Impact factor: 23.059
Authors: Kelsy C Cotto; Alex H Wagner; Yang-Yang Feng; Susanna Kiwala; Adam C Coffman; Gregory Spies; Alex Wollam; Nicholas C Spies; Obi L Griffith; Malachi Griffith Journal: Nucleic Acids Res Date: 2018-01-04 Impact factor: 16.971
Authors: François Pognan; Thomas Steger-Hartmann; Carlos Díaz; Niklas Blomberg; Frank Bringezu; Katharine Briggs; Giulia Callegaro; Salvador Capella-Gutierrez; Emilio Centeno; Javier Corvi; Philip Drew; William C Drewe; José M Fernández; Laura I Furlong; Emre Guney; Jan A Kors; Miguel Angel Mayer; Manuel Pastor; Janet Piñero; Juan Manuel Ramírez-Anguita; Francesco Ronzano; Philip Rowell; Josep Saüch-Pitarch; Alfonso Valencia; Bob van de Water; Johan van der Lei; Erik van Mulligen; Ferran Sanz Journal: Pharmaceuticals (Basel) Date: 2021-03-08