Zichen Wang1, Neil R Clark1, Avi Ma'ayan1. 1. Department of Pharmacology and Systems Therapeutics, One Gustave L. Levy Place Box 1215, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
Abstract
MOTIVATION: Adverse drug reactions (ADRs) are a central consideration during drug development. Here we present a machine learning classifier to prioritize ADRs for approved drugs and pre-clinical small-molecule compounds by combining chemical structure (CS) and gene expression (GE) features. The GE data is from the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 dataset that measured changes in GE before and after treatment of human cells with over 20 000 small-molecule compounds including most of the FDA-approved drugs. Using various benchmarking methods, we show that the integration of GE data with the CS of the drugs can significantly improve the predictability of ADRs. Moreover, transforming GE features to enrichment vectors of biological terms further improves the predictive capability of the classifiers. The most predictive biological-term features can assist in understanding the drug mechanisms of action. Finally, we applied the classifier to all >20 000 small-molecules profiled, and developed a web portal for browsing and searching predictive small-molecule/ADR connections. AVAILABILITY AND IMPLEMENTATION: The interface for the adverse event predictions for the >20 000 LINCS compounds is available at http://maayanlab.net/SEP-L1000/ CONTACT: avi.maayan@mssm.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Adverse drug reactions (ADRs) are a central consideration during drug development. Here we present a machine learning classifier to prioritize ADRs for approved drugs and pre-clinical small-molecule compounds by combining chemical structure (CS) and gene expression (GE) features. The GE data is from the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 dataset that measured changes in GE before and after treatment of human cells with over 20 000 small-molecule compounds including most of the FDA-approved drugs. Using various benchmarking methods, we show that the integration of GE data with the CS of the drugs can significantly improve the predictability of ADRs. Moreover, transforming GE features to enrichment vectors of biological terms further improves the predictive capability of the classifiers. The most predictive biological-term features can assist in understanding the drug mechanisms of action. Finally, we applied the classifier to all >20 000 small-molecules profiled, and developed a web portal for browsing and searching predictive small-molecule/ADR connections. AVAILABILITY AND IMPLEMENTATION: The interface for the adverse event predictions for the >20 000 LINCS compounds is available at http://maayanlab.net/SEP-L1000/ CONTACT: avi.maayan@mssm.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Noel M O'Boyle; Michael Banck; Craig A James; Chris Morley; Tim Vandermeersch; Geoffrey R Hutchison Journal: J Cheminform Date: 2011-10-07 Impact factor: 5.514
Authors: Vivian Law; Craig Knox; Yannick Djoumbou; Tim Jewison; An Chi Guo; Yifeng Liu; Adam Maciejewski; David Arndt; Michael Wilson; Vanessa Neveu; Alexandra Tang; Geraldine Gabriel; Carol Ly; Sakina Adamjee; Zerihun T Dame; Beomsoo Han; You Zhou; David S Wishart Journal: Nucleic Acids Res Date: 2013-11-06 Impact factor: 16.971
Authors: Edward Y Chen; Christopher M Tan; Yan Kou; Qiaonan Duan; Zichen Wang; Gabriela Vaz Meirelles; Neil R Clark; Avi Ma'ayan Journal: BMC Bioinformatics Date: 2013-04-15 Impact factor: 3.169
Authors: Alexandra B Keenan; Sherry L Jenkins; Kathleen M Jagodnik; Simon Koplev; Edward He; Denis Torre; Zichen Wang; Anders B Dohlman; Moshe C Silverstein; Alexander Lachmann; Maxim V Kuleshov; Avi Ma'ayan; Vasileios Stathias; Raymond Terryn; Daniel Cooper; Michele Forlin; Amar Koleti; Dusica Vidovic; Caty Chung; Stephan C Schürer; Jouzas Vasiliauskas; Marcin Pilarczyk; Behrouz Shamsaei; Mehdi Fazel; Yan Ren; Wen Niu; Nicholas A Clark; Shana White; Naim Mahi; Lixia Zhang; Michal Kouril; John F Reichard; Siva Sivaganesan; Mario Medvedovic; Jaroslaw Meller; Rick J Koch; Marc R Birtwistle; Ravi Iyengar; Eric A Sobie; Evren U Azeloglu; Julia Kaye; Jeannette Osterloh; Kelly Haston; Jaslin Kalra; Steve Finkbiener; Jonathan Li; Pamela Milani; Miriam Adam; Renan Escalante-Chong; Karen Sachs; Alex Lenail; Divya Ramamoorthy; Ernest Fraenkel; Gavin Daigle; Uzma Hussain; Alyssa Coye; Jeffrey Rothstein; Dhruv Sareen; Loren Ornelas; Maria Banuelos; Berhan Mandefro; Ritchie Ho; Clive N Svendsen; Ryan G Lim; Jennifer Stocksdale; Malcolm S Casale; Terri G Thompson; Jie Wu; Leslie M Thompson; Victoria Dardov; Vidya Venkatraman; Andrea Matlock; Jennifer E Van Eyk; Jacob D Jaffe; Malvina Papanastasiou; Aravind Subramanian; Todd R Golub; Sean D Erickson; Mohammad Fallahi-Sichani; Marc Hafner; Nathanael S Gray; Jia-Ren Lin; Caitlin E Mills; Jeremy L Muhlich; Mario Niepel; Caroline E Shamu; Elizabeth H Williams; David Wrobel; Peter K Sorger; Laura M Heiser; Joe W Gray; James E Korkola; Gordon B Mills; Mark LaBarge; Heidi S Feiler; Mark A Dane; Elmar Bucher; Michel Nederlof; Damir Sudar; Sean Gross; David F Kilburn; Rebecca Smith; Kaylyn Devlin; Ron Margolis; Leslie Derr; Albert Lee; Ajay Pillai Journal: Cell Syst Date: 2017-11-29 Impact factor: 10.304
Authors: Matthew B A McDermott; Jennifer Wang; Wen-Ning Zhao; Steven D Sheridan; Peter Szolovits; Isaac Kohane; Stephen J Haggarty; Roy H Perlis Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2020-12-08 Impact factor: 3.710
Authors: Lu Zhang; Zichen Wang; Ruijie Liu; Zhengzhe Li; Jennifer Lin; Megan L Wojciechowicz; Jiyi Huang; Kyung Lee; Avi Ma'ayan; John Cijiang He Journal: Diabetes Date: 2020-10-16 Impact factor: 9.461