Literature DB >> 28872869

Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships.

Yuting Xu1, Junshui Ma1, Andy Liaw1, Robert P Sheridan2, Vladimir Svetnik1.   

Abstract

Deep neural networks (DNNs) are complex computational models that have found great success in many artificial intelligence applications, such as computer vision1,2 and natural language processing.3,4 In the past four years, DNNs have also generated promising results for quantitative structure-activity relationship (QSAR) tasks.5,6 Previous work showed that DNNs can routinely make better predictions than traditional methods, such as random forests, on a diverse collection of QSAR data sets. It was also found that multitask DNN models-those trained on and predicting multiple QSAR properties simultaneously-outperform DNNs trained separately on the individual data sets in many, but not all, tasks. To date there has been no satisfactory explanation of why the QSAR of one task embedded in a multitask DNN can borrow information from other unrelated QSAR tasks. Thus, using multitask DNNs in a way that consistently provides a predictive advantage becomes a challenge. In this work, we explored why multitask DNNs make a difference in predictive performance. Our results show that during prediction a multitask DNN does borrow "signal" from molecules with similar structures in the training sets of the other tasks. However, whether this borrowing leads to better or worse predictive performance depends on whether the activities are correlated. On the basis of this, we have developed a strategy to use multitask DNNs that incorporate prior domain knowledge to select training sets with correlated activities, and we demonstrate its effectiveness on several examples.

Mesh:

Substances:

Year:  2017        PMID: 28872869     DOI: 10.1021/acs.jcim.7b00087

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  33 in total

Review 1.  QSAR without borders.

Authors:  Eugene N Muratov; Jürgen Bajorath; Robert P Sheridan; Igor V Tetko; Dmitry Filimonov; Vladimir Poroikov; Tudor I Oprea; Igor I Baskin; Alexandre Varnek; Adrian Roitberg; Olexandr Isayev; Stefano Curtarolo; Denis Fourches; Yoram Cohen; Alan Aspuru-Guzik; David A Winkler; Dimitris Agrafiotis; Artem Cherkasov; Alexander Tropsha
Journal:  Chem Soc Rev       Date:  2020-05-01       Impact factor: 54.564

2.  Multi-task generative topographic mapping in virtual screening.

Authors:  Arkadii Lin; Dragos Horvath; Gilles Marcou; Bernd Beck; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2019-02-09       Impact factor: 3.686

3.  Opportunities and challenges using artificial intelligence in ADME/Tox.

Authors:  Barun Bhhatarai; W Patrick Walters; Cornelis E C A Hop; Guido Lanza; Sean Ekins
Journal:  Nat Mater       Date:  2019-05       Impact factor: 43.841

4.  Developing a Kinase-Specific Target Selection Method Using a Structure-Based Machine Learning Approach.

Authors:  Arina Afanasyeva; Chioko Nagao; Kenji Mizuguchi
Journal:  Adv Appl Bioinform Chem       Date:  2020-12-02

5.  Introduction to the BioChemical Library (BCL): An Application-Based Open-Source Toolkit for Integrated Cheminformatics and Machine Learning in Computer-Aided Drug Discovery.

Authors:  Benjamin P Brown; Oanh Vu; Alexander R Geanes; Sandeepkumar Kothiwale; Mariusz Butkiewicz; Edward W Lowe; Ralf Mueller; Richard Pape; Jeffrey Mendenhall; Jens Meiler
Journal:  Front Pharmacol       Date:  2022-02-21       Impact factor: 5.810

6.  A multitask GNN-based interpretable model for discovery of selective JAK inhibitors.

Authors:  Yimeng Wang; Yaxin Gu; Chaofeng Lou; Yuning Gong; Zengrui Wu; Weihua Li; Yun Tang; Guixia Liu
Journal:  J Cheminform       Date:  2022-03-15       Impact factor: 5.514

Review 7.  In silico toxicology: From structure-activity relationships towards deep learning and adverse outcome pathways.

Authors:  Jennifer Hemmerich; Gerhard F Ecker
Journal:  Wiley Interdiscip Rev Comput Mol Sci       Date:  2020-03-31

Review 8.  Applications of artificial intelligence to drug design and discovery in the big data era: a comprehensive review.

Authors:  Neetu Tripathi; Manoj Kumar Goshisht; Sanat Kumar Sahu; Charu Arora
Journal:  Mol Divers       Date:  2021-06-10       Impact factor: 2.943

9.  Large-Scale Modeling of Multispecies Acute Toxicity End Points Using Consensus of Multitask Deep Learning Methods.

Authors:  Sankalp Jain; Vishal B Siramshetty; Vinicius M Alves; Eugene N Muratov; Nicole Kleinstreuer; Alexander Tropsha; Marc C Nicklaus; Anton Simeonov; Alexey V Zakharov
Journal:  J Chem Inf Model       Date:  2021-02-03       Impact factor: 4.956

10.  Effect of missing data on multitask prediction methods.

Authors:  Antonio de la Vega de León; Beining Chen; Valerie J Gillet
Journal:  J Cheminform       Date:  2018-05-22       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.