Literature DB >> 36272041

Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning.

Ruifeng Liu1,2, Srinivas Laxminarayan1,2, Jaques Reifman1, Anders Wallqvist3.   

Abstract

The main limitation in developing deep neural network (DNN) models to predict bioactivity properties of chemicals is the lack of sufficient assay data to train the network's classification layers. Focusing on feedforward DNNs that use atom- and bond-based structural fingerprints as input, we examined whether layers of a fully trained DNN based on large amounts of data to predict one property could be used to develop DNNs to predict other related or unrelated properties based on limited amounts of data. Hence, we assessed if and under what conditions the dense layers of a pre-trained DNN could be transferred and used for the development of another DNN associated with limited training data. We carried out a quantitative study employing more than 400 pairs of assay datasets, where we used fully trained layers from a large dataset to augment the training of a small dataset. We found that the higher the correlation r between two assay datasets, the more efficient the transfer learning is in reducing prediction errors associated with the smaller dataset DNN predictions. The reduction in mean squared prediction errors ranged from 10 to 20% for every 0.1 increase in r2 between the datasets, with the bulk of the error reductions associated with transfers of the first dense layer. Transfer of other dense layers did not result in additional benefits, suggesting that deeper, dense layers conveyed more specialized and assay-specific information. Importantly, depending on the dataset correlation, training sample size could be reduced by up to tenfold without any loss of prediction accuracy.
© 2022. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.

Entities:  

Keywords:  Deep neural networks; Machine learning; QSAR; Transfer learning

Year:  2022        PMID: 36272041     DOI: 10.1007/s10822-022-00486-x

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   4.179


  12 in total

Review 1.  Connecting chemistry and biology through molecular descriptors.

Authors:  Adrià Fernández-Torras; Arnau Comajuncosa-Creus; Miquel Duran-Frigola; Patrick Aloy
Journal:  Curr Opin Chem Biol       Date:  2021-10-06       Impact factor: 8.822

2.  Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces.

Authors:  Fabio Broccatelli; Richard Trager; Michael Reutlinger; George Karypis; Mufei Li
Journal:  Mol Inform       Date:  2022-02-23       Impact factor: 4.050

3.  QSAR-Based Virtual Screening: Advances and Applications in Drug Discovery.

Authors:  Bruno J Neves; Rodolpho C Braga; Cleber C Melo-Filho; José Teófilo Moreira-Filho; Eugene N Muratov; Carolina Horta Andrade
Journal:  Front Pharmacol       Date:  2018-11-13       Impact factor: 5.810

Review 4.  Best Practices for QSAR Model Reporting: Physical and Chemical Properties, Ecotoxicity, Environmental Fate, Human Health, and Toxicokinetics Endpoints.

Authors:  Geven Piir; Iiris Kahn; Alfonso T García-Sosa; Sulev Sild; Priit Ahte; Uko Maran
Journal:  Environ Health Perspect       Date:  2018-12       Impact factor: 9.031

Review 5.  A Survey of Multi-task Learning Methods in Chemoinformatics.

Authors:  Sergey Sosnin; Mariia Vashurina; Michael Withnall; Pavel Karpov; Maxim Fedorov; Igor V Tetko
Journal:  Mol Inform       Date:  2018-11-28       Impact factor: 3.353

6.  Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.

Authors:  Dejun Jiang; Zhenxing Wu; Chang-Yu Hsieh; Guangyong Chen; Ben Liao; Zhe Wang; Chao Shen; Dongsheng Cao; Jian Wu; Tingjun Hou
Journal:  J Cheminform       Date:  2021-02-17       Impact factor: 5.514

Review 7.  An Introductory Review of Deep Learning for Prediction Models With Big Data.

Authors:  Frank Emmert-Streib; Zhen Yang; Han Feng; Shailesh Tripathi; Matthias Dehmer
Journal:  Front Artif Intell       Date:  2020-02-28

8.  CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery.

Authors:  Yaqin Li; Yongjin Xu; Yi Yu
Journal:  Molecules       Date:  2021-11-30       Impact factor: 4.411

Review 9.  Opportunities and obstacles for deep learning in biology and medicine.

Authors:  Travers Ching; Daniel S Himmelstein; Brett K Beaulieu-Jones; Alexandr A Kalinin; Brian T Do; Gregory P Way; Enrico Ferrero; Paul-Michael Agapow; Michael Zietz; Michael M Hoffman; Wei Xie; Gail L Rosen; Benjamin J Lengerich; Johnny Israeli; Jack Lanchantin; Stephen Woloszynek; Anne E Carpenter; Avanti Shrikumar; Jinbo Xu; Evan M Cofer; Christopher A Lavender; Srinivas C Turaga; Amr M Alexandari; Zhiyong Lu; David J Harris; Dave DeCaprio; Yanjun Qi; Anshul Kundaje; Yifan Peng; Laura K Wiley; Marwin H S Segler; Simina M Boca; S Joshua Swamidass; Austin Huang; Anthony Gitter; Casey S Greene
Journal:  J R Soc Interface       Date:  2018-04       Impact factor: 4.293

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.