Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning.

Literature DB >> 36272041

Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning.

Ruifeng Liu^1,2, Srinivas Laxminarayan^1,2, Jaques Reifman¹, Anders Wallqvist³.

Abstract

The main limitation in developing deep neural network (DNN) models to predict bioactivity properties of chemicals is the lack of sufficient assay data to train the network's classification layers. Focusing on feedforward DNNs that use atom- and bond-based structural fingerprints as input, we examined whether layers of a fully trained DNN based on large amounts of data to predict one property could be used to develop DNNs to predict other related or unrelated properties based on limited amounts of data. Hence, we assessed if and under what conditions the dense layers of a pre-trained DNN could be transferred and used for the development of another DNN associated with limited training data. We carried out a quantitative study employing more than 400 pairs of assay datasets, where we used fully trained layers from a large dataset to augment the training of a small dataset. We found that the higher the correlation r between two assay datasets, the more efficient the transfer learning is in reducing prediction errors associated with the smaller dataset DNN predictions. The reduction in mean squared prediction errors ranged from 10 to 20% for every 0.1 increase in r2 between the datasets, with the bulk of the error reductions associated with transfers of the first dense layer. Transfer of other dense layers did not result in additional benefits, suggesting that deeper, dense layers conveyed more specialized and assay-specific information. Importantly, depending on the dataset correlation, training sample size could be reduced by up to tenfold without any loss of prediction accuracy.

Entities: Chemical

Keywords: Deep neural networks; Machine learning; QSAR; Transfer learning

Year: 2022 PMID： 36272041 DOI： 10.1007/s10822-022-00486-x

Source DB: PubMed Journal: J Comput Aided Mol Des ISSN： 0920-654X Impact factor: 4.179

Keyword Cloud
References

12 in total

Review 1. Connecting chemistry and biology through molecular descriptors.

Authors: Adrià Fernández-Torras; Arnau Comajuncosa-Creus; Miquel Duran-Frigola; Patrick Aloy
Journal: Curr Opin Chem Biol Date: 2021-10-06 Impact factor: 8.822

2. Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces.

Authors: Fabio Broccatelli; Richard Trager; Michael Reutlinger; George Karypis; Mufei Li
Journal: Mol Inform Date: 2022-02-23 Impact factor: 4.050

3. QSAR-Based Virtual Screening: Advances and Applications in Drug Discovery.

Authors: Bruno J Neves; Rodolpho C Braga; Cleber C Melo-Filho; José Teófilo Moreira-Filho; Eugene N Muratov; Carolina Horta Andrade
Journal: Front Pharmacol Date: 2018-11-13 Impact factor: 5.810

Review 4. Best Practices for QSAR Model Reporting: Physical and Chemical Properties, Ecotoxicity, Environmental Fate, Human Health, and Toxicokinetics Endpoints.

Authors: Geven Piir; Iiris Kahn; Alfonso T García-Sosa; Sulev Sild; Priit Ahte; Uko Maran
Journal: Environ Health Perspect Date: 2018-12 Impact factor: 9.031

Review 5. A Survey of Multi-task Learning Methods in Chemoinformatics.

Authors: Sergey Sosnin; Mariia Vashurina; Michael Withnall; Pavel Karpov; Maxim Fedorov; Igor V Tetko
Journal: Mol Inform Date: 2018-11-28 Impact factor: 3.353

6. Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.

Authors: Dejun Jiang; Zhenxing Wu; Chang-Yu Hsieh; Guangyong Chen; Ben Liao; Zhe Wang; Chao Shen; Dongsheng Cao; Jian Wu; Tingjun Hou
Journal: J Cheminform Date: 2021-02-17 Impact factor: 5.514

Review 7. An Introductory Review of Deep Learning for Prediction Models With Big Data.

Authors: Frank Emmert-Streib; Zhen Yang; Han Feng; Shailesh Tripathi; Matthias Dehmer
Journal: Front Artif Intell Date: 2020-02-28

8. CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery.

Authors: Yaqin Li; Yongjin Xu; Yi Yu
Journal: Molecules Date: 2021-11-30 Impact factor: 4.411

Review 9. Opportunities and obstacles for deep learning in biology and medicine.

Authors: Travers Ching; Daniel S Himmelstein; Brett K Beaulieu-Jones; Alexandr A Kalinin; Brian T Do; Gregory P Way; Enrico Ferrero; Paul-Michael Agapow; Michael Zietz; Michael M Hoffman; Wei Xie; Gail L Rosen; Benjamin J Lengerich; Johnny Israeli; Jack Lanchantin; Stephen Woloszynek; Anne E Carpenter; Avanti Shrikumar; Jinbo Xu; Evan M Cofer; Christopher A Lavender; Srinivas C Turaga; Amr M Alexandari; Zhiyong Lu; David J Harris; Dave DeCaprio; Yanjun Qi; Anshul Kundaje; Yifan Peng; Laura K Wiley; Marwin H S Segler; Simina M Boca; S Joshua Swamidass; Austin Huang; Anthony Gitter; Casey S Greene
Journal: J R Soc Interface Date: 2018-04 Impact factor: 4.293