| Literature DB >> 32701288 |
Jack Scantlebury1, Nathan Brown2, Frank Von Delft3,4,5, Charlotte M Deane1.
Abstract
Current deep learning methods for structure-based virtual screening take the structures of both the protein and the ligand as input but make little or no use of the protein structure when predicting ligand binding. Here, we show how a relatively simple method of data set augmentation forces such deep learning methods to take into account information from the protein. Models trained in this way are more generalizable (make better predictions on protein/ligand complexes from a different distribution to the training data). They also assign more meaningful importance to the protein and ligand atoms involved in binding. Overall, our results show that data set augmentation can help deep learning-based virtual screening to learn physical interactions rather than data set biases.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32701288 PMCID: PMC7611237 DOI: 10.1021/acs.jcim.0c00263
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956