| Literature DB >> 29757353 |
Marta M Stepniewska-Dziubinska1, Piotr Zielenkiewicz1,2, Pawel Siedlecki1,2.
Abstract
Motivation: Structure based ligand discovery is one of the most successful approaches for augmenting the drug discovery process. Currently, there is a notable shift towards machine learning (ML) methodologies to aid such procedures. Deep learning has recently gained considerable attention as it allows the model to 'learn' to extract features that are relevant for the task at hand.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29757353 PMCID: PMC6198856 DOI: 10.1093/bioinformatics/bty374
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Pafnucy’s architecture. The molecular complex is represented with a 4D tensor, processed by threee convolutional layers and three dense (fully-connected) layers to predict the binding affinity
Pafnucy’s performance
| Dataset | ||||
|---|---|---|---|---|
| Test (v. 2016 core set) | 1.42 | 1.13 | 1.37 | 0.78 |
| Validation | 1.44 | 1.14 | 1.43 | 0.72 |
| Training | 1.21 | 0.95 | 1.19 | 0.77 |
Note: Prediction accuracy for each subset was evaluated using four different metrics (see main text).
Fig. 2.Predictions for two test sets (core sets from PDBbind v. 2016 and v. 2013), training set and validation set
Results on the CASF-2013 ‘scoring power’ benchmark (PDBbind v. 2013 core set)
| Pafnucy | X-Score | ChemScore | ChemPLP | PLP1 | G-Score | |
|---|---|---|---|---|---|---|
| SD | 1.78 | 1.82 | 1.84 | 1.86 | 1.87 | |
| R | 0.61 | 0.59 | 0.58 | 0.57 | 0.56 |
Note: Only the five best performing scoring functions are presented, for full results see (Li ).
SYBYL.
GOLD.
Discovery Studio.
Predictions accuracy on the Astex Diverse Set
| Method | ||||
|---|---|---|---|---|
| Pafnucy | 1.43 | 1.13 | 1.43 | 0.57 |
| X-Score | 1.55 | 1.22 | 1.48 | 0.52 |
Fig. 3.Range of weights for each input channel (feature). Outliers are not shown
Fig. 4.The most important parts of the input. Regardless of the complex orientation, the same region of the input had the highest impact on the prediction. Note that the second plot is rotated back about the X-axis to ease the comparison. (a) Original orientation. (b) Rotated by about the X-axis. (c) Protein–ligand interactions. Graphic was generated with Poseview (Stierand and Rarey, 2010)
Fig. 5.Activations on the hidden layers for two orientations of the PDE10A complex (PDB ID: 3WS8). Darker colors indicate higher values. Cosine distances (d) between the activation patterns for each layer are provided