| Literature DB >> 31507420 |
Dingyan Wang1,2, Chen Cui1,2, Xiaoyu Ding1,2, Zhaoping Xiong3, Mingyue Zheng1, Xiaomin Luo1, Hualiang Jiang1,3, Kaixian Chen1,3.
Abstract
Scoring functions play an important role in structure-based virtual screening. It has been widely accepted that target-specific scoring functions (TSSFs) may achieve better performance compared with universal scoring functions in actual drug research and development processes. A method that can effectively construct TSSFs will be of great value to drug design and discovery. In this work, we proposed a deep learning-based model named DeepScore to achieve this goal. DeepScore adopted the form of PMF scoring function to calculate protein-ligand binding affinity. However, different from PMF scoring function, in DeepScore, the score for each protein-ligand atom pair was calculated using a feedforward neural network. Our model significantly outperformed Glide Gscore on validation data set DUD-E. The average ROC-AUC on 102 targets was 0.98. We also combined Gscore and DeepScore together using a consensus method and put forward a consensus model named DeepScoreCS. The comparison results showed that DeepScore outperformed other machine learning-based TSSFs building methods. Furthermore, we presented a strategy to visualize the prediction of DeepScore. All of these results clearly demonstrated that DeepScore would be a useful model in constructing TSSFs and represented a novel way incorporating deep learning and drug design.Entities:
Keywords: DUD-E; deep learning; drug discovery; target-specific scoring function; virtual screening
Year: 2019 PMID: 31507420 PMCID: PMC6713720 DOI: 10.3389/fphar.2019.00924
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Atom features used in DeepScore.
| Atom feature name | Feature length | Features |
|---|---|---|
| Type | 9 | B, C, N, O, P, S, Se, halogen, and metal |
| Hybridization | 4 | 1, 2, 3, other |
| Heavy valencea | 4 | 1, 2, 3, other |
| Hetero valenceb | 5 | 0, 1, 2, 3, other |
| Partial charge | 1 | Value |
| Hydrophobic | 1 | 1 (True) or 0 (false) |
| Aromatic | 1 | 1 (True) or 0 (false) |
| Hydrogen-bond donor | 1 | 1 (True) or 0 (false) |
| Hydrogen-bond acceptor | 1 | 1 (True) or 0 (false) |
| Ring | 1 | 1 (True) or 0 (false) |
aThe number of bonds with other heavy atoms.
bThe number of bonds with other heteroatoms.
Figure 1Workflow of DeepScore model construction.
Figure 2BEDROC scores (α=80.5) on 102 targets of our screening results versus the results from benchmark (Chaput et al., 2016). Each dot represents a target.
Figure 3ROC-AUC (upper panel) and PRC-AUC (lower panel) of cross validation performance on each target. Targets are sorted by the performance of Gscore.
Average performance of Gscore, DeepScore, and DeepScoreCS on DUD-E data set.
| Gscore | DeepScore | DeepScoreCS | |||
|---|---|---|---|---|---|
| Value | Value | Better-1a | Value | Better-2b | |
| ROC-AUC | 0.817 | 0.979 | 102 | 0.978 | 49 |
| PRC-AUC | 0.317 | 0.796 | 100 | 0.814 | |
| EF0.5% | 30.625 | 55.275 | 94 | 57.149 | |
| EF1% | 24.335 | 52.218 | 98 | 53.658 | |
| EF2% | 17.203 | 39.716 | 100 | 40.075 | |
| EF5% | 9.122 | 18.200 | 102 | 18.200 | |
| EF10% | 5.573 | 9.472 | 101 | 9.448 | |
| ROC-EF0.5% | 51.522 | 148.948 | 100 | 151.986 | |
| ROC-EF1% | 31.239 | 81.614 | 102 | 82.164 | |
| ROC-EF2% | 18.689 | 43.320 | 102 | 43.498 | |
| ROC-EF5% | 9.423 | 18.417 | 101 | 18.365 | |
| ROC-EF10% | 5.680 | 9.500 | 101 | 9.484 | |
a Better-1 column refers to the number of targets where DeepScore outperforms Gscore.
b Better-2 column refers to the number of targets where DeepScoreCS outperforms DeepScore.
Figure 4The binding site of FPPS (PDB ID 1zw5). (A) Crystal structure ligand. (B) Superposition of all the docking poses of actives.
Figure 5The improvement of PRC-AUC on each target using consensus method. Each point represents a target. Y-axis represents the value of PRC-AUC of DeepScoreCS minus that of DeepScore. Blue dot means that the improvement is positive while red means negative (the performance became worse through consensus method). X-axis represents the mean value of the coefficient c DeepScoreCS used.
Performance comparison between PLEIC-SVM and DeepScore.
| PLEIC-SVM | DeepScore | |
|---|---|---|
| ROC-AUC | 0.93 |
|
| ROC0.5%a | 0.58 |
|
| ROC1%b | 0.64 |
|
| ROC2%c | 0.69 |
|
| ROC5%d | 0.77 |
|
Performance values of PLEIC-SVM are collected from (Yan et al., 2017). Better results are highlighted in bold.
a ROC0.5% = ROC-EF0.5% / 200.
b ROC1% = ROC-EF1% / 100.
c ROC2% = ROC-EF2% / 50.
d ROC5% = ROC-EF5% / 20.
Figure 6The performance of PLEIC-SVM and DeepScore. Targets are sorted by the performance of PLEIC-SVM.
Performance comparison between RF-Score and DeepScore.
| Model name | ROC-AUC | EF1% | EF2% | EF5% | EF10% |
|---|---|---|---|---|---|
| DeepScore |
|
|
|
|
|
| AV-RF-V1 | 0.82 | 29.69 | 21.07 | 11.74 | 7.1 |
| AV-RF-V2 | 0.84 | 34.75 | 24.37 | 12.99 | 7.55 |
| AV-RF-V3 | 0.84 | 32.72 | 23.04 | 12.6 | 7.47 |
| D3.6-RF-V1 | 0.84 | 36.28 | 25.3 | 13.3 | 7.71 |
| D3.6-RF-V2 | 0.87 | 43.43 | 29.72 | 14.76 | 8.25 |
| D3.6-RF-V3 | 0.87 | 41.1 | 28.27 | 14.61 | 8.2 |
| D6.6-RF-V1 | 0.77 | 27.42 | 18.65 | 10.37 | 6.42 |
| D6.6-RF-V2 | 0.80 | 34.3 | 22.07 | 11.73 | 6.96 |
| D6.6-RF-V3 | 0.79 | 32.05 | 21.56 | 11.47 | 6.88 |
Performance values of RF-Score are collected from the Supporting Information of (Wójcikowski et al., 2017). Best results are highlighted in bold.
Figure 7Binding mode analysis of CHEMBL418564 with AA2AR receptor (DeepScore =1.875, PDB ID 3eml). A to D refer to the four different parts in pharmacophore model of AA2AR antagonists. A, hydrogen-bond donor. B, N-containing aromatic ring. C, large lipophilic region. D, smaller lipophilic region.
Figure 8Binding mode analysis of CHEMBL363077 with CDK2 receptor (DeepScore = 1.805, PDB ID 1h00).
Figure 9Binding mode analysis of CHEMBL56306 with ESR1 receptor (DeepScore = 7.411, PDB ID 1sj0).
Figure 10Binding mode analysis of CHEMBL378637 with DPP4 receptor (DeepScore = 3.549, PDB ID 2i78).