| Literature DB >> 35292100 |
Junjie Wang1, NaiFeng Wen2, Chunyu Wang3, Lingling Zhao4, Liang Cheng5,6.
Abstract
MOTIVATION: Drug-target binding affinity (DTA) reflects the strength of the drug-target interaction; therefore, predicting the DTA can considerably benefit drug discovery by narrowing the search space and pruning drug-target (DT) pairs with low binding affinity scores. Representation learning using deep neural networks has achieved promising performance compared with traditional machine learning methods; hence, extensive research efforts have been made in learning the feature representation of proteins and compounds. However, such feature representation learning relies on a large-scale labelled dataset, which is not always available.Entities:
Keywords: Deep learning; Drug-target affinity prediction; ELECTRA; Representative learning
Year: 2022 PMID: 35292100 PMCID: PMC8922401 DOI: 10.1186/s13321-022-00591-x
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Overview of ELECTRA-DTA
Statistics of the compound SMILES corpora
| No. of corpus | Average length of the corpus | Minimum length of the corpus | No. of vocabulary |
|---|---|---|---|
| 1114424 | 47 | 3 | 72 |
Statistics of protein sequence corpora
| No. of corpus | Average length of the corpus | Minimum length of the corpus | No. of vocabulary |
|---|---|---|---|
| 1868198 | 382 | 2 | 30 |
Fig. 2The principle of ELECTRA pre-training
Fig. 3Compound SMILES and protein sequence embedding
Fig. 4Architecture of the ELECTRA-DTA model
Fig. 5Structure of the squeeze-and-excitation block
The detailed statistics of the datasets, containing the number of proteins, compounds, interactions and average number of interactions per protein and per compound
| No. Proteins | No. Compounds | No. Interaction | No. interactions per protein | No. interactions per compound | No. inactive interactions | No. non-inactive interactions | |
|---|---|---|---|---|---|---|---|
| Davis | 442 | 68 | 30056 | 68 | 442 | 2502 | 27554 |
| KIBA | 229 | 2111 | 118254 | 516 | 56 | 24828 | 93426 |
| BindingDB | 1620 | 87461 | 144525 | 89 | 2 | 47608 | 96917 |
The detailed statistics of the refined datasets, containing the number of proteins, compounds, interactions and average number of interactions per protein and per compound
| No. Proteins | No. Compounds | No. interaction | No. interactions per protein | No. interactions per compound | No. inactive interactions | No. non-inactive interactions | |
|---|---|---|---|---|---|---|---|
| Davis | 361 | 68 | 24548 | 68 | 61 | 1649 | 22899 |
| KIBA | 229 | 2052 | 117184 | 511.720 | 57.107 | 24543 | 92641 |
| BindingDB | 1615 | 129109 | 144525 | 79.944 | 1.541 | 41487 | 87622 |
The detailed training settings of ELECTRA-DTA
| Parameter | Setting | Parameter | Setting |
|---|---|---|---|
| CNN kernel size | 3 | Learning rate (lr) | 3e-4 |
| Length of SMILES sequence | 100 | Length of protein sequence | 1000 |
| Vector dimension | 256 | Number of filters | 256;512 |
| Epoch | 100 | Batchsize | 256 |
| hidden neurons | 1024; 1024; 512 | dropout | 0.4 |
Comparison of all baseline approaches and ELECTRA-DTA on the KIBA datasets
| Dataset | Model | CI | MSE | R | AUPR | |
|---|---|---|---|---|---|---|
| Original KIBA Dataset | KronRLS | 0.782 | 0.411 | - | 0.342 | 0.635 |
| SimBoost | 0.836 | 0.222 | - | 0.629 | 0.760 | |
| DeepDTA | 0.863 | 0.194 | 0.848 | 0.673 | 0.788 | |
| WideDTA | 0.875 | 0.179 | - | - | - | |
| DeepCDA | 0.176 | 0.855 | 0.682 (0.008) | |||
| ELECTRA-DTA | 0.889 (0.003) | 0.795 (0.006) | ||||
| refined KIBA Dataset | DeepDTA | 0.892 (0.026) | 0.152 | 0.766 (0.085) | 0.798 (0.063) | |
| Attention-DTA | 0.880 (0.001) | 0.158 | 0.883 | 0.742 (0.015) | 0.795 (0.003) | |
| ELECTRA-DTA | 0.892 |
Bold values represent the best performance over all competitive methods
Comparison of all baseline approaches and ELECTRA-DTA on the Davis datasets
| Dataset | Model | CI | MSE | R | AUPR | |
|---|---|---|---|---|---|---|
| Original Davis Dataset | KronRLS | 0.871 | 0.379 | – | 0.407 | 0.661 |
| SimBoost | 0.872 | 0.282 | – | 0.644 | 0.709 | |
| DeepDTA | 0.876 (0.004) | 0.261 | 0.846 | 0.630 (0.017) | 0.714 (0.010) | |
| WideDTA | 0.886 | 0.262 | – | – | – | |
| DeepCDA | 0.891 (0.003) | 0.248 | 0.649 (0.009) | |||
| ELECTRA-DTA | 0.844 | 0.698 (0.010) | ||||
| refined Davis Dataset | DeepDTA | 0.882 (0.016) | 0.690 (0.035) | |||
| Attention-DTA | 0.888 (0.007) | 0.195 | 0.836 | 0.677 (0.022) | ||
| ELECTRA-DTA | 0.195 | 0.838 | 0.637 (0.048) | 0.685 (0.026) |
Bold values represent the best performance over all competitive methods
Comparison of all baseline approaches and the ELECTRA-DTA on the BindingDB Dataset
| Dataset | Model | CI | MSE | R | AUPR | |
|---|---|---|---|---|---|---|
| Original BindingDB Dataset | DeepDTA | 0.812 (0.002) | 0.832 | 0.824 | 0.623 (0.02) | 0.443 (0.01) |
| DeepCDA | 0.822 (0.001) | 0.844 | 0.808 | 0.631 (0.002) | 0.459 (0.003) | |
| ELECTRA-DTA | ||||||
| refined BindingDB Dataset | DeepDTA | 0.826 (0.001) | 0.703 | 0.845 | 0.669 (0.004) | 0.795 (0.003) |
| Attention-DTA | 0.804 (0.003) | 0.844 | 0.811 | 0.619 (0.009) | 0.764 (0.004) | |
| ELECTRA-DTA |
Bold values represent the best performance over all competitive methods
Results of ablation experiments
| Dataset | method | CI | MSE | R | AUPR | |
|---|---|---|---|---|---|---|
| Davis | ELECTRA-DTA | |||||
| Onehot-DTA | 0.850 (0.005) | 0.301 | 0.739 | 0.525 (0.018) | 0.561 (0.029) | |
| KIBA | ELECTRA-DTA | |||||
| Onehot-DTA | 0.883 (0.002) | 0.157 | 0.881 | 0.744 (0.012) | 0.792 (0.003) | |
| BindingDB | ELECTRA-DTA | |||||
| Onehot-DTA | 0.830 (0.001) | 0.700 | 0.849 | 0.659 (0.037) | 0.799 (0.004) |
Bold values represent the best performance over all competitive methods
Fig. 6Performance evaluation for ELECTRA-DTA and baseline methods on cold splitting settings, on both Davis, KIBA and BindingDB datasets. The error bar shows the standard error
Fig. 7Radar plot of the ELECTRA-DTA and REDIAL models