| Literature DB >> 36188230 |
Ying Qian1, Jian Wu1, Qian Zhang1.
Abstract
Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the field of computer vision, especially in deep learning, have proved that the position, geometry, spatial structure and other features of objects in an image can be well characterized. We propose a novel molecular image-based model named CAT-CPI (combining CNN and transformer to predict CPI) for CPI task. We use Convolution Neural Network (CNN) to learn local features of molecular images and then use transformer encoder to capture the semantic relationships of these features. To extract protein sequence feature, we propose to use a k-gram based method and obtain the semantic relationships of sub-sequences by transformer encoder. In addition, we build a Feature Relearning (FR) module to learn interaction features of compounds and proteins. We evaluated CAT-CPI on three benchmark datasets-Human, Celegans, and Davis-and the experimental results demonstrate that CAT-CPI presents competitive performance against state-of-the-art predictors. In addition, we carry out Drug-Drug Interaction (DDI) experiments to verify the strong potential of the methods based on molecular images and FR module.Entities:
Keywords: compound-protein interaction; deep learning; drug-drug interaction; molecular image; transformer encoder
Year: 2022 PMID: 36188230 PMCID: PMC9520300 DOI: 10.3389/fmolb.2022.963912
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1An overall architecture of the CAT-CPI. The model contains three modules: compound feature extraction, protein feature extraction and feature relearning module.
FIGURE 2An overview of sliding window division method and types of strings of k-gram.
Summary of the datasets.
| Compounds | Proteins | Samples | Pos Samples | |
|---|---|---|---|---|
| Human | 1709 | 2043 | 6212 | 3364 |
| Celegans | 1723 | 1708 | 7511 | 3893 |
| Davis | 68 | 379 | 11,103 | 1506 |
The scores on Human dataset compared to traditional machine learning methods.
| Method | ROC-AUC | Precision | Recall | |
|---|---|---|---|---|
| Human | KNN | 0.860 | 0.927 | 0.798 |
| RF | 0.940 | 0.897 | 0.861 | |
| L2 | 0.911 | 0.913 | 0.867 | |
| SVM | 0.910 | 0.966 | 0.969 | |
| Ours | 0.986 ± 0.001 | 0.948 ± 0.002 | 0.971 ± 0.003 | |
| Celegans | KNN | 0.858 | 0.801 | 0.827 |
| RF | 0.902 | 0.821 | 0.844 | |
| L2 | 0.892 | 0.890 | 0.877 | |
| SVM | 0.894 | 0.785 | 0.818 | |
| Ours | 0.992 ± 0.001 | 0.974 ± 0.004 | 0.948 ± 0.003 |
FIGURE 3Comparison of CAT-CPI and other deep learning methods on Human (A) and Celegans (B) datasets.
Comparison with other methods on Davis dataset.
| Method | ROC-AUC | PR-AUC | Recall |
|---|---|---|---|
| RF | 0.907 | 0.481 | 0.831 |
| SVM | 0.821 | 0.185 | 0.799 |
| GBDT | 0.836 | 0.271 | 0.755 |
| LR | 0.835 ± 0.010 | 0.232 ± 0.023 | 0.699 ± 0.051 |
| GNN-CPI | 0.842 ± 0.006 | 0.269 ± 0.020 | 0.764 ± 0.045 |
| DeepDTA | 0.880 ± 0.007 | 0.302 ± 0.044 | 0.865 ± 0.020 |
| DeepConv-DTI | 0.884 ± 0.008 | 0.299 ± 0.039 | 0.880 ± 0.024 |
| TransformerCPI | 0.841 ± 0.001 | 0.227 ± 0.003 | 0.842 ± 0.004 |
| PWO-CPI | 0.848 ± 0.001 | 0.278 ± 0.001 | 0.884 ± 0.003 |
| MolTrans | 0.907 ± 0.002 | 0.404 ± 0.016 | 0.800 ± 0.022 |
| CAT-CPI | 0.920 ± 0.001 | 0.481 ± 0.001 | 0.888 ± 0.001 |
FIGURE 4Images obtained by Rdkit based on SMILES sequences of Salicylic acid and Phenyl salicylate.
FIGURE 5The flowchart of our DDI model. Two pictures are fed into same CNN Block and Transformer Encoder. Then the feature map is stacked to obtain the prediction results by FR module.
Results of DDI experiments on BIOSNAP dataset.
| Method | ROC-AUC | PR-AUC | F1 |
|---|---|---|---|
| LR | 0.802 ± 0.001 | 0.779 ± 0.001 | 0.741 ± 0.002 |
| Nat.Port | 0.853 ± 0.001 | 0.848 ± 0.001 | 0.714 ± 0.001 |
| Mol2Vec | 0.879 ± 0.006 | 0.861 ± 0.005 | 0.798 ± 0.007 |
| MolVAE | 0.892 ± 0.009 | 0.877 ± 0.009 | 0.788 ± 0.033 |
| DeepDDI | 0.886 ± 0.007 | 0.871 ± 0.007 | 0.817 ± 0.007 |
| CASTER | 0.910 ± 0.005 | 0.887 ± 0.008 | 0.843 ± 0.005 |
| Ours | 0.960 ± 0.002 | 0.938 ± 0.002 | 0.926 ± 0.001 |
Experimental results of CAT-CPI on different size images.
| Image_size | AUC | AUPRC | Recall |
|---|---|---|---|
| 3*64*64 | 0.901 ± 0.001 | 0.371 ± 0.001 | 0.904 ± 0.001 |
| 3*128*128 | 0.920 ± 0.001 | 0.481 ± 0.001 | 0.888 ± 0.001 |
| 3*256*256 | 0.918 ± 0.001 | 0.471 ± 0.002 | 0.870 ± 0.001 |
Ablation study on Human and Davis datasets.
| Method | ROC-AUC | PR-AUC | Recall | |
|---|---|---|---|---|
| Human | CAT-CPI | 0.986 ± 0.001 | 0.948 ± 0.002 | 0.971 ± 0.003 |
| -CNN | 0.980 ± 0.001 | 0.942 ± 0.004 | 0.942 ± 0.003 | |
| -Trans | 0.982 ± 0.001 | 0.939 ± 0.001 | 0.936 ± 0.003 | |
| Word2vec | 0.982 ± 0.001 | 0.925 ± 0.003 | 0.949 ± 0.003 | |
| -P_Trans | 0.981 ± 0.001 | 0.954 ± 0.003 | 0.936 ± 0.003 | |
| -FR | 0.966 ± 0.001 | 0.923 ± 0.002 | 0.955 ± 0.001 | |
| Davis | CAT-CPI | 0.920 ± 0.001 | 0.481 ± 0.001- | 0.888 ± 0.023 |
| -CNN | 0.914 ± 0.003 | 0.473 ± 0.011 | 0.849 ± 0.007 | |
| -Trans | 0.912 ± 0.004 | 0.443 ± 0.005 | 0.848 ± 0.001 | |
| Word2vec | 0.908 ± 0.002 | 0.436 ± 0.003 | 0.881 ± 0.004 | |
| -P_Trans | 0.918 ± 0.002 | 0.478 ± 0.004 | 0.856 ± 0.007 | |
| -FR | 0.853 ± 0.004 | 0.305 ± 0.017 | 0.824 ± 0.022 |
Network component ablation experiments on Davis dataset.
| Step | ROC-AUC | PR-AUC | Recall |
|---|---|---|---|
| GNN-CPI23 | 0.840 ± 0.012 | 0.269 ± 0.020 | 0.696 ± 0.047 |
| our (GNN-CPI) | 0.890 ± 0.002 | 0.312 ± 0.003 | 0.816 ± 0.004 |
| DeepDTA15 | 0.880 ± 0.007 | 0.302 ± 0.044 | 0.764 ± 0.045 |
| our (DeepDTA) | 0.908 ± 0.002 | 0.431 ± 0.002 | 0.845 ± 0.003 |
| CAT-CPI | 0.920 ± 0.001 | 0.481 ± 0.001 | 0.888 ± 0.001 |
Results of model parameters and computational quantities ablation experiments on the Davis dataset.
| Method | Params (M) | FLOPs | ROC-AUC | PR-AUC | Recall |
|---|---|---|---|---|---|
| PWO-CPI | 6.353 | 1.095G | 0.835 ± 0.004 | 0.158 ± 0.003 | 0.798 ± 0.003 |
| CAT-CPI | 6.179 | 935.222M | 0.920 ± 0.001 | 0.481 ± 0.001 | 0.888 ± 0.001 |
| a) | 2.440 | 486.932M | 0.901 ± 0.002 | 0.358 ± 0.003 | 0.825 ± 0.002 |
| b) | 3.827 | 564.921M | 0.854 ± 0.002 | 0.290 ± 0.001 | 0.782 ± 0.001 |
| c) | 2.113 | 931.159M | 0.912 ± 0.001 | 0.441 ± 0.002 | 0.881 ± 0.001 |
| d) | 3.849 | 603.346M | 0.883 ± 0.001 | 0.280 ± 0.002 | 0.860 ± 0.003 |
| e) | 5.881 | 586.045M | 0.877 ± 0.001 | 0.285 ± 0.002 | 0.910 ± 0.002 |
| f) | 6.037 | 783.179M | 0.901 ± 0.001 | 0.371 ± 0.001 | 0.904 ± 0.001 |
| g) | 6.179 | 959.602M | 0.918 ± 0.001 | 0.471 ± 0.002 | 0.870 ± 0.001 |
FIGURE 6Different methods of handling compound images.
Results of the geometric transformation on the Davis dataset.
| Methods | ROC-AUC | PR-AUC | Recall |
|---|---|---|---|
| Rotation | 0.916 ± 0.001 | 0.489 ± 0.001 | 0.866 ± 0.001 |
| HorizontalFlip | 0.918 ± 0.001 | 0.488 ± 0.001 | 0.888 ± 0.001 |
| VerticalFlip | 0.916 ± 0.001 | 0.477 ± 0.001 | 0.867 ± 0.001 |
| Translation, size = (96,96) | 0.916 ± 0.001 | 0.483 ± 0.002 | 0.884 ± 0.002 |
| Translation, size = (64,64) | 0.911 ± 0.001 | 0.464 ± 0.001 | 0.863 ± 0.001 |
| Translation, size = (48,48) | 0.910 ± 0.001 | 0.423 ± 0.003 | 0.849 ± 0.002 |
| Translation, size = (32,32) | 0.908 ± 0.001 | 0.456 ± 0.001 | 0.860 ± 0.001 |
Geometric transformation tests of PWO-CPI and CAT-CPI on the Davis dataset.
| Methods | CAT-CPI (AUC = 0.920) | PWO-CPI (AUC = 0.848) | ||
|---|---|---|---|---|
| ROC-AUC | AUC decrease | ROC-AUC | AUC decrease | |
| Rotation | 0.916 | −0.004 | 0.845 | −0.003 |
| HorizontalFlip | 0.918 | −0.002 | 0.844 | −0.004 |
| VerticalFlip | 0.916 | −0.004 | 0.845 | −0.003 |
| Translation, size = (96,96) | 0.916 | −0.004 | 0.845 | −0.003 |
| Translation, size = (64,64) | 0.911 | −0.009 | 0.835 | −0.013 |
| Translation, size = (48,48) | 0.91 | −0.010 | 0.834 | −0.014 |
| Translation, size = (32,32) | 0.908 | −0.012 | 0.830 | −0.018 |