| Literature DB >> 35350358 |
Vishal Dey1, Raghu Machiraju1,2,3, Xia Ning1,2,3.
Abstract
Recent advances in molecular machine learning, especially deep neural networks such as graph neural networks (GNNs), for predicting structure-activity relationships (SAR) have shown tremendous potential in computer-aided drug discovery. However, the applicability of such deep neural networks is limited by the requirement of large amounts of training data. In order to cope with limited training data for a target task, transfer learning for SAR modeling has been recently adopted to leverage information from data of related tasks. In this work, in contrast to the popular parameter-based transfer learning such as pretraining, we develop novel deep transfer learning methods TAc and TAc-fc to leverage source domain data and transfer useful information to the target domain. TAc learns to generate effective molecular features that can generalize well from one domain to another and increase the classification performance in the target domain. Additionally, TAc-fc extends TAc by incorporating novel components to selectively learn feature-wise and compound-wise transferability. We used the bioassay screening data from PubChem and identified 120 pairs of bioassays such that the active compounds in each pair are more similar to each other compared to their inactive compounds. Overall, TAc achieves the best performance with an average ROC-AUC of 0.801; it significantly improves the ROC-AUC of 83% of target tasks with an average task-wise performance improvement of 7.102%, compared to the best baseline dmpna. Our experiments clearly demonstrate that TAc achieves significant improvement over all baselines across a large number of target tasks. Furthermore, although TAc-fc achieves slightly worse ROC-AUC on average compared to TAc (0.798 vs 0.801), TAc-fc still achieves the best performance on more tasks in terms of PR-AUC and F1 compared to other methods. In summary, TAc-fc is also found to be a strong model with competitive or even better performance than TAc on a notable number of target tasks.Entities:
Year: 2022 PMID: 35350358 PMCID: PMC8945064 DOI: 10.1021/acsomega.1c06805
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Overall Comparisona
| method | ||||||
|---|---|---|---|---|---|---|
| 0.727 ± 0.124 | 0.729 ± 0.121 | 0.648 ± 0.104 | 0.661 ± 0.110 | |||
| 0.731 ± 0.120 | 0.730 ± 0.118 | 0.653 ± 0.102 | 0.735 ± 0.132 | 0.664 ± 0.107 | 0.682 ± 0.105 | |
| 0.754 ± 0.101 | 0.733 ± 0.102 | 0.619 ± 0.116 | 0.739 ± 0.156 | 0.656 ± 0.087 | 0.655 ± 0.126 | |
| 0.755 ± 0.112 | 0.729 ± 0.112 | 0.660 ± 0.119 | 0.712 ± 0.165 | 0.665 ± 0.101 | 0.651 ± 0.136 | |
| 0.754 ± 0.104 | 0.735 ± 0.105 | 0.687 ± 0.106 | 0.686 ± 0.213 | 0.669 ± 0.088 | 0.655 ± 0.140 | |
| 0.671 ± 0.213 | 0.645 ± 0.148 | |||||
| 0.733 ± 0.103 | 0.715 ± 0.103 | 0.671 ± 0.110 | 0.647 ± 0.215 | 0.649 ± 0.084 | 0.623 ± 0.144 | |
| 0.734 ± 0.102 | 0.716 ± 0.104 | 0.676 ± 0.106 | 0.653 ± 0.226 | 0.651 ± 0.085 | 0.624 ± 0.154 | |
| 0.798 ± 0.103 | 0.785 ± 0.108 | 0.729 ± 0.095 | 0.729 ± 0.146 | 0.714 ± 0.108 | ||
| 0.798 ± 0.102 | 0.784 ± 0.107 | 0.729 ± 0.094 | 0.720 ± 0.091 | |||
| 0.729 ± 0.143 | 0.720 ± 0.090 | 0.713 ± 0.103 | ||||
| 0.798 ± 0.105 | 0.785 ± 0.109 | 0.730 ± 0.097 | 0.728 ± 0.147 | 0.719 ± 0.095 | 0.713 ± 0.109 |
In this table, the columns ROC-AUC, PR-AUC, precision, sens, accuracy, and F1-score have the average and standard deviation over all bioassays in each performance metric. The best performance values are bold. The second best performance values are underlined.
Performance Comparison of TAc-dmpna vs FCN-dmpnaa
| method | ||||||
|---|---|---|---|---|---|---|
| 0.801 | 0.786 | 0.731 | 0.729 | 0.720 | 0.713 | |
| 0.763 | 0.745 | 0.702 | 0.671 | 0.672 | 0.645 | |
| 4.980 | 5.503 | 4.131 | 8.644 | 7.143 | 10.543 | |
| 5.702 | 6.085 | 4.876 | 25.281 | 7.727 | 18.464 | |
| (2.80 × 10–19) | (8.00 × 10–21) | (1.69 × 10–11) | (1.73 × 10–09) | (1.19 × 10–29) | (8.93 × 10–20) | |
| 199 (83%) | 192 (80%) | 157 (65%) | 153 (64%) | 201 (84%) | 198 (82%) | |
| 7.102 | 8.044 | 9.293 | 44.261 | 9.509 | 23.532 | |
| (5.56 × 10–22) | (5.51 × 10–26) | (2.81 × 10–27) | (7.36 × 10–25) | (5.02 × 10–35) | (3.60 × 10–26) |
In this table, the first two rows have the performance from respective methods averaged over all bioassays in each performance metric. The row diff % has the percentage difference of average performance in each metric from TAc-dmpna over FCN-dmpna (DT). The row t-diff % has the average of task-wise percentage improvement from TAc-dmpna over FCN-dmpna (DT) in respective metrics, with the corresponding p-value in parentheses below. The row N-impv has the number and percentage of target tasks where TAc-dmpna performs better than FCN-dmpna (DT) in respective metrics. The row t-impv % has the average of task-wise percentage improvement only among the corresponding improved tasks, with corresponding p-values in parentheses below.
Top-N Performance Comparison (%)a
| method | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| top- | 1 | 3 | 5 | 1 | 3 | 5 | 1 | 3 | 5 |
| 10 | 15 | 20 | 13 | 21 | 30 | 11 | 19 | 23 | |
| 2 | 13 | 18 | 3 | 17 | 22 | 4 | 17 | 22 | |
| 5 | 9 | 17 | 6 | 11 | 17 | 10 | 19 | 30 | |
| 4 | 10 | 26 | 2 | 7 | 20 | 5 | 11 | 26 | |
| 2 | 11 | 23 | 4 | 13 | 24 | 5 | 13 | 22 | |
| 5 | 18 | 33 | 6 | 14 | 31 | 2 | 8 | 23 | |
| 2 | 9 | 16 | 1 | 5 | 15 | 3 | 10 | 19 | |
| 2 | 8 | 17 | 1 | 8 | 15 | 2 | 11 | 19 | |
| 18 | 52 | 85 | 17 | 48 | 81 | 12 | 49 | ||
| 12 | 45 | 79 | 15 | 48 | 80 | 80 | |||
| 14 | 55 | 83 | 13 | 43 | 78 | ||||
| 16 | 51 | 81 | 15 | ||||||
In this table, the columns ROC-AUC, PR-AUC, and F1 have the percentage of tasks for which each method is ranked within the top-1, top-3, and top-5 best methods in respective metrics. The best performance values are in bold.
Figure 7Proposed architecture of TAc-fc. The feature learner F learns compound embedding r given the corresponding molecular graph. The feature-wise discriminator L learns feature-wise transferability given the learned compound embedding r. r is further scaled into z using its feature entropy from p out of L. The compound-wise discriminator G learns the compound-wise transferability given z. The domain-wise classifier S classifies the compound as active/inactive.
Comparison on Discriminators (with dmpna)a
| method | ||||||
|---|---|---|---|---|---|---|
| 0.801 | 0.786 | 0.731 | 0.729 | 0.720 | 0.713 | |
| 0.798 | 0.785 | 0.730 | 0.728 | 0.719 | 0.713 | |
| –0.375 | –0.127 | –0.137 | –0.137 | –0.139 | 0.000 | |
| –0.380 | –0.119 | –0.072 | 0.116 | –0.150 | –0.064 | |
| (2.59 × 10–04) | (4.88 × 10–01) | (7.26 × 10–01) | (7.00 × 10–01) | (5.53 × 10–01) | (8.04 × 10–01) | |
| 93 (39%) | 125 (52%) | 119 (50%) | 111 (46%) | 99 (41%) | 112 (47%) | |
| 0.921 | 1.373 | 2.756 | 7.037 | 2.230 | 3.729 | |
| (7.73 × 10–18) | (4.13 × 10–23) | (1.69 × 10–18) | (5.85 × 10–19) | (4.26 × 10–14) | (1.31 × 10–15) | |
| 0.801 | 0.786 | 0.730 | 0.734 | 0.721 | 0.716 | |
| 0.000 | 0.000 | –0.137 | 0.686 | 0.139 | 0.421 | |
| 0.010 | –0.080 | –0.130 | 1.583 | 0.128 | 0.516 | |
| (7.78 × 10–01) | (6.72 × 10–01) | (6.32 × 10–01) | (3.03 × 10–01) | (4.01 × 10–01) | (3.90 × 10–01) | |
| 135 (56%) | 119 (50%) | 123 (51%) | 126 (52%) | 128 (53%) | 130 (54%) | |
| 0.845 | 1.330 | 2.763 | 8.798 | 1.971 | 4.165 | |
| (1.13 × 10–23) | (5.03 × 10–24) | (2.67 × 10–17) | (8.22 × 10–18) | (4.75 × 10–21) | (1.75 × 10–15) | |
| 0.799 | 0.785 | 0.732 | 0.722 | 0.721 | 0.711 | |
| –0.250 | –0.127 | 0.137 | –0.960 | 0.139 | –0.281 | |
| –0.192 | –0.091 | 0.218 | –0.768 | 0.170 | –0.326 | |
| (4.00 × 10–02) | (3.76 × 10–01) | (5.44 × 10–01) | (1.42 × 10–01) | (3.19 × 10–01) | (4.98 × 10–01) | |
| 100 (42%) | 114 (48%) | 124 (52%) | 117 (49%) | 125 (52%) | 123 (51%) | |
| 1.029 | 1.597 | 2.992 | 6.999 | 2.037 | 3.764 | |
| (3.22 × 10–13) | (2.22 × 10–21) | (3.85 × 10–24) | (1.11 × 10–19) | (9.65 × 10–20) | (1.12 × 10–16) |
In this table, the first row block has the average performance of TAc. Each of the other row blocks has the performance comparison of a TAc-fc variant with respect to TAc. The metric diff % represents the difference of average performance of each comparison method with respect to TAc; t-diff % represents the average of the task-wise improvement, with corresponding p-values in the parentheses below; N-impv represents the number of improved tasks and its proportion in the parentheses; and t-impv % represents the average of the task-wise improvement only among the improved tasks, with corresponding p-values in the parentheses below.
Figure 1Parameter study of TAc-dmpna. The columns represent different evaluation metrics. The values in each cell have the average of the best performance achieved with given α and optimal choice of other hyperparameters. Darker cells indicate better performance.
Figure 2Parameter Study of TAc-fc-dmpna in terms of ROC-AUC.
Figure 3Similarity matrices of target pairs with significant ROC-AUC improvement/degradation.
Figure 4ROC-AUC improvement from TAc over FCN-dmpna vs bioassay similarity.
Figure 5Visualization of a few selected compounds from the target bioassay and their corresponding top 5 most similar compounds from the source bioassay.
Overall Performance Comparison of gnnCP
| method | CI | R@3 | R@5 | ndcg@3 | ndcg@5 | R@5% | ndcg@5% |
|---|---|---|---|---|---|---|---|
| 0.706 | 0.543 | 0.644 | 0.814 | 0.816 | 0.420 | 0.838 | |
| 0.711 | 0.545 | 0.655 | 0.815 | 0.819 | 0.437 | 0.846 | |
| 0.687 | 0.500 | 0.626 | 0.789 | 0.797 | 0.375 | 0.816 | |
| 0.687 | 0.519 | 0.632 | 0.790 | 0.797 | 0.396 | 0.813 | |
| 0.731 | 0.643 | 0.709 | 0.854 | 0.847 | 0.579 | 0.896 | |
| 2.353 | 6.608 | 4.460 | 3.114 | 2.421 | 18.428 | 4.475 | |
| 2.535 | 7.645 | 4.720 | 3.406 | 2.569 | 24.578 | 4.979 | |
| 1.14e-10 | 4.89e-10 | 9.87e-13 | 1.42e-12 | 2.25e-12 | 3.95e-15 | 1.83e-11 |
In this table, the columns have the respective average of each performance metric over all bioassays obtained by the respective optimal hyperparameter settings. The best/second best performance under each metric is bold/underlined.
Notations
| method | meanings |
|---|---|
| compound/bioassay | |
| molecular graph with set of atoms | |
| u | an atom in |
| (u, v) | a bond connecting
atoms u and v in |
| neighbors
of atom u in | |
| set of compounds in a bioassay | |
| set of labels corresponding to | |
| input feature space | |
| label space | |
| a domain consisting of | |
| a task consisting of label space and a decision function ω(·) | |
| hidden state | |
| molecular representation out of | |
| scaled molecular representation |
Figure 6Proposed architecture of TAc. The feature learner F learns compound representations r given the corresponding molecular graph. The domain-wise classifier S classifies the compound as active/inactive.