| Literature DB >> 26445039 |
Mohammed Mumtaz Al-Dabbagh1, Naomie Salim2, Mubarak Himmat3, Ali Ahmed4,5, Faisal Saeed6.
Abstract
One of the most widely-used techniques for ligand-based virtual screening is similarity searching. This study adopted the concepts of quantum mechanics to present as state-of-the-art similarity method of molecules inspired from quantum theory. The representation of molecular compounds in mathematical quantum space plays a vital role in the development of quantum-based similarity approach. One of the key concepts of quantum theory is the use of complex numbers. Hence, this study proposed three various techniques to embed and to re-represent the molecular compounds to correspond with complex numbers format. The quantum-based similarity method that developed in this study depending on complex pure Hilbert space of molecules called Standard Quantum-Based (SQB). The recall of retrieved active molecules were at top 1% and top 5%, and significant test is used to evaluate our proposed methods. The MDL drug data report (MDDR), maximum unbiased validation (MUV) and Directory of Useful Decoys (DUD) data sets were used for experiments and were represented by 2D fingerprints. Simulated virtual screening experiment show that the effectiveness of SQB method was significantly increased due to the role of representational power of molecular compounds in complex numbers forms compared to Tanimoto benchmark similarity measure.Entities:
Keywords: complex numbers; ligand-based; quantum mechanics; quantum-based similarity; similarity searching approach; virtual screening
Mesh:
Year: 2015 PMID: 26445039 PMCID: PMC6331860 DOI: 10.3390/molecules201018107
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 13-D subspaces model.
MDDR-DS1 structure activity classes.
| Activity Index | Activity Class | Active Molecules | Pairwise Similarity |
|---|---|---|---|
| 31420 | Renin inhibitors | 1130 | 0.290 |
| 71523 | HIV protease inhibitors | 750 | 0.198 |
| 37110 | Thrombin inhibitors | 803 | 0.180 |
| 31432 | Angiotensin II AT1 antagonists | 943 | 0.229 |
| 42731 | Substance P antagonists | 1246 | 0.149 |
| 06233 | Substance P antagonists | 752 | 0.140 |
| 06245 | 5HT reuptake inhibitors | 359 | 0.122 |
| 07701 | D2 antagonists | 395 | 0.138 |
| 06235 | 5HT1A agonists | 827 | 0.133 |
| 78374 | Protein kinase C inhibitors | 453 | 0.120 |
| 78331 | Cyclooxygenase inhibitors | 636 | 0.108 |
MDDR-DS2 structure activity classes.
| Activity Index | Activity Class | Active Molecules | Pairwise Similarity |
|---|---|---|---|
| 07707 | Adenosine (A1) agonists | 207 | 0.229 |
| 07708 | Adenosine (A2) agonists | 156 | 0.305 |
| 31420 | Renin inhibitors 1 | 1300 | 0.290 |
| 42710 | CCK agonists | 111 | 0.361 |
| 64100 | Monocyclic-lactams | 1346 | 0.336 |
| 64200 | Cephalosporins | 113 | 0.322 |
| 64220 | Carbacephems | 1051 | 0.269 |
| 64500 | Carbapenems | 126 | 0.260 |
| 64350 | Tribactams | 388 | 0.305 |
| 75755 | Vitamin D analogous | 455 | 0.386 |
MDDR-DS3 structure activity classes.
| Activity Index | Activity Class | Active Molecules | Pairwise Similarity |
|---|---|---|---|
| 09249 | Muscarinic (M1) agonists | 900 | 0.111 |
| 12455 | NMDA receptor antagonists | 1400 | 0.098 |
| 12464 | Nitric oxide synthase inhibitors | 505 | 0.102 |
| 31281 | Dopamine-hydroxylase inhibitors | 106 | 0.125 |
| 43210 | Aldose reductase inhibitors | 957 | 0.119 |
| 71522 | Reverse transcriptase inhibitors | 700 | 0.103 |
| 75721 | Aromatase inhibitors | 636 | 0.110 |
| 78331 | Cyclooxygenase inhibitors | 636 | 0.108 |
| 78348 | Phospholipase A2 inhibitors | 617 | 0.123 |
| 78351 | Lipoxygenase inhibitors | 2111 | 0.113 |
MUV structure activity classes.
| Activity Index | Activity Class | Pairwise Similarity |
|---|---|---|
| 466 | S1P1 rec. (agonists) | 0.117 |
| 548 | PKA (inhibitors | 0.128 |
| 600 | SF1 (inhibitors) | 0.123 |
| 644 | Rho-Kinase2 (inhibitors) | 0.122 |
| 652 | HIV RT-RNase (inhibitors) | 0.099 |
| 689 | Eph rec. A4 (inhibitors | 0.113 |
| 692 | SF1 (agonists) | 0.114 |
| 712 | HSP 90 (inhibitors) 30 | 0.106 |
| 713 | ER-a-Coact. Bind. (inhibitors) | 0.113 |
| 733 | ER-b-Coact. Bind. (inhibitors) | 0.114 |
| 737 | ER-a-Coact. Bind. (potentiators) | 0.129 |
| 810 | FAK (inhibitors) | 0.107 |
| 832 | Cathepsin G (inhibitors) | 0.151 |
| 846 | FXIa (inhibitors) | 0.161 |
| 852 | FXIIa (inhibitors) | 0.150 |
| 858 | D1 rec. (allosteric modulators) | 0.111 |
| 859 | M1 rec. (allosteric inhibitors) | 0.126 |
Number of active and inactive compounds for twelve DUD sub datasets, where Na: number of active compounds; Ndec: number of decoys.
| No. | Data Set | Active and Inactive | |
|---|---|---|---|
| 1 | Fgfr1t | 120 | 4550 |
| 2 | fxa | 146 | 5745 |
| 3 | gart | 40 | 879 |
| 4 | gbp | 52 | 2140 |
| 5 | gr | 78 | 2947 |
| 6 | hivpr | 62 | 2038 |
| 7 | hivrt | 43 | 1519 |
| 8 | hmga | 35 | 1480 |
| 9 | Hsp90 | 37 | 979 |
| 10 | mr | 15 | 636 |
| 11 | na | 49 | 1874 |
| 12 | pr | 27 | 1041 |
| Total | 704 | 25,828 | |
Retrieval results of top 1% and 5% for MDDR-DS1 dataset.
| Activity Index | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN |
|---|---|---|---|---|---|---|---|---|---|---|
| 31420 | 72.18 | 72.44 | 73.73 | 70.03 | 69.69 | 87.75 | 87.24 | 87.22 | 84.03 | 83.49 |
| 71523 | 26.33 | 25.41 | 26.84 | 25.58 | 25.94 | 60.16 | 48.48 | 48.7 | 48.65 | 48.92 |
| 37110 | 18.33 | 22.09 | 24.73 | 9 | 9.63 | 39.81 | 45.77 | 45.62 | 19.56 | 21.01 |
| 31432 | 41.61 | 37.2 | 36.66 | 37.34 | 35.82 | 82 | 70.57 | 70.44 | 76.48 | 74.29 |
| 42731 | 19.06 | 20.49 | 21.17 | 17.34 | 17.77 | 28.77 | 24.58 | 24.35 | 28.19 | 29.68 |
| 06233 | 12.45 | 12.26 | 12.49 | 10.75 | 13.87 | 20.96 | 19.04 | 20.04 | 21.04 | 27.68 |
| 06245 | 7.18 | 6.37 | 6.03 | 6.03 | 6.51 | 15.39 | 13.99 | 13.72 | 13.63 | 16.54 |
| 07701 | 10.33 | 10.91 | 11.35 | 8.25 | 8.63 | 26.9 | 25.41 | 26.73 | 21.85 | 24.09 |
| 06235 | 10.51 | 10.9 | 10.15 | 9.14 | 9.71 | 22.47 | 23.72 | 22.81 | 19.13 | 20.06 |
| 78374 | 12.46 | 11.77 | 13.08 | 13.65 | 13.69 | 20.95 | 20.73 | 19.56 | 20.55 | 20.51 |
| 78331 | 6.08 | 6.54 | 5.92 | 5.78 | 7.17 | 10.31 | 11.48 | 11.37 | 13.1 | 16.2 |
| 21.50 | 21.48 | 19.35 | 19.85 | 35.54 | 35.50 | 33.29 | 34.77 | |||
| 2 | 1 | 5 | 0 | 3 | 4 | 2 | 0 | 0 | 4 | |
Retrieval results of top 1% and 5% for MDDR-DS2 dataset.
| Activity Index | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN |
|---|---|---|---|---|---|---|---|---|---|---|
| 07707 | 72.62 | 71.31 | 72.09 | 58.5 | 61.84 | 75.15 | 74.22 | 74.37 | 70.39 | 70.39 |
| 07708 | 95.87 | 96.06 | 95.68 | 55.61 | 47.03 | 99.87 | 100 | 99.61 | 64.97 | 56.58 |
| 31420 | 72.02 | 71.32 | 78.56 | 62.22 | 65.1 | 95.04 | 95.24 | 94.88 | 87.04 | 88.19 |
| 42710 | 82.18 | 77.45 | 76.82 | 83 | 81.27 | 91.09 | 93 | 91.09 | 89.18 | 88.09 |
| 64100 | 88.9 | 87.92 | 87.8 | 80.73 | 80.31 | 99.23 | 98.94 | 99.03 | 94.59 | 93.75 |
| 64200 | 63.3 | 70 | 70.18 | 53.13 | 53.84 | 95.18 | 98.93 | 99.38 | 81.34 | 77.68 |
| 64220 | 60.9 | 66.79 | 67.58 | 34.61 | 38.64 | 84.06 | 90.9 | 90.62 | 48.11 | 52.19 |
| 64500 | 67.36 | 78.64 | 79.2 | 29.04 | 30.56 | 83.28 | 92.72 | 92.48 | 47.68 | 44.8 |
| 64350 | 82.45 | 80.83 | 81.68 | 81.86 | 80.18 | 96.02 | 93.75 | 90.78 | 87.96 | 91.71 |
| 75755 | 97.6 | 97.91 | 98.02 | 85.4 | 87.56 | 98.17 | 98.39 | 98.37 | 94.07 | 94.82 |
| 78.32 | 79.82 | 62.41 | 62.63 | 91.70 | 93.06 | 76.53 | 75.82 | |||
| 3 | 1 | 5 | 1 | 0 | 3 | 6 | 1 | 0 | 0 | |
Retrieval results of top 1% and 5% for MDDR-DS3 dataset.
| Activity Index | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB(Real) | TAN |
|---|---|---|---|---|---|---|---|---|---|---|
| 09249 | 10.17 | 10.61 | 10.99 | 9.92 | 12.12 | 18.05 | 18.26 | 17.8 | 21.4 | 24.17 |
| 12455 | 5.65 | 6.65 | 7.03 | 5.12 | 6.57 | 7.59 | 10.23 | 11.42 | 8.1 | 10.29 |
| 12464 | 5.04 | 6.17 | 6.92 | 5.56 | 8.17 | 12.78 | 16.09 | 16.79 | 10.56 | 15.22 |
| 31281 | 15.14 | 18.19 | 18.67 | 10.29 | 16.95 | 20.86 | 27.43 | 29.05 | 15.14 | 29.62 |
| 43210 | 5.77 | 6.93 | 6.83 | 5.31 | 6.27 | 11.83 | 13.54 | 14.12 | 14.47 | 16.07 |
| 71522 | 4.74 | 6.34 | 6.57 | 3.03 | 3.75 | 10.56 | 13.26 | 13.82 | 9.2 | 12.37 |
| 75721 | 18.44 | 20.14 | 20.38 | 15.24 | 17.32 | 25.1 | 30.13 | 30.61 | 22.27 | 25.21 |
| 78331 | 6.16 | 6.03 | 6.16 | 5.48 | 6.31 | 10.16 | 12.11 | 11.97 | 12.03 | 15.01 |
| 78348 | 8.03 | 8 | 8.99 | 9.67 | 10.15 | 20 | 21.89 | 21.14 | 22.72 | 24.67 |
| 78351 | 10.87 | 11.98 | 12.5 | 10.03 | 9.84 | 11.8 | 12.63 | 13.3 | 11.95 | 11.71 |
| 9.0 | 10.10 | 7.96 | 9.74 | 14.87 | 17.55 | 18.0 | 14.78 | |||
| 0 | 1 | 5 | 0 | 4 | 0 | 0 | 5 | 0 | 5 | |
Retrieval results of top 1% and 5% for MUV dataset.
| Activity Index | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN |
|---|---|---|---|---|---|---|---|---|---|---|
| 466 | 2.41 | 1.03 | 1.38 | 2.41 | 3.1 | 5.17 | 8.28 | 8.62 | 6.9 | 5.86 |
| 548 | 8.28 | 10.34 | 11.38 | 7.59 | 8.62 | 22.07 | 24.14 | 24.14 | 21.03 | 22.76 |
| 600 | 3.79 | 4.48 | 5.52 | 2.41 | 3.79 | 13.1 | 14.83 | 16.21 | 10.34 | 11.38 |
| 644 | 7.59 | 8.28 | 8.97 | 7.24 | 7.59 | 14.14 | 17.93 | 17.93 | 17.24 | 17.59 |
| 652 | 2.76 | 4.14 | 3.79 | 2.07 | 2.76 | 7.59 | 8.97 | 9.66 | 8.62 | 7.93 |
| 689 | 3.79 | 5.17 | 4.48 | 2.07 | 3.79 | 8.28 | 11.38 | 11.72 | 8.28 | 9.66 |
| 692 | 0.69 | 1.03 | 1.38 | 0.69 | 0.69 | 3.79 | 5.17 | 4.83 | 6.21 | 4.83 |
| 712 | 3.45 | 4.48 | 5.17 | 4.14 | 4.14 | 9.31 | 12.41 | 11.03 | 16.9 | 10.34 |
| 713 | 2.76 | 2.76 | 2.76 | 2.41 | 3.1 | 7.59 | 6.55 | 5.86 | 7.24 | 7.24 |
| 733 | 3.45 | 4.14 | 4.14 | 1.38 | 3.45 | 9.31 | 8.62 | 8.62 | 8.97 | 8.97 |
| 737 | 2.41 | 1.72 | 1.72 | 1.38 | 2.41 | 8.97 | 8.62 | 8.28 | 12.41 | 8.28 |
| 810 | 1.72 | 2.41 | 1.72 | 2.41 | 2.07 | 7.24 | 10.34 | 11.03 | 10.34 | 6.9 |
| 832 | 6.21 | 7.24 | 8.28 | 4.48 | 6.55 | 13.1 | 14.83 | 14.83 | 11.38 | 13.1 |
| 846 | 10.34 | 12.76 | 12.41 | 8.97 | 9.66 | 25.86 | 25.86 | 26.9 | 23.45 | 28.62 |
| 852 | 9.66 | 9.31 | 9.66 | 8.62 | 12.41 | 19.31 | 20 | 20 | 18.62 | 21.38 |
| 858 | 1.72 | 1.38 | 1.38 | 3.1 | 1.72 | 5.17 | 6.21 | 6.21 | 7.93 | 5.86 |
| 859 | 1.72 | 2.07 | 2.41 | 2.07 | 1.38 | 7.93 | 10 | 8.62 | 10.69 | 8.97 |
| 4.27 | 4.86 | 3.73 | 4.54 | 11.05 | 12.59 | 12.15 | 11.74 | |||
| 2 | 6 | 9 | 2 | 3 | 2 | 3 | 8 | 5 | 2 | |
Retrieval results of top 1% and 5% for DUD dataset.
| Activity Index | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN | SQB (Complex) Tech. 1 | SQB (Complex) Tech. 2 | SQB (Complex) Tech. 3 | SQB (Real) | TAN |
|---|---|---|---|---|---|---|---|---|---|---|
| FGFR1T | 2.67 | 2.92 | 2.92 | 2.33 | 2.5 | 6.75 | 6.5 | 7 | 6.17 | 6.67 |
| FXA | 3.15 | 3.36 | 3.36 | 2.26 | 1.92 | 8.84 | 7.74 | 8.29 | 8.08 | 7.88 |
| GART | 5.25 | 5.75 | 5.75 | 7.5 | 7.75 | 23 | 22.75 | 23.25 | 22.25 | 22.25 |
| GBP | 15.77 | 16.73 | 15.96 | 13.65 | 13.27 | 28.65 | 30.58 | 30.96 | 21.35 | 20.96 |
| GR | 2.18 | 3.46 | 3.21 | 2.31 | 2.31 | 6.79 | 8.21 | 8.46 | 6.67 | 6.41 |
| HIVPR | 4.52 | 2.74 | 3.55 | 3.39 | 3.55 | 13.23 | 10.97 | 11.29 | 11.45 | 11.77 |
| HIVRT | 1.86 | 1.86 | 1.86 | 1.63 | 1.63 | 5.58 | 6.74 | 6.98 | 4.65 | 4.88 |
| HMGA | 6.57 | 5.43 | 5.43 | 5.71 | 6.29 | 10.86 | 11.43 | 13.14 | 10.29 | 10.29 |
| HSP90 | 3.78 | 4.05 | 4.05 | 2.16 | 1.62 | 9.19 | 9.19 | 8.38 | 8.11 | 8.11 |
| MR | 5.33 | 5.33 | 5.33 | 5.33 | 5.33 | 8.67 | 9.33 | 10 | 9.33 | 9.33 |
| NA | 3.06 | 4.9 | 5.31 | 2.24 | 2.24 | 6.33 | 9.59 | 9.8 | 4.9 | 5.1 |
| PR | 2.22 | 2.22 | 2.22 | 1.85 | 1.85 | 5.19 | 6.67 | 5.19 | 4.44 | 4.81 |
| 4.69 | 4.89 | 4.19 | 4.18 | 11.09 | 11.64 | 9.8 | 9.87 | |||
| 5 | 7 | 6 | 1 | 2 | 3 | 2 | 9 | 0 | 0 | |
Rankings of Similarity Methods Based on Kendall W Test Results using MDDR (DS1–DS3), MUV and DUD datasets for Top 1%.
| Data Set | Ranking | |||
|---|---|---|---|---|
| DS1 | 0.222 | 9.808 | 0.043 | SQB(C./T3) > SQB(C./T1) > SQB(C./T2) > TAN > SQB(R.) |
| DS2 | 0.452 | 18.08 | 0.001 | SQB(C./T3) = SQB(C./T1) > SQB(C./T2) > SQB(R.) > TAN |
| DS3 | 0.483 | 19.356 | 0.0006 | SQB(C./T3) > SQB(C./T2) = TAN > SQB(C./T1) > SQB(R.) |
| MUV | 0.272 | 18.518 | 0.0009 | SQB(C./T3) > SQB(C./T2) > TAN > SQB(C./T1) > SQB(R.) |
| DUD | 0.258 | 12.415 | 0.014 | SQB(C./T3) > SQB(C./T2) > SQB(C./T1) > TAN > SQB(R.) |
Rankings of Similarity Methods Based on Kendall W Test Results using MDDR (DS1–DS3), MUV and DUD datasets for Top 5%.
| Data Set | Ranking | |||
|---|---|---|---|---|
| DS1 | 0.525 | 23.127 | 0.0001 | SQB(C./T1) > SQB(C./T2) = TAN > SQB(C./T3) > SQB(R.) |
| DS2 | 0.738 | 29.535 | 0.000006 | SQB(C./T2) > SQB(C./T1) > SQB(C./T3) > SQB(R.) = TAN |
| DS3 | 0.378 | 15.120 | 0.004 | TAN > SQB(C./T3) > SQB(C./T2) > SQB(R.) > SQB(C./T1) |
| MUV | 0.155 | 10.588 | 0.031 | SQB(C./T3) = SQB(C./T2) > SQB(R.) > TAN > SQB(C./T1) |
| DUD | 0.478 | 22.961 | 0.0001 | SQB(C./T3) > SQB(C./T2) > SQB(C./T1) > TAN > SQB(R.) |