| Literature DB >> 31117509 |
Jean-Paul Ebejer1, Paul W Finn2,3, Wing Ki Wong4, Charlotte M Deane4, Garrett M Morris4.
Abstract
We present Ligity, a hybrid ligand-structure-based, non-superpositional method for virtual screening of large databases of small molecules. Ligity uses the relative spatial distribution of pharmacophoric interaction points (PIPs) derived from the conformations of small molecules. These are compared with the PIPs derived from key interaction features found in protein-ligand complexes and are used to prioritize likely binders. We investigated the effect of generating PIPs using the single lowest energy conformer versus an ensemble of conformers for each screened ligand, using different bin sizes for the distance between two features, utilizing triangular sets of pharmacophoric features (3-PIPs) versus chiral tetrahedral sets (4-PIPs), fusing data for targets with multiple protein-ligand complex structures, and applying different similarity measures. Ligity was benchmarked using the Directory of Useful Decoys-Enhanced (DUD-E). Optimal results were obtained using the tetrahedral PIPs derived from an ensemble of bound ligand conformers and a bin size of 1.5 Å, which are used as the default settings for Ligity. The high-throughput screening mode of Ligity, using only the lowest-energy conformer of each ligand, was used for benchmarking against the whole of the DUD-E, and a more resource-intensive, "information-rich" mode of Ligity, using a conformational ensemble of each ligand, were used for a representative subset of 10 targets. Against the full DUD-E database, mean area under the receiver operating characteristic curve (AUC) values ranged from 0.44 to 0.99, while for the representative subset they ranged from 0.61 to 0.86. Data fusion further improved Ligity's performance, with mean AUC values ranging from 0.64 to 0.95. Ligity is very efficient compared to a protein-ligand docking method such as AutoDock Vina: if the time taken for the precalculation of Ligity descriptors is included in the comparason, then Ligity is about 20 times faster than docking. A direct comparison of the virtual screening steps shows Ligity to be over 5000 times faster. Ligity highly ranks the lowest-energy conformers of DUD-E actives, in a statistically significant manner, behavior that is not observed for DUD-E decoys. Thus, our results suggest that active compounds tend to bind in relatively low-energy conformations compared to decoys. This may be because actives-and thus their lowest-energy conformations-have been optimized for conformational complementarity with their cognate binding sites.Entities:
Year: 2019 PMID: 31117509 PMCID: PMC7007185 DOI: 10.1021/acs.jcim.8b00779
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956
Figure 1Ligity algorithm. Note that some processes, ionization and generate PIPs, are repeated because they have different implementations depending on whether their input is a 3D protein–ligand complex or a 2D SMILES molecule. Conformer generation uses the protocol published by Ebejer et al.[49]
Pharmacophoric Features and Distance Thresholds Used To Define Queries in Ligity
| interacting receptor–ligand PIP pairs | distance threshold (Å) |
|---|---|
| (hydrophobic, hydrophobic) | 4.5 |
| (acceptor, donor) | 3.9 |
| (cation, anion) | 4.0 |
| (aromatic, aromatic) | 4.5 |
| (cation, aromatic) | 4.0 |
Figure 2Relationship between PIPs and the Ligity descriptor. The lengths of the edges of each triangle are mapped to their corresponding indices in the array of triangle counts. Shown here is a 3-PIP triangle, with features ⟨HBA, HBA, hydrophobic⟩ and edge lengths of ⟨5.8, 6.1, 4.2⟩ mapping onto a Ligity descriptor that uses an edge-length bin size of 1.0 Å. The blue-bin count will be incremented.
Targets Used To Identify Optimal Parameter Values and Data Structures for Ligity, along with Their sc-PDB Cluster IDs, PDB IDs, and the Number of Conformers, n, for the Active and Decoy Setsa
| PDB ID | ||||||
|---|---|---|---|---|---|---|
| target | sc-PDB cluster ID | site 1 | site 2 | site 3 | for actives | for decoys |
| adenosine deaminase (ADA) | 0085 | 1ndv | 2e1w | 3km8 | 6002 | 6394 |
| cyclin-dependent kinase 2 (CDK2) | 1424 | 1pxm | 2bts | 2c6m | 6839 | 6697 |
| trypsin 1 (TRY1) | 1463 | 1bjv | 1o3o | 3m35 | 4647 | 4956 |
Three structures of each target were chosen so as to capture the protein flexibility. Each receptor has a randomly selected subset of 100 active and 100 decoy molecules taken from DUD-E, except ADA, which has only 93 actives.
DUD-E Target Datasets Used To Benchmark Ligity, along with Their sc-PDB Clusters, and the Number of Actives and Decoys, with the Number of Conformers, n, Given in Parentheses
| target | sc-PDB cluster ID | number of sc-PDB structures | number of
actives ( | number of
decoys ( |
|---|---|---|---|---|
| angiotensin-converting enzyme (ACE) | 0132 | 9 | 282 (27 346) | 16 900 (1 307 531) |
| adenosine deaminase (ADA) | 0085 | 20 | 93 (6 720) | 5 450 (371 990) |
| cyclin-dependent kinase 2 (CDK2) | 1424 | 109 | 474 (20 480) | 27 850 (1 360 619) |
| coagulation factor X (FA10) | 0224 | 81 | 537 (39 732) | 28 325 (1 799 269) |
| coagulation factor VII (FA7) | 0223 | 15 | 114 (9 759) | 6 250 (398 145) |
| glucocorticoid receptor (GCR) | 0367 | 7 | 258 (8 682) | 14 999 (640 882) |
| human immunodeficiency virus | ||||
| type 1 integrase (HIVINT) | 1167 | 3 | 98 (5 096) | 6 650 (327 474) |
| human immunodeficiency virus | ||||
| type 1 protease (HIVPR) | 0654 | 166 | 535 (27 975) | 35 750 (2 189 091) |
| thrombin (THRB) | 0830 | 113 | 461 (34 936) | 27 004 (2 020 395) |
| trypsin I (TRY1) | 0850 | 74 | 449 (30 311) | 25 980 (1 706 265) |
Effect of Different Similarity Measures on Ligity’s ROC AUC Performance Using the Parameter Optimizaion Dataseta
| AUC | ||||||||
|---|---|---|---|---|---|---|---|---|
| Tversky (1) | Tversky (2) | Tversky (3) | Tversky (4) | |||||
| query | α = 1 | α = 0.95 | α = 0.9 | α = 0.85 | ||||
| target | PDB ID | β = 0 | β = 0.05 | β = 0.1 | β = 0.15 | Tanimoto | Cosine | Counts |
| ADA | 1ndv | 0.791 | 0.814 | 0.831 | 0.799 | 0.810 | 0.791 | |
| 2e1w | 0.904 | 0.911 | 0.919 | 0.909 | 0.912 | 0.904 | ||
| 3km8 | 0.790 | 0.784 | 0.770 | 0.672 | 0.714 | |||
| mean | 0.830 | 0.838 | 0.793 | 0.822 | 0.830 | |||
| fusion | 0.893 | 0.913 | 0.925 | 0.912 | 0.926 | 0.913 | ||
| CDK2 | 1pxm | 0.581 | 0.541 | 0.516 | 0.508 | 0.530 | ||
| 2bts | 0.700 | 0.669 | 0.634 | 0.539 | 0.593 | 0.700 | ||
| 2c6m | 0.553 | 0.539 | 0.522 | 0.513 | 0.481 | |||
| mean | 0.611 | 0.583 | 0.557 | 0.520 | 0.535 | |||
| fusion | 0.575 | 0.532 | 0.508 | 0.488 | 0.503 | 0.576 | ||
| TRY1 | 1bjv | 0.511 | 0.479 | 0.467 | 0.412 | 0.557 | ||
| 1o3o | 0.456 | 0.394 | 0.363 | 0.300 | 0.417 | |||
| 3m35 | 0.709 | 0.663 | 0.615 | 0.483 | 0.687 | |||
| mean | 0.559 | 0.512 | 0.482 | 0.398 | 0.554 | |||
| fusion | 0.712 | 0.662 | 0.611 | 0.472 | 0.687 | 0.728 | ||
The best AUC in each row is shown in bold; it can be seen that Tversky with α = 1 and β = 0 performed best. Note that Dice similarity calculations (not shown) gave identical results to Tanimoto. The mean AUC across all three cognate structure queries for each target is shown, as well as the AUC of the MAX-SIM fused results over the three individual queries.
Figure 3Ligity showed little difference when using only the lowest-energy conformer of the active or decoy molecule (left column). Using the lowest-energy conformer when fusing the results also exhibited a minimal effect (right column). The performance of the method using only the lowest-energy conformer is shown with dashed lines, while the performance using the full conformer set is shown with solid lines.
Figure 4Ligity preferentially selects lower-energy conformers for actives but not for decoys: (a) lower conformer identifiers are selected for actives, rather than what one would expect at random, and (b) there is no difference between conformer identifiers selected for decoys and ones selected at random.
Performance of Ligity in HTS Mode against the Ligity-Compatible DUD-E Targetsa
| ROC AUC | BEDROC | EF1% | ||||||
|---|---|---|---|---|---|---|---|---|
| target | no. of actives | no. of decoys | Tanimoto | Tversky | Tanimoto | Tversky | Tanimoto | Tversky |
| ABL1 | 182 | 10 750 | 0.563 | 0.473 | 0.077 | 0.077 | 1.653 | 2.204 |
| ACE | 281 | 16 877 | 0.787 | 0.787 | 0.336 | 0.401 | 12.425 | 19.525 |
| ACES | 453 | 26 242 | 0.634 | 0.645 | 0.077 | 0.155 | 1.766 | 5.518 |
| ADA | 93 | 5 450 | 0.724 | 0.660 | 0.149 | 0.147 | 3.251 | 3.251 |
| ADA17 | 532 | 35 898 | 0.638 | 0.728 | 0.103 | 0.283 | 1.317 | 9.030 |
| ADRB1 | 247 | 15 850 | 0.523 | 0.647 | 0.065 | 0.129 | 1.619 | 5.262 |
| ADRB2 | 231 | 14 999 | 0.523 | 0.589 | 0.052 | 0.040 | 1.735 | 0.000 |
| AKT1 | 293 | 16 450 | 0.386 | 0.548 | 0.039 | 0.107 | 2.737 | 3.080 |
| AKT2 | 117 | 6 900 | 0.511 | 0.685 | 0.140 | 0.194 | 8.568 | 8.568 |
| ALDR | 159 | 8 988 | 0.574 | 0.610 | 0.202 | 0.172 | 10.747 | 6.322 |
| AMPC | 48 | 2 845 | 0.521 | 0.541 | 0.049 | 0.023 | 0.000 | 0.000 |
| ANDR | 269 | 14 349 | 0.722 | 0.742 | 0.194 | 0.354 | 4.839 | 24.938 |
| AOFB | 121 | 6 875 | 0.422 | 0.464 | 0.045 | 0.027 | 1.652 | 0.000 |
| BACE1 | 283 | 18 100 | 0.441 | 0.775 | 0.017 | 0.310 | 0.000 | 13.062 |
| BRAF | 152 | 9 950 | 0.612 | 0.639 | 0.208 | 0.165 | 12.502 | 5.264 |
| CASP3 | 199 | 10 694 | 0.600 | 0.734 | 0.068 | 0.258 | 0.502 | 7.031 |
| CDK2 | 474 | 27 838 | 0.467 | 0.507 | 0.021 | 0.048 | 0.000 | 1.055 |
| COMT | 41 | 3 846 | 0.789 | 0.889 | 0.338 | 0.665 | 19.447 | 58.341 |
| CP2C9 | 120 | 7 449 | 0.518 | 0.634 | 0.058 | 0.186 | 1.660 | 8.299 |
| CP3A4 | 170 | 11 787 | 0.450 | 0.493 | 0.022 | 0.057 | 0.000 | 2.345 |
| CSF1R | 166 | 12 149 | 0.526 | 0.542 | 0.136 | 0.152 | 6.031 | 7.238 |
| CXCR4 | 40 | 3 405 | 0.575 | 0.722 | 0.217 | 0.134 | 12.665 | 0.000 |
| DEF | 102 | 5 699 | 0.732 | 0.833 | 0.212 | 0.379 | 10.786 | 15.689 |
| DHI1 | 330 | 19 348 | 0.481 | 0.595 | 0.089 | 0.062 | 2.422 | 1.211 |
| DPP4 | 533 | 40 941 | 0.586 | 0.591 | 0.154 | 0.157 | 4.312 | 3.937 |
| DRD3 | 480 | 34 048 | 0.484 | 0.441 | 0.043 | 0.046 | 1.251 | 0.626 |
| DYR | 231 | 17 196 | 0.694 | 0.758 | 0.210 | 0.230 | 6.504 | 7.371 |
| EGFR | 542 | 35 047 | 0.593 | 0.491 | 0.054 | 0.037 | 0.922 | 0.000 |
| ESR1 | 383 | 20 683 | 0.838 | 0.861 | 0.527 | 0.594 | 31.281 | 39.101 |
| ESR2 | 367 | 20 199 | 0.844 | 0.870 | 0.563 | 0.644 | 20.130 | 32.644 |
| FA10 | 537 | 28 324 | 0.564 | 0.674 | 0.058 | 0.118 | 0.930 | 2.232 |
| FA7 | 114 | 6 249 | 0.762 | 0.859 | 0.210 | 0.332 | 6.105 | 8.721 |
| FABP4 | 47 | 2 749 | 0.786 | 0.744 | 0.191 | 0.276 | 0.000 | 10.623 |
| FAK1 | 100 | 5 350 | 0.642 | 0.531 | 0.111 | 0.065 | 2.019 | 0.000 |
| FGFR1 | 139 | 8 698 | 0.511 | 0.522 | 0.036 | 0.088 | 0.722 | 1.445 |
| FKB1A | 111 | 5 799 | 0.605 | 0.751 | 0.162 | 0.164 | 8.122 | 3.610 |
| FNTA | 592 | 51 493 | 0.411 | 0.625 | 0.012 | 0.132 | 0.000 | 4.053 |
| FPPS | 85 | 8 842 | 0.917 | 0.985 | 0.323 | 0.776 | 2.360 | 36.581 |
| GCR | 258 | 14 998 | 0.805 | 0.834 | 0.244 | 0.324 | 3.092 | 8.116 |
| GLCM | 54 | 3 790 | 0.667 | 0.685 | 0.182 | 0.279 | 1.873 | 11.240 |
| GRIA2 | 158 | 11 842 | 0.662 | 0.684 | 0.248 | 0.154 | 11.392 | 5.696 |
| GRIK1 | 101 | 6 547 | 0.656 | 0.668 | 0.203 | 0.102 | 7.978 | 1.995 |
| HDAC2 | 185 | 10 300 | 0.676 | 0.734 | 0.187 | 0.201 | 4.318 | 4.318 |
| HDAC8 | 170 | 10 449 | 0.640 | 0.819 | 0.120 | 0.377 | 2.946 | 8.250 |
| HIVINT | 100 | 6 640 | 0.390 | 0.554 | 0.030 | 0.116 | 0.000 | 3.018 |
| HIVPR | 535 | 35 724 | 0.663 | 0.872 | 0.072 | 0.490 | 0.187 | 23.898 |
| HIVRT | 338 | 18 884 | 0.495 | 0.475 | 0.124 | 0.085 | 4.443 | 1.777 |
| HMDH | 170 | 8 750 | 0.480 | 0.906 | 0.068 | 0.652 | 2.358 | 35.963 |
| HS90A | 88 | 4 850 | 0.635 | 0.506 | 0.096 | 0.083 | 0.000 | 3.436 |
| HXK4 | 92 | 4 700 | 0.662 | 0.803 | 0.206 | 0.307 | 15.192 | 9.766 |
| IGF1R | 148 | 9 300 | 0.502 | 0.575 | 0.057 | 0.189 | 2.037 | 14.941 |
| INHA | 43 | 2 300 | 0.493 | 0.575 | 0.031 | 0.045 | 0.000 | 0.000 |
| ITAL | 138 | 8 500 | 0.619 | 0.465 | 0.037 | 0.065 | 0.000 | 0.728 |
| JAK2 | 107 | 6 500 | 0.472 | 0.475 | 0.073 | 0.118 | 2.807 | 6.549 |
| KIF11 | 116 | 6 850 | 0.755 | 0.781 | 0.149 | 0.219 | 4.289 | 2.574 |
| KIT | 166 | 10 449 | 0.463 | 0.437 | 0.045 | 0.030 | 0.000 | 0.000 |
| KITH | 57 | 2 850 | 0.649 | 0.838 | 0.228 | 0.709 | 14.069 | 47.483 |
| KPCB | 135 | 8 699 | 0.753 | 0.813 | 0.220 | 0.338 | 8.923 | 12.641 |
| LCK | 419 | 27 391 | 0.471 | 0.437 | 0.031 | 0.043 | 0.000 | 1.910 |
| LKHA4 | 171 | 9 448 | 0.718 | 0.694 | 0.238 | 0.150 | 8.203 | 1.758 |
| MAPK2 | 101 | 6 148 | 0.660 | 0.670 | 0.174 | 0.199 | 5.988 | 3.992 |
| MCR | 94 | 5 149 | 0.816 | 0.888 | 0.215 | 0.454 | 6.436 | 19.307 |
| MET | 166 | 11 249 | 0.566 | 0.531 | 0.130 | 0.065 | 6.032 | 0.603 |
| MK01 | 79 | 4 550 | 0.518 | 0.602 | 0.121 | 0.206 | 5.095 | 3.821 |
| MK10 | 104 | 6 600 | 0.488 | 0.489 | 0.020 | 0.031 | 0.962 | 0.962 |
| MK14 | 578 | 35 847 | 0.511 | 0.589 | 0.040 | 0.064 | 0.173 | 0.519 |
| MMP13 | 572 | 37 199 | 0.648 | 0.753 | 0.134 | 0.268 | 2.446 | 9.957 |
| MP2K1 | 121 | 8 146 | 0.669 | 0.569 | 0.187 | 0.058 | 3.293 | 0.823 |
| NOS1 | 98 | 8 028 | 0.483 | 0.451 | 0.109 | 0.041 | 3.071 | 0.000 |
| NRAM | 98 | 6 200 | 0.853 | 0.859 | 0.342 | 0.290 | 11.221 | 3.060 |
| PA2GA | 99 | 5 150 | 0.793 | 0.756 | 0.225 | 0.153 | 1.020 | 3.059 |
| PARP1 | 508 | 30 029 | 0.635 | 0.692 | 0.215 | 0.231 | 11.234 | 7.884 |
| PGH1 | 195 | 10 798 | 0.645 | 0.637 | 0.077 | 0.100 | 0.000 | 2.050 |
| PGH2 | 435 | 23 139 | 0.716 | 0.780 | 0.166 | 0.291 | 3.444 | 9.874 |
| PLK1 | 107 | 6 800 | 0.658 | 0.531 | 0.123 | 0.048 | 1.871 | 0.000 |
| PNPH | 103 | 6 946 | 0.575 | 0.578 | 0.161 | 0.181 | 4.888 | 8.799 |
| PPARA | 373 | 19 399 | 0.783 | 0.778 | 0.262 | 0.280 | 6.693 | 7.764 |
| PPARD | 240 | 12 250 | 0.547 | 0.544 | 0.078 | 0.098 | 1.665 | 2.498 |
| PPARG | 484 | 25 299 | 0.515 | 0.605 | 0.055 | 0.118 | 0.619 | 4.955 |
| PRGR | 293 | 15 648 | 0.740 | 0.793 | 0.142 | 0.318 | 2.053 | 14.714 |
| PTN1 | 130 | 7 249 | 0.398 | 0.538 | 0.055 | 0.090 | 0.000 | 3.068 |
| PUR2 | 50 | 2 700 | 0.851 | 0.837 | 0.281 | 0.255 | 7.857 | 1.964 |
| PYGM | 77 | 3 944 | 0.403 | 0.492 | 0.016 | 0.137 | 0.000 | 3.917 |
| PYRD | 111 | 6 449 | 0.682 | 0.710 | 0.462 | 0.413 | 34.027 | 16.118 |
| RENI | 104 | 6 956 | 0.720 | 0.789 | 0.043 | 0.138 | 0.000 | 0.000 |
| ROCK1 | 100 | 6 300 | 0.347 | 0.449 | 0.020 | 0.084 | 1.000 | 4.000 |
| RXRA | 131 | 6 950 | 0.788 | 0.900 | 0.219 | 0.596 | 6.091 | 27.407 |
| SAHH | 63 | 3 450 | 0.874 | 0.852 | 0.598 | 0.542 | 35.050 | 27.084 |
| SRC | 524 | 34 500 | 0.565 | 0.477 | 0.065 | 0.050 | 0.382 | 0.573 |
| TGFR1 | 133 | 8 499 | 0.609 | 0.639 | 0.147 | 0.154 | 10.565 | 4.528 |
| THB | 103 | 7 450 | 0.794 | 0.762 | 0.238 | 0.150 | 10.614 | 0.965 |
| THRB | 461 | 27 000 | 0.605 | 0.706 | 0.063 | 0.166 | 2.166 | 5.632 |
| TRY1 | 449 | 25 975 | 0.711 | 0.815 | 0.147 | 0.280 | 2.898 | 6.688 |
| TRYB1 | 148 | 7 650 | 0.670 | 0.670 | 0.153 | 0.132 | 3.378 | 3.378 |
| TYSY | 109 | 6 745 | 0.594 | 0.725 | 0.071 | 0.226 | 0.911 | 5.468 |
| UROK | 162 | 9 850 | 0.525 | 0.650 | 0.036 | 0.120 | 0.000 | 1.854 |
| VGFR2 | 409 | 24 948 | 0.632 | 0.578 | 0.083 | 0.093 | 1.465 | 1.465 |
| WEE1 | 102 | 6 150 | 0.934 | 0.929 | 0.789 | 0.797 | 59.348 | 61.294 |
| XIAP | 100 | 5 150 | 0.752 | 0.974 | 0.190 | 0.897 | 8.077 | 51.490 |
The mean (and standard deviation in parentheses) values of ROC AUC using Tanimoto is 0.622 (±0.132), while for Tversky it is 0.671 (±0.142); the mean EF1% using Tanimoto is 5.648 (±8.668), while for EF1% using Tversky it is 9.047 (±12.713).
Ligity Results in Information-Rich Mode Using the DUD-E Validation Subseta
| receptor | mean AUC (±σ) | mean BEDROC (±σ) | fusion AUC | fusion BEDROC |
|---|---|---|---|---|
| angiotensin-converting enzyme (ACE) | 0.779 (±0.070) | 0.424 (±0.181) | 0.776 | |
| adenosine deaminase (ADA) | 0.811 (±0.068) | 0.302 (±0.102) | 0.557 | |
| cyclin-dependent kinase 2 (CDK2) | 0.610 (±0.035) | 0.081 (±0.047) | 0.062 | |
| coagulation factor X (FA10) | 0.700 (±0.050) | 0.195 (±0.079) | 0.716 | 0.208 |
| coagulation factor VII (FA7) | 0.750 (±0.026) | 0.277 (±0.540) | 0.270 | |
| glucocorticoid receptor (GCR) | 0.790 (±0.094) | 0.300 (±0.116) | 0.439 | |
| human immunodeficiency virus | ||||
| type 1 integrase (HIVINT) | 0.173 (±0.068) | 0.637 | 0.139 | |
| human immunodeficiency virus | ||||
| type 1 protease (HIVPR) | 0.874 (±0.018) | 0.584 (±0.057) | 0.527 | |
| thrombin (THRB) | 0.747 (±0.035) | 0.220 (±0.079) | 0.185 | |
| trypsin I (TRY1) | 0.725 (±0.060) | 0.171 (±0.076) | 0.167 | |
| mean across all receptors | 0.745 (±0.050) | 0.273 (±0.135) | 0.333 |
It can be seen that Ligity’s mean ROC AUC is moderate to excellent across all targets and generally improves when using data fusion. Standard deviation values, σ, are shown in parentheses. The best ROC AUC for each target is in italic.
ROC AUC Comparison of Methods for a Queries Using Ligity in “Information-Rich Mode”a)
| DUD-E target | PDB ID | ElectroShape | DOCK | Ligity | Ligity fused |
|---|---|---|---|---|---|
| ACE | 3bkl | 0.452 | 0.716 | ||
| ADA | 2e1w | 0.714 | 0.764 | ||
| CDK2 | 1h00 | 0.433 | (0.610) | 0.644 | |
| FA10 | 3kl6 | 0.664 | 0.716 | 0.717 | |
| FA7 | 1w7x | 0.822 | 0.762 | 0.809 | |
| GCR | 3bqd | 0.521 | 0.439 | ||
| HIVINT | 3nf7 | 0.578 | 0.642 | 0.637 | |
| HIVPR | 1xl2 | 0.495 | 0.596 | ||
| THRB | 1ype | 0.646 | 0.709 | 0.752 | |
| TRY1 | 2ayw | 0.320 | (0.725) | 0.778 | |
| mean AUC | 0.565 | 0.744 |
For those cases where the specific PDB ID was not present in the corresponding sc-PDB cluster—CDK2 and TRY1—we use all the sc-PDB query descriptors and report the mean AUC in parentheses. Entries with the best AUC among the methods that used only one structure for comparison—DOCK and Ligity—are highlighted in bold. The best AUC for each target is in italic. Ligity with a single active structure does better than all other methods for 4 out of the 10 DUD-E target classes, and 9 times out of 10 Ligity is better than the other non-superpositional method, ElectroShape. Note that when using more than one protein–ligand complex and fusing the results, with the exception of HIVINT, Ligity does even better (“Ligity fused”) than when using only one complex (“Ligity”).
Ligity Is about 4–5 Orders of Magnitude Times Faster than Protein–Ligand Docking, Once Its Descriptors Have Been Precalculated for the Virtual Library Being Screeneda
| method | mode | CPU time (s) | relative speed-up |
|---|---|---|---|
| AutoDock Vina | flexible ligand | 12586.8 | 1.0 |
| Ligity descriptors + VS | information-rich | 637.3 | 19.9 |
| Ligity descriptors | information-rich | 585.9 | 21.5 |
| Ligity virtual screening | information-rich | 51.4 | 244.9 |
| Ligity virtual screening | HTS | 1.9 | 6815.3 |
This is exemplified by comparing with the already very efficient AutoDock Vina with the ADA target from DUD-E. The “information-rich” mode of Ligity, with multiple conformers per molecule, used a total of 3074 conformers for the 93 actives and 2287 conformers for 100 randomly selected decoys. The “HTS” mode of Ligity used just the lowest-energy conformer in each molecule’s conformer ensemble for each active or decoy.