| Literature DB >> 24883114 |
Ali Ahmed1, Faisal Saeed2, Naomie Salim2, Ammar Abdo3.
Abstract
BACKGROUND: It is known that any individual similarity measure will not always give the best recall of active molecule structure for all types of activity classes. Recently, the effectiveness of ligand-based virtual screening approaches can be enhanced by using data fusion. Data fusion can be implemented using two different approaches: group fusion and similarity fusion. Similarity fusion involves searching using multiple similarity measures. The similarity scores, or ranking, for each similarity measure are combined to obtain the final ranking of the compounds in the database.Entities:
Keywords: Data fusion; Similarity coefficients; Similarity searching; Virtual screening
Year: 2014 PMID: 24883114 PMCID: PMC4026830 DOI: 10.1186/1758-2946-6-19
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
explanation example on electing winner measure based achieved votes or points
| 11 | 9 | 10 | 1 | 3 | 7 | 6 | 4 | 8 | 2 | 5 | |
| 9 | 8 | 11 | 4 | 5 | 3 | 1 | 2 | 7 | 6 | 10 | |
| 7 | 3 | 9 | 1 | 8 | 4 | 11 | 6 | 2 | 5 | 10 | |
MDDR1 structure activity classes
| 07707 | Adenosine (A1) agonists | 207 | 0.424 |
| 07708 | Adenosine (A2) agonists | 156 | 0.484 |
| 31420 | Renin inhibitors 1 | 1300 | 0.584 |
| 42710 | CCK agonists | 111 | 0.596 |
| 64100 | Monocyclic -lactams | 1346 | 0.512 |
| 64200 | Cephalosporins | 113 | 0.503 |
| 64220 | Carbacephems | 1051 | 0.414 |
| 64500 | Carbapenems | 126 | 0.444 |
| 64350 | Tribactams | 388 | 0.673 |
| 75755 | Vitamin D analogous | 455 | 0.569 |
MDDR2 structure activity classes
| 09249 | Muscarinic (M1) agonists | 900 | 0.257 |
| 12455 | NMDA receptor antagonists | 1400 | 0.311 |
| 12464 | Nitric oxide synthase inhibitors | 505 | 0.237 |
| 31281 | Dopamine-hydroxylase inhibitors | 106 | 0.324 |
| 43210 | Aldose reductase inhibitors | 957 | 0.370 |
| 71522 | Reverse transcriptase inhibitors | 700 | 0.311 |
| 75721 | Aromatase inhibitors | 636 | 0.318 |
| 78331 | Cyclooxygenase inhibitors | 636 | 0.382 |
| 78348 | Phospholipase A2 inhibitors | 617 | 0.291 |
| 78351 | Lipoxygenase inhibitors | 2111 | 0.365 |
MUV structure activity classes
| 466 | S1P1 rec. (agonists) | 0.445 |
| 548 | PKA (inhibitors) | 0.430 |
| 600 | SF1 (inhibitors) | 0.445 |
| 644 | Rho-Kinase2 (inhibitors) | 0.416 |
| 652 | HIV RT-RNase (inhibitors) | 0.398 |
| 689 | Eph rec. A4 (inhibitors) | 0.449 |
| 692 | SF1 (agonists) | 0.365 |
| 712 | HSP 90 (inhibitors) 30 | 0.413 |
| 713 | ER-a-Coact. Bind. (inhibitors) | 0.389 |
| 733 | ER-b-Coact. Bind. (inhibitors) | 0.352 |
| 737 | ER-a-Coact. Bind. (potentiators) | 0.502 |
| 810 | FAK (inhibitors) | 0.425 |
| 832 | Cathepsin G (inhibitors) | 0.435 |
| 846 | FXIa (inhibitors) | 0.532 |
| 852 | FXIIa (inhibitors) | 0.492 |
| 858 | D1 rec. (allosteric modulators) | 0.400 |
| 859 | M1 rec. (allosteric inhibitors) | 0.386 |
Number of active and inactive compounds for twelve DUD sub datasets, where N : number of active compounds, N : number of decoys
| 1 | FGFR1 | 120 | 4550 |
| 2 | FXA | 146 | 5745 |
| 3 | GART | 40 | 879 |
| 4 | GPB | 52 | 2140 |
| 5 | GR | 78 | 2947 |
| 6 | HIVPR | 62 | 2038 |
| 7 | HIVRT | 43 | 1519 |
| 8 | HMGA | 35 | 1480 |
| 9 | HSP90 | 37 | 979 |
| 10 | MR | 15 | 636 |
| 11 | NA | 49 | 1874 |
| 12 | PR | 27 | 1041 |
| 704 | 25828 | ||
Retrieval results of top 5% for data set MDDR1
| 07707 | 71.56 | 87.72 | 85.62 | 86.03 | 85.67 | |
| 07708 | 56.96 | 80.46 | 92.15 | 92.25 | 96.85 | |
| 31420 | 88.89 | 89.07 | 88.61 | 89.39 | 89.45 | |
| 42710 | 88.65 | 90.03 | 89.66 | 89.85 | 90.06 | |
| 64100 | 94.85 | 94.19 | 94.29 | 99.31 | 99.26 | |
| 64200 | 78.41 | 91.69 | 91.26 | 91.62 | 96.94 | |
| 64220 | 53.40 | 70.11 | 62.54 | 81.61 | 89.75 | |
| 64500 | 45.28 | 74.17 | 74.72 | 89.06 | 88.64 | |
| 64350 | 92.27 | 92.52 | 99.95 | 97.82 | 98.28 | |
| 75755 | 95.27 | 95.52 | 96.61 | 97.55 | 95.16 | |
| 76.55 | 86.55 | 87.54 | 91.45 | |||
| 45.28 | 70.11 | 62.54 | 81.61 | 85.67 | ||
| 95.27 | 95.52 | 99.95 | 99.31 | 99.26 | ||
| 0 | 1 | 1 | 3 | 5 | ||
Retrieval results of top 5% for data set MDDR2
| 09249 | 24.64 | 23.88 | 24.01 | 24.64 | 23.89 | |
| 12455 | 11.24 | 21.26 | 21.40 | 13.47 | 14.17 | |
| 12464 | 16.25 | 23.72 | 23.60 | 23.64 | 37.29 | |
| 31281 | 30.57 | 39.70 | 32.74 | 39.54 | 50.63 | |
| 43210 | 16.82 | 18.45 | 26.49 | 29.36 | 14.17 | |
| 71522 | 13.22 | 16.16 | 15.87 | 16.12 | 16.22 | |
| 75721 | 25.81 | 28.29 | 27.89 | 48.22 | 31.36 | |
| 78331 | 16.05 | 17.80 | 18.67 | 18.33 | 24.76 | |
| 78348 | 25.16 | 26.20 | 26.29 | 20.97 | 26.92 | |
| 78351 | 12.70 | 14.46 | 14.60 | 14.85 | 15.15 | |
| 19.25 | 22.99 | 23.16 | 24.91 | |||
| 11.24 | 14.46 | 14.60 | 13.47 | 14.17 | ||
| 30.57 | 39.70 | 32.74 | 48.22 | 50.63 | ||
| 1 | 0 | 1 | 3 | 6 | ||
Retrieval results of top 5% for data set MUV
| 466 | 6.72 | 9.05 | 7.97 | 8.69 | 8.90 | |
| 548 | 20.85 | 25.43 | 26.06 | 25.82 | 26.29 | |
| 600 | 9.94 | 13.82 | 13.85 | 13.99 | 15.69 | |
| 644 | 23.07 | 22.47 | 23.27 | 22.93 | 22.85 | |
| 652 | 7.74 | 11.21 | 10.88 | 10.52 | 11.17 | |
| 689 | 12.40 | 13.35 | 14.76 | 16.39 | 12.38 | |
| 692 | 7.25 | 9.28 | 10.23 | 6.80 | 7.76 | |
| 712 | 15.63 | 13.11 | 13.42 | 13.62 | 12.94 | |
| 713 | 6.99 | 9.60 | 10.37 | 9.71 | 11.97 | |
| 733 | 9.13 | 11.60 | 11.00 | 12.75 | 13.31 | |
| 737 | 8.50 | 10.62 | 10.71 | 13.36 | 13.53 | |
| 810 | 8.51 | 9.27 | 9.37 | 9.40 | 10.93 | |
| 832 | 18.75 | 14.23 | 15.10 | 15.20 | 17.38 | |
| 846 | 24.56 | 29.64 | 30.37 | 29.88 | 29.72 | |
| 852 | 15.71 | 24.50 | 24.48 | 23.63 | 23.81 | |
| 858 | 7.38 | 8.29 | 7.53 | 8.39 | 8.47 | |
| 859 | 9.51 | 9.92 | 9.60 | 11.32 | 11.44 | |
| 12.51 | 14.43 | 14.65 | 14.85 | |||
| 6.72 | 8.29 | 7.53 | 6.80 | 7.76 | ||
| 24.56 | 29.64 | 30.37 | 29.88 | 29.72 | ||
| 2 | 3 | 3 | 1 | 8 | ||
Retrieval results of top 5% for data set DUD
| FGFR1 | 7.25 | 6.08 | 6.08 | 7.08 | 12.08 | |
| FXA | 8.70 | 9.52 | 9.52 | 9.52 | 9.52 | |
| GART | 21.25 | 21.5 | 21.5 | 21.5 | 7.25 | |
| GPB | 27.12 | 29.04 | 29.04 | 29.04 | 35 | |
| GR | 6.79 | 6.79 | 6.79 | 8.97 | 13.21 | |
| HIVPR | 12.74 | 6.45 | 6.45 | 12.9 | 6.45 | |
| HIVRT | 4.42 | 8.42 | 9.42 | 4.42 | 8.42 | |
| HMGA | 10.02 | 10.30 | 15.21 | 15.35 | 17.44 | |
| HSP90 | 9.46 | 9.46 | 9.46 | 9.46 | 8.92 | |
| MR | 8.67 | 8.67 | 8.67 | 8.67 | 8.67 | |
| NA | 6.53 | 6.53 | 6.53 | 6.73 | 10.73 | |
| PR | 5.93 | 7.70 | 9.70 | 10.7 | 12.7 | |
| 10.74 | 10.87 | 11.53 | 12.03 | |||
| 4.42 | 6.08 | 6.08 | 4.42 | 6.45 | ||
| 27.12 | 29.04 | 29.04 | 29.04 | 35.00 | ||
| 3 | 3 | 4 | 5 | 8 | ||
Rankings of similarity approaches based on Kendall W test results: MDDR1, MDDR2, MUV and DUD top 5%
| MDDR1 | 0. 594 | 0.001 | Top100 > Top50 > Top20 > Top10 > TAN |
| MDDR2 | 0.479 | 0.008 | Top100 > Top50 > Top20 > Top10 > TAN |
| MUV | 0.447 | 0.002 | Top100 > Top50 > Top20 > Top10 > TAN |
| DUD | 0.441 | 0.009 | Top100 > Top50 > Top20 > Top10 > TAN |
Figure 1Performance with 95% confidence bound for the three screening methods with a) MDDR1, b) MDDR2, c) MUV and d) DUD data sets at top 5%
Figure 2ROC curves and AUCs at 5% cutoff of MDDR1 data set.
Rankings of similarity approaches based on friedman’s test results: MDDR1, MDDR2, MUV and DUD top 5%
| 0.001 | 1.40 | 2.90 | 2.60 | 4.00 | 4.10 | |
| 0.011 | 1.55 | 3.00 | 3.10 | 3.35 | 4.00 | |
| 0.002 | 1.76 | 2.76 | 3.29 | 3.35 | 3.82 | |
| 0.017 | 2.17 | 2.67 | 2.96 | 3.54 | 3.67 | |
Enrichment values of (BEDROC α = 20) and (EF 1%) using our proposed method on MDDR1, MDDR2, MUV and DUD data sets
| | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.49 | 0.47 | 80.52 | 84.99 | 0.30 | 0.35 | 23.25 | 22.98 | 0.38 | 0.36 | 16.94 | 16.87 | 0.17 | 0.15 | 20.87 | 19.55 | |
| 0.50 | 0.54 | 81.85 | 89.10 | 0.32 | 0.36 | 24.85 | 23.11 | 0.40 | 0.37 | 17.99 | 16.64 | 0.19 | 0.18 | 21.67 | 20.83 | |
| 0.52 | 0.49 | 86.31 | 89.10 | 0.35 | 0.36 | 23.99 | 23.79 | 0.41 | 0.35 | 18.70 | 16.90 | 0.21 | 0.16 | 22.59 | 20.74 | |
| 0.55 | 0.47 | 86.99 | 88.55 | 0.36 | 0.34 | 24.72 | 21.45 | 0.42 | 0.39 | 17.72 | 17.01 | 0.29 | 0.28 | 23.22 | 21.05 | |
| 0.63 | 0.62 | 91.17 | 90.05 | 0.47 | 0.51 | 27.87 | 23.99 | 0.45 | 0.40 | 19.94 | 18.64 | 0.35 | 0.30 | 25.61 | 22.14 | |