| Literature DB >> 20037730 |
Yoshifumi Fukunishi1, Haruki Nakamura.
Abstract
A molecular similarity measure has been developed using molecular topological graphs and atomic partial charges. Two kinds of topological graphs were used. One is the ordinary adjacency matrix and the other is a matrix which represents the minimum path length between two atoms of the molecule. The ordinary adjacency matrix is suitable to compare the local structures of molecules such as functional groups, and the other matrix is suitable to compare the global structures of molecules. The combination of these two matrices gave a similarity measure. This method was applied to in silico drug screening, and the results showed that it was effective as a similarity measure.Entities:
Mesh:
Year: 2009 PMID: 20037730 PMCID: PMC2796334 DOI: 10.1155/2009/231780
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Figure 1Example of the matrices E and D.
The average q values and the hit ratio with optimized coefficients for various λ values. The coefficient1 is the c parameter in (4) for the E matrix and the coefficient2 is the c parameter in (4) for the D matrix. The hit ratio is the % of the active compounds found within the first 1% of selected compounds of the database. The average and σ are the average value and the standard deviation of the scores. ∗: the top-ranked compounds when diphenhydramine is the template.
| 0 | 0.25 | 0.5 | 0.75 | 1 | |
|---|---|---|---|---|---|
| Hit ratio | 28.86% | 36.48% | 35.76% | 35.69% | 26.93% |
| 80.14 | 82.53 | 80.37 | 82.09 | 72.72 | |
| Coefficient1 | 0.002 | 0.01 | 0.02 | 0.01 | 0 |
| Coeffcient2 | 0 | 0.00005 | 0.00005 | 0.00001 | 0.0001 |
| Average | 0.100 ∗ 10−4 | 3.61 ∗ 10−4 | 7.13 ∗ 10−4 | 10.64 ∗ 10−4 | 14.163 ∗ 10−4 |
| 7.50 ∗ 10−6 | 1.62 ∗ 10−4 | 3.24 ∗ 10−4 | 4.87 ∗ 10−4 | 6.50 ∗ 10−4 | |
| 1st compound* | |||||
| 1.321 | 2.218 | 2.189 | 2.179 | 2.174 | |
| Score | 0.00008295 | 0.001723 | 0.001944 | 0.002164 | 0.002383 |
| 2nd compound* | |||||
| 1.315 | 2.202 | 2.179 | 2.171 | 2.167 | |
| Score | 0.0001266 | 0.004425 | 0.005333 | 0.006242 | 0.00715 |
| 3rd compound* | |||||
| 1.314 | 2.192 | 2.169 | 2.161 | 2.157 | |
| Score | 0.0001338 | 0.005944 | 0.008402 | 0.01085 | 0.01331 |
Figure 2Distribution of the values of Dist. The template is diphenhydramine. The Dist values are multiplied by 10000 times and the frequency is normalized.
Figure 4Histamine H1 receptor antagonists.
Figure 3Average database enrichment curves of 160 active compounds with the current similarity measure. Open circles represent the average database enrichments with the distance measure defined by (6) (MGD) with λ = 0.25. The dashed line represents the random screening result.
Figure 5The three top-ranked compounds by the MGD method for the template: diphenhydramine. (a) The result with λ = 0.25. (b) The result with λ = 0. (c) The result with λ = 1.