| Literature DB >> 20100327 |
Bing Xiong1, Jie Wu, David L Burk, Mengzhu Xue, Hualiang Jiang, Jingkang Shen.
Abstract
BACKGROUND: Genome sequencing and post-genomics projects such as structural genomics are extending the frontier of the study of sequence-structure-function relationship of genes and their products. Although many sequence/structure-based methods have been devised with the aim of deciphering this delicate relationship, there still remain large gaps in this fundamental problem, which continuously drives researchers to develop novel methods to extract relevant information from sequences and structures and to infer the functions of newly identified genes by genomics technology.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20100327 PMCID: PMC3098077 DOI: 10.1186/1471-2105-11-47
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Workflow to compute a fingerprint from a binding site structure. The atoms of the binding site residues are divided into 7 fragment types according to their properties. Distances between the centroids of each pair of fragments were then calculated and the distances are recorded and binned into the corresponding locations in the fingerprint (FP).
Figure 2Binding site residues distribution.
Figure 3Raw fingerprint scores for two simulated binding sites data sets. Here only six groups are shown. The black are scores of Dataset I. The red lines are scores of random Dataset II.
Figure 4Least square fitting of the mean and standard deviation of the raw fingerprint scores.
Figure 5ROC curves calculated from the Dataset I and II. to assess the boundaries of Z score cutoff and fix number.
Figure 6Comparison of the geometric hashing method with the fingerprint Z-score method.
Figure 7Validation the fingerprint Z-score method with SCOP and Blast Dataset.
Case studies by comparison with other servers/softwares.
| PDB1 | PDB2 | ProFunc | SuMo | SitesBase | PocketMatch | PyMol | BSSF | |||
|---|---|---|---|---|---|---|---|---|---|---|
| 112 | 100 | 83.33 | 0 | 85.25 | 85.25 | 0.231 | 61.91 | -3.87 | ||
| 112 | 76 | 93.67 | 0 | 99.91 | 88.39 | 0.141 | 85.58 | -2.90 | ||
| X | NA | 100 | 0 | 98.93 | 88.39 | 0.298 | 40.34 | -5.00 | ||
| 241 | 91 | 76.92 | 0 | 99.4 | 89.29 | 0.088 | 77.96 | -3.20 | ||
| 229 | 89 | 29.38 | 3E-28 | 93.65 | 50.17 | 0.294 | 194.09 | -4.00 | ||
| 217 | 40 | NA | NA | 56.9 | 52.29 | 1.186 | 394.22 | -1.42 | ||
| X | NA | 54.80 | 0 | 100.0 | 88.01 | 0.113 | 87.31 | -2.95 | ||
| 150 | 45 | 93.06 | 0 | 94.72 | 83.59 | 0.149 | 77.22 | -3.21 | ||
| 104.58 | X | NA | NA | 80.48 | 28.40 | 3.527 | 83.63 | -2.98 | ||
| 174 | 27 | 60 | 2.4E-40 | 96.87 | 75.41 | 0.236 | 86.52 | -2.94 | ||
| X | 58 | 86.79 | 2.5E-38 | 100 | 69.05 | 0.160 | 61.35 | -4.16 | ||
| 102 | X | 55.55 | 7.6E009 | 91.53 | 21.16 | 3.019 | 214.00 | -1.78 | ||
| X | X | 54.65 | 8.6E-27 | 74.22 | 55.56 | 0.640 | 84.86 | -3.02 | ||
| X | X | X | X | 67.58 | 63.85 | 5.875 | 149.16 | -0.66 | ||
| X | X | X | X | 58.94 | 56.01 | 5.691 | 176.40 | 0.27 | ||
| X | X | 46.81 | 2.6E-16 | 68.94 | 61.42 | 0.690 | 144.04 | -0.92 | ||