| Literature DB >> 20305268 |
Abstract
MOTIVATION: Exploitation of locally similar 3D patterns of physicochemical properties on the surface of a protein for detection of binding sites that may lack sequence and global structural conservation.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20305268 PMCID: PMC2859123 DOI: 10.1093/bioinformatics/btq100
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Different functional groups in proteins are assigned distinct labels (see color encoding). Subgraphs are generated from the query protein and each database protein and then used to produce product graphs which reveal the extent of superposition of any pair of subgraphs.
Fig. 2.Schematic representation of the ProBiS algorithm. (A) The query protein structure (Q) is compared in a pairwise manner with each of ∼23 000 non-redundant structures (P). (B) Proteins, represented as graphs of vertices (white dots) and edges (not shown), are divided into n overlapping subgraphs, where n equals the number of vertices and all vertices are within 15 Å of a central vertex: three subgraphs per protein are depicted here as distinctly colored encirclements. A fast distance-matrix-based filtering is applied to them to eliminate non-similar subgraphs. (C) A product graph is constructed for each similar pair of subgraphs (see color encoding in B and C). A maximum clique (thick lines) in a product graph represents the largest similarity between two compared protein subgraphs. (D) Each maximum clique produces a structural alignment of two compared proteins (the alignment shown corresponds to the middle maximum clique in C). (E) Steps A–D are repeated for each protein from the nr-PDB and the results are stored in a MySQL database. Structural similarity scores are calculated and projected on the query protein surface. Structurally similar and variable residues are colored red and blue, respectively. High-scoring residues are considered as predicted structurally similar binding sites.
Fig. 3.Schematic representation of nr-PDB database preparation, conversion to a surface representation and ProBiS results database (MySQL).
Binding sites detection with a structural similarity mapping method, evolutionary conservation mapping method and an energy-based method on a set of 39 proteins
| Method | Total no. of residues | Interface no. residues | No. of predicted residues | SP (%) | SE (%) | |
|---|---|---|---|---|---|---|
| ProBiS | 224 | 54 | 62.8 | 1.4 | 39.0 | 43.9 |
| ConSurf | 224 | 54 | 60.6 | 33.0 | 34.3 | 38.1 |
| Q-SiteFinder | 224 | 54 | 48.5 | 6.5 | 39.4 | 38.2 |
Tables with detailed results are available in the Supplementary Material.
aThe alignment scores listed in the ‘Methods’ section were used.
bConSurf uses different methods to count conserved residues. We used the Bayesian method which is enabled by default and gives the best results.
Fig. 4.Structural similarity pattern in the homodimer protein biotin carboxylase (PDB: 1bnc) and in the TATA-binding protein (TBP) (PDB: 1ytf). The proteins are represented as pink cartoon models, TATA box DNA as cyan cartoon model; the structurally similar residues are shown as yellow, orange and red stick models and the interacting residues on the opposing chains are shown as pink stick models. Hydrogen bond pattern occurring (A) between biotin carboxylase and bound ligands, biotin, ADP, Mg2+ and bicarbonate ion; (B) between the two subunits of biotin carboxylase; (C) between the TBP and the transcription factor IIA and (D) between the TBP and the TATA box DNA is shown.
Comparison of structural alignments quality of similar binding sites on 10 protein pairs with dissimilar folds
| First protein structure | Second protein structure | DaliLite | MolLoc | MultiBind | |||||
|---|---|---|---|---|---|---|---|---|---|
| PDB | Residue numbers | Ligand | PDB | Residue numbers | Ligand | RMSDs (Å) | |||
| 1addA | 262,295,181,15,214,238 | ZN | 1bmcA | 168,90,152,86,88,149 | ZN | 6.53 | 14.31 | 13.29 | 14.59 |
| 1eceA | 114,162,238,116,161,27 | BGC | 2dnjA | 212,39,134,252,7,170 | DNA | 9.91 | 12.91 | 12.48 | 8.05 |
| 1phrA | 12,129,18,19 | SO4 | 1vhrA | 124,92,130,131 | EPE | 1.50 | 8.94 | 8.10 | 5.37 |
| 1bmfA | 269,270,273,175,176 | ANP | 1aylA | 268,269,213,254,255 | ATP | 2.34 | n/a | 3.26 | 11.93 |
| 1ampA | 179,117,256,97,228 | ZN | 1alkA | 51,327,331,370,102 | ZN | 2.14 | 10.69 | 6.60 | 11.91 |
| 1ribA | 84,237,238,118,121,122 | FEO | 1vhhA | 130,148,127,135,139,142 | ZN | 4.82 | n/a | n/a | 13.19 |
| 1powA | 308,311,286 | FAD | 1inpA | 311,370,358 | n/a | 1.07 | 35.70 | 19.75 | n/a |
| 1alkA | 369,327,101,331,370,166 | ZN | 1fjmA | 64,92,208,125,173,221 | MN | 7.00 | n/a | 9.63 | 7.92 |
| 1qbaA | 844,845,847,850,849,852 | n/a | 1eurA | 176,177,179,182,181,184 | n/a | 0.27 | 46.12 | 0.30 | n/a |
| 2kauC | 134,136,219,246,320,364 | NI | 2mhrA | 54,25,106,77,73,62 | FEO | 12.68 | 25.95 | 12.41 | 12.62 |
| Mean RMSDs (Å) | 4.82 | 22.09 | 9.54 | 10.69 | |||||
aGeometry+atom type method is used; in the two cases, where ligands are not available (n/a), we restricted the comparison to the corresponding residue numbers given above.
bThe compared binding sites are defined by the surface region within 8.0 Å from small ligands with codes ZN, MN, SO4, EPE, NI and FEO; the default threshold of 4.0 Å is used elsewhere.