| Literature DB >> 24564209 |
Abstract
BACKGROUND:Entities:
Mesh:
Substances:
Year: 2013 PMID: 24564209 PMCID: PMC3817810 DOI: 10.1186/1471-2105-14-S18-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Body-bar-hinge modeling of molecules. KINARI builds a body-bar-hinge mechanical model of the input molecule. For example, in ethane, each C atom bonded to four neighbors forms a rigid body. The two bodies share a hinge along the center C-C bond.
Figure 2Hydrogen bond definition. Hydrogen bond definition. (a) A hydrogen bond forms between an electronegative acceptor atom, A, and a hydrogen atom, H, that is covalently bonded to an electronegative donor atom, D. AB is the acceptor base. (b) Hydrogen bonds are calculated using geometric parameters.
Figure 3Hydrogen bond configurations. Configurations of interest when building a mechanical model of the protein. The triangles show the bodies determined by KINARI for the mechanical model.
Prevalence of H-bonds in special configurations.
| 1htg | 1vgc | 1bbp | |
|---|---|---|---|
| total | 146 | 173 | 553 |
| bifurcated D | 1.3% | 2.3% | 5.8% |
| bifurcated A | 15% | 20% | 21% |
| trifurcated A | 0% | 1.7% | 3.8% |
| multi-base A | 4.8% | 6.4% | 6.0% |
We examined the H-bonds calculated with HBPLUS [36] on 3 example proteins from [15]: HIV-1 protease (1htg), serine protease (1vgc), and bilin binding protein (1bbp). We counted how many H-bonds were found in each configuration type shown in Figure 3, which are of special interest in mechanical modeling.
Figure 4Histograms of hydrogen bond energies, by configuration. The distributions of H-bond energies varies based on configuration. H-bonds and associated energies were calculated via HBPLUS [36] software and the Mayo Lab energy function [17] on 3 example proteins from [15]: HIV-1 protease (1htg), serine protease (1vgc), and bilin binding protein (1bbp).
Figure 5Examples of generic and non-generic body-bar-hinge frameworks. Examples of generic and non-generic body-bar-hinge frameworks (a,b,c) and associated graphs (d,e,f). (a) shows a generic body-bar-hinge framework. The bar endpoints, and the continuous sets of points along the hinge axes, are all distinct. Its associated graph, shown in (d), is completely defined. The frameworks of (b,c) contain non-generic features described in this paper: a bar-bar concurrency (b) and a bar-hinge concurrency (c). These two types of degeneracies may occur in mechanical models of proteins when modeling H-bonds or hydrophobic interactions with a bar. Using our heuristic, we build associated graphs for the non-generic frameworks (e,f). Although for these two examples the pebble game will produce the correct result, there is no guarantee for non-generic cases.
Figure 6Calculated noncovalent interactions in an alpha helix. Hydrogen bonds (green) and hydrophobic interactions (blue) computed on a section of alpha helix. In the 18 residue alpha helix, 14 hydrogen bonds, with energies ranging between -2 and -7 kcal/mol, and 51 hydrophobic interactions, with energies ranging between -0.15 and -0.2 kcal/mol, were identified. With the previous version of calculating hydrophobic interactions in KINARI v1.0, no hydrophobic interactions would be identified within the alpha helix.
Figure 7Example rigid cluster decomposition comparison. Decompositions on the same example 10-residue protein to demonstrate the cluster decomposition score. GS represents the gold standard decomposition against which predicted decompositions D1-D3 are compared. D2 and D3 are the all-rigid (100%-recall) and all-floppy (100%-precision) decompositions. D1, D2, and D3 receive B-cubed scores, respectively, of 0.65, 0.46, and 0.55. See Table 2 for calculations.
Example B-cubed scoring calculations.
| D1 | D2 | D3 | ||||
|---|---|---|---|---|---|---|
| 1 | ||||||
| 2 | ||||||
| 3 | ||||||
| 4 | ||||||
| 5 | ||||||
| 6 | ||||||
| 7 | ||||||
| 8 | ||||||
| 9 | ||||||
| 10 | ||||||
| Avg | 0.93 | 0.51 | 1.0 | 0.38 | 0.30 | 1.0 |
| F1 | 0.65 | 0.55 | 0.46 | |||
Shown are the calculations of B-cubed precision, recall, and F1-scores for the small examples shown in Figure 7. The scoring method is described in the Methods section. The B-cubed score (shown as F1) for decomposition D1 is higher than the all-rigid (D2) and all-floppy (D3) decompositions. In the all-rigid baseline, all 10 residues are placed into the same cluster (D2), resulting in 100% recall but low precision. To contrast, in the all-floppy baseline, each residue is placed in a unique cluster resulting in 100% precision but low recall.
Figure 8Results of B-cubed scoring evaluation on benchmark data set. We determined rigid cluster decompositions for each PDB file in the benchmark data set, using the 7 methods listed in Table 3. The data set comes from publications by two different software for performing rigid cluster decompositions. The MSU-FIRST portion of the data set consists of 4 proteins used to validate the MSU-FIRST software [1,5]. The RigidFinder portion of the data set is categorized, from top to bottom, into large (greater than 500 residues), medium (between 200 and 500 residues) and small (fewer than 200 residues) proteins. Decomposition methods 1 through 7 are summarized in Table 3. For each method, the maximum B-cubed score and corresponding hydrogen bond and hydrophobic energy cutoffs (where relevant) are shown. See Figure 9 for barplots comparing each method against methods 2 and 3.
Decomposition methods.
| Decomposition Method | Description |
|---|---|
| 1 | All-floppy decomposition |
| 2 | All-rigid decomposition |
| 3 | KINARI v1.0, default options |
| 4 | KINARI, vary hydrogen bond energy cutoff and exclude weak hydrogen bonds |
| 5 | KINARI, vary hydrogen bond energy cutoff and model weak hydrogen bonds as bars |
| 6 | KINARI, use default options for hydrogen bonds. compute hydrophobics and assign energy with LJ-potential. Exclude weak hphobes and model the rest as bars |
| 7 | KINARI, same as Method 6, but vary the hydrogen bond energy cutoff and model the weaker hydrogen bonds as bars |
We evaluate the 6 decomposition method variants. The first three serve as baselines against which the new modeling options, proposed in this paper, are compared. The data set and results of of our evaluation of the 7 methods are listed in Figure 8.
Figure 9Baseline comparison of B-cubed scores of RigidFinder data set. Mean of differences and p-values for each decomposition method (listed in Table 3), compared with all-rigid baseline (method 2) and KINARI v1.0 (method 3). Results include only the RigidFinder portion of the data set (see Figure 8). The mean of differences measures the change in B-cubed score between the two methods; a better-performing method will have a higher associated mean of differences. The p-value indicates whether the improvement is significant (p-value≤ 0.05 is deemed significant). The greatest improvement in B-cubed scores, most significantly in the small to medium sized proteins, resulted when both the modeling of hydrogen bonds and hydrophobic interactions were varied (method 7).
Comparison of flexible loops detected by MSU-FIRST, KINARI v1.0, and RigidFinder on four proteins.
| Protein | PDB | MSU-FIRST | KINARI v1.0 | RigidFinder |
|---|---|---|---|---|
| LAO-binding 1 | 1lst | 1 | 1 | 1 |
| HIV-1 Protease 1 | 1hhp | 3 | 3 | 2 |
| Dihydrofolate Reductase 1 | 1ra1 | 2 | 2 | 2 |
| Adenylate Kinase 1 | 1aky | 6 | 6 | 3 |
A comparison of the flexible loop regions detected by KINARI v1.0, MSU-FIRST, and RigidFinder. The four proteins, with annotated flexible loops, were used in the validation of MSU-FIRST [1,5]. The table data for MSU-FIRST was taken from the same publications. For KINARI v1.0 and RigidFinder, flexible loops were determined visually. Because RigidFinder computes decompositions based on multiple conformations, the results shown will match for all conformations of the same protein.
Figure 10Rigid cluster decomposition of lysine-arginine-ornithine binding protein. All of the decompositions are depicted on 2lao. (a) LAO-binding protein is composed of two functional domains. (b,c) Decompositions for 1lst and 2lao computed via KINARI v1.0. (c) Decomposition produced by RigidFinder computing using both conformations.
Figure 11Rigid cluster decompositions of HIV-1 Protease. All of the decompositions are depicted on the 1hhp dimer. (a,b) Decompositions for 1hhp and 1htg computed via KINARI v1.0. (c) Decomposition produced by RigidFinder computing using both conformations.
Figure 12Rigid cluster decompositions of Dihydrofolate Reductase. All of the decompositions are depicted on 1ra1. (a,b,c) Decompositions for 1ra1, 1rx1, and 1rx6 computed via KINARI v1.0. (d) Decomposition produced by RigidFinder computing using 1ra1 and 1rx6 conformations.
Figure 13Rigid cluster decompositions of Adenylate Kinase. All of the decompositions are depicted on 1dvr. (a,b) Decompositions for 1aky and 1dvr computed via KINARI v1.0. (c) Decomposition produced by RigidFinder computing using both conformations.
Figure 14Case Study of Pyruvate Phosphate Dikinase. In this case study, we demonstrate how B-cubed scoring may be used to determine the parameter settings for rigidity analysis. (a) shows the RigidFinder decompositions of PPDK, which was validated against literature-annotated functional domains [6]. The KINARI v1.0 decompositions of the open (2r82) and closed (1kc7), shown in (b) and (c), have B-cubed scores of 0.66 and 0.45 respectively. By varying the H-bond energy cutoff (method 2 in Table 3), decompositions with higher B-cubed scores for 1kc7 could be generated. The KINARI decomposition for 2r82 was optimal at cutoff energy 0, meaning all H-bonds were included. For 1kc7, the maximum score, 0.66, was attained at -1.5 kcal/mol when excluding weak H-bonds from the modeling (d). By using a bar to model weak H-bonds, a slightly better score (0.67, cutoff -2.75 kcal/mol) was achieved (e). The B-cubed score plots for the two conformations, using method 7, are shown in (f). As the cutoff is varied, the precision and recall are monotonically increasing and decreasing, shown in (g). An optimal B-cubed score is achieved when the F1-values combining the precision and recall is optimized.
Figure 15Case study of Calmodulin (1ctr). In this case study, we demonstrate how varying the H-bonds and hydrophobic interactions included can produced a cluster decomposition that better matches with the 'gold standard'. (a-c) Rigid cluster decompositions of 1ctr by RigidFinder, KINARI v1.0 (method 3), and by varying hydrogen bonds and hydrophobic interactions (method 7). Using the RigidFinder decomposition as the gold standard for comparison, the decompositions of (b) and (c) attained, respectively, B-cubed scores of 0.48 and 0.90. (d) shows a plot of the B-cubed score as the energy cutoffs for H-bonds and hydrophobic interactions are varied, via method 7. To show how the H-bond and hydrophobic interaction energies are distributed, we include figures (e) and (f). The KINARI-Web server provides the functionality of viewing such histograms when choosing energy cutoffs.