| Literature DB >> 16336694 |
K V Brinda1, Saraswathi Vishveshwara.
Abstract
BACKGROUND: Protein-protein association is essential for a variety of cellular processes and hence a large number of investigations are being carried out to understand the principles of protein-protein interactions. In this study, oligomeric protein structures are viewed from a network perspective to obtain new insights into protein association. Structure graphs of proteins have been constructed from a non-redundant set of protein oligomer crystal structures by considering amino acid residues as nodes and the edges are based on the strength of the non-covalent interactions between the residues. The analysis of such networks has been carried out in terms of amino acid clusters and hubs (highly connected residues) with special emphasis to protein interfaces.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16336694 PMCID: PMC1326230 DOI: 10.1186/1471-2105-6-296
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Size of the largest cluster/N (N = No. of amino acid residues in the protein structure) Vs Imin for a set of oligomers from the dataset.
Figure 2Interface amino acid clusters in Urate oxidase (1UOX) at (a) Imin = 6% and (b) Imin = 4%. The protein monomers and the cluster forming residues are colored differently and are shown in cartoon and van der Waal's representations respectively. This is an example of an interface with more than one patch at higher Imins that do not merge at lower Imins (1UOX) as identified from the interface clusters at different Imins.
Figure 3Amino acid preferences in the interface and the non-interface hubs at (a) Imin = 0% and (b) Imin = 4%. The percentage compositions of amino acid hubs in the interfaces and non-interfaces are presented (i.e., (No. of interface or non-interface amino acid hubs of type 'i' ÷ Total No. of amino acid residues of type 'i' present in the dataset) × 100).
Preferences of interface hubs to interact with other residues at Imin = 4%1
| ALA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| ARG | 2.06 | 1.56 | 10.08 | 5.74 | 14.32 | 0.45 | 4.62 | 15.15 | 0.50 | 3.34 | 3.90 | 3.96 | 2.28 | 2.34 | 6.24 | 2.79 | 5.40 | 4.51 | 2.45 | 6.63 | 3.73 |
| ASN | 0.92 | 3.70 | 4.27 | 10.67 | 9.96 | 0.43 | 8.11 | 7.54 | 2.56 | 5.97 | 2.42 | 4.27 | 2.13 | 2.42 | 4.84 | 3.70 | 2.84 | 6.40 | 2.42 | 6.26 | 9.10 |
| ASP | 0.27 | 1.51 | 18.87 | 3.02 | 4.15 | 0.00 | 3.02 | 5.28 | 0.00 | 7.92 | 0.38 | 5.66 | 2.64 | 1.89 | 5.28 | 6.79 | 7.55 | 9.43 | 5.66 | 9.06 | 1.89 |
| CYS | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| GLN | 0.70 | 1.35 | 5.84 | 8.09 | 5.62 | 0.90 | 12.58 | 7.64 | 0.45 | 2.47 | 3.82 | 8.31 | 1.35 | 4.04 | 6.52 | 1.80 | 3.82 | 6.74 | 3.60 | 9.66 | 5.39 |
| GLU | 0.53 | 1.04 | 20.42 | 3.11 | 2.60 | 1.21 | 3.46 | 3.29 | 0.87 | 7.44 | 6.23 | 4.84 | 6.06 | 3.29 | 7.44 | 2.60 | 6.06 | 7.61 | 2.60 | 7.79 | 2.08 |
| GLY | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| HIS | 1.48 | 1.18 | 7.24 | 12.46 | 7.24 | 1.52 | 3.03 | 5.89 | 0.84 | 12.12 | 4.04 | 5.72 | 2.53 | 4.71 | 2.69 | 0.34 | 5.72 | 6.90 | 3.54 | 5.56 | 6.73 |
| ILE | 0.31 | 0.33 | 6.31 | 1.66 | 3.32 | 0.00 | 8.64 | 6.31 | 0.33 | 3.32 | 6.64 | 9.63 | 3.32 | 5.32 | 14.95 | 2.33 | 5.65 | 4.65 | 4.65 | 6.64 | 5.98 |
| LEU | 0.33 | 2.56 | 11.44 | 3.75 | 5.92 | 0.79 | 6.51 | 4.14 | 1.38 | 5.72 | 6.31 | 8.09 | 1.78 | 1.18 | 7.69 | 4.54 | 3.94 | 2.37 | 4.34 | 6.31 | 11.24 |
| LYS | 0.48 | 2.52 | 5.03 | 5.49 | 13.04 | 1.14 | 4.35 | 17.39 | 1.14 | 3.89 | 3.43 | 5.49 | 0.23 | 3.66 | 3.20 | 1.37 | 8.47 | 5.72 | 2.75 | 9.15 | 2.52 |
| MET | 1.22 | 4.99 | 6.07 | 7.81 | 5.64 | 1.95 | 3.69 | 3.25 | 1.30 | 3.90 | 5.21 | 8.46 | 3.90 | 7.81 | 8.68 | 2.82 | 5.42 | 1.74 | 3.25 | 7.59 | 6.51 |
| PHE | 1.40 | 1.23 | 5.53 | 3.89 | 5.53 | 1.13 | 4.10 | 2.97 | 1.02 | 3.89 | 8.09 | 7.89 | 2.15 | 4.61 | 14.45 | 2.97 | 3.48 | 4.51 | 3.69 | 10.86 | 7.99 |
| PRO | 0.11 | 0.00 | 5.81 | 5.81 | 6.98 | 1.16 | 0.00 | 3.49 | 0.00 | 4.65 | 0.00 | 4.65 | 0.00 | 0.00 | 10.47 | 13.95 | 6.98 | 3.49 | 5.81 | 20.93 | 5.81 |
| SER | 0.06 | 0.00 | 9.09 | 0.00 | 9.09 | 0.00 | 16.36 | 0.00 | 0.00 | 7.27 | 0.00 | 0.00 | 0.00 | 0.00 | 10.91 | 0.00 | 16.36 | 23.64 | 7.27 | 0.00 | 0.00 |
| THR | 0.05 | 0.00 | 0.00 | 14.00 | 4.00 | 0.00 | 26.00 | 12.00 | 0.00 | 6.00 | 6.00 | 0.00 | 8.00 | 0.00 | 12.00 | 2.00 | 2.00 | 6.00 | 0.00 | 0.00 | 2.00 |
| TRP | 2.28 | 1.95 | 5.07 | 8.38 | 4.29 | 0.39 | 7.21 | 3.51 | 0.97 | 3.51 | 7.80 | 7.21 | 3.90 | 2.53 | 11.31 | 2.73 | 2.92 | 7.02 | 3.31 | 11.31 | 4.68 |
| TYR | 1.47 | 0.56 | 11.26 | 6.31 | 7.77 | 0.56 | 5.07 | 7.21 | 1.69 | 4.17 | 5.07 | 5.52 | 5.29 | 2.48 | 7.09 | 6.64 | 2.93 | 2.48 | 3.60 | 8.45 | 5.86 |
| VAL | 0.03 | 0.00 | 5.56 | 0.00 | 2.78 | 0.00 | 5.56 | 5.56 | 0.00 | 5.56 | 0.00 | 11.11 | 0.00 | 0.00 | 36.11 | 0.00 | 0.00 | 0.00 | 8.33 | 16.67 | 2.78 |
11: The rows of the table (i) correspond to the twenty different types of amino acids forming the interface hubs. The second element in each row gives Hi, the percentage of amino acids of each type 'i' forming interface hubs in the dataset (with respect to the total number of residues of type 'i' in the dataset). Elements three to twenty two in each row correspond to the percentage of interactions that the amino acid hub of type 'i' makes with the twenty different amino acid types (j) [with respect to the total number of interactions made by hub residue type 'i']. This gives the 20 × 20 matrix for the interactions of the interface hubs with other residues at Imin = 4%. Thus, each ijth element in the 20 × 20 matrix gives the percentage interactions of interface hub type 'i' with residue type 'j'.
Figure 4Examples of Arginine and Tyrosine hubs and their interactions. (a) Arginine hubs in the interface of 5-aminolevulinic acid dehydratase tetramer (1B4K) at Imin = 4%. The protein tetramer is shown in cartoon representation with each monomer colored differently. Arg 17 (C), Arg 14 (C) and Arg 198 (B) form hubs, which interact with other residues (including other Arginines) belonging to different chains, thus forming a connected network of amino acid cluster at the interface. All the Arginine residues are shown in ball and stick representation and are colored according to the atom types (carbon in cyan, oxygen in red and nitrogen in blue) and the other residues are shown in bond representations and are colored according to the different residue types. The residue names and numbers are indicated along with the chains to which they belong. (b) Tyrosine hub in the interface of the shaker potassium channel (1A68) obtained at Imin = 4%. The protein backbone is shown in cartoon representation with the monomers colored differently. The interface hub residue (Tyr 123 D) and the residues with which it interacts (Arg 130 D, Ser 127 D, Gln 126 D, Val 141 C, Asn 114 C) are shown in van der Waal's representation. The residue numbers, names and the chain identifiers are given. The Tyrosine hydroxyl is involved in a short-strong hydrogen bond with the Arginine side-chain with a donor-acceptor distance of 2.52 Å. The Tyrosine and the Arginine residues involved in the short hydrogen bond are colored according to atom type (carbon in cyan, nitrogen in blue and oxygen in red) and the other residues are colored differently based on their residue types.
Statistics of interface interactions
| Imin = 4% | |||
| Charged+Polar interactions1 (Salt bridges)3,5 | 34% (11%) | 37% (12%) | 44% (13.7%) |
| Hydrophobic interactions2 (Aromatic-Aromatic interactions)4,5 | 26.5% (3.2%) | 24% (5.7%) | 22% (9%) |
| Imin = 0% | |||
| Charged+Polar interactions (Salt bridges) | 35.3% (8%) | 24.3% (7%) | 21.5% (6%) |
| Hydrophobic interactions (Aromatic-Aromatic interactions) | 23% (0.4%) | 32% (1%) | 40% (5%) |
1Charged and polar interactions involve Arg, Lys, His, Glu, Asp, Gln, Asn, Ser, Thr, Cys.
2Hydrophobic and Aromatic interactions involve Leu, Ile, Val, Ala, Gly, Met, Pro, Phe, Tyr and Trp.
3Salt bridges are between Arg/His/Lys and Glu/Asp.
4Aromatic interactions involve Phe/Tyr/Trp with themselves.
5The values given within parentheses are either the salt bridges amongst the charged+polar interactions or the aromatic-aromatic interactions amongst the hydrophobic interactions.
6The rest of the interactions in each category (adding up to 100%) involve charged/polar versus hydrophobic/aromatic interactions.
Comparison of interface clusters and hubs with experimental dissociation constants of oligomeric proteins
| Protein [Reference] | PDB Code | Kd1 | No. of interface hubs | Size of the largest interface cluster | Nic (Nires)2 | ||||||
| Imin (%) | Imin (%) | Imin (%) | |||||||||
| 0 | 2 | 4 | 4 | 6 | 8 | 4 | 6 | 8 | |||
| E-Cadherin [50] | 1edh | 170 μM | 14 | 3 | 1 | 41 | 3 | 0 | 2(44) | 3(9) | 0 |
| β-Lactoglobulin [50] | 1beb | 20 μM | 13 | 3 | 2 | 32 | 4 | 0 | 3(43) | 2(7) | 0 |
| Insulin [50] | 1trz | 1 μM | 13 | 8 | 2 | 31 | 6 | 0 | 3(37) | 4(18) | 0 |
| Elongation Factor EF-TU/EF-TS Complex [50] | 1efu | 30 nM | 54 | 27 | 11 | 61 | 18 | 11 | 8(190) | 7(63) | 5(28) |
| Rac-ExoS GAP domain [50] | 1he1 | 20–30 nM | 24 | 13 | 2 | 94 | 9 | 7 | 4(146) | 4(23) | 2(10) |
| CD4-GP120 [51] | 1gc1 | 5 nM | 28 | 13 | 5 | 86 | 15 | 8 | 4(152) | 3(32) | 3(21) |
| Ran-Importin-β [50] | 1ibr | 0.6 nM | 49 | 18 | 7 | 131 | 15 | 11 | 6(173) | 5(48) | 4(22) |
| Growth hormone-Receptor [52] | 2hhr | 0.15 nM | 46 | 21 | 12 | 257 | 34 | 22 | 6(344) | 7(86) | 5(57) |
1: Kd: Experimental dissociation constant
2: Nic (Nires): No. of interface clusters (No. of residues in the interface clusters)
Hot spot predictions§ from interface hub analysis on protein complexes at Imin = 4%
| 1A4Y | 35L, 40K, 95R, 117Q | 35D, 63R, 346Q, 349N | ||
| 1CBW2 | 17R, 37G | Chymotrypsin | 57H | |
| 2PTC | **4 | Trypsin | 57H, 99L, 190S | |
| 1BRS | 56F, 103Y | 33N | ||
| 1GC1 | ** | GP120 | 280N, 368D, 370E, 469R | |
| 1DVF | 35Nh5, 45Lh, 50Mh, 103Lh, 106Wh, 36Yl5, 89Ql, 91Fl, 96Rl | 47Wh, 99Yh, 103Wh, 36Yl, 92Nl | ||
| 1VFB | 35Nh, 45Lh, 40Mh, 103Lh, 106Wh, 36Yl, 44Pl, 89Ql, 91Fl, 96Rl | 27N | ||
| 1DAN3 | 128Fh, 164Mh, 208Yh, 230Rh, 95Nl, 101Yl, 118Yl | 19F, 74R, 96N, 100F, 147F | ||
| 3HFM | ** | 33Yh, 39Kh, 50Yh, 98Wh, 103Wh, 166Fh, 32Nl, 50Yl, 94Wl, 96Yl, 121Sl, 123El, 135Fl | ||
| 3HHR3 | 8R, 9L, 12N, 16R, 41K | 150H, 152D, 197V, 200Y, 217R, 218N | ||
| 1BXI | ** | E9DNase | 72N, 75N, 84S, 86F, 97K | |
| 1FC2 | ** | IgG1 | 30M, 31I | |
| 1DFJ | ** | RNase A | 41K, 111E | |
| 1JCK | ** | TcrVb | 43L, 101Y, 108F | |
| 1AHW | 152I, 169K, 171N, 190Q, 192V, 201K | Fab 5G9 | 32Yl, 36Yl, 50Yl, 91Hl, 135Fl, 137Nl, 33Yh, 35Hh, 45Lh, 50Lh, 52Dh, 59Ih, 102Yh, 103Yh, 104Fh, 147Kh, 170Fh |
§ Already mutated hubs have been excluded from this table.
1 The underlined monomers are the ones where some mutations have already been carried out.
2 No hubs are obtained at Imin = 4% in 1CBW. Hence, hubs at Imin = 2% are used for prediction.
3 1DAN and 3HHR are proteins where a large number of mutations (>100) have been carried out, which have ΔΔG < 1 kcal/mol. These mutations have not been included in this analysis.
4 ** All hubs identified at Imin = 4% have already been mutated.
5 h or l accompanying a residue name corresponds to the heavy and the light chains of the corresponding antibodies.
Figure 5Correlation of ΔΔG with hubs obtained at different Imins. The experimental mutations are categorized into different bins based on their ΔΔG values in kcal/mol. The distribution of the hub character of the residues (highest Imin at which a residue is a hub) is shown in different shades within these bins.