Literature DB >> 27924264

Specificity of broad protein interaction surfaces for proteins with multiple binding partners.

Nobuyuki Uchikoga1, Yuri Matsuzaki2, Masahito Ohue3, Yutaka Akiyama4.   

Abstract

Analysis of protein-protein interaction networks has revealed the presence of proteins with multiple interaction ligand proteins, such as hub proteins. For such proteins, multiple ligands would be predicted as interacting partners when predicting all-to-all protein-protein interactions (PPIs). In this work, to obtain a better understanding of PPI mechanisms, we focused on protein interaction surfaces, which differ between protein pairs. We then performed rigid-body docking to obtain information of interfaces of a set of decoy structures, which include many possible interaction surfaces between a certain protein pair. Then, we investigated the specificity of sets of decoy interactions between true binding partners in each case of alpha-chymotrypsin, actin, and cyclin-dependent kinase 2 as test proteins having multiple true binding partners. To observe differences in interaction surfaces of docking decoys, we introduced broad interaction profiles (BIPs), generated by assembling interaction profiles of decoys for each protein pair. After cluster analysis, the specificity of BIPs of true binding partners was observed for each receptor. We used two types of BIPs: those involved in amino acid sequences (BIP-seqs) and those involved in the compositions of interacting amino acid residue pairs (BIP-AAs). The specificity of a BIP was defined as the number of group members including all true binding partners. We found that BIP-AA cases were more specific than BIP-seq cases. These results indicated that the composition of interacting amino acid residue pairs was sufficient for determining the properties of protein interaction surfaces.

Entities:  

Keywords:  hub protein; interaction profile; postdocking process; protein-protein interaction; rigid-body docking process

Year:  2016        PMID: 27924264      PMCID: PMC5042157          DOI: 10.2142/biophysico.13.0_105

Source DB:  PubMed          Journal:  Biophys Physicobiol        ISSN: 2189-4779


Protein-protein interaction (PPI) networks provide key information for understanding living cells. In yeast, PPI networks have been classified as free-scale networks [1,2]. This type of network has proteins with multiple partners, called hub proteins, including calmodulin, p53, and kalirin, among others [3]. Large-scale PPI prediction provides information regarding the presence or absence of protein interactions. However, when all-to-all PPI predictions are performed, some false-positive pairs can be found [4-6] owing to our lack of knowledge regarding PPI pairs. A typical benchmark dataset, e.g., protein-protein docking benchmark dataset ver. 5.0 [7], includes some proteins that exhibit interactions with multiple partners, such as alpha-chymotrypsin, cyclin-dependent kinase 2 (CDK2), and actin. The accuracy of PPI prediction thus depends on whether we have knowledge of interacting protein pairs. When we predict two proteins as binders and we do not have information on the specific binding of the pair, the prediction will be evaluated as false-positive; this problem is directly associated with our understanding of PPI mechanisms. The protein surface is one of the most important factors in protein sciences for understanding PPI mechanisms. We can determine the properties of a protein surface from the tertiary structure of a protein. Databases of protein surface geometries and electrostatic potentials, such as the eF-site [8], are important for predicting the functions of proteins or determining the details of interaction mechanisms. When investigating PPI mechanisms, more information is obtained from analysis of the surface of each protein and the structures of protein complexes, which are associated with their amino acid sequences and physicochemical properties. Additionally, computational simulations provide information for understanding PPI mechanisms. Molecular dynamic simulations provide information regarding numerous states of protein interactions using ab initio calculations [9,10]. Docking simulations can also be used to predict protein complex structures by generating many candidate complex structures (decoys) that exhibit various interaction states. In general, for PPI predictions, near-native structures are evaluated from decoy sets, and most decoys are ignored as false-positives. For example, in drug design, near-native structures are refined to generate high-resolution predicted docking structures. After this process, we can observe the interaction mechanisms in detail with the protein surface area obtained from the small number of near-native structures. In contrast, based on a comprehensive view of protein interaction surfaces, a decoy set, generated by the docking process, includes information for many possible interaction surfaces of protein pairs. Thus, we can obtain information regarding broad protein interaction surfaces or information of interaction surfaces. In this work, we focused not on each protein interaction surface as ‘local’ protein interaction surface but on the sum of protein interaction surfaces derived from decoys as ‘broad’ protein interaction surfaces. A set of interaction surfaces of a protein pair, ‘local’ protein interaction, may differ from sets of pairs with different docking partners because of differences in the shapes of protein surfaces and physicochemical properties of exposed amino acid residues. However, it is unclear how different ‘broad’ protein interaction surfaces are among multiple partners or between partners and nonbinders. Therefore, we attempted to obtain an indicator of broad protein interaction surfaces for discriminating true partners of a receptor protein from other nonbinders using docking decoy sets. Then, we introduced profiles of broad protein interaction surfaces. Profile methods are applied to protein-small molecules or protein-protein interactions in postdocking analysis, including cluster analysis [11-13]. Profiles can easily be compared, and other properties can be added, e.g., flags of donors, acceptors, cations, anions, and aromatic residues, for protein-small molecule interactions [11,12]. To investigate PPIs, it is favorable to set interacting amino acid residue pairs for elements of an interaction profile. This type of profile results in better classification of decoys in cluster analysis compared with cases measured by root mean square deviation (RMSD) [13]. Therefore, in this work, we examined three proteins, i.e., alpha-chymotrypsin (PDB-chainID: 1ACB-E), CDK2 (1BUH-A), and actin (1ATN-A), which had multiple binding partners and were deposited in protein-protein docking benchmark dataset ver. 5.0 [7]. These proteins, used as docking receptors, interacted with multiple true partners, as described in Table 1. To generate decoy sets, rigid-body docking was performed for each docking receptor using MEGADOCK ver. 4.0 [14], and proteins were docked with 44 different ligand proteins, including their true partners. We then introduced the concept of broad interaction profiles (BIPs), which were made by assembling interaction profiles of decoys and provided information of broad protein interaction surfaces. These profiles of protein pairs were compared in cluster analysis, allowing us to observe differences in their profiles.
Table 1

List of proteins used in docking processes. Parentheses indicate PDBIDs of unbound states

Docking receptorsTrue partner proteinsReceptors in crystal structuresf


NamesPDB IDsNamesPDB IDs
Alpha-chymotrypsin1ACB (2CGA)eglin C1ACB (1EGL)Alpha-chymotrypsin
PSTIa1CGI (1HPT)Alpha-chymotrypsinogen
BPTIb1EAW (9PTI)Matriptase

Actin1ATN (1IJJ)DNase I1ATN (3DNI)Actin
Gelsolin1H1V (1D0N)
DBPc1KXP (1KW2)

CDK2d1BUH (1HCL)CKSHS11BUH (1DKS)CDK2
CDKN3e1FQ1 (1FPZ)

pancreatic secretory trypsin inhibitor,

pancreatic trypsin inhibitor,

human vitamin-D binding protein,

cyclin-dependent kinase 2,

cyclin-dependent kinase inhibitor 3,

names of receptors in X-ray crystal structure data with each true partner

Materials and Methods

Docking process

For obtaining decoy sets, we performed docking processes using MEGADOCK ver. 4.0, an FFT-grid-based exhaustive rigid-body docking tool with multiparallel calculations [14,15]. In this work, docking processes were performed on an Intel Xeon E7-4870 CPU (2.4 GHz, 10 cores) at the National Institute of Genetics. A total of 2000 docking decoys were used for analysis.

Tertiary structures of protein pairs

We examined proteins interacting with multiple binding proteins and used the tertiary structures of proteins for docking processes. We defined binding partners within the protein docking benchmark by searching “low-throughput-like” direct interactions derived from public databases (Dr. Vachiranee Limph, personal communication, data retrieved on August 2011). Protein pairs were screened in multiple PPI databases (BIND [16], BioGRID [17], DIP [18], HPRD [19], IntAct [20], MINT [21], MPact [22], MPPI [23]), and high-throughput-like data found only in references that reported 50 or more interactions were eliminated. Then, three proteins, i.e., alpha-chymotrypsin, actin, and CDK2, referred to as the receptor proteins, were chosen from protein-protein docking benchmark dataset ver. 5.0 [7]. Table 1 shows the interactions of each receptor protein with multiple interacting proteins. Alpha-chymotrypsin interacts with Eglin C, pancreatic secretory trypsin inhibitor (PSTI), and pancreatic trypsin inhibitor (BPTI), which form various quaternary structures. For example, for BPTI, x-ray crystal structures are available showing a complex with matriptase (PDBID: 1EAW). We used the tertiary structure of alpha-chymotrypsin (1ACB) as the docking receptor. For actin and CDK2, 1ATN and 1BUH were used as the docking receptors. We also used unbound states of tertiary structures for the docking processes, corresponding with ligands of the bound state [7]; PDBIDs for these are shown in Table 1. In order to observe differences in protein interaction surfaces between true ligand partners and other nonbinders, the following 44 ligands were also docked to generate decoy sets (chain IDs are shown in parentheses): 1ACB(I), 1AK4(D), 1ATN(D), 1AVX(B), 1AY7(B), 1B6C(B), 1BUH(B), 1BVN(T), 1CGI(I), 1D6R(I), 1DFJ(I), 1E6E(B), 1E96(B), 1EAW(B), 1EWY(C), 1F34(B), 1FC2(D), 1FQ1(B), 1FQJ(B), 1GCQ(C), 1GHQ(B), 1GRN(B), 1H1V(G), 1HE1(A), 1HE8(A), 1I2M(B), 1IBR(B), 1KAC(B), 1KTZ(B), 1KXP(D), 1KXQ(A), 1M10(B), 1MAH(F), 1PPE(I), 1QA9(B), 1SBB(B), 1TMQ(B), 1UDI(I), 1WQ1(G), 2BTF(P), 2PCC(B), 2SIC(I), 2SNI(I), and 7CEI(B). For unbound states, PDBID and chain IDs are as follows: 1EGL, 1E6J(P), 3DNI, 1BA7(B), 1A19(B), 1IAS(A), 1DKS(A), 1HOE, 1HPT, 1K9B(A), 2BNH, 1CJE(D), 1HH8(A), 9PTI, 1CZP(A), 1F32(A), 1FC1(A), 1FPZ(E), 1FQI(A), 1GCP(B), 1LY2(A), 1RGP, 1D0N, 1HE9(A), 1E8Z(A), 1A23(A), 1F59(A), 1F5W(B), 1M9Z(A), 1KW2(B), 1PPI, 1M0Z(B), 1FSC, 1LU0(A), 1CCZ(A), 1SE4, 1B1U(A), 2UGI(B), 1WER, 1PNE, 1YCC, 3SSI, 2CI2(I), and 1M08(B) (same order as the bound state list). To calculate conformation changes between bound and unbound states, we used the TMscore program [24].

BIPs

First, we generated each profile from a decoy generated by the rigid-body docking process (Fig. 1). A profile was composed of elements when an interaction residue pair was detected, and the value of the corresponding element was set to 1. Otherwise, the value was set to 0. To determine interaction residue pairs, we used the dimplot command of the LIGPLOT program, whose default cut-off distance between nonhydrogen atoms is 3.9 Å [25], which is longer than the distance between hydrogen and acceptor atoms (2.5 Å) [26]. This type of profile was introduced for comparisons between decoys, as described by Uchikoga and Hirokawa [13]. After generating profiles of decoys, to investigate broad protein interacting surfaces, we introduced the profiles involved in all interaction surfaces included in a decoy set, named BIPs. Each corresponding element of the interaction profile of decoys was added to generate BIPs by assembling decoys after reranking with ZRANK [27].
Figure 1

Flowchart for generating BIPs. After generating 2000 decoys by the rigid-body docking process with MEGADOCK, their tertiary structures were converted to interaction profiles with a reranking process. Broad interaction profiles (BIPs) were generated by assembling certain numbers (N-values) of profiles.

Decoys with low interaction energy scores were regarded as high-ranking decoys. Then, the number of assembled decoys could influence the results of cluster analysis because a set of high-ranking decoys was expected to generate more specific profiles involved in true partners than those generated by low-ranked decoys. In this work, the N-value was defined as the number of top ranked decoys for generating BIPs. Therefore, the BIP depended on a pair of docking proteins and the number of assembled decoys, i.e., the N-value. We used two types of BIPs. The first type of BIP included each element corresponding to the interacting amino acid residue pairs along with amino acid sequences. If two docking protein pairs had lengths of L and L amino acid sequences, the number of elements in this type of profile was L×L. We called this type BIP-seq. For example, when elements of a profile of a decoy are P1(i, j), P2(i, j), ..., P(i, j) with i-th and j-th amino acids in receptor and ligand proteins, respectively, and N is the number of decoys, the element of a BIP with an N-value was . Elements of profiles of a decoy, P(i, j), were 0 or 1. Therefore, elements of BIPs were not only 0 or 1, but could also be integers. The other type of BIP was composed of amino acid residue types, including 400 (20×20) elements. This type was called BIP-AA. Elements of BIP-AA with an N-value could be described as , where m and n are types of amino acids in receptor and ligand proteins, respectively. The condition of the second summation is S={(i, j) | aa(i)=m & aa( j)=n}, where aa(i) is a type of amino acid located at the i-th amino acid residue in a sequence. Then, BIP(m, n)≠BIP(n, m), indicating that, for example, the element of interaction between ALA belonging to the receptor protein and GLY of the other protein was different from that between GLY of the receptor and ALA of the other protein.

Calculation of distance between BIPs and cluster analysis

For cluster analysis, it was necessary to compare all protein pairs between BIPs. We use the Tanimoto coefficient (Tc) to measure the similarities between profiles, which were converted to distance by calculating 1–Tc. Because the elements of BIPs could be integers, the following equation was used to calculate the Tanimoto coefficient between BIP and BIP: where k is the element index of BIPs, and L is the number of elements (Table 2 in [28]). This equation indicates that each BIP was used as a vector. For example, when BIP(N; 0, 0) was converted to BIP(N; 1), and BIP(N; 0, 1) was converted to BIP(N; 2), etc., we obtained the L-dimension BIP vector. Then, the numerator and the third term in the denominator corresponded to the inner product between BIP and BIP. The other terms corresponded to the squares of the length of vectors BIP and BIP, respectively. This indicator of Tanimoto distance was used to classify interaction profiles for effective exploration of docking spaces [29]. However, in the case of integer vectors of BIPs, this indicator cannot satisfy triangle inequality. Therefore, we only discussed the specificity of true partners (M-values) without detailing the topologies of tree diagrams. Additionally, we also calculated distances between BIPs as Euclidean distances: in order to observe M-values. When comparing between BIP-AAs, L was 400 because of the 20 types of amino acid residues in each docked protein. From this equation, because different pairs had different numbers of elements of BIP-seqs, we calculated Tc between BIPs using common receptor information. Therefore, when comparing BIP-seqs, if docked proteins had lengths of amino acid sequences of L=L, which is the length of common receptor amino acid sequences, the elements of the BIP vector were , where L is the length of an amino acid sequence of a ligand protein. On the other hand, in the case of BIP-AAs, although all profiles included the same elements constructed in a 20×20 shape, we used information for receptors in BIP-AAs. Then, distance matrices were generated for docking results of pairs, which included each receptor and all 44 ligands. After distance calculations for all combinations of BIPs, we performed hierarchical clustering using software for statistical analysis (R ver. 3.2.1.) with the function ‘hclust’ with the group average method option.

Specificity of BIPs (M-value) with the T-value

To investigate differences between BIPs of true partners and other proteins, we examined the specificity of BIPs. After cluster analysis, a group including all true partners was evaluated. For this process, a tree diagram was divided using various thresholds from 0.01 to 0.5 with 0.01 step in R ver. 3.2.1 software. Next, we obtained the minimum threshold, T-value by iterative searching from the smallest T-value (0.01) until all target ligands were classified into the same group. At the same time, we counted the number of members in the group, i.e., the M-value. If the M-value was small, BIPs of true partners were more similar than BIPs of other docking ligands, indicating BIPs with higher specificity. When the M-value was 44, corresponding to the nonspecific BIP, the T-value was indicated as N/A (Fig. 2).
Figure 2

Tree diagrams from cluster analysis by BIP similarities as Tanimoto distances. Each square includes all true partners. M-values are the numbers of ligands included in each square with minimum threshold T-values. The true partners with the quaternary structure of the bound state and the corresponding unbound state protein of PDBIDs are marked as “o_”. The other true partners are marked as “x_”.

Results and Discussion

A decoy set generated by the rigid-body docking process includes information for protein surfaces. Decoys are generated using the docking score involved in shape complementarity, electrostatic parameters, and desolvation-like parameters in MEGADOCK ver. 4.0 [14]. Each decoy provides information regarding the quaternary structure. Thus, a set of decoys can be used for the postdocking analysis, e.g., the reranking process, and can provide information regarding broad interaction surfaces in a pair of proteins. In order to observe information regarding broad protein surfaces, we introduced the concept of BIPs, which were generated by assembling high-ranking decoys after reranking. BIP depended on protein pairs and the number of assembled decoys (i.e., N-values). We used the BIP of a receptor molecule to calculate their similarities. Even if an identical receptor protein was used for the docking process, the BIP pattern depended on various ligand proteins, indicating that they were not the same.

Cluster analysis and specificity of BIPs including true partners

To investigate differences in the interacting surfaces with various ligands, we performed cluster analysis using BIPs. Figure 2A shows tree diagrams with the BIP of alpha-chymotrypsin as a bound state receptor. The number of assembled decoys for generating a BIP (N-values) was changed from 100 to 2000, and we examined ligand proteins known as the true partners of a certain receptor. We then counted the number of ligands classified into a group, including all true partner ligands interacting with the receptor (i.e., the M-value). The specificity of BIPs including true partners corresponded to the M-values. When N=100, the distance was greater than that when N≠100 for all receptor cases (Fig. 2). These data indicated that BIPs for which N was 100, involving a small number of high-ranking decoys, were more different in cases with multiple ligands than in cases with larger N-values. Figure 2 also shows tree diagrams generated using the two types of BIPs and various N-values for all receptor proteins, alpha-chymotrypsin, actin, and CDK2. For example, in the case of bound alpha-chymotrypsin (PDBID-chainID: 1ACBE), which is known to interact with Eglin C (1ACB-I), PSTI (1CGI-I), and BPTI (1EAW-B), we obtained a group composed of five ligands (M=5), including all three true partners of alpha-chymotrypsin in the case of BIP-seq cases in which N=100. When the N-value was 2000, most ligand cases were classified into a single group, including all true partners (M=43), indicating that this situation (BIP-seq) did not allow for the distinction of true partners from other ligand cases. Additionally, the case of N=100 for BIP-seq was consistent with the results of reranking decoys because high-ranking decoys generated specific protein interaction surfaces for true partners. Notably, cluster analysis using BIP-AAs showed fewer group members (M-values) for all N-values than for BIP-seq cases, except in the case of N=100. For cases in which N=500, when the M-value was low (M=8), the specificity of BIP-AAs required more low-ranked decoys compared with the BIP-seq case of N=100. Moreover, every T-value in BIP-AA cases was smaller than that in BIP-seq cases, indicating that BIP-AAs had smaller distances between each other than BIP-seq cases, e.g., alpha-chymotrypsin in Figure 2 (other cases of T-values are not shown). M-values for all receptor protein cases are shown in Table 2. In the bound state for BIP-seq cases, we found the lowest M-values at N=2000 for actin, indicating that more decoys were need for highly specific BIP-seq cases compared with those for other receptor cases. For CDK2, the lowest M-values were found at N=500, which was comparable to the N-value of alpha-chymotrypsin.
Table 2

List of the number of ligands in a classified group including all true partners (M-values)

Tanimoto Distance

Receptor proteinBound stateUnbound state


N-valueN-value


1005001,0002,0001005001,0002,000
Alpha-chymotrypsinBIP-seq520174341424334
BIP-AA2081211222697

ActinBIP-seq4140392844231616
BIP-AA4121141225272113

CDK2BIP-seq4116324344444443
BIP-AA4325433915434343

Euclidean Distance

Alpha-chymotrypsinBIP-seq61635144412619
BIP-AA22242010251558

ActinBIP-seq4444424125121315
BIP-AA4122231217431013

CDK2BIP-seq4114154243434343
BIP-AA4043393940334038
In the unbound state (Fig. 2B and Table 2), the lowest M-values for BIP-seq cases were greater than those for bound cases. The lowest M-values for both types of BIPs were found at N=2000 for all receptor cases, except CDK2 in the case of BIP-AA. However, the M-value decreased as the N-values increased. Therefore, it was possible to obtain lower M-values when the N-values were higher. Moreover, the lowest M-value for CDK2 in the case of BIP-AA in the unbound state was found at N=100, indicating that this number of decoys was sufficient to obtain highly specific BIP-AA information. In BIP-AA cases, we also found that the lowest M-values in the unbound state were lower than those in the bound state, except for actin. However, analysis of actin indicated comparable results between M=12 in the bound state and M=13 in the unbound state. Comparisons between BIP-seq and BIP-AA cases in the unbound state revealed that the lowest M-values of BIP-AAs were lower than those of BIP-seq cases for each receptor case. We also obtained M-values by calculating Euclidean distances of BIPs (Table 2). In this case, the lowest M-values in each case were found to be comparable to those of Tanimoto distance cases. However, for alpha-chymotrypsin, smaller M-values were found in the unbound state, particularly for BIP-seq. For actin, in the bound state of BIP-seq, larger M-values were observed than those in Tanimoto distance cases, indicating that the specificity of interaction surfaces of true partners was reduced. For CDK2, we found specificity (low M-values) for BIP-seq in the bound state, although nonspecificity was found for BIP-AA in the bound state. In the case of the BIP-AA of actin, M-values in the unbound state were reduced compared with those in the bound state owing to differences between the tertiary structures of receptor proteins. The RMSDs ranged from 13.27 Å to 19.21 Å in the bound state for all combinations of the three receptors in the tertiary complex structures, i.e., 1ATN-A, 1H1V-A, and 1KXP-A. On the other hand, receptors in the unbound state were similar because of the use of identical structures (1IIJ-B). Thus, structural deviations of receptors influenced their protein surfaces. Additionally, low-specificity BIPs were found in the bound state, indicating that these cases exhibited larger M-values than those in the unbound state. However, for BIP-AAs, the lowest M-values were comparable between the bound (M=12) and unbound states (M=13), indicating that interacting protein surfaces were more specific than those in cases of other ligands. Notably, the results for CDK2 were different. RMSDs were calculated using two receptors in tertiary complex structures, yielding RMSDs of 12.46 Å in the bound state and 8.49 Å in the unbound state. Thus, these data provided information regarding the number of high-ranking decoys suitable for achieving the most specific BIP-AA in the unbound state.

Physicochemical properties of protein-interacting interfaces

Because BIPs included information for interacting amino acid pairs, we could evaluate the physicochemical properties of BIPs. Tables 3 and 4 show the net charge and hydrophobic parameters of the three receptors and their true partners. The hydrophobic properties of ligands decreased to negative values as N-values increased because increasing the number of interacting residue pairs was associated with the number of assembled decoys. In receptors in the bound state (Table 3), negative hydrophobic properties were also found in ligand cases, indicating that these ligands had hydrophilic properties, with the exception of 1ACB (alpha-chymotrypsin) when N=100. The condition of N=100 in 1ACB yielded more hydrophobic receptors. We also observed interaction residues using x-ray crystal complex structure data. Figure 3A shows interaction sites and aligned amino acid sequences for alpha-chymotrypsin (1ACB), alpha-chymotrypsinogen (1CGI), and matriptase (1EAW). Interestingly, the amino acid sequences and interaction sites were similar, as were the physicochemical properties of these interacting residues, as determined using hydropathy parameters assigned by Kyte and Doolittle [30] and electrostatic charges (Arg, Lys, and His: +1; Asp and Glu: −1; and other amino acids: 0) [31]. The net charges and the hydrophobic parameters were defined as the sum of parameters assigned to their residues composing the interaction site. The net charges (E values) of the interaction sites were 10.0, 10.0, and 11.0, and the hydrophobic parameters (H values) were 25.7, 50.7, and 61.30 for the receptors 1ACB, 1CGI, and 1EAW, respectively. For the ligand sides, the following E and H values were obtained for 1ACB, 1CGI, and 1EAW, respectively: −2.0 and 16.40, −12.0 and −145.0, and 66.0 and −242.60. In the unbound state of alpha-chymotrypsin (1ACB), BIPs showed hydrophilic properties (Table 4). These results were associated with the lowest M-value at N=2000 for the unbound case (Table 2), implying that more decoys were necessary to obtain BIPs specific to the true partners. In the other cases, actin (1ATN) had interaction sites with physicochemical properties (E and H values) for 1ATN, 1H1V, and 1KXP as follows: 18.0 and −35.90, −7.0 and −22.90, and 4.0 and −167.60, respectively. In this case, the ligands had E and H values of 6.0 and −83.80, 0.0 and −43.50, and 7.0 and −41.30 for 1ATN, 1H1V, and 1KXP, respectively, in the absence of interaction sites. The interaction sites of receptors were similar, except for that of 1ATN (Fig. 3B). For actin in the bound state, the ligand of 1ATN was consistent at all N-values, indicating the positive net charge of 1ATN receptor and ligand of BIPs at all N-values (Table 3). Moreover, for the 1ATN pair, assembled high-ranking decoys had native-like interaction properties, and N-values were sufficient for obtaining specific BIPs.
Table 3

Physicochemical properties of the bound state

PDBIDNet chargeHydrophobicityNumber of interacting residue pairs



ReceptorLigandN-valueReceptorLigandReceptorLigand
1ACB1ACB10082320392−14283111
1CGI250−23365.4−3708.92935
1EAW105794137.7−2664.82765

1ACB5004781566−139.2−9223.614692
1CGI10124−1556.8−17940.213539
1EAW4203786−2240.4−14204.0913323

1ACB100010622970−5023.4−18962.7128487
1CGI18030−7359−37655.4926563
1EAW8867247−8434.5−29224.9126576

1ACB200023925668−21928.2−41395.5155915
1CGI3022195−24285.7−78172.1252376
1EAW213614199−26546.41−60223.9953624

1ATN1ATN100219139−4507.6−3973.23743
1H1V109121−4634.1−45883864
1KXP3−1−4614.3−2721.14255

1ATN500387894−25735.1−21367.518486
1H1V202798−23722.7−21893.5918350
1KXP−33−4−23312.5−22638.7119491

1ATN10005822058−52365.68−45610.8836613
1H1V711451−51153.1−44999.3936763
1KXP−15922−50730.7−49853.9939312

1ATN20002304587−113680.8−97211.375000
1H1V−7092824−108758.3−96134.1774603
1KXP−1028−130−112267.11−111733.7480426

1BUH1BUH100377−221−3076.9−42932843
1FQ1398325−4381.1−3722.93375

1BUH5001924−701−16989.3−19486.213554
1FQ123061489−23405.9−19446.616929

1BUH10003778−838−34932−37541.226969
1FQ144343003−47284.79−39759.8934296

1BUH20007430−703−74073.6−76021.4254585
1FQ191946874−100196.3−83359.6970848
Table 4

Physicochemical properties of the unbound state

PDBIDNet chargeHydrophobicityNumber of interacting residue pairs



ReceptorLigandN-valueReceptorLigandReceptorLigand
1ACB1ACB10058−5−204.5−2073.22052
1CGI81273−162.3−4592.42126
1EAW61674−366.9−3512.12087

1ACB500321−120−2411.8−10528.29534
1CGI387977−617.5−19018.49720
1EAW2412916−2067.9−14417.59673

1ACB1000676−289−5191.5−19787.218498
1CGI7001669−2326.6−35923.5118394
1EAW5115334−4467.3−26688.418413

1ACB20001300−598−10210−36639.535574
1CGI14272502−6335−66801.5134904
1EAW11019754−9848.8−49854.5135037

1ATN1ATN100−120333−3963−4298.83291
1H1V−16453−5074.8−5474.23613
1KXP−199−42−4733.2−5231.83579

1ATN500−5661437−18381.8−19863.814534
1H1V−711−173−20845.99−25009.8116389
1KXP−665−496−20928.3−26045.515840

1ATN1000−11732545−36763.21−39000.927991
1H1V−1481−142−40160.8−47217.2131144
1KXP−1220−1088−38860.49−50446.6130127

1ATN2000−24684780−70724.91−74852.8153554
1H1V−3070−194−77328.7−89695.4959364
1KXP−2573−2230−75127.91−97115.857889

1BUH1BUH100479161−3238.3−32942361
1FQ1308365−3496.7−39462782

1BUH5001795722−13457.1−17625.710622
1FQ114851516−14542.7−18653.8112177

1BUH100031281254−24678.2−34111.5119918
1FQ128852905−28007.2−36008.723171

1BUH200058022160−46804.9−65434.837755
1FQ152215405−52693.5−66785.1143747
Figure 3

Alignment of receptors and interaction sites with x-ray crystal complex structures. Each receptor indicates PDBIDs corresponding to receptors in Table 1. Squares are interaction sites extracted from complex structure data by LIGPLOT [25].

The BIPs of CDK2 (1BUH) exhibited hydrophilic and positive net charges in receptor proteins in both the bound and unbound states (Tables 3 and 4). In the unbound state, the ligand sides of BIPs exhibited positive net charges for both ligands (Table 4). In contrast, negative net charges for the 1BUH ligand and positive net charges for the 1FQ1 ligand were found in the bound state (Table 4). For the receptors, the interaction sites exhibited E and H values of −18.0 and −45.40 for 1BUH and −18.0 and −110.90 for 1FQ1, respectively (Fig. 3C). Interaction sites on ligands had E and H values of −15.0 and −106.60 for 1BUH and 1.0 and −99.30 for 1FQ1, respectively, in the absence of interaction sites. Although the positive net charges in the ligand of 1FQ1 were close to zero, these parameters appeared to be consistent with the physicochemical properties of BIPs in the bound state (Table 3), in contrast to the results for the unbound state shown in Table 4, indicating that large Mvalues were found as nonspecific BIPs in the unbound state (Table 2). However, we found that M=15 for BIP-AA in the unbound state, suggesting that high-ranking decoys generated specific BIP-AAs for true partners. We observed the physicochemical properties of BIPs under different conditions. Particularly in the bound state, consistent properties were found with actual interaction sites in some cases. In contrast, more decoys appeared to be necessary to obtain specific BIPs in the unbound state.

Conclusion

In this work, we examined proteins that could bind with other multiple proteins, e.g., hub proteins. Generally, hub proteins have disordered regions, interacting with various proteins through large conformation changes [2]. In this work, we examined globular-type proteins selected from a docking benchmark dataset. These proteins can be categorized into three types in terms of conformational changes between the bound and unbound states, i.e., difficult, moderately difficult, and rigid-body [7]. Alpha-chymotrypsin and actin are classified into the difficult type, with RMSDs between interfaces of the bound and unbound states of 2.26 and 3.28, respectively. CDK2 is a rigid-body type, with an RMSD of 0.75. Profile methods may be useful for studies of proteins with large conformation changes. For example, interaction fingerprints have been applied to docking problems using calmodulin with large conformation changes, indicating that profile methods are useful for analysis of protein interaction surfaces [13]. In summary, we observed the specificities of BIPs for true partners and the physicochemical properties of BIPs; these properties were not directly related to the PPI predictions. However, the information provided by analysis of broad possible protein interfaces can yield clues for understanding PPI mechanisms. Three proteins interacting with multiple ligand proteins (true partners) were analyzed to determine the specificity of interaction surfaces among true partners. Then, to observe interaction surfaces, broad interaction profiles (BIPs) were defined as the sum of interaction profiles of decoys, generated by the rigid-body docking process. Two types of BIPs were introduced; one was involved in amino acid sequences as its elements (BIP-seq), and the other was composed of interacting amino acid pairs (BIP-AAs). Notably, the specificity of BIP-AAs for true ligand proteins was higher than that of BIP-seqs.
  30 in total

1.  The Database of Interacting Proteins: 2004 update.

Authors:  Lukasz Salwinski; Christopher S Miller; Adam J Smith; Frank K Pettit; James U Bowie; David Eisenberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Scoring function for automated assessment of protein structure template quality.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proteins       Date:  2004-12-01

Review 3.  Flexible nets. The roles of intrinsic disorder in protein interaction networks.

Authors:  A Keith Dunker; Marc S Cortese; Pedro Romero; Lilia M Iakoucheva; Vladimir N Uversky
Journal:  FEBS J       Date:  2005-10       Impact factor: 5.542

4.  Non-interacting surface solvation and dynamics in protein-protein interactions.

Authors:  Koen M Visscher; Panagiotis L Kastritis; Alexandre M J J Bonvin
Journal:  Proteins       Date:  2015-01-21

5.  Satisfying hydrogen bonding potential in proteins.

Authors:  I K McDonald; J M Thornton
Journal:  J Mol Biol       Date:  1994-05-20       Impact factor: 5.469

6.  A simple method for displaying the hydropathic character of a protein.

Authors:  J Kyte; R F Doolittle
Journal:  J Mol Biol       Date:  1982-05-05       Impact factor: 5.469

Review 7.  Hub promiscuity in protein-protein interaction networks.

Authors:  Ashwini Patil; Kengo Kinoshita; Haruki Nakamura
Journal:  Int J Mol Sci       Date:  2010-04-26       Impact factor: 5.923

8.  The BioGRID interaction database: 2015 update.

Authors:  Andrew Chatr-Aryamontri; Bobby-Joe Breitkreutz; Rose Oughtred; Lorrie Boucher; Sven Heinicke; Daici Chen; Chris Stark; Ashton Breitkreutz; Nadine Kolas; Lara O'Donnell; Teresa Reguly; Julie Nixon; Lindsay Ramage; Andrew Winter; Adnane Sellam; Christie Chang; Jodi Hirschman; Chandra Theesfeld; Jennifer Rust; Michael S Livstone; Kara Dolinski; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 19.160

9.  MEGADOCK 3.0: a high-performance protein-protein interaction prediction software using hybrid parallel computing for petascale supercomputing environments.

Authors:  Yuri Matsuzaki; Nobuyuki Uchikoga; Masahito Ohue; Takehiro Shimoda; Toshiyuki Sato; Takashi Ishida; Yutaka Akiyama
Journal:  Source Code Biol Med       Date:  2013-09-03

10.  MEGADOCK: an all-to-all protein-protein interaction prediction system using tertiary structure data.

Authors:  Masahito Ohue; Yuri Matsuzaki; Nobuyuki Uchikoga; Takashi Ishida; Yutaka Akiyama
Journal:  Protein Pept Lett       Date:  2014       Impact factor: 1.890

View more
  4 in total

1.  Biochemical Reduction of the Topology of the Diverse WDR76 Protein Interactome.

Authors:  Gerald Dayebgadoh; Mihaela E Sardiu; Laurence Florens; Michael P Washburn
Journal:  J Proteome Res       Date:  2019-08-09       Impact factor: 4.466

2.  Conservation of Specificity in Two Low-Specificity Proteins.

Authors:  Lucas C Wheeler; Jeremy A Anderson; Anneliese J Morrison; Caitlyn E Wong; Michael J Harms
Journal:  Biochemistry       Date:  2017-12-29       Impact factor: 3.162

3.  Preface of Special Issue "Protein-Ligand Interactions".

Authors:  Kei Yura
Journal:  Biophys Physicobiol       Date:  2016-07-14

4.  Systematic identification of recognition motifs for the hub protein LC8.

Authors:  Nathan Jespersen; Aidan Estelle; Nathan Waugh; Norman E Davey; Cecilia Blikstad; York-Christoph Ammon; Anna Akhmanova; Ylva Ivarsson; David A Hendrix; Elisar Barbar
Journal:  Life Sci Alliance       Date:  2019-07-02
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.