Literature DB >> 19381265

Allosteric transitions of supramolecular systems explored by network models: application to chaperonin GroEL.

Abstract

Identification of pathways involved in the structural transitions of biomolecular systems is often complicated by the transient nature of the conformations visited across energy barriers and the multiplicity of paths accessible in the multidimensional energy landscape. This task becomes even more challenging in exploring molecular systems on the order of megadaltons. Coarse-grained models that lend themselves to analytical solutions appear to be the only possible means of approaching such cases. Motivated by the utility of elastic network models for describing the collective dynamics of biomolecular systems and by the growing theoretical and experimental evidence in support of the intrinsic accessibility of functional substates, we introduce a new method, adaptive anisotropic network model (aANM), for exploring functional transitions. Application to bacterial chaperonin GroEL and comparisons with experimental data, results from action minimization algorithm, and previous simulations support the utility of aANM as a computationally efficient, yet physically plausible, tool for unraveling potential transition pathways sampled by large complexes/assemblies. An important outcome is the assessment of the critical inter-residue interactions formed/broken near the transition state(s), most of which involve conserved residues.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2009 PMID： 19381265 PMCID： PMC2664929 DOI： 10.1371/journal.pcbi.1000360

Source DB: PubMed Journal: PLoS Comput Biol ISSN： 1553-734X Impact factor: 4.475

Introduction

Many proteins assume more than one functional conformation, stabilized by ligand binding or changes in environmental conditions. A typical example is the bacterial chaperonin GroEL [1], a widely studied ATP-regulated molecular machine and member of heat shock protein Hsp60 family. GroEL assists in unfolding and refolding misfolded or partially folded proteins [2]–[4]. It is composed of two back-to-back stacked rings, each containing seven subunits of 60 kDa. Each subunit is, in turn, composed of three domains, equatorial (E), intermediate (I) and apical (A). GroEL works together with the co-chaperonin, GroES. The activity of GroEL-GroES complex entails a series of allosteric transitions in structure, triggered by ATP binding and hydrolysis, described in Figure 1: In the absence of nucleotide binding, both rings assume the closed (T) state, designated as T/T state for the two rings. Cooperative binding of seven ATP molecules to the subunits in one of the rings, hereafter referred to as the cis ring, drives the conformational change of these subunits to the ‘open’ (R) state, thus leading to the R/T form of the cis/trans rings. The R/T form exposes a number of hydrophobic residues at the apical domains of the cis ring subunits. These groups attract the unfolded or partially folded peptide (substrate) to be encapsulated in the cylindrical chamber following the attachment of GroES (R′/T form of cis/trans rings). ATP hydrolysis provides the energy needed to process (unfold/refold) the substrate and leads to the state R″/T. This process is terminated upon binding of seven ATP molecules to the adjoining (trans) ring, hence the term ‘negative cooperative effect’ induced by ATP binding. The structure with ADP and ATP molecules bound to the respective cis and trans rings (R″/R form) favors the opening of the GroES cap and release of the peptide and ADP molecules to start a new cycle, this time, with the roles of the former cis and trans rings being inverted.

Figure 1

GroEL/GroES allosteric cycle.

GroEL/GroES allosteric cycle.

GroEL consists of two rings, cis and trans, which assume the states: T: ATP-free; R: ATP-bound prior to substrate (peptide) and co-chaperonin (GroES) binding; R′: ATP-, substrate- and GroES-bound; R″: ADP-, substrate- and GroES-bound. Subunits in the T state are shown in red, R in cyan; R′ in green, R″ in blue, and the cap in purple. ATP and ADP are shown by blue and orange boxes. Successive events/reactions along the cycle are (A) binding of seven ATPs to induce the binding of the unfolded substrate (orange), (B) co-chaperonin binding, (C) ATP hydrolysis, (D) ATP binding to trans ring subunits, (E) release of ADPs, substrate (folded or partially folded) and GroES from the cis ring, (F) initiation of a new cycle where the roles of the cis and trans rings are inverted. Top-middle and bottom-left structures are related by rigid body rotation. Diagrams were generated using the data from the PDB in PyMOL (http://www.pymol.org), except for the schematic views of the substrate and ligands included to provide a clearer description. The PDB ids for the structures T/T, R/T, R″/R, R′/T and R″/T are 1GR5, 2C7E and 1GRU [68], 2C7C [15] and 1AON [20], respectively. Of interest is to understand the molecular basis of the negative cooperative effect triggered upon binding of seven ATP molecules to the trans ring (step E in Figure 1). ATP binding induces in this case a structural change on the cap-binding region at a distance of 65 Å. Understanding the mechanism of this allosteric signaling is of fundamental importance because of the critical role chaperonins play in preventing aggregation and regulating folding vs. degradation events. Several studies have been published on the allosteric pathways and dynamics of GroEL [3]–[19] since the original determination of the ADP-bound complex (R″/T) [20]. These studies provide valuable insights into the successive states involved in the chaperonin cycle. The elucidation of allosteric transition mechanisms has been a challenge, however, both experimentally and computationally, due to the transient nature of the high energy conformers between the states, the multiplicity of pathways, and the large size of this biomolecular system.

Computational methods for exploring transition pathways

Several computational methods have been developed in the last two decades for exploring the structural transition pathways of biomolecules (for reviews see [21],[22]). These include methods based on minimizing path-dependent functionals [23],[24], stochastic path integration [25], MaxFlux method for identifying the path of maximum flux [26],[27], nudged elastic band method [28], and the determination of temperature-independent reaction coordinates by action minimization [29]–[31]. Other groups resorted to targeted molecular dynamics (TMD) simulations in the presence of holonomic constraints [11], [32]–[36], Monte Carlo- and MD-based methods for sampling ensembles of stochastic transition paths (e.g., [37],[38] and perturbation-targeted MD combined with umbrella sampling [39],[40]). Yet, the identification of the transition state(s) and accompanying conformational rearrangements are by and large inaccessible for systems of the order of megadaltons like GroEL. Coarse-grained models and methods appear as the only tractable tools in such cases. Perhaps the most comprehensive computational study of GroEL allosteric dynamics is that of Hyeon, Lorimer and Thirumalai, where the T→R→R″ transition has been examined by Brownian dynamics (BD) simulations using a state-dependent (Go-like) self-organized polymer model [18]. These simulations, performed for subunit dimers and heptamers, provided valuable insights on the heterogeneity of the transition pathways and the kinetics of salt bridges' formation/rupture at the successive transitions T→R and R→R″.

Elastic Network Models (ENMs)

Recent years have seen a revival in normal mode analyses (NMAs) of proteins, after realizing that coarse-grained models such as the ENM provide information on functional and robust modes [41]–[45]. ENMs have been recently used for exploring the transition pathways between known end conformations: Kim et al. proposed to interpolate between the end points by controlling the distances between the nodes, rather than the position vectors so as to avoid unrealistic deformation [46]. Onuchic and coworkers extended the concept of minimal frustration to transitions between basins modeled as ENMs, and examined the coupling between the strains in dihedral angles and local unfolding events (also termed cracking) [47],[48]. Maragakis and Karplus explored the transition of adenylate kinase between its T and R forms using a combination of steepest descent and conjugate peak refinement algorithms [49]. More recently, Chu and Voth presented a double-well ENM model for exploring the conformational transitions of G-actin and adenylate kinase [50]. The success of these studies suggests that ENM-based methods may serve as a first approximation for exploring the transitions between not-too-distant pairs of functional states. Here we present a new approach based on the anisotropic network model (ANM) [51],[52] to this aim, and apply it to the allosteric transitions of the GroEL-GroES complex (∼8,000 residues, or ∼1 MDa). The approach, referred to as daptive ANM ( ANM), utilizes the ANM modes to guide the complex along the directions intrinsically favored by its instantaneous inter-residue contact topology. The ANM is based on the following assumptions: (i) the experimentally resolved structures are assumed to represent an energy minimum, (ii) the structures are modeled as a network, the nodes of which represent the individual amino acids, (iii) the positions of the nodes are identified by those of the α-carbons, (iv) all pairs of nodes separated by a distance shorter than a cutoff distance of rc are connected by an elastic spring, (v) a uniform force constant is assumed for all pairs regardless of amino acid type. Thus, the structure is subject to a sum of harmonic potentials corresponding to all connected (bonded or non-bonded) amino acids. The structural coordinates are used to derive the 3N-6 orthonormal modes of motion, called ANM modes, uniquely defined for each equilibrium structure of N modes [51],[52]. The ANM is now widely used in exploring protein dynamics and assessing the global (lowest frequency modes) in particular, in view of the observed robustness and functional relevance of these modes [41]–[45], [53]–[55]. In the present study, the potential of the system is approximated by the sum of two harmonic ANM potentials adopted for the end states, along with a softening term that ensures a smooth transition between them. As will be explained below, the softening term (and the corresponding parameter β; see Methods) is not used in the aANM generation of the pathways, but in the comparison of the results against those predicted by other methods. Multiple paths (or subsets of ANM modes) exist for the passage between the endpoints. The recruitment of the particular subsets of modes results from a tradeoff between minimizing the path length and selecting the direction of the lowest increase in internal energy. The ‘shortest’ path is by definition the interpolation between the two endpoints. However, this path usually incurs unphysical strains in internal coordinates, and is, thereby, unfavorable from energetic point of view. In the other extreme case of movements along the lowest-lying (softest) modes, on the other hand, these particular mode directions may not necessarily lead to the target. The selected pathway will thus depend on the relative importance of environmental (path length) or internal (conformational energy barrier) constraints (see below). The results are organized in two sections: First (Section I), we present the results for the R″←→T transition of a single GroEL subunit (N = 514 residues). These calculations serve two purposes: benchmarking our results against those obtained by action minimization method [29], and estimating the effect of different choices of parameters. These data are then utilized in the second part (Section II) where we examine the intact chaperonin allosteric cycle and compare with experimental data and other simulations. Notably, a small subset of low frequency modes are found to drive the transition at early stages, succeeded by the recruitment of increasingly larger subsets of modes (yet not exceeding 4% of the entire spectrum) to overcome the energy barrier. Native contacts are closely maintained throughout significant portions of transition pathways, except for the close neighborhood of the energy peak, also called putative transition state (PTS), where a major redistribution in intra- and inter-subunit contacts takes place. The results provide new insights and testable hypotheses on the mechanistic involvement of conserved residue pairs in critical interactions.

Theory: Adaptive ANM (aANM)

Definitions

The method is based on the simultaneous generation of pairs of intermediate conformations, and , starting from the known endpoints and . Each conformation along the so-called reaction coordinate is represented by a 3N-dimensional vector, corresponding to the coordinates of α-carbons. The distance vector ( between the pair of conformations generated at iteration k is defined asand the deformation vectors used to generate the k pair of conformations are The original distance vector between the two optimally superimposed end structures is . This distance vector also defines the original root-mean-square deviation (RMSD), | (0)|/N, between the endpoints. The RMSD at iteration k is similarly defined as

Methodology

The ANM method consists of the following steps (see Figure 2):

Figure 2

Schematic description of aANM method.

Schematic description of aANM method.

Two sets of intermediate conformations are generated, and (1≤k≤k), starting from the known substates and , illustrated here for k = 1 and 2. The distance vector between the instantaneous endpoints at the k step is denoted as , and the deformation at each step is or . Dashed ellipses indicate isoenergetic contours. (i) Two sets of intermediates are generated, starting from both ends. The recurrence equation for evaluating the k intermediate starting from state A isand a similar expression holds for state B. Following Eq. (4), we evaluate on the basis of dominant (low frequency) eigenvectors () predicted by the ANM for the conformation . The displacement along the i eigenmode is proportional to the projection of the instantaneous distance vector on , and scaled by the step size . The step sizes s ( and s ( are simultaneously selected at each iteration k, as a fraction f of those, s ( and s (, that minimize (. The limit f→0 refers to infinitesimally small displacements that are strictly accurate but prohibitively expensive, while the other extreme case f→1 is the most efficient move, but may give rise to unphysical deformations in structural coordinates. f = 0.2 is selected here as a scaling factor that optimally balances between efficiency and accuracy (see ). (ii) The number of modes of motion recruited at iteration k is based on a threshold squared cosine, F, that defines the maximal angular departure between the instantaneous displacement direction and that targeted. To this end, we evaluate the cumulative squared cosine and we select the minimal number of modes, starting from the low frequency end of the spectrum, that satisfy the inequality [C(m ()]2 ≥F. It can be shown (see derivation in Text S1) that C(m () is identical to the correlation cosine between the instantaneous deformation and distance vectors, i.e., Therefore, the threshold F ensures the selection of the smallest subset of modes to drive the deformation of the molecule toward a direction that does not deviate by more than a specified correlation cosine (F ½ ) from the target direction (. Note that the use of the complete set of modes leads, by definition, to C(3N-6) = 1. By selecting a subset, we let the molecule undergo a structural change that is not necessarily toward the endpoint, but along the coordinates energetically favored by its architecture. Figure 3 illustrates the procedure for selecting m ( for the transition R″→T (step E in Figure 1). Results are shown for the aANM iterations k = 1 (top), 7 (middle) and 13 (bottom). The bars displays the correlation cosine as a function of mode number 1≤i≤25 (left ordinate), and the blue curve is the cumulative squared cosine [C(m ()]2 (right). For k = 1, the lowest frequency mode (i = 1) alone yields a correlation cosine of 0.82: it suffices, therefore, to take m (1) = 1 mode at this step to meet the criterion [C(m ()]2≥F , if the threshold F = 0.5. For k = 7, on the other hand, the same criterion is met by m (7) = 3 modes (see the red line), and for k = 13, we need m (13) = 23 modes.

Figure 3

Correlation cosine between instantaneous distance vector and eigenmodes.

Results are illustrated for aANM steps k = 1, 7 and 13 along the transition R″→T of a single subunit (subunit A in the respective PDB structures 1GRU and 1GR5). The left ordinate displays the correlation cosine between the distance vector d ( and the eigenvectors for 1≤i≤30 (black bars), and the right ordinate shows the corresponding cumulative squared cosine (Eq. (5)) (blue curve). The threshold F = 0.5 for the cumulative square cosine implies the selection of m (1) = 1, m (7) = 3, and m (13) = 23 in evaluating v ( as indicated by the red lines and filled bars. See Table 1 for the complete list of and values and associated RMSDs between intermediate conformations.

Correlation cosine between instantaneous distance vector and eigenmodes.

Table 1

aANM data for the transition of a GroEL subunit between R″ and T forms(*).

Iteration (k)	F_min = 0.4			F_min = 0.5			F_min = 0.6			F_min = 0.7
	m_R″ ^(k)	m_T ^(k)	RMSD (Å)	m_R″ ^(k)	m_T ^(k)	RMSD (Å)	m_R″ ^(k)	m_T ^(k)	RMSD (Å)	m_R″ ^(k)	m_T ^(k)	RMSD (Å)
0			12.33			12.33			12.33			12.33
1	1	3	11.00	1	4	10.74	1	4	10.74	2	6	10.56
2	1	3	9.77	1	3	9.57	1	4	9.46	2	4	9.16
3	1	3	8.80	1	3	8.48	1	3	8.37	2	4	7.97
4	1	3	8.01	1	3	7.61	1	4	7.37	3	4	6.96
5	1	3	7.36	1	3	6.93	3	5	6.42	5	6	6.12
6	1	3	6.77	3	4	6.16	3	4	5.70	5	7	5.29
7	3	4	6.04	3	4	5.57	5	4	5.11	6	7	4.66
8	3	4	5.46	5	4	5.09	6	4	4.53	13	7	4.14
9	3	4	5.01	6	6	4.50	11	5	4.04	23	12	3.68
10	5	4	4.58	6	7	3.98	15	6	3.60	41	16	3.27
11	6	6	4.18	13	7	3.56	23	10	3.26	73	33	2.91
12	6	7	3.73	17	7	3.21	32	16	2.96	112	44	2.59
13	10	7	3.34	23	7	2.96	64	27	2.68	210	119	2.31
14	14	7	3.06	31	16	2.70	87	39	2.43	377	259	2.06
15	17	7	2.82	54	22	2.49	156	72	2.20	576	455	1.83
16	24	12	2.62	75	35	2.30	266	169	1.99	906	727	1.63
17	32	17	2.44	111	43	2.12	422	312	1.81	1189	1118	0.87
18	52	25	2.27	164	89	1.95	651	488	1.64	-	-	-
19	72	35	2.11	267	172	1.80	953	742	1.49	-	-	-
20	98	41	1.98	385	295	1.66	-	-	-	-	-	-
21	137	68	1.86	545	442	1.53	-	-	-	-	-	-
22	192	124	1.74	769	639	1.05	-	-	-	-	-	-
23	284	207	1.64	-	-	-	-	-	-	-	-	-
24	388	304	1.54	-	-	-	-	-	-	-	-	-
25	498	443	1.44	-	-	-	-	-	-	-	-	-

(*): m ( and m ( are the number of modes recruited at step k, starting from the respective states R″ and T; RMSD values are calculated using Eq. (3). A and B designate the respective states T and R″. The first row indicates the original RMSD of 12.33 Å between the two end points.

Table 1 summarizes the m ( and m ( values for all steps along the R″→T transition for F = 0.4, 0.5, 0.6 and 0.7. Interestingly, an increasingly larger number of modes are recruited as we move away from the original equilibrium state. This important result, also confirmed for the intact complex, will be further elaborated below. (*): m ( and m ( are the number of modes recruited at step k, starting from the respective states R″ and T; RMSD values are calculated using Eq. (3). A and B designate the respective states T and R″. The first row indicates the original RMSD of 12.33 Å between the two end points. The ANM calculations thus involve two parameters, F and f. The former controls the direction of motion, and the latter its size. Smaller F values permit us to proceed via lower energy ascent directions, at the cost of longer excursions. Smaller f implies smaller displacements at each iteration. (iii) The above scheme is repeated to generate a series of intermediate conformations, until the RMSD between the intermediates becomes sufficiently small (1.5 Å in Table 1). The total number of iterations, k, is thus defined by this targeted RMSD. The sequence of conformations along the pathway is denoted as or .

Results/Discussion

I. Allosteric transitions of a single subunit

The allosteric cycle undergone by a given subunit can be summarized as T→R→R′→R″→T. Table 2A lists the RMSDs between these alternative states, derived from the coordinates of the cis ring subunits in different forms of the complex. The RMSD between R′ and R″ is 1.50 Å, which is lower than the resolution of these structures, while those between different states are larger than 12 Å, except for the T and R states which differ by 5.21 Å. A reasonable approximation in view of the similarity of R′ and R″, and the resolution of existing structural data, is to condense the allosteric cycle into T→R→R″ (or R′)→T (see for example [18]).

Table 2

RMSD values (in Å) between (A) different forms of a subunit and (B) different states of the intact GroEL.

A. Single GroEL subunit (*)
RMSD (Å)	1GR5_A (T)	2C7E_A (R)	2C7C_A (R′)	1GRU_A (R″)
1GR5_A (T)	–	5.21	12.06	12.33
2C7E_A (R)	5.21	–	12.23	12.35
2C7C_A (R′)	12.06	12.23	–	1.50
1GRU_A (R″)	12.33	12.35	1.50	–

(*): based on α-carbons. The states refer to cis ring subunit A (indicated by suffix _A).

(*): based on α-carbons. The states refer to cis ring subunit A (indicated by suffix _A). Toward exploring the suitability of aANM for predicting the events T→R→R″→T, we first examine the intrinsic ability of the subunits in the T, R and R″ forms to undertake these particular changes (steps B, D and E, in Figure 1). Second, we focus on the transition R″→T, before proceeding to the intact chaperonin in section II.

Intrinsic dynamics of T, R and R″ states favor functional changes in structure

The lowest frequency modes, also called global modes, represent by definition the collective motions that are most easily accessed near a given equilibrium state, or those intrinsically favored by the 3D structure [45]. These are soft modes along entropically preferred reconfigurations [56]. We will analyze each of the three steps in T→R→R″ (or R′)→T to see to what extent the corresponding changes in structure, which are functionally required, comply with these modes. Results are summarized in Table 3 and Figure S1. As seen in Figure S1, the structural changes induced by the slow modes allow for substantial decrease in the distance (RMSD) from the endpoint except for mode 1 for the step R→R″ (middle panel on the left) where the mode direction is apparently almost orthogonal to (0). Table 3 lists the closest RMSDs that can be achieved upon moving along one, two, or three low frequency modes, starting from either end, or proceeding simultaneously from both ends. Strikingly, in all cases, the three modes permit the molecule to complete more than half of the structural change in the ‘functional’ direction, confirming that the subunits are intrinsically poised to reconfigure in the ‘right’ way. Below are more details. For clarity, the entries discussed below are highlighted (in boldface) in the table and more details can be found in the Supporting Information.

Table 3

Contributions of the lowest frequency modes to the cycle T→R→R″→T (a).

Step	Starting structure	Target structure	Original RMSD (Å)	RMSD by the 1^st ANM mode			RMSD by two ANM modes(b)			RMSD by three ANM modes(b)
				RMSD(Å)	F_min	modes	RMSD(Å)	F_min	modes	RMSD(Å)	F_min	modes
T to R	1GR5_A (T)	2C7E_A (R)	5.21	3.40	0.57	1	3.05	0.65	1+2	2.83	0.70	1+2+3
	2C7E_A (R)	1GR5_A (T)	5.21	2.83	0.70	1	2.70	0.73	1+4	2.65	0.75	1+4+6
	Deform simultaneously		5.21	2.74			2.61			2.53
R to R″	2C7E_A (R)	1GRU_A (R″)	12.35	12.11	0.04	1	8.19	0.55	1+2	7.30	0.65	1+2+3
	1GRU_A (R″)	2C7E_A (R)	12.35	8.84	0.49	1	7.65	0.61	1+2	7.29	0.65	1+2+3
	Deform simultaneously		12.35	7.57			6.02			4.19
R″ to T	1GRU_A (R″)	1GR5_A (T)	12.33	7.02	0.68	1	6.61	0.71	1+2	6.05	0.76	1+2+5
	1GR5_A (T)	1GRU_A (R″)	12.33	12.03	0.05	1	8.78	0.49	1+3	7.38	0.64	1+3+4
	Deform simultaneously		12.33	6.49			6.19			4.91

Lowest frequency modes that exhibit a correlation cosine of >0.1 with (0) are selected. Note that these are all confined to the lowest frequency six ANM modes (see listed mode numbers).

The RMSDs refer to those between the intermediates generated by moving along the 1st ANM mode (columns 5–7), two modes (8–10) and three modes (11–13) in iteration k = 1 (see Eq. (3)). For each step (1st column), the results are separately given for the reconfiguration of the forward (first row), backward (2nd row) and simultaneous (3rd row) passages between the two endpoints. Lowest frequency modes that exhibit a correlation cosine of >0.1 with (0) are selected. Note that these are all confined to the lowest frequency six ANM modes (see listed mode numbers). Step T→R. The T state exhibits a strong tendency to approach the R state. The change is indeed a reconfiguration along the first mode predicted by the ANM for the T state. This type of transition occurs both in steps A and D of the allosteric cycle (Figure 1) at the respective cis and trans ring subunits. As the T subunit changes its conformation along mode 1 alone, the RMSD from state R decreases from 5.21 to 3.40 Å, and upon further recruiting modes 2 and 3, the RMSDs decrease to 3.05 and 2.83 Å, respectively. Interestingly, the end state (R) also tends to move toward the state T, if deformed along its mode 1. These results support the view that the T form possesses an intrinsic tendency to assume its ATP-bound conformation R, prior to ATP binding, and it is also readily recovered upon nucleotide release. Step R→R″ (steps B and C in Figure 1). In this case, the RMSD of 12.35 Å between the end states cannot be reduced upon moving along mode 1. Instead mode 2 appears to play a major role, to decrease the RMSD to 8.19 Å. This step is triggered upon ATP hydrolysis, which apparently favors mode 2. Yet, it is remarkable that the contribution of three modes from both ends is sufficient to reduce the RMSD to 4.19 Å. Step R″→T (step E in Figure 1). Results illustrated in Table 1 already showed that the reconfiguration of R″ is initiated via a deformation along the first mode. This mode indeed yields a correlation cosine of 0.82 with the targeted direction (0) (see Figure 3), and Table 3 shows that the original RMSD of 12.33 Å decreases to 7.02 Å upon moving along mode 1, exclusively. As to the backward transition, the first mode of the T state induces a movement almost orthogonal to (0). Yet, recruitment of the first three modes from both ends has a dramatic effect, as an RMSD of 4.91 Å is reached. The above analysis shows the utility of taking steps along the ANM modes so as to move efficiently toward the endpoint. Slowest modes refer here to lowest-lying eigenvalues, or softest modes. Recruiting three modes from both ends leads to intermediate states that differ by 2.53, 4.19 and 4.91 Å, as opposed to the original RMSDs of 5.21, 12.35 and 12.33 Å between the end points of the respective three steps T→R→R″→T. The extent of productive reconfiguration, or the fraction of the total path travelled (more than ½) by taking steps along these three modes is remarkable, given that (i) these represent 3/1,542 of the complete set of 3N-6 normal axes/directions that could be selected by the starting conformation, and (ii) the ANM modes are solely based on the distribution of inter-residue contacts.

R″→T transition analyzed by aANM: Path lengths vs. energy barriers

Having established the use of moving along ANM modes, we now proceed to an iterative use and (re)generation of ANM modes according to the aANM algorithm. In order to assess the effects of the choice of aANM parameters and establish default values, which will also be adopted for the intact GroEL (see below), we repeated aANM calculations for different F values. Results for the transition R″→T are shown in Figure 4. Panel A displays the gradual decrease in the RMSD between the instantaneous pairs of intermediate conformations (see Eq.(3)) as a function of iteration number k, and panel B the change in energy involved in each case. It can be seen that lower F values allow for larger excursions away from the targeted direction by recruiting relatively smaller numbers of low frequency modes. They consequently require a larger number of steps to be undertaken to reach the target, while the accompanying energy increase is relatively smaller. Higher F, on the other hand, permits us to reach the target faster, but with a higher energy cost. The limit F = 1 corresponds to pure interpolation by recruiting all modes.

Figure 4

R″→T transition for a single subunit of GroEL.

R″→T transition for a single subunit of GroEL.

(A) RMSD values, |d (| /N, between instantaneous endpoints plotted as a function of iteration number k. The end states refer to subunit A in the PDB structures 1GRU and 1GR5. Results are shown for F = 0.4, 0.5, 0.6 and 0.7, corresponding to the allowable angular deviations of up to 50.8°, 45.0°, 39.2° and 33.2°, respectively, between and v ( (or v (). (B) The energy profile for alternative pathways in arbitrary units. Note the significantly lower energy barrier compared to the interpolation (orange curve) between the endpoints. The black curve refers to the SDP trajectory. The reaction coordinate refers to the normalized projection of the instantaneous displacement on the original distance vector. (C) Series of conformations sampled along the reaction coordinate. The diagrams are colored by domains (equatorial, blue; intermediate, green; apical, red). (D) Movements of helices H, K and M sampled along the transition pathway. Three conformations are shown at decreasing transparency levels, starting from R″ (lightest color), to T (darkest). The reaction coordinate in panel B is the projection of the cumulative displacement ( = (− (0) on the original distance vector (0), i.e., x(n) = (0). (/| (0) |2, with R″ and T representing the respective limits x(n) = 0 and 1. The peak in the energy profile tends to be closer to the T state, especially when lower F values (which entail lower energy barriers) are adopted. This may be related to the recruitment of higher modes (steeper ascent along the energy surface) near state T, as opposed to the first mode near R″. Note that in the allosteric transition of hemoglobin between the T and R′ states, T was found to be more predisposed to move toward R″, than R″ undergoing the reverse transition [56],[57]. In particular, the value F = 0.5 leads to a peak around x(n) = 0.52, in accord with the steepest descent path (SDP) derived by action minimization (black curve in Figure 4B), as will be further elaborated below.

Sequence of events

A series of conformations visited along the transition pathway are displayed in Figure 4C for F = 0.5. We observe that, if the E domain (blue) is fixed as a reference, the A domain (red) closes down up and twists about the axial direction, and the intermediate (I) (green) domain moves away. Helix M (386–409) on the I domain, and helices H (233–244) and K (338–354) on the A domain are highlighted in Figure 4D (left diagram) and shown at three successive times from different perspectives (three diagrams on the right) to provide a clearer description of their movements. The three color shades, from light to dark, describe three snapshots along the reaction coordinate. The movement of the I domain, and the helix M in particular, seen on the left panel, will be shown below to be crucial in forming/breaking critical contacts during the allosteric transitions of the intact GroEL. An interesting observation is the sequential order of events: first the E and I domains almost stick to each other and move coherently as a single rigid body, while the A domain undergoes an upward tilting and simultaneous twisting. These movements of the A domain are completed in the first half of the transition pathway from T to R″. Then, slight rearrangements in the relative positions of the E and I domains occur, which expose the top portion of the A domain to bind the substrate and GroES. This sequence of events is consistent with the two-stage transition explored by the TMD simulations [11]. See the animations on http://www.ccbb.pitt.edu/People/yzheng/ for more details.

Which modes?

The number of modes involved in the transition between substates has been a question raised in a number of studies [58],[59]. Table 1 lists the number of modes, and , recruited at each step as the molecule travels between the end points. An important observation is the increased involvement of higher frequency modes as we proceed away from the original state. In another words, the slow modes play an important role at the initial stages of deformation, and continue to play a role throughout the entire trajectory, although they are gradually complemented by increasingly larger subsets of higher frequency modes. The dominance of low frequency modes is consistent with the previously noted driving role of large scale motions in GroEL allostery [18]. Due to the limits of applicability of the harmonic potential, it is arguable how far away from the original energy minima we can extend the quadratic approximation. In the trajectory of F = 0.5, after 10 iterations (Table 1), the distance between the instantaneous intermediates ( () falls from 12.33 Å to less than 4 Å. Only six low frequency modes out of the complete set of 1536 ANM modes accessible to the starting conformation (R″) are recruited to reach this stage, along with seven slowest modes accessible to state T. Low frequency modes are known to be robustly determined by the overall shape and contact topology of the examined structure, i.e., they are insensitive the specific interactions and nonlinear effects, hence the broad use of ENMs in NMAs in recent years [42],[60]. This property emphasized in several reviews [41],[53],[54] is also supported by the close agreement between the low frequency modes from quasiharmonic analysis of MD trajectories and those from coarse-grained NMA; see, for example, references [61] and [62]. The fact that such a large portion (up to 2/3) of the reconfiguration occurs via these robust modes lends support to the applicability of aANM. A small portion near the PTS needs, however, to be interpreted with care, as will be further analyzed below. In a strict sense, the normal modes provide information on the direction of motion near an energy minimum, and steps along these modes will be accurate to the extent that they are infinitesimally small. In order to examine how the step size s ( (or s () affects the course of structural reconfiguration, we repeated our calculations with various scaling factors f in the range 0.1deformations in backbone geometry. In particular, we monitored the Cα-positions and Cα-Cα bond lengths, which refer to bonded and non-bonded interactions in the coarse-grained model used here, to verify that the conservative steps generated with this scaling factor avoid steric clashes between residue pairs and unrealistic fluctuations in bond lengths. See more details (Figure S3 and Table S1) in the Supporting Information.

Comparison with the results independently obtained by action minimization (SDP)

Toward assessing the consistency of our results with those predicted by other analytical methods, we examined the transition pathway predicted for the same model by the action minimization algorithm [29],[63]. The resulting trajectory, called the steepest descent pathway (SDP), led to the energy profile shown by the black curve in Figure 4B. The barrier height is slightly lower than that obtained by aANM, while the shape of the energy profile shows close resemblance: a peak around x(n) = ∼0.52, preceded by minimal energy cost for an extended portion of the reaction coordinate, followed by a sharp increase near the peak, and then a smooth decrease after the peak. Toward a more critical analysis of the modes that contribute to the SDP, we reorganized the SDP trajectory (consisting of 46 frames) into a series of k = 9 (macro)steps by collapsing each set of five consecutive frames into a macrostep (Figure 5A) and we calculated the deformation vector Δ SDP = SDP− SDP for each macrostep. The following questions were raised: Which ANM modes effectively contribute to these macrosteps? Do SDP macrosteps exhibit the same tendency as aANM to originally proceed via softer modes and gradually recruit increasingly larger subsets of modes? How similar are the conformations visited along the aANM and the SDP? To this aim, we evaluated the correlation cosine between Δ SDP and the ANM modes (1) and (1) accessible to original states (0) and (0). The results are shown as a function of mode index i in the respective panels B and C of Figure 5. The correlation cosines represent the relative contributions of the intrinsically accessible ANM modes to the SDP macrosteps. In accord with the results from aANM, only very few modes at the low frequency end of the spectrum contribute to the SDP macrosteps in the close neighborhood of the original states (red plots). The slow modes contribute by almost by the same amount as those observed in aANM at the successive stages of the transition pathway. The contribution of higher frequency modes, which is negligibly small at early stages, gradually increases, consistent with the aANM.

Figure 5

Comparison with the results from steepest descent pathway (SDP) based on action minimization.

(A) Fragmentation of the SDP pathway for the transition 1GRU←→1GR5 of a subunit into nine macrosteps, consisting each of five frames. Same color scheme is adopted in panels B and C. (B) Correlation between SDP macrosteps and ANM modes accessible to the original conformation . (C) Same as panel B, for the right portion of the trajectory, i.e. the reconfiguration from 1GR5_A to 1GRU_A using the eigenvectors generated for 1GRU_A. Note that the early macrosteps from both directions are accounted for by a few slowest ANM modes, while increasingly higher modes are being recruited as the molecule proceeds away from its original conformation, consistent with the results found by aANM (see Table 1). (D) RMSD values between the intermediate conformations sampled by the aANM and SDP methods. The aANM results refer to the trajectory F = 0.5. The RMSDs between pairs of intermediates remain lower than 2.0 Å at all steps (see the color-coded scale on the right).

Comparison with the results from steepest descent pathway (SDP) based on action minimization.

II. Allosteric transitions of the intact chaperonin

Intrinsic dynamics of the T/T, R/T, R′/T and R″/R states

Calculations performed above for a single subunit demonstrated that the transition pathways between the functional forms of the single subunit can be delineated by the aANM algorithm. In particular, Table 3 and Figure S1 showed that a few low frequency modes can account for a significant portion of the structural changes that take place between the functional forms. We now explore the allosteric transitions in the intact chaperonin. We begin by examining if/how the low frequency modes play equally an important role in the allosteric dynamics of the intact GroEL. The counterparts of Table 3 and Figure S1 for the intact chaperonin are Table 4 and Figure S2. Here, the major difference is the fact that the changes occur via non-degenerate modes that maintain the heptameric symmetry of the GroEL structure, similar to the previously noted [64],[65] dominant role of non-degenerate ANM modes in enabling the maturation of icosahedrally symmetric viral capsids. Results are listed for the first 1, 3 and 6 non-degenerate modes. Note that the set of ANM modes accessible to the GroEL complex is larger than that of the single subunit by a factor of 14. We might thus expect to recruit a larger number of modes in the low frequency regime to achieve the same fractional contribution to observed changes. However, it is again striking to observe that six non-degenerate modes are sufficient to generate intermediates that are almost half way through the transition between the end points. The original RMSD values of 6.72, 10.85 and 11.54 Å between the end states of the three respective steps T/T→R/T, R/T→R′/T and R″/R→T/R are reduced to 3.26, 6.35 and 7.87 Å upon moving along these modes (see Table 4). The gradual decreases in the RMSDs as the intact chaperonin reconfigures along these modes are shown in Figure S2.

Table 4

The contribution of lowest frequency modes to the three major steps of the chaperonin allosteric cycle(a).

Step	Starting structure	Target structure	Original RMSD (Å)	RMSD by the 1^st non-degenerate mode			RMSD by the first 3 non-degenerate modes			RMSD by the first 6 non-degenerate mode
				RMSD(Å)	F_min	modes	RMSD(Å)	F_min	modes	RMSD(Å)	F_min	modes
T/T to R/T	1GR5	2C7E	6.72	5.54	0.32	1	4.91	0.46	1+10+11	3.97	0.65	1+10+11+28+51+61
	2C7E	1GR5	6.72	5.83	0.24	3	4.66	0.52	3+12+20	3.42	0.77	3+12+20+35+49+56
	Deform simultaneously		6.72	5.54			4.65			3.26
R/T to R′/T	2C7E	2C7C	10.85	10.34	0.09	3	9.96	0.15	3+17+20	6.35	0.66	3+17+20+29+44+49
	2C7C	2C7E	10.85	10.68	0.03	3	9.96	0.16	3+4+10	9.34	0.27	3+4+10+20+21+22
	Deform simultaneously		10.85	10.34			9.77			6.35
R″/R to T/R	1GRU	2C7E′ (b)	11.54	11.22	0.05	3	9.35	0.34	3+10+17	8.34	0.48	3+10+17+20+22+30
	2C7E′	1GRU	11.54	11.19	0.06	3	9.33	0.35	3+20+27	8.52	0.46	3+20+27+29+32+35
	Deform simultaneously		11.54	11.19			8.43			7.87

Major steps are those involving an RMSD larger than 3.2 Å (∼resolution of some structures) between the end points.

2C7E′ is same as the PDB structure 2C7E, except for the inversion shown in step F of the allosteric cycle in Figure 1.

Major steps are those involving an RMSD larger than 3.2 Å (∼resolution of some structures) between the end points. 2C7E′ is same as the PDB structure 2C7E, except for the inversion shown in step F of the allosteric cycle in Figure 1.

Energy profile and operating modes

We now proceed to the aANM study of the transition T/T→R/T→R″/R. These three states are selected because they differ by an RMSD of 6.72 Å and 11.03 Å, respectively (see Table 2 part B). R′/T and R″/T differ from R″/R by less than 3 Å, which is comparable to the resolution of the structures, hence their exclusion from the above scheme. The results are presented in Figure 6 and Table 5. The energy profile along the transition R/T→R″/R (Figure 6A) exhibits a shape similar to that obtained with the single subunit; the PTS is closer to the R″/R state (x(n) = 0.58), and the barrier is much lower than that encountered upon interpolation (orange curve) between the end points.

Figure 6

Transition among T/T→R/T→R″/R forms of the intact GroEL complex.

Table 5

aANM data for the transition of Intact GroEL complex(*).

k	1GR5 (T/T) to 2C7E (R/T)			2C7E (R/T) To 1GRU (R″/R)
	m _T/T ^(k)	m _R/T ^(k)	RMSD (Å)	m _R/T ^(k)	m _R″/R ^(k)	RMSD (Å)
0	–	–	6.72	–	–	11.03
1	51	20	6.17	49	121	9.70
2	51	35	5.67	49	122	8.53
3	61	51	5.04	49	139	7.58
4	89	51	4.55	49	142	6.86
5	94	65	4.11	54	181	6.14
6	118	104	3.74	79	188	5.58
7	218	118	3.40	120	189	5.07
8	238	183	3.10	95	209	4.61
9	282	205	2.80	86	310	4.21
10	326	229	2.55	142	338	3.80
11	425	282	2.32	198	384	3.44
12	515	346	2.14	240	471	3.10
13	620	510	1.95	318	576	2.80
14	942	616	1.80	321	731	2.53
15	-	-	-	363	1054	2.31
16	-	-	-	519	1380	2.13
17	–	–	–	636	1869	1.93
18	–	–	–	792	2281	1.23

(*): k = 0 refers to the original RMSD between the end points. Results are obtained with F = 0.5.

Transition among T/T→R/T→R″/R forms of the intact GroEL complex.

(A) Energy profiles of the intact GroEL complex (R/T→R″/R) along the reaction coordinate computed by aANM (blue) and Cartesian interpolation (orange) using the double-well potential given by Eq. (7). (B) Contribution of different modes at various steps (1, 6, 11 and 15) along the transition R/T→R″/R. Broader numbers of higher frequency modes are recruited as the structure approaches the energy barrier (see Table 5). (C) Top view of structures sampled along the transition. Snapshots corresponding to conformations , , , and are shown. (D) Side view of the same structures. (E) Close-up views of pairs of adjacent subunits. The diagrams in panels C–E are color-coded according to the mobilities of residues (red: most mobile; blue: almost fixed). Note that the equatorial domains of cis ring subunits are almost fixed, while the largest motions occur at the apical domains of the same subunits. (*): k = 0 refers to the original RMSD between the end points. Results are obtained with F = 0.5. Figure 6B displays the histograms of modes contributing at various iterations (k = 1, 6, 11 and 15), starting from the R/T state. The tendency to recruit higher modes as the PTS is approached is clearly seen, consistent with the trend observed in the single subunit with both aANM and SDP. Table 5 shows that the RMSD between the intermediate conformations reduces to 1.93 Å by selecting modes from amongst the slowest 636 modes. More detailed examination shows that among them 97 non-degenerate modes contribute to 90% of the structural change. This subset amounts to less than 0.5% of the spectrum of ANM modes (∼2.2×104 of them). Panels C, D and E display the evolution of the intact structure viewed from the top (C), and from the side (D), along with a close-up view of two adjacent subunits on the cis ring (E). See http://www.ccbb.pitt.edu/People/yzheng/ for movies. The trans ring is generally observed to undergo moderate changes between the T to R states. The cis ring, on the other hand, undergoes concerted rotations and extensions at the apical and intermediate domains. Notably, the intermediate domains move toward the cleft between neighboring equatorial domains, while the apical domains extend along the vertical (cylindrical axis) direction accompanied by a rotation about the same axis. These motions result in the enlargement of the central cavity along with the exposure of the apical domains on the cis ring to bind the GroES-binding flexible loop K15-T36.

Redistribution of inter-subunit salt bridges and implications on intra-ring cooperativity

Salt bridges at the interface between the cis and trans rings have been pointed out in site-directed mutagenesis experiments by Saibil and coworkers to play a critical role in communicating allosteric signals between the two rings and the co-chaperonin binding site [12]. The redistribution of critical interactions accompanying the global changes in structure has been proposed to be a possible mechanism for regulating allosteric signal transduction [66]. Here we examine more closely the changes in the distances between salt-bridge forming residues during the transitions of the intact chaperonin, and discuss our results in relation to previous experimental and computational data. Experiments conducted with Arg197Ala mutant [67] pointed to the functional role of the salt bridge between E386 and R197 on adjacent subunits in the cis ring. This inter-subunit salt-bridge was also noted in early cryo-EM studies [68]. It forms in the T state of the ring (1GR5) (Figure 7A and 7B), and it has been proposed to be an essential component of the positive intra-ring cooperativity [69]. The R″ state of the same ring (1GRU) indicates, on the other hand, a new salt bridge, formed between E386 and K80 (on the E domain of the neighboring subunit) (Figure 7D). The aANM results shed light to the mechanism of this interchange of salt bridges. The separation between E386 and R197 α-carbons, originally equal to 15.7 Å (their charged ends being separated by 3.2 Å), gradually increases by the downwards motion of the intermediate domain. After 18 iterations (for T/T→R/T), the distance between E386 and R197 becomes larger than that between E386 and K80, and K80 replaces R197 to form a salt bridge with E386 (Figure 7C). Afterwards, R197 moves dramatically away from E386, led by the opening of the apical domain. Meanwhile the downward movement of helix M continues until the distance between the Cα-atoms of the new salt-bridge-forming residues E386 and K80 reduces to 8.6 Å.

Figure 7

The cis ring inter-subunit interactions during the transition T→R″, based on the intact GroEL structure calculation.

The cis ring inter-subunit interactions during the transition T→R″, based on the intact GroEL structure calculation.

(A) Intersubunit interface near the intermediate domains (green) of two adjacent subunits in the cis ring. The backbones are shown in cartoon view and colored by domains: A (orange), I (green), and E (blue). Backbone atoms of three charged residues are shown by spheres. Positively and negatively charged residues are colored blue and red, respectively. (B) The inter-subunit hydrogen bond, E386-R197, in the T state of the cis ring (1GR5). (C) During the transition to state R″/R, residue E386 in the I domain moves towards K80 (blue sphere) in the E domain of the adjacent subunit, while R197 on the A domain moves away from E386. (D) The final configuration in the R″ state of the cis ring, represented by 1GRU. Residue E386 now forms a new hydrogen bond with K80. These intra-ring rearrangements are in accord with previous TMD simulations [11]. However, TMD runs showed heterogeneous subunit behavior, while the aANM trajectory retains the sevenfold rotational symmetry. In each ring, the movements of all subunits are identical and coherent (Figure 6C). In Figure 6B we noted that not all slow modes in the lower end of the spectrum contribute to the transition. Instead a subset inducing cylindrically symmetric changes is recruited. For example, in the first iteration, while 51 modes are recruited, the contributions of modes 2 through 9 are almost zero, and we have large contributions from the non-degenerate modes 1, 10, 11, 28 and 51 (Table 4). Our results support the Monod-Wyman-Changeux (MWC) view [70],[71] of cooperative, symmetric (non-degenerate) modes of motion controlling allostery. Likewise, GroEL engineering experiments provide evidence for the concerted nature of the allosteric transition [69].

Comparison with the results from recent BD simulations

A recent examination of the time evolution of salt-bridges formed/broken at the steps T→R and R→R″ of the GroEL allosteric cycle brought attention to a number of key interactions [18]. Those results have been obtained by Hyeon, Lorimer and Thirumalai, by performing BD simulations, for a dimer, using state-specific Go-like potentials for the end points [18]. Therefore, they differ from ours in their method, model and parameters. Yet, in order to see if similar trends could be detected in spite of the differences in the methodology, we undertook a careful examination of the formation/breakage of salt bridges along the aANM predicted reaction coordinates. Figure 8 displays our results for the respective T→R (panel A) and R→R″ (panel B) transitions of cis ring subunits obtained by aANM for the intact chaperonin. These are the Cα-Cα distances between intra- or intersubunit salt-bridge forming residues, plotted as a function of reaction coordinate. Interestingly, particular pairs or residues exhibit a smooth increase/decrease in their distance, whereas others are more sluggish. The former group of pairs which readily associate/dissociate may be viewed as driving interactions. We notice, for example, the smooth decrease in K80-D359 distance along with increase in D83-K327 distance (panel B), indicative of the compensating formation and breakage of these two intra-subunit salt bridges prompted at an early stage of the R→R″ transition. These changes undergone early on are in sharp contrast to the salt bridges involving E257 and E386, which exhibit strong resistance to dissociate or associate (panels A and B). The leading role of K80-D359 and D83-K327 is in agreement with the mechanism observed in BD simulations [18]. The formation of the K80-D359 salt-bridge has been pointed out therein to be the major driving force for the R→R″ transition [18]. Also, this formation has been pointed out to be coupled to the dissociation of D83-K327 [18]. Other similar features include the fast increase in R58-E209 and decrease in P33-N153 distances at short times during T→R, in contrast to the slower responses of D83-K327 and E257-R268 at the same stages (panel A), succeeded by the high stability of P33-N153 during R→R″, the resistance of E257-K321 and E257-R322 to come closer at the initiation of T→R, in contrast to their predisposition to dissociate at the onset of R→R″. Finally, it is interesting to notice that the non-monotonic behavior of the pair E257-R268 in panel B conforms to previously observed [18] ensemble-averaged behavior for the same pair; while that of R197-E386 observed here departs from the smooth increase previously reported.

Figure 8

Changes in the distances between salt-bridge forming pairs along the aANM reaction coordinate.

Results are shown for (A) T→R and (B) R→R″ transitions. See Figure S4 for the corresponding time dependences, and Table 6 for the kinetic expressions and the comparison with the results from BD simulations by Hyeon et al. [18].

Changes in the distances between salt-bridge forming pairs along the aANM reaction coordinate.

Table 6

Comparison of the kinetics of salt-bridge forming residues obtained by aANM and BD simulations(a).

Transition	Salt Bridge	Time dependence of C^α-C^α distance ( aANM) (b)	R² (aANM)	<d(t)> (from reference [18])	Correlation coefficient (c)
T→R	D83-K327	11.8−2.3e ^−t/101.1	0.91	13.4−4.9 e ^−t/100.0	0.999
	P33-N153	6.8+5.8 e ^-t/7.7	0.99	7.3+4.2e ^−t/6.3	0.998
	R284-E386	30.7−9.9e ^−t/15.8−10.4e ^−t/84.2	0.99	29.7−8.1e ^−t/20.8−8.4e ^−t/85.8	0.999
	R285-E386	Reference curve	–	28.4−6.6e ^−t/19.1−8.1e ^−t/88.8	1
	R197-E386	22.4−4.1e ^−t/20.0−6.4 e ^−t/307.2	0.99	20.9−2.6e ^−t/0.67−6.4e ^−t/96.7	0.975
	K80-E386	11.0+10.1e ^−t/14.1+4.0e ^−t/2.5	0.99	10.4+7.6e ^−t/12.1+2.2e ^−t/61.8	0.993
	E257-R268	22.5−169.2e ^−t/58.3+151.9e ^−t/62.0	0.99	21.8−4.2e ^−t/26.2−7.9e ^−t/66.4	0.998
	E257-R322	10.3+8.1e ^−t/24.8	0.99	N/A	–
	E257-K321	10.1+10.7e ^−t/26.6	0.99	N/A	–
R→R″	D83-K327	Reference curve	–	37.3−26.9 e ^−t/77.9	1
	E257-K321	58.1−55.1e ^−t/73.1	0.94	N/A	–
	E257-R322	54.6−50.7e ^−t/76.2	0.94	N/A	–
	E257-K246	51.6−47.6 e ^−t/95.8	0.86	N/A	–
	K80-D359	15.1+25.2e ^−t/37.4	0.98	14.1+26.4e ^−t/28.0	0.991

BD simulations results were reported by Hyeon et al. (2006) [18]; time (t) in microseconds.

aANM results are reported for all salt-bridges that exhibited a monotonic time dependence (single or double exponential) with R2>0.85.

between the two sets of data (aANM and BD) for each salt bridge.

In the interest of making an even more stringent, quantitative comparison between the two sets of data, we have considered the relative rates of formation/rupture of different salt bridges. Our approach does not contain ‘time’ explicitly. Instead we have the ‘number of iterations’. In order to evaluate the relative time dependence of the salt bridge formation/breakage events, we normalized one of our curves, called ‘reference curve’, with respect to its counterpart obtained [18] by Hyeon et al. (2006). This permitted us to plot the accompanying time dependence of all other salt bridges as a function of time (see Figure S4) and determine the best fitting kinetic data. The results, compiled in Table 6 exhibit striking quantitative agreements between the aANM results and those from BD simulations. BD simulations results were reported by Hyeon et al. (2006) [18]; time (t) in microseconds. aANM results are reported for all salt-bridges that exhibited a monotonic time dependence (single or double exponential) with R2>0.85. between the two sets of data (aANM and BD) for each salt bridge.

Critical interactions formed/broken at the transition involve conserved residues

The above results (Figure 8) reveal that certain interactions play a key role in stabilizing particular states, once formed. For example, the inter-subunit E257-R268 salt bridge appears to be crucial for the T state of the cis ring; whereas the inter-subunit hydrophobic contact I305-A260 seems critical to stabilizing the R″ state. These interactions are usually disrupted only at the PTS. Toward a more systematic assessment of such critical interactions, we focused on the changes in native contacts occurring at the PTSs. A set of 2028 native contacts are shared between the R and R″ states (native contacts being defined as Cα-Cα distances lower than 7 Å, in the present model) while the two respective states exhibit 137 and 105 distinctive native contacts. The questions are: How does the number of native contacts vary along the transition pathway? Which contacts among those differentiating the two end structures are maintained along the trajectory and which ones are broken or formed only at the transition state? Figure 9A displays the time evolution of native contacts observed for a single subunit during the transition R→R″; and panel B shows its counterpart for inter-subunit contacts observed in the complex. Clearly, most of the native contacts remain unchanged throughout a large portion of the trajectory, while significant changes occur near the PTS, both with regard to the disruption (top panel) and formation (bottom panel) of contacts. Trajectories generated by varying aANM model parameters in the ranges 0.4

Figure 9

Evolution native contacts along the structural transition from R to R″ states.

The number of intra-subunit (panel A) and inter-subunit (panel B) native contacts that are disrupted (upper panel) and formed (lower panel) vs. the reaction coordinate. The results refer to F = 0.5 for cis ring subunits along the transition from 2C7E (R/T) to 1GRU (R″/R). Each bar represents the number of native contacts formed/broken at a given aANM iteration. Note the sharp increase near the energy barrier. See Figure 10 for the corresponding critical contacts.

Evolution native contacts along the structural transition from R to R″ states.

Figure 10

Redistribution of inter-residue contacts at the transition from R to R″ state.

A closer examination reveals the residues involved in critical interactions tend to be strongly, if not fully, conserved. Table 7 lists the inter-subunit residue pairs broken/formed at the transition, also called ‘critical intra-ring contacts’, along with their conservation scores evaluated using ConSurf [72],[73]. The E domains make indeed many more inter-subunit contacts than the A domains (Figure 10). As a result the A domains can move independently of one another, while the E domains are constrained [18], which explains their breakage at the critical state only. It is also interesting to note that the only I domain residues that make critical contacts (Ala384 and Thr385 next to the N-terminus of the M helix) undergo large reorientations (Figure 4D). We also note the segment A384-V387 that has been pointed to play an important role in mediating positive intra-ring cooperativity [16].

Table 7

Critical inter-subunit contacts broken/formed during R→R″.

	Chain A		ConSurf score(b)	Chain B		ConSurf score(b)
	Residue type	Index(a)		Residue type	Index(a)
Breaking	VAL	39 (E1)	0	GLU	518 (E2)	9
	ASP	41 (E1)	9	MET	520 (E2)	9
	GLY	45 (E1)	9	GLN	72 (E1)	9
	ALA	46 (E1)	0	MET	69 (E1)	9
	ALA	46 (E1)	0	GLU	76 (E1)	9
Forming	ALA	26 (E1)	9	CYS	519 (E2)	9
	VAL	29 (E1)	9	GLU	518 (E2)	9
	LYS	34 (E1)	0	MET	114 (E1)	9
	ARG	36 (E1)	9	THR	517 (E2)	9
	ASN	37 (E1)	9	CYS	519 (E2)	9
	LEU	40 (E1)	9	THR	522 (E2)	9
	PRO	47 (E1)	9	GLN	72 (E1)	9
	ILE	60 (E1)	7	LYS	4 (E1)	9
	GLU	61 (E1)	9	ALA	2 (E1)	0
	THR	210 (A)	1	GLN	351 (A)	0
	ALA	260 (A)	9	GLU	304 (A)	9
	ALA	260 (A)	9	ILE	305 (A)	0
	ALA	260 (A)	9	GLY	306 (A)	0
	VAL	264 (A)	9	GLY	306 (A)	0
	ALA	384 (I2)	9	TYR	506 (E2)	0
	ALA	384 (I2)	9	SER	509 (E2)	9
	THR	385 (I2)	9	TYR	506 (E2)	0
	THR	385 (I2)	9	SER	509 (E2)	9

Conserved residues are highlighted in boldface. The domains are shown in parentheses. The equatorial domain residues are Met1-Pro137 (E1) and Val411-Pro525 (E2); intermediate domain residues are Cys138-Gly192 (I1) and Val376-Gly419(I2); and the apical domain (A) consists of a contiguous segment Met193-Gly375.

The scores are based ConSurf Server [72],[73] calculation for GroEL sequence.

Redistribution of inter-residue contacts at the transition from R to R″ state.

Panel (A) shows the inter-residue contacts between adjacent subunits of the cis ring, which break up during the transition. The two subunits are colored in light pink and blue. Contact pairs are represented by spheres at the Cα position, and by distinctive colors. Similarly, panel (B) shows the contacts newly formed during the transition. See Table 7 for the complete list of residue pairs shown here. The contacts involving a pair of conserved residues are labeled using the same colors as the corresponding residues. Conserved residues are highlighted in boldface. The domains are shown in parentheses. The equatorial domain residues are Met1-Pro137 (E1) and Val411-Pro525 (E2); intermediate domain residues are Cys138-Gly192 (I1) and Val376-Gly419(I2); and the apical domain (A) consists of a contiguous segment Met193-Gly375. The scores are based ConSurf Server [72],[73] calculation for GroEL sequence. Broken pairs distinguished by their high conservation are Asp41-Met520 and Gly45-Gln72; and those conserved contacts made at the PTS are Ala26-Cys519, Val29-Glu518, Arg36-Thr517, Asn37-Cys519, Leu40-Thr522, Pro47-Gln72, Ala260-Glu304, Ala384-Ser509 and Thr385-Ser509 (Figure 10). Notably, some of these residues have been already reported to be functional. The mutation Gly45Glu, for example, causes defective release of partial folded polypeptide without reducing ATPase or GroES binding [74],[75]. Val264Ser reduces GroES and polypeptide binding affinity [74]; Ala260 is involved in binding and unfolding rhodanese, as shown in MD simulations [76]. Mutants Arg36His, Glu76Lys, Val264Ile, Thr385Ile, Met520Ile and Thr522Ile give rise to active single-ring GroEL [77]. Finally, we note that all four residues, Val39, Asp41, Gly45 and Ala46, listed in Table 7 to break at the energy barrier, belong to the segment Val38-Ile49, pointed out in our previous study [16] to play a key role in establishing the intra-ring allosteric communication.

Conclusion

GroEL is a molecular machine that has been broadly studied in recent years using both experimental and theoretical or computational methods. Yet, a structure-based analysis of the transition of the intact chaperonin between its functional forms has been held back by the large size of the chaperonin. The aANM method is proposed as a first approximation toward approaching this challenging task. The conformational transition results from the competition of two counter effects at a minimum, one intramolecular, and the other intermolecular: (1) any deformation away from the original equilibrium states involves an uphill motion along the energy surface, but the lowest frequency motions incur the lowest ascent in energy (intramolecular); (2) longer pathways experience higher frictional resistance from the surroundings, such that the shortest pathway may be favored (intermolecular) at the expense of internal strains. In the aANM model, one can balance these two counter effects by adopting different threshold correlation cosine (F), for different proteins in various conditions. Higher F values emphasize the effect (2), with the upper limit F = 1 restricting the pathway to a pure interpolation. Lower F values on the other hand allow for occasional departures from the targeted direction, with the limit F = 0 tolerating steps orthogonal to the target direction. Central to the development of this methodology were approximations and parameters that needed to be critically tested, hence our extensive examination of the transitions of a GroEL subunit and comparison with the results from other methods, including SDP (action minimization), MD simulations [11] and experimental studies. Multiple pathways were explored to this aim by varying the parameters F n and f, which control the respective direction and size of the aANM steps. It is conceivable that multiple routes are accessible between the two endpoints, given the multidimensional character of the energy landscape. For example, broad transition state ensembles have been observed by Hyeon, Lorimer and Thirumalai in their simulations of the GroEL allosteric transitions using a self-organized polymer model [18]. However, one or more are more probable, and aANM method by definition aims at sampling such probable (structurally favored) paths. Of interest was to identify the common features, if any, of the paths sampled with different parameters, and to identify among them those most closely agreeing with the data from other methods and experiments. Notably, the generated pathways invariably exhibited the following common features, observed for a single subunit, and confirmed in the intact chaperonin: The recruitment of low frequency modes near the original state, succeeded by the recruitment of higher frequency modes near the barrier (see for example Table 1 and Figures 3, 5A–C, and 6B), The dominant role of a few well-defined low frequency modes for achieving substantial (at least halfway) displacement in the ‘functional’ direction (Tables 2 and 4 and Figures S1 and S2), The close agreement between pathways and energy profiles generated with different parameters, observed over a large portion (x(n)≤0.4 and x(n)≥0.7) of the reaction coordinate, also in accord with SDP results (see for example Figures 4A and 4B, 5D, and S6), consistent with the predominance of well-defined modes at the initial stages of the structural changes, Barrier heights significantly lower than that incurred upon linear interpolation of coordinates between the two endpoints (Figures 3B, 5A, and S6), Protection of the native state inter-residue contacts throughout a surprisingly large portion of the transition pathway, and formation/breakage of native contacts, mostly at the E domains, only in the close neighborhood of the PTS (Figures 10 and S5 and Table 7). Having established these common features, a closer analysis of intermediate conformations, energetics and mode distributions in comparison to those from the rigorous SDP method supported the use of F = 0.5 as an optimal parameter for further calculations for the intact chaperonin. A discriminative feature that supported this parameter was the position of the energy barrier along the reaction coordinate, in accord with SDP. The close neighborhood of the energy barrier is indeed the region most sensitive to the choice of parameters. Notably, this region involves local rearrangements (RMSD<2 Å) comparable (or below) the resolution of examined structures. Therefore the results in this regime should be interpreted with caution. However, the set of ‘critical’ contacts identified for the R→R″ transition (listed in Table 7) have been verified to be robustly reproduced over a relatively broad range of aANM parameters (0.4salt-bridges observed here and those found by BD simulations with state-dependent energy functions and parameters [18], presented in Figures 8 and S4, and Table 6. The application of aANM to GroEL therefore elucidates not only highly probable pathways or the hierarchic contribution of modes to achieve the transition; but it also provides insights into key interactions that initiate the transition (e.g., formation of the K80-D359 salt-bridge), or those that form/break at the transition state(s). Such inter-subunit contacts disrupted and formed at the PTS are illustrated at the respective panels A and B of Figure 10. Notably, the majority of the residues involved in these critical interactions are highly conserved (Table 7), and some of them have been observed in previous site-directed mutagenesis experiments to affect GroEL machinery [74]–[77]. The importance of the other identified critical contacts to the functional transitions of the chaperonin remains to be further established by experiments. On a practical side, the major utility of the method lies in its application to the transitions of supramolecular systems beyond the range of exploration of other computational methods. The computing time in the present method is several orders of magnitude shorter than that required in regular MD or BD simulations. Here, sampling the transition pathway of the intact GroEL (of approximately 8,000 residues) takes less than 255 min CPU time (or about 33 hr real time) on a 2G Hz Linux server. The method has its own limitations, however. The modes predicted by the ANM are those exclusively based on inter-residue contact topology. No specific interactions are taken into consideration. So, the routes predicted here are those selected assuming that mechanical effects purely based on geometry dominate the dynamics. However, specific interactions may become more important as more localized changes are simulated, and these are usually manifested by changes in side chain reorientations, which are beyond the range of aANM calculations. We note in particular that there are well-defined ways to define transition rates through reactive flux theory [78], optimize reaction coordinates and estimate reaction rate coefficients (see for example [79]), which are not addressed in the present study. Finally, from a broader perspective, it is worth noting that this type of calculation is possible only to the extent that the conformational changes intrinsically favored by the overall architecture comply with those required to achieve the biological function (or the allosteric cycle, here). In principle, the configurations are defined in a 3N-dimensional space, defined by 3N-coordinates (provided that we adopt a coarse-grained description of N position vectors, one per residue). Here, we let the molecule move in the subspace of 5–6 coordinates only predicted by the ANM, and we can see that more than ½ of the trajectory along the conformational space is traversed by the molecule by simply moving along these soft coordinates. High frequency modes need not be recruited until the energy barrier is approached. The conformational changes are therefore effectuated to a large extent through low frequency modes, which are the least expensive (from energetic point of view), or the most favorable (from entropic point of view) modes among a multitude (3N-6 of them) of possible modes/directions of reconfiguration. This brings up the issue of a possible evolutionary selection of 3-dimensional structures (or inter-residue contact topologies) whose energy landscape is most conducive to functional changes in conformation. The selection of a conformational changes entropically favored by the structure, in order to achieve biological function, appears to be a common property of proteins, as also evidenced for ubiquitin in a recent study [80].

Materials and Methods

Materials

The following GroEL structures deposited in the Protein Data Bank (PDB) [81] have been considered in the present study (Figure 1): (A) 1GR5; the apo-form of GroEL, representative of the T/T state. (B) 1GRU; identified with the R″/R state; it contains seven ADPs and seven ATPs bound to the cis and trans rings, respectively. All these structures were determined by cryo-electron microscopy by Saibil and coworkers [15],[68],[82]. The structures being compared (the end points and intermediates at each step) were optimally superimposed at their α-carbons using Kabsch method [83] prior to implementing the aANM method.

aANM method and parameters

To explore the transition pathway between the initial and final conformations of subunit A, a series of intermediate conformations has been generated upon successive deformations of both end structures that were regularly updated. The directions of deformations were determined by implementing the deformations along the directions of dominant ANM modes accessible to the intermediate states. The contribution of the i normal mode to the mean-square fluctuations of residues scales with the inverse eigenvalue of the Hessian characteristic of the energy landscape near the equilibrium state; and the eigenvalues, in turn, scale with the squared frequency of the normal modes, hence the dominant role of low frequency modes in shaping the equilibrium dynamics [42]. The aANM scheme involves two parameters, F and f, which, in turn, determine m (, m (, s ( and s (. The dependence of m ( and m ( on F has been described in the Theory. As to s ( and s (, these are determined as follows: first we evaluate the values s ( and s ( that minimize |d (|. Note that there is a unique combination of these parameters (or the sizes || and || instantaneous displacements) which leads to the minimal |d (|. However, the step sizes that meet this criterion may be too large to be extrapolated using a quadratic approximation away from the energy minimum, and the energy cost of the induced deformations may be excessively high and not accurately accounted for by the harmonic approximation. Instead, we take a fraction f of the calculated values, i.e., we implement displacements that scale with fs ( and fs (. Calculations repeated with different f values support the use of relatively small values for two reasons: First, the smaller steps avoid unphysical distortions in the structure (see Supporting Information); second, if a larger displacement along a given mode direction is more productive, the same mode is selected in the succeeding iteration (see for example how modes 1, 3, 4 etc. are selected multiple times at the initial steps of the trajectories in Table 1.) Therefore, in principle, we would like to take steps that are as small as possible. However, the simulations become increasingly time-consuming (due to larger number of iterations) with decreasing step size. Our calculations suggest that the value f = 0.2 leads to sufficiently small step sizes along with computationally tractable iterations. Finally, instead of picking a fixed F throughout the entire trajectories, F can be chosen dynamically, (for k>1). In the first iteration, only the first dominant mode is used to deform the system. The results with this choice of dynamic F ( were found to yield results comparable to F = 0.4−0.5 and SDP results (see Figure S6). The use of this empirical relationship has the advantage of eliminating the variable F from the aANM algorithm.

Potential energy function and parameters

We adopt a double minima quadratic energy function [49] Here the parameter β sets the height (and smoothness) of the energy barrier, and the potentials U and U are defined in the ANM in terms of a uniform force constant γ as [52] Here | is the instantaneous distance between residues i and j (based on α-carbons), | is the equilibrium distance in conformation A, and the summations are performed over all pairs of residues located within a cutoff distance r. The aANM trajectories were generated using Eq. (8) in each iteration by assuming that and are the final conformations of the previous iteration. Eq. (7) has been used for evaluating the energy of the generated conformations and for performing SDP calculations. We adopted the value β = 10 kcal/mol and verified that changes in β within one order of magnitude do not affect the results. Nevertheless, the energies reported below provide information on the shape, rather than absolute values, of the energy profile. The force constant is taken as 0.7 kcal/(mol·Å2), and the cut-off distance as r = 13.0 Å consistent with previous assessment of optimal ANM parameters [55].

Steepest descent path (SDP)

The SDP between the conformations and is calculated by numerical minimization of the integral The product , therefore , where is potential energy difference in the i th monotonic part of the trajectory. The sum is minimal if the molecule undergoes a transition from to through the lowest energy barrier; moreover, the terms in the last inequality are equal if and only if the trajectory proceeds parallel to the potential gradient. Therefore SDP is the unique minimum of the boundary value integral in Eq. (9). To find a path that minimizes S (i.e., the SDP), we search for the global minimum of a discrete approximationsubject to the constraint that successive 's are equally spaced along the trajectory. The critical advantage of this algorithm is that it remains stable and calculates qualitatively reasonable trajectories even when the distances between 's are large. The minimization of is accomplished by simulated annealing. For further details see [29] and [63]. Contribution of low frequency modes to the conformational changes undergone by a single subunit during the cycle T→R→R″→T. (0.84 MB DOC) Click here for additional data file. Contribution of low frequency modes to the conformational changes undergone by the intact chaperonin during the cycle T/T→R/T, R/T→R′/T and R″/R→T/R. (0.79 MB DOC) Click here for additional data file. Distortion in backbone bonds during conformational transitions. (0.32 MB DOC) Click here for additional data file. Time evolution of salt bridges. (0.78 MB DOC) Click here for additional data file. Robustness of the broken/formed native contacts near the transition point. (0.41 MB DOC) Click here for additional data file. Comparison with the paths predicted by MinActionPath, SDP and Interpolation. (0.10 MB DOC) Click here for additional data file. Distortion in backbone bonds during conformational transitions. (0.03 MB DOC) Click here for additional data file. Relationship between cumulative correlation cosine and the angle formed by ( and combined eigenvectors (derivation of Eq.(6)). (0.10 MB DOC) Click here for additional data file.

68 in total

1. The Protein Data Bank.

Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. ATP-bound states of GroEL captured by cryo-electron microscopy.

Authors: N A Ranson; G W Farr; A M Roseman; B Gowen; W A Fenton; A L Horwich; H R Saibil
Journal: Cell Date: 2001-12-28 Impact factor: 41.582

Review 3. Chaperonin-mediated protein folding.

Authors: D Thirumalai; G H Lorimer
Journal: Annu Rev Biophys Biomol Struct Date: 2001

4. Phi value analysis of heterogeneity in pathways of allosteric transitions: Evidence for parallel pathways of ATP-induced conformational changes in a GroEL ring.

Authors: Amnon Horovitz; Amnon Amir; Oded Danziger; Galit Kafri
Journal: Proc Natl Acad Sci U S A Date: 2002-10-18 Impact factor: 11.205

5. Reaction coordinates and rates from transition paths.

Authors: Robert B Best; Gerhard Hummer
Journal: Proc Natl Acad Sci U S A Date: 2005-04-06 Impact factor: 11.205

6. Can conformational change be described by only a few normal modes?

Authors: Paula Petrone; Vijay S Pande
Journal: Biophys J Date: 2005-12-16 Impact factor: 4.033

7. Functional modes of proteins are among the most robust.

Authors: S Nicolay; Y-H Sanejouand
Journal: Phys Rev Lett Date: 2006-02-24 Impact factor: 9.161

8. From A to B in free energy space.

Authors: Davide Branduardi; Francesco Luigi Gervasio; Michele Parrinello
Journal: J Chem Phys Date: 2007-02-07 Impact factor: 3.488

9. ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures.

Authors: Meytal Landau; Itay Mayrose; Yossi Rosenberg; Fabian Glaser; Eric Martz; Tal Pupko; Nir Ben-Tal
Journal: Nucleic Acids Res Date: 2005-07-01 Impact factor: 16.971

10. Signal propagation in proteins and relation to equilibrium fluctuations.

Authors: Chakra Chennubhotla; Ivet Bahar
Journal: PLoS Comput Biol Date: 2007-09 Impact factor: 4.475

52 in total

1. Mechanism of cohesin loading onto chromosomes: a conformational dynamics study.

Authors: Ozge Kurkcuoglu; Paul A Bates
Journal: Biophys J Date: 2010-08-09 Impact factor: 4.033

2. Dynamic Transmission of Protein Allostery without Structural Change: Spatial Pathways or Global Modes?

Authors: Tom C B McLeish; Martin J Cann; Thomas L Rodgers
Journal: Biophys J Date: 2015-08-31 Impact factor: 4.033

3. Global motions exhibited by proteins in micro- to milliseconds simulations concur with anisotropic network model predictions.

Authors: M Gur; E Zomot; I Bahar
Journal: J Chem Phys Date: 2013-09-28 Impact factor: 3.488

Review 4. Normal mode analysis of biomolecular structures: functional mechanisms of membrane proteins.

Authors: Ivet Bahar; Timothy R Lezon; Ahmet Bakan; Indira H Shrivastava
Journal: Chem Rev Date: 2010-03-10 Impact factor: 60.622

5. The "weighted ensemble" path sampling method is statistically exact for a broad class of stochastic processes and binning procedures.

Authors: Bin W Zhang; David Jasnow; Daniel M Zuckerman
Journal: J Chem Phys Date: 2010-02-07 Impact factor: 3.488

Review 6. Pre-existing soft modes of motion uniquely defined by native contact topology facilitate ligand binding to proteins.

Authors: Lidio Meireles; Mert Gur; Ahmet Bakan; Ivet Bahar
Journal: Protein Sci Date: 2011-09-09 Impact factor: 6.725