Chun-Lai Ren1, Yue Shan1, Pengfei Zhang2, Hong-Ming Ding3, Yu-Qiang Ma1. 1. National Laboratory of Solid State Microstructures and Department of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China. 2. State Key Laboratory for Modification of Chemical Fibers and Polymer Materials, Center for Advanced Low-Dimension Materials, College of Material Science and Engineering, Donghua University, Shanghai 201620, China. 3. Center for Soft Condensed Matter Physics and Interdisciplinary Research, School of Physical Science and Technology, Soochow University, Suzhou 215006, China.
Abstract
Recent studies reported that adenosine triphosphate (ATP) could inhibit and enhance the phase separation in prion-like proteins. The molecular mechanism underlying such a puzzling phenomenon remains elusive. Here, taking the fused in sarcoma (FUS) solution as an example, we comprehensively reveal the underlying mechanism by which ATP regulates phase separation by combining the semiempirical quantum mechanical method, mean-field theory, and molecular simulation. At the microscopic level, ATP acts as a bivalent or trivalent binder; at the macroscopic level, the reentrant phase separation occurs in dilute FUS solutions, resulting from the ATP concentration-dependent binding ability under different conditions. The ATP concentration for dissolving the protein condensates is about 10 mM, agreeing with experimental results. Furthermore, from a dynamic point of view, the effect of ATP on phase separation is also nonmonotonic. This work provides a clear physical description of the microscopic interaction and macroscopic phase diagram of the ATP-modulated phase separation.
Recent studies reported that adenosine triphosphate (ATP) could inhibit and enhance the phase separation in prion-like proteins. The molecular mechanism underlying such a puzzling phenomenon remains elusive. Here, taking the fused in sarcoma (FUS) solution as an example, we comprehensively reveal the underlying mechanism by which ATP regulates phase separation by combining the semiempirical quantum mechanical method, mean-field theory, and molecular simulation. At the microscopic level, ATP acts as a bivalent or trivalent binder; at the macroscopic level, the reentrant phase separation occurs in dilute FUS solutions, resulting from the ATP concentration-dependent binding ability under different conditions. The ATP concentration for dissolving the protein condensates is about 10 mM, agreeing with experimental results. Furthermore, from a dynamic point of view, the effect of ATP on phase separation is also nonmonotonic. This work provides a clear physical description of the microscopic interaction and macroscopic phase diagram of the ATP-modulated phase separation.
Recent discoveries have shown that phase separation plays an important role in many intracellular biological processes (–), e.g., the formation of membrane-less compartments like stress granules in the cytoplasm (), Cajal bodies (), and nucleoli () in the nucleus. Phase separation results in the compartmentalization of biomolecules into concentrated liquid droplets (with a typical size in micrometers) via weak specific interactions at the molecular level (–). Although their organizations are typically highly dynamic and accompanied by the rapid exchange of components with their surroundings, their overall size and shape are usually stable for minutes or hours (, ).The development of both experiment and theory (–) has revealed the possible mechanisms of phase separation. One central principle is multivalence, which can drive the formation of condensate via phase separation coupled to percolation (–). Specifically, multivalent interaction is facilitated by the intrinsically disordered regions (IDRs) in the proteins (–). As one specific class of IDRs, the prion-like domain (PLD) is frequently found in RNA binding proteins, e.g., the fused in sarcoma (FUS) protein. It was found that the molecular driving forces for phase separation in FUS were cooperatively weak interactions among tyrosine (Tyr) residues from the PLD and arginine (Arg) residues from the RNA binding domain (); the valence of the multivalent interactions was a key factor determining the saturation concentration in FUS solutions.Adenosine triphosphate (ATP) is known as an energy fuel for biological reactions typically at the concentration of micromolar (–). In cells, unexpectedly, the ATP concentration is in the range of 2 to 12 mM, which is much higher than the amount required for the energy fuel. Previous studies showed that the high concentration of ATP was related to the dissolution of protein condensates (), in which ATP was explained as a biological hydrotrope (–). In chemistry, hydrotropes are small-molecule amphiphiles that contain distinct hydrophilic and hydrophobic parts (the aromatic ring/rings in most cases) and can help solubilize the hydrophobic substances in aqueous systems (). Likewise, ATP consists of hydrophobic aromatic pyrimidine base ring and hydrophilic triphosphate moiety, fitting well into the concept of a hydrotrope.However, later works found that like RNA (), single-stranded DNA (), and other molecules (), ATP had two-stage effects on phase separation of prion-like proteins: an enhancement of phase separation at low concentrations and an inhibition of phase separation at high concentrations (, –). The mechanism of these dual effects of ATP is still largely unknown. It is worth noting that such reentrant phase separations are quite general in biomacromolecules, which were observed in many multicomponent biosystems (–). There is emerging evidence that abnormal protein aggregation is associated with many human diseases, including cancer, neurodegeneration, and infectious disease (, ). Therefore, it is of fundamental and physiological significance to uncover the influence of ATP on protein condensation or solubilization.In this work, we combine the semiempirical quantum mechanical (SQM) method, mean-field theory, and coarse-grained molecular dynamics (CGMD) simulation to reveal the mechanism underlying the dual effect of ATP on phase separation. We first use the SQM method to uncover the Tyr-Arg and ATP-Arg bindings at the atomic level, which are taken as the input for the mean-field theory. We then develop a mean-field theory by explicitly taking molecular interactions into account based on a sticker-and-spacer model and provide phase diagrams with full binodals at the macroscopic level, which can be directly compared with the experimental phenomena. Furthermore, we use the CGMD simulation to investigate the phase separation from the dynamic perspective. These three methods complement each other and can build a direct relationship between the microscopic driving force and macroscopic phase separation, offering a better understanding of ATP-modulated phase separation.
RESULTS
The relationship between Tyr-Arg binding and phase separation in FUS solution
Multivalent interactions among Tyr and Arg residues are considered as driving forces for the phase separation in FUS solution (). However, the direct relationship between the multivalent interactions and phase diagrams is still lacking. To achieve a quantitative description, we first used the SQM method to calculate the binding free energy of the Tyr-Arg interaction (more details on the SQM method are given in Materials and Methods). Figure 1 (A and B) shows two typical binding modes of the Tyr-Arg complex, where the ending charged group in Arg may be either perpendicular or parallel to the benzene plane in Tyr. Notably, the binding free energy is about −2.0 kBT in both cases, indicating that the Tyr-Arg binding is a weak interaction. This raises the question of whether Tyr-Arg binding could be sufficient to cause the phase separation in FUS solutions.
Fig. 1.
Tyr-Arg binding resulting in the phase separation in FUS solution.
Snapshots for the two typical binding modes between one Tyr and one Arg obtained by the SQM method (A and B), where ϵ1 denotes the binding free energy in each case. Phase diagram and the fraction of Tyr-Arg binding calculated from the mean-field theory (C and D). The black curve represents the diluted phase, and the red one represents the dense phase. Dash lines are tie-lines relating two coexisting phases. The blue dots at ϵ1 = −2 kBT are taken as the average binding free energy between Tyr and Arg for the following theoretical calculations. (E) Schematic representation of the formation of FUS protein droplets caused by Tyr-Arg binding.
Tyr-Arg binding resulting in the phase separation in FUS solution.
Snapshots for the two typical binding modes between one Tyr and one Arg obtained by the SQM method (A and B), where ϵ1 denotes the binding free energy in each case. Phase diagram and the fraction of Tyr-Arg binding calculated from the mean-field theory (C and D). The black curve represents the diluted phase, and the red one represents the dense phase. Dash lines are tie-lines relating two coexisting phases. The blue dots at ϵ1 = −2 kBT are taken as the average binding free energy between Tyr and Arg for the following theoretical calculations. (E) Schematic representation of the formation of FUS protein droplets caused by Tyr-Arg binding.To answer this question, we developed a mean-field theory for FUS solutions based on a sticker-and-spacer model. This model was first proposed by Semenov and Rubinstein () to treat the gelation and phase separation of thermo-responsive associative polymer solutions, following the pioneering works by Tanaka (, ) for similar systems. Recently, this model has been successfully used to reflect the phase separation in multivalent proteins (, , ). It should be mentioned that two order parameters, i.e., the volume fraction of protein and the fraction of Tyr-Arg bonds, self-consistently appear in the free energy, which is widely used to reflect the concentration and structure in the molecular liquid (, ). To better compare the theoretical and experimental results, we convert the calculated volume fraction into the concentration (see details of the theoretical modeling in Materials and Methods) in the following discussion. Figure 1C shows the phase diagram (i.e., the binodal curve) of phase separation in the FUS solution, depending on the FUS concentration (ρ) and the Tyr-Arg binding free energy (ϵ1). The details of the calculation for the phase diagram are given in the Supplementary Materials. When ϵ1 is lower than −1.4 kBT, the phase separation could be observed in the FUS solution, meaning that the Tyr-Arg binding is strong enough to drive phase separation due to the cooperation of multiple Tyr-Arg bindings. At ϵ1 = −2.0 kBT, the concentration of the condensed phase is two orders of magnitude higher than that of the dilute phase, which is consistent with the characteristics of protein condensates (). Because the binding free energy of Tyr-Arg is weak, there may exist a reversible breaking and formation of the Tyr-Arg binding under the thermal fluctuation. The extent of Tyr-Arg binding can be reflected by the fraction of Tyr-Arg bonds p, namely, the ratio of the number of Tyr-Arg bonds to the number of Tyr residues in the solution. As shown in Fig. 1D, the value of p in the dense phase is larger than that in the diluted phase, indicating that the protein condensates are denser liquids with network structures in which Tyr-Arg bonds act as cross-linkers (Fig. 1E). Collectively, the above results demonstrate that different FUS concentrations, as well as distinct microscopic structures, occur in the two coexisting phases.
SQM methods on the microscopic interactions between ATP and Arg
We then investigated the effect of ATP on the phase separation in FUS solutions. With the addition of ATP, there may exist an interaction between ATP and FUS, in which the ATP-Arg binding is believed to play an important role (, –). However, the binding modes, possible binding numbers of Arg residues (to ATP), and binding strength are still lacking. To reveal the detailed microscopic mechanism of their interactions, the SQM method was again used. One study () assumed that ATP may act as a bivalent binder due to its amphiphilic nature, i.e., two Arg residues could bind to the triphosphate part and the adenine part, respectively. As shown in Fig. 2 (A and B), the Arg can bind to either the triphosphate part or the adenine part of ATP. Nevertheless, the binding strength is quite different in the two cases. Because the Arg carries one net positive charge and the triphosphate part of ATP is highly negatively charged, the binding free energy in the former case is much lower than that in the latter case (Fig. 2, C and D). Thus, the Arg prefers binding to the triphosphate part of the ATP. We also compared the binding preference in the case of multiple Arg residues. Similarly, these Arg residues are more likely to accumulate around the triphosphate part of the ATP because the average binding free energy is much lower. Notably, because of the steric effect, the average binding free energy increases with the increase of the Arg number. The average binding energy increases significantly, and one of the binding free energies becomes positive in the case of four Arg residues (Fig. 2C), indicating that the binding of Arg to ATP is no longer energy-favorable. Thus, the maximum binding number of Arg residues (to ATP) should be three; namely, ATP may act as a trivalent binder.
Fig. 2.
The SQM result for the ATP-Arg binding.
Snapshots for the typical binding modes between one ATP and one/several Arg residues: (A) The Arg residue(s) all bind to the triphosphate part of the ATP; (B) one of the Arg residue(s) binds to the adenine part of the ATP. The number of Arg residues is denoted as the valence of ATP, represented by L. (C and D) shows the corresponding binding free energy of the ATP-Arg interaction as a function of L in the case of (A) and (B), respectively.
The SQM result for the ATP-Arg binding.
Snapshots for the typical binding modes between one ATP and one/several Arg residues: (A) The Arg residue(s) all bind to the triphosphate part of the ATP; (B) one of the Arg residue(s) binds to the adenine part of the ATP. The number of Arg residues is denoted as the valence of ATP, represented by L. (C and D) shows the corresponding binding free energy of the ATP-Arg interaction as a function of L in the case of (A) and (B), respectively.We only considered a single amino acid (i.e., Arg) in the above calculation, which may be different from the real case when the Arg is in the protein chain. To this end, we further considered a tripeptide model [i.e., glycine (Gly)–Arg-Gly] and calculated the binding free energy between the tripeptide and the ATP. The use of Gly as the terminal of the tripeptide is due to the fact that Gly is the most frequently neighboring amino acid of the Arg in FUS (there are two Arg-Gly rich regions in FUS). Gly is a very small amino acid; thus, it could have little impact on the interaction between the Arg and the ATP. As shown in fig. S1, the binding free energy of the ATP-tripeptide interaction is similar to that of ATP-Arg interaction at L = 1, 2, and 3 (Fig. 2C), while some increase in the binding free energy is observed at L = 4, probably due to the steric effect. Generally, the neighboring amino acids may not obviously affect the binding strength and the binding valance of the Arg-ATP interaction in FUS.
Mean-field theory on the relationship between microscopic interactions and macroscopic phase behaviors
According to the above SQM results, the strength of ATP-Arg binding is stronger than that of Tyr-Arg binding, implying that the phase separation of FUS driven by Tyr-Arg binding could be affected by the addition of ATP. As such, to investigate the effect of ATP on the phase separation in FUS solutions, we further developed the mean-field theory by taking one-to-multiple binding between the ATP and Arg residues into account; this was inspired by the idea of dealing with specific interactions in polymer systems (see details of the theoretical modeling in Materials and Methods) (, , ).Figure 3A shows the phase diagram with critical points in the presence of ATP. The saturation concentration (i.e., the left part of the binodal curve separated by the critical point) of FUS shows a nonmonotonic manner with the addition of ATP when ATP serves as a multivalent binder (i.e., L = 2 or 3). When the amount of ATP is small, the saturation concentration decreases with the increase of ATP. However, the saturation concentration increases when the ATP concentration reaches the order of millimolar. As illustrated in Fig. 3B, ATP can bind to more than one Arg residues when the ATP concentration is low, which could be taken as an additional driving force for promoting the aggregation of proteins and causing phase separation in very dilute solutions. However, at a high ATP concentration, each ATP can only bind to one Arg because of the limited number of Arg residues, which leads to the breakage of Tyr-Arg binding and the inhibition of phase separation. This two-stage phenomenon is in good agreement with previous experiments on the effect of ATP on phase separation (, ): first, the enhancement and the subsequent inhibition with the increase of additives. On the contrary, if ATP is just a monovalent binder (i.e., L = 1), then the saturation concentration of FUS monotonically increases with the increase of ATP concentration. In other words, a higher concentration of protein solution is required for the phase separation in the presence of ATP, indicating that the ATP only inhibited phase separation. Therefore, a multivalent binder acted on by ATP is a prerequisite of the two-stage phenomenon.
Fig. 3.
The reentrant phase separation and protein dissolution in mean-field theory.
(A) Phase diagram of FUS solutions as a function of ATP concentration (c) for three different ATP valences L. The parameter βϵ2 at L = 1, 2, and 3 is set based on the average binding free energy in Fig. 2C. Dashed lines are tie-lines relating two coexisting phases. Dots are critical points of each cases. The dotted line with diamond symbols (i) to (iii) indicates the reentrant phase transition with the increase of ATP concentration. (B) Schematic representations for (i) to (iii) indicate the reentrant phase transition occurring in dilute solutions for L = 2. (C) The concentration of protein condensates (ρ2), (D) the extent of Tyr-Arg binding (p2), and (E) the average number of bound Arg residues per ATP (g2) in protein condensates as functions of the ATP concentration.
The reentrant phase separation and protein dissolution in mean-field theory.
(A) Phase diagram of FUS solutions as a function of ATP concentration (c) for three different ATP valences L. The parameter βϵ2 at L = 1, 2, and 3 is set based on the average binding free energy in Fig. 2C. Dashed lines are tie-lines relating two coexisting phases. Dots are critical points of each cases. The dotted line with diamond symbols (i) to (iii) indicates the reentrant phase transition with the increase of ATP concentration. (B) Schematic representations for (i) to (iii) indicate the reentrant phase transition occurring in dilute solutions for L = 2. (C) The concentration of protein condensates (ρ2), (D) the extent of Tyr-Arg binding (p2), and (E) the average number of bound Arg residues per ATP (g2) in protein condensates as functions of the ATP concentration.It was reported that the dissolution of protein droplets can be observed when the ATP concentration reached its physiological concentration (). As shown in Fig. 3C, the protein condensates can completely disappear when the ATP concentration is about 10 mM for L = 2 and 15 mM for L = 3, which agrees with experimental observations (∼8 to 12 mM) by Patel et al. (). In comparison, the protein condensates dissolve when ATP concentration reached 0.001 mM under the condition of L = 1. These results indicate that the multivalence of ATP may be an important reason why the physiological concentration of ATP is in the order of millimolar. With the increase in ATP, the concentration of the protein condensate ρ2 first increases and then decreases in the case of L = 3. This means that the addition of a small amount of ATP not only can stabilize the protein droplet but also can make the droplet denser when the ATP has a high valence (L = 3). Moreover, as shown in Fig. 3 (D and E), both the extent of Tyr-Arg binding and the average number of bound Arg residues per ATP show a similar way of changing with the addition of ATP in the case of L = 2 or 3, which is quite different from that in the case of L = 1. In the latter case, one molecule of ATP always links to one Arg, which strongly decreases the Tyr-Arg binding and shows the inhibition of phase separation with ATP concentration much lower than the physiological concentration. This indicates that the changeable binding ability of ATP (to Arg) is crucial for dissolving the protein droplets at a millimolar ATP concentration.
CGMD simulation on the effect of ATP on the phase separation from the dynamic view
We also used the CGMD simulation (see details of the CG modeling in Materials and Methods) to investigate the effect of ATP on the phase separation. As shown in fig. S2 (A and B), the reentrant phase separation in the diluted solution is also observed once the ATP can bind to more than one Arg residues. Moreover, a large number of ATP molecules can dissolve the condensates at a high concentration of protein solution (fig. S2, C and D). Protein condensates are dynamic, having the characteristic of the exchange of protein molecules inside and outside the condensate. As shown in movies S1 to S3, the phase separation is in a dynamic equilibrium and the protein chains in the surrounding continuously exchange with protein chains in the droplet. In more detail, there are many exchanged chains in the absence of ATP (Fig. 4A). With the addition of a small amount of ATP, because the protein condensate becomes denser, the number of chains participating in the exchange decreases (Fig. 4B). With the further addition of ATP, the protein condensate becomes looser and the molecular exchange is promoted again (Fig. 4C). The quantitative data of the total number of exchanged chains in a given time can be found in fig. S3. Generally, from the dynamic point of view, the influence of ATP on phase behavior is also nonmonotonic.
Fig. 4.
Molecular exchange of protein condensates.
Time sequence of snapshots illustrating the exchange of protein chains inside and outside the protein condensates in CGMD simulations: (A) nATP = 0, (B) nATP = 30, and (C) nATP = 70. The protein chains participating in the molecular exchange are shown in cyan, and others are described as pink chains. The yellow and green beads represent Arg and Tyr residues, respectively, and the purple beads stands for the ATP molecules. The total number of protein chains is npro = 50 in the simulations.
Molecular exchange of protein condensates.
Time sequence of snapshots illustrating the exchange of protein chains inside and outside the protein condensates in CGMD simulations: (A) nATP = 0, (B) nATP = 30, and (C) nATP = 70. The protein chains participating in the molecular exchange are shown in cyan, and others are described as pink chains. The yellow and green beads represent Arg and Tyr residues, respectively, and the purple beads stands for the ATP molecules. The total number of protein chains is npro = 50 in the simulations.
Predictions on phase separation induced by ATP for prion-like proteins with asymmetric numbers of Tyr and Arg residues
After revealing the nonmonotonic effect of the ATP on the phase separation in the FUS solution, we further explored a more general case in prion-like proteins, in which the number of Tyr and Arg residues is not necessarily equal (). To denote such a difference, here, for the sake of simplicity, we only considered two cases: the number of Tyr residues being much larger than (m1 = 34, m2 = 11) or smaller than (m1 = 11, m2 = 34) that of Arg residues. Because of the asymmetric number of Tyr and Arg residues, the phase separation is hardly observed in the pure protein solutions. With the addition of ATP, the phase separation could be observed as two looped phase diagrams (see Fig. 5A). However, the area of the loop is quite different; namely, the area in the case of m1 < m2 is much larger than that in the case of m1 > m2. This difference mainly originated from the ability of ATP binding to Arg; in other words, the average number of bound Arg residues per ATP in the case of m1 < m2 is much larger than that in the case of m1 > m2 (Fig. 5B). Furthermore, with a small amount of ATP, the Tyr-Arg binding is enhanced in the case of m1 < m2, which promotes phase separation (see the inset of Fig. 5B). The above theoretical results are also verified by the CGMD simulations (fig. S4). Generally, the phase separation could also be modulated by the ATP in the protein systems with asymmetrical numbers of Tyr and Arg residues, but the sensitivity of the phase separation (to the ATP concentration) is highly related to the number of Arg residues.
Fig. 5.
The effect of ATP on other prion-like proteins.
(A) Phase diagram of prion-like proteins with asymmetric numbers of Tyr and Arg residues (m1 > m2 and m1 < m2) with the addition of ATP for L = 3. (B) The average number of bound Arg residues per ATP (g2) and the extent of Tyr-Arg binding (p2) (inset) in protein condensates as functions of ATP concentration.
The effect of ATP on other prion-like proteins.
(A) Phase diagram of prion-like proteins with asymmetric numbers of Tyr and Arg residues (m1 > m2 and m1 < m2) with the addition of ATP for L = 3. (B) The average number of bound Arg residues per ATP (g2) and the extent of Tyr-Arg binding (p2) (inset) in protein condensates as functions of ATP concentration.
DISCUSSION
Why the ATP concentration in cells is so high has long been a puzzling problem. When Patel et al. () found that ATP can work as a biological hydrotrope in 2017, researchers began to pay special attention to the effect of ATP on the dissolution of protein condensates (). Later, the dual roles of ATP on phase separation were reported (, –), which again put forward the question of how ATP works.In this work, combining the effort of the SQM method, mean-field theory, and molecular simulation, we perform a systematic study on the modulation of FUS condensates aroused by ATP. The SQM method shows that Arg prefers binding to the triphosphate part of the ATP, with the binding free energy in the order of kBT and a maximum binding number of three. Taking these microscopic features as the input, we develop a mean-field theory to provide macroscopic phase diagrams based on the thermodynamic criterion; this demonstrates that the nonmonotonic modulation largely originates from the distinct number of binding Arg residues (to ATP) at different ATP concentrations. Furthermore, we use the CGMD simulation to demonstrate that the dynamic molecular exchange between the condensate and the surrounding also shows a nonmonotonic behavior as the ATP concentration increases.It should be mentioned that the effects of ligands on the phase separation of proteins may be also the opposite (i.e., enhancement or inhibition), which can be well described within the framework of polyphasic linkage first developed by Wyman and Gill (). Recently, Ruff et al. (, ) reported that the strength, valence, structure, and concentration of the ligand can generate important and subtle effects on the phase behaviors of multivalent macromolecules with sticker-and-spacer architectures. Notably, they were the first to show that monovalent ligands destabilize phase separation driven by homotypic interactions, and the nonmonotonic behavior of ligands as a function of the ligand concentration was also anticipated and demonstrated (, ). Very recently, Dao et al. () found that the type of the polyubiquitin chain could also play distinct roles in the phase separation of UBQLN2, which further bolsters the general conceptual foundations of the findings from Ruff et al. These results share some similarity with the dual effect of ATP (on the phase separation) reported here, which provides a key conceptual lynchpin for our current work.In general, this work well explains the nonmonotonic effect of ATP on the phase separation. This sheds some lights on fine-tuning the phase behaviors of prion-like proteins with additional molecules and may have implications for devastating neurodegenerative disorders.
MATERIALS AND METHODS
The SQM method
Here, the latest SQM method proposed by Bannwarth et al. (), the GFN2-xTB method, is used to calculate the free energy of the Arg binding to the Tyr and/or the ATP. The accuracy of this SQM method in binding energy calculation is believed to be close to that of the density functional theory, but it has a much lower computing cost (). The binding free energy is calculated as ΔG = Gcom − GA − GB, where Gcom is the free energy of the complex, GA is the free energy of the Arg, and GB is the free energy of the Tyr or the ATP. The free energy G usually comes from three sources: the total molecular gas-phase energy, the corresponding free solvation energy, and the thermostatistical contribution to the free energy including the translation, rotation, vibration, and conformation degrees of freedom at a given temperature (, ). To obtain the representative configuration, we first used the Molclus program () to search the possible binding structures in each case. Then, we used the GFN2-xTB method to make a preliminary optimization of the searched structures. Next, we divided the optimized structure into a complex and two monomers and performed the Hessian calculation. All the calculations were performed under the implicit solvent model, namely, using the generalized Born (GB) model with surface area (SA) contributions [i.e., GBSA(H2O)] (). The SQM calculations were performed using the xtb program ().
Theoretical modeling of FUS solutions in the absence of ATP
In the theory, FUS proteins were modeled as linear polymers with a length of N1 = 526. Because FUS belongs to a class of biomolecules with limited aqueous solubility (), solvents were modeled as chains with lengths of N2, which is a simple way to reflect the limited solubility of biomolecules (, ). Each FUS protein contains some Tyr and Arg residues (). The number of Tyr or Arg residues is named as the valence of the multivalent interaction. According to experimental data from Wang et al. (), FUS has m1 Tyr residues and m2 Arg residues, with m1 = m2 = 34. The incompressible solution consists of n FUS proteins and n solvent molecules. The dimensionless free energy of the reference state βFref was written aswhere is the volume fraction of FUS. v is the volume of one residue of the protein, which is taken as the unit volume. V = n1v + n2v is the volume of system. Thus, the reference system is always homogeneous, which is also the system without including Tyr-Arg binding. Next, we took the specific binding between Tyr and Arg residues into account, assuming that each Tyr/Arg has only one binding site. According to previous works on associative polymers (, ), the free energy of Tyr-Arg binding can be written aswhere Zbinding is a partition function and is equal toHere, Pbinding is a combinatorial factor that describes the number of ways to form k Tyr-Arg bonds, which can be written asThe formation of each bond requires the Tyr and the Arg to be in close proximity within some volume scale v. For simplicity, we take v to simply be the residue volume v. The overall probability of forming k bonds is thenIn Eq. 3, βϵ1 is the free energy associated with the binding between Tyr and Arg, which includes the affinity between Tyr and Arg residues, as well as the entropic loss during the formation of the Tyr-Arg bond (). More specifically, the specific chemical structures of Tyr and Arg side chains appear to be important determinants of the formation of the Tyr-Arg bond (). Such a reduction of conformational entropy is a common character for specific interactions, such as hydrogen bonds and cation-π interactions. Substituting Eqs. 3 to 5 into Eq. 2, the dimensionless binding free energy can be written aswhere is the fraction of Tyr in the formation of Tyr-Arg bonds and reflects the extent of protein network cross-linked by the Tyr-Arg bonds. Thus, the total free energy of the FUS solution is Ftot = Fref + Fbinding, which includes the two order parameters ϕ and p. According to the thermodynamic criterion, phase diagrams can be obtained from the equality of chemical potential and the osmotic pressure of two coexisting phases (see the details in the Supplementary Materials). The two coexisting phases consist of a diluted phase and a condensed phase, with distinct volume fractions of ϕ1 and ϕ2, respectively. It should be mentioned that the volume fraction of the diluted phase (ϕ1) can be converted to the saturation concentration (ρ1) by considering that the hydration size of the amino acid is about 0.65 nm (). This can be then mapped with the experimental saturation concentration (i.e., 5 μM) () to determine the fitting parameter N2 = 65 in the theory.
Theoretical modeling of FUS solutions in the presence of ATP
The ATP is modeled as a small molecule with a volume of v (the unit volume), which is the same as one residue of FUS. The reference system is chosen as a FUS solution with freely moving ATP molecules. The dimensionless free energy of the reference state can be written aswhere is the ratio of the number of ATP molecules and the number of FUS residues. nATP is the number of ATP molecules in the solution. The first three terms are mixing the entropy for the three components of FUS, solvent, and ATP. The fourth term reflects the effective interaction between ATP and the solvent with χ = −1.2, because ATP can act as a hydration mediator due to its triphosphate with a unique hydration property (). The last term is introduced to change the system into a semiclosed system (), which is more convenient for calculating the phase diagram. This means that ATP molecules in protein solutions are free to exchange with a reservoir at a fixed chemical potential (μATP) determined by ATP volume fraction (φ), i.e., . The ATP volume fraction (φ) can be converted into the ATP concentration (c) by . Next, we take both the Tyr-Arg binding and ATP-Arg binding into account. The former is a one-to-one binding, and the latter may be one-to-multiple binding. L is used to represent the valence of ATP. According to previous works on one-to-multiple binding (, ), the free energy of the formation of different kinds of bonds isZbinding is the partition function and is given byHere, Pbinding is a combinatorial factor that describes the number of ways to form k Tyr-Arg bonds and b ATP-Arg bonds in the system with n FUS proteins and nATP ATP molecules. It can be written asW in Eq. 9 is a probability of finding (k + b) bonds in the vicinity of each other, which isIn Eq. 9, βϵ2 is the binding free energy aroused by the formation of one ATP-Arg bond. Substituting Eqs. 9 to 11 into Eq. 8, the associated dimensionless free energy is given bywhere is the fraction of ATP participating in ATP-Arg binding. Therefore, the total free energy becomes ℱtot = ℱref + ℱbinding. It can be seen that p and q are coupled in Eq. 12, indicating the competition between ATP-FUS and FUS-FUS interactions. After minimizing ℱtot with respect to p, q, and μ, the phase diagram can be obtained according to the thermodynamic criteria of the same chemical potential and osmotic pressure of two coexisting phases (see the details in the Supplementary Materials).
The CGMD simulation
A CGMD simulation was used to investigate the effect of ATP on the phase separation of the protein solution. The protein was modeled as a chain with connective beads in the implicit solvent (). These beads were connected by the harmonic spring with the potential U(r) = k(r − r)2, where r = 4.5 nm is the mean bond length and is the bond stiffness, where ϵ0 is the energy unit. There were three types of beads in the protein chain, namely, the Tyr beads, the Arg beads, and the noninteracting beads. Here, the percentage of the Tyr or Arg beads was 0.06, similar to that in the mean-field theory. To reflect the effective binding between the Tyr bead and the Arg bead, the attractive potential for r < r0 was used. We chose r0 = 2.0 nm and U0 = −20ϵ0 (). Moreover, a softened and truncated Lennard-Jones potential was adopted to avoid the aggregation of the same type of beads, which was given by when r < r. ϵ = 0.621ϵ0 and ϵ = 0.15ϵ0 were chosen for the Tyr bead and the Arg bead, respectively. Other parameters are fixed as λ = 0.68, σ = 3.5 nm, and r = 5 nm, which are similar to those used in the previous article (). The ATP was modeled as one bead in the simulations. The effective binding between the ATP bead and Arg bead was also modeled as the attraction potential . Here, r0 = 2.0 nm and U0 = − 75ϵ0 were chosen because the binding free energy of ATP-Arg was nearly four times as strong as that of Tyr-Arg according to the SQM result.The coarse-grained simulations were performed at the NVT ensemble with the Langevin thermostat at room temperature. The size of the simulation box was 50 nm by 50 nm by 250 nm, and the periodic boundary conditions were adopted in all three directions. To promote the system to equilibrium, the slab method was adopted in the simulation (). The time step in the equilibrium simulation was dt = 0.5 ns. The data were collected every 50 μs, with a total time of about 50 ms. All the simulations were performed using the LAMMPS software package (29 October 2020) ().
Authors: Jie Wang; Jeong-Mo Choi; Alex S Holehouse; Hyun O Lee; Xiaojie Zhang; Marcus Jahnel; Shovamayee Maharana; Régis Lemaitre; Andrei Pozniakovsky; David Drechsel; Ina Poser; Rohit V Pappu; Simon Alberti; Anthony A Hyman Journal: Cell Date: 2018-06-28 Impact factor: 41.582
Authors: Pilong Li; Sudeep Banjade; Hui-Chun Cheng; Soyeon Kim; Baoyu Chen; Liang Guo; Marc Llaguno; Javoris V Hollingsworth; David S King; Salman F Banani; Paul S Russo; Qiu-Xing Jiang; B Tracy Nixon; Michael K Rosen Journal: Nature Date: 2012-03-07 Impact factor: 49.962