Literature DB >> 28971853

Slow molecular recognition by RNA.

Kristin R Gleitsman1, Raghuvir N Sengupta1, Daniel Herschlag1,2,3.   

Abstract

Molecular recognition is central to biological processes, function, and specificity. Proteins associate with ligands with a wide range of association rate constants, with maximal values matching the theoretical limit set by the rate of diffusional collision. As less is known about RNA association, we compiled association rate constants for all RNA/ligand complexes that we could find in the literature. Like proteins, RNAs exhibit a wide range of association rate constants. However, the fastest RNA association rates are considerably slower than those of the fastest protein associations and fall well below the diffusional limit. The apparently general observation of slow association with RNAs has implications for evolution and for modern-day biology. Our compilation highlights a quantitative molecular property that can contribute to biological understanding and underscores our need to develop a deeper physical understanding of molecular recognition events.
© 2017 Gleitsman et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

Keywords:  RNA structure; binding kinetics; molecular recognition

Mesh:

Substances:

Year:  2017        PMID: 28971853      PMCID: PMC5688996          DOI: 10.1261/rna.062026.117

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


INTRODUCTION

Molecular recognition underlies all of life. The individual molecules that determine the accuracy of what complexes form and what reactions occur are responsible for the fidelity of biological responses and ultimately for the cellular and organismal order that is emblematic of life. Nevertheless, the most accurate process is of no value to an organism if it cannot be carried out rapidly, to overcome stray chemical and physical processes and to outcompete other organisms. The rate of association between molecules sets a speed limit for metabolic functions and regulatory responses, with a theoretical maximum set by the rate of diffusional collision (see Eigen and Hammes 1963; Berg and von Hippel 1985; Fersht 1999; Schreiber et al. 2009 and below). Underscoring the importance of speed, nature has evolved clever mechanisms to overcome the physical limit of three-dimensional diffusion, for example, allowing proteins to slide in one dimension along DNA or hop between DNAs and harnessing electrostatic interactions to speed associations (von Hippel and Berg 1989; Halford and Marko 2004; Schreiber et al. 2009). On the other end of the spectrum, association rates can fall many orders of magnitude below diffusion control, and in some cases slow association can contribute to regulation and specificity (e.g., Wu et al. 2002; James and Tawfik 2005). Given the importance of molecular recognition and its speed, and the generally adopted model of an early RNA world that was later overcome by a protein-dominant world (Gilbert 1986; Cech 2009), we wondered whether there might be differences in molecular recognition between RNA and proteins at the level of association rates. Such differences could have contributed to the course of evolution and may impact current-day biology. On the molecular level, differences in association rates may reflect atomic-level properties critical in molecular recognition and may provide a unique window into understanding these processes.

RESULTS

To broadly assess the kinetics of association, we gathered all of the RNA/ligand association rate constants we could find in the literature (n = 64) (Supplemental Tables S1, S2). For comparison, we gathered a large number of protein/ligand association rate constants (n = 207) (Supplemental Table S3). We also examined a third category of association events, those between proteins and RNA molecules (n = 22) (Supplemental Table S4). While we expect that our RNA/ligand list is nearly complete, there are too many protein/ligand measurements to fully assemble (see Materials and Methods). Nevertheless, we expect our list to be reasonably representative and suitable for revealing trends and stimulating ideas and discussion about their physical origins and biological implications.

RNA and protein ligand association rate constants

Figure 1 shows rate constants for ligands binding to structured RNAs and to proteins (leftmost and middle columns, respectively). We expect a generic upper limit corresponding to diffusional collision at ∼109 M−1 sec−1 (Eigen and Hammes 1963; Berg and von Hippel 1985; Fersht 1999; Schreiber et al. 2009). For protein ligands, the values span a wide range of over 108-fold and approach and even exceed the diffusional limit, as electrostatic attraction between the protein and ligand can increase their rate of association (e.g., Lambeth et al. 1980; Wallis et al. 1995; Schreiber and Fersht 1996; Radic et al. 1997; Polticelli et al. 1998; and see Schreiber et al. 2009 and references therein). There is also a large range observed for RNA/ligand association rate constants. However, the highest values are distinctly lower than those for proteins, falling well below the diffusional limit (Fig. 1, compare left and middle columns).
FIGURE 1.

Rate constants for ligands binding to structured RNAs (left column) and to proteins (middle column) and for RNA•protein association (right column). Values are presented in Supplemental Tables S1–S3, and “N” is the number of examples in each column. The upper dark gray box represents a generic diffusion limit centered around 109 M−1 sec−1. In the box plot representations, the line inside the rectangle shows the median, the box spans the first quartile to the third quartile (representing the span from 0.25 to 0.75 of the ranked values), and the whiskers are shown as guides. The median values are as follows: RNA–ligand: 1.0 × 105 M−1 sec−1; Protein–ligand: 6.6 × 106 M−1 sec−1; and Protein–RNA: 6.3 × 106 M−1 sec−1.

Rate constants for ligands binding to structured RNAs (left column) and to proteins (middle column) and for RNA•protein association (right column). Values are presented in Supplemental Tables S1–S3, and “N” is the number of examples in each column. The upper dark gray box represents a generic diffusion limit centered around 109 M−1 sec−1. In the box plot representations, the line inside the rectangle shows the median, the box spans the first quartile to the third quartile (representing the span from 0.25 to 0.75 of the ranked values), and the whiskers are shown as guides. The median values are as follows: RNA–ligand: 1.0 × 105 M−1 sec−1; Protein–ligand: 6.6 × 106 M−1 sec−1; and Protein–RNA: 6.3 × 106 M−1 sec−1. As a crude test of whether there is some type of intrinsic limitation to the rate of associations involving RNA, we separately plotted rate constants for the association between RNA and proteins (Fig. 1, rightmost column). The values exceeded those for RNA with non-protein binding partners (Fig. 1, compare left and right columns) and reach the upper range observed for other protein/ligand associations (Fig. 1, compare middle and right columns), indicating that there is no inherent limitation to association involving RNA. To further probe association rates involving RNA we parsed its association rates (Fig. 2), first separating out RNA/RNA associations (“RNA ligands”) from those with non-RNA ligands (“Small molecule ligands”), and, for the RNA/RNA group, further demarcating subgroups that involve formation of base pairs only (“BP”), formation of base pairs and additional tertiary interactions (“BP + 3°”), and formation of non-Watson–Crick tertiary interactions only (“3°”). RNA binding to structured RNAs that involve base-pair formation is significantly faster than binding that does not (Fig. 2, compare “BP” and “3°”), although the number and diversity of RNAs binding via only tertiary contacts is small. Interestingly, the binding of structured RNAs via base-pairing occurs at rates similar to those for simple duplex formation between short RNA oligomers (Fig. 2, compare “BP” and “RNA duplex”; “RNA duplex” values are from Turner et al. 1990 and are listed in Supplemental Table S5), suggesting that there are no general constraints or accelerants to base-pairing in structured RNAs. Non-RNA ligands bind to structured RNAs with rate constants extending up to those for recognition via base-pair formation (Fig. 2, compare “Small molecule ligands” and “BP”), although on average the values are somewhat lower and a wide range of values is observed. Interestingly, the measured values for naturally occurring and in vitro–selected RNAs have similar means and span similar ranges (Fig. 2, compare “Natural” and “In vitro” columns above “Small molecule ligands”).
FIGURE 2.

Rate constants for association involving RNA, separated into RNA/RNA associations (left), RNA/small molecule associations (middle), and RNA duplex formation (right). Rate constants between structured RNAs and RNA ligands are shown in the first three columns, and are further subdivided into interactions that involve formation of base pairs only (BP), formation of base pairs and additional tertiary interactions (BP + 3°), and formation of non-Watson–Crick, tertiary interactions only (3°). For the binding of non-RNA ligands to structured RNAs, association rate constants are broken down into naturally occurring and in vitro–selected RNAs. Values are presented in Supplemental Tables S1–S3 and S5, and “N” is the number of examples in each column. The median values are as follows: RNA-ligand: BP: 3.3 × 105 M−1 sec−1; (BP + 3°): 5.7 × 106 M−1 sec−1; 3°: 2.0 × 103 M−1 sec−1. The median values for RNA-small molecule ligands are 5.5 × 104 and 8.1 × 104 M−1 sec−1 for natural and in vitro RNAs, respectively. The median value for RNA duplex is 1.3 × 106 M−1 sec−1.

Rate constants for association involving RNA, separated into RNA/RNA associations (left), RNA/small molecule associations (middle), and RNA duplex formation (right). Rate constants between structured RNAs and RNA ligands are shown in the first three columns, and are further subdivided into interactions that involve formation of base pairs only (BP), formation of base pairs and additional tertiary interactions (BP + 3°), and formation of non-Watson–Crick, tertiary interactions only (3°). For the binding of non-RNA ligands to structured RNAs, association rate constants are broken down into naturally occurring and in vitro–selected RNAs. Values are presented in Supplemental Tables S1–S3 and S5, and “N” is the number of examples in each column. The median values are as follows: RNA-ligand: BP: 3.3 × 105 M−1 sec−1; (BP + 3°): 5.7 × 106 M−1 sec−1; 3°: 2.0 × 103 M−1 sec−1. The median values for RNA-small molecule ligands are 5.5 × 104 and 8.1 × 104 M−1 sec−1 for natural and in vitro RNAs, respectively. The median value for RNA duplex is 1.3 × 106 M−1 sec−1.

Implications and mechanisms of slow RNA association

We consider the assembled RNA association data from biological and then physical perspectives. Our goal is to stimulate discussion about potential implications and origins of the observed trends and, ultimately, experiments that reveal the physical bases for the observed behavior and the ensuing biological manifestations.

Implications of slow RNA recognition in early evolution

We previously proposed that nonspecific RNA chaperones provided the first foothold into the RNA world for peptides and then proteins (Tsuchihashi et al. 1993; Herschlag 1995), and it is possible that association rates played a role in these transitions. According to this model, short peptide sequences that nonspecifically bind RNA acted as chaperones; short polymers and nonspecific interactions would be more likely to arise evolutionarily than long polymers and specific interactions, and RNA's tendency to fall into kinetic traps (Gartland and Sueoka 1966; Lindahl et al. 1966; Herschlag 1995; Uhlenbeck 1995; Russell 2008) would allow a selective advantage from nonspecific chaperones that could aid RNAs’ escape from these traps and thereby yield a higher fraction of RNA molecules in functional conformations (Fig. 3). Considering association rates, it is also possible that early peptides provided selective advantages by increasing association rates, as has been seen in model RNA/peptide and protein systems (e.g., Kumar and Wilson 1990; Munroe and Dong 1992; Herschlag et al. 1994; Portman and Dreyfuss 1994).
FIGURE 3.

Schematic depiction of the progression from an RNA world to the modern biological world. We use the RNA and protein subunits of RNase P as well as tRNA synthetase to illustrate this progression. RNA's tendency to form long-lived misfolded states is depicted on the far left. (i) Short peptide sequences that nonspecifically bind RNA can act as chaperones and provide a selective advantage for an organism that can escape these traps to yield a higher fraction of RNA molecules in functional conformations. (ii) Once peptides gained a foothold there would be a selective advantage for longer and more accurate peptide production that could lead to specific binding and to structured proteins. (iii) As protein catalytic function emerged and competed with ribozymes, RNA association rates may have limited catalytic efficiency and contributed to the current widespread use of proteins for catalysis. The blue circle corresponds to the amino acid covalently attached to tRNA by tRNA synthetase.

Schematic depiction of the progression from an RNA world to the modern biological world. We use the RNA and protein subunits of RNase P as well as tRNA synthetase to illustrate this progression. RNA's tendency to form long-lived misfolded states is depicted on the far left. (i) Short peptide sequences that nonspecifically bind RNA can act as chaperones and provide a selective advantage for an organism that can escape these traps to yield a higher fraction of RNA molecules in functional conformations. (ii) Once peptides gained a foothold there would be a selective advantage for longer and more accurate peptide production that could lead to specific binding and to structured proteins. (iii) As protein catalytic function emerged and competed with ribozymes, RNA association rates may have limited catalytic efficiency and contributed to the current widespread use of proteins for catalysis. The blue circle corresponds to the amino acid covalently attached to tRNA by tRNA synthetase. The next evolutionary step, according to this model, was the transition to longer peptides and specific functions. Once peptides gained a foothold there would be advantages and successive selection for longer and more accurate peptide production, presumably leading to specific binding and structured proteins (Fig. 3, “RNA + protein”) (Herschlag 1995; Cech 2009). But why would proteins have supplanted RNAs in functional roles so resoundingly? Common and reasonable ideas include an ability of proteins to create more specific folds and binding pockets (Herschlag 1995; Russell 2008) and to carry out more efficient and a broader range of chemical catalysis with their >20 amino acid side chains than the four canonical RNA side chains (Herschlag 1995; Narlikar and Herschlag 1997). Instead or additionally, the diversity of protein side chains instead of RNA side chains may have arisen because of the constraint on RNA to also serve as genetic material, because of more facile routes to synthesize and incorporate additional amino acid side chains (e.g., Huber et al. 2003; Leman et al. 2004) and/or because of limits imposed on RNA folding, molecular recognition, and catalytic function by its uniformly charged backbone (Narlikar and Herschlag 1997). The observations herein suggest that an additional driving force could have been operative: faster molecular recognition by proteins than by RNA (Figs. 1, 3). Especially if the metabolic milieu in the RNA world was rather complex, with efficient ribozymes limited by their recognition speed, increasing association rates and thus enzyme efficiency beyond that possible with ribozymes could have provided a selective advantage for proteins. Indeed, the potential for faster recognition by proteins may also have provided some of the selective advantage for additional amino acid side chains with different “shapes” that can better preorganize protein structures and thereby speed ligand binding (see below).

Implications of slow RNA recognition in modern-day biology

The simplest model for the presence of complex structured RNAs that carry out functions in modern biology is that these are vestiges from an earlier RNA world (Gilbert 1986; Cech 2009). Indeed, the RNA core of the ribosome provides the strongest single argument for functional RNAs predating proteins (Cech 2000; Nissen et al. 2000). While it is possible that all modern-day RNAs are evolutionary vestiges that would be eliminated in a perfectly engineered—rather than evolved—organism, it is also possible that certain processes benefit from utilization of RNA, and that some of this utilization is connected to how RNA recognizes ligands. Base-pairing is a highly effective and highly specific means to recognize and discriminate between RNAs, and is highly modular. While there are cases of modular proteins that bind individual bases, each protein module involves multiple amino acids and thus requires a longer and less probable an evolutionary search to switch recognition specificity to another base than the single base swap needed for RNA. For this reason, recognition by RNA may be more evolvable (Houle 1992; Wagner and Altenberg 1996; Kirschner and Gerhart 1998; Wagner 2005; Bloom et al. 2006). It may also be harder to achieve near-uniform affinity for any base accompanied by high specificity for that base with protein modules than with individual RNA bases. Finally, base pairs in RNA duplexes are themselves accessible and can therefore be recognized by additional RNA elements (or proteins) to increase (or decrease) base-pair affinity and specificity (e.g., Leontis et al. 2002; Carlson et al. 2003; Arya 2011; Masliah et al. 2013 and Hougland et al. 2006 and references therein). When else might RNA prove advantageous? Folding studies have demonstrated that RNA readily adopts stable alternative structures—and we and others have argued and shown that these alternative structures can be inactive and deleterious (Gartland and Sueoka 1966; Lindahl et al. 1966; Herschlag 1995; Uhlenbeck 1995; Russell 2008); however, riboswitches are biological RNAs that use such metastable states for regulation (Haller et al. 2011; Serganov and Patel 2012; Furtig et al. 2015), and ribosome and spliceosome function rely on multiple conformational states (Guthrie and Patterson 1988; Madhani and Guthrie 1994; Voorhees and Ramakrishnan 2013; Chen and Moore 2014; Warnasooriya and Rueda 2014; Rodnina et al. 2017). Thus, kinetically trapped metastable states could have been co-opted through evolution to provide a series of functional conformational steps, with these steps controlled by proteins such as ATPases and GTPases in pre-mRNA splicing and translation (Staley and Guthrie 1998; Chang et al. 2013; Jarmoskaite and Russell 2014; Bourgeois et al. 2016). Finally, the slow observed ligand recognition by RNA provides an opportunity for kinetic control of specificity when binding is rate limiting (Karbstein and Herschlag 2003; Gleitsman and Herschlag 2014). In this mechanism, referred to as “k specificity,” added neighboring interactions can increase the lifetime of an initial encounter complex and thereby provide an attached ligand multiple chances at productive binding prior to diffusing away and thus a higher k. For RNA, adding neighboring interactions can be as simple as adding residues that can base pair with the structured RNA, and indeed there is evidence for this mechanism with the Tetrahymena group I intron (Karbstein and Herschlag 2003). There, residues neighboring the guanosine at the 3′-intron–exon junction form base pairs with intron sequences near the guanosine binding site and make association of that guanosine faster and more probable than other G residues. Similarly, positively charged functional groups on ligands can provide an “anchor” that increases residence times and thus association rate constants. There is evidence for such an effect for a positively charged guanosine analog binding to a group I intron (RN Sengupta and D Herschlag, unpubl.) and analogous effects may be operative in aminoglycoside antibiotics binding to specific ribosome states (Vicens and Westhof 2003; Trylska and Kulik 2016) and other structured RNAs (Stage et al. 1995; Walter et al. 1999). These interactions can allow a ligand to “wait” for the formation of an inhibitable state and need not even be maintained in the final bound state, and it may be possible to co-opt this strategy for new therapeutics that target RNA.

Physical origins of slow RNA recognition

Molecular binding events are typically represented as simple association processes (Fig. 4A). The widespread awareness that the predominant conformational states of free and bound RNAs often differ has led to considerable discussion about induced fit or conformational capture models in which conformational changes occur after the initial encounter or prior to the encounter, respectively (Fig. 4B) (e.g., Leulliot and Varani 2001; Rose and Weeks 2001; Webb et al. 2001; Boehr et al. 2009; Daniels et al. 2014). Most kinetic experiments cannot distinguish these pathways (Chakraborty and Di Cera 2017), but even more critically, these alternatives represent limiting models for a wide range of possible pathways (Fig. 4C); labeling a process by one of these two designations does not reveal the myriad of molecular rearrangements and atomic-level forces that make up the actual pathway and determine the efficiency of association.
FIGURE 4.

Descriptions of molecular binding events. (A) Simple model of ligand binding. Binding event between receptor (R) and ligand (L) to form receptor-ligand complex (R•L) represented as simple association processes governed by a single set of association and dissociation rates. When association is slower than diffusion-limited collision, additional steps are needed to describe binding. (B) Induced fit and conformational capture models of ligand binding. These models account for slower-than-diffusional binding. The tertiary capture model invokes conformational changes prior to the encounter between R and L, while the induced fit model represents the opposite extreme in which conformational changes occur after the initial encounter. (C) Complex model of ligand binding considering an ensemble of conformational states and series of conformational changes to the receptor during the association process. Even the complex representation in the figure is a vast simplification, as it depicts a linear order of conformation changes (and a limited number of conformational steps) rather than a larger and more realistic multi-dimensional energy landscape that can account for multiple orders for the local conformational changes and multiple possible binding pathways.

Descriptions of molecular binding events. (A) Simple model of ligand binding. Binding event between receptor (R) and ligand (L) to form receptor-ligand complex (R•L) represented as simple association processes governed by a single set of association and dissociation rates. When association is slower than diffusion-limited collision, additional steps are needed to describe binding. (B) Induced fit and conformational capture models of ligand binding. These models account for slower-than-diffusional binding. The tertiary capture model invokes conformational changes prior to the encounter between R and L, while the induced fit model represents the opposite extreme in which conformational changes occur after the initial encounter. (C) Complex model of ligand binding considering an ensemble of conformational states and series of conformational changes to the receptor during the association process. Even the complex representation in the figure is a vast simplification, as it depicts a linear order of conformation changes (and a limited number of conformational steps) rather than a larger and more realistic multi-dimensional energy landscape that can account for multiple orders for the local conformational changes and multiple possible binding pathways. The nucleic acid association process that we know the most about is duplex formation (Fig. 5; see Bloomfield et al. 2000 and references therein). Early work on simple, short DNA oligos showed that duplex formation was much slower than diffusion and revealed a large temperature dependence (Ross and Sturtevant 1960; Wetmur and Davidson 1968), something not observed for diffusional processes (Berg and von Hippel 1985), and similar rates and temperature dependences are observed for RNA duplexes (Pörschke and Eigen 1971; Pörschke et al. 1973; Nelson and Tinoco 1982). The observed enthalpies of activation (ΔH‡) derived from these temperature dependences match enthalpies for the formation of 2–3 base pairs, leading to a zipper model involving nucleation via the formation of 2–3 bp, after which subsequent base-pair formation was favored over the dissociation of the 2–3 bp nucleation complex (Fig. 5; Pörschke 1977). More recent optical trap experiments with DNA hairpins provided strong support for this model (Woodside et al. 2006).
FIGURE 5.

Free energy diagram for simple duplex formation zipper model. In the zipper model, nucleation involves the formation of 2–3 bp, after which subsequent base-pair formation is favored over the dissociation of the nucleation complex. The bimolecular association rate between two ssRNA molecules is limited by this nucleation rate (kduplex). Because the formation of these 2–3 bp is uphill in energy from an initial encounter between the two strands, the association rate for duplex formation (kduplex) is slower than diffusional encounter (kdiff).

Free energy diagram for simple duplex formation zipper model. In the zipper model, nucleation involves the formation of 2–3 bp, after which subsequent base-pair formation is favored over the dissociation of the nucleation complex. The bimolecular association rate between two ssRNA molecules is limited by this nucleation rate (kduplex). Because the formation of these 2–3 bp is uphill in energy from an initial encounter between the two strands, the association rate for duplex formation (kduplex) is slower than diffusional encounter (kdiff). While the above results provide a compelling model, they do not provide the detailed timing and extent of the individual conformational transitions that together constitute the binding process and, correspondingly, they do not explain “why” the process has its maximal barrier subsequent to the formation of two base pairs. Indeed, even duplex formation is extraordinarily complex, involving many bond rotations, changes in solvation, and the breaking and making of multiple hydrogen bonds, van der Waals, and stacking interactions. As the steps involved in association represent basic RNA conformational transitions, learning more about association processes has the potential to deepen our fundamental understanding of RNA. More generally, the observation of slower association with structured RNAs than structured proteins provides a perspective for considering the question of the molecular features involved in RNA conformational dynamics and ligand association. Below we briefly consider molecular properties that differ between proteins and RNA and may contribute to their different association rates. Limited conformational preorganization: The limited number and diversity of RNA side chains and the greater number of backbone rotatable bonds, relative to protein side chains and its backbone, have been suggested to limit RNA packing and preorganization, and electrostatic repulsion from the uniformly charged RNA backbone may provide additional limitations (Narlikar and Herschlag 1997). Limited preorganization in the conformational substates matching that for the bound complex introduce the need for rearrangements prior to and/or accompanying binding and thus additional free energy barriers to binding. Indeed, it has been suggested that protein•RNA complexes recognize RNA or DNA by base-pairing faster than naked RNA because of preorganization of the bound RNA strand, although electrostatic attraction introduced with the protein partner could also or instead be responsible (Gorski et al. 2017 and references therein; see also below). Finally, we note that intrinsically disordered proteins (IDPs) can bind ligands faster than observed for structured RNAs (Shammas et al. 2016), indicating that an absence of structure alone does not compel slow binding, and we discuss this further below. Binding pockets: The concave geometry of many protein binding sites may facilitate association, by increasing the probability of encountering multiple tethering interactions, once the ligand enters the pocket. The shape of the binding pocket may also facilitate desolvation, by not allowing the water–water interactions that are most favorable in bulk water. Nevertheless, concavity may also decrease the number of collisions, underscoring the multiplicity of features that influence complex association events. Steric accessibility: Limited steric accessibility can slow association, but even proteins that surround their bound ligands can bind those ligands rapidly, presumably because the needed loop opening and closings and domain motions associated with binding are sufficiently rapid (e.g., Rozovsky and McDermott 2001; Henzler-Wildman et al. 2007). We know much less about the steric accessibility of binding sites for structured RNAs in solution and their opening and closing or breathing rates. Backbone degrees of freedom: Limited conformational preorganization within binding sites, as noted above, necessitates structural rearrangements to access configurations that allow or favor binding. In the course of these rearrangements, RNA may in general explore more conformational states than proteins that require rearrangement, due to RNA's more flexible backbone (Narlikar and Herschlag 1997). A broader search landscape would be expected to lengthen searches and lower the probability of finding the “right” subset of conformational states—those that allow productive binding to proceed. RNA base and sugar motions: When RNA rearrangements are needed for ligand binding, the bulkiness and rigidity of their base and sugar components might slow the needed motions, as they would require a larger vacated space to move into than a more flexible linear protein side chain, which might be likened to the ability of octopus to fit through narrow crevices (see https://www.youtube.com/watch?v=949eYdEz3Es). Indeed, as noted above, even intrinsically disordered proteins (IDPs) can bind ligands faster than observed for structured RNAs (Shammas et al. 2016). This faster binding is consistent with significant kinetic barriers for RNA rearrangements to binding-competent conformations that are absent for the IDPs. Nevertheless, electrostatic and/or hydrophobic IDP•ligand interactions could also or instead increase residence times and thus association rates, as described above for kon specificity, underscoring the complexity of molecular association events and the need for careful and systematic investigation. Hydrophobic interactions: Hydrophobic interactions, which occur in many protein–ligand binding processes (Fersht 1999), are less directional than hydrogen bonds and less constrained than stacking interactions and thus may allow facile exploration in early association complexes to “find” the additional binding interactions needed to favor formation of the final bound state. Hydrophobic groups may also shed solvating water more readily than polar and charged groups to allow native contacts to form. Nevertheless, protein ligands range in their hydrophobicity, polarity, and charge, so this strategy is unlikely to account for all fast protein associations; different association events are likely to differ substantially. Hydrogen bonds: The ability of RNA bases to form two or three hydrogen bonds with interaction partners would seem to be an advantage for association by increasing the residence time of precursor complex states. Perhaps these hydrogen bonds are weaker and short-lived, relative to at least some protein–ligand hydrogen bonds. Alternatively, hydrogen bonds (and other interactions) stabilizing the alternative state of the RNA may also be more stable or more numerous and thus slower to resolve, thereby leading to more dissociation of encounter complexes prior to formation of a stably bound state. Solvation and electrostatics: Macromolecular motions require or are coupled to solvent motions, and the charged nature of RNA may result in greater coupling and more restricted motion. It is also possible that reorganization of the ion atmosphere that surrounds RNA as a polyelectrolyte (Lipfert et al. 2014) must also readjust to facilitate motions and engenders an additional impediment to rearrangement. The myriad interactions and possibilities outlined above highlight that we have a long way to go to understand association involving both RNA and proteins. We hope that comparing and contrasting these different biomolecules has helped illuminate underlying factors; nevertheless, the factors are too numerous and varied to expect mechanistic clarity from these rudimentary comparisons alone. Understanding RNA and protein associations at the atomic level will require an integration of information-rich experiments that provide detailed information at high time resolution and, given the large number of interactions and degrees of freedom in these systems, tight feedback between these experiments and predictions from atomic-level molecular dynamics models. Finally, we have not proven that RNA cannot bind ligands faster; it will be fascinating to explore this question through SELEX-type experiments that increase the selective pressure for fast and not just strong binding.

Closing remarks

Nature is powered by but also limited by the physical and chemical properties of its constituent materials and by the laws of physics. The slow association observed for ligands binding to structured RNAs may reflect basic physical properties of RNA and may have impacted the course of evolution and the molecules and molecular behaviors seen in modern-day biology. We hope that the observations and initial discussion presented herein will stimulate discussions, models, and new experiments that deepen our understanding across the interface of physics and biology.

MATERIALS AND METHODS

Compiling kinetic data from the literature

Association rate constants were assembled from peer-reviewed literature through an extensive internet search using both Google Scholar and the PubMed database. Search terms included “association rate constant,” “on rate,” “kon,” “binding rate,” “ligand binding kinetics,” “association kinetics,” “fast association rate,” and “slow association rate.” For nucleic acids, the terms “duplex formation,” “duplex association,” and “base-pair association” were also used. Papers identified through this search were manually scanned for relevant information including the association rate constant and the identity of the binding partners and their classification (RNA, DNA, protein, small molecule). For RNA binding processes, we also compiled the method used for determining the association rate constant (e.g., surface plasmon resonance, NMR, etc.), experimental conditions (buffer, temperature, etc.), and, when available, the dissociation rate constant (koff) and equilibrium dissociation constant (Kd). This information was used to create Supplemental Tables S1–S5. Our aim was to create a database that was representative of the repertoire of macromolecular association rate constants. To avoid overrepresentation, for papers that contained multiple mutants of a single protein, typically only the wild-type association rate constant was entered. A notable exception comes from papers where the mutations were specifically aimed at altering binding kinetics; in these cases typically the extrema and wild-type values were included (e.g., work on barnase/barstar [Schreiber and Fersht 1996]). Similarly, for binding experiments where a large number of binding conditions were explored, association rate constants for every condition may not be included in the table; however, the conditions for the listed association rate constant are indicated.

Potential sources of bias

Whereas the protein data set contains different types of proteins from various organisms and biochemical pathways (Supplemental Table S3), the RNA data set (Supplemental Tables S1, S2) is less diverse, having a few predominant RNA types. Further, many of these RNAs have been artificially truncated to facilitate handling and biochemical analyses and these alterations have the potential to affect activity (e.g., Uhlenbeck 2003) and kon (e.g., Soulière et al. 2013). Second, while in principle molecular constants are independent of the technique used to obtain them, there are on occasion technique-specific effects. For example, surface plasmon resonance (SPR) involves association at a surface, so surface effects and accessibility can affect the rate of binding (e.g., Myszka 1997). Also, care is needed in applying this and other techniques and in analyzing results to prevent errors in measurement and interpretation (e.g., Fried 1989; Hellman and Fried 2007; Suddala and Walter 2014; Vander Meulen et al. 2016). For the RNA data presented, there were no systematic differences in association rate constants for SPR versus the other methods (not shown). Finally, binding interactions with a strong electrostatic component will be highly sensitive to salt concentration. For example, the kon value for the theophylline RNA aptamer increases by two orders of magnitude from 0 to 10 mM MgCl2 (Latham et al. 2009). Similarly, the Barnase–Barnstar association rate constant increases by four orders of magnitude from 5 mM to 2 M NaCl (Schreiber and Fersht 1996). These studies underscore the importance of varying reaction conditions to evaluate the sensitivity of kon toward salt concentration, and the ability to learn about this component of binding through such studies.

SUPPLEMENTAL MATERIAL

Supplemental material is available for this article.
  80 in total

1.  Less isn't always more.

Authors:  Olke C Uhlenbeck
Journal:  RNA       Date:  2003-12       Impact factor: 4.942

Review 2.  New approaches toward recognition of nucleic acid triple helices.

Authors:  Dev P Arya
Journal:  Acc Chem Res       Date:  2010-11-12       Impact factor: 22.384

3.  Kinetic analysis of macromolecular interactions using surface plasmon resonance biosensors.

Authors: 
Journal:  Curr Opin Biotechnol       Date:  1997-02-01       Impact factor: 9.740

Review 4.  Understanding nucleic acid-ion interactions.

Authors:  Jan Lipfert; Sebastian Doniach; Rhiju Das; Daniel Herschlag
Journal:  Annu Rev Biochem       Date:  2014-03-05       Impact factor: 23.643

5.  Two interconvertible forms of tryptophanyl sRNA in E. coli.

Authors:  W J Gartland; N Sueoka
Journal:  Proc Natl Acad Sci U S A       Date:  1966-04       Impact factor: 11.205

6.  Co-operative non-enzymic base recognition. 3. Kinetics of the helix-coil transition of the oligoribouridylic--oligoriboadenylic acid system and of oligoriboadenylic acid alone at acidic pH.

Authors:  D Pörschke; M Eigen
Journal:  J Mol Biol       Date:  1971-12-14       Impact factor: 5.469

Review 7.  Fundamental aspects of protein-protein association kinetics.

Authors:  G Schreiber; G Haran; H-X Zhou
Journal:  Chem Rev       Date:  2009-03-11       Impact factor: 60.622

8.  Inhibition of the hammerhead ribozyme by neomycin.

Authors:  T K Stage; K J Hertel; O C Uhlenbeck
Journal:  RNA       Date:  1995-03       Impact factor: 4.942

Review 9.  The spliceosome: disorder and dynamics defined.

Authors:  Weijun Chen; Melissa J Moore
Journal:  Curr Opin Struct Biol       Date:  2014-02-14       Impact factor: 6.809

10.  RNA annealing activities in HeLa nuclei.

Authors:  D S Portman; G Dreyfuss
Journal:  EMBO J       Date:  1994-01-01       Impact factor: 11.598

View more
  14 in total

Review 1.  Approaches for measuring the dynamics of RNA-protein interactions.

Authors:  Donny D Licatalosi; Xuan Ye; Eckhard Jankowsky
Journal:  Wiley Interdiscip Rev RNA       Date:  2019-08-20       Impact factor: 9.957

Review 2.  Understanding the Contributions of Conformational Changes, Thermodynamics, and Kinetics of RNA-Small Molecule Interactions.

Authors:  Aline Umuhire Juru; Neeraj N Patwardhan; Amanda E Hargrove
Journal:  ACS Chem Biol       Date:  2019-05-01       Impact factor: 5.100

Review 3.  Enzymatic Cascade Reactions in Biosynthesis.

Authors:  Christopher T Walsh; Bradley S Moore
Journal:  Angew Chem Int Ed Engl       Date:  2019-02-20       Impact factor: 15.336

4.  RNA drug discovery: Conformational restriction enhances specific modulation of the T-box riboswitch function.

Authors:  Ian Armstrong; Ali H Aldhumani; Jia L Schopis; Fang Fang; Eric Parsons; Chunxi Zeng; Md Ismail Hossain; Stephen C Bergmeier; Jennifer V Hines
Journal:  Bioorg Med Chem       Date:  2020-08-06       Impact factor: 3.641

Review 5.  The roles of structural dynamics in the cellular functions of RNAs.

Authors:  Laura R Ganser; Megan L Kelly; Daniel Herschlag; Hashim M Al-Hashimi
Journal:  Nat Rev Mol Cell Biol       Date:  2019-08       Impact factor: 94.444

6.  Hidden intermediates in Mango III RNA aptamer folding revealed by pressure perturbation.

Authors:  Balasubramanian Harish; Jinqiu Wang; Eric J Hayden; Bastian Grabe; Wolf Hiller; Roland Winter; Catherine A Royer
Journal:  Biophys J       Date:  2021-12-28       Impact factor: 4.033

7.  NMR Chemical Exchange Measurements Reveal That N6-Methyladenosine Slows RNA Annealing.

Authors:  Honglue Shi; Bei Liu; Felix Nussbaumer; Atul Rangadurai; Christoph Kreutz; Hashim M Al-Hashimi
Journal:  J Am Chem Soc       Date:  2019-12-16       Impact factor: 15.419

8.  The kinetic landscape of an RNA-binding protein in cells.

Authors:  Deepak Sharma; Leah L Zagore; Matthew M Brister; Xuan Ye; Carlos E Crespo-Hernández; Donny D Licatalosi; Eckhard Jankowsky
Journal:  Nature       Date:  2021-02-10       Impact factor: 49.962

Review 9.  Principles for targeting RNA with drug-like small molecules.

Authors:  Katherine Deigan Warner; Christine E Hajdin; Kevin M Weeks
Journal:  Nat Rev Drug Discov       Date:  2018-07-06       Impact factor: 84.694

10.  Hierarchical mechanism of amino acid sensing by the T-box riboswitch.

Authors:  Krishna C Suddala; Javier Cabello-Villegas; Malgorzata Michnicka; Collin Marshall; Edward P Nikonowicz; Nils G Walter
Journal:  Nat Commun       Date:  2018-05-14       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.