Literature DB >> 26875531

Availability: A Metric for Nucleic Acid Strand Displacement Systems.

Xiaoping Olson¹, Shohei Kotani¹, Jennifer E Padilla¹, Natalya Hallstrom¹, Sara Goltry¹, Jeunghoon Lee¹, Bernard Yurke¹, William L Hughes¹, Elton Graugnard¹.

Abstract

DNA strand displacement systems have transformative potential in synthetic biology. While powerful examples have been reported in DNA nanotechnology, such systems are plagued by leakage, which limits network stability, sensitivity, and scalability. An approach to mitigate leakage in DNA nanotechnology, which is applicable to synthetic biology, is to introduce mismatches to complementary fuel sequences at key locations. However, this method overlooks nuances in the secondary structure of the fuel and substrate that impact the leakage reaction kinetics in strand displacement systems. In an effort to quantify the impact of secondary structure on leakage, we introduce the concepts of availability and mutual availability and demonstrate their utility for network analysis. Our approach exposes vulnerable locations on the substrate and quantifies the secondary structure of fuel strands. Using these concepts, a 4-fold reduction in leakage has been achieved. The result is a rational design process that efficiently suppresses leakage and provides new insight into dynamic nucleic acid networks.

Entities: Chemical Disease Gene Species

Keywords: DNA; availability; fraying; leakage; mutual availability; nucleic acids; reaction networks; strand displacement

Mesh：

Substances：
DNA

Year: 2016 PMID： 26875531 PMCID： PMC5259754 DOI： 10.1021/acssynbio.5b00231

Source DB: PubMed Journal: ACS Synth Biol ISSN： 2161-5063 Impact factor: 5.110

Nucleic acids are programmable materials because of their predictable Watson–Crick base pairing[1,2] and well-documented thermodynamics,[3−7] kinetics,[8−13] and mechanics.[14] In addition to static structures,[15−19] nucleic acids can be programmed into dynamic devices using toehold-mediated strand displacement,[9,20−22] whereby kinetic barriers to strand exchange are lowered via short complementary sequences that bring components into proximity. Dynamic nucleic acid technology utilizes toehold-mediated DNA strand displacement (DSD) to construct (1) nonenzymatic catalytic chemical reaction networks for isothermal signal amplification;[23−26] (2) catalytic hairpin assemblies for diagnostics, therapeutics, and theranostics;[27,28] (3) nanomachines[9,20,29] and walkers[30,31] for work and motility; (4) circuits for energy transport and logic;[32,33] and (5) networks for computation.[34−37] Although they are compelling, these demonstrations are limited in the scale and complexity necessary for real-world applications by a single fundamental challenge: network leakage. Leakage refers to the production of an unwanted output in the absence of an input, and it is the Achilles’ heel of DSD systems, independent of the DNA/RNA system under consideration. The challenge of leakage must be overcome to achieve the device performance (i.e., speed, sensitivity, selectivity, stability, and scalabilty) necessary for broader adoption.

Leakage Problem

By design, DSD systems are metastable networks designed to be set into operation by the addition of a specific single-stranded sequence that triggers the reaction. Leakage occurs when system components react in the absence of a trigger, and its effect undermines the performance of catalytic networks,[24,38] seesaw gates,[36] catalytic hairpin assemblies,[26,39,40] and hybridization chain reactions.[25] Extrinsic sources of leakage, including chemical impurities, defective oligonucleotides, and malformed network components, can be minimized with careful processing.[26,41] In comparison, intrinsic leakage results from the design of the network, even if the components are perfect, and limits the ultimate DSD performance. Sources of intrinsic leakage may be understood by considering the catalytic reaction network from Zhang et al. illustrated in Figure a.[24] In this representation, unique sequences are represented by labeled domains and complementary domains are denoted with asterisks (strand sequences are provided in Supporting Information S1). This network consists of a three-strand substrate complex in which the “upper” signal and output strands occupy domains of the lower backbone strand. Briefly, network operation is designed to be triggered by a single-stranded catalyst strand that hybridizes with an exposed backbone toehold (y*) and initiates three-way branch migration to displace the signal strand and expose a sequestered backbone toehold (3*). The catalytic cycle is completed by a similar process with the fuel strand reacting with the backbone to displace the output strand, the original catalyst, and form a waste product, as illustrated. As the end of a cycle results in no gain or loss of base pairs, this network is driven forward thermodynamically by a net gain in entropy.

Figure 1

Domain representation of the catalytic DNA strand displacement system from Zhang et al.[24] and four example leakage pathways. (a) In the catalyzed strand displacement pathway of the reaction network, a catalyst strand initiates a reaction cycle driven forward thermodynamically by a net gain in entropy. The strand displacement exchanges the catalyst for the signal strand and exposes a sequestered toehold on the substrate backbone for the fuel, which reacts with the intermediate to complete the cycle and form a waste duplex. Sequences and domains are listed in Supporting Information S1. (b) In the four leakage pathways, the fuel reacts with the substrate backbone in the absence of a catalyst by exploiting fraying at the 5′, nick, and 3′ locations of the substrate. In this network, leakage occurs when the substrate and fuel react to produce signal in the absence of catalyst. This leakage reaction depends on successful nucleation of the fuel strand in the absence of an intended toehold. Fuel and substrate must bump into one another favorably, meaning that key bases must have some chance to interact and nucleate. Once nucleated, the leakage reaction proceeds through a branch migration process until strand invasion is complete. In this process, under the conditions reported here, nucleation is the rate-limiting step for the leakage reaction.[8,11,42] Example leakage reactions are shown in Figure b.

Thermal Fluctuations in DNA

Although often considered to be a zero toehold strand displacement event,[43] intrinsic leakage reactions are enabled by transient toeholds created via thermal fluctuations at the ends of the substrate and at the nick between the output and signal strands. Breathing refers to the spontaneous dissociation of individual base pairs in the interior of the duplex, and fraying is dissociation of the terminal base pairs (at the duplex ends or nicks). Studies of base pair fluctuations indicate that at room temperature the terminal base pairs are 50% open and the penultimate bases (one base pair from the ends of a duplex) are 10–20% open, whereas the fraction of open interior base pairs is ∼10–6 with an open lifetime of ∼0.1 μs.[44−46] Additionally, single-stranded DNA overhangs (toeholds or specificity domains, such as domains y* and 9a in Figure a) increase the stability of the neighboring duplex base pairs, but they do not prevent fraying.[44] Thus, fraying of two base pairs at the ends and nick point of the substrate duplex is expected to be the dominant leakage mechanism. These vulnerable regions are highlighted in Figure a.

Figure 2

(a) Sequence and domain representation of the substrate with fraying locations highlighted. (b) Sequence and domain representation of the original fuel strand. Corresponding to the fraying locations of the substrate, the locations of fuel base mismatches are numbered, highlighted, and shown in bold font. They are 5′ end (bases 1 and 2), nick (bases 24 and 25), and 3′ end (bases 43 and 44). (c) Leakage rate constants for fuel modifications. The concentrations for leakage reactions are fuel (1300 nM), substrate (14 nM), and reporter (20 nM). The black bar represents the leakage rate with the original fuel strand. Pink, orange, and blue bars represent leakage rates for fuels with 1 and 2 nt modifications at 5′, nick, and 3′ locations, respectively. The rates are labeled by the identity of the modified base and its location on the fuel (see panel (b) for locations and originial base idenities). For example, G1T2 indicates that base 1 was changed from C to G and base 2 was changed to C to T. Error bars show the standard deviation from the mean for select samples in triplicate to estimate experimental error. Leakage caused by fraying, when compared to toehold invasion, is approximately 4–6 orders of magnitude slower.[11] Even this small leakage drastically limits the scalability of feed-forward, cross-catalytic, and autocatalytic networks, where fuel invasion will unintentionally release the catalyst of the coupled networks. Thermal fluctuations such as fraying have long been suspected as the source of intrinsic leakage, and strategies to suppress it include (1) careful sequence and domain design such as using GC pairs at the fraying locations,[24] (2) use of proper reaction conditions,[47] (3) use of GC-rich sequences or introduction of buffer or clamping domains that are absent from fuel sequences,[36,40] (4) sequestration of domains in hairpin structures,[48] (5) use of extremely pure DNA strands made in bacteria,[26] (6) incorporation of mismatches,[39] and (7) novel domain level redundancy.[49] While each of these approaches has shown some effect, a clear set of design rules have not emerged for consistently and efficiently reducing leakage.

Insight from Secondary Structure

While previous studies have targeted the location and thermodynamic cost of mismatches in strand displacement systems,[39,50] design principles such as mismatch positions, mismatch numbers, and mismatch identities for suppressing leakage have not emerged. Importantly, base pair mismatch modifications also change the ensemble of DNA secondary structures, which can impact their nucleation and branch migration rates.[51,52] Here, we report a systematic investigation of the effects of mismatches on intrinsic leakage suppression and network performance using the network shown in Figure . All one and two base-pair mismatches produced by fuel strand sequence modifications were characterized at the 5′, 3′, and nick locations (see Figure a,b) by measuring the reaction rates of uncatalyzed (leakage) and catalyzed reactions using fluorescence photometry. These locations are related to locations on the substrate where fraying is expected to occur and enable nucleation between the backbone and fuel in the absence of catalyst. The results were analyzed on the basis of the mismatch identity, mismatch position, mismatch numbers, and the secondary structure of the fuel strands. To quantify the effects of secondary structure on leakage rates, we calculated the probability that a base is unpaired at equilibrium using NUPACK,[7,53] as discussed below. We define this probability as the availability of a base and introduce availability as a design concept for analyzing and engineering the stability of DNA reaction networks. To further understand the relationship between leakage rates and secondary structures, we define total mutual availability as the sum of all pairwise products of the availabilities of corresponding bases between fuels and the backbone. Taking consideration of both mismatches and secondary structure provides a more complete analysis of leakage suppression, and inclusion of the availability and mutual availability during our analysis provides insight toward rational design principles for minimizing leakage.

Results and Discussion

Effect of Mismatch on Leakage Rate Constants

The leakage data for each fuel modification were fit with a second-order kinetics model to extract the leakage rate constant, kleak (Supporting Information S3), and the results are shown in Figure c. The largest leakage suppression was observed for fuel modifications that created two mismatches at the 5′ end of the fuel (bases 1 and 2) and one or two mismatches at the nick in the substrate (base 25 and/or bases 24 and 25). While these locations showed consistent leakage suppression, no clear pattern between mismatch base identities and leakage rates emerged. For example, G–A and G–T mismatches show no suppression at base 1 and a factor of 2 suppression at base 25, whereas a G–G mismatch reduces leakage in both locations despite the fact that G–G mismatches have a lower energy penalty than other G or C mismatches when placed within a DNA duplex.[6] While the G–G mismatch consistently reduces leakage at bases 1, 2, and 25, no clear impact from mismatch identity is observed for bases 43 and 44. Although excess fuel in solution could interfere with leakage from the 3′ end of the fuel (at the toehold of the substrate backbone; see Supporting Information S4), the data indicates that mismatch identity alone or an associated energy penalty does not ensure leakage suppression.

Availability

Beyond mismatch identity, key insight into leakage suppression can be gained by analysis of the secondary structure ensembles of the original and modified fuels. While domain level designs assume the fuel to be purely single-stranded, thermodynamic analysis using NUPACK reveals a range of secondary structures. The minimum free energy (MFE) structures are shown in Figure a and have a moderate level of base pairing between six nucleotides of the fuels. Although the MFE structures indicate base pairing between bases 5–23, 6–22, and 7–21 for all but one (G25) fuel sequence, the probability of pairing is affected by the modifications at bases 1, 2, 24, 25, 43, and 44. The MFE structures for all fuels are provided in Supporting Information S5. G25 indicates that base 25 was changed from C to G. More generally, the letter denotes the base identity and the number denotes the base position from 5′ end of the fuel. While the MFE structures are color-coded by the probability for being in the particular MFE structure shown, greater clarity is obtained by plotting the availability for each base in the fuel sequences, as shown in Figure b,c (upper plots). We define availability as the probability that a base is unpaired at equilibrium, and it quantifies the per-base effects of a sequence’s ensemble of secondary structures. Availability is calculated by NUPACK from the predicted secondary structure ensemble lacking pseudoknots and non-Watson–Crick interactions.[7,53] Modifications to the fuel strand alter the availability of the bases since each sequence has a unique ensemble of secondary structures. Figure b,c (lower plots) shows the changes in base availabilities for modified fuel strands relative to the original fuel sequence. NUPACK calculations were performed using the following parameters: (1) 25 °C operating temperature; (2) 0.05 M Na+ and 0.0115 M Mg2+ ion concentrations; (3) 14 nM substrate component concentrations, allowed complex size of 3; (4) 1.3 μM fuel concentration, allowed complex size of 2; and (5) dangles set to “all” in all cases to account for single-stranded tails.

Figure 3

Minimum free energy (MFE) structures and base availabilities for select fuel strands and the substrate backbone. (a) MFE structures of the original fuel strand and fuel modifications A1A2, G1T2, G1G2, and C24 calculated by NUPACK. The Gibbs free energy of each structure is provided in units of kcal/mol. (b) Base availabilities for the original fuel and fuel modifications A1A2, G1T2, and G1G2 (upper plot) and the differrence in base availabilities (ΔA) for each modification relative to the original fuel (lower plot). (c) Base availabilities for the original fuel and fuel modification C24 (upper plot) and the difference in availability relative to the original fuel (lower plot). (d) MFE structure of the substrate calculated by NUPACK. (e) Availability of each base in the backbone strand of the substrate. Because the fuel strand hybridizes with the backbone strand on the substrate, the base positions of the backbone strand were plotted on the x axis and labeled to correspond to the complement of the fuel strand. Consistent with the MFE structures shown in Figure a, the availabilities of fuel bases 5, 6, 7, 21, 22, and 23 range between 0.1 and 0.6. However, several other bases have availabilities less than 1, which influences the probability of those bases nucleating a leakage reaction. Additionally, availability calculations exhibit subtle changes for modified fuel strands (Figure b,c) that have a large impact on leakage and are not limited to the modified bases. For example, the availabilities of several bases were considerably different between the original fuel and the G1T2 fuel (Figure b), especially for bases 1–4 and bases 13–18, which show a drop, and bases 21–23, which show a rise. While most modifications decreased availability for certain bases or left them nearly unchanged (for example, A1A2; Figure b), the C24 fuel modification increased the availability of several bases when compared to that of the original fuel (Figure c) and exhibited the highest leakage rate measured (Figure c). The base availabilities for all fuel modifications are provided in Supporting Information S6 and are ordered in terms of leakage rate in Figure S4. The data clearly show the positive correlation between lower fuel base availability and lower leakage rate. To fully exploit the concept of availability for understanding the source of leakage, the availability of the bases of the substrate backbone must also be considered because both fuel and backbone bases must be available simultaneously for nucleation to occur. Figure d,e shows the MFE structure of the substrate and availability of the backbone bases. Ideally, the backbone would have zero availability within double-stranded domains (bases 1*–44*) and unity availability at the toehold (bases 45*–50*). However, the availabilities are ∼0.1 at base 1* ∼0.24 at base 24*, and ∼0.23 at base 25*, indicating that the substrate is vulnerable to leakage at these locations (i.e., nucleation with bases 1, 24, and/or 25 of the fuel strands). Thus, in the context of the substrate, availability quantifies the degree of fraying or breathing of the duplex bases.

Base Modifications at the 5′ End of the Fuel

Given that bases 1*, 24*, and 25* of the substrate backbone are most vulnerable to leakage, fuel modifications that reduce availability for fuel bases 1, 24, and 25 can be expected to exhibit the lowest leakage, and this is shown to be the case. For example, leakage was suppressed for the G1, G1T2, and G1G2 fuel modifications. The G1 leakage drop corresponds to a 5% reduction in the availability of bases 1 and 2 (Supporting Information S6). In addition, the availability of bases 1 and 2 of G1T2 decreased 40%, whereas for G1G2, the availability of bases 1 and 2 decreased 40 and 54%, respectively (Figure b). These modified fuels yielded a 4-fold reduction in leakage when compared to that of the original fuel strand. In contrast, the base availabilities in A1, T1, and A1A2 strands are nearly identical to the original fuel strand (Figure b and Supporting Information S6), and their leakage suppression was minimal. Here, the changes in availability for single bases on the modified fuel strands provide a compelling explanation for the variation in leakage rates.

Base Modifications at the 3′ End of the Fuel

The low availabilities at backbone bases 43* and 44* imply a lack of fraying, which would be expected to minimize the impact of changes in the availabilities of fuel bases 43 and 44 at the 3′ end of the fuels. This hypothesis is consistent with the uniform and relatively minor leakage reductions for fuels with reduced availabilities at bases 43 and 44, such as T43T44 and G43T44 (Supporting Information S6). However, the data for leakage at bases 43* and 44* are confounded by spurious hybridization of the fuel’s y domain with the y* toehold domain of the substrate (bases 45* to 50*). This hybridization causes the x domain of both the fuel strand and the signal strand to compete to bind with the x* domain of the substrate (Supporting Information S4). The competition is expected to be significant since the fuel is at 100× excess concentration. This spurious hybridization is expected to sterically hinder leakage at bases 43* and 44* of the backbone and is likely an important factor in the lack of variation in the leakage rate for base modifications at the 3′ end of the fuel strand.

Base Modifications of the Fuel at the Nick Location

Base 24* and base 25* on the substrate backbone have high availabilities, which suggests a greater degree of fraying (Figure e). Consistent with this expectation, all fuel mismatch modifications at base 25 were observed to suppress the leakage rates. We attribute the reduced leakage for mismatch modifications at fuel base 25 to the lower availabilities at base 25 for the modified fuels compared with the original fuel. For example, availabilities at base 25 for A25, T25 and G25 were reduced from 21% to 62% and the leakage was reduced from 49% to 68% compared with original fuel. A similar correlation between availability and leakage rate was observed for mismatch modifications at fuel base 24. The single base mismatch at fuel base 24 reduced the leakage for T24, for which the availability of base 24 decreased by 72%. The leakage nearly doubled for C24, which exhibited a 16% higher availability for base 24. Lastly, no change in leakage rate was observed for A24, for which the base 24 availability increased by 9%. An additional factor in the increased leakage observed for C24 may stem from its increased availability at several bases when compared to the original sequence (Figure c). An increase in availability corresponds to a decrease in secondary structure, which then lowers the activation energy for nucleation between fuel and substrate. For further consideration, an analysis of base availability in the context of the intuitive energy landscape model of Srinivas et al.[11] is provided in Supporting Information S7.

Mutual Availability

On the basis of the above observations, base availability is a potentially powerful new design tool with base-specific resolution. In our qualitative explanations, we focused on the separate availabilities of the bases of the fuel or substrate backbone strands. However, as noted above, leakage reactions require nucleation of these strands with each other. To analyze the combined effects of the availabilities of bases from both strands and to find a quantifiable correlation, we define and analyze a mutual availability (m) and total mutual availability (M). The mutual availability is simply the product of the availabilities of any two bases, defined as m = PF(PB(, where PF( is the availability of base i of the fuel strand and PB( is the availability of base j of the backbone strand within the substrate complex. The total mutual availability is defined as M = ∑(m) = ∑(PF(PB(), where i indexes the complementary base pairs in the fuel-substrate waste product in correct registration. In other words, i* is the base position of backbone strand that matches the complementary position i of the fuel strand. For nucleation to occur, key bases of the fuel and backbone must be available to hybridize. Total mutual availability, M, as defined above, provides a quantitative metric for analyzing fuel and substrate sequence interactions. To assess whether M could be correlated with leakage rate, Figure a plots leakage rates versus the calculated values of M for all fuel sequence modifications. On the basis of the apparent exponential dependence, the natural log of the leakage rate constant is plotted versus M in Figure b and is colored-coded by 5′, 3′, and nick modifications of the fuel. Select experiments were repeated in triplicate, and the scatter of the data is greater than the experimental error. Error bars represent the standard deviation of the mean.

Figure 4

(a) Leakage rate constants for each fuel modification plotted versus total mutual availability between the fuel strand and the backbone strand on the substrate. The leakage rate for the original fuel is shown in black, while the 5′ end, nick and 3′ end fuel modifications are shown in blue, green, and orange, respectively. Representative error bars of select samples are shown, indicating that the scatter of the data is greater than the experimental error. Error bars represent the standard deviation of the mean of three samples. (b) Natural log plot of the leakage rate constant versus the total mutual availability. The green, and blue lines are the fits for the nick modifications, 5′ end modifications. Linear fits to the data are provided as guides to the eye. The 5′ and nick modifications exhibited linear trends and were fit individually. Given the near zero availability of base 44* of the backbone, 3′ fuel modifications had very little impact on total mutual availability. The nick fuel modifications and their corresponding fit are depicted in green and have a slope of 2.87 with an adjusted R2 value of 0.50. 5′ fuel modifications and their corresponding fit are depicted in blue and have a slope of 18.26 with an adjusted R2 of 0.81. While the primary discussion here is focused on single location fuel modifications, multiple location modifications (e.g., 5′ end and nick locations) further reduced the leakage rate to an almost undetectable level (about 100-fold), which are presented and discussed in Supporting Information S8. These data provide further support for total mutual availability as a metric for leakage. The leakage rate constant appears to be exponentially related to the total mutual availability of the fuel and substrate backbone, suggesting that M may be related to a nucleation activation energy barrier. However, our data do not distinguish between barriers to nucleation and branch migration nor can they identify the critical nucleus for leakage to proceed. The scatter in Figure may result from the incompleteness of our mutual availability model, which does not include branch migration steps, and limitations in total mutual availability as a measure of nucleation barriers. For example, NUPACK does not include pseudoknots, G-quartets, nick overhangs, and the coaxial stacking parameter into its calculations. Additionally, base availability, as defined, does not include tertiary nucleic acid structure. The correlation between the leakage rate constant and total mutual availability also needs careful consideration. For example, as the number of fuel mismatches increases, the leakage rate approaches zero, and the reaction stalls because of a lack of thermodynamic driving force. In comparison, when the total mutual availability is high, an effective toehold is formed, and the nucleation barrier is reduced, which means diffusion is the rate-limiting factor. The relationship between M and the leakage rate constant is thus constrained by these limits.

Catalyzed Reactions

It has generally been observed that the rate constants between catalytic reactions and leakage reactions are coupled. It has been shown that when leakage rates were reduced by introducing mismatches, catalytic rates were also decreased or maintained.[39] Likewise, here we also found that some fuel mismatch modifications maintained the original catalytic rate while decreasing the leakage rates. The kinetics data of each fuel modification were fit with a third-order kinetics model with an approximation to extract the catalyzed rate constant, kcat (Supporting Information S3). Catalyzed rate constants ranged 3 orders of magnitude for the fuel modifications and are plotted in Figure .

Figure 5

Rate constants of catalyzed reactions between the catalyst (1 nM), fuel (13 nM), substrate (14 nM), and reporter (20 nM) monitored via fluorescence. The black bar represents the original fuel strand. Pink, orange, and blue bars represent 1 and 2 nt modifications at 5′, nick, and 3′ locations, respectively. Error bars show the standard deviation from the mean for select samples in triplicate to estimate experimental error. The effect of fuel sequence modifications on the catalyzed reaction can be understood via the reaction mechanism. The modification positions play a critical role in the catalyzed reaction, as discussed further in Supporting Information S8 and S9. In Figure , trends can be observed by grouping the modification positions of the fuel strand at the 5′ end (bases 1 and 2) with base 24 of the nick and at the 3′ end (bases 43 and 44) with base 25 of the nick. As expected from the reaction mechanism shown in Figure , mismatches at base 25 of the nick location have the greatest impact because it affects fuel hybridization with the intermediate (I3), followed by mismatches at the 3′ end that impede catalyst release. Fuel modifications at the 5′ end and base 24 of the nick locations have minimal impact on the catalytic rate. A strategy to speed up the catalytic reaction is to increase the fuel toehold length by deleting one nucleotide at the 5′ end of the catalyst. Preliminary experiments for multiple-location fuel modifications indicate that this approach is effective for fuels that include a modification at base 25, whereas it has a counter effect for other fuel modifications (Table S1 and Supporting Information S8). Since one catalytic reaction cycle in this system has many intermediate steps including toehold exchange, toehold-mediated strand displacement, and spontaneous toehold dissociation, the correlation between the overall catalytic rate constant and the total mutual availability of the catalyzed reaction was not studied in this work.

System Performance

An ideal DNA strand displacement system would have elevated selectivity to the catalyst, sensitivity to the catalyst, high catalytic turnover (high kcat), stability in the absence of the catalyst (low kleak), and scalability because of suppressed crosstalk and leakage. Thus, as a practical metric for the performance of the system, we use the ratio, kcat/kleak. The larger the ratio, the greater will be the capacity to distinguish a response to the catalyst from the background leakage. Given that the leakage rate is strongly coupled to the catalytic rate for fuel sequence modifications at bases 25, 43, and 44, the suppression of the catalytic reaction reduced performance more than leakage suppression increased it. Locations of strong coupling between catalytic rate and leakage rate can be considered to be limitations of intrinsic leakage suppression; they are a result of the domain design of this system and will be different for other domain level designs. Modifications at base 24 had no net benefit due to the low availability of the substrate at this location. Improvements in performance came from introducing sequence mismatches at the 5′ end of the fuel (bases 1 and 2), where leakage and catalytic reaction rates are decoupled. As measured by the kcat/kleak ratio, the G1T2 fuel modification has the best performance overall (Supporting Information S10). This modification targeted the vulnerability at base 1* of the backbone strand due to nonzero availability. It reduced the leakage reaction rate by a factor of 4 but maintained a catalytic rate close to the original fuel strand (Figure a).

Figure 6

(a) Ratio of the catalyzed to leakage reaction rates (kcat/kleak) for single location fuel modifications to evaluate overall system performance. Catalyzed reactions were performed with the catalyst (1 nM), fuel (13 nM), substrate (14 nM), and reporter (20 nM), monitored via fluorescence, and uncatalyzed leakage reactions were performed with fuel (1300 nM), substrate (14 nM), and reporter (20 nM). The black bar represents the original fuel strand. Pink, orange, and blue bars represent 1 and 2 nt modifications at 5′, nick, and 3′ locations, respectively. Error bars show the standard deviation from the mean for select samples in triplicate to estimate experimental error. (b) Representative fluorescence data of catalytic reactions: the original fuel (empty black circles), G1T2 modified fuel (empty red triangles), C24T25 modified fuel (yellow diamonds), and G43A44 modified fuel (empty blue stars). (c) Representative fluorescence data of leakage reactions: the original fuel (solid black circles), G1T2 modified fuel (solid red triangles), C24T25 modified fuel (solid yellow diamonds), and G43A44 modified fuel (solid blue stars). The gray lines are the calculated fits to each curve, and the solid blue and purple lines represent reactions between the reporter and the original fuel and the reporter and the substrate, respectively. In the literature, mismatch modifications have shown more dramatic improvements to leaky systems and systems using low-quality strands. Mismatches in Jiang et al.’s DNA catalytic hairpin design with large leakage showed 25-fold improvements in signal-to-background ratio compared with that of the original hairpins.[39] By contrast, Bhadra et al.’s optimized RNA catalytic hairpin system shows only 7-fold leakage reduction, without disturbing the catalytic reaction rate, by introducing mismatch modifications. However, when using unpurified RNA strands in this system, a 13–15-fold reduction in the leakage is observed when compared to that of the control.[54] Zhang et al.’s system was optimized and purified, having an intrinsic leakage rate of only ∼8 M–1 s–1. This work demonstrates that a 4-fold leakage reduction in this system can be achieved while leaving the catalyzed reaction rate nearly unchanged. Mismatches at substrate fraying locations reveal the power of availability to influence circuit performance.

Availability Analysis of Other Networks

The concept of mutual availability is expected to apply to other network designs as well. In an effort to validate the mutual availability concept with another network design, we analyzed a hairpin design from Jiang et al.[39] Their study provided sufficient data to apply an analysis of total mutual availability, and we estimated the rate constants for the hairpin design, as described in Supporting Information S11. Even though the total mutual availability values vary relative to the Zhang et al. network, the observed trend is the same, even in a different buffer. The results provide compelling support for the validity of mutual availability as a metric for sequence-level network analysis and design.

Conclusions

The effects of base-pair mismatches on leakage suppression and total network performance were systematically investigated using the well-established catalytic reaction network from Zhang et al.[24] Fuel modifications at the 5′, 3′, and nick locations were chosen because they correspond to vulnerable substrate locations where nucleation is expected to occur. Qualitatively, availabilities of the substrate and the fuel strand bases were found to correspond well to observed trends in the leakage rate data. Quantitatively, a trend between the total mutual availability and the leakage rates was observed regardless of mismatch identities, mismatch numbers, and mismatch locations. This work suggests availability and mutual availability as design concepts for optimal performance of nucleic acid reaction networks. Future work can further explore the correlation between the total mutual availability and the activation energy, aiming at a more detailed model of leakage mechanisms. In addition, the correlation between the overall catalytic rate constant (kcat) and the total mutual availability of the catalyzed reaction should be studied to allow predictions of the practical metric for the performance of the system (kcat/kleak). This study also leaves room for refinement against other interactions that NUPACK does not calculate, such as G-quartets, non-Watson–Crick interactions, pseudoknots, and geometric constraints. With improved design metrics and refined design tools, nonenzymatic amplification systems can be used as amplifiers for diagnostics, and nucleic acid chemical reaction networks will become more robust tools for theranostics, molecular computation, and synthetic biology.

Methods

Chemicals and DNA Complex Purification

Solvents and chemicals were purchased from Sigma-Aldrich unless otherwise noted. DNA oligonucleotides were synthesized and purified with high-performance liquid chromatography by Integrated DNA Technologies (IDT). Reporter strands were labeled with 5′ fluorophores (TET) and 3′ Iowa Black dark quenchers (IABkFQ) by IDT. Oligonucleotides were prepared in 1× TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, diluted from 100× TE). Final stock concentrations (100 μM) were confirmed by measuring the 260 nm absorbance (Eppendorf Biophotometer) using extinction coefficients provided by IDT. TAE buffer (10×; 40 mM Tris, 40 mM acetate, 1 mM EDTA) was purchased from Hoefer or Fisher Scientific and then mixed with 125 mM Mg(C2H3O2)2·4H2O. DNA components were diluted to 30 μM in 1× TAE buffer with 12.5 mM Mg2+. DNA components were annealed at 95 °C for 5 min using a thermocycler (Eppendorf Mastercycler Nexus Gradient) and cooled to room temperature over ∼90 min to form substrates and reporters. Substrate and reporter complexes were purified by native polyacrylamide gel electrophoresis (N-PAGE). To eliminate malformed substrates, fuel and substrate were stoichiometrically incubated at 15 μM for 1 h at room temperature before loading the gel. The loading buffer contained a 1:1 ratio of bromophenol blue dye and ficoll solution (type 400, 20% water). Substrates were purified by N-PAGE in 14% acrylamide gels (made from 30% acrylamide/bis-acrylamide solution in a 29:1 ratio), which were run at 150 V for 7 h. Reporters were also purified by N-PAGE in 10% acrylamide gels, which were run at 150 V for 2 h. For both processes, the cooling system (VWR International) was set to 20 °C. The bands of interest were cut out of the gels and eluted in 1× TE/Mg2+ buffer for 2 days at 4 °C. The buffer included 1× TE with 12.5 mM MgCl2·4H2O (Acros Organics) added. Because Mg2+ binds to EDTA, the effective Mg2+ concentration was estimated to be 11.5 mM.[24] Substrate and reporter concentrations were quantified via measuring absorbance at 260 nm and calculated using extinction coefficients predicted by nearest-neighbor models (Supporting Information S2).[36,55] Typical yields were 30% for the substrate and 50% for the reporter.

Spectrofluorimetry

All experiments were carried out in 1× TE/Mg2+ buffer with a total volume of 1 mL in 4 mL disposable methacrylate cuvettes (Fisher Scientific) at 25 °C. DNA stock solutions were normally diluted to 2 μM before being added to each sample. A poly-T strand (dT20) was added into all dilute stock samples (1 μM and lower) to reach a final concentration of 1 μM and prevent DNA loss via nonspecific binding to the microfuge tubes and pipet tips.[24] All solutions were gently mixed by pipetting. Fluorescence intensity versus time was measured via fluorescence spectrophotometers (Agilent Technologies, Cary Eclipse). Sample solutions were excited at 510 nm, and the emission was measured at 538 nm. Slit sizes used were 2.5 nm for excitation and 10 nm for emission. Fluorescence was normalized so that 1 normalized unit (a.u.) of fluorescence corresponded to 14 nM (the substrate concentration) for leakage reactions and 13 nM (the fuel concentration) for catalyzed reactions.

Reaction Measurements

For leakage reactions, the fluorescence intensity was continuously monitored for the first 12 h (shown in Figure c) with the samples maintained at 25 °C and then periodically measured until the reaction reached completion at room temperature (∼21.5 °C). Substrate (14 nM) and reporter (20 nM) were reacted with ∼100-fold excess of the fuel strands (1300 nM) to expedite leakage reactions and to extract intrinsic leakage specific to fuel and substrate interaction. Reaction between substrate and reporter was undetectable under this condition (Figure c). With the assumption that extrinsic leakage dominates at shorter times and intrinsic leakage dominates at longer times,[26] leakage was measured over the long term to extract intrinsic effects. For catalyzed reactions, the catalyst (1 nM), fuel (13 nM), substrate (14 nM), and reporter (20 nM) were reacted for 10 h (Figure b). During the experiments, substrate reactions were inferred by monitoring the production of signal strand through its reaction with the reporter (Supporting Information S1).

49 in total

1. Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid.

Authors: J D WATSON; F H CRICK
Journal: Nature Date: 1953-04-25 Impact factor: 49.962

2. Thermodynamics of DNA branching.

Authors: M Lu; Q Guo; L A Marky; N C Seeman; N R Kallenbach
Journal: J Mol Biol Date: 1992-02-05 Impact factor: 5.469

3. NUPACK: Analysis and design of nucleic acid systems.

Authors: Joseph N Zadeh; Conrad D Steenberg; Justin S Bois; Brian R Wolfe; Marshall B Pierce; Asif R Khan; Robert M Dirks; Niles A Pierce
Journal: J Comput Chem Date: 2011-01-15 Impact factor: 3.376

4. Programming biomolecular self-assembly pathways.

Authors: Peng Yin; Harry M T Choi; Colby R Calvert; Niles A Pierce
Journal: Nature Date: 2008-01-17 Impact factor: 49.962

5. Mismatches improve the performance of strand-displacement nucleic Acid circuits.

Authors: Yu Sherry Jiang; Sanchita Bhadra; Bingling Li; Andrew D Ellington
Journal: Angew Chem Int Ed Engl Date: 2014-01-08 Impact factor: 15.336

6. Absorbance melting curves of RNA.

Authors: J D Puglisi; I Tinoco
Journal: Methods Enzymol Date: 1989 Impact factor: 1.600

7. Enzymatic production of 'monoclonal stoichiometric' single-stranded DNA oligonucleotides.

Authors: Cosimo Ducani; Corinna Kaul; Martin Moche; William M Shih; Björn Högberg
Journal: Nat Methods Date: 2013-06-02 Impact factor: 28.547

8. Spectroscopic studies of position-specific DNA "breathing" fluctuations at replication forks and primer-template junctions.

Authors: Davis Jose; Kausiki Datta; Neil P Johnson; Peter H von Hippel
Journal: Proc Natl Acad Sci U S A Date: 2009-02-25 Impact factor: 11.205