Oligomers equipped with a sequence of phenol and pyridine N-oxide groups form duplexes via H-bonding interactions between these recognition units. Reductive amination chemistry was used to synthesize all possible 3-mer sequences: AAA, AAD, ADA, DAA, ADD, DAD, DDA, and DDD. Pairwise interactions between the oligomers were investigated using NMR titration and dilution experiments in toluene. The measured association constants vary by 3 orders of magnitude (102 to 105 M-1). Antiparallel sequence-complementary oligomers generally form more stable complexes than mismatched duplexes. Mismatched duplexes that have an excess of H-bond donors are stabilized by the interaction of two phenol donors with one pyridine N-oxide acceptor. Oligomers that have a H-bond donor and acceptor on the ends of the chain can fold to form intramolecular H-bonds in the free state. The 1,3-folding equilibrium competes with duplex formation and lowers the stability of duplexes involving these sequences. As a result, some of the mismatch duplexes are more stable than some of the sequence-complementary duplexes. However, the most stable mismatch duplexes contain DDD and compete with the most stable sequence-complementary duplex, AAA·DDD, so in mixtures that contain all eight sequences, sequence-complementary duplexes dominate. Even higher fidelity sequence selectivity can be achieved if alternating donor-acceptor sequences are avoided.
Oligomers equipped with a sequence of phenol and pyridine N-oxide groups form duplexes via H-bonding interactions between these recognition units. Reductive amination chemistry was used to synthesize all possible 3-mer sequences: AAA, AAD, ADA, DAA, ADD, DAD, DDA, and DDD. Pairwise interactions between the oligomers were investigated using NMR titration and dilution experiments in toluene. The measured association constants vary by 3 orders of magnitude (102 to 105 M-1). Antiparallel sequence-complementary oligomers generally form more stable complexes than mismatched duplexes. Mismatched duplexes that have an excess of H-bond donors are stabilized by the interaction of two phenol donors with one pyridine N-oxide acceptor. Oligomers that have a H-bond donor and acceptor on the ends of the chain can fold to form intramolecular H-bonds in the free state. The 1,3-folding equilibrium competes with duplex formation and lowers the stability of duplexes involving these sequences. As a result, some of the mismatch duplexes are more stable than some of the sequence-complementary duplexes. However, the most stable mismatch duplexes contain DDD and compete with the most stable sequence-complementary duplex, AAA·DDD, so in mixtures that contain all eight sequences, sequence-complementary duplexes dominate. Even higher fidelity sequence selectivity can be achieved if alternating donor-acceptor sequences are avoided.
The encoded recognition
properties of nucleic acids are currently
unrivaled in any other material. High-fidelity sequence-selective
duplex formation is the molecular basis for replication of the genetic
information encoded by DNA and is finding widespread applications
in the programmed assembly of complex nucleic acid nanostructures.[1] There is no fundamental reason that these properties
should be restricted to biological polymers, and a range of synthetic
nucleic acid analogues have been demonstrated to form duplexes.[2−5] In principle, any synthetic polymer equipped with complementary
recognition units has the potential to show sequence-selective duplex
formation and the associated properties found in nucleic acids. Figure a shows a minimalist
blueprint for such polymers. A two-letter recognition alphabet would
be sufficient to encode sequence information in binary form. Then
all that is required is reliable chemistry for the synthesis of oligomers
and a compatible backbone to link the components together.
Figure 1
Blueprint for
assembly of a polymer that forms a duplex with sequence
selectivity based on a two-letter recognition alphabet. The key design
components are the covalent chemistry used for synthesis (red), the
noncovalent chemistry used for recognition (blue), and the backbone
linker that determines the geometric complementarity of the two chains
(black).
Blueprint for
assembly of a polymer that forms a duplex with sequence
selectivity based on a two-letter recognition alphabet. The key design
components are the covalent chemistry used for synthesis (red), the
noncovalent chemistry used for recognition (blue), and the backbone
linker that determines the geometric complementarity of the two chains
(black).A number of synthetic duplex-forming
oligomers have been reported,[6−10] and in some of these systems, it was possible to investigate the
effect of changing the sequence of the building blocks. Lehn et al.
showed that oligomeric bipyridine and terpyridine ligands form duplexes
with complementary metal ions, demonstrating both length and sequence
selectivity.[6] Gong et al. have described
oligomers that form duplexes due to H-bonding interactions between
amide groups located in the backbone. It was possible to control the
recognition properties of these systems by changing both the sequence
and the geometrical spacing of H-bond donors and acceptors along the
chain.[7] Yashima et al. have demonstrated
sequence-selective duplex formation between oligomers equipped with
carboxylate and amidinium recognition units that form salt bridges.[8]We have been investigating a range of different
duplex-forming
oligomer systems based on the blueprint in Figure .[11] The most promising
system that we have characterized to date is shown in Figure . Strong H-bonding interactions
between the phenol and pyridine N-oxide recognition
units give rise to stable duplexes in toluene solution. For duplexes
formed between homo-oligomers, the stability increases by an order
of magnitude for every additional recognition unit in the chain, which
is indicative of cooperative H-bond formation along the duplex. The
X-ray crystal structure of the duplex formed by the self-complementary
AD 2-mer is shown in Figure b.[11e] The recognition units are
too far apart in the duplex for the long-range secondary electrostatic
interactions that are observed in other H-bonded arrays to be important
in this system.[12] The solution phase self-assembly
properties of the AD 2-mer also show that there is no intramolecular
1,2-folding between adjacent H-bond donors and acceptors in the monomeric
free state. This system is therefore ideally suited for a more detailed
investigation of the selectivity of duplex formation for longer mixed
sequence oligomers.
Figure 2
(a) H-bonded duplex formed by a phenol homo-oligomer and
a pyridine N-oxide homo-oligomer. (b) X-ray crystal
structure of the
duplex formed by the corresponding mixed phenol–pyridine N-oxide 2-mer.[11e]
(a) H-bonded duplex formed by a phenol homo-oligomer and
a pyridine N-oxide homo-oligomer. (b) X-ray crystal
structure of the
duplex formed by the corresponding mixed phenol–pyridine N-oxide 2-mer.[11e]The simplest systems for which the sequence selectivity
of duplex
formation can be studied are the mixed sequence 3-mers. Figure shows the structures of all
possible 3-mer sequences of the system shown in Figure . In this paper, we describe the synthesis
of these eight compounds and measurement of the pairwise binding affinities
in toluene. The results allow quantification of the fidelity of the
single H-bond recognition system and provide insights into competing
processes that could be targeted to improve the sequence selectivity
of duplex formation.
Figure 3
Eight different 3-mer sequences of H-bond donors (D) and
acceptors
(A).
Eight different 3-mer sequences of H-bond donors (D) and
acceptors
(A).
Results and Discussion
Synthesis
The
3-mers were synthesized from monomers 1 and 2 by sequential acetal deprotection and
reductive amination reactions (Scheme ). Synthesis of the monomer building blocks, 1 and 2, was described previously.[11a,11c] The phenol group of the H-bond donor monomer, 1, was
protected as the triisopropylsilyl ether, and the protecting groups
were removed using tetra-n-butylammonium fluoride
(TBAF) in the final step of the synthesis. In some cases, the acetal
protecting group on the end of the 3-mer was removed during workup,
so these compounds were isolated as the aldehyde as indicated in Scheme . The H-bond acceptor
properties of aldehydes and acetals are both poor compared with pyridine N-oxide, so the presence of different terminal groups should
not significantly affect the assembly properties of the oligomers.
Scheme 1
(i) 1 or 2, NaBH(OAc)3; (ii)
Aqueous HCl; (iii) TBAF
The use of amino–aldehyde monomer units confers
directionality
on the oligomer backbone, so we will describe the sequence of recognition
units in the direction of synthesis, starting from the amino-terminal
end (the nitrobenzyl group) to the aldehyde-terminal end (acetal or
aldehyde group). For example, the oligomers described as ADD and DDA
in Scheme differ
in the orientation of the backbone with respect to the sequence of
the recognition units.
NMR Titrations
Interactions between
all pairwise combinations
of the 3-mer sequences were investigated by 1H NMR titration
and dilution experiments in toluene-d8. The titration data all fit well to 1:1 binding isotherms, and the
dilution data fit well to dimerization isotherms. The resulting association
constants are reported in Table . The stabilities of the complexes span 3 orders of
magnitude. The most stable complex is the sequence-complementary AAA·DDD
duplex, which has an association constant of 105 M–1, but some of the other sequence-complementary complexes
are significantly less stable.
Table 1
Association Constants
(log K/M–1) for Formation of 1:1
Complexes
Measured by 1H NMR Titrations and Dilutions in Toluene-d8 at 298 Ka
DDD
ADD
DAD
AAD
DDA
ADA
DAA
AAA
DDD
0.6
2.3
3.0
4.2
n.d.b
4.2
4.0
5.0
ADD
2.3
3.3
3.9
3.1
3.4
3.0
3.2
DAD
n.d.
3.8
3.6
4.4
3.9
3.9
AAD
2.1
3.3
3.0
2.9
n.d.
DDA
n.d.
3.7
3.6
3.2
ADA
2.9
3.4
n.d.
DAA
n.d.
2.7
AAA
1.6
Errors in log K are less than ±0.2.
n.d. For some complexes with
low association constants, reliable determination of the association
constant was not possible.
Errors in log K are less than ±0.2.n.d. For some complexes with
low association constants, reliable determination of the association
constant was not possible.For each sequence-complementary combination of recognition units,
up to four different duplexes are possible due to the directionality
of the backbone. For example, Figure a shows the structures of four different duplexes that
have the same arrangement of H-bonded recognition units but different
backbone directions. Using the N-to-C terminal description of sequence,
these four duplexes are designated DDA·AAD, DDA·DAA, ADD·AAD,
and ADD·DAA. In these systems, the structure of the duplex is
dictated by the sequence of the recognition units, so it should be
possible to distinguish parallel (DDA·AAD and ADD·DAA) and
antiparallel (DDA·DAA and ADD·AAD) arrangements of the backbone.
If the sequence of recognition units is symmetric, then it is possible
for parallel and antiparallel duplexes to coexist in equilibrium.
For example, for the AAA·DDD duplex both parallel and antiparallel
directions of the backbone are compatible with the arrangement of
the recognition units (Figure b). To simplify the discussion, we will start by considering
only the arrangement of the recognition units, but we will return
to the directionality of the backbone later.
Figure 4
Structures of duplexes
with parallel and antiparallel backbones.
(a) Duplexes where the sequence of recognition units dictates the
backbone arrangement: parallel for DDA·AAD and ADD·DAA and
antiparallel for DDA·DAA and ADD·AAD. (b) Duplex where the
sequence of recognition units allows parallel and antiparallel backbone
arrangements to coexist in equilibrium: AAA·DDD.
Structures of duplexes
with parallel and antiparallel backbones.
(a) Duplexes where the sequence of recognition units dictates the
backbone arrangement: parallel for DDA·AAD and ADD·DAA and
antiparallel for DDA·DAA and ADD·AAD. (b) Duplex where the
sequence of recognition units allows parallel and antiparallel backbone
arrangements to coexist in equilibrium: AAA·DDD.
Single-Site Mismatch Analysis
Figure shows an analysis
of the data in Table comparing the stabilities
of duplexes formed by complementary sequences of recognition units
with the stabilities of the corresponding duplexes containing a single
mismatch. Where different arrangements of the backbone are possible,
the results for all backbone arrangements are plotted side by side
in the same bar of the chart. For example, the association constants
for the four duplexes illustrated in Figure a are shown as four different values that
make up the first sequence-complementary entry in Figure c. The data in Table can therefore be analyzed in
terms of three sequence-complementary trimer duplexes: the homo-oligomer
duplex, AAA·DDD, the alternating oligomer duplex, ADA·DAD,
and the four duplexes shown in Figure a. If a single recognition unit is modified in a sequence-complementary
3-mer duplex, then a total of three A → D and three D →
A mutations are possible. For symmetric sequences of recognition units,
some of the mutations are equivalent, and in these cases, the data
appear twice in Figure .
Figure 5
Effects of single A → D and D → A mutations (red)
on the stabilities of sequence-complementary duplexes. (a) AAA·DDD.
(b) ADA·DAD. (c) DDA·AAD, DDA·DAA, ADD·AAD, and
ADD·DAA. Duplexes with different arrangements of the backbone
are plotted on the same bar of the chart.
Effects of single A → D and D → A mutations (red)
on the stabilities of sequence-complementary duplexes. (a) AAA·DDD.
(b) ADA·DAD. (c) DDA·AAD, DDA·DAA, ADD·AAD, and
ADD·DAA. Duplexes with different arrangements of the backbone
are plotted on the same bar of the chart.Figure a
shows
that AAA·DDD is the most stable duplex and that mutation of any
of the recognition units leads to a decrease in stability of an order
of magnitude. The stability of the ADA·DAD duplex is 4 times
lower than that of AAA·DDD (Figure b). Again the sequence-complementary duplex
is the most stable complex for this system, but some of the mismatch
sequences are surprisingly stable. For example, the mismatched duplex
involving DDD is only 2-fold lower in stability than the sequence-matched
duplex. Figure c shows
that for the third type of duplex the stabilities of the sequence-complementary
complexes span an order of magnitude, and they are 10–100 times
less stable than AAA·DDD. Moreover, the sequence-matched duplexes
are not the most stable complexes in Figure c, and the mismatched duplexes involving
DDD are more stable.
Stabilization of D-Rich Complexes
Closer examination
of Figure reveals
some interesting patterns. In general, the A →
D mutations give complexes that are more stable than
the D → A mutations. For example
in Figure a, the D
→ A mutants have stabilities of (2–8) × 103 M–1, whereas the A →
D mutants have stabilities of (1–2) ×
104 M–1. These values can be compared
with the stabilities of the corresponding 2-mer duplexes, AA·DD
and AD·AD, where only two H-bonds are made ((2–5) ×
103 M–1). The association constants for
formation of the 2-mer duplexes are comparable to the values for the
D → A mismatch 3-mer complexes and significantly lower than
the values for the A → D mismatch
3-mer complexes, suggesting that A → D mutations introduce additional
stabilizing interactions. There is
a fundamental difference between the D → A and A → D mutations: phenol
has one H-bond donor site and so can only interact with one H-bond
acceptor; in contrast, pyridine N-oxide can accept
more than one H-bond from multiple donors. Thus, a D → A mutation removes all
possibility of
forming a H-bonding interaction, because
there are no additional H-bond donor sites in the oligomers that could
interact with the new mismatch pyridine N-oxide acceptor.
However, when an A → D mutation
is made, the two unpaired phenols that do not have complementary pyridine N-oxide partners to interact with can form additional interactions
with pyridine N-oxides that are
already H-bonded to complementary phenols.Molecular
mechanics calculations on the structures of the duplexes support this
hypothesis. Figure shows the lowest energy conformations of three different duplexes:
AAD·DDA, ADA·DDD, and AAA·DAD. The sequence-complementary
3-mers form a duplex with three H-bonds as expected (Figure a). In the mismatch duplex
that has an excess of H-bond donor recognition units (Figure b), one of the pyridine N-oxide acceptors is H-bonded to one phenol donor, but the
other pyridine N-oxide is H-bonded to two phenol
donors (this structure also shows an additional phenol–phenol
interaction). In the mismatch duplex that has an excess of H-bond
acceptor recognition units (Figure c), two intermolecular H-bonds are formed as expected,
and the unsatisfied acceptor units dangle freely from the side of
the duplex.
Figure 6
Lowest energy conformations of duplexes calculated using molecular
mechanics conformational searches for (a) ADA·DAD, (b) ADA·DDD,
and (c) AAA·DAD.[13] The backbone is
shown in gray, the H-bond donor recognition units are in blue, and
the H-bond acceptor units are in red. The terminal groups were simplified
to methyl and phenyl and are shown as lines for clarity.
Lowest energy conformations of duplexes calculated using molecular
mechanics conformational searches for (a) ADA·DAD, (b) ADA·DDD,
and (c) AAA·DAD.[13] The backbone is
shown in gray, the H-bond donor recognition units are in blue, and
the H-bond acceptor units are in red. The terminal groups were simplified
to methyl and phenyl and are shown as lines for clarity.The thermodynamic consequences of doubly H-bonded
acceptor units
can be tested directly by measuring the interaction of a simple pyridine N-oxide monomer with the DD 2-mer. 1H NMR titrations
of p-cresol (D) or DD into 4-methylpyridine N-oxide (A) were carried out in toluene-d8. Performing the titrations in this way ensures that
the concentration of A is too low for the 2:1 A2·DD
complex to be formed. The titration data fit well to 1:1 binding isotherms
in both cases, and the resulting association constants were 3.3 ±
0.8 × 102 M–1 for the A·D complex
and 1.7 ± 0.2 × 103 M–1 for
the A·DD complex. The larger association constant observed for
the compound with two H-bond donor sites suggests that interactions
of the type illustrated in Figure b stabilize 3-mer duplexes with A → D mutations.The observed equilibrium constant for the formation of the 1:1
A·DD complex is given by eq .where K1 and K2 are the stepwise equilibrium constants illustrated
in Figure .
Figure 7
Pyridine N-oxide can accept two H-bonds, leading
to enhanced stability in complexes with an excess of H-bond donors.
The stepwise equilibrium constants for formation of the doubly H-bonded
complex between DD and A are K1 and K2. The global minimum conformation of the 1:1
complex obtained from a molecular mechanics conformational search
is shown (right).[13] The backbone is shown
in gray, the H-bond donor recognition units are in blue, and the H-bond
acceptor unit is in red.
Pyridine N-oxide can accept two H-bonds, leading
to enhanced stability in complexes with an excess of H-bond donors.
The stepwise equilibrium constants for formation of the doubly H-bonded
complex between DD and A are K1 and K2. The global minimum conformation of the 1:1
complex obtained from a molecular mechanics conformational search
is shown (right).[13] The backbone is shown
in gray, the H-bond donor recognition units are in blue, and the H-bond
acceptor unit is in red.Rearranging eq gives eq , which allows
estimation
of the value of K2, the equilibrium constant
for formation of a second intramolecular H-bond to a H-bonded pyridine N-oxide, assuming that the value of K1 is 2KA·D. The statistical
factor of 2 accounts for the degeneracy of the singly H-bonded complex.The value of K2 for the system shown
in Figure is 3, which
means that the doubly H-bonded complex
represents 75% of the bound state. The presence of the second H-bond
donor in the DD·A complex increases the observed association
constant by a factor of 5 compared with the D·A complex, where
only one H-bond can be formed. This value represents an upper limit
on the increase in association constant that is expected due to formation
of a second intramolecular H-bond to a bound pyridine N-oxide in the 3-mer mismatch duplexes, because the overall geometry
of the duplex is likely to restrict the possible arrangements of the
recognition units. However, stabilization by a factor of 5 is consistent
with the higher association constants observed for D-rich mismatch
complexes in Figure .
Intramolecular Folding
The analysis above indicates
the complexes with D → A mutations are not perturbed by additional
interactions involving unsatisfied recognition units. However, there
is some variation in the stabilities of the complexes with D →
A mutations. In both Figure a and b, making D → A mutations at the chain ends leads
to complexes that are significantly less stable than mutating the
recognition unit in the center. The common feature of the less stable
complexes is that they contain oligomers that have a H-bond donor
at one end of the chain and a H-bond acceptor at the other. Such sequences
could fold via intramolecular H-bonding interactions between the terminal
recognition units, and folding would compete with duplex formation.
This observation would also account for the exceptionally low stability
of the sequence-complementary duplexes in Figure c, because for these systems, both oligomers
can fold in the unbound state (Figure ).
Figure 8
1,3-Folding competes with duplex formation.
1,3-Folding competes with duplex formation.The potential of the oligomers to fold was investigated
using molecular
mechanics calculations. Figure shows an overlay of the lowest energy conformation found
for each of the oligomers AAD, ADD, DAA, and DDA. In all four cases,
there is an intramolecular H-bond between the terminal recognition
units, and the backbones adopt very similar conformations in order
to achieve this interaction. Thus, there appears to be a well-defined
conformation of the backbone that places the recognition units in
an arrangement that allows intramolecular H-bonding in a 1,3-folded
state.
Figure 9
Overlay of the lowest energy conformation calculated for AAD, ADD,
DAA, and DDA using molecular mechanics conformational searches.[13] Intramolecular H-bonds (green) are observed
between the terminal recognition units in all cases. The backbone
is shown in gray, H-bond donor recognition units are in blue, and
H-bond acceptor recognition units are in red. The terminal aromatic
groups and solubilizing groups are omitted for clarity.
Overlay of the lowest energy conformation calculated for AAD, ADD,
DAA, and DDA using molecular mechanics conformational searches.[13] Intramolecular H-bonds (green) are observed
between the terminal recognition units in all cases. The backbone
is shown in gray, H-bond donor recognition units are in blue, and
H-bond acceptor recognition units are in red. The terminal aromatic
groups and solubilizing groups are omitted for clarity.It is possible to estimate the extent of folding
experimentally
by comparing the stabilities of complexes involving oligomers that
can and cannot fold. We have shown previously that 1,2-folding between
neighboring recognition units does not compete with duplex formation
in AD 2-mers, and so we assume that none of the 3-mers discussed here
suffer from 1,2-folding. For the AAA, DDD, ADA, and DAD sequences,
intramolecular 1,3-folding is not possible, so folding equilibria
can only compete for duplex formation for complexes involving AAD,
ADD, DDA, and DAA. Complexes with A → D mutations are complicated
by additional H-bonding interactions as discussed above, so we will
consider only complexes with D → A mutations. A direct comparison
can be made between AAA·DAD, where 1,3-folding is not possible,
and AAA·DDA, AAA·ADD, ADA·DAA, and ADA·AAD, where
one of the two binding partners can form an intramolecular H-bond
between the terminal recognition units. The four duplexes that compete
with 1,3-folding equilibria have very similar association constants
(1.6 × 103, 1.7 × 103, 2.6 ×
103, and 1.0 × 103 M–1), and these values are on average 5 times lower than the association
constant for AAA·DAD (8.4 × 103 M–1), where there are no competing folding equilibria.For duplexes
where one of the two binding partners folds, the observed
association constant, Kobs, is given by eq .where Kfold is
the equilibrium constant for 1,3-folding, and Kduplex is the association constant for formation of the duplex
from the unfolded state (Figure ).Equation can be
rearranged to estimate the value of Kfold using the association constants measured for complexes that compete
with one intramolecular folding equilibrium (Kobs) and complexes that do not (Kduplex) (eq ).The analysis of the D → A mismatch complexes
described above
therefore indicates that Kfold for 1,3-folding
is approximately 4 for oligomers with complementary terminal recognition
units. The folded state therefore represents 80% of the monomeric
unbound state for these oligomers, with 20% in the unfolded state.
For complexes where both binding partners fold, the observed association
constant for duplex formation is given by eq .Using Kfold =
4 in eq suggests that
for sequence-complementary duplexes where both binding partners can
form intramolecular H-bonds between the terminal recognition units,
the stability of the duplex will be reduced by a factor of 25 compared
with sequences where 1,3-folding is not possible. This estimate accounts
rather well for the results shown in Figure c: the association constants for formation
of the four sequence-complementary duplexes are between 10 and 100
times lower than the association constant for formation of the AAA·DDD
duplex. It should be noted that although 1,3-folding competes with
duplex formation in these systems, folding does not abolish duplex
formation. For example, in a 10 mM sample of a 1:1 mixture of AAD
and ADD, the population of duplex is 10 times greater than the population
of the 1,3-folded state.
Backbone Arrangement in Duplexes
Although the duplexes
illustrated in Figure a have the same arrangement of recognition units, the measured association
constants span an order of magnitude (see the first sequence-complementary
bar in Figure c).
The major difference between the structures of these duplexes is the
parallel and antiparallel arrangement of the backbone. The association
constants for formation of the two antiparallel duplexes (4.2 ×
103 and 8.0 × 103 M–1) are higher than for formation of the two parallel duplexes (1.0
× 103 and 2.2 × 103 M–1). These results suggest that on average the antiparallel arrangement
of the backbone is preferred by a factor of 4. For systems where the
arrangement of the backbone is not dictated by the sequence of the
recognition units, a similar preference is expected, i.e., a 20% population
with the backbone in a parallel arrangement in equilibrium with 80%
in an antiparallel arrangement.
Sequence Selectivity in
Mixtures
In order to assess
the sequence selectivity of duplex formation in this system, it is
important to define what is meant by selectivity. The selectivity
of a recognition event depends on what the competition is. For example,
consider formation of the ADD·AAD duplex. If ADD competes with
DDD for duplex formation with AAD, then DDD will win because, as illustrated
in Figure c, the AAD·DDD
mismatch complex is more stable than the sequence-complementary duplex.
However, if this competition is repeated in the presence of AAA, then
the two sequence-complementary duplexes AAA·DDD and ADD·AAD
will be formed, because the AAA·DDD duplex is much more stable
than any other complex in this system.The association constants
in Table can be used
to estimate the speciation of complexes in mixtures of the 3-mers. Figure a illustrates the
populations of all possible duplexes calculated for an equimolar mixture
of all eight 3-mers at a concentration at which all of the compounds
are fully bound (>95% bound at 100 mM).[14] There are six complexes for which association constants are not
reported in Table . However, the titration experiments suggest that these are all weak
binding systems, and assigning association constants in the range
102–103 M–1 to these
complexes has no significant effect on the speciation of the other
complexes or the appearance of Figure a. The sequences in Figure a are organized so that all of the antiparallel
sequence-complementary duplexes lie on the diagonal of this plot,
and it is clear that these duplexes are the most populated complexes
(blue regions). At first sight, this result is counterintuitive, because,
as illustrated in Figure , some of the mismatch duplexes are more stable than the corresponding
sequence-complementary duplexes. However, in a system where all sequences
compete for optimal binding partners, the effects that are apparent
in the mismatch analysis are damped, and the result is that sequence-complementary
duplexes dominate.
Figure 10
Calculated populations of duplexes formed in a 100 mM
equimolar
mixture of 3-mers in toluene (>50% dark blue, >30% royal blue,
>20%
pale blue, <20% pink). (a) Mixture of all eight trimer sequences.
(b) Mixture that does not contain the alternating ADA and DAD sequences.
Each duplex appears twice, and antiparallel sequence-complementary
duplexes lie along the diagonal.
Calculated populations of duplexes formed in a 100 mM
equimolar
mixture of 3-mers in toluene (>50% dark blue, >30% royal blue,
>20%
pale blue, <20% pink). (a) Mixture of all eight trimer sequences.
(b) Mixture that does not contain the alternating ADA and DAD sequences.
Each duplex appears twice, and antiparallel sequence-complementary
duplexes lie along the diagonal.The noncomplementary duplexes that compete most effectively
with
sequence-complementary duplex formation are ADA·DDA and DAD·DAA.
These two duplexes correspond to the four off-diagonal peaks in Figure a (each duplex
appears twice due to symmetry). The populations of the corresponding
sequence-complementary duplexes (ADA·DAD and DDA·DAA, which
each appear twice on the diagonal in Figure ) are somewhat reduced by population of
the two mismatch duplexes. The two mismatch complexes are both less
stable than the sequence-complementary ADA·DAD duplex, but they
are both slightly more stable than the sequence-complementary DDA·DAA
duplex. The appearance of mismatch duplexes is therefore the result
of competition between all of the different possible complexes in
the system. The off-diagonal mismatch peaks in Figure a suggest that longer mixed sequence oligomers
are unlikely to exhibit high-fidelity sequence recognition. However,
the identity of the mismatch duplexes provides a clue as to how this
fidelity might be improved. Although the ADA·DAD duplex is relatively
stable, these two 3-mer sequences participate in the most significant
mismatch duplexes. The speciation in a mixture of the other six 3-mers
that does not contain ADA or DAD is illustrated in Figure b. In this case, the sequence
selectivity is excellent, with a high degree of discrimination between
matched and mismatched duplexes. This result provides an important
strategy for enhancing the fidelity of sequence recognition in longer
oligomers. If alternating donor–acceptor sequences are avoided,
then mismatch duplexes will be suppressed.
Conclusions
Selective
recognition between oligomers programmed with information
encoded in the form of a sequence of recognition sites is the basis
for the unique chemical properties of nucleic acids. We describe a
synthetic oligomer system that recapitulates some of these properties.
Oligomers equipped with a sequence of phenols (H-bond donors, D) and
pyridine N-oxides (H-bond acceptors, A) show sequence-selective
duplex formation due to H-bonding interactions between complementary
recognition sites. This paper describes the synthesis of all eight
3-mer sequences and measurement of the pairwise binding affinities
of the oligomers in toluene. The stabilities of the complexes vary
by 3 orders of magnitude depending on sequence complementarity. There
are three factors that govern the overall stabilities of the complexes
in addition to the number of complementary H-bonding interactions.
Backbone Orientation
For the oligomer
sequences AAD, DAA, ADD, and DDA, it is possible to characterize the
relative stabilities of duplexes that have parallel and antiparallel
backbones, because the orientation of the backbones is dictated by
the sequence of recognition units. These systems show that the antiparallel
arrangement of the backbones is more stable than the parallel arrangement
by a factor of 4. The other duplexes presumably exist as a 80:20 mixture
of the two backbone arrangements.
Doubly
H-Bonded Acceptors
A single
site mismatch analysis reveals that an A → D mutation leads
to unexpectedly stable complexes, because the pyridine N-oxide recognition units can accept a second H-bond from an unpaired
phenol recognition unit. These additional H-bonding interactions can
stabilize D-rich mismatch complexes by up to a factor of 5.
1,3-Folding
We have previously shown
that 1,2-folding of adjacent complementary recognition sites does
not take place in this system. However, for 3-mers that have a H-bond
donor and acceptor at each end of the oligomer, 1,3-folding is significant
in the monomeric free state. Folding equilibria compete with duplex
formation and reduce the stability of the corresponding duplex by
a factor of 5.The latter two factors conspire to make the stabilities
of some of the mismatch complexes greater than the stabilities of
some of the sequence-complementary duplexes. However, the measured
association constants show that in a mixture of all eight 3-mer sequences
the sequence-complementary duplexes are the predominant species present
in solution. The most problematic sequence from the point of view
of mismatched duplex formation is DDD, which competes effectively
with the fully matched sequence in a number of cases. However, DDD
has a much higher affinity for AAA than for any other sequence, and
so, in the presence of one equivalent of AAA, DDD will not form a
mismatched duplex with other sequences. Thus, the fidelity of the
recognition system in a complex mixture is higher than might be expected
by comparing the stabilities of individual duplexes. Moreover, if
alternating donor–acceptor sequences are avoided, it is possible
to show that competition from mismatched duplexes can be almost completely
eliminated. It should therefore be possible to extend these studies
to longer oligomers to obtain high-fidelity sequence recognition.This issue of doubly H-bonded acceptors can be addressed in a straightforward
manner by replacing the pyridine N-oxide recognition units with pyridines,
which can only form a single H-bond with phenol. We have shown previously
that although the pyridine–phenol H-bond is weaker than the
pyridine N-oxide–phenol interaction, the increased
conformational restriction imposed by the oriented pyridine nitrogen
lone pair compensates to yield stable duplexes.[11c] The issue of folding equilibria is more difficult. Folding
will always compete with duplex formation in synthetic information
molecules of this type, because the oligomers carry mutually complementary
recognition units. However, the properties of nucleic acids show that,
for long oligomers, sequence complementarity can be used to ensure
that duplex formation predominates or that the absence of a complementary
partner can be used to ensure that intramolecular folding predominates.
The same should be true of the systems described here, and this duality
of behavior offers interesting avenues for future research. There
are some differences between the synthetic H-bonded duplexes and nucleic
acid duplexes that may lead to differences in behavior. In nucleic
acids, formation of the first base pair is thermodynamically unfavorable,
which leads to a nucleation and growth mechanism of duplex assembly,
whereas formation of the first base pair in the synthetic duplexes
is thermodynamically favorable. Nucleic acid duplexes form compact
structures that promote cooperativity and selectivity, whereas the
synthetic duplexes are less organized. However, the well-defined assembly
properties of nucleic acids are not apparent for short oligomers and
only emerge for sequences several bases long. Work on longer synthetic
oligomers will reveal whether more organized structures emerge for
larger molecules, how the fidelity of sequence recognition is affected,
and the impact on the kinetics of strand exchange.
Authors: Giulia Iadevaia; Diego Núñez-Villanueva; Alexander E Stross; Christopher A Hunter Journal: Org Biomol Chem Date: 2018-06-06 Impact factor: 3.876
Authors: John S Shaw; Rajendran Vaiyapuri; Matthew P Parker; Claire A Murray; Kate J C Lim; Cong Pan; Marcus Knappert; Christine J Cardin; Barnaby W Greenland; Ricardo Grau-Crespo; Howard M Colquhoun Journal: Chem Sci Date: 2018-03-27 Impact factor: 9.825
Authors: Marco Hebel; Andreas Riegger; Maksymilian M Zegota; Gönül Kizilsavas; Jasmina Gačanin; Michaela Pieszka; Thorsten Lückerath; Jaime A S Coelho; Manfred Wagner; Pedro M P Gois; David Y W Ng; Tanja Weil Journal: J Am Chem Soc Date: 2019-08-28 Impact factor: 15.419