Kevin M Rose1, Irani Alves Ferreira-Bravo1, Min Li2, Robert Craigie2, Mark A Ditzler3, Philipp Holliger4, Jeffrey J DeStefano1,5. 1. Cell Biology and Molecular Genetics, Bioscience Research Building , University of Maryland , College Park , Maryland 20742 , United States. 2. Laboratory of Molecular Biology, NIDDK , National Institutes of Health , Bethesda , Maryland 20892 , United States. 3. Exobiology Branch, Space Science and Astrobiology Division , NASA Ames Research Center , Moffett Field , California 94035 , United States. 4. MRC Laboratory of Molecular Biology , Francis Crick Avenue, Cambridge Biomedical Campus , Cambridge CB2 0QH , United Kingdom. 5. Maryland Pathogen Research Institute (MPRI) , College Park , Maryland 20742 , United States.
Abstract
Systematic Evolution of Ligands by Exponential Enrichment (SELEX) is the iterative process by which nucleic acids that can bind with high affinity and specificity (termed aptamers) to specific protein targets are selected. Using a SELEX protocol adapted for Xeno-Nucleic Acid (XNA) as a suitable substrate for aptamer generation, 2'-fluoroarabinonucleic acid (FANA) was used to select several related aptamers to HIV-1 integrase (IN). IN bound FANA aptamers with equilibrium dissociation constants (KD,app) of ∼50-100 pM in a buffer with 200 mM NaCl and 6 mM MgCl2. Comparisons to published HIV-1 IN RNA and DNA aptamers as well as IN genomic binding partners indicated that FANA aptamers bound more than 2 orders of magnitude more tightly to IN. Using a combination of RNA folding algorithms and covariation analysis, all strong binding aptamers demonstrated a common four-way junction structure, despite significant sequence variation. IN aptamers were selected from the same starting library as FA1, a FANA aptamer that binds with pM affinity to HIV-1 Reverse Transcriptase (RT). It contains a 20-nucleotide 5' DNA sequence followed by 59 FANA nucleotides. IN-1.1 (one of the selected aptamers) potently inhibited IN activity and intasome formation in vitro. Replacing the FANA nucleotides of IN-1.1 with 2'-fluororibonucleic acid (F-RNA), which has the same chemical formula but with a ribose rather than arabinose sugar conformation, dramatically reduced binding, suggesting that FANA adopts unique structural conformations that promote binding to HIV-1 IN.
Systematic Evolution of Ligands by Exponential Enrichment (SELEX) is the iterative process by which nucleic acids that can bind with high affinity and specificity (termed aptamers) to specific protein targets are selected. Using a SELEX protocol adapted for Xeno-Nucleic Acid (XNA) as a suitable substrate for aptamer generation, 2'-fluoroarabinonucleic acid (FANA) was used to select several related aptamers to HIV-1 integrase (IN). IN bound FANA aptamers with equilibrium dissociation constants (KD,app) of ∼50-100 pM in a buffer with 200 mM NaCl and 6 mM MgCl2. Comparisons to published HIV-1 IN RNA and DNA aptamers as well as IN genomic binding partners indicated that FANA aptamers bound more than 2 orders of magnitude more tightly to IN. Using a combination of RNA folding algorithms and covariation analysis, all strong binding aptamers demonstrated a common four-way junction structure, despite significant sequence variation. IN aptamers were selected from the same starting library as FA1, a FANA aptamer that binds with pM affinity to HIV-1 Reverse Transcriptase (RT). It contains a 20-nucleotide 5' DNA sequence followed by 59 FANA nucleotides. IN-1.1 (one of the selected aptamers) potently inhibited IN activity and intasome formation in vitro. Replacing the FANA nucleotides of IN-1.1 with 2'-fluororibonucleic acid (F-RNA), which has the same chemical formula but with a ribose rather than arabinose sugar conformation, dramatically reduced binding, suggesting that FANA adopts unique structural conformations that promote binding to HIV-1 IN.
Humanimmunodeficiency
virus
(HIV-1) integrase (IN), the enzyme responsible for incorporating virus-derived
proviral double stranded DNA (dsDNA) into the host cell chromosome,
has become an important drug target for HIV therapy.[1,2] HIV-1 IN is a 32-kilodalton (kDa) protein with three distinct domains:
an N-terminus that coordinates zinc; a catalytic core domain that
catalyzes the transfer of viral nucleic acid into the host cell chromosome
via a conserved “D,D(35)E” motif commonly found in members
of a nucleotidyltransferase superfamily, which includes HIV-1 IN,
other retroviral integrases, and bacterial and eukaryotic transposases;[3,4] and a C-terminus.[5] IN initiates the transfer
of dsDNA to the host cell chromosome through two distinct but catalytically
symmetrical reactions. Initially, IN mediates cleavage of the penultimate
bases from the 3′ ends of the dsDNA, exposing terminal hydroxyl
groups for nucleophilic attack on the target host DNA. These newly
exposed hydroxyl groups are then coordinated to hydrolyze the phosphodiester
backbone of the host cell chromosome that results in covalent attachment
of the viral nucleic acid to the host genome.[6,7] These
reactions are carried out by nucleoprotein complexes called intasomes
in which a pair of viral DNA ends are synapsed by a multimer of IN.
Intasomes are the target of currently approved drugs for the treatment
of AIDS that target DNA integration. The first FDA approved inhibitor
to IN was granted approval in 2007 and is an Integrase Strand Transfer
Inhibitor (INSTI) known as Raltegravir (RAL).[8] Second generation INSTIs have since been approved for HIV treatment
including Dolutegravir (DTG), Elvitegravir (EVG), and Bictegravir
(BIC). INSTIs have dramatically improved the current regimens for
the treatment of HIV and Acquired Immune Deficiency Syndrome (AIDS).
However, emergent drug resistance emphasizes the need to further understand
the mechanisms underlying viral biochemistry and pathogenesis.[9]One approach for studying and potentially
inhibiting protein targets
is to generate aptamers (short nucleic acid molecules that bind with
high affinity and specificity to their target) to the specific protein.
Aptamers can serve a myriad of purposes including, among others, replacement
of antibodies in flow cytometry, cell-phenotyping, and ELISA assays;
biosensors to detect and quantify proteins; and inhibitors for the
neutralization of bacterial toxins and other targets.[10−24] Currently, only one aptamer has received FDA approval (Macugen)
for the treatment of macular degeneration, but several other candidates
are in or entering the drug pipeline.[16,25] Recently a
“primer-template mimicking” aptamer was used to aid
in the crystallization and characterization of HIV and hepatitis B
virus (HBV) reverse transcriptases (RT), uncovering yet another potential
use for aptamers.[21,26,27] In the case of HIV-1, aptamers have been successfully used in vitro and display potent antiviral activity with minor
cytopathic effects, as well as demonstrating efficacy in animal models.[14,27−37] Current DNA aptamers to IN are potent inhibitors of enzyme activity
and resemble G-quadruplex structures.[1,12,27] In some cases, these aptamers can be targeted to
both IN and the RNase H domain of HIV RT, as these proteins are structurally
related through their phosphotransferase domains.[27,38,39] RNA aptamers containing G-rich sequences
have also been raised to IN and exhibited low nanomolar binding KD values.[11,40]Aptamers can
be raised to specific proteins using the iterative
multistep process known as the Systematic Evolution of Ligands by
Exponential Enrichment (SELEX).[41,42] Successful selection
of aptamers relies on several rounds of SELEX and a sufficient degree
of sequence diversity (∼1014 or more sequences)
to identify specific binding partners. Since its inception, SELEX
has been used to select hundreds of aptamers to an array of diverse
proteins from several structural and functional classes.In
order to use aptamers as inhibitors in vivo or in
assays with cellular lysates, they must be stable in an environment
wrought with nucleases that degrade foreign nucleic acid. One interesting
approach to this problem is the replacement of conventional nucleic
acid substrates with their synthetic equivalents, potentially avoiding
detection by the immune system and acquiring enhanced resistance to
nucleases. Xeno-Nucleic Acid (XNA) refers to any nucleic acid analogue
that differs from natural ones in at least one of the three chemical
moieties: sugar, phosphate, or nucleobase.[43] XNAs allow the adaptation of chemical or structural configurations
distinct from traditional hereditary material.[44] Despite chemical and conformational differences that are
theorized to explain the nuclease resistance seen by these analogues,
XNAs retain the ability to adopt secondary structures and store information
similarly to their DNA and RNA counterparts.[45−47] Aptamers containing
XNA can be made by modifying some of the nucleosides in aptamers produced
with conventional nucleic acids (Macugen was produced using this approach[48]) or by direct selection using XNA nucleotides
in a SELEX protocol. Some XNA nucleotides (e.g., 2′-fluororibonucleic
acid (F-RNA) pyrimidines) can be incorporated using natural polymerases;
however, most require engineered polymerases for incorporation.[49]The list of XNA aptamers continues to
grow, although it is restricted
by the limited commercial availability of XNAtriphosphates. Directly
selected aptamers include those produced with HNA (1,5-anhydrohexitol
nucleic acid), TNA (threose nucleic acid (α-l-threofuranosyl-(3′,2′)
nucleic acid), F-RNA/RNA, and FANA among others[49−58] (for a review, see ref (54)). The first FANA aptamer was raised to HIV-1 RT and bound
with picomolar affinity.[55] FANA is structurally
and chemically distinct from F-RNA or RNA due to its difference in
sugar pucker as an arabinose-based analogue and the presence of an
electronegative fluorine atom in the 2′ position in the β
conformation in place of a hydroxyl group.[59,60] The sugar adopts a C2′/O4′-endo conformation much
like DNA, in stark contrast to the C3′-endo conformation seen
by both F-RNA and RNA.[44,59,61−63]This report builds upon the framework that
allowed for the selection
of a FANA aptamer to HIV-1 RT and applies it to HIV IN.[55] FANA aptamers selected for IN binding are the
second set of FANA aptamers selected for binding to a protein (a catalytically
active RNA cleaving FANA aptamer has also been described[56]). These XNA aptamers are structurally and chemically
distinct from other DNA and RNA aptamers that bind IN with low nanomolar
affinity and display markedly tighter affinity in the low picomolar
range.
Materials and Methods
Materials
The
2′-deoxy-2′-fluoro-arabino-nucleotides
(faATP, faCTP, faGTP, faUTP) required for FANA synthesis were obtained
from Metkinen Chemistry (Kuusisto, Finland). Deoxyribonucleotide triphosphates
(dNTPs) were from Roche or United States Biochemical (USB). Enzymes
and buffers including Taq polymerase, T4 polynucleotide
kinase (PNK), 10X ThermoPol buffer (Mg2+-free), MgSO4, and 10× CutSmart buffer were from New England BioLabs.
Radiolabeled ATP (γ-32P) was from PerkinElmer. G-25
spin columns were from Harvard Apparatus. Miniprep DNA preparation
kits were purchased from Qiagen. Nitrocellulose filter disks (Protran
BA 85, 0.45 μm pore size and 25 mm diameter) were from Whatman.
All DNA oligonucleotides were from Integrated DNA Technologies. All
other chemicals were from VWR, Fisher Scientific, or Sigma. The HIV-1
IN wild type (INwt) and mutants (F185H/C280S (INF185H/C280S) and Sso7d-IN fusion (INSso7d)) were prepared as described.[37,64] Prototype Foamy Virus (PFV) IN protein was prepared as described.[65] All IN proteins were stored in aliquots at −80
°C. Thermostable polymerase D4K was prepared as described and
stored in aliquots at −80 °C.[49]
Methods
End-Labeling of Oligonucleotides with T4 PNK
DNA and
RNA oligonucleotides were 5′ end-labeled in a 50 μL volume
containing 10–50 pmol of the oligonucleotide of interest, 1
× T4 PNK reaction buffer (provided by manufacturer), 10 U of
T4 PNK, and 10 μL of (γ-32P) ATP (3000 Ci/mmol,
10 μCi/μL). The labeling reaction was done at 37 °C
for 30 min according to the manufacturer’s protocol. The PNK
enzyme was heat inactivated by incubating the reaction at 75 °C
for 15 min. Excess radiolabeled nucleotides were then removed by centrifugation
using a Sephadex G-25 column.
Selection of FANA Aptamers
with HIV-1 INF185H/C280S Using Nitrocellulose Filter Binding
The 79 nucleotide FANA
random pool starting material for SELEX containing a 40 nucleotide
central random region flanked at the 5′ end by 20 nucleotides
of DNA (5′-AAAAGGTAGTGCTGAATTCG-3′) and at the 3′
end by 19 nucleotides of fixed FANA sequence (5′-UUCGCUAUCCAGUUGGCCU-3′)
was prepared as described previously.[55] About 100 pmol of the 5′-32P-labeled FANA starting
pool was snap-cooled on ice and then incubated with 10 pmol of HIV-1
IN (INF185H/C280S was used for all experiments unless
otherwise indicated) and 10 μg of yeast tRNA competitor in 40
μL of Binding Buffer (20 mM Tris-HCl at pH 7.5, 200 mM NaCl,
6 mM MgCl2, and 1 mM DTT) for 10 min at RT. A separate
control was performed identically except that INF185H/C280S was omitted. The control was used to ensure that selection in the
presence of INF185H/C280S bound significantly more material
than those in its absence. The material was applied to a nitrocellulose
filter that was preincubated in a Low Salt Buffer (20 mM Tris-HCl
at pH 7.5, 10 mM KCl). A vacuum was applied at a rate of ∼0.25
mL/s to pull the sample through the filter. Filters were washed three
times with 1 mL of Low Salt Buffer. Filters were exposed to imaging
screens, and the portion of the filter containing radioactive material
in selections containing INF185H/C280S was excised. The
excised filter slice was mixed with ∼300 μL of 7 M urea
and 5 mM EDTA (pH 8) and heated for 5 min at 95 °C. A solution
of phenol/chloroform/isoamyl alcohol (25:24:1) saturated with 100
mM Tris-HCl (pH 8; 300 μL) was added, and the material was vortexed
then centrifuged in a microfuge to separate phases. FANA sequences
were recovered in the aqueous phase. The phenol-chloroform phase was
re-extracted a second time by adding 400 μL of water and repeating
the centrifugation (selection round 1 only). The recovered material
was reverse transcribed to DNA and processed to produce FANA material
for round 2 as previously described.[55] For
subsequent rounds, ∼ 40 pmol of FANA material was used for
selection with a 1:10 ratio of IN/FANA. In some rounds, extra FANA
was made to perform a control reaction in the absence of INF185H/C280S. After six, eight, and nine rounds, the amount of competitor tRNA
was increased from 10 to 20, 40, and 50 μg, respectively, to
increase the stringency of the binding conditions. After the second
round, some material from PCR 1 was saved as a source to regenerate
the selected material from these rounds. Filter binding assays (see
below) were used to access the binding affinity to INF185H/C280S of the pools recovered from some of the rounds. The SELEX was stopped
after round 10 as no further binding affinity increase was detected
between rounds 8 and 10.
Sequence Analysis of FANA Products Recovered
from Round 10
PCR products were prepared from FANA sequences
recovered from round
10. The PCR material was cloned using a TOPO TA cloning kit from Life
Technologies. DNA mini-preps were prepared, and the products were
sequenced by Macrogen (Rockville, Maryland). The appropriate DNA oligonucleotide
templates for some recovered sequences were synthesized and generation
of FANA material was performed as described.[55]
Determination of Apparent Equilibrium Dissociation Constant
(KD,app) Using Nitrocellulose Filter Binding
Assay
Standard reactions for KD,app determinations were performed in 1 mL of Binding Buffer with 0.1
mg mL–1 BSA and 2 pM 5′-32P end-labeled
aptamer. Increasing amounts of IN (INwt, INF185H/C280S, or INSso7d as indicated) diluted in the above buffer
were added in amounts that approximately flanked the KD,app value (estimated from initial experiments) for the
aptamer. For aptamers with very low KD,app values (e.g., IN-1.1 (Table )), the amount of IN used was typically 0, 4, 8, 16, 31, 63,
125, 250, and 2000 pM. For controls and aptamer constructs with higher KD,app values, the concentrations of IN and aptmer
were increased, and the volume of the reaction was decreased to 100
μL (with 20 pM aptamer) or 20 μL (with 100 pM aptamer).
After 10 min at RT, the reactions were applied to a 25 mm nitrocellulose
disk (0.45 μm pore, Protran BA 85, Whatman) presoaked in filter
wash buffer (25 mM Tris-HCl at pH = 7.5, 10 mM KCl). The filter was
washed three times under a vacuum with 1 mL of wash buffer at a flow
rate of ∼ 0.25 mL/s. Filters were then counted in a scintillation
counter. A plot of bound aptamer vs IN concentration was fit to the
following equation for ligand binding one-site saturation in SigmaPlot
in order to determine the KD,app: y = Bmax(x)/(KD + x) where x is the concentration of IN and y is the amount
of bound aptamer. The experiment was performed three times, and the KD,app values in Table are an average of those experiments ±
standard deviations. See Figure S1 for
a graphical example.
Table 1
Equilibrium Dissociation
Constants
for Aptamers and Constructs
substrate
(IN type)
KD,appa
P5 (INF185H/C280S)
53 ± 26 nM
HIV-1 TAR (INF185H/C280S)
NDc
HIV-1 TAR (INwt)
NDc
FANA random pool (INF185H/C280S)b
4.3 ± 3.2 μM
FANA random pool (INSso7d)
634 ± 355 nM
IN-1.1 (INwt)
103 ± 18 pM
IN-1.1 (INF185H/C280S)
51 ± 18 pM
IN-1.1 (INSso7d)
74 ± 53 pM
IN-2.1 (INF185H/C280S)
44 ± 9 pM
IN-3.1 (INF185H/C280S)
102 ± 42 pM
IN-6.1 (INF185H/C280S)
39 ± 12 pM
IN-8.1 (INF185H/C280S)
NDc
IN-13.1 (INF185H/C280S)
NDc
IN-14.1 (INF185H/C280S)
2.2 ± 1.1 μM
modifications bound to INF185H/C280S
IN-1.1(FANA
to F-RNA)
NDc
IN-1.1(P1flip)
89 ± 31 pM
IN-1.1(−10)
56 ± 28 pM
IN-2.1(P1flip)
106 ± 31 pM
IN-2.1(altP1)
8.9 ± 0.8 nM
IN-2.1(−10)
101 ± 67 pM
IN-6.1/2.1
118 ± 40 pM
IN-6.1/2.1(−10)
146 ± 9 pM
IN 1.1 binding to PFV INd
IN-1.1
(PFV IN)
102 ± 36 nM
Apparent equilibrium
dissociation
constants (KD,app) were measured as described
in the Materials and Methods.
Starting material for the FANA SELEX
experiments.
ND: not determined.
The construct
bound too weakly to determine a binding constant in reaction with
up to 5 μM IN.
PFV
IN, Prototype Foamy Virus Integrase.
Apparent equilibrium
dissociation
constants (KD,app) were measured as described
in the Materials and Methods.Starting material for the FANA SELEX
experiments.ND: not determined.
The construct
bound too weakly to determine a binding constant in reaction with
up to 5 μM IN.PFV
IN, Prototype Foamy Virus Integrase.
Competition Binding Assays
Five
nanomolar 5′-32P end-labeled IN-1.1 was incubated
at room temperature in
Binding Buffer with various amounts of excess unlabeled competitor
(IN-1.1, P5 aptamer,[11] or HIV TAR (1–57),[66] at 0-, 1-, 2-, 4-, 8-, or 16-fold excess over
radiolabeled IN-1.1). HIV INF185H/C280S was added to a
final concentration of 5 nM. The total volume was 20 μL. Incubations
were continued for 50 min. Samples were run over a nitrocellulose
filter and washed and quantified as described above.
Concerted
Integration and Intasome Assembly Assays
INSso7d (2.5 μM) and 1.0 μM FAM-labeled 27-bp
viral DNA substrates were preincubated on ice in 20 mM HEPES at pH
7.5, 25% glycerol, 10 mM DTT, 5 mM MgCl2, 4 μM ZnCl2, 300 ng of target plasmid DNA pGEM-9zf, and 100 mM NaCl in
a 20 μL reaction volume.[64] The reaction
was carried out at 37 °C for 1 h. For integration product analysis,
the reactions were stopped by the addition of SDS and EDTA to 0.2%
and 10 mM, respectively, together with 5 μg of proteinase K.
Incubation was continued at 37 °C for a further 1 h. The DNA
was then recovered by ethanol precipitation and subjected to electrophoresis
in a 1.5% agarose gel in 1 × TBE buffer. Intasome assembly was
carried out in the same way except that no target DNA was added, and
MgCl2 was substituted with CaCl2. For electrophoretic
mobility shift assays of intasomes (EMSA), the reaction was stopped
by chilling on ice and the addition of 10 μg/mL heparin. A 2.5
μL aliquot was subjected to electrophoresis on a 3.0% low melting
1 × TBE agarose gel (SeaKem LE agarose) containing 10 μg/mL
heparin. DNA was visualized by fluorescence using a Typhoon 8600 fluorescence
scanner (GE Healthcare).
Comparative Sequence Analysis and Structure
Prediction
Recovered sequences (26 total) were clustered
into 14 separate lineages
(shown in Figure ).
We then searched for sequence-level patterns that are common to multiple
lineages. We began by identifying all six-nucleotide long sequences
within the variable region of the most abundant sequence (lineage
1) and then searched for those 6-mers within the other 13 lineages.
Two 6-mers, UCAAGU and AUAUUA, are both present in six of the lineages
(lineages 1–6 in Figure ), and in each instance, the 6-mers are separated by two nucleotides.
A very similar sequence pattern was then identified in lineage 7.
Next, we took one representative sequence from each of the lineages
that contain these 6-mers and performed a sequence alignment using
Mafft.[67] We then looked for sequence covariation
between lineages that might suggest base-pairing or other physical
interactions between specific positions in the sequences. From our
alignment, we could see covariation between sequences. We generated
a secondary structure prediction based on this alignment using RNAalifold,[68,69] and it predicts the pairing shown for P3 and P4 (Figure ). Because the sequences within
the 5′ and 3′ constant regions do not vary, we could
not use covariation to support structure predictions in these parts
of the molecule. We therefore looked at the structure predictions
of individual sequences to determine likely base pairing patterns
involving the constant regions. We used RNAfold,[68,69] to predict secondary structures both with and without folding constraints
that force the predicted structure to contain P3 and P4. RNAfold predicted
multiple different base pairing patterns for the constant regions
(Figure S2). To identify which of these
predicted structures is correct, we tested the impact that mutations
in the constant regions had on binding. Mutations predicted to disrupt
either the P1 or P2 pairing shown in Figure have a pronounced negative impact on binding
(Figures S3–S5). Alternatively,
mutations that are predicted to maintain the pairing in P1 and P2
have binding comparable to the original aptamer. We therefore used
the pairing for P1 and P2 shown in Figure to generate our final structure-based alignment.
Figure 1
Sequence
of the random region from 26 sequences obtained after
10 rounds of selection with INF185H/C280S. Constructs recovered
from round 10 of SELEX were PCR amplified and cloned and sequenced
as described in the Methods. The recovered
sequences are divided into 14 different lineages based on sequence
relationships. One sequence (IN-1.1) appeared in the selected pool
three times (parentheses), and four sequences differed from it by
1–3 nucleotides (IN-1.2–1.5). IN-2.1 appeared twice,
and all other sequences were recovered only one time. For members
of the same lineage, an alignment with the first member is shown.
Each sequence was flanked at the 5′ end by 20 DNA nucleotides
(5′-AAAAGGTAGTGCTGAATTCG-3′) and at the 3′
end by 19 nucleotides of fixed FANA sequence (5′-UUCGCUAUCCAGUUGGCCU-3′).
Figure 2
Predicted structure and motif alignment of recovered aptamers.
The first seven lineages shown in Figure are presented and aligned with the various
predicted stem motifs (P1–P4; lower panel). See Methods and Results for
a description of how the motifs were predicted. Structures for three
of the aptamers are shown (upper panel). The model predicts that all
seven lineages form a similar four way junction secondary structure.
For lineages with more than one member, only the first member from Figure is shown.
Sequence
of the random region from 26 sequences obtained after
10 rounds of selection with INF185H/C280S. Constructs recovered
from round 10 of SELEX were PCR amplified and cloned and sequenced
as described in the Methods. The recovered
sequences are divided into 14 different lineages based on sequence
relationships. One sequence (IN-1.1) appeared in the selected pool
three times (parentheses), and four sequences differed from it by
1–3 nucleotides (IN-1.2–1.5). IN-2.1 appeared twice,
and all other sequences were recovered only one time. For members
of the same lineage, an alignment with the first member is shown.
Each sequence was flanked at the 5′ end by 20 DNA nucleotides
(5′-AAAAGGTAGTGCTGAATTCG-3′) and at the 3′
end by 19 nucleotides of fixed FANA sequence (5′-UUCGCUAUCCAGUUGGCCU-3′).Predicted structure and motif alignment of recovered aptamers.
The first seven lineages shown in Figure are presented and aligned with the various
predicted stem motifs (P1–P4; lower panel). See Methods and Results for
a description of how the motifs were predicted. Structures for three
of the aptamers are shown (upper panel). The model predicts that all
seven lineages form a similar four way junction secondary structure.
For lineages with more than one member, only the first member from Figure is shown.
Results
Protocol for FANA Aptamer
Selection
Aptamers were selected
from a starting pool of approximately 6 × 1013 different
sequences using the previously established approach for HIV RT[55] with the modifications noted in the Materials and Methods. Note that this approach produces
“chimeric” aptamers with a 20-nucleotide fixed DNA sequence
at the 5′ end followed by a ∼40-nucleotide region of
random FANA nucleotides, then 19 nucleotides of the FANA fixed sequence.
With HIV RT, most of the DNA region played no role in the binding
of the selected aptamers to RT.[55] This
was not the case for IN where the DNA region was pivotal for tight
binding (see below). Selection was stopped after round 10 of SELEX
as there was essentially no difference in the binding of pooled material
for rounds 9 and 10, both of which bound at ∼2 nM.
Sequences Recovered
from the FANA SELEX
Material from
round 10 of the SELEX was cloned, and 26 sequences were recovered
(Figure , only random
region nucleotides are shown, and Clustal was used to compare the
sequences from the same lineage). The sequences are arranged in 14
different lineages based on sequence relationships. One sequence appeared
three times (IN-1.1) and another twice (IN-2.1). All other sequences
appeared only once. IN-1.1 was also closely related to four other
sequences (IN-1.2–IN-1.5) and showed strong homology at the
5′ end to IN-2.1. A second cluster of four closely related
sequences (IN-3.1–IN-3.4) was also present, although they shared
little sequence homology with lineage 1 or 2. Finally, IN-8.1 and
IN-8.2 differed by only three nucleotides but were not strongly related
to any other sequences. Several sequences (notably lineages 8–14)
showed G-rich runs intermittently spaced throughout the sequence.
This suggested the possibility of intra- or intermolecular G tetraplex
formation, although this was not tested. While relatively little is
known about the ability of XNAs to form G quartets, both FANA[70] and TNA[71] have been
shown to form these structures.For several of the sequences,
the KD,app values for binding to IN were
calculated using filter binding assays as described in the Methods section. Table shows values for the SELEX starting material
and several aptamer constructs. The selection process was performed
using INF185H/C280S, a more soluble version of IN that
retains full activity.[72] For some constructs,
values were calculated for this enzyme as well as INwt and
INSso7d. The Sso7d version contains a ∼7-kDa N-terminal
DNA binding domain corresponding to Sulfolobus solfataricus chromosomal protein Sso7d. This modification improves IN activity
and solubility.[64] Binding to the starting
material was in the high nanomolar to low micromolar range for all
IN enzymes tested. IN-1.1, IN-2.1, IN-3.1, and IN 6.1 bound strongly
to IN showing KD,app values in the 40–100
pM range with INF185H/C280S. Note that this is well over
1000-fold tighter than binding to the starting material. Interestingly,
IN-1.1 and IN-2.1 shared strong homology over the first 16 nucleotides
derived from the random region, but less homology over nucleotides
17–40. IN-1.1 and 3.1 were nearly identical to other members
of these lineages, and it was assumed that these sequences would bind
similarly to IN, although they were not tested. Other sequences from
round 10 (Figure )
that were tested bound poorly. This included the G-rich sequence IN-8.1
(which is nearly identical to IN-8.2), IN-13.1, and IN-14.1. This
suggests that their high prevalence in the round 10 SELEX material
may be related to another selection parameter such as a higher affinity
for nitrocellulose filters or over-representation in the starting
material.
FANA IN Aptamers of the Same Sequence Composed of 2′-Fluororibonucleic
Acid (F-RNA) Bound INF185H/C280S Weakly and Aptamers Bound
Poorly to Protoptype Foamy Virus (PFV) IN
Changing all FANA
nucleotides in IN-1.1 to F-RNA (D4K can incorporate both analogs)
resulted in a loss in binding (IN-1.1 (FANA to F-RNA); Table ). In fact, the IN-1.1 F-RNA
construct bound so poorly that we were unable to determine the KD,apps in standard assays with up to 5 μM
INF185H/C280S. These results illustrated that aptamer binding
specificity cannot be mimicked by F-RNA. This is consistent with findings
for FANA aptamers selected to HIV RT.[55] Since FANA and F-RNA have identical chemical formulas, the complete
loss of specific binding may seem counterintuitive. However, as was
noted in the introduction, these analogs are
structurally divergent with FANA duplexes more closely resembling
DNA (the natural substrate of IN) while F-RNA duplexes are more RNA-like.[43,58,60−62] IN 1.1 binding
to PFV IN was also tested and was much weaker than binding to HIV
IN (Table ). The HIV
and PFV IN share only ∼20% sequence homology[73] but are predicted to fold similarly.[74] This suggests that IN 1.1 has high specificity for HIV
IN, just as FA-1 had high specificity for HIV RT.[55]
Prediction of the Structure of FANA IN Aptamers
Was Guided by
Covariation
The 26 sequences were clustered into 14 different
lineages based on sequence conservation and then analyzed to identify
the presence of shared structural motifs. Predicting the secondary
structure of these FANA aptamers and other XNAs is complicated by
a lack of available programs or enzymatic assays to interrogate XNA
folding. Available programs for RNA or DNA can be used to approximate
some XNA structures as many XNAs retain canonical bases. However,
some XNAs,[44] like FANA,[63,75] form stronger duplex structures with DNA and RNA than natural nucleic
acids, suggesting stronger base-pairing interactions. Although FANA
shares properties with DNA (e.g., stimulation of RNase H activity
in FANA-RNA hybrids,[63,76] susceptibility to DNases,[77] and incorporation by some DNA polymerases[78]), the stability of FANA–RNA hybrids is
significantly greater than DNA–RNA and comparable to RNA–RNA
duplexes.[77] For our structure analysis,
we relied on a combination covariation analysis and RNA folding programs
to predict the structures of the FANA IN aptamers (see Methods). We identified a potential four-way junction
secondary structure for sequences from seven (first seven lineages
in Figure ) of the
14 lineages. Representative aptamers (IN 1.1, 2.1, and 6.1) are shown
in Figure along with
an alignment illustrating the positions of the common stem structures
in the aptamers from the seven different lineages. The presence of
P3 and P4 in the functional aptamer structure is supported by covariation
between sequences that conform to the predicted four-way junction
structure. Because P1 and P2 are either entirely (P1) or in part (P2)
derived from invariable sequences at the 5′ and 3′ end
of the aptamers, covariation cannot be used to support base pairing
of these elements. Additionally, RNA folding software predicts multiple
different base-pairing patterns for the invariable region of these
sequences. For sequence 1.1, 1.2, 1.4, and 2.1, RNA structure prediction
software predicts the pairing shown for P1 and P2, but the other sequences
are predicted to form alternative base pairing patterns (Figure S2) that are inconsistent with the structure
shown in Figure .To test the proposed structure in Figure and to resolve the ambiguity in predicting
base pairs in the invariant region of the sequences, we generated
additional aptamer sequences and tested their affinity for IN. Constructs
designed to stabilize the pairing of P1 as shown in Figure retain picomolar affinity
for IN, but constructs that stabilize competing structures significantly
increased the KD,app (Figures S3–S5). The constructs IN 1.1(P1flip) and IN
2.1(P1flip) contain a set of compensatory mutations that maintain
P1 pairing for IN 1.1 and IN 2.1 but eliminate the base pairs of the
competing alternative structures. IN 1.1(−10) and IN 2.1(−10)
lack the last 10 nucleotides at the 3′ end of the aptamers.
These last 10 nucleotides are involved in base pairs in the alternative
P1 structure, but they are predicted to be single stranded in Figure . The shortened constructs
therefore eliminated competing base pairs in the alternative structure.
All four of these modifications bound IN with affinity that was comparable
to the parent constructs (Table ). Alternatively, IN-2.1(altP1), which contains changes
in the invariant regions that are predicted to stabilize an alternative
structure, exhibited a dramatic loss in affinity for IN, although
it still bound much better than the starting material (Table ). Several additional constructs
(Figure S3) that are predicted to disrupt
the secondary structure shown in Figure exhibited even more dramatic losses in binding
affinity. Overall, these results show that the structure depicted
in Figure likely
represents the aptamer structure responsible for high affinity IN
binding.Finally, it was notable that all the stem-loop structures
of IN
1.1 and IN 2.1 were highly similar despite the differences in sequence
in P3 and P4. In contrast, despite binding strongly (Table ), IN 6.1 was clearly different
with a larger P2 stem and smaller P4 stem and loop (P1 and P3 were
structurally invariant in all aptamers). Folded structures for lineages
3, 4, and 5 had less pronounced changes (when compared to IN 1.1 and
IN 2.1) in P2 and P4 than IN 6.1. To test for possible interactions
between P2 and P4, a construct that contained the P2 stem-loop from
IN 6.1 and P4 stem-loop from IN 2.1 was constructed (IN-(6.1/2.1), Figure S3). This construct bound INF185H/C280S
with affinity similar to the parent constructs (Table ). A version of the construct with the 10
3′ nucleotides deleted (IN-(6.1/2.1)(−10)) was also
made and showed similar binding affinity (Table ). These results suggest that the P2 and
P4 stem loops are interchangeable and probably contribute independently
to IN binding.
Comparison of FANA IN Aptamers with Other
IN Aptamers
DNA and RNA aptamers to HIV IN have been selected
by other groups,[11,27,38,40] and IN has been shown to bind to various
elements in the viral RNA
genome.[66] A 77 nucleotide RNA aptamer termed
“P5” was isolated using SELEX and bound to INF185H/C280S (binding to wild type IN was similar (data not shown)) with a KD of ∼10 nM under conditions with similar
ionic strength to those used here (see below).[11] IN was also shown to bind to the TAR loop (bases 1–57
of the HIV-1 genome) with a KD of ∼3
nM (measured in a buffer with relatively lower ionic strength).[66] Using the same conditions used for FANA aptamer
measurements, a KD,app of 53 ± 26
nM was observed for P5 (Table ). This is somewhat higher than the reported KD but in the same range. Differences may have resulted
from inclusion of Mn2+ rather than Mg2+ in the
original P5 measurements.[11] TAR bound poorly
to INF185H/C280S and INwt under the high salt
conditions used in our assays. We could not determine a KD value for IN binding to TAR under the conditions used
for FANA as the binding was too weak. This may be related to the lower ionic
strength as well as the different measurement methods used in the
previous work.[66]The P5 and TAR RNAs
were also compared to IN-1.1 for binding to INF185H/C280S in a competition binding assay (Figure ). The assay contained 5 nM radiolabeled
IN-1.1 and 5 nM INF185H/C280S with an increasing amount
of unlabeled competitor. As expected, unlabeled IN-1.1 competed the radiolabeled
IN-1.1 off of INF185H/C280S. When equal amounts of labeled
and unlabeled IN-1.1 were added, the level of radiolabeled IN-1.1
bound to INF185H/C280S decreased by about 50% as expected,
and further decreases were observed as more unlabeled IN-1.1 was added.
In contrast, neither P5 nor HIV TAR produced any significant decrease
in binding of radiolabeled IN-1.1, even when they were added at 16-fold
excess. This is consistent with IN-1.1 binding much more strongly
to INF185H/C280S than the other constructs. Alternatively,
the constructs may bind to different locations on INF185H/C280S such that they do not affect each other’s binding. However,
this possibility was ruled out since radiolabeled P5 was completely
displaced from INF185H/C280S with IN-1.1 (data not shown).
Figure 3
HIV TAR
and Aptamer P5 cannot compete with IN-1.1 for binding to
IN. 5 nM 5′ 32P labeled IN-1.1 was incubated with
5 nM HIV INF185H/C280S in 20 mM Tris-HCl at pH 7.5, 200
mM NaCl, 6 mM MgCl2, 1 mM DTT, and 20 μL total volume
at room temperature. Various amounts of excess unlabeled competitor
(IN-1.1, P5 aptamer, or HIV TAR (1–57)) were added (fold excess:
0, 1, 2, 4, 8, 16) and incubated for 50 min. Samples were run over
nitrocellulose filters, washed, and counted in a scintillation counter.
Values on the y axis are relative to the sample with
no competitor added.
HIV TAR
and Aptamer P5 cannot compete with IN-1.1 for binding to
IN. 5 nM 5′ 32P labeled IN-1.1 was incubated with
5 nM HIV INF185H/C280S in 20 mM Tris-HCl at pH 7.5, 200
mM NaCl, 6 mM MgCl2, 1 mM DTT, and 20 μL total volume
at room temperature. Various amounts of excess unlabeled competitor
(IN-1.1, P5 aptamer, or HIV TAR (1–57)) were added (fold excess:
0, 1, 2, 4, 8, 16) and incubated for 50 min. Samples were run over
nitrocellulose filters, washed, and counted in a scintillation counter.
Values on the y axis are relative to the sample with
no competitor added.
The IN-1.1 Aptamer Is a
Potent Inhibitor of HIV IN Activity
and Intasome Formation
HIV IN, together with viral DNA, forms
multiple intasome species, including tetramers and dodecamers that
are functional for DNA integration.[79] Similar
structures with other retroviruses consisting of four or eight and
16 IN subunits have been uncovered.[80−82] Both intasome formation
and concerted integration activity were monitored in the presence
and absence of different concentrations of IN-1.1 using INSso7d (Figure A). Concerted
integration was inhibited by IN-1.1 with an IC50 of approximately
80 nM with complete inhibition at about 160 nM IN-1.1. This correlated
well with the formation of intasomes, which began to decline at ∼80
nM IN-1.1, and no intasome formation was observed past 160 nM IN-1.1
(Figure B). Although
these values are large compared to the measured KD,app for IN binding to IN-1.1 (Table ), it is important to note that the concentration
of INSso7d in the reactions was 2.5 μM; currently
established in vitro integration assays require an
IN protein concentration that is orders of magnitude higher than the KD,apps of the aptamer, which precludes measuring
the true IC50. Complete inhibition of integration and intasome
formation was observed when the ratio of IN-1.1/INSso7d was ∼1:10 in the reaction. This is consistent with concerted
integration requiring the formation of the intasome complex, which
can apparently be completely abrogated when a relatively small proportion
of the total INSso7d is occupied by aptamer.
Figure 4
(A and B) IN-1.1
aptamer inhibits concerted integration and intasome
assembly. (A) Integrase strand transfer assay. 2.5 μM INSso7d and 1 μM 27 bp FAM labeled viral DNA and 300 ng
of target plasmid DNA pGEM-9zf substrate were incubated in 20 mM HEPES
at pH 7.5, 25% glycerol, 10 mM DTT, 5 mM MgCl2, 4 μM
ZnCl2, and 100 mM NaCl in a 20 μL reaction volume.
The reaction was carried out at 37 °C for 1 h. The integration
products were recovered by ethanol precipitation and subjected to
electrophoresis in a 1.5% agarose gel. (B) Intasome assembly was carried
out in the same way except that no target DNA was added, and CaCl2 was substituted for MgCl2. Intasome assembly mix
was subjected to electrophoresis on a 3.0% low melting agarose gel.
DNAs were visualized using a fluorescence scanner.
(A and B) IN-1.1
aptamer inhibits concerted integration and intasome
assembly. (A) Integrase strand transfer assay. 2.5 μM INSso7d and 1 μM 27 bp FAM labeled viral DNA and 300 ng
of target plasmid DNA pGEM-9zf substrate were incubated in 20 mM HEPES
at pH 7.5, 25% glycerol, 10 mM DTT, 5 mM MgCl2, 4 μM
ZnCl2, and 100 mM NaCl in a 20 μL reaction volume.
The reaction was carried out at 37 °C for 1 h. The integration
products were recovered by ethanol precipitation and subjected to
electrophoresis in a 1.5% agarose gel. (B) Intasome assembly was carried
out in the same way except that no target DNA was added, and CaCl2 was substituted for MgCl2. Intasome assembly mix
was subjected to electrophoresis on a 3.0% low melting agarose gel.
DNAs were visualized using a fluorescence scanner.
Discussion
In this report, a previously described method
to directly select
aptamers to HIV-1 RT containing only FANA nucleotides in the randomized
region was employed to generate aptamers to HIV IN. Like the HIV RT
aptamers, the IN aptamers bound with a KD,app in the low picomolar range (Table ). With HIV RT, previous aptamers made with natural
RNA and DNA typically bound RT with modestly lower affinities that
were in the same range as the FANA aptamers.[55] In contrast, previous natural nucleic acid aptamers to IN bind much
more weakly than the FANA aptamers produced here.[31,45,48,49] Like other
XNAs, FANA nucleic acids form unique hybrid structures with binding
free energies that are different from natural nucleic acids.[44,60,83] Coupled with the 2′ fluorine
and arabinose sugar, this may allow FANA to sample unique structural
spaces or make distinct interactions with amino acid moieties and
domains in proteins. Other XNAs or noncanonical nucleotide analogs
would likely have different properties that may make them more or
less amenable to specific proteins. The commercial availability of
FANA and F-RNA nucleotides has contributed to a more rapid advancement
of these XNAs. Whether XNAs or natural nucleic acid will yield better
aptamers with respect to binding affinity likely depends on the particular
protein target.In addition to the altered target binding specificity
imparted
by the chemical and structural diversity noted above, XNA modifications
of the phosphate backbone and sugar groups in particular can enhance
biological stability (reviewed in refs (84), (85)). This property would be pivotal for potential aptamer drugs or
aptamers that would be used in assays with biological material. Macugen,
an aptamer that is used for treatment of macular degeneration, was
originally selected from a nucleic acid pool containing 2′-fluoropyrimidines.
The resulting aptamer bound with low micomolar affinity to the VEGF
target protein. Further postselection modification increased the stability
of the aptamer without substantially sacrificing binding affinity.[48] However, postselection modification can be time-consuming
and is hindered by the potential loss of affinity.The FANA
aptamers described here and previously differ from most
other modified aptamers in that the selected region is composed solely
of FANA nucleotides, a property imparted by the ability of modified
or natural polymerases to efficiently and accurately incorporate all
four FANA nucleotides.[49,50] In contrast, T7 RNA polymerase
efficiently incorporates F-RNApyrimidines, which limits selection,
while other RNA polymerases that can more efficiently incorporate
F-RNA are being explored.[86] During the
current experiments, we found that the modified D4K enzyme used for
FANA incorporation also incorporates all four F-RNA nucleotides (Table , IN-1.1 (FANA to
F-RNA)). The fidelity and efficiency of this process was not explored,
but it has the potential for allowing the selection of homogeneous
F-RNA aptamers. Like the F-RNA aptamers, most other modified aptamers
produced thus far are also heterogeneous. In some cases, the various
nucleotide analogs are incorporated by natural or modified polymerases,
while moieties can also be added to select nucleotides using “click”
chemistry or other techniques.[84,87−89]The structural models for the IN aptamers were guided by covariation
analysis that predicted the presence of two stem structures (P3 and
P4 in Figure ) in
the variable region of several of the sequences from round 10, indicating
that they adopt the same functional structural motif, despite considerable
sequence variation. Interestingly, using RNA free-energy parameters[90] resulted in structure predictions that were
identical or nearly identical to our final structure prediction for
the sequences IN 1.1, IN 1.2, IN 1.4, and IN 2.1 (Figure S2). For these sequences, predicting FANA structures
using RNA folding parameters produced results that were consistent
with both the covariation analysis and mutational analysis. Alternatively,
when DNA free-energy parameters[68] were
used, none of the sequences were predicted to fold into the structure
supported by covariation or mutational analysis. This suggests that
RNA folding parameters, as opposed to DNA, are better predictors of
FANA secondary structure, but further analysis with other aptamers
will be necessary to establish this. Several methods of confirming
the secondary structures of nucleic acids cannot be applied to FANA
and other XNAs. Development of new methods for analyzing XNA structures
should be helpful in future experiments characterizing XNAs and other
modified nucleic acids.
Authors: O M Antipova; E G Zavyalova; A V Golovin; G V Pavlova; A M Kopylov; R V Reshetnikov Journal: Biochemistry (Mosc) Date: 2018-10 Impact factor: 2.487
Authors: Katherine J Lepik; P Richard Harrigan; Benita Yip; Lu Wang; Marjorie A Robbins; Wendy W Zhang; Junine Toy; Linda Akagi; Viviane D Lima; Silvia Guillemi; Julio S G Montaner; Rolando Barrios Journal: AIDS Date: 2017-06-19 Impact factor: 4.177
Authors: David H Mathews; Matthew D Disney; Jessica L Childs; Susan J Schroeder; Michael Zuker; Douglas H Turner Journal: Proc Natl Acad Sci U S A Date: 2004-05-03 Impact factor: 11.205
Authors: Irina Anosova; Ewa A Kowal; Matthew R Dunn; John C Chaput; Wade D Van Horn; Martin Egli Journal: Nucleic Acids Res Date: 2015-12-15 Impact factor: 16.971
Authors: Yee-Wai Cheung; Pascal Röthlisberger; Ariel E Mechaly; Patrick Weber; Fabienne Levi-Acobas; Young Lo; Alvin W C Wong; Andrew B Kinghorn; Ahmed Haouz; G Paul Savage; Marcel Hollenstein; Julian A Tanner Journal: Proc Natl Acad Sci U S A Date: 2020-07-06 Impact factor: 11.205
Authors: Luke K McKenzie; Marie Flamme; Patrick S Felder; Johannes Karges; Frederic Bonhomme; Albert Gandioso; Christian Malosse; Gilles Gasser; Marcel Hollenstein Journal: RSC Chem Biol Date: 2021-11-02