APOBEC3B (A3B) deamination activity on ssDNA is considered a contributing factor to tumor heterogeneity and drug resistance in a number of human cancers. Despite its clinical impact, little is known about A3B ssDNA substrate preference. We have used nuclear magnetic resonance to monitor the catalytic turnover of A3B substrates in real-time. This study reports preferred nucleotide sequences for A3B substrates, including optimized 4-mer oligonucleotides, and reveals a breadth of substrate recognition that includes DNA sequences known to be mutated in drug-resistant cancer clones. Our results are consistent with available clinical and structural data and may inform the design of substrate-based A3B inhibitors.
APOBEC3B (A3B) deamination activity on ssDNA is considered a contributing factor to tumor heterogeneity and drug resistance in a number of human cancers. Despite its clinical impact, little is known about A3B ssDNA substrate preference. We have used nuclear magnetic resonance to monitor the catalytic turnover of A3B substrates in real-time. This study reports preferred nucleotide sequences for A3B substrates, including optimized 4-mer oligonucleotides, and reveals a breadth of substrate recognition that includes DNA sequences known to be mutated in drug-resistant cancer clones. Our results are consistent with available clinical and structural data and may inform the design of substrate-based A3B inhibitors.
The human APOBEC3 (apolipoprotein
B mRNA editing enzyme, catalytic polypeptide-like) cytidine deaminase
family comprises seven members, which convert C to U on single stranded
DNA (ssDNA), among other functions.[1,2] APOBEC proteins
contain either one (A3A/A3C/A3H) or two domains (A3B/A3DE/A3F/A3G)
with similar overall architecture. Although each domain incorporates
a zinc binding site, in the case of A3B only the C-terminal domain
(CTD) has been associated with catalytic activity.[3,4] Activity-modulating
functions of the N-terminal domain (NTD) have been proposed;[2] however, expression and purification of full-length
recombinant wild-type A3B has proven challenging.[4] As part of the innate immune system, APOBEC proteins are
upregulated in response to viral infection, for example, human immunodeficiency
virus type 1 or human papilloma virus, where they deaminate viral
genomes and restrict viral replication.[1,5] However, several
members have also been linked to cancer, and A3B in particular has
been associated with increased mutational burden in head and neck,
cervical, lung, bladder, and breast cancer.[6] Depending on subsequent repair mechanisms, APOBEC-mediated deamination
predominantly causes C→T transition and C→G transversion,[7] thereby contributing to mutational load, tumor
heterogeneity, and the development of drug resistance.[8] A3B inhibition with small molecules is therefore considered
a promising therapeutic strategy to combat drug resistance in cancer
patients. Several studies have analyzed the sequence context of APOBEC-mediated
deamination in vitro and from patient sequencing
data, identifying a consensus sequence of 5′-TC-3′ (the target C is henceforth underlined)
for A3A, A3B, A3F, and A3H, while A3G prefers a 5′-CCC-3′ consensus sequence.[2,9−19] A recent study has further evaluated the effect of oligonucleotide
length and substrate context on the A3B CTD deamination rate.[20] However, a systematic study of A3B substrate
sequence preference has not been reported. We use real-time nuclear
magnetic resonance (NMR) experiments to determine the impact of substrate
sequence and length on turnover rate by wild-type A3B CTD. Through
these studies, we further inform the biological and clinical relevance
of A3B activity and gain insight into its molecular mechanism.We found that the 10-mer APOBEC3B substrate sequence 5′-TTATTCATAT-3′ contained in the 30 nucleotide (nt) FAM-labeled
ssDNA substrate reported by Fu et al.[3] was
readily deaminated by A3B CTD in a fluorescence-based deamination
assay (Figure S1) and by NMR (Figure A, Table S1) with an initial rate of 0.174 ± 0.042 mM·h–1 (n = 3, ±SD). By contrast,
no deamination of the mononucleotide 2′-deoxycytidine 5′-monophosphate
(5′-dC-3′, dCMP) or the commonly
reported A3B preferred motif 5′-TCA-3′[10,13] was observed after a 5 h incubation (Figure A, Table S1).
When we extended the incubation time to 20 h, we detected 20% conversion
for 5′-TCA-3′, a deamination
rate of 0.007 ± 0.001 mM·h–1 (n = 3, ±SD), 25-fold slower than for the 10-mer 5′-TTATTCATAT-3′. We did not observe any deamination for
dCMP even after 50 h incubation.
Figure 1
Impact of (+)
and (−) side nucleotides on substrate turnover.
(A) Representative 1H NMR spectra recorded over a time
course are shown for the 10-mer 5′-TTATTCATAT-3′, dCMP and 5′-TCA-3′. The intensities of the C peak were used to monitor substrate turnover and to calculate the
initial reaction rates. (B) Time course of deamination rates for 10-mer
5′-TTATTCATAT-3′, dCMP, and (+) and (−) side only oligonucleotides.
(C) Addition of +1A (left) and two T nucleotides at the (−)
side (right) increase turnover of the substrate oligo, whereas a single
T on the (−) side appears detrimental. ns not significant,
*p-value < 0.05, ***p-value <
0.001, One-way ANOVA and Bonferroni test.
Impact of (+)
and (−) side nucleotides on substrate turnover.
(A) Representative 1H NMR spectra recorded over a time
course are shown for the 10-mer 5′-TTATTCATAT-3′, dCMP and 5′-TCA-3′. The intensities of the C peak were used to monitor substrate turnover and to calculate the
initial reaction rates. (B) Time course of deamination rates for 10-mer
5′-TTATTCATAT-3′, dCMP, and (+) and (−) side only oligonucleotides.
(C) Addition of +1A (left) and two T nucleotides at the (−)
side (right) increase turnover of the substrate oligo, whereas a single
T on the (−) side appears detrimental. ns not significant,
*p-value < 0.05, ***p-value <
0.001, One-way ANOVA and Bonferroni test.Using the 10-mer oligonucleotide 5′-TTATTCATAT-3′ as a benchmark, we set out to determine
the contribution
to A3B CTD-mediated deamination of nucleotides both (−) (i.e.,
5′) and (+) (i.e., 3′) to C.
Dinucleotides, such as 5′-TC-3′
and 5′-CA-3′, were not deaminated
by A3B CTD. When we added a second nucleotide on the (+) side, we
observed deamination of the trimer 5′-CAT-3′ with an initial rate of 0.023 ± 0.001 mM·h–1 (n = 3, ±SD) (Figure B, Figure S2, and Table S1), 8-fold slower
than the 10-mer. Notably, 5′-CAT-3′
is the smallest oligonucleotide for which we observed deamination
after a 5 h incubation. Shi et al. suggested that aromatic π-stacking
of nucleobases positioned +1 to +3 to the substrate C contribute to productive deamination.[13] However, in our hands, further extension of the 5′-CAT-3′ oligonucleotide to tetra- and pentanucleotides
using a poly-T sequence had little effect (Figure B, Figure S2,
and Table S1), indicating that nucleotides
at positions +3 and +4 do not further enhance substrate turnover rate.
A similar study on the (−) side of the substrate C showed lower deamination rate compared with their matched
(+) side equivalents (Figure B, Figure S2, and Table S1). For example, no deamination was observed for 5′-TTC-3′ after 5 h incubation and a deamination rate
could only be obtained with extended incubation time (15 h) [0.009
± 0.002 mM·h–1 (n = 4,
±SD) (Figure B, Figure S2, and Table S1)]. The initial deamination rates of oligonucleotides
extended in either the (−) or the (+) direction are depicted
as bar charts in Figure S2, and deamination
rates are provided in Table S1.These
results prompted us to further investigate the +1 nucleotide.
Intriguingly, introduction of A to the +1 position of 5′-TTC-3′ increased the deamination rate to a level
similar to the 10-mer rate [5′-TTCA-3′
initial rate = 0.107 ± 0.014 mM·h–1 (n = 3, ±SD)]. This effect was also observed for 5′-ATTC-3′;
addition of a +1 A (5′-ATTCA-3′)
fully restored the deamination rate to that of the 10-mer (Figure C, left, and Table S1). These observations suggest that the
+1 nucleotide is important for productive substrate turnover. Analogously,
we added T to the −1 position of oligonucleotide 5′-CAT-3′. The resultant 5′-TCAT-3′ sequence showed low catalytic turnover, which could
be improved by addition of a second nucleotide on the (−)-side
(Figure C, right,
and Table S1). Thus, only one nucleotide
at the (−)-side appears detrimental, whereas addition of both
−1 and −2 nucleotides enhances the deamination rate
∼6-fold compared to the 5′-CAT-3′
trinucleotide. Crystal structures of A3A and a chimeric A3A/A3B CTD
crystal structure in complex with ssDNA substrates reveal large conformational
changes of Tyr132 (A3A), Tyr315 (A3B), and Arg211 (A3B)[13] in relation to the apo structures[16,21] in order to accommodate T at the −1 position. Our results
suggest that the specific interactions of −1 T with Asp314
are not sufficient to stabilize a productive protein conformation,
and further interactions of the −2 nucleotide with the protein
may be required. This could explain why deamination, albeit weakly,
of (+)-side only nucleotides was observed; the turnover of such substrates
may not require such conformational adaptation of A3B.As the
shortest oligonucleotide substrate subject to notable deamination
was the tetramer 5′-TTCA-3′,
we investigated whether the directionality of ssDNA presentation is
important for short oligonucleotides. We tested the corresponding
5′-ACTT-3′ sequence (equivalent
to 3′-TTCA-5′) and observed no
deamination (Figure A and Table S1). Using a crystal structure
of 5′-TTCA-3′ in complex with
the A3A/A3B CTD chimera,[13] we modeled 3′-dC-5′
into the catalytic site to understand the difference between 5′-dC-3′
and 3′-dC-5′ binding at an atomic level. Constraining
the cytidine base within the catalytic site to a productive binding
mode and altering the ssDNA directionality from 5′-dC-3′
to 3′-dC-5′ revealed severe clashes with the protein
surface (Figure B)
due to the fixed chirality of the ribose sugar (Figure C).
Figure 2
Influence of ssDNA directionality on substrate
turnover. (A) The
5′ → 3′ sequence TTCA
but not the corresponding 3′ → 5′ TTCA (equivalent to 5′-ACTT-3′) oligonucleotide
is deaminated by A3B CTD. (B) Binding of 3′ → 5′
deoxycytidine monophosphate (dCMP, orange), but not 5′ →
3′ dCMP (green) results in severe clashes with the A3B protein
(blue cartoon/spheres and gray surface). Therefore, only the 5′
→ 3′ dCMP is compatible with binding; the crystal structure
conformation of the base is fixed in a productive binding mode; PDB
entry 5TD5 was
used.[13] (C) Due to the chirality of the
deoxyribose sugar, the 3′ and 5′ ends adopt different
vectors in 5′ → 3′ and 3′ → 5′
dCMP; the cytosine base is depicted in the same orientation, which
is required for productive deamination (Figure B).
Influence of ssDNA directionality on substrate
turnover. (A) The
5′ → 3′ sequence TTCA
but not the corresponding 3′ → 5′ TTCA (equivalent to 5′-ACTT-3′) oligonucleotide
is deaminated by A3B CTD. (B) Binding of 3′ → 5′
deoxycytidine monophosphate (dCMP, orange), but not 5′ →
3′ dCMP (green) results in severe clashes with the A3B protein
(blue cartoon/spheres and gray surface). Therefore, only the 5′
→ 3′ dCMP is compatible with binding; the crystal structure
conformation of the base is fixed in a productive binding mode; PDB
entry 5TD5 was
used.[13] (C) Due to the chirality of the
deoxyribose sugar, the 3′ and 5′ ends adopt different
vectors in 5′ → 3′ and 3′ → 5′
dCMP; the cytosine base is depicted in the same orientation, which
is required for productive deamination (Figure B).Work by Byeon et al.[9] suggested
that
A at the −2 position, that is, 5′-ATCA-3′, is a better A3B substrate than 5′-TTCA-3′, and consistent with their report, we determined
an initial rate of 0.205 ± 0.041 mM·h–1 (n = 3, ±SD) for 5′-ATCA-3′ compared to 0.107 ± 0.014 mM·h–1 (n = 3, ±SD) for 5′-TTCA-3′ (p-value = 0.0166, unpaired t test). This observation raised further questions concerning
A3B substrate preference, and we therefore systematically evaluated
the impact of all possible nucleobases at positions −2, −1,
and +1 using the 5′-ATCA-3′ sequence
as a benchmark.Consistent with previous reports,[9] only
−1 T was compatible with substrate turnover in a 4-mer oligonucleotide
substrate. Deamination was reduced with −1 C and absent with
−1 A and −1 G (Figure A and Table S1); probably
due to extensive interactions of −1 T with A3B[13] and potential steric restrictions associated with the binding
of larger purine bases.[22]
Figure 3
A3B CTD base preference
at different substrate positions translates
to biologically relevant DNA sequences. (A) Based on the 5′-ATCA-3′ sequence as a benchmark, the effect of different
bases at positions −2, −1, and +1 on substrate turnover
was evaluated. Preference for G at +1 is also observed with the (+)-side
trimer oligonucleotides (p-value = 0.0046, unpaired t test). (B) Oligonucleotides representing commonly observed
drug resistance mutations containing identified substrate sequences
5′-TTCA-3′ (PIK3CA E452), 5′-ATCA-3′ (Kit D820), 5′-GTCG-3′ (Smo D473), and 5′-ATCG-3′
(Kit D716) are deaminated at a similar or higher rate to the 10-mer
positive control 5′-TTATTCATAT-3′,
whereas no turnover was observed for the negative control 5′-TACA-3′ (Kit T670) after 20 h. Notably, all drug
resistance mutations except Kit T670I affect the complementary strand.
The 10-mer and 5′-CAT-3′ deamination
rates from previous experiments are plotted as a reference. ns not
significant, *p-value < 0.05, **p-value < 0.01, ***p-value < 0.001, One-way
ANOVA and Bonferroni test.
A3B CTD base preference
at different substrate positions translates
to biologically relevant DNA sequences. (A) Based on the 5′-ATCA-3′ sequence as a benchmark, the effect of different
bases at positions −2, −1, and +1 on substrate turnover
was evaluated. Preference for G at +1 is also observed with the (+)-side
trimer oligonucleotides (p-value = 0.0046, unpaired t test). (B) Oligonucleotides representing commonly observed
drug resistance mutations containing identified substrate sequences
5′-TTCA-3′ (PIK3CA E452), 5′-ATCA-3′ (Kit D820), 5′-GTCG-3′ (Smo D473), and 5′-ATCG-3′
(Kit D716) are deaminated at a similar or higher rate to the 10-mer
positive control 5′-TTATTCATAT-3′,
whereas no turnover was observed for the negative control 5′-TACA-3′ (Kit T670) after 20 h. Notably, all drug
resistance mutations except Kit T670I affect the complementary strand.
The 10-mer and 5′-CAT-3′ deamination
rates from previous experiments are plotted as a reference. ns not
significant, *p-value < 0.05, **p-value < 0.01, ***p-value < 0.001, One-way
ANOVA and Bonferroni test.Variation of the −2 nucleobase also influenced substrate
turnover, with −2 G representing the best deamination substrate
(Figure A and Table S1).Variations at the +1 position
of 4-mer substrates had less impact
compared to the −1 position with 5′-ATCG-3′ the best substrate (Figure A and Table S1). Preference for +1 G was also observed in the (+)-side only trinucleotides,
with initial rates of 0.023 ± 0.001 (n = 3,
± SD) vs 0.093 ± 0.021 (n = 3, ± SD)
mM·h–1 for 5′-CAT-3′ and 5′-CGT-3′ (Figure A and Table S1). Combining these position-dependent
results suggested 5′-GTCG-3′
as an optimal A3B substrate; this oligonucleotide was deaminated at
a similar rate to 5′-ATCG-3′
and 5′-GTCA-3′. Intriguingly,
these 4-mer substrates were deaminated at a similar rate compared
to the control 10-mer oligonucleotide 5′-TTATTCATAT-3′ (p-value > 0.05 for 5′-ATCG-3′, 5′-GTCA-3′,
and 5′-GTCG-3′, One-way ANOVA
and Bonferroni test). Other studies have suggested that longer ssDNA
oligo substrates are required for binding[22] and deamination by the closely related A3A enzyme;[17] however, our data demonstrates proficient A3B substrate
turnover with short oligos provided the sequence context is appropriate.To further investigate the biological relevance of our findings,
we tested commonly observed mutations in cancer patients, where mutation
of a C base is consistent with the generation
of clones known to confer resistance to cancer therapy (Figure B and Table S1). For example, mutations within the Kit and Smoothened (Smo)
genes confer resistance to targeted cancer drugs such as imatinib
(Kit D820Y and D716N) and vismodegib (Smo D473H/N/Y).[23] These clinically identified mutations may all result from
A3B-mediated lesions, which can be further processed by DNA repair
mechanisms to give the observed amino acid change. Oligonucleotides
(10-mers) containing mutation sites within the identified substrate
sequences 5′-ATCA-3′ (Kit D820Y),
5′-ATCG-3′ (Kit D716N), and 5′-GTCG-3′ (Smo D473H/N/Y) were rapidly deaminated
at a similar rate to the 5′-TTATTCATAT-3′
10-mer substrate. Furthermore, the oligonucleotide representing the
PIK3CA E542K mutation previously associated with A3B-activity[24] was also deaminated. In contrast, no deamination
of 5′-TACA-3′ (Kit T670I), which
is not a predicted A3B substrate, was observed after 20 h. These findings
provide a mechanistic rationale for A3B-mediated generation of mutations
that sustain the emergence of resistant clones under the selective
pressure of targeted cancer therapy.Deamination of 5′-GTCG-3′
and 5′-ATCG-3′-containing substrates
at rates comparable to the 5′-TTATTCATAT-3′ 10-mer suggests that their productive deamination
is determined by both specific interactions of −1, −2,
and +1 substrate nucleotides, as well as the modified C, with A3B-CTD. Furthermore, the secondary structure and conformation
of the substrate may be important, consistent with the reported increased
A3B binding of hairpin versus linear oligonucleotides.[25]Previous A3B CTD NMR deamination studies
of +1 base variations
in the context of 40-mers are consistent with our results,[9] thereby suggesting that the preference for bases
immediately adjacent to the substrate C is conserved and independent
from the composition or length of the overall oligonucleotide. This
suggests that interactions with residues close to, but not within,
the catalytic site are required for tight and specific binding of
ssDNA, or for optimal positioning of the substrate C for productive deamination, or both. As discussed above, structural
data on A3B ssDNA binding is fully consistent with the preference
for T at position −1 and gives insights into possible induced-fit
mechanisms for substrate binding. However, structural data to inform
on potential interactions of −2 and +1 nucleotides is lacking
because the reported A3B–ssDNA crystal structure includes alterations
to A3B loop regions 1 and 3.[13] Due to their
proximity to the substrate-binding site, these loops are likely to
be critical to binding of the −2 and +1 nucleobases (Figure S3).Conflicting evidence is available
regarding the substrate specificity
of the full-length A3B protein compared to CTD alone: A study by Haché
et al. showed that APOBEC substrate specificity is not affected by
the NTD sequence and is determined by the CTD alone.[26] By contrast, Byeon et al. reported a similar extent of
deamination by full-length A3B in cell extracts at 5 h for 40-mer
substrates with varying nucleobases at the +1 position; however, initial
deamination rates were not reported.[9] More
detailed analyses of the impact of full-length protein on deamination
rate and substrate specificity are required and will be the subject
of future work.Our studies investigate the impact of substrate
length and composition
on deamination rate by A3B. Direct comparison with other A3 isoforms
is limited by the available data; however, a preference of A3A for
a pyrimidine at position −2 has been reported.[9,22] In contrast, we observe that A3B-mediated deamination is enhanced
with purine bases at the −2 position consistent with the findings
of Byeon et al.[9] These results indicate
a potential for A3A and A3B substrate specificity [XTCY (A3A)[22] and YTCZ (A3B; this study), where X is a pyrimidine base, Y is a purine
base, and Z can be any base (although T may be suboptimal)]. Importantly,
this is consistent with differences observed in cancer patient samples,
where A3A and A3B associated mutations can be distinguished based
on the identity of the −2 base.[11] Furthermore, the mutational pattern observed in patient samples with A3B gene deletion
but A3A gene retention is consistent with an A3A-substrate preference.[11] In the cellular environment, ssDNA is coated
with replication protein A (RPA); however, a recent study showed that
full-length A3B can compete with RPA binding to ssDNA.[27] Together, these studies suggest that our findings
are biologically and clinically relevant. Similar studies on related
isoforms such as A3F and A3H would be highly desirable to dissect
the contribution of each APOBEC family member to the mutational load
observed in distinct disease states.In summary, our studies
reveal a broader A3B substrate sequence
recognition than previously reported, which may widen the clinical
context of A3B-mediated mutational signatures. TCG sequences are removed from bioinformatic characterization so that
the events are not confused with CpG islands, where methylated C can be deaminated directly to T in a non-APOBEC-driven
manner.[24] However, our results suggest
that G is equal or even preferred to A and the contribution of A3B
to mutagenesis in the genome may therefore be underestimated. We observe
equivalent deamination rates for optimized short tetranucleotide sequences
compared to 10-mer nucleotides suggesting that induced-fit recognition
of A3B CTD for its substrates can be triggered by an optimal tetranucleotide
binding motif, which may facilitate the rational design of substrate-based
A3B-selective inhibitors. The substrate preference identified in this
study also translates to biologically relevant DNA sequences. We demonstrate
that A3B CTD deaminates oligonucleotides derived from sequences associated
with clinical resistance to cancer therapy, thereby supporting previous
evidence that A3B activity contributes to the evolution of resistance
in cancer.
Authors: Michael B Burns; Lela Lackey; Michael A Carpenter; Anurag Rathore; Allison M Land; Brandon Leonard; Eric W Refsland; Delshanee Kotandeniya; Natalia Tretyakova; Jason B Nikas; Douglas Yee; Nuri A Temiz; Duncan E Donohue; Rebecca M McDougle; William L Brown; Emily K Law; Reuben S Harris Journal: Nature Date: 2013-02-06 Impact factor: 49.962
Authors: Markus-Frederik Bohn; Shivender M D Shandilya; Tania V Silvas; Ellen A Nalivaika; Takahide Kouno; Brian A Kelch; Sean P Ryder; Nese Kurt-Yilmaz; Mohan Somasundaran; Celia A Schiffer Journal: Structure Date: 2015-04-23 Impact factor: 5.006
Authors: Ke Shi; Michael A Carpenter; Surajit Banerjee; Nadine M Shaban; Kayo Kurahashi; Daniel J Salamango; Jennifer L McCann; Gabriel J Starrett; Justin V Duffy; Özlem Demir; Rommie E Amaro; Daniel A Harki; Reuben S Harris; Hideki Aihara Journal: Nat Struct Mol Biol Date: 2016-12-19 Impact factor: 15.369
Authors: Tania V Silvas; Shurong Hou; Wazo Myint; Ellen Nalivaika; Mohan Somasundaran; Brian A Kelch; Hiroshi Matsuo; Nese Kurt Yilmaz; Celia A Schiffer Journal: Sci Rep Date: 2018-05-14 Impact factor: 4.379
Authors: Shurong Hou; Tania V Silvas; Florian Leidner; Ellen A Nalivaika; Hiroshi Matsuo; Nese Kurt Yilmaz; Celia A Schiffer Journal: J Chem Theory Comput Date: 2018-12-11 Impact factor: 6.006
Authors: Maksim V Kvach; Fareeda M Barzak; Stefan Harjes; Henry A M Schares; Harikrishnan M Kurup; Katherine F Jones; Lorraine Sutton; John Donahue; Richard T D'Aquila; Geoffrey B Jameson; Daniel A Harki; Kurt L Krause; Elena Harjes; Vyacheslav V Filichev Journal: Chembiochem Date: 2019-12-19 Impact factor: 3.164