Justin J King1,2, Faezeh Borzooee1,2, Junbum Im2,3, Mahdi Asgharpour1,2, Atefeh Ghorbani1,2, Cody P Diamond2, Heather Fifield2, Lesley Berghuis2, Mani Larijani1,2. 1. Department of Molecular Biology and Biochemistry, Faculty of Science, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada. 2. Program in immunology and Infectious Diseases, Division of Biomedical Sciences, Faculty of Medicine, Memorial University of Newfoundland, St. John's, Newfoundland A1B 3 V6, Canada. 3. BC Cancer Research/Terry Fox Labs, University of British Columbia, Vancouver, British Columbia BC V5Z 1L3, Canada.
Abstract
Activation-induced cytidine deaminase (AID) initiates antibody diversification by mutating immunoglobulin loci in B lymphocytes. AID and related APOBEC3 (A3) enzymes also induce genome-wide mutations and lesions implicated in tumorigenesis and tumor progression. The most prevalent mutation signatures across diverse tumor genomes are attributable to the mistargeted mutagenic activities of AID/A3s. Thus, inhibiting AID/A3s has been suggested to be of therapeutic benefit. We previously used a computational-biochemical approach to gain insight into the structure of AID's catalytic pocket, which resulted in the discovery of a novel type of regulatory catalytic pocket closure that regulates AID/A3s that we termed the "Schrodinger's CATalytic pocket". Our findings were subsequently confirmed by direct structural studies. Here, we describe our search for small molecules that target the catalytic pocket of AID. We identified small molecules that inhibit purified AID, AID in cell extracts, and endogenous AID of lymphoma cells. Analogue expansion yielded derivatives with improved potencies. These were found to also inhibit A3A and A3B, the two most tumorigenic siblings of AID. Two compounds exhibit low micromolar IC50 inhibition of AID and A3A, exhibiting the strongest potency for A3A. Docking suggests key interactions between their warheads and residues lining the catalytic pockets of AID, A3A, and A3B and between the tails and DNA-interacting residues on the surface proximal to the catalytic pocket opening. Accordingly, mutants of these residues decreased inhibition potency. The chemistry and abundance of key stabilizing interactions between the small molecules and residues within and immediately outside the catalytic pockets are promising for therapeutic development.
Activation-induced cytidine deaminase (AID) initiates antibody diversification by mutating immunoglobulin loci in B lymphocytes. AID and related APOBEC3 (A3) enzymes also induce genome-wide mutations and lesions implicated in tumorigenesis and tumor progression. The most prevalent mutation signatures across diverse tumor genomes are attributable to the mistargeted mutagenic activities of AID/A3s. Thus, inhibiting AID/A3s has been suggested to be of therapeutic benefit. We previously used a computational-biochemical approach to gain insight into the structure of AID's catalytic pocket, which resulted in the discovery of a novel type of regulatory catalytic pocket closure that regulates AID/A3s that we termed the "Schrodinger's CATalytic pocket". Our findings were subsequently confirmed by direct structural studies. Here, we describe our search for small molecules that target the catalytic pocket of AID. We identified small molecules that inhibit purified AID, AID in cell extracts, and endogenous AID of lymphoma cells. Analogue expansion yielded derivatives with improved potencies. These were found to also inhibit A3A and A3B, the two most tumorigenic siblings of AID. Two compounds exhibit low micromolar IC50 inhibition of AID and A3A, exhibiting the strongest potency for A3A. Docking suggests key interactions between their warheads and residues lining the catalytic pockets of AID, A3A, and A3B and between the tails and DNA-interacting residues on the surface proximal to the catalytic pocket opening. Accordingly, mutants of these residues decreased inhibition potency. The chemistry and abundance of key stabilizing interactions between the small molecules and residues within and immediately outside the catalytic pockets are promising for therapeutic development.
The DNA-editing
enzyme activation-induced
cytidine deaminase (AID) is expressed in activated B lymphocytes.
AID mutates deoxycytidine (dC) to deoxyuridine (dU) at immunoglobulin
(Ig) genes, triggering somatic hypermutation (SHM) and class switch
recombination (CSR) of antibodies.[1−7] AID deficiency results in a hyper IgM characterized by a lack of
high affinity antibodies of switched isotypes, which is readily treatable
by modern pharmaceuticals.[2,3,8] AID also mediates significant off-target and genome-wide mutagenesis,
some of which result in double stand breaks (DSBs) that mediate chromosomal
translocations.[9−14] Thus, a wealth of literature over two decades implicates AID in
initiation of leukemia/lymphomas including Burkitt’s lymphoma
(BL), diffuse large B cell lymphoma (DLCL), follicular lymphoma (FL),
multiple myeloma, and chronic lymphocytic leukemia.[10,15−19] These tumors arise from centroblasts or postcentroblasts, the narrow
stage in a B cell’s life where AID is expressed, and mutations
and translocation breakpoints (c-myc/IgH in BL, bcl-2/IgH in FL, bcl-6/IgH
in DLCL, IgH-CCND1 in Mantle cell lymphoma) occur at genomic sites
that are frequently targeted by AID.[18,20−27] The causal role of AID in lymphomagenesis was proven in IL-6 transgenic
mice which develop lymphomas that mimic human BL in phenotype/genotype
(c-myc/IgH). In this model, DSBs at both the IgH and translocation
partner c-myc loci are directly caused by AID.[9,15,28−30]Beyond tumor initiation,
AID expression can also exacerbate leukemia/lymphomas.
Genome-wide AID-mediated mutation signatures are prevalent in leukemia/lymphomas,
and numerous studies have shown that AID levels in tumors correlate
with poor diagnosis.[10,19,31−43] In chronic myeloid leukemia (CML), AID was shown to mutate tumor
suppressor and/or DNA repair genes and accelerate imatinib resistance.[31] Moreover, recent studies indicate that some
therapeutic agents, such as idelalisib and duvelisib, can exacerbate
AID-mediated genome mutations in tumors through increased AID expression
and chromosomal translocation frequency between the IgH locus and
off-target sites.[44,45]AID is a member of the
apolipoprotein B mRNA editing enzyme catalytic
polypeptide-like (APOBEC) family of Zn-dependent, single-stranded
polynucleotide-restricted cytidine deaminases.[4,5,46] The AID/APOBEC family includes AID, APOBEC1,
APOBEC2, APOBEC3A,B,C,D,F,G,H, and APOBEC4.[4,5,46] The APOBEC3 (A3) sub-branch carries out
antiviral protective functions by targeting viral DNA for mutation
in the cytoplasm of infected cells.[47−51] This antiviral activity has been the most studied
in the context of retroviruses like HIV whose genomes go through a
ssDNA replication phase; however, like AID, the A3s (particularly
A3A and A3B) are also a major endogenous source of genomic mutations
in many human cancers such as lung cancer, gastrointestinal cancer,
breast cancer, head and neck cancer, and ovarian cancer[2,48,52−67] with prevalent mutational signatures across sequenced tumor genomes.
A3 expression associates with poor prognosis, and A3 action is one
mechanism for the generation of drug resistance. More recently, the
role of A3s as cancer drivers and exacerbators has been confirmed
in mouse models.[68]Given their prominent
roles as mediators of one of the most prevalent
tumor genome mutation signatures, inhibiting AID and A3 activity has
been suggested as a potentially useful approach to treating AID-expressing
malignancies or augmenting other therapies.[19,40,69−79] We previously utilized a combined computational–biochemical
approach to glean insights into AID’s native and functional
structure.[80] This approach is based on
structure prediction using multiple templates, followed by functional
verification of model predictions using a library of AID variants,
including point/multiple mutants, orthologues, and chimeric versions
with portions of other deaminases exchanged into the AID scaffold,
or vice versa. We thus arrived at a map of AID’s functional
structure including surface topology, core architecture, and catalytic
pocket.[80] This map demonstrated that AIDs
form a catalytic pocket with the triad of Zn-coordinating residues
(H56, C87, and C90 in human AID) and catalytic glutamic acid (E58
in human AID) that can accommodate a dC residue in orientations that
support the four-stage deamination chemistry common to cytidine and
cytosine deaminase. This was reassuring because the same arrangement
of primary catalytic residues directly responsible for cytidine deamination
is classically found in cytidine deaminases across evolution. Furthermore,
the periphery of AID’s catalytic pocket houses a network of
noncatalytic residues, termed the “secondary catalytic residues”.
These residues, while not directly involved in deamination catalysis,
contact and/or stabilize the dC in deamination-conducive confirmations
within the catalytic pocket.[80] This network
of amino acids consists of G23, R24, R25, E26, T27, L29, N51, K52,
N53, G54, C55, V57, T82, W84, S85, P86, D89, Y114, F115, C116, and
E122 in human AID.[80] These residues form
the “walls” and “floors” of the catalytic
pocket and interact with substrate dC in several predicted protein
conformations through hydrogen bonding, electrostatic interactions,
and aromatic base stacking. Since publication of the functional structure
of AID using the computational–biochemical method, two partial
crystal structures have become available[81,82] which confirmed the map of AID’s catalytic pocket, including
the aforementioned arrangement of the primary and secondary catalytic
residues.A more detailed observation of the conformational
states of AID’s
catalytic revealed that it shifts dynamically between open and closed
positions and that the majority (∼75%) of conformations at
any time are predicted to exhibit a closed pocket.[80] This dynamic pocket closure, termed the “Schrodinger’s
CATalytic pocket”[83] for its duality,
was the first demonstration of such an inherent regulatory mechanism
in human DNA/RNA-editing enzymes. More recently, this type of catalytic
pocket closure was observed by X-ray and NMR on A3A and A3B, two close
siblings of AID.[84−87]Previous works on small-molecule inhibitors of AID/APOBECs
have
been largely focused on A3G. Screening of a library of 1280 pharmacologically
active compounds against A3G yielded several structurally related
small molecules that inhibited A3G at low micromolar concentrations
through covalent attachment of a nonconserved cysteine (C321) unique
to A3G’s substrate-specificity loop 7.[78] As a follow-up, a screening of >300 000 compounds yielded
different A3G inhibitors that covalently attached to the same cysteine
residue.[88] Although useful for studying
A3G biology, the electrophilic nature of these inhibitors is too cross-reactive
in a cellular context and is thus not suitable for cellular work.
As for AID, 5-aza-deoxycytidine incorporated into ssDNA was shown
to bind the active site of AID and inhibit expression via proteasomal
degradation,[75] though 5-aza-deoxycytidine
is a transition-state analogue which needs to be in ssDNA, as the
free form does not inhibit AID. More recently, 5-fluoro-2′-deoxyzebularine
incorporated into ssDNA was also shown to inhibit A3A and an A3B-A3A
chimera.[89]Thus, to date, there are
no reported specific small molecules of
AID, A3A, or A3B. As an extension of our previous works on delineating
AID’s catalytic pocket, we screened a library of small molecules
against multiple highly accessible conformations of its catalytic
pocket. We identified first-generation small molecule inhibitors that
specifically inhibit the mutagenic activity of purified AID, native
AID in whole cell extracts, and endogenous AID of B lymphoma cells.
Docking and mutational analyses reveal a network of contacts between
the small molecules and AID’s primary and secondary catalytic
residues. We found some analogue derivatives of the first-generation
small molecules that were even more efficient inhibitors of A3A and
A3B compared to AID.
Results
Rationale for Targeting
the Catalytic Pocket of AID
AID’s catalytic pocket
is an ideal target for small molecule
inhibition for several reasons. First, we have previously gleaned
detailed insights into its architecture and conformational breathing.[80] Second, the majority of structural differences
between AID and related cytidine deaminases are concentrated in the
catalytic pocket and proximal surface regions at the pocket opening,[80,83,90−93] thus allowing maximum specificity.
Third, we and others have described the pocket-adjacent main ssDNA
binding groove (groove 1) in detail,[80,82,83,94,95] thus offering a promising target for future derivatization.We reasoned that the functional and breathing structure of AID’s
catalytic pocket as first described through our computational-biochemical
approach[80] is an advantageous template
for inhibitor search for several reasons. First, the architecture
of its catalytic pocket was extensively functionally verified by testing
of mutants and chimeras.[83] Second, using
multiple catalytic pocket conformations is advantageous since AID/APOBEC
enzymes contain highly flexible loops that compose their catalytic
pocket leading to a range of different catalytic pocket conformations
for each enzyme.[80,83,90,91,96] Third, purely
homology modeled structures, using the same homology modeling methodology
used to arrive at AID’s functional structure, were shown to
be as reliable for generating inhibitor “hits”, as crystal
structures.[97] Considering that the functional
AID structure, especially of its inhibitor-target catalytic pocket,
was backed by extensive biochemical validation of homology modeling
predictions using AID mutants and chimeras[80,83] and was also confirmed by crystal structures (Figure S1),[81,82] we reasoned that it ought to
represent an even more high confidence template for inhibitor design.
Structure-Based Virtual Screening of Small Molecules against
the Catalytic Pocket of AID
To carry out a structure-based
docking search for small molecules that bind in its catalytic pocket,
we included five different conformations of AID which are representative
of the full range of conformations with accessible catalytic pockets
(Figure A–C).[80,83] This strategy would allow for the selection of compounds that bind
all active AID conformations. For small molecules, we utilized the
ZINC database which contains >100 million structures of commercially
available compounds.[98] We restricted our
search to the “clean lead” subset of 4.6 million compounds
because this set contains compounds that are lead-like defined by
pharmacological properties such as Lipinski’s rule of 5 and
properties generally amenable for oral intake. This set is composed
of molecules with benign functionality and excludes those with potentially
toxic chains such as aldehydes and thiols.[99,100]
Figure 1
Virtual
high-throughput screening of druglike small molecules against
the catalytic pocket of AID. (A) Ensemble of AID conformations covering
the range of accessible catalytic pockets dynamics used for virtual
high-throughput screening. The structures exhibit conserved overall
structure, with conformational changes localized to the secondary
catalytic loops that compose the walls and floors of the catalytic
pocket. N- to C-termini progression is shown from blue to red. The
purple sphere depicts the catalytic pocket-coordinated Zn. (B) Representative
surface structure of a catalytically productive AID:ssDNA complex
with docked ssDNA (blue) and dC poised for deamination in the catalytic
pocket (magenta). (C) dC bound in a catalytically productive configuration
in the catalytic pocket highlighting the secondary catalytic residues.
Adjacent ssDNA structure was omitted for clarity. This and other energetically
similar conformations of the accessible catalytic pocket state of
AID served as template for virtual high-throughput screening. (D)
The structure-based virtual high-throughput screening scheme. We screened
a large library of lead-like compounds from the ZINC library against
the catalytic pocket of AID. Using several independent and complementary
docking protocols to the catalytic pocket alone or the entire surface
of AID, we identified 10 low-energy compounds predicted to bind in
the catalytic pocket. (E) Structures of the 10 first-generation inhibitor
candidates (C1–C10).
Virtual
high-throughput screening of druglike small molecules against
the catalytic pocket of AID. (A) Ensemble of AID conformations covering
the range of accessible catalytic pockets dynamics used for virtual
high-throughput screening. The structures exhibit conserved overall
structure, with conformational changes localized to the secondary
catalytic loops that compose the walls and floors of the catalytic
pocket. N- to C-termini progression is shown from blue to red. The
purple sphere depicts the catalytic pocket-coordinated Zn. (B) Representative
surface structure of a catalytically productive AID:ssDNA complex
with docked ssDNA (blue) and dC poised for deamination in the catalytic
pocket (magenta). (C) dC bound in a catalytically productive configuration
in the catalytic pocket highlighting the secondary catalytic residues.
Adjacent ssDNA structure was omitted for clarity. This and other energetically
similar conformations of the accessible catalytic pocket state of
AID served as template for virtual high-throughput screening. (D)
The structure-based virtual high-throughput screening scheme. We screened
a large library of lead-like compounds from the ZINC library against
the catalytic pocket of AID. Using several independent and complementary
docking protocols to the catalytic pocket alone or the entire surface
of AID, we identified 10 low-energy compounds predicted to bind in
the catalytic pocket. (E) Structures of the 10 first-generation inhibitor
candidates (C1–C10).Using DOCK Blaster,[101] we screened for
candidates that bind AID in a search space restricted to the catalytic
pocket and proximal surface region (∼10 Å; Figure C and D). For each AID conformation,
we identified the 500 lowest-energy compounds. We then prioritized
compounds that bound several AID conformations over those that bound
only to a single conformation and selected the 40 lowest binding energy
compounds (Table S1). To confirm specificity
for the catalytic pocket, unrestricted docking was repeated using
the entire surface of AID, rather than just the catalytic pocket region.
In addition, we employed a second independent docking algorithm, AutoDock
Vina,[102] to substantiate the 40 candidates
identified by Dock Blaster. Even with access to AID’s whole
surface, 27 of 40 compounds bound preferentially in the pocket, thus
validating our screening methodology while further refining the list
of hits. The 10 inhibitor candidates with the most favorable docking
energies (C1–C10) were selected for functional testing (Figure E).
First Generation
Hits That Inhibit Purified and Endogenous AID
Using the standard
alkaline cleavage enzyme assay for deamination
activity of purified AID/APOBEC enzymes, we measured the catalytic
activity of purified GST-AID in the presence of each compound (Figure S2A and B). We found that 2 of 10 compounds
(C4 and C8) diminished AID activity (8.3% and 17.4% AID catalytic
activity, respectively; Figure A). In addition to purified GST-AID, we tested inhibition
on whole cell extracts of AID-expressing 293T cells transfected with
a CMV-promoter based AID-His expression vector (Figure B). Akin to our results with purified GST-AID,
C4 and C8 inhibited this AID as well (43.9 and 40.3% AID catalytic
activity, respectively). To assess potency, we measured the dose–response
of C4 and C8 against GST-AID and the AID-expressing 293T cell lysate
(Figure C and D).
C4 and C8 showed a similar potency in GST-AID (IC50 = 290
and 230 μM, respectively) as with AID-His (IC50 =
460 and 390 μM, respectively).
Figure 2
First-generation inhibitor candidates
inhibit purified AID and
AID in whole cell extracts. (A) Catalytic activity of bacterially
expressed and purified GST-AID treated with C1–C10 (n = 6 independent experiments conducted with three independently
purified preparations of GST-AID). (B) Catalytic activity of eukaryotic-expressed
AID in whole 293T cell lysate treated with C1–C10 (n = 3 independently prepared AID-expressing whole cell extracts).
(C) Catalytic activity of GST-AID on C4 and C8 as a function of log
inhibitor concentration. (D) Catalytic activity of AID-His 293T lysate
as a function of log inhibitor concentration. (E) Catalytic activity
of eukaryotic-expressed and purified GST-A3A, GST-A3B, GST-A3F, and
GST-A3G treated with 700 μM C8. (F) Catalytic activity of GST-A3A,
GST-A3B, and GST-A3G in comparison to bacterially expressed and purified
GST-AID across a concentration range of C8. All experiments contained
a negative control vehicle-only (140 mM DMSO) reaction which was designated
as 100% AID activity. All AID reactions were performed at 37 °C
for 2–4 h at pH 7.2 using 2 nM of the standard bubble oligonucleotide
substrate TGCbub7 which has previously been demonstrated to be AID’s
most favored substrate in the alkaline cleavage assay. GST-A3A, GST-A3B,
GST-A3F, and GST-A3G reactions were incubated at 37 °C for 2
h in pH 6.0 using 2 nM of standard single-stranded oligonucleotide
substrates containing a single target TTCA motif for A3A, A3B, and
A3F and a single target CCC motif for A3G.
First-generation inhibitor candidates
inhibit purified AID and
AID in whole cell extracts. (A) Catalytic activity of bacterially
expressed and purified GST-AID treated with C1–C10 (n = 6 independent experiments conducted with three independently
purified preparations of GST-AID). (B) Catalytic activity of eukaryotic-expressed
AID in whole 293T cell lysate treated with C1–C10 (n = 3 independently prepared AID-expressing whole cell extracts).
(C) Catalytic activity of GST-AID on C4 and C8 as a function of log
inhibitor concentration. (D) Catalytic activity of AID-His 293T lysate
as a function of log inhibitor concentration. (E) Catalytic activity
of eukaryotic-expressed and purified GST-A3A, GST-A3B, GST-A3F, and
GST-A3G treated with 700 μM C8. (F) Catalytic activity of GST-A3A,
GST-A3B, and GST-A3G in comparison to bacterially expressed and purified
GST-AID across a concentration range of C8. All experiments contained
a negative control vehicle-only (140 mM DMSO) reaction which was designated
as 100% AID activity. All AID reactions were performed at 37 °C
for 2–4 h at pH 7.2 using 2 nM of the standard bubble oligonucleotide
substrate TGCbub7 which has previously been demonstrated to be AID’s
most favored substrate in the alkaline cleavage assay. GST-A3A, GST-A3B,
GST-A3F, and GST-A3G reactions were incubated at 37 °C for 2
h in pH 6.0 using 2 nM of standard single-stranded oligonucleotide
substrates containing a single target TTCA motif for A3A, A3B, and
A3F and a single target CCC motif for A3G.To evaluate toxicity, we incubated multiple cell lines originating
from different tissues (A549, MCF-7, 293T, and Raji) as well as primary
peripheral blood monocytes (PBMC) with C4 or C8 and measured viability
using the standard MTT assay (Figure S2C). C4 caused minor toxicity toward all anchorage-dependent cells
(A549, MCF-7, and 293T) but not Raji or PBMCs, while C8 was found
not to be toxic. We thus focused on C8 for further development.To confirm that C8 was a bona fide AID inhibitor we measured off-target
inhibition of UDG, an enzyme used downstream of AID in the alkaline
cleavage deamination assay and found C8 did not inhibit UDG (Figure S2E). To test specificity, we examined
whether C8 could inhibit the catalytic activity of homologous A3A,
A3B, A3F, and A3G. We found GST-A3A, GST-A3B, and GST-A3G were inhibited
(5%, 5%, and 37% activity, respectively), while GST-A3F was unaffected
(Figure E). Compared
to bacterially expressed GST-AID, 293T-expressed GST-A3A and GST-A3B
were more potently inhibited, and GST-A3G was inhibited to a lesser
degree (IC50 = 220, 60, 120, and 280 μM, respectively; Figure F). Thus, C8 acts
as an AID/A3 inhibitor capable of blocking catalytic activity of specific
family members.C8 inhibition of multiple forms of AID (purified
GST-AID and AID-His
in whole cell lysates of 293T cells) was reassuring. We then examined
the ability of C8 to inhibit native (no fusion tag) full-length AID
expressed in 293T cells and found that surprisingly it was even more
effective at inhibiting untagged AID in whole cell lysates (IC50 = 11 μM) than purified AID was (Figure A). This result prompted us to examine the
ability of C8 to inhibit the endogenous AID of AID-expressing lymphoma
cell lines (Raji, Daudi, and Ramos; Figure B). Given that endogenous AID levels are
significantly lower than that of 293T cells transfected with CMV promoter-driven
AID expression vectors, we were unable to detect AID activity from
cell lysates in the alkaline cleavage assay. Instead, we employed
a more sensitive semiquantitative deamination-specific PCR assay (Deam-PCR)
which we have previously established for measuring AID activity (Figure S3).[7,94,103−105] In this assay, a substrate plasmid is incubated
with AID and subjected to PCR using deamination-specific primers that
amplify DNA deaminated by AID (Figure S4A). Using this assay on C8-treated extracts of all three lymphoma
cells, we also observed inhibition of endogenous AID in the whole
cell extracts of three different lymphoma cell lines (Figure B). The assay also further
confirmed the inhibition of purified AID (Figure B, bottom gel). In addition, we observed
inhibition of exogenous AID expressed in 293T cells, in the whole
cell extract (Figure C).
Figure 3
C8 inhibition of native untagged AID and endogenous AID from B
lymphoma cells. (A) Representative alkaline cleavage experiment demonstrating
C8 inhibition of untagged native AID in 293T whole cell lysate (left
panel). Catalytic activity of eukaryotic-expressed native untagged
AID as a function of log C8 concentration, using 140 mM DMSO as control
for 100% AID activity (right panel). (B) Deam-specific PCR was used
to detect activity of endogenous AID in extracts of AID-expressing
B lymphoma cell lines, with or without addition of C8. The deamination
substrate plasmid was incubated with cell extracts containing endogenously
expressed AID from Raji (top panel), Ramos (middle panel), and Daudi
(bottom panel) with added vehicle (140 mM DMSO) or C8. Incubation
with C8, but not DMSO, abrogated any detectable PCR product. The bottom
gel is a representative experiment demonstrating inhibition of AID-His
using Deam-PCR. (C) Deam-PCR was used to demonstrate inhibition of
AID expressed in 293T cells, in whole cell extract incubated with
the target plasmid.
C8 inhibition of native untagged AID and endogenous AID from B
lymphoma cells. (A) Representative alkaline cleavage experiment demonstrating
C8 inhibition of untagged native AID in 293T whole cell lysate (left
panel). Catalytic activity of eukaryotic-expressed native untagged
AID as a function of log C8 concentration, using 140 mM DMSO as control
for 100% AID activity (right panel). (B) Deam-specific PCR was used
to detect activity of endogenous AID in extracts of AID-expressing
B lymphoma cell lines, with or without addition of C8. The deamination
substrate plasmid was incubated with cell extracts containing endogenously
expressed AID from Raji (top panel), Ramos (middle panel), and Daudi
(bottom panel) with added vehicle (140 mM DMSO) or C8. Incubation
with C8, but not DMSO, abrogated any detectable PCR product. The bottom
gel is a representative experiment demonstrating inhibition of AID-His
using Deam-PCR. (C) Deam-PCR was used to demonstrate inhibition of
AID expressed in 293T cells, in whole cell extract incubated with
the target plasmid.
Inhibition Is Sensitive
to Mutation of Catalytic Pocket Residues
Docking revealed
that C8 binds into the catalytic pocket of AID
(Figure A) and that
akin to AID’s native substrate dC (Figure C) C8 is stabilized through multiple interactions
with secondary catalytic residues in the catalytic pocket.[80] These secondary catalytic residues are housed
on four secondary catalytic loops: Loops 1 (L1), 3 (L3), 5 (L5), and
7 (L7) (L1, L3, L5, and L7 correspond to the β1-α1 loop,
β2-α2 loop, β3-α3 loop, and β4-α4
loop, respectively; Figure A, left panel).[80]Table describes the C8-stabilizing
residues across multiple AID conformations, showing π–π
stacking and hydrogen-bonding residues. In all AID-C8 complexes, the
C8 benzimidazol-2-one is anchored deep in the catalytic pocket where
it can π–π stack with H56 and/or Y114 and hydrogen
bond with T27, E58 backbone, and S85 backbone. Notably, these are
the same residues that interact with AID’s native substrate
dC for stabilization and deamination in the catalytic pocket.[80,82] The C8-tail bound the ssDNA binding grooves with some flexibility;
AID-C8 complexes bound the C8-tail either in the L1-L7 interface (between
R25, W84 and Y114; Figure A, middle panel) or in the L1-L3 interface (between R25, E26,
N51, and K52; Figure A, right panel), with a slight preference for the former. The tail-carbonyl
hydrogen bonded with N51, H56, or Y114, while the furan group π–π
stacked with W84 or Y114 and hydrogen bonded with R25 or Y114 (Table ). Thus, the benzimidazol-2-one
was firmly bound in the catalytic pocket, while the flexible tail
was predicted to adopt several configurations interacting with the
ssDNA binding grooves.
Figure 4
Predicted binding modes of AID-C8 and inhibition-resistant
mutations
(A) Representative AID-C8 docked complex illustrating a low-energy
binding mode (left), with the AID surface colored according to surface
charges (blue = positive, red = negative, purple = Zn-coordinating
residues). The C8 benzimidazol-2-one scaffold was bound deep in the
catalytic pocket with the tail region bound outside, stabilized by
several ssDNA binding residues in either the L1-L7 (middle) or L1-L3
(right) interface. For each inhibitor, the carbon backbone is colored
green, with nitrogen, oxygen, and hydrogen colored blue, red, and
white, respectively. (B) C8 inhibition of AID mutants targeting residues
predicted to interact with C8 by docking analysis, demonstrating that
mutation of ssDNA binding groove residues that directly stabilize
C8 results in resistance to C8 inhibition. Dr-AID, representing an
evolutionary-distant AID orthologue was unaffected by C8.
Table 1
Inhibitor Binding Residues across
All Conformations of AID, A3A, A3B, and A3G
Predicted binding modes of AID-C8 and inhibition-resistant
mutations
(A) Representative AID-C8 docked complex illustrating a low-energy
binding mode (left), with the AID surface colored according to surface
charges (blue = positive, red = negative, purple = Zn-coordinating
residues). The C8 benzimidazol-2-one scaffold was bound deep in the
catalytic pocket with the tail region bound outside, stabilized by
several ssDNA binding residues in either the L1-L7 (middle) or L1-L3
(right) interface. For each inhibitor, the carbon backbone is colored
green, with nitrogen, oxygen, and hydrogen colored blue, red, and
white, respectively. (B) C8 inhibition of AID mutants targeting residues
predicted to interact with C8 by docking analysis, demonstrating that
mutation of ssDNA binding groove residues that directly stabilize
C8 results in resistance to C8 inhibition. Dr-AID, representing an
evolutionary-distant AID orthologue was unaffected by C8.Indicates the peptide backbone of
that residue.To verify
these docking results, we measured C8 inhibition on a
panel of AID mutated at residues predicted to interact with C8 (Figure B). We were limited
in testing AID mutations, as many of the predicted C8-stabilizing
residues are key catalytic residues, wherein even conservative mutations
(e.g., N51Q and Y114F) result in catalytically dead mutants;[80] however, we tested the ability of C8 to inhibit
AID mutated at each of the other contact residues located at the opening
of the catalytic pocket (K22A, E26R, R25H, and R25N) and found that
all were resistant to C8 inhibition when compared to wild type (IC50 = 500, > 1000, 890, and 770 μM, respectively).
Zebrafish
AID (Dr-AID) which naturally has a R25H equivalent, was also resistant
to C8 inhibition (IC50 > 1000 μM). These data
provide
functional validation for the predicted AID-C8 interactions and highlight
the specificity of C8 for the catalytic pocket and adjacent regions
of human AID.
Structural analogues of C8 also exhibit AID
inhibition
Using the ZINC database, we identified 948 structural
analogues of
C8 using a 60% structural similarity cutoff. We docked each analogue
with several conformations of catalytically accessible AID, using
the same approach described in Figure . Analogous to our initial screening approach for identifying
C1–C10, we ranked C8 analogues based upon docking energy and
chemical diversity and selected 15 analogues (C8.1–C8.15; Figure S5A). C8 analogues were tested for inhibition
of purified AID (Figure S5B and C), and
several were found to inhibit AID. Compared to C8, C8.5 and C8.12
inhibited purified GST-AID (230, 140, and 160 μM, respectively)
and 293T-expressed AID-His (390, 340, and 370 μM, respectively)
by a relatively similar degree (Figure A and B). However, C8 potently inhibited AID (untagged)
to a far greater extent in comparison to C8.5 and C8.12 (IC50 = 11, 90, and 130 μM, respectively; Figure C). We performed a similar screen for C4
analogues (C4.1–C4.5), but none were capable of inhibiting
AID (Figure S6A and B). Thus, C8, C8.5,
and C8.12 inhibit AID activity to a similar degree, except for untagged
AID in whole cell extracts which was most potently inhibited by C8.
We then carried out a modified intracellular version of the Deam-PCR
assay described above, wherein the plasmid substrate was transfected
into the lymphoma cells that were either treated or untreated with
C8 and C8.5, followed by cell lysis and Deam-PCR to measure mutations
mediated by endogenous AID (Figure S4A).
Here we also observed that C8 and C8.5 treatment of transfected Raji
cells inhibited AID-mediated mutations (Figure S4B)
Figure 5
Structural analogues of C8 exhibit variable potency against AID,
A3A, A3B, and A3G. Each panel measured catalytic activity in the presence
of C8, C8.5, or C8.12. (A) Bacterially expressed and purifiedGST-AID.
(B) Eukaryotic-expressed AID in whole 293T cell lysate. (C) Eukaryotic-expressed
native untagged AID in whole 293T cell lysate. (D) Eukaryotic-expressed
and purified GST-A3A. (E) Eukaryotic-expressed and purified GST-A3B.
(F) Eukaryotic-expressed and purified GST-A3G. (G) Eukaryotic-expressed
native untagged A3A in 293T cell lysate. (H) List of C8, C8.5, and
C8.12 IC50 values across each enzyme. All experiments used
140 mM DMSO as a negative control, designating 100% enzyme activity.
All AID reactions were performed at 37 °C for 2–4 h at
pH 7.2 using 2 nM of the standard bubble oligonucleotide substrate
TGCbub7. GST-A3A, GST-A3B, GST-A3F, and GST-A3G reactions were incubated
at 37 °C for 2 h in pH 5.5 for A3A/B and pH 6.0 for A3G/F using
2 nM of standard single-stranded oligonucleotide substrates containing
a single target TTCA motif for A3A, A3B, and A3F and a single target
CCC motif for A3G.
Structural analogues of C8 exhibit variable potency against AID,
A3A, A3B, and A3G. Each panel measured catalytic activity in the presence
of C8, C8.5, or C8.12. (A) Bacterially expressed and purifiedGST-AID.
(B) Eukaryotic-expressed AID in whole 293T cell lysate. (C) Eukaryotic-expressed
native untagged AID in whole 293T cell lysate. (D) Eukaryotic-expressed
and purified GST-A3A. (E) Eukaryotic-expressed and purified GST-A3B.
(F) Eukaryotic-expressed and purified GST-A3G. (G) Eukaryotic-expressed
native untagged A3A in 293T cell lysate. (H) List of C8, C8.5, and
C8.12 IC50 values across each enzyme. All experiments used
140 mM DMSO as a negative control, designating 100% enzyme activity.
All AID reactions were performed at 37 °C for 2–4 h at
pH 7.2 using 2 nM of the standard bubble oligonucleotide substrate
TGCbub7. GST-A3A, GST-A3B, GST-A3F, and GST-A3G reactions were incubated
at 37 °C for 2 h in pH 5.5 for A3A/B and pH 6.0 for A3G/F using
2 nM of standard single-stranded oligonucleotide substrates containing
a single target TTCA motif for A3A, A3B, and A3F and a single target
CCC motif for A3G.
C8.5 Inhibits the Catalytic
Activities of A3A, A3B, and A3G
Given inhibition by C8 on
A3A/B/G (Figure E
and F), we next sought to examine inhibition
with C8.5 and C8.12 (Figure D–G). We found that C8.5 inhibits GST-A3A, GST-A3B,
and GST-A3G and is remarkably more potent on inhibition of GST-A3A
than any other inhibition value measured thus far in our assays (IC50 = 3, 40, and 70 μM, respectively). In contrast, C8.12
moderately inhibited GST-A3A but was a poor inhibitor of GST-A3B and
GST-A3G (IC50 = 290, 700, and 740 μM, respectively).
We also performed a similar screen of C8, C8.5, and C8.12 on GST-A3F
but did not detect any inhibition (Figure S6C). Like the AID (untagged) expression system, we also attempted to
express untagged versions of A3A and A3B but were only successful
with A3A. We found that untagged A3A was rendered completely resistant
against C8 (IC50 > 1000 μM), while C8.5 most potently
inhibited its activity (IC50 = 9 μM), and C8.12 moderately
inhibited its activity (IC50 = 170 μM; Figure G). Thus, of all the inhibition
values measured in this study, the most potent inhibition was in the
case of C8.5 and A3A, in single digit micromolar values for two independent
forms of the enzyme.
Predicted AID/APOBEC3-Inhibitor Complexes
Vary by Structural
Differences between Secondary Catalytic Loops
Given the varying
degrees of inhibition by C8, C8.5, and C8.12 on AID, A3A, A3B, and
A3G, we sought a structural rationale for the observed differences.
C8, C8.5, and C8.12 conserve the warhead benzimidazol-2-one group,
the tail carbonyl, and a flexible, largely hydrophobic tail containing
a planar ring available for π–π stacking (Figure , top panel). For
each enzyme, we examined inhibitor binding variability across two
to three catalytically accessible conformations and tabulated the
most frequently contacted residues by the inhibitors (Table ).[80,83]
Figure 6
Favored
binding modes of C8, C8.5, and C8.12 with AID, A3A, A3B,
and A3G. Top panel: Conserved benzimidazol-2-one scaffold (blue) among
C8, C8.5, and C8.12. Each inhibitor contains a tail-carbonyl and a
flexible tail containing a largely hydrophobic aromatic ring. Each
enzyme–inhibitor pair depicts the benzimidazol-2-one group
bound deep in the catalytic pocket, with the flexible tail stabilized
by different combinations of the secondary catalytic loops (L1, L3,
L5, and L7). Stabilizing residues are transparent, colored, and labeled.
The Zn-coordinating residues are colored purple. For each inhibitor,
the carbon backbone is colored green, with nitrogen, oxygen, and hydrogen
colored blue, red, and white, respectively.
Favored
binding modes of C8, C8.5, and C8.12 with AID, A3A, A3B,
and A3G. Top panel: Conserved benzimidazol-2-one scaffold (blue) among
C8, C8.5, and C8.12. Each inhibitor contains a tail-carbonyl and a
flexible tail containing a largely hydrophobic aromatic ring. Each
enzyme–inhibitor pair depicts the benzimidazol-2-one group
bound deep in the catalytic pocket, with the flexible tail stabilized
by different combinations of the secondary catalytic loops (L1, L3,
L5, and L7). Stabilizing residues are transparent, colored, and labeled.
The Zn-coordinating residues are colored purple. For each inhibitor,
the carbon backbone is colored green, with nitrogen, oxygen, and hydrogen
colored blue, red, and white, respectively.Figure illustrates
the dominant binding mode (shown in stick), while stabilizing residues
are shown in colored transparent surfaces. In AID, the C8-tail frequently
bound the L1-L7 interface with some conformations in the L1-L3 interface,
while the C8.5 tail bound the L5-L7-α4 region and the C8.12-tail
bound the L1-L3 interface (Figure , AID panel). In A3A, we found conformations of inhibitors
where the benzimidazol-2-one group bound facing the Zn-coordinating
residues and some that were slightly tilted toward L3 (shown for A3A-C8.5).
For A3A, the C8-tail and C8.12-tail both bound between L1-L3-L7, with
most interactions in the L1-L7 interface. The C8.5-tail preferentially
bound the L5-L7-α4 region, with some conformations in the L1-L7
interface (Figure , A3A panel). Like A3A, we also noted different configurations of
the benzimidazol-2-one group bound in the catalytic pocket of A3B,
either slightly tilting toward L3 or L7. Furthermore, among all enzyme–inhibitor
pairs, we noted the most inhibitor-tail flexibility when bound with
A3B. For A3B, the C8-tail most frequently bound the L1-L3 interface,
with other configurations bound with L1-L7, L1-L5-α4, and L3-L5-L7.
Akin to AID and A3A, in A3B, the C8.5-tail frequently bound the L5-L7-α4
interface, with some binding also observed on the L1-L7 interface.
The C8.12-tail primarily bound the L3-L5-L7-α3 interface, with
some bound in the L1-L7 interface (Figure , A3B panel). Unlike A3A or A3B, we found
A3G bound C8, C8.5, and C8.12 with limited variability, whereby each
tail only bound in the L1-L7 interface, except for the C8.12-tail,
which had some conformations bound in the L1-L3 interface (Figure , A3G panel). Across
AID, A3A, A3B, and A3G, we noted the benzimidazol-2-one was consistently
stabilized by the Zn-coordinating histidine (H56, H70, H253, and H257,
respectively) as well as several secondary catalytic residues, including
the L1 threonine (T27, T31, T214, and T218, respectively), the L5
serine backbone (S85, S99, S282, and S286, respectively), and the
L7 tyrosine (Y114, Y130, Y313, and Y315, respectively; Table ). Interestingly, the C8.5-tail
preferentially bound the L5-L7-α4 interface of AID, A3A, and
A3B, whereby the tail phenylpropane π–π stacked
with one or more aromatic residues.
Discussion
Since
AID expression drives and exacerbates tumorigenesis, an AID
inhibitor has been suggested to be of benefit; however, development
of such an agent has not been possible since structural insights into
AID were not gleaned until recently. The functional and native structure
of AID described in 2015 revealed two structural features that explained
the unusually low catalytic rate and high ssDNA binding affinity of
AID.[80] These include frequent catalytic
pocket closure and sporadic ssDNA binding by a highly positively charged
surface, in positions that are not deamination-viable. This structural
analysis, which has since been confirmed by two partial AID crystal
structures,[81,82] provided an understanding of
AID’s catalytic pocket conformational dynamics and ssDNA stabilization
interactions proximal to the pocket (Figure C), thus offering an opportunity for structure-based
inhibitor design.Using these AID–DNA interactions, we
scanned the ZINC database
of lead-like compounds and identified 10 compounds for testing against
AID (Figure D and
E). We identified two compounds, C4 and C8, that inhibit the enzymatic
activity of AID (Figure ). MTT assay showed that C8 was noncytotoxic across several cell
lines and primary healthy donor cells tested (Figure S2C). This suggested that C8 or derivatizations could
be used successfully in future in vivo studies, and hence, we focused
on C8. Although purified GST-AID was inhibited by C8 with modest IC50 values (230 μM), C8 could inhibit multiple forms of
purified AID (both bacterially expressed GST-AID and 293T-expressed
AID-His) as well as AID in whole cell lysates of 293T cells (Figure ). It was particularly
encouraging that native untagged AID in whole cell 293T extract, which
is the best representative of endogenous cellular AID, was inhibited
by C8 with the most effective IC50 of 11 μM, ∼21-fold
more effectively than purified AID (Figure A). This increased susceptibility of C8 inhibition
is likely due to the higher specific activity of native untagged AID,
thus requiring a lower amount of enzyme to achieve similar catalytic
rates when compared with purified GST- or His-tagged AID.Since
the topological features of the AID catalytic pocket and
surrounding region are unique, we rationalized that these residues
ought to act as specific anchors for small molecule placement. Furthermore,
inhibitor binding in this region would impart a level of structural
specificity to our strategy. Akin to dC stabilization (Figure C), we noted several stabilizing
interactions with secondary catalytic residues and DNA binding residues
in and proximal to the catalytic pocket, respectively (Figures A and 6). The benzimdazol-2-one group was anchored in the catalytic pocket,
while the tail group adopted several conformations in the DNA binding
groove. To bolster our understanding of the AID-C8 complexes, we probed
several nonlethal surface mutants predicted to destabilize C8 binding.
DNA binding groove mutants (K22A, R25H, R25N, and E26R) as well as
evolutionary-distant Dr-AID were all resistant to C8 inhibition with
increased IC50 values (Figure B).Using C8 as a parent compound,
we examined 948 structural analogues,
obtaining 15 (C8.1–C8.15) for analysis. Analogues lacking the
protruding carbonyl characteristic of the benzimidazol-2-one group
(C8.2, C8.3, C8.4, C8.13, and C8.14) resulted in a major loss of AID
inhibition (Figure S5), likely due to a
loss of stabilizing hydrogen bonds. C8.5 and C8.12 inhibited AID with
similar potencies relative to C8 (Figure A and B), except in the case of native AID
(untagged), whereby C8 was superior with an IC50 of 11
μM (Figure C).
Akin to C8, C8.5 and 8.12 conserve the benzimidazol-2-one group, a
tail carbonyl, and a flexible, mostly hydrophobic tail with a planar
ring available for π–π stacking interactions (Figure , top panel).To assess the specificity of C8 within the AID/A3 family, we measured
the inhibition of A3A, A3B, A3F, and A3G. Surprisingly, A3A was potently
inhibited by C8, A3B and A3G were moderately inhibited, and A3F was
unaffected (Figure E and F). Even more surprising is the fact that C8.5, a C8 analogue
screened against the catalytic pocket of AID, potently inhibited A3A,
A3B, and A3G while it moderately inhibited AID (Figure ). However, in retrospect, we believe this
result was achieved for two reasons: First, the catalytic pockets
of AID, A3A, A3B, and A3G each evolved to be conducive to a polynucleotide
containing dC, and, as such, many of the secondary catalytic residues
that stabilize dC also stabilize C8.5 (e.g., hydrogen bonding with
the conserved L1 threonine residue T27, T31, T214, and T218, respectively; Table ). Second, our screening
of AID exclusively included accessible catalytic pockets, which are
predicted to represent the minority of conformations at any given
time.[80,83] A3A and A3B-CTD have high sequence similarity,
with the largest differences observed in the L1 region, with identical
sequences for L5, L7, and α4 (Figure S6D). Given the dominant binding modes of the C8-tail (L1-L7 vs L1-L3
interface for A3A and A3B, respectively) and the C8.12-tail (L1-L7
vs L3-L5-L7-α3 interface for A3A and A3B, respectively; Figure ), we were not surprised
to find differences in inhibition potencies. In support of the importance
of these interactions, we note that A3F, which is unaffected by C8,
C8.5, or C8.12 (Figure S6C), has major
sequence differences of L1/L3/L5/L7 when compared to A3A/B/G as well
as notable differences in its catalytic pocket chemistry that may
explain a lack of inhibition (Figure S6D). Such differences include alternative Zn-coordination[106] and differences among key inhibitor-stabilizing
residues (e.g., L1 threonine stabilizing residue is instead a serine,
S216; Figure S6D). We noticed that A3A-inhibitor
and A3B-inhibitor complexes with the benzimidazol-2-one group bound
directly facing toward either Zn or Zn-coordinating residues, but
also a tilted angle toward L3 or L5 residues (an example is shown
in Figure for A3A-C8.5),
a feature not observed for AID or A3G. We postulate that these tilted
binding modes observed for A3A and A3B may bestow increased conformational
freedom of inhibitors, thus allowing for overall enhanced stabilization
of the benzimidazol-2-one group in the catalytic pocket.For
both AID and A3s, we ensured that multiple enzyme versions
are tested for inhibition, including GST-tagged purified and untagged
in whole cell extracts and, in some cases, several other forms expressed
in different cell hosts. In our view, this approach provides critical
confidence that the observed inhibition is not a property of one version
of the enzyme, as fusion tags and expression hosts are well-known
to impact protein conformations and, in the case of enzymes, specific
activity levels.[107] However reassuring
the results were that in each case multiple forms of AID and A3s were
indeed inhibited, the differences in IC50 values between
the various versions are a testament to such conformational differences
due to expression hosts and fusion tags.In the future, to maximize
binding affinity and specificity among
AID/A3 family members, several strategies could be undertaken to generate
more efficacious inhibitors. Derivatization of the C8/C8.5/C8.12 benzimidazol-2-one
scaffold could introduce additional hydrogen bonding pairs, conducive
to the secondary catalytic residues of the AID/A3 catalytic pocket
to improve affinity. Additionally, these derivatives could be modified
to take advantage of the multiple regions bound by the tail of each
inhibitor. Instead of a single, flexible tail connected to the benzimidazol-2-one
scaffold, a 2- or 3-tail inhibitor could be constructed, such that
the L1-L3, L1-L7, and L5-L7-α4 regions are bound simultaneously.
Because most structural differences reside within these secondary
catalytic loops, in particular the L1-L7 interface, such multitailed
derivatives could achieve improved affinity and specificity to the
desired AID/A3 target. An alternative approach for AID/A3 inhibitor
design might include screening for inhibitors that bind accessible
and/or partially occluded conformations of the catalytic pocket. This
would help improve the specificity of inhibitors toward AID/A3 family
members, based upon their catalytic pocket dynamics. For example,
inhibitors restricted to the partially occluded conformations of A3B
would be less likely to bind A3A, due to A3A’s superior catalytic
pocket accessibility.[83−86,96] Such inhibitors would prove useful
as probes to study AID/A3 biology and as initial scaffolds for future
drug design. This effort could achieve either specific inhibition
of A3A or A3B, or pan inhibition of AID, A3A and A3B, three highly
tumorigenic AID/APOBEC family member enzymes. Unlike C8, the C8.5
and C8.12 tail-structures were much more hydrophobic (Figure ) and required sonication to
coax dissolution (see Methods). Despite our
best efforts, our working aliquots were only partially dissolved in
solution and thus, the true IC50 of C8.5 and C8.12 may
likely be even lower than the experimentally observed IC50. We suggest that future modifications of tail structures that include
hydrogen bond donor/acceptor groups should dramatically improve solubility.Recently, several studies of SARS-CoV-2 have suggested that APOBECs
may contribute to viral mutagenesis.[108−112] Similar to their role in promoting immune
and drug escape in HIV,[47,113−115] APOBEC-induced mutations could generate SARS-CoV-2 variants with
enhanced immune evasion and drug resistance. It is tempting to consider
APOBEC inhibitors in this context, in addition to therapeutics that
thwart cancer genome mutations. Beyond their potential for therapeutic
development, the demonstration that C8 and analogues, which were identified
based on docking into the Schrodinger’s CATalytic pocket, could
indeed functionally inhibit AID and A3s provides further verification
of the pocket structure itself. In the future, these and other derivatized
inhibitors could also be useful as biochemical probes for studying
the interaction of AID and A3s with DNA/RNA and as novel tools for
studying their biology.
Methods
Virtual High-Throughput
Screening of Small Molecules against
the Catalytic Pocket of AID
The AID structure used for high
throughput in silico identification of first generation hits is based
on the functional and native AID structure described previously though
a combined computational–biochemical method.[80] This structure has been verified by a partial AID crystal
structure.[81,82] In addition, since the catalytic
pocket of AID was designated as the inhibitor target for this study,
this structure is advantageous because it includes several dynamic
conformations of AID’s catalytic pocket, verified by biochemical
analysis of AID variants.[80] Briefly, this
structure was generated by modeling full-length AID based on eight
X-ray or APOBEC structures as templates for homology modeling: A2
NMR (PDB: 2RPZ), A241–224 chain A and B X-ray (PDB: 2NYT:A and 2NYT:B, respectively),
A3A NMR (PDB: 2M65), A3C X-ray (PDB: 3VOW), A3F-CTD X-ray (PDB: 4IOU), A3G-CTD NMR (PDB: 3E1U), and A3G-CTD NMR (PDB: 2KBO).[84,116−121] All AID/APOBEC X-ray/NMR structures were obtained from the protein
databank (http://www.rcsb.org) and visualized using PyMOL v1.7.6 (http://www.pymol.org). Using the default parameters of I-TASSER
(http://zhanglab.ccmb.med.umich.edu/I-TASSER/),[122,123] full-length human AID (Hs-AID) and variants
were modeled from APOBEC templates. The catalytic pocket is defined
as the indented space containing Zn and the catalytic residues (H56,
E58, C87, and C90 in Hs-AID).Using DOCK Blaster v1.6.0 (http://blaster.docking.org/),[101] we virtually screened 4.6 ×
106 “clean-lead” small molecules from the
ZINC database (http://zinc.docking.org/) against the catalytic pocket of AID. We used several AID-DNA complexes
containing dC in the catalytic pocket as a template for screening.[80] In total, we screened five low energy conformations
of AID, representative of the range of catalytically accessible catalytic
pocket conformations. As a result, small molecules were screened for
their ability to bind to the catalytic pocket and surrounding DNA
binding grooves across the ensemble of catalytically active AID structures.
Compounds were docked and ranked based on binding energy. The 500
lowest energy compounds bound to each AID conformation were cross-referenced,
and compounds predicted to bind only one catalytically active AID
conformation were excluded. The 40 lowest-energy compounds (Table S1) bound across several catalytically
active AID conformations were selected for additional docking using
Autodock VINA (http://vina.scripps.edu)[102] via PyRx (https://pyrx.sourceforge.io)[124] to confirm specificity to the catalytic
pocket and ranked energies. The top 10 compounds (C1–C10) were
then selected for testing based on binding energy ranking as well
as the chemical diversity of structure side chains. Compounds were
purchased from Molport, and purities were > 90%. C4, C8, C8.5 and
C8.12. Molport IDs are, respectively, 008-366-081, 009-139-310, 005-764-107,
020-119-012. For analogue expansion, we used the same approach against
the ZINC database to identify 948 structural analogues of C4 and C8
considering a 60% similarity cutoff. Using Autodock VINA via PyRx,
we screened the five AID conformations and identified the top ranking
structural analogues of C4 and C8. Additionally, we constructed a
full-length AID model based the crystal structure of near-native AID
(PDB: 5W1C)[82] for binding with inhibitors to confirm AID-inhibitor
binding modes. Two A3A conformations were selected from PDB 2M65 (apo) and PDB 5KEG (ssDNA-bound), so
the catalytic pockets represent both unbound and bound states (with
respective rotations of Y130/Y132, etc). For A3B-CTD, we chose two
NMR conformations (PDB: 2NBQ) that had rotations of the equivalent tyrosine residues
(Y313 and Y315), albeit different from the A3A structures. For A3G-CTD,
we used two structures to represent different conformations (PDB: 3E1U and 3IR2). When examining
AID/A3–inhibitor binding mode interactions, we examined three
or more low-energy clusters per conformation for a total of six or
more docked complexes per enzyme–inhibitor pair.
Expression
and Purification of AID/APOBECs
Expression
and purification of GST-AID and AID-His in bacteria and HEK 293T cells
have previously been described.[80,125,126] Briefly, for bacterially expressed GST-AID, the expression and purification
of human (Hs-AID), Hs-AID mutants, and zebrafish (Dr-AID) GST-AID
were carried out as previously described, using the pGEX5.3 expression
system.[92,126] Point mutants were generated by site-directed
mutagenesis using appropriate GST-AID constructs as templates, as
previously described.[80] Briefly, GST-AID
was expressed in E. coli (DE3-Bl21) and purified
using GST-column chromatography as per the manufacturer’s guidelines.
In total, six independent preparations of GST-AID and two independent
preparations of each mutant/chimeric/orthologous AID were made and
tested in parallel. The expression and catalytic activities of AID-His
and untagged native AID in HEK 293T cells were carried out as previously
described, using the pcDNA3.1 expression system.[94,107] Briefly, 50 × 10 cm plates of 293T cells were transfected with
5 μg of plasmid per plate using Polyjet transfection reagent
(Froggabio), incubated for 48 h, resuspended in 50 mM phosphate buffer
(pH 8.2 for AID-His and pH 7 for untagged AID) + 500 mM NaCl, 0.2
mM PMSF, 50 μg/mL RNase A, and lysed using a French pressure
cell press. Whole cell extracts expressing either AID-His or native
untagged AID were flash frozen in liquid nitrogen and stored for activity
analysis. GST-tagged versions of A3A/B/F/G expressed in HEK-293T were
purified using GST-beads per the manufacturer’s guidelines
as previously described.[107,127] The GST-tagged A3
enzymes were stored in 20 mM Tris-HCl pH 7.5, 100 mM NaCl, 1 mM DTT,
5% glycerol, and 50 μg/mL BSA. Whole cell extracts expressing
native untagged A3A obtained as described above were flash frozen
in liquid nitrogen. Expression of AID/APOBECs was verified using Western
blotting probed with anti-GST (SantaCruz) antibodies, followed by
the secondary detection by goat anti-rabbit IgG (SantaCruz). Western
blots to verify the expression of untagged native AID/APOBECs were
probed with rabbit polyclonal anti-AID antibody (Abcam) or anti-A3A
antibody (Abcam), respectively, followed by the aforementioned secondary
IgG. The relative yield and purity of each purified AID/APOBEC preparation
were evaluated using standard SDS Coomassie staining. The concentration
of 293T-purified AID/APOBECs ranged from 10 to 50 ng/μL. The
concentration of native untagged AID ranged from 0.7 to 7 ng/μL,
and for untagged native A3A it was 4 ng/μL.
Inhibition
of AID/APOBEC Using the Alkaline Cleavage Assay
The standard
alkaline cleavage assay for AID/APOBEC-mediated deamination
was used to screen compounds for inhibition of AID, A3A, A3B, A3F,
and A3G. For AID reactions, we used the standard seven-nucleotide
bubble substrate containing the WRC motif TGC (5′-TTTGCTT-3′) as a substrate, because it has previously
been optimized for the highest levels of AID activity in this enzyme
assay.[80,125,128] For A3A,
A3B, and A3F, we used a single-stranded oligo containing the preferred
5′-TC-3′ dinucleotide;[106,129,130] and for A3G, we used a single-stranded 5′-CCC-3′
oligo (Figure S7). Substrates were labeled
and purified as described previously.[80,92,125] For AID reactions, purified substrate (1.7 nM) was
incubated with AID enzyme (∼0.9 μg of bacterially expressed
GST-AID, 1–10 ng of 293T-expressed AID-His or untagged AID)
in phosphate buffer (100 mM, pH 7.2) with H2O, 140 mM DMSO
as a vehicle control or compound (see below). All AID reactions were
incubated at 37 °C, except for Dr-AID, which was incubated at
25 °C, which was previously shown to be its optimal temperature,[92,126] in a total volume of 10 μL. For A3A, A3B, A3F, and A3G, alkaline
cleavage reactions were conducted in the same manner (12–30
ng of GST-tagged A3s and 12 ng of native untagged A3A used per reaction),
except they were incubated in buffers of more acidic pH since the
A3 family enzymes are optimally active at a more acidic pH as compared
to AID.[131,132] The A3G/A3F activity buffer was 100 mM phosphate
buffer (pH 6.0), 1 mM DTT, and 50 μg/mL RNase A. The A3A/A3B
activity buffer was 25 mM HEPES (pH 5.5), 100 mM NaCl, 1 mM DTT, 0.1%
Triton X-100, and 100 ng/mL RNase A. Concentrated stocks of compound
were sonicated at 37 °C for up to 4 h in 1–10% DMSO to
promote dissolution. Due to solubility differences between compounds,
the highest achieved concentration in 140 mM DMSO was used for initial
screening. For initial C1–C10, C4.1–C4.5, and C8.1–C8.15
screening, final alkaline cleavage concentrations (in 140 mM DMSO)
ranged from 500 to 840 μM, except for C9, C8.4, C8.5, and C8.9
which had lower concentrations (310, 400, 170, and 270 μM, respectively).
The A3 enzymes tolerated a higher [DMSO] as compared to AID which
began to show reduced activity at 10% DMSO; therefore, for the A3
inhibition assays, we were able to use a 1:10 dilution of C8, C8.5,
and C8.12 which were dissolved in 100% DMSO, thus bringing the final
reaction to 10% (1.4 M) [DMSO].
Inhibition of Endogenous
AID in Lymphoma Cells Measured by the
Deamination-Specific PCR Assay
Daudi, Raji, and Ramos cells
(ATCC = CCL-213, CCL-86, and CRL-1596, respectively) were suspension
cultured for 48 h in RPMI 1640 growth media containing 10% FBS. To
lyse the cells, cultures were centrifuged and cells (2.5–3
× 106 cells) were washed twice with PBS. Cell were
lysed using glass beads (425–600 μm) (Sigma). The volume
of glass beads used was twice the volume of the cell pellet. The mixture
was then vortexed for 20–30 s and then incubated on ice for
30 s. This was repeated two to three more times in PBS with 0.2 mM
PMSF, 50 μg/mL RNase A. To detect AID expression in these cells,
we employed quantitative real-time PCR. Total RNA was extracted from
106 cultured cells using TRIzol solution (Invitrogen) according
to the manufacturer’s instructions. The quality and quantity
of the extracted RNA were estimated by spectrometry. Total RNA (1
μg) was used for cDNA synthesis with the ProtoScript First Strand
cDNA Synthesis Kit (NEB, UK). Quantitative real-time PCR was performed
in triplicate using SYBR Green I. The amount of GAPDH housekeeping
gene transcripts was used as a reference for the level of AID gene
expression. Amplification was carried out in a total volume of 20
μL containing 0.5 μg of cDNA prepared as described above,
0.3 μM each of GAPDH- and AID-specific primers, and 1×
reaction mixture consisting of RNase-free water and 2× QuantiTect
SYBR Green PCR Master Mix (Qiagen). Thermal cycling for both genes
was initiated with a denaturation step at 95 °C for 10 min, followed
by 40 cycles of denaturation at 95 °C for 15 s, annealing at
50 °C for 30 s, and elongation at 72 °C for 1 min. Melting
curve analysis of the amplification products was performed at the
end of each PCR by cooling the samples to 60 °C and then increasing
the temperature to 95 °C at 0.2 °C/s. The experiment was
repeated three times for statistical analysis.To detect AID
activity in cell extracts, we utilized a deamination-specific PCR
(Deam-PCR) as previously described for measuring the mutational patterns
of purified AID.[7,94,103−105] In this assay, a plasmid DNA substrate is
incubated with AID and mutations are detected by amplification using
deamination-specific primers. Here, we applied this assay in three
iterations: first, to measure inhibition of purified AID incubated
with the substrate plasmid; second, to measure inhibition of AID in
whole cell extracts of lymphoma cells incubated with the substrate
plasmid; and third, we extended the application of this assay to detecting
intracellular AID-mediated mutations on transfected plasmid substrate.
Briefly, the plasmid used as a substrate for the deamination specific
PCR assay was pcDNA3.1 containing a random WRC-rich target sequence
as previously described. An amount of 50 ng of supercoiled plasmid
was denatured at 98 °C in 100 mM phosphate buffer pH 7.2 for
10 min followed by snap-cooling in an ice bath to generate ssDNA targets
for AID to mutate. A volume of 4 μL of either purified AID or
AID-expressing cell lysates (from Daudi, Raji, or Ramos which are
naturally AID+ B lymphomas, or from 293T cell transfected with an
AID expression vector as described above) was mixed either with C8
or vehicle, added to the target plasmid, and incubated for 4 h at
32 °C. To detect AID-mediated mutations, 1 μL of each reaction
was amplified by deamination-specific nested PCR using mutation-specific
primers, as previously described.[7,94,103,104] The inner primers
used in the second nested PCR reaction generate a 451 nt-product.
PCR amplicons were subsequently TA-cloned, and 10 from each reaction
were sequenced to confirm AID-mutated mutations C to T or G to A on
the sense and nonsense strands. Negative control deamination-specific
reactions were conducted on the substrate alone with no added extract
or AID. Positive control Deam-specific PCRs were conducted on reactions
containing substrate plasmid and extracts of AID-expressing 293T cells,
as described above. An additional negative control PCR reaction was
carried out by adding C8 directly to the PCR reaction of the aforementioned
positive control, in order to make sure that our observation of inhibition
of endogenous lymphoma cell AID by C8 is not due to interference of
C8 with the deamination-specific PCR step itself. For the intracellular
endogenous AID inhibition assay, Raji cells were suspended for 48
h in RPMI 1640 growth media containing 10% FBS. Then 1–2 ×
106 cells were transfected with 5 μg of the same
substrate plasmid DNA as above. Cells were treated with either C8
or C8.5 at a final concentration of 700 μM or treated with vehicle
immediately prior to transfection with the AID substrate plasmid.
Untreated transfected Raji cells and untreated untransfected Raji
cells were used as negative controls. To lyse the cells 24 h post
transfection, cultures were centrifuged and cells were washed twice
with PBS. Cell were lysed using glass beads (425–600 μm)
(Sigma). The volume of glass beads used was twice the volume of the
cell pellet, followed by vortexing for 20–30 s and incubation
on ice for 30 s. This procedure was repeated three times. Then 1 μL
of the cell lysate or 1 μL of each condition diluted 100 times
was subject to Deam-PCR as described above. To confirm that compounds
do not interfere with Taq polymerase, either inhibitor-treated or
untreated/transfected Raji cell lysate was added to the positive PCR
reaction.
MTT Assay
A breast cancer cell line (MCF-7; ATCC =
HTB22), lung cancer cell line (A549; ATCC = CCL185), embryonic kidney
cell line, (293T; ATCC = CRL3216), B cell lymphoma cell line (Raji;
ATCC = CCl-86), and primary peripheral blood mononuclear cells (PBMCs)
from healthy donors were used to test the toxicity of C4 and C8. MCF-7,
A549, and 293T cells were gifted from Dr. Kao (Memorial University
of Newfoundland), while Raji cell lines were gifted from Dr. Hirasawa
(Memorial University of Newfoundland). The B cell lymphoma cell line
and PBMCs were grown in RPMI-1640 medium (Invitrogen, USA) supplemented
with 10% fetal calf serum (Invitrogen, USA), 200 IU/mL penicillin/streptomycin
(Invitrogen, USA), 1% 1 M HEPES (Invitrogen, USA), 1% l-glutamine
(Invitrogen, USA), and 2.0 × 10–5 M 2-mercaptoethanol
(Sigma-Aldrich, USA), whereas MCF-7, A549, and 293T cell lines were
maintained in DMEM containing 100 U/mL penicillin and 100 μg/mL
streptomycin (Invitrogen, USA) and supplemented with 10% fetal calf
serum (Invitrogen, USA). PBMCs from an anonymous healthy donor was
obtained with approval from the Health Research Ethics Authority of
Newfoundland and Labrador, Canada, carried out in accordance with
the recommendations of the Canadian Tri-Council Policy Statement:
Ethical Conduct for Research Involving Humans. All cells were grown
at 37 °C in a humidified condition containing 5% CO2. Cells were transferred into a 96-well plate (104 cells
per well) and treated in eight replicates with C4 and C8 (100, 250,
and 450 μM) separately or with vehicle alone (140 mM DMSO).
Untreated cells were also considered as the negative control. After
24 and 48 h incubation at 37 °C, 10 μL of 12 mM MTT (3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium
bromide) solution (Molecular Probes) was added to each well and incubated
for 4 h. The reaction was terminated by the addition of 100 μL
of detergent reagent and incubation for 4 h in the dark. Colorimetric
evaluation was performed using a spectrophotometer at 490 nm. The
percentage of viable cells was calculated from the absorbance values
of untreated and treated cells as %Viable Cells = (OD490 treated/OD490 untreated) × 100.
Authors: Steven A Roberts; Michael S Lawrence; Leszek J Klimczak; Sara A Grimm; David Fargo; Petar Stojanov; Adam Kiezun; Gregory V Kryukov; Scott L Carter; Gordon Saksena; Shawn Harris; Ruchir R Shah; Michael A Resnick; Gad Getz; Dmitry A Gordenin Journal: Nat Genet Date: 2013-07-14 Impact factor: 38.330
Authors: Mani Larijani; Darina Frieder; Timothy M Sonbuchner; Ronda Bransteitter; Myron F Goodman; Eric E Bouhassira; Matthew D Scharff; Alberto Martin Journal: Mol Immunol Date: 2005-03 Impact factor: 4.407
Authors: Albert G Tsai; Haihui Lu; Sathees C Raghavan; Markus Muschen; Chih-Lin Hsieh; Michael R Lieber Journal: Cell Date: 2008-12-12 Impact factor: 41.582
Authors: Harvey A Greisman; Zhengfei Lu; Albert G Tsai; Timothy C Greiner; Hye Son Yi; Michael R Lieber Journal: Blood Date: 2012-08-20 Impact factor: 22.113
Authors: Juliana D Siqueira; Livia R Goes; Brunna M Alves; Pedro S de Carvalho; Claudia Cicala; James Arthos; João P B Viola; Andréia C de Melo; Marcelo A Soares Journal: Virus Evol Date: 2021-02-16
Authors: Michael J Grillo; Katherine F M Jones; Michael A Carpenter; Reuben S Harris; Daniel A Harki Journal: Trends Pharmacol Sci Date: 2022-05 Impact factor: 17.638