Literature DB >> 34423273

Structure-Based Design of First-Generation Small Molecule Inhibitors Targeting the Catalytic Pockets of AID, APOBEC3A, and APOBEC3B.

Justin J King1,2, Faezeh Borzooee1,2, Junbum Im2,3, Mahdi Asgharpour1,2, Atefeh Ghorbani1,2, Cody P Diamond2, Heather Fifield2, Lesley Berghuis2, Mani Larijani1,2.   

Abstract

Activation-induced cytidine deaminase (AID) initiates antibody diversification by mutating immunoglobulin loci in B lymphocytes. AID and related APOBEC3 (A3) enzymes also induce genome-wide mutations and lesions implicated in tumorigenesis and tumor progression. The most prevalent mutation signatures across diverse tumor genomes are attributable to the mistargeted mutagenic activities of AID/A3s. Thus, inhibiting AID/A3s has been suggested to be of therapeutic benefit. We previously used a computational-biochemical approach to gain insight into the structure of AID's catalytic pocket, which resulted in the discovery of a novel type of regulatory catalytic pocket closure that regulates AID/A3s that we termed the "Schrodinger's CATalytic pocket". Our findings were subsequently confirmed by direct structural studies. Here, we describe our search for small molecules that target the catalytic pocket of AID. We identified small molecules that inhibit purified AID, AID in cell extracts, and endogenous AID of lymphoma cells. Analogue expansion yielded derivatives with improved potencies. These were found to also inhibit A3A and A3B, the two most tumorigenic siblings of AID. Two compounds exhibit low micromolar IC50 inhibition of AID and A3A, exhibiting the strongest potency for A3A. Docking suggests key interactions between their warheads and residues lining the catalytic pockets of AID, A3A, and A3B and between the tails and DNA-interacting residues on the surface proximal to the catalytic pocket opening. Accordingly, mutants of these residues decreased inhibition potency. The chemistry and abundance of key stabilizing interactions between the small molecules and residues within and immediately outside the catalytic pockets are promising for therapeutic development.
© 2021 The Authors. Published by American Chemical Society.

Entities:  

Year:  2021        PMID: 34423273      PMCID: PMC8369683          DOI: 10.1021/acsptsci.1c00091

Source DB:  PubMed          Journal:  ACS Pharmacol Transl Sci        ISSN: 2575-9108


The DNA-editing enzyme activation-induced cytidine deaminase (AID) is expressed in activated B lymphocytes. AID mutates deoxycytidine (dC) to deoxyuridine (dU) at immunoglobulin (Ig) genes, triggering somatic hypermutation (SHM) and class switch recombination (CSR) of antibodies.[1−7] AID deficiency results in a hyper IgM characterized by a lack of high affinity antibodies of switched isotypes, which is readily treatable by modern pharmaceuticals.[2,3,8] AID also mediates significant off-target and genome-wide mutagenesis, some of which result in double stand breaks (DSBs) that mediate chromosomal translocations.[9−14] Thus, a wealth of literature over two decades implicates AID in initiation of leukemia/lymphomas including Burkitt’s lymphoma (BL), diffuse large B cell lymphoma (DLCL), follicular lymphoma (FL), multiple myeloma, and chronic lymphocytic leukemia.[10,15−19] These tumors arise from centroblasts or postcentroblasts, the narrow stage in a B cell’s life where AID is expressed, and mutations and translocation breakpoints (c-myc/IgH in BL, bcl-2/IgH in FL, bcl-6/IgH in DLCL, IgH-CCND1 in Mantle cell lymphoma) occur at genomic sites that are frequently targeted by AID.[18,20−27] The causal role of AID in lymphomagenesis was proven in IL-6 transgenic mice which develop lymphomas that mimic human BL in phenotype/genotype (c-myc/IgH). In this model, DSBs at both the IgH and translocation partner c-myc loci are directly caused by AID.[9,15,28−30] Beyond tumor initiation, AID expression can also exacerbate leukemia/lymphomas. Genome-wide AID-mediated mutation signatures are prevalent in leukemia/lymphomas, and numerous studies have shown that AID levels in tumors correlate with poor diagnosis.[10,19,31−43] In chronic myeloid leukemia (CML), AID was shown to mutate tumor suppressor and/or DNA repair genes and accelerate imatinib resistance.[31] Moreover, recent studies indicate that some therapeutic agents, such as idelalisib and duvelisib, can exacerbate AID-mediated genome mutations in tumors through increased AID expression and chromosomal translocation frequency between the IgH locus and off-target sites.[44,45] AID is a member of the apolipoprotein B mRNA editing enzyme catalytic polypeptide-like (APOBEC) family of Zn-dependent, single-stranded polynucleotide-restricted cytidine deaminases.[4,5,46] The AID/APOBEC family includes AID, APOBEC1, APOBEC2, APOBEC3A,B,C,D,F,G,H, and APOBEC4.[4,5,46] The APOBEC3 (A3) sub-branch carries out antiviral protective functions by targeting viral DNA for mutation in the cytoplasm of infected cells.[47−51] This antiviral activity has been the most studied in the context of retroviruses like HIV whose genomes go through a ssDNA replication phase; however, like AID, the A3s (particularly A3A and A3B) are also a major endogenous source of genomic mutations in many human cancers such as lung cancer, gastrointestinal cancer, breast cancer, head and neck cancer, and ovarian cancer[2,48,52−67] with prevalent mutational signatures across sequenced tumor genomes. A3 expression associates with poor prognosis, and A3 action is one mechanism for the generation of drug resistance. More recently, the role of A3s as cancer drivers and exacerbators has been confirmed in mouse models.[68] Given their prominent roles as mediators of one of the most prevalent tumor genome mutation signatures, inhibiting AID and A3 activity has been suggested as a potentially useful approach to treating AID-expressing malignancies or augmenting other therapies.[19,40,69−79] We previously utilized a combined computational–biochemical approach to glean insights into AID’s native and functional structure.[80] This approach is based on structure prediction using multiple templates, followed by functional verification of model predictions using a library of AID variants, including point/multiple mutants, orthologues, and chimeric versions with portions of other deaminases exchanged into the AID scaffold, or vice versa. We thus arrived at a map of AID’s functional structure including surface topology, core architecture, and catalytic pocket.[80] This map demonstrated that AIDs form a catalytic pocket with the triad of Zn-coordinating residues (H56, C87, and C90 in human AID) and catalytic glutamic acid (E58 in human AID) that can accommodate a dC residue in orientations that support the four-stage deamination chemistry common to cytidine and cytosine deaminase. This was reassuring because the same arrangement of primary catalytic residues directly responsible for cytidine deamination is classically found in cytidine deaminases across evolution. Furthermore, the periphery of AID’s catalytic pocket houses a network of noncatalytic residues, termed the “secondary catalytic residues”. These residues, while not directly involved in deamination catalysis, contact and/or stabilize the dC in deamination-conducive confirmations within the catalytic pocket.[80] This network of amino acids consists of G23, R24, R25, E26, T27, L29, N51, K52, N53, G54, C55, V57, T82, W84, S85, P86, D89, Y114, F115, C116, and E122 in human AID.[80] These residues form the “walls” and “floors” of the catalytic pocket and interact with substrate dC in several predicted protein conformations through hydrogen bonding, electrostatic interactions, and aromatic base stacking. Since publication of the functional structure of AID using the computational–biochemical method, two partial crystal structures have become available[81,82] which confirmed the map of AID’s catalytic pocket, including the aforementioned arrangement of the primary and secondary catalytic residues. A more detailed observation of the conformational states of AID’s catalytic revealed that it shifts dynamically between open and closed positions and that the majority (∼75%) of conformations at any time are predicted to exhibit a closed pocket.[80] This dynamic pocket closure, termed the “Schrodinger’s CATalytic pocket”[83] for its duality, was the first demonstration of such an inherent regulatory mechanism in human DNA/RNA-editing enzymes. More recently, this type of catalytic pocket closure was observed by X-ray and NMR on A3A and A3B, two close siblings of AID.[84−87] Previous works on small-molecule inhibitors of AID/APOBECs have been largely focused on A3G. Screening of a library of 1280 pharmacologically active compounds against A3G yielded several structurally related small molecules that inhibited A3G at low micromolar concentrations through covalent attachment of a nonconserved cysteine (C321) unique to A3G’s substrate-specificity loop 7.[78] As a follow-up, a screening of >300 000 compounds yielded different A3G inhibitors that covalently attached to the same cysteine residue.[88] Although useful for studying A3G biology, the electrophilic nature of these inhibitors is too cross-reactive in a cellular context and is thus not suitable for cellular work. As for AID, 5-aza-deoxycytidine incorporated into ssDNA was shown to bind the active site of AID and inhibit expression via proteasomal degradation,[75] though 5-aza-deoxycytidine is a transition-state analogue which needs to be in ssDNA, as the free form does not inhibit AID. More recently, 5-fluoro-2′-deoxyzebularine incorporated into ssDNA was also shown to inhibit A3A and an A3B-A3A chimera.[89] Thus, to date, there are no reported specific small molecules of AID, A3A, or A3B. As an extension of our previous works on delineating AID’s catalytic pocket, we screened a library of small molecules against multiple highly accessible conformations of its catalytic pocket. We identified first-generation small molecule inhibitors that specifically inhibit the mutagenic activity of purified AID, native AID in whole cell extracts, and endogenous AID of B lymphoma cells. Docking and mutational analyses reveal a network of contacts between the small molecules and AID’s primary and secondary catalytic residues. We found some analogue derivatives of the first-generation small molecules that were even more efficient inhibitors of A3A and A3B compared to AID.

Results

Rationale for Targeting the Catalytic Pocket of AID

AID’s catalytic pocket is an ideal target for small molecule inhibition for several reasons. First, we have previously gleaned detailed insights into its architecture and conformational breathing.[80] Second, the majority of structural differences between AID and related cytidine deaminases are concentrated in the catalytic pocket and proximal surface regions at the pocket opening,[80,83,90−93] thus allowing maximum specificity. Third, we and others have described the pocket-adjacent main ssDNA binding groove (groove 1) in detail,[80,82,83,94,95] thus offering a promising target for future derivatization. We reasoned that the functional and breathing structure of AID’s catalytic pocket as first described through our computational-biochemical approach[80] is an advantageous template for inhibitor search for several reasons. First, the architecture of its catalytic pocket was extensively functionally verified by testing of mutants and chimeras.[83] Second, using multiple catalytic pocket conformations is advantageous since AID/APOBEC enzymes contain highly flexible loops that compose their catalytic pocket leading to a range of different catalytic pocket conformations for each enzyme.[80,83,90,91,96] Third, purely homology modeled structures, using the same homology modeling methodology used to arrive at AID’s functional structure, were shown to be as reliable for generating inhibitor “hits”, as crystal structures.[97] Considering that the functional AID structure, especially of its inhibitor-target catalytic pocket, was backed by extensive biochemical validation of homology modeling predictions using AID mutants and chimeras[80,83] and was also confirmed by crystal structures (Figure S1),[81,82] we reasoned that it ought to represent an even more high confidence template for inhibitor design.

Structure-Based Virtual Screening of Small Molecules against the Catalytic Pocket of AID

To carry out a structure-based docking search for small molecules that bind in its catalytic pocket, we included five different conformations of AID which are representative of the full range of conformations with accessible catalytic pockets (Figure A–C).[80,83] This strategy would allow for the selection of compounds that bind all active AID conformations. For small molecules, we utilized the ZINC database which contains >100 million structures of commercially available compounds.[98] We restricted our search to the “clean lead” subset of 4.6 million compounds because this set contains compounds that are lead-like defined by pharmacological properties such as Lipinski’s rule of 5 and properties generally amenable for oral intake. This set is composed of molecules with benign functionality and excludes those with potentially toxic chains such as aldehydes and thiols.[99,100]
Figure 1

Virtual high-throughput screening of druglike small molecules against the catalytic pocket of AID. (A) Ensemble of AID conformations covering the range of accessible catalytic pockets dynamics used for virtual high-throughput screening. The structures exhibit conserved overall structure, with conformational changes localized to the secondary catalytic loops that compose the walls and floors of the catalytic pocket. N- to C-termini progression is shown from blue to red. The purple sphere depicts the catalytic pocket-coordinated Zn. (B) Representative surface structure of a catalytically productive AID:ssDNA complex with docked ssDNA (blue) and dC poised for deamination in the catalytic pocket (magenta). (C) dC bound in a catalytically productive configuration in the catalytic pocket highlighting the secondary catalytic residues. Adjacent ssDNA structure was omitted for clarity. This and other energetically similar conformations of the accessible catalytic pocket state of AID served as template for virtual high-throughput screening. (D) The structure-based virtual high-throughput screening scheme. We screened a large library of lead-like compounds from the ZINC library against the catalytic pocket of AID. Using several independent and complementary docking protocols to the catalytic pocket alone or the entire surface of AID, we identified 10 low-energy compounds predicted to bind in the catalytic pocket. (E) Structures of the 10 first-generation inhibitor candidates (C1–C10).

Virtual high-throughput screening of druglike small molecules against the catalytic pocket of AID. (A) Ensemble of AID conformations covering the range of accessible catalytic pockets dynamics used for virtual high-throughput screening. The structures exhibit conserved overall structure, with conformational changes localized to the secondary catalytic loops that compose the walls and floors of the catalytic pocket. N- to C-termini progression is shown from blue to red. The purple sphere depicts the catalytic pocket-coordinated Zn. (B) Representative surface structure of a catalytically productive AID:ssDNA complex with docked ssDNA (blue) and dC poised for deamination in the catalytic pocket (magenta). (C) dC bound in a catalytically productive configuration in the catalytic pocket highlighting the secondary catalytic residues. Adjacent ssDNA structure was omitted for clarity. This and other energetically similar conformations of the accessible catalytic pocket state of AID served as template for virtual high-throughput screening. (D) The structure-based virtual high-throughput screening scheme. We screened a large library of lead-like compounds from the ZINC library against the catalytic pocket of AID. Using several independent and complementary docking protocols to the catalytic pocket alone or the entire surface of AID, we identified 10 low-energy compounds predicted to bind in the catalytic pocket. (E) Structures of the 10 first-generation inhibitor candidates (C1–C10). Using DOCK Blaster,[101] we screened for candidates that bind AID in a search space restricted to the catalytic pocket and proximal surface region (∼10 Å; Figure C and D). For each AID conformation, we identified the 500 lowest-energy compounds. We then prioritized compounds that bound several AID conformations over those that bound only to a single conformation and selected the 40 lowest binding energy compounds (Table S1). To confirm specificity for the catalytic pocket, unrestricted docking was repeated using the entire surface of AID, rather than just the catalytic pocket region. In addition, we employed a second independent docking algorithm, AutoDock Vina,[102] to substantiate the 40 candidates identified by Dock Blaster. Even with access to AID’s whole surface, 27 of 40 compounds bound preferentially in the pocket, thus validating our screening methodology while further refining the list of hits. The 10 inhibitor candidates with the most favorable docking energies (C1–C10) were selected for functional testing (Figure E).

First Generation Hits That Inhibit Purified and Endogenous AID

Using the standard alkaline cleavage enzyme assay for deamination activity of purified AID/APOBEC enzymes, we measured the catalytic activity of purified GST-AID in the presence of each compound (Figure S2A and B). We found that 2 of 10 compounds (C4 and C8) diminished AID activity (8.3% and 17.4% AID catalytic activity, respectively; Figure A). In addition to purified GST-AID, we tested inhibition on whole cell extracts of AID-expressing 293T cells transfected with a CMV-promoter based AID-His expression vector (Figure B). Akin to our results with purified GST-AID, C4 and C8 inhibited this AID as well (43.9 and 40.3% AID catalytic activity, respectively). To assess potency, we measured the dose–response of C4 and C8 against GST-AID and the AID-expressing 293T cell lysate (Figure C and D). C4 and C8 showed a similar potency in GST-AID (IC50 = 290 and 230 μM, respectively) as with AID-His (IC50 = 460 and 390 μM, respectively).
Figure 2

First-generation inhibitor candidates inhibit purified AID and AID in whole cell extracts. (A) Catalytic activity of bacterially expressed and purified GST-AID treated with C1–C10 (n = 6 independent experiments conducted with three independently purified preparations of GST-AID). (B) Catalytic activity of eukaryotic-expressed AID in whole 293T cell lysate treated with C1–C10 (n = 3 independently prepared AID-expressing whole cell extracts). (C) Catalytic activity of GST-AID on C4 and C8 as a function of log inhibitor concentration. (D) Catalytic activity of AID-His 293T lysate as a function of log inhibitor concentration. (E) Catalytic activity of eukaryotic-expressed and purified GST-A3A, GST-A3B, GST-A3F, and GST-A3G treated with 700 μM C8. (F) Catalytic activity of GST-A3A, GST-A3B, and GST-A3G in comparison to bacterially expressed and purified GST-AID across a concentration range of C8. All experiments contained a negative control vehicle-only (140 mM DMSO) reaction which was designated as 100% AID activity. All AID reactions were performed at 37 °C for 2–4 h at pH 7.2 using 2 nM of the standard bubble oligonucleotide substrate TGCbub7 which has previously been demonstrated to be AID’s most favored substrate in the alkaline cleavage assay. GST-A3A, GST-A3B, GST-A3F, and GST-A3G reactions were incubated at 37 °C for 2 h in pH 6.0 using 2 nM of standard single-stranded oligonucleotide substrates containing a single target TTCA motif for A3A, A3B, and A3F and a single target CCC motif for A3G.

First-generation inhibitor candidates inhibit purified AID and AID in whole cell extracts. (A) Catalytic activity of bacterially expressed and purified GST-AID treated with C1–C10 (n = 6 independent experiments conducted with three independently purified preparations of GST-AID). (B) Catalytic activity of eukaryotic-expressed AID in whole 293T cell lysate treated with C1–C10 (n = 3 independently prepared AID-expressing whole cell extracts). (C) Catalytic activity of GST-AID on C4 and C8 as a function of log inhibitor concentration. (D) Catalytic activity of AID-His 293T lysate as a function of log inhibitor concentration. (E) Catalytic activity of eukaryotic-expressed and purified GST-A3A, GST-A3B, GST-A3F, and GST-A3G treated with 700 μM C8. (F) Catalytic activity of GST-A3A, GST-A3B, and GST-A3G in comparison to bacterially expressed and purified GST-AID across a concentration range of C8. All experiments contained a negative control vehicle-only (140 mM DMSO) reaction which was designated as 100% AID activity. All AID reactions were performed at 37 °C for 2–4 h at pH 7.2 using 2 nM of the standard bubble oligonucleotide substrate TGCbub7 which has previously been demonstrated to be AID’s most favored substrate in the alkaline cleavage assay. GST-A3A, GST-A3B, GST-A3F, and GST-A3G reactions were incubated at 37 °C for 2 h in pH 6.0 using 2 nM of standard single-stranded oligonucleotide substrates containing a single target TTCA motif for A3A, A3B, and A3F and a single target CCC motif for A3G. To evaluate toxicity, we incubated multiple cell lines originating from different tissues (A549, MCF-7, 293T, and Raji) as well as primary peripheral blood monocytes (PBMC) with C4 or C8 and measured viability using the standard MTT assay (Figure S2C). C4 caused minor toxicity toward all anchorage-dependent cells (A549, MCF-7, and 293T) but not Raji or PBMCs, while C8 was found not to be toxic. We thus focused on C8 for further development. To confirm that C8 was a bona fide AID inhibitor we measured off-target inhibition of UDG, an enzyme used downstream of AID in the alkaline cleavage deamination assay and found C8 did not inhibit UDG (Figure S2E). To test specificity, we examined whether C8 could inhibit the catalytic activity of homologous A3A, A3B, A3F, and A3G. We found GST-A3A, GST-A3B, and GST-A3G were inhibited (5%, 5%, and 37% activity, respectively), while GST-A3F was unaffected (Figure E). Compared to bacterially expressed GST-AID, 293T-expressed GST-A3A and GST-A3B were more potently inhibited, and GST-A3G was inhibited to a lesser degree (IC50 = 220, 60, 120, and 280 μM, respectively; Figure F). Thus, C8 acts as an AID/A3 inhibitor capable of blocking catalytic activity of specific family members. C8 inhibition of multiple forms of AID (purified GST-AID and AID-His in whole cell lysates of 293T cells) was reassuring. We then examined the ability of C8 to inhibit native (no fusion tag) full-length AID expressed in 293T cells and found that surprisingly it was even more effective at inhibiting untagged AID in whole cell lysates (IC50 = 11 μM) than purified AID was (Figure A). This result prompted us to examine the ability of C8 to inhibit the endogenous AID of AID-expressing lymphoma cell lines (Raji, Daudi, and Ramos; Figure B). Given that endogenous AID levels are significantly lower than that of 293T cells transfected with CMV promoter-driven AID expression vectors, we were unable to detect AID activity from cell lysates in the alkaline cleavage assay. Instead, we employed a more sensitive semiquantitative deamination-specific PCR assay (Deam-PCR) which we have previously established for measuring AID activity (Figure S3).[7,94,103−105] In this assay, a substrate plasmid is incubated with AID and subjected to PCR using deamination-specific primers that amplify DNA deaminated by AID (Figure S4A). Using this assay on C8-treated extracts of all three lymphoma cells, we also observed inhibition of endogenous AID in the whole cell extracts of three different lymphoma cell lines (Figure B). The assay also further confirmed the inhibition of purified AID (Figure B, bottom gel). In addition, we observed inhibition of exogenous AID expressed in 293T cells, in the whole cell extract (Figure C).
Figure 3

C8 inhibition of native untagged AID and endogenous AID from B lymphoma cells. (A) Representative alkaline cleavage experiment demonstrating C8 inhibition of untagged native AID in 293T whole cell lysate (left panel). Catalytic activity of eukaryotic-expressed native untagged AID as a function of log C8 concentration, using 140 mM DMSO as control for 100% AID activity (right panel). (B) Deam-specific PCR was used to detect activity of endogenous AID in extracts of AID-expressing B lymphoma cell lines, with or without addition of C8. The deamination substrate plasmid was incubated with cell extracts containing endogenously expressed AID from Raji (top panel), Ramos (middle panel), and Daudi (bottom panel) with added vehicle (140 mM DMSO) or C8. Incubation with C8, but not DMSO, abrogated any detectable PCR product. The bottom gel is a representative experiment demonstrating inhibition of AID-His using Deam-PCR. (C) Deam-PCR was used to demonstrate inhibition of AID expressed in 293T cells, in whole cell extract incubated with the target plasmid.

C8 inhibition of native untagged AID and endogenous AID from B lymphoma cells. (A) Representative alkaline cleavage experiment demonstrating C8 inhibition of untagged native AID in 293T whole cell lysate (left panel). Catalytic activity of eukaryotic-expressed native untagged AID as a function of log C8 concentration, using 140 mM DMSO as control for 100% AID activity (right panel). (B) Deam-specific PCR was used to detect activity of endogenous AID in extracts of AID-expressing B lymphoma cell lines, with or without addition of C8. The deamination substrate plasmid was incubated with cell extracts containing endogenously expressed AID from Raji (top panel), Ramos (middle panel), and Daudi (bottom panel) with added vehicle (140 mM DMSO) or C8. Incubation with C8, but not DMSO, abrogated any detectable PCR product. The bottom gel is a representative experiment demonstrating inhibition of AID-His using Deam-PCR. (C) Deam-PCR was used to demonstrate inhibition of AID expressed in 293T cells, in whole cell extract incubated with the target plasmid.

Inhibition Is Sensitive to Mutation of Catalytic Pocket Residues

Docking revealed that C8 binds into the catalytic pocket of AID (Figure A) and that akin to AID’s native substrate dC (Figure C) C8 is stabilized through multiple interactions with secondary catalytic residues in the catalytic pocket.[80] These secondary catalytic residues are housed on four secondary catalytic loops: Loops 1 (L1), 3 (L3), 5 (L5), and 7 (L7) (L1, L3, L5, and L7 correspond to the β1-α1 loop, β2-α2 loop, β3-α3 loop, and β4-α4 loop, respectively; Figure A, left panel).[80]Table describes the C8-stabilizing residues across multiple AID conformations, showing π–π stacking and hydrogen-bonding residues. In all AID-C8 complexes, the C8 benzimidazol-2-one is anchored deep in the catalytic pocket where it can π–π stack with H56 and/or Y114 and hydrogen bond with T27, E58 backbone, and S85 backbone. Notably, these are the same residues that interact with AID’s native substrate dC for stabilization and deamination in the catalytic pocket.[80,82] The C8-tail bound the ssDNA binding grooves with some flexibility; AID-C8 complexes bound the C8-tail either in the L1-L7 interface (between R25, W84 and Y114; Figure A, middle panel) or in the L1-L3 interface (between R25, E26, N51, and K52; Figure A, right panel), with a slight preference for the former. The tail-carbonyl hydrogen bonded with N51, H56, or Y114, while the furan group π–π stacked with W84 or Y114 and hydrogen bonded with R25 or Y114 (Table ). Thus, the benzimidazol-2-one was firmly bound in the catalytic pocket, while the flexible tail was predicted to adopt several configurations interacting with the ssDNA binding grooves.
Figure 4

Predicted binding modes of AID-C8 and inhibition-resistant mutations (A) Representative AID-C8 docked complex illustrating a low-energy binding mode (left), with the AID surface colored according to surface charges (blue = positive, red = negative, purple = Zn-coordinating residues). The C8 benzimidazol-2-one scaffold was bound deep in the catalytic pocket with the tail region bound outside, stabilized by several ssDNA binding residues in either the L1-L7 (middle) or L1-L3 (right) interface. For each inhibitor, the carbon backbone is colored green, with nitrogen, oxygen, and hydrogen colored blue, red, and white, respectively. (B) C8 inhibition of AID mutants targeting residues predicted to interact with C8 by docking analysis, demonstrating that mutation of ssDNA binding groove residues that directly stabilize C8 results in resistance to C8 inhibition. Dr-AID, representing an evolutionary-distant AID orthologue was unaffected by C8.

Table 1

Inhibitor Binding Residues across All Conformations of AID, A3A, A3B, and A3G

C8benzimidazol-2-one (π–π stacking)benzimidazol-2-one (H-bonding)linker carbonyl (H-bonding)tail furan (π–π stacking)tail (H-bonding)
AIDH56, Y114T27, H56, E58,a S85,a C87aN51, H56, Y114W84, Y114R25, Y114
A3AH70, Y130T31, N57, S99aH29, N57H29, W98, Y130, Y132R28, Y130
A3BY250, H253, Y313, Y315, intramolecularR212,a T214, N240, R252,a S282,a P283,a C284,a D316H253, Y313, Y315Y250, H253, F285, Y313, Y315, intramolecularR211, Y250, H253, C284, Y313, Y315.
A3GH257, W285, Y315, intramolecularH257, E259, S286,a T218noneH216, W285, Y315, intramolecularR213

Indicates the peptide backbone of that residue.

Predicted binding modes of AID-C8 and inhibition-resistant mutations (A) Representative AID-C8 docked complex illustrating a low-energy binding mode (left), with the AID surface colored according to surface charges (blue = positive, red = negative, purple = Zn-coordinating residues). The C8 benzimidazol-2-one scaffold was bound deep in the catalytic pocket with the tail region bound outside, stabilized by several ssDNA binding residues in either the L1-L7 (middle) or L1-L3 (right) interface. For each inhibitor, the carbon backbone is colored green, with nitrogen, oxygen, and hydrogen colored blue, red, and white, respectively. (B) C8 inhibition of AID mutants targeting residues predicted to interact with C8 by docking analysis, demonstrating that mutation of ssDNA binding groove residues that directly stabilize C8 results in resistance to C8 inhibition. Dr-AID, representing an evolutionary-distant AID orthologue was unaffected by C8. Indicates the peptide backbone of that residue. To verify these docking results, we measured C8 inhibition on a panel of AID mutated at residues predicted to interact with C8 (Figure B). We were limited in testing AID mutations, as many of the predicted C8-stabilizing residues are key catalytic residues, wherein even conservative mutations (e.g., N51Q and Y114F) result in catalytically dead mutants;[80] however, we tested the ability of C8 to inhibit AID mutated at each of the other contact residues located at the opening of the catalytic pocket (K22A, E26R, R25H, and R25N) and found that all were resistant to C8 inhibition when compared to wild type (IC50 = 500, > 1000, 890, and 770 μM, respectively). Zebrafish AID (Dr-AID) which naturally has a R25H equivalent, was also resistant to C8 inhibition (IC50 > 1000 μM). These data provide functional validation for the predicted AID-C8 interactions and highlight the specificity of C8 for the catalytic pocket and adjacent regions of human AID.

Structural analogues of C8 also exhibit AID inhibition

Using the ZINC database, we identified 948 structural analogues of C8 using a 60% structural similarity cutoff. We docked each analogue with several conformations of catalytically accessible AID, using the same approach described in Figure . Analogous to our initial screening approach for identifying C1–C10, we ranked C8 analogues based upon docking energy and chemical diversity and selected 15 analogues (C8.1–C8.15; Figure S5A). C8 analogues were tested for inhibition of purified AID (Figure S5B and C), and several were found to inhibit AID. Compared to C8, C8.5 and C8.12 inhibited purified GST-AID (230, 140, and 160 μM, respectively) and 293T-expressed AID-His (390, 340, and 370 μM, respectively) by a relatively similar degree (Figure A and B). However, C8 potently inhibited AID (untagged) to a far greater extent in comparison to C8.5 and C8.12 (IC50 = 11, 90, and 130 μM, respectively; Figure C). We performed a similar screen for C4 analogues (C4.1–C4.5), but none were capable of inhibiting AID (Figure S6A and B). Thus, C8, C8.5, and C8.12 inhibit AID activity to a similar degree, except for untagged AID in whole cell extracts which was most potently inhibited by C8. We then carried out a modified intracellular version of the Deam-PCR assay described above, wherein the plasmid substrate was transfected into the lymphoma cells that were either treated or untreated with C8 and C8.5, followed by cell lysis and Deam-PCR to measure mutations mediated by endogenous AID (Figure S4A). Here we also observed that C8 and C8.5 treatment of transfected Raji cells inhibited AID-mediated mutations (Figure S4B)
Figure 5

Structural analogues of C8 exhibit variable potency against AID, A3A, A3B, and A3G. Each panel measured catalytic activity in the presence of C8, C8.5, or C8.12. (A) Bacterially expressed and purifiedGST-AID. (B) Eukaryotic-expressed AID in whole 293T cell lysate. (C) Eukaryotic-expressed native untagged AID in whole 293T cell lysate. (D) Eukaryotic-expressed and purified GST-A3A. (E) Eukaryotic-expressed and purified GST-A3B. (F) Eukaryotic-expressed and purified GST-A3G. (G) Eukaryotic-expressed native untagged A3A in 293T cell lysate. (H) List of C8, C8.5, and C8.12 IC50 values across each enzyme. All experiments used 140 mM DMSO as a negative control, designating 100% enzyme activity. All AID reactions were performed at 37 °C for 2–4 h at pH 7.2 using 2 nM of the standard bubble oligonucleotide substrate TGCbub7. GST-A3A, GST-A3B, GST-A3F, and GST-A3G reactions were incubated at 37 °C for 2 h in pH 5.5 for A3A/B and pH 6.0 for A3G/F using 2 nM of standard single-stranded oligonucleotide substrates containing a single target TTCA motif for A3A, A3B, and A3F and a single target CCC motif for A3G.

Structural analogues of C8 exhibit variable potency against AID, A3A, A3B, and A3G. Each panel measured catalytic activity in the presence of C8, C8.5, or C8.12. (A) Bacterially expressed and purifiedGST-AID. (B) Eukaryotic-expressed AID in whole 293T cell lysate. (C) Eukaryotic-expressed native untagged AID in whole 293T cell lysate. (D) Eukaryotic-expressed and purified GST-A3A. (E) Eukaryotic-expressed and purified GST-A3B. (F) Eukaryotic-expressed and purified GST-A3G. (G) Eukaryotic-expressed native untagged A3A in 293T cell lysate. (H) List of C8, C8.5, and C8.12 IC50 values across each enzyme. All experiments used 140 mM DMSO as a negative control, designating 100% enzyme activity. All AID reactions were performed at 37 °C for 2–4 h at pH 7.2 using 2 nM of the standard bubble oligonucleotide substrate TGCbub7. GST-A3A, GST-A3B, GST-A3F, and GST-A3G reactions were incubated at 37 °C for 2 h in pH 5.5 for A3A/B and pH 6.0 for A3G/F using 2 nM of standard single-stranded oligonucleotide substrates containing a single target TTCA motif for A3A, A3B, and A3F and a single target CCC motif for A3G.

C8.5 Inhibits the Catalytic Activities of A3A, A3B, and A3G

Given inhibition by C8 on A3A/B/G (Figure E and F), we next sought to examine inhibition with C8.5 and C8.12 (Figure D–G). We found that C8.5 inhibits GST-A3A, GST-A3B, and GST-A3G and is remarkably more potent on inhibition of GST-A3A than any other inhibition value measured thus far in our assays (IC50 = 3, 40, and 70 μM, respectively). In contrast, C8.12 moderately inhibited GST-A3A but was a poor inhibitor of GST-A3B and GST-A3G (IC50 = 290, 700, and 740 μM, respectively). We also performed a similar screen of C8, C8.5, and C8.12 on GST-A3F but did not detect any inhibition (Figure S6C). Like the AID (untagged) expression system, we also attempted to express untagged versions of A3A and A3B but were only successful with A3A. We found that untagged A3A was rendered completely resistant against C8 (IC50 > 1000 μM), while C8.5 most potently inhibited its activity (IC50 = 9 μM), and C8.12 moderately inhibited its activity (IC50 = 170 μM; Figure G). Thus, of all the inhibition values measured in this study, the most potent inhibition was in the case of C8.5 and A3A, in single digit micromolar values for two independent forms of the enzyme.

Predicted AID/APOBEC3-Inhibitor Complexes Vary by Structural Differences between Secondary Catalytic Loops

Given the varying degrees of inhibition by C8, C8.5, and C8.12 on AID, A3A, A3B, and A3G, we sought a structural rationale for the observed differences. C8, C8.5, and C8.12 conserve the warhead benzimidazol-2-one group, the tail carbonyl, and a flexible, largely hydrophobic tail containing a planar ring available for π–π stacking (Figure , top panel). For each enzyme, we examined inhibitor binding variability across two to three catalytically accessible conformations and tabulated the most frequently contacted residues by the inhibitors (Table ).[80,83]
Figure 6

Favored binding modes of C8, C8.5, and C8.12 with AID, A3A, A3B, and A3G. Top panel: Conserved benzimidazol-2-one scaffold (blue) among C8, C8.5, and C8.12. Each inhibitor contains a tail-carbonyl and a flexible tail containing a largely hydrophobic aromatic ring. Each enzyme–inhibitor pair depicts the benzimidazol-2-one group bound deep in the catalytic pocket, with the flexible tail stabilized by different combinations of the secondary catalytic loops (L1, L3, L5, and L7). Stabilizing residues are transparent, colored, and labeled. The Zn-coordinating residues are colored purple. For each inhibitor, the carbon backbone is colored green, with nitrogen, oxygen, and hydrogen colored blue, red, and white, respectively.

Favored binding modes of C8, C8.5, and C8.12 with AID, A3A, A3B, and A3G. Top panel: Conserved benzimidazol-2-one scaffold (blue) among C8, C8.5, and C8.12. Each inhibitor contains a tail-carbonyl and a flexible tail containing a largely hydrophobic aromatic ring. Each enzyme–inhibitor pair depicts the benzimidazol-2-one group bound deep in the catalytic pocket, with the flexible tail stabilized by different combinations of the secondary catalytic loops (L1, L3, L5, and L7). Stabilizing residues are transparent, colored, and labeled. The Zn-coordinating residues are colored purple. For each inhibitor, the carbon backbone is colored green, with nitrogen, oxygen, and hydrogen colored blue, red, and white, respectively. Figure illustrates the dominant binding mode (shown in stick), while stabilizing residues are shown in colored transparent surfaces. In AID, the C8-tail frequently bound the L1-L7 interface with some conformations in the L1-L3 interface, while the C8.5 tail bound the L5-L7-α4 region and the C8.12-tail bound the L1-L3 interface (Figure , AID panel). In A3A, we found conformations of inhibitors where the benzimidazol-2-one group bound facing the Zn-coordinating residues and some that were slightly tilted toward L3 (shown for A3A-C8.5). For A3A, the C8-tail and C8.12-tail both bound between L1-L3-L7, with most interactions in the L1-L7 interface. The C8.5-tail preferentially bound the L5-L7-α4 region, with some conformations in the L1-L7 interface (Figure , A3A panel). Like A3A, we also noted different configurations of the benzimidazol-2-one group bound in the catalytic pocket of A3B, either slightly tilting toward L3 or L7. Furthermore, among all enzyme–inhibitor pairs, we noted the most inhibitor-tail flexibility when bound with A3B. For A3B, the C8-tail most frequently bound the L1-L3 interface, with other configurations bound with L1-L7, L1-L5-α4, and L3-L5-L7. Akin to AID and A3A, in A3B, the C8.5-tail frequently bound the L5-L7-α4 interface, with some binding also observed on the L1-L7 interface. The C8.12-tail primarily bound the L3-L5-L7-α3 interface, with some bound in the L1-L7 interface (Figure , A3B panel). Unlike A3A or A3B, we found A3G bound C8, C8.5, and C8.12 with limited variability, whereby each tail only bound in the L1-L7 interface, except for the C8.12-tail, which had some conformations bound in the L1-L3 interface (Figure , A3G panel). Across AID, A3A, A3B, and A3G, we noted the benzimidazol-2-one was consistently stabilized by the Zn-coordinating histidine (H56, H70, H253, and H257, respectively) as well as several secondary catalytic residues, including the L1 threonine (T27, T31, T214, and T218, respectively), the L5 serine backbone (S85, S99, S282, and S286, respectively), and the L7 tyrosine (Y114, Y130, Y313, and Y315, respectively; Table ). Interestingly, the C8.5-tail preferentially bound the L5-L7-α4 interface of AID, A3A, and A3B, whereby the tail phenylpropane π–π stacked with one or more aromatic residues.

Discussion

Since AID expression drives and exacerbates tumorigenesis, an AID inhibitor has been suggested to be of benefit; however, development of such an agent has not been possible since structural insights into AID were not gleaned until recently. The functional and native structure of AID described in 2015 revealed two structural features that explained the unusually low catalytic rate and high ssDNA binding affinity of AID.[80] These include frequent catalytic pocket closure and sporadic ssDNA binding by a highly positively charged surface, in positions that are not deamination-viable. This structural analysis, which has since been confirmed by two partial AID crystal structures,[81,82] provided an understanding of AID’s catalytic pocket conformational dynamics and ssDNA stabilization interactions proximal to the pocket (Figure C), thus offering an opportunity for structure-based inhibitor design. Using these AID–DNA interactions, we scanned the ZINC database of lead-like compounds and identified 10 compounds for testing against AID (Figure D and E). We identified two compounds, C4 and C8, that inhibit the enzymatic activity of AID (Figure ). MTT assay showed that C8 was noncytotoxic across several cell lines and primary healthy donor cells tested (Figure S2C). This suggested that C8 or derivatizations could be used successfully in future in vivo studies, and hence, we focused on C8. Although purified GST-AID was inhibited by C8 with modest IC50 values (230 μM), C8 could inhibit multiple forms of purified AID (both bacterially expressed GST-AID and 293T-expressed AID-His) as well as AID in whole cell lysates of 293T cells (Figure ). It was particularly encouraging that native untagged AID in whole cell 293T extract, which is the best representative of endogenous cellular AID, was inhibited by C8 with the most effective IC50 of 11 μM, ∼21-fold more effectively than purified AID (Figure A). This increased susceptibility of C8 inhibition is likely due to the higher specific activity of native untagged AID, thus requiring a lower amount of enzyme to achieve similar catalytic rates when compared with purified GST- or His-tagged AID. Since the topological features of the AID catalytic pocket and surrounding region are unique, we rationalized that these residues ought to act as specific anchors for small molecule placement. Furthermore, inhibitor binding in this region would impart a level of structural specificity to our strategy. Akin to dC stabilization (Figure C), we noted several stabilizing interactions with secondary catalytic residues and DNA binding residues in and proximal to the catalytic pocket, respectively (Figures A and 6). The benzimdazol-2-one group was anchored in the catalytic pocket, while the tail group adopted several conformations in the DNA binding groove. To bolster our understanding of the AID-C8 complexes, we probed several nonlethal surface mutants predicted to destabilize C8 binding. DNA binding groove mutants (K22A, R25H, R25N, and E26R) as well as evolutionary-distant Dr-AID were all resistant to C8 inhibition with increased IC50 values (Figure B). Using C8 as a parent compound, we examined 948 structural analogues, obtaining 15 (C8.1–C8.15) for analysis. Analogues lacking the protruding carbonyl characteristic of the benzimidazol-2-one group (C8.2, C8.3, C8.4, C8.13, and C8.14) resulted in a major loss of AID inhibition (Figure S5), likely due to a loss of stabilizing hydrogen bonds. C8.5 and C8.12 inhibited AID with similar potencies relative to C8 (Figure A and B), except in the case of native AID (untagged), whereby C8 was superior with an IC50 of 11 μM (Figure C). Akin to C8, C8.5 and 8.12 conserve the benzimidazol-2-one group, a tail carbonyl, and a flexible, mostly hydrophobic tail with a planar ring available for π–π stacking interactions (Figure , top panel). To assess the specificity of C8 within the AID/A3 family, we measured the inhibition of A3A, A3B, A3F, and A3G. Surprisingly, A3A was potently inhibited by C8, A3B and A3G were moderately inhibited, and A3F was unaffected (Figure E and F). Even more surprising is the fact that C8.5, a C8 analogue screened against the catalytic pocket of AID, potently inhibited A3A, A3B, and A3G while it moderately inhibited AID (Figure ). However, in retrospect, we believe this result was achieved for two reasons: First, the catalytic pockets of AID, A3A, A3B, and A3G each evolved to be conducive to a polynucleotide containing dC, and, as such, many of the secondary catalytic residues that stabilize dC also stabilize C8.5 (e.g., hydrogen bonding with the conserved L1 threonine residue T27, T31, T214, and T218, respectively; Table ). Second, our screening of AID exclusively included accessible catalytic pockets, which are predicted to represent the minority of conformations at any given time.[80,83] A3A and A3B-CTD have high sequence similarity, with the largest differences observed in the L1 region, with identical sequences for L5, L7, and α4 (Figure S6D). Given the dominant binding modes of the C8-tail (L1-L7 vs L1-L3 interface for A3A and A3B, respectively) and the C8.12-tail (L1-L7 vs L3-L5-L7-α3 interface for A3A and A3B, respectively; Figure ), we were not surprised to find differences in inhibition potencies. In support of the importance of these interactions, we note that A3F, which is unaffected by C8, C8.5, or C8.12 (Figure S6C), has major sequence differences of L1/L3/L5/L7 when compared to A3A/B/G as well as notable differences in its catalytic pocket chemistry that may explain a lack of inhibition (Figure S6D). Such differences include alternative Zn-coordination[106] and differences among key inhibitor-stabilizing residues (e.g., L1 threonine stabilizing residue is instead a serine, S216; Figure S6D). We noticed that A3A-inhibitor and A3B-inhibitor complexes with the benzimidazol-2-one group bound directly facing toward either Zn or Zn-coordinating residues, but also a tilted angle toward L3 or L5 residues (an example is shown in Figure for A3A-C8.5), a feature not observed for AID or A3G. We postulate that these tilted binding modes observed for A3A and A3B may bestow increased conformational freedom of inhibitors, thus allowing for overall enhanced stabilization of the benzimidazol-2-one group in the catalytic pocket. For both AID and A3s, we ensured that multiple enzyme versions are tested for inhibition, including GST-tagged purified and untagged in whole cell extracts and, in some cases, several other forms expressed in different cell hosts. In our view, this approach provides critical confidence that the observed inhibition is not a property of one version of the enzyme, as fusion tags and expression hosts are well-known to impact protein conformations and, in the case of enzymes, specific activity levels.[107] However reassuring the results were that in each case multiple forms of AID and A3s were indeed inhibited, the differences in IC50 values between the various versions are a testament to such conformational differences due to expression hosts and fusion tags. In the future, to maximize binding affinity and specificity among AID/A3 family members, several strategies could be undertaken to generate more efficacious inhibitors. Derivatization of the C8/C8.5/C8.12 benzimidazol-2-one scaffold could introduce additional hydrogen bonding pairs, conducive to the secondary catalytic residues of the AID/A3 catalytic pocket to improve affinity. Additionally, these derivatives could be modified to take advantage of the multiple regions bound by the tail of each inhibitor. Instead of a single, flexible tail connected to the benzimidazol-2-one scaffold, a 2- or 3-tail inhibitor could be constructed, such that the L1-L3, L1-L7, and L5-L7-α4 regions are bound simultaneously. Because most structural differences reside within these secondary catalytic loops, in particular the L1-L7 interface, such multitailed derivatives could achieve improved affinity and specificity to the desired AID/A3 target. An alternative approach for AID/A3 inhibitor design might include screening for inhibitors that bind accessible and/or partially occluded conformations of the catalytic pocket. This would help improve the specificity of inhibitors toward AID/A3 family members, based upon their catalytic pocket dynamics. For example, inhibitors restricted to the partially occluded conformations of A3B would be less likely to bind A3A, due to A3A’s superior catalytic pocket accessibility.[83−86,96] Such inhibitors would prove useful as probes to study AID/A3 biology and as initial scaffolds for future drug design. This effort could achieve either specific inhibition of A3A or A3B, or pan inhibition of AID, A3A and A3B, three highly tumorigenic AID/APOBEC family member enzymes. Unlike C8, the C8.5 and C8.12 tail-structures were much more hydrophobic (Figure ) and required sonication to coax dissolution (see Methods). Despite our best efforts, our working aliquots were only partially dissolved in solution and thus, the true IC50 of C8.5 and C8.12 may likely be even lower than the experimentally observed IC50. We suggest that future modifications of tail structures that include hydrogen bond donor/acceptor groups should dramatically improve solubility. Recently, several studies of SARS-CoV-2 have suggested that APOBECs may contribute to viral mutagenesis.[108−112] Similar to their role in promoting immune and drug escape in HIV,[47,113−115] APOBEC-induced mutations could generate SARS-CoV-2 variants with enhanced immune evasion and drug resistance. It is tempting to consider APOBEC inhibitors in this context, in addition to therapeutics that thwart cancer genome mutations. Beyond their potential for therapeutic development, the demonstration that C8 and analogues, which were identified based on docking into the Schrodinger’s CATalytic pocket, could indeed functionally inhibit AID and A3s provides further verification of the pocket structure itself. In the future, these and other derivatized inhibitors could also be useful as biochemical probes for studying the interaction of AID and A3s with DNA/RNA and as novel tools for studying their biology.

Methods

Virtual High-Throughput Screening of Small Molecules against the Catalytic Pocket of AID

The AID structure used for high throughput in silico identification of first generation hits is based on the functional and native AID structure described previously though a combined computational–biochemical method.[80] This structure has been verified by a partial AID crystal structure.[81,82] In addition, since the catalytic pocket of AID was designated as the inhibitor target for this study, this structure is advantageous because it includes several dynamic conformations of AID’s catalytic pocket, verified by biochemical analysis of AID variants.[80] Briefly, this structure was generated by modeling full-length AID based on eight X-ray or APOBEC structures as templates for homology modeling: A2 NMR (PDB: 2RPZ), A241–224 chain A and B X-ray (PDB: 2NYT:A and 2NYT:B, respectively), A3A NMR (PDB: 2M65), A3C X-ray (PDB: 3VOW), A3F-CTD X-ray (PDB: 4IOU), A3G-CTD NMR (PDB: 3E1U), and A3G-CTD NMR (PDB: 2KBO).[84,116−121] All AID/APOBEC X-ray/NMR structures were obtained from the protein databank (http://www.rcsb.org) and visualized using PyMOL v1.7.6 (http://www.pymol.org). Using the default parameters of I-TASSER (http://zhanglab.ccmb.med.umich.edu/I-TASSER/),[122,123] full-length human AID (Hs-AID) and variants were modeled from APOBEC templates. The catalytic pocket is defined as the indented space containing Zn and the catalytic residues (H56, E58, C87, and C90 in Hs-AID). Using DOCK Blaster v1.6.0 (http://blaster.docking.org/),[101] we virtually screened 4.6 × 106 “clean-lead” small molecules from the ZINC database (http://zinc.docking.org/) against the catalytic pocket of AID. We used several AID-DNA complexes containing dC in the catalytic pocket as a template for screening.[80] In total, we screened five low energy conformations of AID, representative of the range of catalytically accessible catalytic pocket conformations. As a result, small molecules were screened for their ability to bind to the catalytic pocket and surrounding DNA binding grooves across the ensemble of catalytically active AID structures. Compounds were docked and ranked based on binding energy. The 500 lowest energy compounds bound to each AID conformation were cross-referenced, and compounds predicted to bind only one catalytically active AID conformation were excluded. The 40 lowest-energy compounds (Table S1) bound across several catalytically active AID conformations were selected for additional docking using Autodock VINA (http://vina.scripps.edu)[102] via PyRx (https://pyrx.sourceforge.io)[124] to confirm specificity to the catalytic pocket and ranked energies. The top 10 compounds (C1–C10) were then selected for testing based on binding energy ranking as well as the chemical diversity of structure side chains. Compounds were purchased from Molport, and purities were > 90%. C4, C8, C8.5 and C8.12. Molport IDs are, respectively, 008-366-081, 009-139-310, 005-764-107, 020-119-012. For analogue expansion, we used the same approach against the ZINC database to identify 948 structural analogues of C4 and C8 considering a 60% similarity cutoff. Using Autodock VINA via PyRx, we screened the five AID conformations and identified the top ranking structural analogues of C4 and C8. Additionally, we constructed a full-length AID model based the crystal structure of near-native AID (PDB: 5W1C)[82] for binding with inhibitors to confirm AID-inhibitor binding modes. Two A3A conformations were selected from PDB 2M65 (apo) and PDB 5KEG (ssDNA-bound), so the catalytic pockets represent both unbound and bound states (with respective rotations of Y130/Y132, etc). For A3B-CTD, we chose two NMR conformations (PDB: 2NBQ) that had rotations of the equivalent tyrosine residues (Y313 and Y315), albeit different from the A3A structures. For A3G-CTD, we used two structures to represent different conformations (PDB: 3E1U and 3IR2). When examining AID/A3–inhibitor binding mode interactions, we examined three or more low-energy clusters per conformation for a total of six or more docked complexes per enzyme–inhibitor pair.

Expression and Purification of AID/APOBECs

Expression and purification of GST-AID and AID-His in bacteria and HEK 293T cells have previously been described.[80,125,126] Briefly, for bacterially expressed GST-AID, the expression and purification of human (Hs-AID), Hs-AID mutants, and zebrafish (Dr-AID) GST-AID were carried out as previously described, using the pGEX5.3 expression system.[92,126] Point mutants were generated by site-directed mutagenesis using appropriate GST-AID constructs as templates, as previously described.[80] Briefly, GST-AID was expressed in E. coli (DE3-Bl21) and purified using GST-column chromatography as per the manufacturer’s guidelines. In total, six independent preparations of GST-AID and two independent preparations of each mutant/chimeric/orthologous AID were made and tested in parallel. The expression and catalytic activities of AID-His and untagged native AID in HEK 293T cells were carried out as previously described, using the pcDNA3.1 expression system.[94,107] Briefly, 50 × 10 cm plates of 293T cells were transfected with 5 μg of plasmid per plate using Polyjet transfection reagent (Froggabio), incubated for 48 h, resuspended in 50 mM phosphate buffer (pH 8.2 for AID-His and pH 7 for untagged AID) + 500 mM NaCl, 0.2 mM PMSF, 50 μg/mL RNase A, and lysed using a French pressure cell press. Whole cell extracts expressing either AID-His or native untagged AID were flash frozen in liquid nitrogen and stored for activity analysis. GST-tagged versions of A3A/B/F/G expressed in HEK-293T were purified using GST-beads per the manufacturer’s guidelines as previously described.[107,127] The GST-tagged A3 enzymes were stored in 20 mM Tris-HCl pH 7.5, 100 mM NaCl, 1 mM DTT, 5% glycerol, and 50 μg/mL BSA. Whole cell extracts expressing native untagged A3A obtained as described above were flash frozen in liquid nitrogen. Expression of AID/APOBECs was verified using Western blotting probed with anti-GST (SantaCruz) antibodies, followed by the secondary detection by goat anti-rabbit IgG (SantaCruz). Western blots to verify the expression of untagged native AID/APOBECs were probed with rabbit polyclonal anti-AID antibody (Abcam) or anti-A3A antibody (Abcam), respectively, followed by the aforementioned secondary IgG. The relative yield and purity of each purified AID/APOBEC preparation were evaluated using standard SDS Coomassie staining. The concentration of 293T-purified AID/APOBECs ranged from 10 to 50 ng/μL. The concentration of native untagged AID ranged from 0.7 to 7 ng/μL, and for untagged native A3A it was 4 ng/μL.

Inhibition of AID/APOBEC Using the Alkaline Cleavage Assay

The standard alkaline cleavage assay for AID/APOBEC-mediated deamination was used to screen compounds for inhibition of AID, A3A, A3B, A3F, and A3G. For AID reactions, we used the standard seven-nucleotide bubble substrate containing the WRC motif TGC (5′-TTTGCTT-3′) as a substrate, because it has previously been optimized for the highest levels of AID activity in this enzyme assay.[80,125,128] For A3A, A3B, and A3F, we used a single-stranded oligo containing the preferred 5′-TC-3′ dinucleotide;[106,129,130] and for A3G, we used a single-stranded 5′-CCC-3′ oligo (Figure S7). Substrates were labeled and purified as described previously.[80,92,125] For AID reactions, purified substrate (1.7 nM) was incubated with AID enzyme (∼0.9 μg of bacterially expressed GST-AID, 1–10 ng of 293T-expressed AID-His or untagged AID) in phosphate buffer (100 mM, pH 7.2) with H2O, 140 mM DMSO as a vehicle control or compound (see below). All AID reactions were incubated at 37 °C, except for Dr-AID, which was incubated at 25 °C, which was previously shown to be its optimal temperature,[92,126] in a total volume of 10 μL. For A3A, A3B, A3F, and A3G, alkaline cleavage reactions were conducted in the same manner (12–30 ng of GST-tagged A3s and 12 ng of native untagged A3A used per reaction), except they were incubated in buffers of more acidic pH since the A3 family enzymes are optimally active at a more acidic pH as compared to AID.[131,132] The A3G/A3F activity buffer was 100 mM phosphate buffer (pH 6.0), 1 mM DTT, and 50 μg/mL RNase A. The A3A/A3B activity buffer was 25 mM HEPES (pH 5.5), 100 mM NaCl, 1 mM DTT, 0.1% Triton X-100, and 100 ng/mL RNase A. Concentrated stocks of compound were sonicated at 37 °C for up to 4 h in 1–10% DMSO to promote dissolution. Due to solubility differences between compounds, the highest achieved concentration in 140 mM DMSO was used for initial screening. For initial C1–C10, C4.1–C4.5, and C8.1–C8.15 screening, final alkaline cleavage concentrations (in 140 mM DMSO) ranged from 500 to 840 μM, except for C9, C8.4, C8.5, and C8.9 which had lower concentrations (310, 400, 170, and 270 μM, respectively). The A3 enzymes tolerated a higher [DMSO] as compared to AID which began to show reduced activity at 10% DMSO; therefore, for the A3 inhibition assays, we were able to use a 1:10 dilution of C8, C8.5, and C8.12 which were dissolved in 100% DMSO, thus bringing the final reaction to 10% (1.4 M) [DMSO].

Inhibition of Endogenous AID in Lymphoma Cells Measured by the Deamination-Specific PCR Assay

Daudi, Raji, and Ramos cells (ATCC = CCL-213, CCL-86, and CRL-1596, respectively) were suspension cultured for 48 h in RPMI 1640 growth media containing 10% FBS. To lyse the cells, cultures were centrifuged and cells (2.5–3 × 106 cells) were washed twice with PBS. Cell were lysed using glass beads (425–600 μm) (Sigma). The volume of glass beads used was twice the volume of the cell pellet. The mixture was then vortexed for 20–30 s and then incubated on ice for 30 s. This was repeated two to three more times in PBS with 0.2 mM PMSF, 50 μg/mL RNase A. To detect AID expression in these cells, we employed quantitative real-time PCR. Total RNA was extracted from 106 cultured cells using TRIzol solution (Invitrogen) according to the manufacturer’s instructions. The quality and quantity of the extracted RNA were estimated by spectrometry. Total RNA (1 μg) was used for cDNA synthesis with the ProtoScript First Strand cDNA Synthesis Kit (NEB, UK). Quantitative real-time PCR was performed in triplicate using SYBR Green I. The amount of GAPDH housekeeping gene transcripts was used as a reference for the level of AID gene expression. Amplification was carried out in a total volume of 20 μL containing 0.5 μg of cDNA prepared as described above, 0.3 μM each of GAPDH- and AID-specific primers, and 1× reaction mixture consisting of RNase-free water and 2× QuantiTect SYBR Green PCR Master Mix (Qiagen). Thermal cycling for both genes was initiated with a denaturation step at 95 °C for 10 min, followed by 40 cycles of denaturation at 95 °C for 15 s, annealing at 50 °C for 30 s, and elongation at 72 °C for 1 min. Melting curve analysis of the amplification products was performed at the end of each PCR by cooling the samples to 60 °C and then increasing the temperature to 95 °C at 0.2 °C/s. The experiment was repeated three times for statistical analysis. To detect AID activity in cell extracts, we utilized a deamination-specific PCR (Deam-PCR) as previously described for measuring the mutational patterns of purified AID.[7,94,103−105] In this assay, a plasmid DNA substrate is incubated with AID and mutations are detected by amplification using deamination-specific primers. Here, we applied this assay in three iterations: first, to measure inhibition of purified AID incubated with the substrate plasmid; second, to measure inhibition of AID in whole cell extracts of lymphoma cells incubated with the substrate plasmid; and third, we extended the application of this assay to detecting intracellular AID-mediated mutations on transfected plasmid substrate. Briefly, the plasmid used as a substrate for the deamination specific PCR assay was pcDNA3.1 containing a random WRC-rich target sequence as previously described. An amount of 50 ng of supercoiled plasmid was denatured at 98 °C in 100 mM phosphate buffer pH 7.2 for 10 min followed by snap-cooling in an ice bath to generate ssDNA targets for AID to mutate. A volume of 4 μL of either purified AID or AID-expressing cell lysates (from Daudi, Raji, or Ramos which are naturally AID+ B lymphomas, or from 293T cell transfected with an AID expression vector as described above) was mixed either with C8 or vehicle, added to the target plasmid, and incubated for 4 h at 32 °C. To detect AID-mediated mutations, 1 μL of each reaction was amplified by deamination-specific nested PCR using mutation-specific primers, as previously described.[7,94,103,104] The inner primers used in the second nested PCR reaction generate a 451 nt-product. PCR amplicons were subsequently TA-cloned, and 10 from each reaction were sequenced to confirm AID-mutated mutations C to T or G to A on the sense and nonsense strands. Negative control deamination-specific reactions were conducted on the substrate alone with no added extract or AID. Positive control Deam-specific PCRs were conducted on reactions containing substrate plasmid and extracts of AID-expressing 293T cells, as described above. An additional negative control PCR reaction was carried out by adding C8 directly to the PCR reaction of the aforementioned positive control, in order to make sure that our observation of inhibition of endogenous lymphoma cell AID by C8 is not due to interference of C8 with the deamination-specific PCR step itself. For the intracellular endogenous AID inhibition assay, Raji cells were suspended for 48 h in RPMI 1640 growth media containing 10% FBS. Then 1–2 × 106 cells were transfected with 5 μg of the same substrate plasmid DNA as above. Cells were treated with either C8 or C8.5 at a final concentration of 700 μM or treated with vehicle immediately prior to transfection with the AID substrate plasmid. Untreated transfected Raji cells and untreated untransfected Raji cells were used as negative controls. To lyse the cells 24 h post transfection, cultures were centrifuged and cells were washed twice with PBS. Cell were lysed using glass beads (425–600 μm) (Sigma). The volume of glass beads used was twice the volume of the cell pellet, followed by vortexing for 20–30 s and incubation on ice for 30 s. This procedure was repeated three times. Then 1 μL of the cell lysate or 1 μL of each condition diluted 100 times was subject to Deam-PCR as described above. To confirm that compounds do not interfere with Taq polymerase, either inhibitor-treated or untreated/transfected Raji cell lysate was added to the positive PCR reaction.

MTT Assay

A breast cancer cell line (MCF-7; ATCC = HTB22), lung cancer cell line (A549; ATCC = CCL185), embryonic kidney cell line, (293T; ATCC = CRL3216), B cell lymphoma cell line (Raji; ATCC = CCl-86), and primary peripheral blood mononuclear cells (PBMCs) from healthy donors were used to test the toxicity of C4 and C8. MCF-7, A549, and 293T cells were gifted from Dr. Kao (Memorial University of Newfoundland), while Raji cell lines were gifted from Dr. Hirasawa (Memorial University of Newfoundland). The B cell lymphoma cell line and PBMCs were grown in RPMI-1640 medium (Invitrogen, USA) supplemented with 10% fetal calf serum (Invitrogen, USA), 200 IU/mL penicillin/streptomycin (Invitrogen, USA), 1% 1 M HEPES (Invitrogen, USA), 1% l-glutamine (Invitrogen, USA), and 2.0 × 10–5 M 2-mercaptoethanol (Sigma-Aldrich, USA), whereas MCF-7, A549, and 293T cell lines were maintained in DMEM containing 100 U/mL penicillin and 100 μg/mL streptomycin (Invitrogen, USA) and supplemented with 10% fetal calf serum (Invitrogen, USA). PBMCs from an anonymous healthy donor was obtained with approval from the Health Research Ethics Authority of Newfoundland and Labrador, Canada, carried out in accordance with the recommendations of the Canadian Tri-Council Policy Statement: Ethical Conduct for Research Involving Humans. All cells were grown at 37 °C in a humidified condition containing 5% CO2. Cells were transferred into a 96-well plate (104 cells per well) and treated in eight replicates with C4 and C8 (100, 250, and 450 μM) separately or with vehicle alone (140 mM DMSO). Untreated cells were also considered as the negative control. After 24 and 48 h incubation at 37 °C, 10 μL of 12 mM MTT (3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide) solution (Molecular Probes) was added to each well and incubated for 4 h. The reaction was terminated by the addition of 100 μL of detergent reagent and incubation for 4 h in the dark. Colorimetric evaluation was performed using a spectrophotometer at 490 nm. The percentage of viable cells was calculated from the absorbance values of untreated and treated cells as %Viable Cells = (OD490 treated/OD490 untreated) × 100.
  131 in total

1.  Activation-induced cytidine deaminase accelerates clonal evolution in BCR-ABL1-driven B-cell lineage acute lymphoblastic leukemia.

Authors:  Tanja Andrea Gruber; Mi Sook Chang; Richard Sposto; Markus Müschen
Journal:  Cancer Res       Date:  2010-09-28       Impact factor: 12.701

2.  An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers.

Authors:  Steven A Roberts; Michael S Lawrence; Leszek J Klimczak; Sara A Grimm; David Fargo; Petar Stojanov; Adam Kiezun; Gregory V Kryukov; Scott L Carter; Gordon Saksena; Shawn Harris; Ruchir R Shah; Michael A Resnick; Gad Getz; Dmitry A Gordenin
Journal:  Nat Genet       Date:  2013-07-14       Impact factor: 38.330

3.  Small-molecule library screening by docking with PyRx.

Authors:  Sargis Dallakyan; Arthur J Olson
Journal:  Methods Mol Biol       Date:  2015

4.  The mutation spectrum of purified AID is similar to the mutability index in Ramos cells and in ung(-/-)msh2(-/-) mice.

Authors:  Mani Larijani; Darina Frieder; Wajiha Basit; Alberto Martin
Journal:  Immunogenetics       Date:  2005-01-14       Impact factor: 2.846

5.  APOBEC3G: a double agent in defense.

Authors:  Harold C Smith
Journal:  Trends Biochem Sci       Date:  2011-01-14       Impact factor: 13.807

6.  Methylation protects cytidines from AID-mediated deamination.

Authors:  Mani Larijani; Darina Frieder; Timothy M Sonbuchner; Ronda Bransteitter; Myron F Goodman; Eric E Bouhassira; Matthew D Scharff; Alberto Martin
Journal:  Mol Immunol       Date:  2005-03       Impact factor: 4.407

7.  Human chromosomal translocations at CpG sites and a theoretical basis for their lineage and stage specificity.

Authors:  Albert G Tsai; Haihui Lu; Sathees C Raghavan; Markus Muschen; Chih-Lin Hsieh; Michael R Lieber
Journal:  Cell       Date:  2008-12-12       Impact factor: 41.582

8.  IgH partner breakpoint sequences provide evidence that AID initiates t(11;14) and t(8;14) chromosomal breaks in mantle cell and Burkitt lymphomas.

Authors:  Harvey A Greisman; Zhengfei Lu; Albert G Tsai; Timothy C Greiner; Hye Son Yi; Michael R Lieber
Journal:  Blood       Date:  2012-08-20       Impact factor: 22.113

9.  APOBEC-mediated cytosine deamination links PIK3CA helical domain mutations to human papillomavirus-driven tumor development.

Authors:  Stephen Henderson; Ankur Chakravarthy; Xiaoping Su; Chris Boshoff; Tim Robert Fenton
Journal:  Cell Rep       Date:  2014-06-05       Impact factor: 9.423

10.  SARS-CoV-2 genomic analyses in cancer patients reveal elevated intrahost genetic diversity.

Authors:  Juliana D Siqueira; Livia R Goes; Brunna M Alves; Pedro S de Carvalho; Claudia Cicala; James Arthos; João P B Viola; Andréia C de Melo; Marcelo A Soares
Journal:  Virus Evol       Date:  2021-02-16
View more
  1 in total

Review 1.  The current toolbox for APOBEC drug discovery.

Authors:  Michael J Grillo; Katherine F M Jones; Michael A Carpenter; Reuben S Harris; Daniel A Harki
Journal:  Trends Pharmacol Sci       Date:  2022-05       Impact factor: 17.638

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.