Lizhen Wu1, Vipul Shukla2, Anurupa Devi Yadavalli1, Ravi K Dinesh1, Dijin Xu3, Anjana Rao2,4,5, David G Schatz1. 1. Department of Immunobiology, Yale School of Medicine, New Haven, Connecticut 06520, USA. 2. Division of Signaling and Gene Expression, La Jolla Institute for Immunology, La Jolla, California 92037, USA. 3. Department of Microbial Pathogenesis, Yale University School of Medicine, New Haven, Connecticut 06510, USA. 4. Department of Pharmacology, Moores Cancer Center, University of California at San Diego, La Jolla, California 92093, USA. 5. Consortium for Regenerative Medicine, La Jolla, California 92037, USA.
Abstract
Somatic hypermutation (SHM) produces point mutations in immunoglobulin (Ig) genes in B cells when uracils created by the activation-induced deaminase are processed in a mutagenic manner by enzymes of the base excision repair (BER) and mismatch repair (MMR) pathways. Such uracil processing creates DNA strand breaks and is susceptible to the generation of deleterious deletions. Here, we demonstrate that the DNA repair factor HMCES strongly suppresses deletions without significantly affecting other parameters of SHM in mouse and human B cells, thereby facilitating the production of antigen-specific antibodies. The deletion-prone repair pathway suppressed by HMCES operates downstream from the uracil glycosylase UNG and is mediated by the combined action of BER factor APE2 and MMR factors MSH2, MSH6, and EXO1. HMCES's ability to shield against deletions during SHM requires its capacity to form covalent cross-links with abasic sites, in sharp contrast to its DNA end-joining role in class switch recombination but analogous to its genome-stabilizing role during DNA replication. Our findings lead to a novel model for the protection of Ig gene integrity during SHM in which abasic site cross-linking by HMCES intercedes at a critical juncture during processing of vulnerable gapped DNA intermediates by BER and MMR enzymes.
Somatic hypermutation (SHM) produces point mutations in immunoglobulin (Ig) genes in B cells when uracils created by the activation-induced deaminase are processed in a mutagenic manner by enzymes of the base excision repair (BER) and mismatch repair (MMR) pathways. Such uracil processing creates DNA strand breaks and is susceptible to the generation of deleterious deletions. Here, we demonstrate that the DNA repair factor HMCES strongly suppresses deletions without significantly affecting other parameters of SHM in mouse and human B cells, thereby facilitating the production of antigen-specific antibodies. The deletion-prone repair pathway suppressed by HMCES operates downstream from the uracil glycosylase UNG and is mediated by the combined action of BER factor APE2 and MMR factors MSH2, MSH6, and EXO1. HMCES's ability to shield against deletions during SHM requires its capacity to form covalent cross-links with abasic sites, in sharp contrast to its DNA end-joining role in class switch recombination but analogous to its genome-stabilizing role during DNA replication. Our findings lead to a novel model for the protection of Ig gene integrity during SHM in which abasic site cross-linking by HMCES intercedes at a critical juncture during processing of vulnerable gapped DNA intermediates by BER and MMR enzymes.
Somatic hypermutation (SHM) introduces mutations, predominantly single-nucleotide point mutations, into the variable (V) region exon of assembled immunoglobulin (Ig) heavy and light chain genes in antigen-stimulated germinal center B cells (French et al. 1989; Di Noia and Neuberger 2007; Victora and Nussenzweig 2012). SHM enables the generation of populations of B cells whose surface Ig receptors vary in their antigen binding affinity. This variation is critical for the process of antibody affinity maturation in which B cells with higher-affinity Ig receptors are iteratively selected for preferential survival, proliferation, and additional rounds of SHM and affinity selection (Neuberger 2008; Victora and Nussenzweig 2012). SHM is therefore critical for effective humoral immunity and vaccine responses. SHM also generates deletions and, to a lesser extent, insertions, which are strongly selected against in the Ig V region, presumably because of their propensity to damage the integrity of the open reading frame or protein folding (Yeap et al. 2015). The mechanisms that generate deletions during SHM are not well characterized, although it is plausible that they involve double-strand DNA (dsDNA) breaks, a particularly dangerous DNA lesion (Sale and Neuberger 1998; Bross et al. 2000; Papavasiliou and Schatz 2000; Alt et al. 2013; Yeap et al. 2015; Casellas et al. 2016; Yeap and Meng 2019). Whether dedicated factors or mechanisms exist to regulate the frequency of deletions during SHM and whether they might also regulate the generation of insertions are unknown.SHM is initiated when the activation-induced deaminase (AID), a single-strand DNA (ssDNA)-specific cytidine deaminase, converts a cytosine base (C) to uracil (U) in an Ig V region (Fig. 1A; Di Noia and Neuberger 2007). AID can target C residues on either DNA strand, acts preferentially at hotspots corresponding to the sequence RGYW or its inverse complement WRCY (R = A, G; Y = C, T; and W = A, T), and is thought to access its ssDNA substrate in the context of the transcription bubble created by paused or stalled RNA polymerase 2 (Sun et al. 2013; Methot and Di Noia 2017). Using a similar mechanism, AID deaminates C residues in Ig heavy chain gene (IGH) switch regions to initiate class switch recombination (CSR), a reaction that replaces one IGH constant region with another, thereby altering antibody effector function (Chaudhuri and Alt 2004; Xu et al. 2012).
Figure 1.
Establishing a rapid assay for SHM. (A) Schematic of SHM reaction. During SHM, AID deaminates C to generate uracil, which can act as a template for replication, leading to a C:G-to-T:A transition mutation. Alternatively, noncanonical (error-prone) base excision repair (BER) initiated by UNG can create an abasic site that, if copied by a translesion DNA polymerase, yields mutations at G:C. Finally, U or U:G mismatches can trigger the generation of ssDNA gaps through the action of UNG and mismatch repair (MMR) factors MSH2, MSH6, and EXO1, which primarily yield mutations at A:T upon translesion polymerase synthesis. (B) Scheme for creating RASH-1 and RASH-2. Creation of RASH involved several steps of CRISPR/Cas9 gene targeting in Ramos. AICDA (encoding AID) was disrupted, and a construct was inserted into the AAVS1 safe harbor locus that expresses AID7.3 (AID hyperactive mutant with threefold increased catalytic activity) upon doxycycline (dox)-induced binding of Tet3G to a promoter containing seven copies of its binding site, TetO (7xTetO). Western blot showing AID protein levels in AID7.3in cells without or with 200 ng/mL dox for 24 h. The GFP7-E reporter with EF1α promoter driving transcription of HTS7 (enriched with AID hotspots)-T2A-GFP was integrated into IGH either 38,674 bp upstream of the V leader sequence (RASH-1) or by replacing IGH-VDJ coding sequences so as to retain the leader exon (which now lacks its ATG start codon; RASH-2). Kozak sequence was added to HTS7-T2A-GFP in RASH-2. (C,D) Representative flow cytometry plots of and bar graphs quantifying GFP loss (C) or IgM loss (D) in RASH-1 cells treated with dox for 6 d. (E) Representative flow cytometry plots of and bar graph quantifying GFP loss in RASH-2 treated with dox for 6 d. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using two-tailed Student's t-test. (***) P value < 0.001.
Establishing a rapid assay for SHM. (A) Schematic of SHM reaction. During SHM, AID deaminates C to generate uracil, which can act as a template for replication, leading to a C:G-to-T:A transition mutation. Alternatively, noncanonical (error-prone) base excision repair (BER) initiated by UNG can create an abasic site that, if copied by a translesion DNA polymerase, yields mutations at G:C. Finally, U or U:G mismatches can trigger the generation of ssDNA gaps through the action of UNG and mismatch repair (MMR) factors MSH2, MSH6, and EXO1, which primarily yield mutations at A:T upon translesion polymerase synthesis. (B) Scheme for creating RASH-1 and RASH-2. Creation of RASH involved several steps of CRISPR/Cas9 gene targeting in Ramos. AICDA (encoding AID) was disrupted, and a construct was inserted into the AAVS1 safe harbor locus that expresses AID7.3 (AID hyperactive mutant with threefold increased catalytic activity) upon doxycycline (dox)-induced binding of Tet3G to a promoter containing seven copies of its binding site, TetO (7xTetO). Western blot showing AID protein levels in AID7.3in cells without or with 200 ng/mL dox for 24 h. The GFP7-E reporter with EF1α promoter driving transcription of HTS7 (enriched with AID hotspots)-T2A-GFP was integrated into IGH either 38,674 bp upstream of the V leader sequence (RASH-1) or by replacing IGH-VDJ coding sequences so as to retain the leader exon (which now lacks its ATG start codon; RASH-2). Kozak sequence was added to HTS7-T2A-GFP in RASH-2. (C,D) Representative flow cytometry plots of and bar graphs quantifying GFP loss (C) or IgM loss (D) in RASH-1 cells treated with dox for 6 d. (E) Representative flow cytometry plots of and bar graph quantifying GFP loss in RASH-2 treated with dox for 6 d. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using two-tailed Student's t-test. (***) P value < 0.001.The U created by AID is the source of the mutations generated during SHM (Fig. 1A). It can be replicated over in S phase to generate C/G-to-A/T transition mutations or it (or the U:G mismatch it creates) can be recognized and processed by enzymes in the base excision repair (BER) or mismatch repair (MMR) pathways. In most cells, these pathways process U (U:G) in a high-fidelity manner to prevent mutations, but downstream from AID in germinal center B cells they act both individually and in concert—through multiple error-prone repair pathways—to generate mutations at the site at which AID acted and at surrounding G/C and A/T residues (Liu and Schatz 2009; Methot and Di Noia 2017; Pilzecker and Jacobs 2019).The critical enzymes that initiate these error-prone repair mechanisms are the BER factor UNG (uracil DNA glycosylase) and the MMR heterodimer MSH2/MSH6 (Rada et al. 2004; Di Noia and Neuberger 2007) (Fig. 1A). UNG encodes mitochondrial (UNG1) and nuclear (UNG2) isoforms through the use of alternative promoters and splicing (Nilsen et al. 1997). UNG recognizes and removes the U base from ssDNA or dsDNA to create an abasic site, which can either serve as a noninstructive template for translesion polymerases or trigger endonucleolytic cleavage of the deoxyribose–phosphate backbone by apurinic/apyrimidinic endonucleases APE1 or APE2. APE2 has been implicated in the generation of mutations and deletions in SHM (Sabouri et al. 2009; Stavnezer et al. 2014). MSH2/6 recognizes the U:G mismatch created by the action of AID and can trigger resection of one DNA strand by the exonuclease EXO1. The DNA strand lesions created by BER and MMR factors can then be resolved by one of several pathways involving error-prone translesion polymerases to create the full spectrum of mutations associated with SHM (Methot and Di Noia 2017; Pilzecker and Jacobs 2019; Yeap and Meng 2019; Feng et al. 2020). Extensive evidence indicates that BER and MMR factors act collaboratively to create mutations during SHM and to convert ssDNA breaks and gaps into the dsDNA breaks required for CSR, with loss of either pathway causing a severe reduction in the efficiency of both SHM and CSR (Methot and Di Noia 2017; Pilzecker and Jacobs 2019; Feng et al. 2020, 2021). It is plausible that BER and MMR factors collaborate to generate deletions during SHM, although this idea has not been tested.HMCES (5-hydroxymethylcytosine binding, embryonic stem cell-specific protein) is the sole known mammalian member of the superfamily of proteins containing a SRAP (SOS response-associated peptidase) domain, an evolutionarily ancient domain found in E. coli yedK and in many other species (Aravind et al. 2013). HMCES is recruited to stalled DNA replication forks where its SRAP domain forms a covalent cross-link with abasic sites in ssDNA, thereby shielding the abasic site from the action of endonucleases and translesion polymerases and protecting against DNA breaks and mutations (Mohni et al. 2019; Mehta et al. 2020; Srivastava et al. 2020). In this mode of action, HMCES is a suicide enzyme that is removed from DNA by ubiquitin-mediated proteolysis (Mohni et al. 2019). Structural and biochemical analyses have revealed the basis of HMCES's preference for binding to ssDNA extensions or gaps and identified a thiazolidine linkage formed between an invariant cysteine at position 2 (C2) of HMCES and the deoxyribose of the abasic site (Halabelian et al. 2019; Thompson et al. 2019; Wang et al. 2019). HMCES is highly expressed in germinal center B cells, and Hmces−/− mice exhibit a significant defect in CSR despite normal cellular parameters of the germinal center response (Shukla et al. 2020). The CSR defect is due to a selective failure of the alternative end-joining DNA repair pathway, which works in tandem with classical nonhomologous end joining to complete the CSR reaction (Boboila et al. 2010; Alt et al. 2013; Chang et al. 2017; Shukla et al. 2020). Notably, HMCES function in alternative end joining appears to involve the protection of ssDNA overhangs against excessive deletion and does not require C2, revealing a second mode of action that is independent of cross-linking to abasic sites (Shukla et al. 2020). The role of HMCES in SHM has not been explored.In this study, we report that HMCES plays a highly selective role during SHM to limit the accumulation of deletions in SHM target regions. In both mice and a human cell line model system, loss of HMCES leads to significant increases in SHM-associated deletions with little or no significant effect on point mutation accumulation or spectrum or the frequency of insertions. Taking advantage of a new rapid SHM assay system, we demonstrate that HMCES function during SHM requires C2 and an intact ssDNA binding groove, is completely dependent on UNG, and protects against a deletion pathway that involves APE2 (but not APE1), MSH2, MSH6, and EXO1. Our results lead to a model in which cross-linking of HMCES to abasic sites generated by the combined action of AID and UNG inhibits DNA cleavage of a vulnerable ssDNA repair intermediate by APE2 and indicate that BER and MMR can act in a coordinated manner to create deletions during SHM.
Results
Establishing a rapid assay for SHM (RASH)
To facilitate the analysis of SHM factors and mechanisms, we sought to create a fast and robust cellular assay for SHM that provided high levels of AID activity, a sensitive readout of AID-dependent mutation accumulation, and temporal control. We used the human germinal center-derived lymphoma cell line Ramos because of its extensive prior validation as a SHM model system, including efficient AID-dependent mutation of its Ig heavy (IGH) and λ (IGL) chain variable regions, ability to support Ig enhancer-driven mutation of reporter vectors, and substantial mapping of sites of “off-target” SHM activity in its genome (Sale and Neuberger 1998; Papavasiliou and Schatz 2000; Ronai et al. 2007; Qian et al. 2014; Senigl et al. 2019). Ramos has been found to mirror many aspects of physiological SHM with the exception that its mutation spectrum is biased toward mutations at G/C (75%–80% as opposed to the ∼45% seen in germinal center B cells) (Sale and Neuberger 1998).Creation of cell lines in which SHM could be assessed rapidly and sensitively (Fig. 1B) began with our previously generated AID7.3in cells, in which endogenous AICDA (encoding AID) is disrupted and AID7.3 (AID mutant K10E/E156G/T82I, with threefold increased catalytic activity) (Wang et al. 2009) is expressed from an integrated doxycycline (dox)-inducible construct (Dinesh et al. 2020). AID7.3 is tightly regulated and expressed at levels well above those of endogenous AID upon dox treatment (Fig. 1B; Supplemental Fig. S1A). We infected AID7.3in Ramos cells with a lentiviral SHM reporter vector (GFP7-E), in which the constitutive EF1α promoter drives transcription of a fusion HTS7-T2A-GFP open reading frame. HTS7 contains a dense array of SHM hotspots designed to yield stop codons upon mutation, allowing efficient detection of SHM activity by virtue of loss of GFP fluorescence (referred to here as GFP loss) (Senigl et al. 2019). As expected (Senigl et al. 2019), most cell clones with integrated GFP7-E gave rise to very little (<1%) GFP loss after 6 d of dox induction (279 of 306 clones examined), while one clone exhibited particularly strong GFP loss (Supplemental Fig. S1B, arrow). We found that this clone contained GFP7-E integrated ∼38.6 kb upstream of the rearranged IGH variable region (Fig. 1B) and exhibited robust (>40%) GFP loss with 6 d of dox treatment (Fig. 1C). AID induction also led to substantial accumulation of cells lacking surface IgM expression (referred to here as IgM loss) (Fig. 1D), which has been shown to be caused by deleterious mutations in the rearranged IGH and IGL variable regions (Neuberger et al. 1998; Papavasiliou and Schatz 2000). This clone is referred to here as RASH-1 (rapid assay for SHM).A second cell line (RASH-2) was generated from AID7.3in Ramos cells by targeted integration of HTS7-T2A-GFP into the IGH locus, replacing the rearranged IGH variable region exon and placing the cassette under control of the EF1α promoter (Fig. 1B). The ATG start codon in the L exon was mutated and splicing between the L exon and HTS7-T2A-GFP was retained. RASH-2 cells yielded robust GFP loss upon AID induction, similar in magnitude to RASH-1 cells (Fig. 1E). Insertion of HTS7-T2A-GFP into the IGH locus in RASH-1 and RASH-2 provides a physiological genomic context for SHM, which is thought to be driven by IGH enhancer elements (Rouaud et al. 2013; Buerstedde et al. 2014; Senigl et al. 2019; Dinesh et al. 2020). Cas9-expressing derivatives of RASH-1 and RASH-2 (RASH-1C and RASH-2C) support efficient Cas9-mediated gene disruption (data not shown) and most of the cell line experiments described below were performed in RASH-1C cells or derivatives thereof.
HMCES deficiency leads to increased deletions in SHM target regions
HMCES is required for efficient CSR (Shukla et al. 2020), leading us to ask whether it might also play a role in facilitating SHM; if so, one might expect disruption of HMCES to decrease the extent of GFP loss in our RASH lines. Unexpectedly, however, when HMCES was disrupted in bulk RASH-1C cells using two independent sgRNAs delivered by electroporation (Fig. 2A,B), GFP loss increased significantly upon AID induction. Increased GFP loss was also observed in two independent RASH-2C clones depleted of HMCES and in HMCES-deficient RASH-1C single-cell clones (Fig. 2C,D; Supplemental Fig. S1C,D). Furthermore, loss of HMCES led to increased IgM loss in RASH-1C clones, with the magnitude of the increase correlating well with the increase in GFP loss (Fig. 2E; Supplemental Fig. S1E). To rule out the possibility that this phenotype was due to use of a hyperactive AID mutant, we engineered Cas9-expressing derivatives (Supplemental Fig. S1F) of Ramos line A23, which overexpresses WT human AID from a retroviral vector (Supplemental Fig. S1G; Dinesh et al. 2020). Analysis of bulk HMCES knockout (KO) populations or single-cell HMCES KO clones of A23-Cas9 revealed increased IgM loss relative to the parental A23-Cas9 cells (Supplemental Fig. S1H–J). Finally, to rule out the possibility that the phenotype was due to ectopic overexpression of AID, HMCES was knocked out in six independent WT Ramos single-cell clones, which exhibited a strong increase in IgM loss relative to WT Ramos clones (Fig. 2F,G). Disruption of HMCES did not alter levels of AID expression in these analyses (Fig. 2C,F; Supplemental Fig. S1H). Together, these results demonstrated that loss of HMCES leads to an increase in AID-dependent GFP and IgM loss and suggested the possibility that HMCES is a negative regulator of SHM.
Figure 2.
HMCES deficiency leads to increased deletions in SHM target regions. (A) Representative flow cytometry histogram plot of and graph quantitating GFP loss in RASH-1C cells treated with empty vector or a HMCES sgRNA without or with 4-d dox treatment. (B) Western blot showing HMCES protein levels in RASH-1C cells treated with two different HMCES sgRNAs. (C) Western blot showing HMCES protein levels in 11 independent HMCES KO RASH-1C cell clones. (D) Dot plot of GFP loss in cells shown in C treated with dox for 4 d. (E) Scatter plot of GFP loss versus IgM loss from WT and HMCES KO RASH-1C cell clones treated with dox for 4 d. (F) Western blot showing HMCES protein levels in six independent HMCES KO Ramos cell clones. (G) IgM loss from WT, HMCES KO, or AID KO Ramos cell clones at different time points of culture. (H,I) Deletion frequency in the HTS7 (H) and VDJ (I) regions from WT, HMCES KO, UNG KO, and HMCES/UNG dKO RASH-1C cells with dox for 4 d. Deletion frequency was calculated as (number of deletions/number of sequences) × 100; fold changes compared with WT are marked above the bar. (J,K) Distribution of deletion lengths in HTS7 (J) and VDJ (K) regions from WT, HMCES KO, UNG KO, and HMCES/UNG dKO RASH-1C cells treated with dox for 4 d. Deletions were divided into three groups based on length as indicated and the percentage of sequences with indicated deletion length are shown. Fold changes compared with WT are marked above the bar. (L,M) Deletion frequency in the IGH VDJ (L) and IGL VJ (M) regions from WT, HMCES KO, and AID KO Ramos cells. Single cells were seeded and grown for 6 wk and then harvested for analysis. Fold changes compared with WT are marked above the bar. (N,O) Distribution of deletion lengths in the IGH VDJ (N) and IGL VJ (O) regions from WT, HMCES KO, and AID KO Ramos cells. Deletions were divided into three groups based on length as indicated and the percentage of sequences with the indicated deletion length are shown. Fold changes compared with WT are marked above the bar. (P) Deletion profile in the VDJ region from WT and HMCES KO RASH-1C cells treated with dox for 4 d. AID hotspots are indicated with a vertical line at the X-axis. The CDR1, CDR2, and CDR3 regions are shaded. (Q) Deletion profile in the VDJ region from WT, HMCES KO, and AID KO Ramos cells. AID hotspots are indicated with a vertical line at the X-axis. The CDR1, CDR2, and CDR3 regions are shaded. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using two-tailed Student's t-test for A and D and using one way ANOVA with Dunnett's post-test for G–O. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
HMCES deficiency leads to increased deletions in SHM target regions. (A) Representative flow cytometry histogram plot of and graph quantitating GFP loss in RASH-1C cells treated with empty vector or a HMCES sgRNA without or with 4-d dox treatment. (B) Western blot showing HMCES protein levels in RASH-1C cells treated with two different HMCES sgRNAs. (C) Western blot showing HMCES protein levels in 11 independent HMCES KO RASH-1C cell clones. (D) Dot plot of GFP loss in cells shown in C treated with dox for 4 d. (E) Scatter plot of GFP loss versus IgM loss from WT and HMCES KO RASH-1C cell clones treated with dox for 4 d. (F) Western blot showing HMCES protein levels in six independent HMCES KO Ramos cell clones. (G) IgM loss from WT, HMCES KO, or AID KO Ramos cell clones at different time points of culture. (H,I) Deletion frequency in the HTS7 (H) and VDJ (I) regions from WT, HMCES KO, UNG KO, and HMCES/UNG dKO RASH-1C cells with dox for 4 d. Deletion frequency was calculated as (number of deletions/number of sequences) × 100; fold changes compared with WT are marked above the bar. (J,K) Distribution of deletion lengths in HTS7 (J) and VDJ (K) regions from WT, HMCES KO, UNG KO, and HMCES/UNG dKO RASH-1C cells treated with dox for 4 d. Deletions were divided into three groups based on length as indicated and the percentage of sequences with indicated deletion length are shown. Fold changes compared with WT are marked above the bar. (L,M) Deletion frequency in the IGH VDJ (L) and IGL VJ (M) regions from WT, HMCES KO, and AID KO Ramos cells. Single cells were seeded and grown for 6 wk and then harvested for analysis. Fold changes compared with WT are marked above the bar. (N,O) Distribution of deletion lengths in the IGH VDJ (N) and IGL VJ (O) regions from WT, HMCES KO, and AID KO Ramos cells. Deletions were divided into three groups based on length as indicated and the percentage of sequences with the indicated deletion length are shown. Fold changes compared with WT are marked above the bar. (P) Deletion profile in the VDJ region from WT and HMCES KO RASH-1C cells treated with dox for 4 d. AID hotspots are indicated with a vertical line at the X-axis. The CDR1, CDR2, and CDR3 regions are shaded. (Q) Deletion profile in the VDJ region from WT, HMCES KO, and AID KO Ramos cells. AID hotspots are indicated with a vertical line at the X-axis. The CDR1, CDR2, and CDR3 regions are shaded. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using two-tailed Student's t-test for A and D and using one way ANOVA with Dunnett's post-test for G–O. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.To ascertain the basis for increased GFP and IgM loss in HMCES-deficient cells, the HTS7 (366-bp) and IGH VDJ (389-bp) regions were PCR-amplified and subjected to high-throughput sequencing. Surprisingly, loss of HMCES in RASH-1C cells did not increase—and in fact tended to decrease—the frequency of point mutations in both HTS7 and VDJ (Supplemental Fig. S2A,B). Instead, deletions were significantly (more than twofold) increased upon knockout of HMCES (Fig. 2H,I). This increase was not observed without dox induction of AID, indicating that loss of HMCES did not detectably increase background (AID-independent) levels of deletions (Fig. 2H,I). HMCES deficiency increased 1-bp deletions, intermediate length deletions (2–20 bp), and long deletions (>20 bp) in both HTS7 and VDJ, with the largest fold increase observed for intermediate length deletions (Fig. 2J,K). Very similar results were obtained from sequencing of IGH VDJ and IGL VJ from WT and HMCES KO Ramos cells (expressing endogenous, WT AID), with the fold increase in deletions caused by loss of HMCES even greater in this context than in RASH-1C (Fig. 2L,M). Essentially all deletions were eliminated in AID KO Ramos cells, with the residual 1-bp deletions likely representing sequencing errors (Fig. 2N,O). As in RASH-1C, loss of HMCES in Ramos did not increase the point mutation frequency, and in this context, little or no significant decrease was detected (Supplemental Fig. S2C,D).In both RASH-1C and Ramos, deletion frequencies were increased across the entire VDJ region and VJ region and peaked in several regions containing clusters of AID hotspots (RGYW or its inverse complement, WRCY; R = A, G; Y = C, T; W = A, T), consistent with the idea that the deletions originate from AID-mediated deamination events (Fig. 2P,Q; Supplemental Fig. S2E). The overall similarities in the shape of the deletion profiles in HMCES-deficient and -proficient cells (Fig. 2P,Q; Supplemental Fig. S2E) argue that loss of HMCES does not strongly perturb AID targeting. These data demonstrate that loss of HMCES results in an AID-dependent increase in deletions and that this increase is observed with WT endogenous AID and overexpressed AID7.3.To determine how this increase in deletions influenced the distribution of events leading to GFP loss, the HTS7-GFP region was amplified from sorted GFP-negative WT and HMCES KO RASH-1C cells and subjected to Sanger sequencing. The percentage of sequences in which GFP loss could be explained by a deletion event increased from 14% in WT to 37%–44% in HMCES KO cells (Supplemental Fig. S2F), providing a ready explanation for the increase in GFP loss seen in HMCES KO cells.A detailed examination of point mutations in the presence and absence of HMCES revealed few if any consistent changes. In RASH-1C, loss of HMCES increased the proportion of transition mutations at G/C at the expense of transversion mutations, but this change was less evident in Ramos (Supplemental Fig. S2G–N). In both RASH-1C and Ramos, loss of HMCES did not substantially alter the distribution of mutations across the IGH VDJ region, and mutations occurred preferentially near AID hotspots (Supplemental Fig. S3A–C). The frequency of insertions was significantly increased in Ramos but not in RASH-1C when HMCES was knocked out (Supplemental Fig. S3D–G). Together, these data demonstrate that disruption of HMCES in Ramos and Ramos derivatives leads to an increase in GFP loss and IgM loss, a phenotype readily explained by a consistent large increase in deletions in SHM target regions, but that changes in mutation frequency and spectrum and in insertion frequency tend to be modest or not significant and are influenced by whether the cells express hyperactive or WT AID.
The function of HMCES in SHM is dependent on abasic site cross-linking and ssDNA binding residues
HMCES (Fig. 3A) contains a well-conserved N-terminal SRAP domain (amino acids 1–270) and a C-terminal region with three proliferating cell nuclear antigen (PCNA)-interacting protein (PIP) motifs implicated in interaction with PCNA and possibly other DNA repair factors (Boehm and Washington 2016) and recruitment of HMCES to DNA replication forks (Aravind et al. 2013; Mohni et al. 2019). The invariant cysteine at position 2 (C2) is required for HMCES to cross-link to abasic sites in DNA and does so by forming a thiazolidine linkage with the deoxyribose of the abasic site (Halabelian et al. 2019; Mohni et al. 2019; Thompson et al. 2019; Wang et al. 2019). Cross-linking requires prior removal of the amino acid N-terminal to C2, which can be accomplished by HMCES autopeptidase activity using catalytic residues C2, E127, and H210 and likely by other peptidases as well (Kweon et al. 2017; Halabelian et al. 2019; Thompson et al. 2019; Wang et al. 2019). Structural analysis of the HMCES SRAP domain in complex with 3′ overhang DNA revealed that R98 and R212 form a single-strand (ss) DNA binding cleft, that E127 and H210 are closely juxtaposed to C2 and the site of cross-linking, and that W81, F92, and R106 contribute to the interaction of HMCES with DNA near the ssDNA–dsDNA junction (Halabelian et al. 2019).
Figure 3.
The function of HMCES in SHM is dependent on abasic site cross-linking and ssDNA binding residues. (A) Schematic of human HMCES protein. (B) Western blot of HMCES and AID in WT or HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants and induced with dox for 4 d. (C,D) WT HMCES or the indicated HMCES mutants were expressed in RASH-1C cells in which HMCES was knocked out (RASH-1C HMCES KO) or intact (RASH-1C WT). Cells induced with dox for 4 d were assayed for GFP loss. (E) WT HMCES or the indicated HMCES mutants were expressed in HMCES KO RASH-1C cells. Cells induced with dox for 4 d were assayed for IgM loss. (F,G) Deletion frequencies in HTS7 (F) and VDJ (G) from HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants or overexpressing UNG2, as indicated. (H,I) Distribution of deletion lengths in the HTS7 (H) and VDJ (I) regions from HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants or overexpressing UNG2. (J,K) Point mutation frequencies in the HTS7 (J) and VDJ (K) regions from HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants or overexpressing UNG2. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
The function of HMCES in SHM is dependent on abasic site cross-linking and ssDNA binding residues. (A) Schematic of human HMCES protein. (B) Western blot of HMCES and AID in WT or HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants and induced with dox for 4 d. (C,D) WT HMCES or the indicated HMCES mutants were expressed in RASH-1C cells in which HMCES was knocked out (RASH-1C HMCES KO) or intact (RASH-1C WT). Cells induced with dox for 4 d were assayed for GFP loss. (E) WT HMCES or the indicated HMCES mutants were expressed in HMCES KO RASH-1C cells. Cells induced with dox for 4 d were assayed for IgM loss. (F,G) Deletion frequencies in HTS7 (F) and VDJ (G) from HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants or overexpressing UNG2, as indicated. (H,I) Distribution of deletion lengths in the HTS7 (H) and VDJ (I) regions from HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants or overexpressing UNG2. (J,K) Point mutation frequencies in the HTS7 (J) and VDJ (K) regions from HMCES KO RASH-1C cells reconstituted with WT HMCES or its mutants or overexpressing UNG2. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.To investigate the mechanism by which HMCES functions in SHM, we expressed WT HMCES or HMCES variants in WT or HMCES KO RASH-1C cells (Fig. 3B). We found that the HMCES KO GFP loss phenotype could be fully rescued by reconstitution with WT HMCES or a C-terminal truncated protein containing only the SRAP domain (amino acids 1–270) (Fig. 3C), indicating that the C-terminal region is not required for HMCES function in SHM, as is also the case for HMCES function in CSR (Shukla et al. 2020). However, no reconstitution was observed with C2A point mutant HMCES (Fig. 3C), in striking contrast to CSR in which HMCES function is unaffected by the C2A mutation (Shukla et al. 2020). Overexpression of HMCES or the C2A or 1–270 mutants in WT cells had no significant effect on GFP loss, suggesting that protein levels of HMCES are saturating in these cells and arguing against a dominant-negative phenotype for the C2A mutant (Fig. 3C).Analysis of DNA binding and autopeptidase catalytic residue mutants revealed a range of phenotypes. While R98A, R98E, and R212A single mutants exhibited substantial reconstitution activity, mutation of R212 to E strongly diminished activity, and double mutations of R98 and R212 to A or E eliminated HMCES activity in the GFP loss assay, indicating that the ssDNA binding cleft is important for HMCES function in SHM, as also observed for HMCES function in CSR (Fig. 3D; Supplemental Fig. S4A,B). Further analyses confirmed the lack of activity for C2A and R98E/R212E HMCES mutants, demonstrating that they cannot reconstitute the IgM loss (Fig. 3E), deletion frequency (Fig. 3F–I), point mutation frequency (Fig. 3J,K), or point mutation spectrum (Supplemental Fig. S4C–F) phenotypes in HMCES KO RASH-1C cells. Neither single mutation of W81, F92, or R106 nor triple mutation of these residues affected HMCES activity in the GFP loss assay (Supplemental Fig. S4G,H), arguing that the ssDNA–dsDNA junction binding region of HMCES is not essential for SHM function. Similarly, neither E127A nor H210E mutations compromised HMCES function in the GFP loss assay (Fig. 3D; Supplemental Fig. S4I), indicating that HMCES autopeptidase activity is not required and that a cellular peptidase activity would be predicted to be able to remove the first amino acids of HMCES to enable C2 cross-linking activity. We did not detect an effect on GFP loss of overexpression of any of the mutants in WT RASH-1C cells except for E127A, which reduced GFP loss (Supplemental Fig. S4I–K), an effect that might be attributable to the increased DNA binding observed when this residue is mutated (see the Discussion). In summary, mutational analysis of HMCES demonstrates that its function in SHM in Ramos cells requires the abasic site cross-linking residue C2, is supported by its ssDNA binding surface, and is independent of mutations that alter ssDNA–dsDNA binding residues and of autopeptidase activity.
The function of HMCES in SHM is dependent on the activity of UNG
UNG is the major uracil glycosylase in SHM and should be the primary source of abasic sites during the reaction (Fig. 1A; Di Noia et al. 2006; Methot and Di Noia 2017). If HMCES functions in SHM by cross-linking to abasic sites, as suggested by the requirement for C2, then HMCES should act downstream from UNG and be strongly dependent on UNG to exert its phenotype. To test this prediction, we compared the phenotypes of UNG KO and UNG/HMCES double-KO (dKO) cells. UNG deficiency led to increases in GFP and IgM loss in RASH-1C cells (Fig. 4A,B), as expected from prior studies showing that mutation frequencies increase when UNG activity is reduced or eliminated (Di Noia and Neuberger 2002; Rada et al. 2004; Liu et al. 2008; Feng et al. 2021; Rogier et al. 2021). The increases were comparable in magnitude with those observed in HMCES KO cells. Elimination of UNG decreased deletion frequencies dramatically, almost to the level seen in cells without AID induction (Fig. 2H,I). Strikingly, relative to UNG single-KO cells, HMCES/UNG dKO cells showed no increase in GFP or IgM loss (Fig. 4A,B), no increase in deletion frequency (Fig. 2H–K), no alteration in mutation frequency or spectrum (Supplemental Fig. S2A,B,G–J), and no difference in insertion frequency (Supplemental Fig. S3D,E). Indeed, UNG/HMCES dKO cells closely phenocopied UNG KO cells in all parameters measured.
Figure 4.
The function of HMCES in SHM is dependent on the activity of UNG. (A,B) WT, HMCES KO, UNG KO, and HMCES/UNG dKO (double-KO) RASH-1C cells with dox for 4 d were assayed for GFP loss (A) and IgM loss (B). (C) UNG KO or WT RASH-1C cells overexpressing UNG2 induced with dox for 4 d were assayed for GFP loss. (D) HMCES or UNG2 was overexpressed in WT or HMCES KO RASH-1C cells. Cells induced with dox for 4 d were assayed for GFP loss. (E) HMCES and UNG dKO RASH-1C cells overexpressing UNG2 or HMCES induced with dox for 4 d were assayed for GFP loss. (F,G) Point mutation frequencies in the HTS7 (F) and VDJ (G) regions from WT RASH-1C cells overexpressing UNG2. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test for A, B, D, and E and using two-tailed Student's t-test for C, F, and G. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
The function of HMCES in SHM is dependent on the activity of UNG. (A,B) WT, HMCES KO, UNG KO, and HMCES/UNG dKO (double-KO) RASH-1C cells with dox for 4 d were assayed for GFP loss (A) and IgM loss (B). (C) UNG KO or WT RASH-1C cells overexpressing UNG2 induced with dox for 4 d were assayed for GFP loss. (D) HMCES or UNG2 was overexpressed in WT or HMCES KO RASH-1C cells. Cells induced with dox for 4 d were assayed for GFP loss. (E) HMCES and UNG dKO RASH-1C cells overexpressing UNG2 or HMCES induced with dox for 4 d were assayed for GFP loss. (F,G) Point mutation frequencies in the HTS7 (F) and VDJ (G) regions from WT RASH-1C cells overexpressing UNG2. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test for A, B, D, and E and using two-tailed Student's t-test for C, F, and G. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.Reconstitution and overexpression experiments in RASH-1C cells further emphasized the lack of a HMCES phenotype in the absence of UNG and demonstrated that UNG exerts its effects in the absence of HMCES. Ectopic expression of UNG2 (the nuclear form of UNG) in WT, UNG KO, HMCES KO, and UNG/HMCES dKO cells reduced GFP loss (Fig. 4C–E; Supplemental Fig. S5A–C). In contrast, HMCES re-expression in UNG/HMCES dKO cells had no effect on the frequency of GFP loss (Fig. 4E) while it reduced GFP loss in HMCES single-KO cells, as expected (Fig. 4D). Ectopic UNG2 expression reduced mutation frequencies (particularly for G/C transitions) in both WT and HMCES KO cells (Figs. 3J,K, 4F,G; Supplemental Figs. S4C,D, S5D,E), demonstrating that the high-fidelity repair pathways initiated by UNG are not HMCES-dependent. UNG hyperactivity or overexpression has been shown to reduce mutation frequencies in mice and cell lines (Feng et al. 2021; Rogier et al. 2021). Ectopic UNG2 expression showed a tendency to increase deletion frequencies in both WT and HMCES KO cells, but 1-bp deletions were selectively and significantly increased only in HMCES KO cells, suggesting that HMCES protects against 1-bp deletions initiated by UNG (Fig. 3H,I; Supplemental Fig. S5F–I). We conclude that HMCES function in SHM is entirely dependent on UNG, a conclusion consistent with a model in which HMCES protects against deletions by acting on the abasic sites created by UNG. In contrast, UNG is able to alter mutation frequency and spectrum in the absence of HMCES.
APE2 contributes to deletions that arise in the absence of HMCES
By cross-linking to abasic sites, HMCES is thought to protect abasic sites from the action of endonucleases and translesion polymerases during DNA replication and, in biochemical assays, blocks cleavage by AP endonuclease APE1 (Halabelian et al. 2019; Mohni et al. 2019; Thompson et al. 2019; Wang et al. 2019; Srivastava et al. 2020). Our data raise the possibility that HMCES performs a similar function during SHM. If this is the case, then the excessive deletions that occur in the absence of HMCES would be predicted to be dependent on the action of APEs. APE2 is up-regulated in germinal center B cells and contributes to efficient point mutation accumulation and the generation of indels during SHM, while APE1, the more efficient endonuclease, is down-regulated in germinal center B cells and functions in high-fidelity short patch BER (Hegde et al. 2008; Sabouri et al. 2009; Stavnezer et al. 2014; Roco et al. 2019).To test the prediction that APEs contribute to the excessive deletions that occur in the absence of HMCES, we knocked out APE1 and APE2 in WT and HMCES KO RASH-1C cells. Knockout of APE2 but not APE1 significantly reduced GFP loss and IgM loss in both the WT and HMCES KO context (Fig. 5A,B). Strikingly, deficiency for APE2 but not APE1 led to a significant reduction in deletion frequencies in HTS7 and VDJ in HMCES KO but not WT cells (Fig. 5C,D). This decrease was consistently observed and significant for intermediate length and long deletions but not for 1-bp deletions (Fig. 5E,F). In some cases, APE1 KO actually resulted in a significant increase in deletions, again observed only in the HMCES KO context (Fig. 5C–F). APE2 deficiency significantly reduced point mutation frequencies in WT and HMCES KO RASH-1C cells (Fig. 5G,H), with the decreases most significantly affecting G/C transversions and A/T mutations (Supplemental Fig. S6A,B). In contrast, APE1 KO led to an increase in mutation frequency specifically in the HMCES KO context (Fig. 5G,H; Supplemental Fig. S6A,B). Insertion frequencies decreased in some contexts but not others upon knockout of APE1 or APE2 (Supplemental Fig. S6C,D). These findings demonstrate that APE2 but not APE1 contributes to the deletions that accumulate in the absence of HMCES. They further suggest that APE1 protects against even higher levels of deletions in HMCES KO cells, which we speculate might occur by virtue of competition between the high-fidelity action of APE1 and the more deletion-prone action of APE2. Competition between APE1 and APE2 in the HMCES-deficient context is also consistent with our observation that while APE2 KO leads to reduced point mutations, APE1 KO has the opposite effect. Finally, to our knowledge, these findings provide the first analysis of SHM in the complete absence of APE1, revealing that it does not contribute significantly to deletions or point mutation frequency or spectrum in HMCES-sufficient cells.
Figure 5.
APE2 contributes to deletions that arise in the absence of HMCES. (A,B) APE1 or APE2 were knocked out in WT or HMCES KO RASH-1C cells, and cells were assayed for GFP loss (A) or IgM loss (B) after being induced with dox for 4 d. (C,D) Deletion frequencies in the HTS7 (C) and VDJ (D) regions from WT and HMCES KO RASH-1C cells with either APE1 or APE2 knocked out. (E,F) Distribution of deletion lengths in the HTS7 (E) and VDJ (F) regions from WT and HMCES KO RASH-1C cells with either APE1 or APE2 knocked out. (G,H) Point mutation frequencies in the HTS7 (G) and VDJ (H) regions from WT and HMCES KO RASH-1C cells with either APE1 or APE2 knocked out. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
APE2 contributes to deletions that arise in the absence of HMCES. (A,B) APE1 or APE2 were knocked out in WT or HMCES KO RASH-1C cells, and cells were assayed for GFP loss (A) or IgM loss (B) after being induced with dox for 4 d. (C,D) Deletion frequencies in the HTS7 (C) and VDJ (D) regions from WT and HMCES KO RASH-1C cells with either APE1 or APE2 knocked out. (E,F) Distribution of deletion lengths in the HTS7 (E) and VDJ (F) regions from WT and HMCES KO RASH-1C cells with either APE1 or APE2 knocked out. (G,H) Point mutation frequencies in the HTS7 (G) and VDJ (H) regions from WT and HMCES KO RASH-1C cells with either APE1 or APE2 knocked out. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
Mismatch repair factors contribute to deletions caused by HMCES deficiency
The data presented above indicate that a pathway involving UNG and APE2 contributes to the excess deletions that arise in the absence of HMCES. Because UNG and MMR factors act in concert in some SHM repair pathways to generate G/C transversion and A/T mutations and in CSR to generate DSBs (Frieder et al. 2009; Schrader et al. 2009; Methot and Di Noia 2017; Pilzecker and Jacobs 2019), we considered the possibility that they might also collaborate in the creation of deletions. We therefore assessed the consequences of knocking out MSH2 and MSH6 (which together mediate U:G mismatch recognition to initiate MMR) as well as EXO1 (which acts downstream from MSH2/6 to resect one DNA strand and create ssDNA gaps) (Fig. 6A–C). In both WT and HMCES KO RASH-1C cells, MSH2, MSH6, and EXO1 deficiency each reduced GFP loss, with larger effects observed in the HMCES KO cells (Fig. 6D). In line with these results, knockout of MSH2, MSH6, and EXO1 significantly and consistently reduced deletion frequencies in the HMCES-deficient context, with the effect strongest for MSH2 and MSH6 loss (Fig. 6E,F). Loss of these factors also appears to reduce deletions in HMCES-sufficient cells (Fig. 6E,F). As was the case for APE2, MSH2, MSH6, and EXO1 contributed strongly to intermediate and long deletions but had modest or insignificant effects on 1-bp deletions (Fig. 6G,H). Loss of the MMR factors reduced total point mutation frequencies as expected, primarily by reducing the frequency of G/C transversion mutations with a less consistent effect on A/T mutations (Supplemental Fig. S7A–H). Hence, the MMR pathway contributes less strongly to A/T mutagenesis in Ramos than in germinal center B cells (Bardwell et al. 2004; Rada et al. 2004; Shen et al. 2006), providing a plausible explanation for the known underrepresentation of A/T mutations in the Ramos SHM mutation spectrum (Sale and Neuberger 1998). Interestingly, we observed consistent decreases in insertion frequencies upon knockout of MSH2, MSH6, or EXO1 in HMCES-deficient RASH-1 cells, suggesting that the MMR pathway contributes to insertions as well as deletions when HMCES is absent (Supplemental Fig. S7I,J). In summary, our results demonstrate that MSH2, MSH6, and EXO1 contribute strongly to the generation of excessive deletions >1 bp in length that occurs in the absence of HMCES.
Figure 6.
Mismatch repair factors contribute to deletions caused by HMCES deficiency. (A–C) MSH2, MSH6, or EXO1 was knocked out in WT or HMCES KO RASH-1C cells. Western blots of MSH2 (A), MSH6 (B), and EXO1(C) in WT or HMCES KO RASH-1C cells. (D) MSH2, MSH6, or EXO1 was knocked out in WT or HMCES KO RASH-1C cells, and cells induced with dox for 4 d were assayed for GFP loss. (E,F) Deletion frequencies in the HTS7 (E) and VDJ (F) regions from WT and HMCES KO RASH-1C cells with MSH2, MSH6, or EXO1 knocked out. (G,H) Distribution of deletion lengths in the HTS7 (G) and VDJ (H) regions from WT or HMCES KO RASH-1C cells with MSH2, MSH6, or EXO1 knocked out. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
Mismatch repair factors contribute to deletions caused by HMCES deficiency. (A–C) MSH2, MSH6, or EXO1 was knocked out in WT or HMCES KO RASH-1C cells. Western blots of MSH2 (A), MSH6 (B), and EXO1(C) in WT or HMCES KO RASH-1C cells. (D) MSH2, MSH6, or EXO1 was knocked out in WT or HMCES KO RASH-1C cells, and cells induced with dox for 4 d were assayed for GFP loss. (E,F) Deletion frequencies in the HTS7 (E) and VDJ (F) regions from WT and HMCES KO RASH-1C cells with MSH2, MSH6, or EXO1 knocked out. (G,H) Distribution of deletion lengths in the HTS7 (G) and VDJ (H) regions from WT or HMCES KO RASH-1C cells with MSH2, MSH6, or EXO1 knocked out. Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using one-way ANOVA with Dunnett's post-test. (***) P-value < 0.001, (**) P-value < 0.01, (*) P-value < 0.05, (ns) not significant.
HMCES deficiency leads to increased deletions in mouse germinal center B cells
Hmces−/− mice were previously shown to exhibit normal parameters of hematopoiesis and normal numbers of splenic B cells, T cells, and germinal center B cells after immunization, but have a B cell-intrinsic defect in CSR (Shukla et al. 2020). We characterized SHM in Hmce−/− and control Hmces+/− and Hmces+/+ mice by analyzing mutations that accumulated in the region downstream from the IGH J4 gene segment in germinal center B cells from spleens of sheep red blood cell-immunized mice and Peyer's patches (PPs) of unimmunized mice. This analysis revealed an approximately sixfold increase in deletions in both the splenic and PP-derived B cells that was due almost entirely to increases in deletions >1 bp in length (Fig. 7A–D). This striking increase in deletions occurred in the absence of significant changes in the frequency of insertions or point mutations or significant alterations in the point mutation spectrum (Fig. 7E–J).
Figure 7.
HMCES deficiency leads to increased deletions in mouse germinal center B cells. (A,B) Deletion frequencies in the region downstream from the Igh J4 gene segment (JH4 intron) in germinal center B (GCB) cells from either the spleen (A) or Peyer's patch (B) of Hmces−/− and control Hmces+/− and Hmces+/+ mice. (C,D) Distribution of deletion lengths in the JH4 intron in splenic (C) and Peyer's patch GCB (D) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (E,F) Insertion frequencies in the JH4 intron in splenic (E) and Peyer's patch GCB (F) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (G,H) Point mutation frequencies in the JH4 intron in splenic (G) and Peyer's patch GCB (H) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (I,J) Mutation frequencies for G/C transitions, G/C transversions, and A/T mutations in the JH4 intron in splenic (I) or Peyer's patch GCB (J) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (K) Flow cytometry plots of GCB cells gated as the FAS+ CD38− population among total B cells in spleens of Hmces+/+ and Hmces−/− mice 14 d after immunization with NP-CGG. (L,M) Bar graphs showing the frequencies of GCB cells in Hmces+/+ or Hmces+/− and Hmces−/− mice in spleens (L) or Peyer's patches (M). (N) Bar graphs quantifying the levels of low- and high-affinity IgM antibodies measured by ELISA in the sera of Hmces−/− and control Hmces+/+ and Hmces+/− mice. NP-BSA with a conjugation ratio of 27 and 2 was used to detect low-and high-affinity antibodies, respectively. (O) Working model for the mechanism by which HMCES suppresses deletions during SHM. After AID acts, MSH2/6, EXO1, and likely other factors can resect one strand to expose a region of ssDNA containing a U residue, on which UNG can act to create an abasic site (AP site). Cleavage of this vulnerable intermediate by APE2 would result in a DNA double-strand break (DSB), which would be prone to end resection by exonucleases (including APE2 itself) and to generating a deletion. HMCES protects against this outcome by forming a covalent cross-link with the abasic site, thereby blocking cleavage by APE2. The events that occur subsequent to HMCES cross-linking are not known but could involve translesion synthesis (TLS) by an error-prone polymerase. The pathway depicted here is just one of the pathways depicted in Figure 1A (the second from the top). Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using two-tailed Student's t-test. (***) |P-value < 0.001, (**) P-value < 0.01, (ns) not significant.
HMCES deficiency leads to increased deletions in mouse germinal center B cells. (A,B) Deletion frequencies in the region downstream from the Igh J4 gene segment (JH4 intron) in germinal center B (GCB) cells from either the spleen (A) or Peyer's patch (B) of Hmces−/− and control Hmces+/− and Hmces+/+ mice. (C,D) Distribution of deletion lengths in the JH4 intron in splenic (C) and Peyer's patch GCB (D) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (E,F) Insertion frequencies in the JH4 intron in splenic (E) and Peyer's patch GCB (F) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (G,H) Point mutation frequencies in the JH4 intron in splenic (G) and Peyer's patch GCB (H) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (I,J) Mutation frequencies for G/C transitions, G/C transversions, and A/T mutations in the JH4 intron in splenic (I) or Peyer's patch GCB (J) cells from Hmces−/− and control Hmces+/+ and Hmces+/− mice. (K) Flow cytometry plots of GCB cells gated as the FAS+ CD38− population among total B cells in spleens of Hmces+/+ and Hmces−/− mice 14 d after immunization with NP-CGG. (L,M) Bar graphs showing the frequencies of GCB cells in Hmces+/+ or Hmces+/− and Hmces−/− mice in spleens (L) or Peyer's patches (M). (N) Bar graphs quantifying the levels of low- and high-affinity IgM antibodies measured by ELISA in the sera of Hmces−/− and control Hmces+/+ and Hmces+/− mice. NP-BSA with a conjugation ratio of 27 and 2 was used to detect low-and high-affinity antibodies, respectively. (O) Working model for the mechanism by which HMCES suppresses deletions during SHM. After AID acts, MSH2/6, EXO1, and likely other factors can resect one strand to expose a region of ssDNA containing a U residue, on which UNG can act to create an abasic site (AP site). Cleavage of this vulnerable intermediate by APE2 would result in a DNA double-strand break (DSB), which would be prone to end resection by exonucleases (including APE2 itself) and to generating a deletion. HMCES protects against this outcome by forming a covalent cross-link with the abasic site, thereby blocking cleavage by APE2. The events that occur subsequent to HMCES cross-linking are not known but could involve translesion synthesis (TLS) by an error-prone polymerase. The pathway depicted here is just one of the pathways depicted in Figure 1A (the second from the top). Throughout the figure, data are presented with the bars representing mean and the error bars as ±SD. Statistical significance was calculated using two-tailed Student's t-test. (***) |P-value < 0.001, (**) P-value < 0.01, (ns) not significant.We reasoned that the generation of excessive deletions in Hmces-deficient mice might compromise the humoral immune response and therefore tested the antigen-specific immune response. While the frequency of germinal center B cells was normal in 4-hydroxyl nitrophenyl (NP)-conjugated ovalbumin (NP-OVA) immunized Hmces−/− mice (Fig. 7K–M), the NP hapten-specific serum antibody response was blunted, particularly for high-affinity antibody (Fig. 7N). We conclude that HMCES protects mouse germinal center B cells and human Ramos B cells from the accumulation of deletions in SHM target regions during SHM, thereby facilitating the antigen-specific antibody response. Furthermore, we conclude that HMCES exerts this protective effect while having modest or insignificant effects on other aspects of the SHM reaction.
Discussion
CSR targets long, repetitive, AID hotspot-rich switch regions and is designed to generate dsDNA breaks and deletions. These deletions can occur productively between two switch regions or nonproductively within a single switch region, with the latter readily tolerated because switch regions lie within introns (Chen et al. 2001; Reina-San-Martin et al. 2003; Chaudhuri and Alt 2004). While SHM also generates deletions, they accumulate to much lower levels in an in-frame, productive IGH V region than in a matched, out-of-frame V region, indicating that they are strongly selected against, presumably because of the damage they inflict on the open reading frame (Yeap et al. 2015). Insertions occur at lower frequencies than deletions during SHM and therefore appear to be less of a threat to reading frame integrity (Yeap et al. 2015). Evolution of a mechanism to specifically limit deletions during SHM without interfering with mutation accumulation would therefore potentially be advantageous for effective humoral immunity. Our findings demonstrate that such a mechanism does indeed exist in both human and mouse B cells, that it is mediated by HMCES acting downstream from UNG, and that in the absence of this mechanism, antibody affinity maturation is compromised. A particularly intriguing aspect of our findings is that the primary mechanisms of action of HMCES in SHM and CSR are distinct—the former but not the latter requiring the abasic site cross-linking residue C2. Hence, as discussed below, the function of HMCES during SHM is likely to resemble more closely its abasic site-protective function during DNA replication than its end-joining-promoting function during CSR.The RASH cell lines described here provide multiple features that should facilitate future studies of SHM. Tightly regulated expression of a highly active form of AID combined with insertion of a sensitized SHM reporter cassette into the IGH locus supports robust, facile, and rapid analysis of SHM in a physiological genomic context. These features, combined with efficient CRISPR/Cas9 targeting of the Ramos genome, allow for rapid generation and analysis of mutant phenotypes and should enable genome-wide screens for factors involved in SHM. Use of a constitutive, heterologous promoter to drive HTS7-GFP expression protects reporter transcription against manipulations that alter the transcriptional activity of endogenous IGH locus control elements and distinguishes RASH from the Ramos IGH locus reporter system of Wang et al. (2014). Inefficient mutation of A/T residues and the reduced contribution of MMR factors to A/T mutagenesis in Ramos cells relative to GCB cells represents one potential limitation of the RASH assay system.Our analysis of HMCES mutants revealed several differences between the protein features necessary for its function in SHM as compared with CSR and at DNA replication forks. The HMCES C-terminal region and the PIP motifs harbored therein are dispensable for SHM and CSR but required for HMCES function in DNA replication, while C2 is required for SHM and function in DNA replication but not CSR (Mohni et al. 2019; Shukla et al. 2020). The ssDNA binding surface is required for all identified HMCES activities but is more resilient to mutation in SHM than in CSR or DNA replication. R212E and double mutants of R98 and R212 to A or E reduce or eliminate HMCES function in SHM, demonstrating the essential contribution of these ssDNA binding residues. However, R98A, R98E, and R212A single mutants retain substantial function in SHM but show defects in ssDNA binding in vitro (Halabelian et al. 2019; Mohni et al. 2019), and R98E loses function in DNA repair after ionizing radiation (Mohni et al. 2019), while R212A fails to reconstitute CSR (Shukla et al. 2020). Furthermore, several residues involved in binding the ssDNA–dsDNA junction in the HMCES–DNA crystal structure (W81, F92, and R106) (Halabelian et al. 2019) are not required for SHM despite the fact that mutation of the corresponding residues in yedK compromises DNA binding (Wang et al. 2019). These comparisons argue that the HMCES–DNA interaction is strongly context-dependent and suggest the possibility that during SHM, other interactions or factors render HMCES less dependent on canonical modes of DNA binding. We cannot rule out that protein overexpression observed in our reconstitution system augments the function of some HMCES mutants, although, with one exception discussed below, overexpression of WT or mutant HMCES proteins has no discernable effect on GFP loss in cells expressing endogenous HMCES (Fig. 3C; Supplemental Fig. S4J,K).Mutation of autopeptidase catalytic residues C2, E127, and H210 yielded strikingly divergent results, with C2 required and E127 and H210 dispensable for HMCES function in SHM. Importantly, of these three residues, only C2 is required for abasic site cross-linking, since E. coli yedK mutants at residue E105 or H160 (which correspond to HMCES E127 and H210) retain substantial cross-linking activity (Thompson et al. 2019; Wang et al. 2019). Hence, our data argue strongly that HMCES cross-linking activity but not autopeptidase activity is required for function during SHM and that a cellular peptidase activity can remove the N-terminal amino acid of HMCES to expose the amino group on C2 required for cross-linking. Interestingly, HMCES E127A and yedK E105A mutants exhibit strongly increased ssDNA binding activity in vitro, apparently through neutralization of charge repulsion between the glutamate side chain and DNA phosphate backbone (Halabelian et al. 2019; Wang et al. 2019). The significant reduction in GFP loss that we observed due to overexpression of HMCES E127A in WT cells (Supplemental Fig. S4I) might therefore be explained by increased HMCES ssDNA binding and protection of abasic sites. Avid ssDNA binding might help HMCES displace UNG from abasic sites, which are bound by UNG even more tightly than its U substrate (Parikh et al. 1998). Release of glycosylases from abasic sites is the rate-limiting step in BER (Jacobs and Schär 2012).HMCES has previously been demonstrated to work through two distinct mechanisms. In the first, HMCES cross-links to abasic sites at stalled DNA replication forks, thereby preventing the genome-destabilizing action of endonucleases and translesion polymerases (Mohni et al. 2019; Mehta et al. 2020; Srivastava et al. 2020). In the second, HMCES facilitates alternative end joining by a mechanism that appears to involve ssDNA end binding and protection (Shukla et al. 2020). Multiple aspects of our findings indicate that the primary function of HMCES during SHM more closely parallels the first mechanism than the second. First, HMCES function in SHM is completely dependent on abasic site cross-linking residue C2. Second, HMCES deficiency has no detectable phenotype in the absence of UNG, consistent with HMCES acting on abasic sites. Third, during SHM, HMCES protects against a deletion pathway that is strongly dependent on APE2, which cleaves DNA at abasic sites. The HMCES–abasic site cross-link blocks endonucleolytic cleavage (e.g., by APE1) (Mohni et al. 2019), and APE2 is the dominant AP endonuclease involved in point mutation and deletion generation during SHM (Sabouri et al. 2009; Stavnezer et al. 2014). Hence, by cross-linking to abasic sites generated by UNG, HMCES could regulate the action of APE2 to protect against deletions. Fourth, our finding that HMCES selectively regulates deletion formation during SHM is fully consistent with the established role for HMCES–abasic site cross-links in suppressing dsDNA breaks during DNA replication (Mohni et al. 2019), including dsDNA breaks triggered by the replication-coupled action of the cytidine deaminase APOBEC3A (Mehta et al. 2020).The function of HMCES in SHM and CSR is united in one respect: its independence from the HMCES C-terminal region and presumably from recruitment by PCNA to replication forks. AID initiates both SHM and CSR in the G1 phase of the cell cycle, and UNG is also thought to act predominantly in G1 during SHM (Sharbeen et al. 2012; Casellas et al. 2016; Wang et al. 2017). It is plausible that HMCES also acts in G1 during SHM, although this remains to be determined.The excess deletions caused by loss of HMCES are almost completely eliminated by deficiency in APE2, MSH2, or MSH6, with the exception of 1-bp deletions (Figs. 5, 6), indicating that the mechanisms that generate most deletions in the absence of HMCES involve all three of these factors. Cooperation between the BER and MMR pathways is well established for both CSR and SHM (Methot and Di Noia 2017). In CSR, MMR-dependent exonuclease activity is thought to facilitate the conversion of ssDNA nicks created by BER enzymes into dsDNA breaks (Stavnezer and Schrader 2006; Methot and Di Noia 2017). In SHM, a noncanonical hybrid BER/MMR pathway appears to be responsible for about half of transversion mutations at G/C and is proposed to involve MMR-mediated strand resection to create ssDNA gaps in which the retained DNA strand contains U (Frieder et al. 2009; Thientosapol et al. 2017, 2018; Pilzecker and Jacobs 2019; Feng et al. 2020). The importance of recruitment of MMR factors for efficient SHM was highlighted in recent studies of Fam72-deficient mice and cells in which hyperactive UNG results in dramatic reductions in SHM and CSR (Feng et al. 2021; Rogier et al. 2021), apparently by quickly eliminating U:G mismatches and thereby subverting MMR engagement (Feng et al. 2021).Based on our findings and these considerations, we propose a model in which HMCES acts to protect a particularly vulnerable ssDNA intermediate created during SHM (Fig. 7O). In this model, uracils generated by AID trigger MMR factor-dependent processing to generate an exposed ssDNA region that either contains U or is deaminated by AID to generate U. After UNG converts the U to an abasic site, HMCES binds the ssDNA region and forms a covalent cross-link with the abasic site, thereby preventing cleavage by APE2 and the generation of a dsDNA break, which could readily yield a deletion upon repair. Notably, APE2 possesses robust 3′-to-5′ exonuclease activity (Burkovics et al. 2006, 2009) that could contribute to the generation of deletions. How the HMCES–abasic site cross-link is subsequently processed during SHM and with what outcome is not known; ubiquitin-mediated proteolysis of HMCES followed by translesion polymerase gap filling is one plausible possibility. This model, while speculative, accommodates the strong ssDNA binding preference of HMCES, the requirement for HMCES C2, HMCES's dependency on UNG to exert its effects, and the major contribution made by both APE2 and MMR factors to the deletions that accumulate in the absence of HMCES. The model is not meant to exclude the participation of HMCES in other pathways and in particular might not explain the origin of many of the 1-bp deletions that arise in the absence of HMCES. One previously proposed source of deletions during SHM is dsDNA breaks created by closely opposed nicks on opposite DNA strands (Yeap and Meng 2019).Our data and that of Shukla et al. (2020) indicate that the principal function of HMCES during SHM is a C2-dependent suppression of deletions and during CSR is a C2-independent facilitation of alternative end joining. These functions reflect the different mechanistic imperatives of SHM and CSR, with the former proceeding primarily through ssDNA lesions and seeking to avoid dsDNA breaks and the latter requiring a dsDNA break intermediate. Existing data are consistent, however, with the possibility that HMCES can cross-link to abasic sites in switch regions and suppress deletions during CSR. Similarly, we cannot rule out the possibility that HMCES sometimes facilitates end joining of dsDNA breaks during SHM.The SRAP domain is evolutionarily ancient and traces its association with DNA repair back to prokaryotes (Aravind et al. 2013). Our findings demonstrate that HMCES, which contains the only known SRAP domain in mammals, has been co-opted for a specific purpose in SHM: that of reducing the frequency of deletions. Elimination of HMCES has no consistent effect on any other parameter of SHM that we have examined. Deletions are selected against and, as suggested by our findings, might be detrimental to the humoral immune response if too frequent. However, they can make important contributions to novel antibody specificities such as broadly neutralizing antibodies against HIV-1 (Kepler et al. 2014), suggesting that their complete suppression might not be advantageous. It will now be important to determine whether HMCES plays a role in protecting non-Ig portions of the genome from AID-initiated deletions and in suppressing other types of instability such as chromosomal translocations that are associated with the action of AID (Casellas et al. 2016).
Materials and methods
Mice
Hmces-deficient mice were generated as previously described (Kweon et al. 2017; Shukla et al. 2020) and maintained on a C57BL/6J genetic background for at least eight generations. All mice used in the studies here were 8–21 wk of age and were housed in the animal facility at the La Jolla Institute for Immunology. Male mice were used in the studies. All procedures were performed according to protocols approved by the Institutional Animal Care and Use Committee of the La Jolla Institute for Immunology.
Cell culture
Ramos cells and cell lines derived from Ramos were grown in RPMI-1640 (Gibco 11875) supplemented with 10% FBS (Gemini) and 0.5 mg/mL penicillin–streptomycin–glutamine (Gibco 10378016) at 37°C and 5% CO2. 293T cells were grown at 37°C and 5% CO2 in DMEM (Gibco 10566) supplemented in the same fashion as Ramos cell media.
Plasmids and antibodies
px458 and lenti-SpCas9-hygro were obtained from AddGene (48138 and 104995). px458-mcherry was generated by replacing GFP in px458 with mCherry. pLW44 was generated by removing Cas9 from px458-mCherry, resulting in a 3.9-kb plasmid expressing a single guide RNA (sgRNA) and mCherry. pLW44 could be delivered into ∼20% of RASH-1C cells, compared with ∼1% when a larger plasmid (px458-mCherry, 9.3 kb) expressing Cas9, sgRNA, and mCherry was used (data not shown), allowing gene targeting with high efficiency in RASH-1C cells. Full-length HMCES was amplified from pLV-EF1a-IRES GFP-human HMCES FL (Shukla et al. 2020) and cloned into retroviral expression vector pMSCV (Papavasiliou and Schatz 2002). Point mutations and truncations of HMCES were introduced using In-Fusion cloning according to the manufacturer's protocol. The GFP7-E vector was generated by replacing the CMV promoter of GFP7 (Senigl et al. 2019) with EF1a promoter. cDNA encoding human UNG2 was amplified by PCR from Ramos cDNA and cloned into pMSCV.The following primary antibodies were used for Western blotting: anti-AID monoclonal antibody (Thermo Fisher 39-2500), anti-HMCES (Sigma HPA044968), antitubulin (Sigma T5168), anti-MSH2 (D24B5; Cell Signaling Technology 2017), anti-MSH6 (P150; Cell Signaling Technology 3995), anti-EXO1 (Proteintech 16253-1-AP), and anti-UNG (Novus Biologicals NBP1-49985).
Lentiviral and retroviral transduction
293T cells in six-well plates were grown to 50%–80% confluence and transfected with 1 μg of lentiviral plasmid, 0.6 μg of psPAX2 (AddGene 12260), and 0.4 μg of pMD2.G (AddGene 12259) or 1 μg of retroviral plasmid and 1 μg of packaging plasmid pkat2 with JetPrime reagent (Polyplus 114-07) according to the manufacturer's protocol. At 48 h after transfection, lentivirus or retroviral-containing media were collected and filtered through a 0.45-μm filter before being used to infect cells. Ramos cells were transduced as reported previously (Dinesh et al. 2020).
Generation of cell lines
Generation of RASH-1 was performed according to the method described by Senigl et al. (2019) with slight modifications. One million AID7.3in cells (Dinesh et al. 2020) were infected with virus expressing GFP7-E vector at MOI < 0.01. Two days after infection, blasticidin (final concentration 5 μg/mL) was added. Two weeks later, GFP-positive cells were sorted in a single-cell sort mode into 96-well plates to obtain single-cell clones. Clones were cultured for 18 d in 96-well plates and then 40 µL of cells was transferred to a new plate and treated with 200 ng/mL dox for 6 d, followed by GFP loss analysis. Clones that yielded a high percentage of GFP loss were analyzed for their GFP7-E integration site using splinkerette-PCR as described by Senigl et al. (2019).Generation of RASH-2 was performed according to the protocol described by Koch et al. (2018) with modifications. px458-mcherry expressing an sgRNA (ATAACACCAACCATACACCC) targeting upstream of the IGH variable region and donor plasmid containing EF1a promoter-HTS7-T2A-GFP (see Fig. 1A) flanked by ∼1000-bp homolog arms of the rearranged IGH variable region were delivered into AID7.3in cells by electroporation using Amaxa Nucleofector II (Lonza) with program O-006 according to the manufacturer's instructions. Homemade 1M buffer was used for electroporation according to Parreno et al. (2016). One day later, mCherry-positive cells were sorted and then expanded for 12 d, and GFP-positive cells were sorted into a 96-well plate to obtain single-cell clones. Clones were cultured for 15 d in a 96-well plate and then 50 µL of cells was transferred to a new plate and treated with 200 ng/mL dox for 6 d, followed by GFP loss analysis. Clones that yielded a high percentage of GFP loss were selected and validated by Sanger sequencing of the PCR product generated using primers (forward: ATGGTCCTGGTGGAGTTCGTGAC, and reverse: GCCGCATCGGTGATTCGGAACAGAAT) surrounding the target region.RASH-1, RASH-2, and A23 (Dinesh et al. 2020) were infected with lentivirus expressing spCas9-hygro. Two days after infection, 0.5 mg/mL hygromycin B was added. Two weeks later, single cells were sorted into a 96-well plate, and clones expressing Cas9 were identified by Western blot. RASH-1 clones, RASH-2 clones, and A23 clones constitutively expressing Cas9 were named RASH-1C, RASH-2C, and A23-Cas9, respectively.
CRISPR/Cas9 editing in Ramos and Ramos derivative lines
Guide RNAs targeting human AICDA, HMCES, UNG, APEX1, APEX2, MSH2, MSH6, and EXO1 were designed by the Broad Institute's sgRNA designer (https://portals.broadinstitute.org/gpp/public/analysis-tools/sgrna-design). Guide RNA sequences were as follows: sgAICDA (ACAGCCTCTTGATGAACCGG), sgHMCES-1 (GCAGTCGAGACAGAAGCACT), sgHMCES-2 (TGCGCCTACCAGGATCGGCG), sgUNG (GCGGCCCGCAACGTGCCCGT), sgAPEX1 (TAACGGGAATGCCGAAGCGT), sgAPEX2 (TAACCCTCAACGATAGCCAG), sgMSH2 (ACAGTGCGCCTTTTCGACCG), sgMSH6 (CCTGGCTAACTATAGTCGAG), and sgEXO1 (TCAGGGGGTAGATTGCCTCG).Guide RNAs were cloned into pLW44 or px458 and delivered into cells by electroporation. One to three days later, for RASH-Cas9 cells, GFP and mCherry double-positive cells were sorted either into 15-mL tubes as bulk KO cells or sorted into 96-well plates for single-KO clone selection. For Ramos and A23-Cas9 cells, GFP-positive cells were sorted either into 15-mL tubes as bulk KO cells or sorted into 96-well plates for single-knockout clone selection.
IgM and GFP loss assay
For the IgM loss assay in RASH cells, ∼1.0 × 106 cells treated with or without 200 ng/mL dox were stained with APC mouse antihuman IgM (BD 551062) diluted 1:100 in FACS buffer (1× PBS with 2% FBS) and incubated for 20 min at room temperature. The starting point for the IgM loss assay in A23 cells and Ramos was a sorted IgM+ population. Cells were cultured and then harvested at different time points and were again stained, and percentages of IgM+ and IgM− cells were measured in accordance with published guidelines on flow cytometry (Cossarizza et al. 2021). For the GFP loss assay, RASH cells and RASH-Cas9 cells treated with or without 200 ng/mL dox for 2–6 d as indicated were assayed for the percentage of GFP+ and GFP− cells in accordance with published guidelines on flow cytometry (Cossarizza et al. 2021).
High-throughput sequencing
Analyses of mutations, deletions, and insertions were performed using a protocol modified from Illumina protocol “16S metagenomic sequencing library preparation.” Genomic DNA was extracted using DNeasy blood and tissue kit (Qiagen) according to the manufacturer's instructions. HTS7 was amplified using the oligos (forward) TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGTTCCAAAGTAGACCCAGCCTTCTAA and (reverse) GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGATTCTCCTCCACATCACCACAG. IGH VDJ was amplified using the oligos (forward) TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGAGGAATGCGGATATGAAGATATGAG and (reverse) GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGAGTAGCAGAGAACAGAGGCCCTAGA. IGLV2-14J2 was amplified using the oligos (forward) TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCACTGACTCACTGGCATGTATTTCT and (reverse) GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGCTGACCACAAGTTGAGACAAGATA.PCR reactions were performed using NEBNext high-fidelity 2× PCR master mix (New England Biolabs) under the following conditions: 3 min at 98°C, followed by 25 cycles of 10 sec at 98°C, 30 sec at 60°C, 30 sec at 72°C, and finally 2 min at 72°C in a 25-µL reaction.The JH4 intron was amplified using nested PCR. The first amplification reaction was performed under the following conditions: 3 min at 98°C, followed by 13 cycles of 10 sec at 98°C,30 sec at 66°C, 30 sec at 72°C, and finally 2 min at 72°C using the oligos (forward) GGAATTCGCCTGACATCTGAGGACTCTGC and (reverse) CTGGACTTTCGGTTTGGTG in a 50-µL reaction. Five microliters was used as a template for the second PCR, which was performed under the following conditions: 3 min at 98°C, followed by 18 cycles of 10 sec at 98°C, 30 sec at 56°C, 30 sec at 72°C, and finally 2 min at 72°C, using the oligos (forward) TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGGTCAAGGAACCTCAGTCA and (reverse) GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGTCTCTAGACAGCAACTAC.PCR products were multiplexed using Nextera XT index kit (Illumina). Sequencing on the Illumina Miseq platform (two 300-bp paired end sequencing) using Miseq reagent kit V3 was performed at the Yale Center for Genome Analysis.
Bioinformatic analysis of DNA sequence data
Sequence reads were analyzed using a custom pipeline. Paired-end reads were joined using the fastq-join tool requiring that the reads from both sides have at least a 10-bp overlap with mismatch rate ≤8%. Joined reads were aligned to their respective reference sequence using Burrows-Wheeler aligner (BWA) using default parameters. Python utility Pysamstats was then used to calculate statistics against genome positions based on sequence alignments from the aligned BAM file. Position-wise statistics were generated for reads with mapping quality of at least 60 and minimum base quality of 30.For each sample, the aligned file (binary encoded BAM format) was converted to TSV using Java-based utility sam2tsv. A custom AWK script was used to generate the per-read-based summary files for each observed variation (deletions, insertions, and point mutations). For insertions and point mutations, only nucleotide bases with a phred quality score ≥30 were considered. For deletion events, only reads with average phred score ≥30 were considered, since the bases in question were deleted.A custom R script was used to identify AID hotspots from the position-based stats files generated by Pysamstats.
ELISA
For NP-specific enzyme-linked immunosorbent assay (ELISA), polystyrene plates were coated overnight with NP2-BSA (conjugation ratio of 2 for high affinity) or NP27-BSA (conjugation ratio of 27 for low affinity) diluted to 10 µg/mL with PBS (Biosearch). The following day, 200 µL of blocking solution (1% BSA, 0.05% sodium azide in PBS) was added, and the plates were incubated for 2 h at 37°C in an incubator, or overnight at 4°C. After blocking, 50 µL of threefold serially diluted serum (starting dilution 1:20) in dilution buffer (0.1% BSA, 0.05% sodium azide in PBS) was added to each well, and plates were incubated for 2 h at 37°C. Plates were then washed three times with deionized water, 50 µL of the alkaline phosphatase-conjugated antimouse IgM secondary antibody (Bethyl antibody A90-101AP) diluted at 1:2500 was added, and plates were incubated for 2 h at 37°C. Plates were then washed three times with deionized water and 100 µL of p-nitrophenyl phosphate, disodium salt (PNPP from Sigma) substrate was added, and the reaction was allowed to develop for ∼20 min at room temperature before the absorbance was read at 405 nm (M2 spectramax spectrophotometer, Spectral Labs).
Statistical analysis
Data were subjected to statistical analysis and plotted using GraphPad Prism. Single comparisons were performed using two-tailed Student's t-test, whereas multiple comparisons were assessed by one-way ANOVA with Dunnett's multiple comparison test. For all analyses, P-value < 0.05 (*), P-value < 0.01 (**), and P-value < 0.001 (***). Results are reported as mean ± SD as indicated in the figure legends.
Data availability
The mutation sequencing data sets generated during this study are available under the NCBI Bioproject PRJNA820976.
Authors: Andrea Cossarizza; Hyun-Dong Chang; Andreas Radbruch; Sergio Abrignani; Richard Addo; Mübeccel Akdis; Immanuel Andrä; Francesco Andreata; Francesco Annunziato; Eduardo Arranz; Petra Bacher; Sudipto Bari; Vincenzo Barnaba; Joana Barros-Martins; Dirk Baumjohann; Cristian G Beccaria; David Bernardo; Dominic A Boardman; Jessica Borger; Chotima Böttcher; Leonie Brockmann; Marie Burns; Dirk H Busch; Garth Cameron; Ilenia Cammarata; Antonino Cassotta; Yinshui Chang; Fernando Gabriel Chirdo; Eleni Christakou; Luka Čičin-Šain; Laura Cook; Alexandra J Corbett; Rebecca Cornelis; Lorenzo Cosmi; Martin S Davey; Sara De Biasi; Gabriele De Simone; Genny Del Zotto; Michael Delacher; Francesca Di Rosa; James Di Santo; Andreas Diefenbach; Jun Dong; Thomas Dörner; Regine J Dress; Charles-Antoine Dutertre; Sidonia B G Eckle; Pascale Eede; Maximilien Evrard; Christine S Falk; Markus Feuerer; Simon Fillatreau; Aida Fiz-Lopez; Marie Follo; Gemma A Foulds; Julia Fröbel; Nicola Gagliani; Giovanni Galletti; Anastasia Gangaev; Natalio Garbi; José Antonio Garrote; Jens Geginat; Nicholas A Gherardin; Lara Gibellini; Florent Ginhoux; Dale I Godfrey; Paola Gruarin; Claudia Haftmann; Leo Hansmann; Christopher M Harpur; Adrian C Hayday; Guido Heine; Daniela Carolina Hernández; Martin Herrmann; Oliver Hoelsken; Qing Huang; Samuel Huber; Johanna E Huber; Jochen Huehn; Michael Hundemer; William Y K Hwang; Matteo Iannacone; Sabine M Ivison; Hans-Martin Jäck; Peter K Jani; Baerbel Keller; Nina Kessler; Steven Ketelaars; Laura Knop; Jasmin Knopf; Hui-Fern Koay; Katja Kobow; Katharina Kriegsmann; H Kristyanto; Andreas Krueger; Jenny F Kuehne; Heike Kunze-Schumacher; Pia Kvistborg; Immanuel Kwok; Daniela Latorre; Daniel Lenz; Megan K Levings; Andreia C Lino; Francesco Liotta; Heather M Long; Enrico Lugli; Katherine N MacDonald; Laura Maggi; Mala K Maini; Florian Mair; Calin Manta; Rudolf Armin Manz; Mir-Farzin Mashreghi; Alessio Mazzoni; James McCluskey; Henrik E Mei; Fritz Melchers; Susanne Melzer; Dirk Mielenz; Leticia Monin; Lorenzo Moretta; Gabriele Multhoff; Luis Enrique Muñoz; Miguel Muñoz-Ruiz; Franziska Muscate; Ambra Natalini; Katrin Neumann; Lai Guan Ng; Antonia Niedobitek; Jana Niemz; Larissa Nogueira Almeida; Samuele Notarbartolo; Lennard Ostendorf; Laura J Pallett; Amit A Patel; Gulce Itir Percin; Giovanna Peruzzi; Marcello Pinti; A Graham Pockley; Katharina Pracht; Immo Prinz; Irma Pujol-Autonell; Nadia Pulvirenti; Linda Quatrini; Kylie M Quinn; Helena Radbruch; Hefin Rhys; Maria B Rodrigo; Chiara Romagnani; Carina Saggau; Shimon Sakaguchi; Federica Sallusto; Lieke Sanderink; Inga Sandrock; Christine Schauer; Alexander Scheffold; Hans U Scherer; Matthias Schiemann; Frank A Schildberg; Kilian Schober; Janina Schoen; Wolfgang Schuh; Thomas Schüler; Axel R Schulz; Sebastian Schulz; Julia Schulze; Sonia Simonetti; Jeeshan Singh; Katarzyna M Sitnik; Regina Stark; Sarah Starossom; Christina Stehle; Franziska Szelinski; Leonard Tan; Attila Tarnok; Julia Tornack; Timothy I M Tree; Jasper J P van Beek; Willem van de Veen; Klaas van Gisbergen; Chiara Vasco; Nikita A Verheyden; Anouk von Borstel; Kirsten A Ward-Hartstonge; Klaus Warnatz; Claudia Waskow; Annika Wiedemann; Anneke Wilharm; James Wing; Oliver Wirz; Jens Wittner; Jennie H M Yang; Juhao Yang Journal: Eur J Immunol Date: 2021-12-07 Impact factor: 5.532
Authors: Leng-Siew Yeap; Joyce K Hwang; Zhou Du; Robin M Meyers; Fei-Long Meng; Agnė Jakubauskaitė; Mengyuan Liu; Vinidhra Mani; Donna Neuberg; Thomas B Kepler; Jing H Wang; Frederick W Alt Journal: Cell Date: 2015-11-12 Impact factor: 41.582