Literature DB >> 27439715

APOBEC3 proteins can copackage and comutate HIV-1 genomes.

Belete A Desimmie1, Ryan C Burdick1, Taisuke Izumi1, Hibiki Doi1, Wei Shao2, W Gregory Alvord3, Kei Sato4, Yoshio Koyanagi5, Sara Jones6, Eleanor Wilson2, Shawn Hill2, Frank Maldarelli2, Wei-Shau Hu7, Vinay K Pathak8.   

Abstract

Although APOBEC3 cytidine deaminases A3G, A3F, A3D and A3H are packaged into virions and inhibit viral replication by inducing G-to-A hypermutation, it is not known whether they are copackaged and whether they can act additively or synergistically to inhibit HIV-1 replication. Here, we showed that APOBEC3 proteins can be copackaged by visualization of fluorescently-tagged APOBEC3 proteins using single-virion fluorescence microscopy. We further determined that viruses produced in the presence of A3G + A3F and A3G + A3H, exhibited extensive comutation of viral cDNA, as determined by the frequency of G-to-A mutations in the proviral genomes in the contexts of A3G (GG-to-AG) and A3D, A3F or A3H (GA-to-AA) edited sites. The copackaging of A3G + A3F and A3G + A3H resulted in an additive increase and a modest synergistic increase (1.8-fold) in the frequency of GA-to-AA mutations, respectively. We also identified distinct editing site trinucleotide sequence contexts for each APOBEC3 protein and used them to show that hypermutation of proviral DNAs from seven patients was induced by A3G, A3F (or A3H), A3D and A3G + A3F (or A3H). These results indicate that APOBEC3 proteins can be copackaged and can comutate the same genomes, and can cooperate to inhibit HIV replication. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27439715      PMCID: PMC5027510          DOI: 10.1093/nar/gkw653

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

During the last decade, numerous host restriction factors have been identified that inhibit the replication of HIV-1 and other viruses to varying degrees (1–4). Among the restriction factors reported thus far, human apolipoprotein B mRNA-editing enzyme, catalytic polypeptide-like 3 (APOBEC3) cytidine deaminases are among the most potent and well-characterized HIV restriction factors. The APOBEC3 superfamily consists of seven members (A3A, A3B, A3C, A3D, A3F, A3G and A3H); A3D, A3F, A3G and certain haplotypes of A3H (II, V and VII) have been shown to inhibit HIV replication (5–9). APOBEC3 proteins have specificity for single-stranded DNA and deaminate cytidines in the viral minus-strand DNA, which results in extensive G-to-A hypermutation of the viral genome during reverse transcription. In addition to the cytidine deaminase-dependent inhibition of viral replication, the APOBEC3 proteins have been shown to inhibit viral replication by inhibiting viral DNA synthesis and integration of the viral DNA into the host genome (for a recent review see Ref. (3)). The restriction activity of APOBEC3 proteins requires their incorporation into virions (7,10,11). However, lentiviruses such as HIV-1 and HIV-2 express the accessory protein viral infectivity factor (Vif), which can bind to some of the APOBEC3 proteins (A3C, A3D, A3F, A3G and A3H) and mediate their polyubiquitination and proteasomal degradation (5,7,9,11–16). When Vif is absent or defective, the APOBEC3 proteins can be packaged into the assembling nascent virions and exert extensive cytidine deamination in the minus-strand DNA of the viral genome, most often resulting in lethal hypermutation of the viral DNA. APOBEC3 genes have been shown to be induced by interferon (IFN) in macrophages, dendritic cells, resting CD4+ T cells but not in activated CD4+ T cells (17–21). A3D, A3F, A3G and A3H (haplotypes II, V and VII) have each been shown to individually inhibit HIV-1 replication, to our knowledge, there are no studies that have directly investigated the potential for different APOBEC3 proteins to copackage and to comutate the same viral genomes. A3G prefers 5′-GG editing sites and the other APOBEC3 proteins prefer 5′-GA editing sites (3); therefore, a high frequency of mutations in both GG and GA contexts in the same genome can be employed to identify copackaging of functional APOBEC3 proteins. However, a previous study analyzed nearly 100 full-length HIV genome sequences classified as ‘hypermutated’ viral genomes for co-existence of signature A3G- and A3F-induced G-to-A mutations by analyzing the GG and GA dinucleotide motifs of the edited sites and concluded that they rarely comutate the same genome (22). As comutation was rarely observed, it was concluded that A3G and A3F (or other A3F-like proteins) are not copackaged into the same virion. Alternatively, if they are copackaged, their copackaging does not result in comutation because only one of the APOBEC3 proteins hypermutates the viral genome irrespective of the presence of the other APOBEC3 protein. It was also suggested that A3G and A3F share a similar virion-incorporation mechanism and compete for packaging; however, most studies of virion incorporation have focused on A3G packaging and very few studies have examined virion incorporation of A3F or the other APOBEC3 proteins (23–27). The underlying mechanisms by which APOBEC3 proteins are packaged into HIV-1 nascent virions are not fully understood and different mechanisms have been proposed. Previously, we and others have investigated the mechanism by which A3G is packaged into virions and have concluded that interactions of A3G with viral or non-viral RNAs are essential for virion incorporation (25,28–32). Other studies have proposed that APOBEC3 proteins interact with HIV-1 nucleocapsid domain of Gag in an RNA-dependent manner and that this interaction is essential for virion incorporation (30,33,34). It was also proposed that interactions with 7SL RNA and/or other small RNAs are critical for virion incorporation (25,32,35), while other studies found that interactions with 7SL RNA are dispensable for virion incorporation (27–28,36,37). Furthermore, Apolonia et al. demonstrated that the promiscuity of A3G and A3F to non-specifically bind to a variety of RNAs ensures their effective encapsidation into HIV-1 particles (36). In addition to their high affinity to RNAs (36,38,39), APOBEC3 proteins are also known to associate with cellular ribonucleoprotein complexes in the cytosol (37,40,41), suggesting that some unidentified mechanisms may govern the specificity of their virion incorporation. APOBEC3 proteins are expressed in human CD4+ T cells (17–19,42,43) and have been shown to hypermutate HIV-1 DNA in infected cells (42,44–50); however, it is not known whether different APOBEC3 proteins can copackage into the same virions. Although not required for their antiviral action, A3G and A3F were shown to co-localize in mRNA processing (P) bodies and hetero-oligomerize through an RNA-dependent interaction (9,37,51), suggesting that they may bind the same RNA and incorporate into the same virions. Furthermore, their distinct editing site dinucleotide preferences suggest that they may not compete for targets in the genome and could induce comutation of the same genomes. In addition, in vitro biochemical studies have demonstrated that APOBEC3 proteins exhibit different mechanisms of movement on substrate nucleic acid template, which perhaps contribute to their editing site selection and specificity; such studies were done for A3G, A3F as well as A3H haplotype II and haplotype V (52,53). Here, we determined the potential of APOBEC3 proteins to copackage and their ability to comutate the same viral genomes. We used single-virion analysis to unambiguously show that A3G can efficiently copackage into the same HIV-1 particles with A3F, A3D or A3H haplotype II (hereafter referred to as A3H). We analyzed the trinucleotide sequence contexts in which A3G, A3F, A3D and A3H induced hypermutation in cell culture-based experiments as well as in proviral DNA isolated from HIV-1 infected patients. Our results suggest that A3G can be copackaged with the other APOBEC3 proteins (A3F, A3D or A3H) into the same virions and can comutate the same viral genome in single-cycle infection assays as well as during natural HIV-1 infection in vivo.

MATERIALS AND METHODS

Cell culture, transfections and production of VSV-G-pseudotyped HIV-1 containing APOBEC3 proteins

293T and CEM-SS cell lines were obtained from the American Type Culture Collection and maintained in Dulbecco's modified Eagle's medium and RPMI 1640 medium (Corning Cellgro), respectively. Both media were supplemented to contain 10% fetal calf serum (Hyclone), 100 IU/ml penicillin and 100 μg/ml streptomycin (GIBCO). All viruses were prepared by using a previously described HIV-1-based vector pHDV-EGFP (54), which expresses Gag-Pol and eGFP from the HIV-1 LTR promoter. phCMV-G plasmid, which expresses vesicular stomatitis virus glycoprotein (VSV-G), was used to pseudotype the viral particles (55). Venus yellow fluorescent protein (YFP)-tagged APOBEC3G (A3G-YFP) and/or mCherry-tagged A3F (A3F-mCH), mCherry-tagged A3D (A3D-mCH) and mCherry-tagged Hap II A3H expression plasmids were co-transfected during virus production to label the particles as described previously (37). Briefly, 4 × 106 cells 293T cells were seeded and co-transfected the next day with pHDV-EGFP (10 μg), phCMV-G (0.5 μg) and pA3G-YFP (5 μg), pA3F-mCH (5 μg), pA3D-mCH (5 μg), pA3H-mCH (5 μg) or pA3G-YFP (5 μg) and pA3F-mCH (5 μg), pA3D-mCH (5 μg) or pA3H-mCH (5 μg) using polyethylenimine as previously described (56). For visualization of virions, non-infectious Gag-cerulean fluorescent protein (Gag-CeFP)-labeled viral particles were produced using the same cocktails of plasmids described above except pHDV-EGFP was replaced with a mixture of pGag-CeFP-BglSL (5 μg) and pGag-BglSL (5 μg) (57,58). The culture supernatant containing virus was harvested at 24-h (for infection and proviral DNA hypermutation analysis) or 18-h (for single virion analysis) post-transfection, filtered through 0.45-μm filters and concentrated by ultracentrifugation through a 20% sucrose cushion at 25 000 rpm for 1.5 h at 4°C as described previously (37,58). The virus pellets were resuspended in culture medium and kept at −80°C until use.

APOBEC3 protein titration and viral infectivity assays

Human embryonic kidney 293T cells were seeded at 8 × 105 cells per well in 6-well plates. The transfection procedure was the same as above and the plasmid mixes contain 1 μg of the HIV-1 vector pHDV-EGFP and 0.2 μg of VSV-G plasmid phCMV-G. To this mix we added A3G-YFP, A3F-mCH, A3D-mCH and/or A3H-mCH. All APOBEC3 proteins were titrated serially by 2-fold (from 15.6 to 500 ng). For some combination experiments, we kept A3G-YFP at 62.5 ng and added different amounts of three point titration of the other APOBEC3 proteins. In all cases, to maintain equal amounts of DNA, pcDNA3.1noMCS was used when needed. After 48 h, the virus-containing supernatant was clarified through 0.45-μm filters and virus preparations were kept at −80°C until use. To determine the infectivity of viruses produced in the presence of the different APOBEC3 proteins, TZM-bl cells were seeded in 96-well plates (4 × 103 cells per well) a day prior to infection. Next day, we infected TZM-bl cells in triplicates with viruses normalized for p24 (XpressBio) and 48-h later luciferase activity was determined using britelite™ plus (PerkinElmer) following the instructions of the manufacturer and measured using a LUMIstar Galaxy luminometer or 1450 MicroBeta JET (PerkinElmer).

Western blot analysis

Virus samples and cell lysates were analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and immunoblot analysis. Virus samples input were normalized for p24 by ELISA. While A3G-YFP was detected using anti-GFP (Sigma), A3F-mCH, A3D-mCH and A3H-mCH were detected using anti-mCherry antibodies (Abcam). The precursor p55 Gag and p24 CA were detected using a mouse anti-HIV-1 p24 CA monoclonal antibody (kindly provided by Michael H. Malim, AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, NIH). Tubulin, a loading control for cell lysates, was detected with mouse anti-tubulin antibody (Sigma). The primary antibodies were probed using an IRDye 800CW- or IRDye 680-labeled secondary antibodies (Li-COR). The signal intensities of the protein bands were calculated using the Odyssey system (Li-COR).

Confocal microscopy

Human embryonic kidney 293T cells (20 000 cells per well) were seeded onto ibiTreat 8-well μ-slides (Ibidi) and co-transfected with A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH expression plasmids using the Lipofectamine™2000 transfection reagent (Invitrogen). The cells were fixed 18-h post-transfection with 4.0% paraformaldehyde and cellular DNA was counterstained with DAPI (Sigma). The transfected cells were visualized using a LSM780 laser-scanning confocal microscope (Zeiss).

Single-virion analysis using fluorescence microscopy

Single-virion analysis was performed as described previously (37,57,58) and used to directly visualize virion incorporation of A3G-YFP, A3F-mCH, A3D-mCH or A3H-mCH (37,58). Concentrated virus supernatant (0.4 μl) was briefly mixed with 200 μl phosphate buffered saline and then centrifuged onto ibiTreat μ-slides (Ibidi) at 1200 × g for 1 h. Images of virus-like particles (VLPs) were acquired using epifluorescence microscopy (Nikon). CeFP was used to identify VLPs containing Gag, and YFP and mCH were used to detect virion incorporated A3G-YFP and A3F-mCH, A3D-mCH or A3H-mCH proteins, respectively (37,57,58). The number, positions and integrated intensities of the spots were quantified for each channel using Localize software as previously described (37,57–59). The positions of the spots were also used to determine colocalization; spots were considered colocalized if the centers of the spots were within three pixels of each other. Merged and pseudocolored images were generated using NIS Elements (Nikon). A3G, A3F, A3D and A3H virion incorporation efficiency was calculated by determining the percentage of CeFP+ particles that contained the YFP and/or mCH signals, respectively, by using an in-house MATLAB program (MathWorks).

Proviral DNA sequencing and hypermutation analysis

CEM-SS cells (3 × 105 cells/well in 24-well plates) were infected with 30 ng of p24 CA equivalent HDV-EGFP virus preparations, as determined by ELISA. Cells were collected at 24-h post-infection and genomic DNA was extracted using QIAamp Blood DNA Mini Kit following the manufacturer's instruction (QIAGEN). Approximately 2-kb of the pol (NL4-3 2258–4343 bp) region encompassing the RT reading frame was amplified using a forward primer HIV-07 ([NL4-3 #2258-2280]: 5′-GATCACTCTTTGGCAGCGACCCC-3′) and a reverse primer HIV-11b [NL4-3 #4343-4317]: 5′-GGCTACTATTTC TTTTGCTACTACAGG-3′) (see the schematic in Figure 3A). The polymerase chain reaction (PCR) primers were designed to anneal in regions with the fewest possible putative A3F and A3G deamination sites to avoid potential primer mismatch due to A3-induced mutagenesis. The PCR products were purified using QIAquick PCR Purification Kit (QIAGEN) and cloned into pCR2.1 vector with TA Cloning Kits (Life technologies). The insert was then sequenced using the following overlapping primers: HIV-08N-for (NL4-3 #2553-2590): 5′-ATTAGTCCTATTGAGACTGTACCAGTAAAATTAAAGCC-3′; HIV-08-rev (#3043-3018): 5′-GTCATGCTACACTGGAATATTGCTGG-3′; HIV-10-for (#3296-3321): 5′-GGACAGCTGGACTGTCAATGACATAC-3′; and HIV-11-rev (#4188-4162): 5′-GTTCATTTCCTCCAATTCCTTTGTGTG-3′.
Figure 3.

APOBEC3 proteins can comutate the same HIV genome. (A) Schematic representation of CEM-SS infection with the indicated HDV-EGFP viruses made in the presence of A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH or A3G-YFP + A3H-mCH. Genomic DNA was extracted from the infected cells 24-h post-infection (p.i.), and a 1320-bp fragment encoding reverse transcriptase (RT) was PCR-amplified using the indicated forward and reverse primers and sequenced using four overlapping primer sets. Numbers of GG-to-AG (B) and GA-to-AA (C) mutations per clone are shown for A3G (dark gray bars), A3D, A3F and A3H (light gray bars|), for the combination experiments, the calculation was done using dual-hypermutated sequences only (observed frequencies [O]; black bars). The A3G-YFP + A3D-mCH conditions were not determined (n.d.) because of the limited numbers of dual-hypermutated sequences. The expected frequencies (E; white bars) were calculated by adding the mutation frequencies/clone of each APOBEC3 protein from Table 1. The averages ± S.D. from three to six independent experiments are shown. Welch's unpaired t-test was performed to determine statistical significance (*P < 0.005; ns, not significant). (D) The distribution of GA-to-AA mutations induced by A3G-YFP only (dashed line), A3F-mCH only (black line) and A3G-YFP + A3F-mCH (light gray line), are shown relative to their locations in the 1320-nt RT region. (E) The distribution of GA-to-AA mutations induced by A3G-YFP only (dashed line), A3H-mCH only (black line) and A3G-YFP + A3H-mCH (light gray line). Each point represents the average frequency of mutations/site after combining the sequences from three to six independent experiments (1.0 = 100%). Black vertical tick lines on the X-axis indicate positions of 110 GA target sites (D and E). Data were analyzed with generalized linear model and loglinear modeling techniques. See text for a detailed description of the non-homogeneity statistical analysis and group comparisons (**P < 0.0001).

Study participants

HIV infected Individuals (N = 7) were enrolled in studies at the NIAID/CCMD clinic at the NIH Clinical Center in Bethesda MD and were sampled in the period 1994–2004. All participants were male with median age 40.8 years undergoing antiretroviral therapy with persistent viremia (see Supplementary Table S1 for demographic information); 4/7 had diagnosis of AIDS. Plasma and peripheral blood mononuclear cells (PBMCs) were obtained during protocol visits by phlebotomy.

Ethics statement

All participants in this study were enrolled in clinical protocols approved by the NIAID Institutional Review Board (FWA00005897) administered at the NIH Clinical Center in Bethesda, MD, USA. Individuals underwent an informed consent process and provided written consent for participation in the research described here.

APOBEC3H genotyping and haplotype determination

Genomic DNA was extracted from PBMCs and A3H exons were amplified with PCR Supermix High Fidelity (Invitrogen) using primers and PCR genotyping conditions for previously identified A3H amino acid polymorphisms (ΔN15, rs139292; R18L, rs139293; G105R, rs139297; D121K, rs139299 and rs139298; E178D, rs139302) described by OhAinle et al. (60). PCR products were gel eluted and directly sequenced as well as cloned into TA cloning vector (Invitrogen), in which case multiple clones (4–8 clones) were sequenced.

Viral RNA and proviral DNA sequence analysis

Plasma viral RNA and PBMC proviral DNA were analyzed using single-genome sequencing (SGS) covering ∼1100-bp pro-pol region as previously described (61,62). For each patient, alignments were constructed for both the plasma derived and plasma derived and PBMC DNA derived samples (MEGA 5.0). A patient-specific consensus was obtained from the plasma alignment, and used as a standard sequence to detect hypermutation in HIV from PBMC derived DNA. Each PBMC derived HIV sequence was analyzed for the presence of GG and GA motifs present in the plasma consensus. Patient cellular DNA sequences were considered hypermutated if the number of mutations was significantly above genetic variation in the plasma viral RNA population for each patient. A sequence was considered to be hypermutated in the GG context if it contained at least 5–7 mutations compared to the consensus plasma RNA sequence (average 0–1 mutations/clone). A sequence was considered hypermutated in the GA context if it contained at least 7–10 mutations compared to the consensus plasma RNA sequence (average 1–3 mutations/clone).

Statistical analysis

Data were analyzed with paired and Welch's unpaired t-tests, generalized linear models and loglinear modeling techniques. The numbers of G-to-A mutations were tested for homogeneity (independence), and were further partitioned (as necessary) into orthogonal components such that independent inferences could be drawn for each component involved in the partitioning (63). For some analyses, post hoc Fisher's Exact tests were performed on 2 × 2 subtables that were extracted from 4 × 2 tables and the P-values for these tests were extracted. The P-values for these post hoc tests were adjusted using the Bonferroni correction for multiple comparisons. Calculations were performed with the R Statistical Language and Environment (R Core Team) (https://www.r-project.org) (64). Probability values <0.05 (P < 0.05) were considered significant.

RESULTS

Virion incorporation of APOBEC3 proteins into Vif-deficient HIV-1 particles

To determine whether APOBEC3 proteins can be copackaged into the same virus particles, we constructed expression vectors in which A3G was tagged at its C-terminus with venus YFP (A3G-YFP); we also constructed vectors in which A3F, A3D and A3H were tagged at their C-termini with monomeric cherry fluorescent protein (A3F-mCH, A3D-mCH and A3H-mCH, respectively) (Figure 1A). A3G-YFP, A3F-mCH, A3D-mCH or A3H-mCH were co-expressed alone or in combination in 293T cells with HDV-EGFP, an HIV-1 vector that expresses Gag-Pol and is defective in expression of Vif and Env (Δvif Δenv). Virus particles were harvested 24-h later, and the virus producer cell lysates and viral lysates were analyzed by western blotting analysis (Figure 1B and C). All APOBEC3 proteins were readily detected in cells transfected with A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH or A3G-YFP + A3H-mCH (Figure 1B). Quantitation of the western blots indicated that similar levels of A3G were expressed in cells transfected with A3G-YFP alone and in combination with the other APOBEC3 proteins, and similar levels of A3F, A3D and A3H were expressed in cells transfected only with A3F-mCH, A3D-mCH or A3H-mCH, or in combination with A3G-YFP. Moreover, all APOBEC3 proteins exhibited mostly diffuse distribution in the cytosol of the producer cells when expressed alone or in combination (Supplementary Figure S1). As previously observed by us and others, we also observed accumulation of all APOBEC3 proteins in cytosolic puncta that were previously shown to be RNA processing bodies (P bodies (37,51,65).
Figure 1.

Analysis of copackaging of multiple APOBEC3 proteins into Δvif HIV-1 particles. (A) Schematics of plasmids used to produce viral particles and western blot analysis of purified viral particles and producer cells lysates. (B) Representative immunoblots and quantitation of the expression levels of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH proteins in producer cell lysates were analyzed by western blotting; p55 Gag and tubulin bands were shown as control. The bar graphs indicate the average expression of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH in the combination experiments relative to A3G-YFP only, A3F-mCH only, A3D-mCH only or A3H-mCH only, respectively, which were set to 100%. Quantitation was done after normalization for tubulin measured using the Odyssey system. (C) Representative immunoblots and quantitation of the A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH proteins in virus lysates. The level of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH proteins after normalization for p24 CA was determined using the Odyssey system relative to A3G-YFP only, A3F-mCH only, A3D-mCH only or A3H-mCH only, respectively. (D) Analysis of the relative packaging efficiency of A3D-mCH and A3H-mCH compared to A3F-mCH (set to 100%). (E) Relative infectivity of viruses produced in the presence of the four human APOBEC3 proteins either individually or in combination with A3G compared to the no APOBEC3 condition (set to 100%). Single-cycle HDV-EGFP virus was produced in the presence or absence of human APOBEC3. The A3G concentration was determined after titration and 62.5 ng of plasmid, which led to ∼80% drop in infectivity, was used in these experiments. The expected inhibition of infectivity (white bars) was calculated by multiplying the inhibition by the individual protein and then divided by 100. All quantitative data are from three independent experiments; n.s., no significant differences.

Analysis of copackaging of multiple APOBEC3 proteins into Δvif HIV-1 particles. (A) Schematics of plasmids used to produce viral particles and western blot analysis of purified viral particles and producer cells lysates. (B) Representative immunoblots and quantitation of the expression levels of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH proteins in producer cell lysates were analyzed by western blotting; p55 Gag and tubulin bands were shown as control. The bar graphs indicate the average expression of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH in the combination experiments relative to A3G-YFP only, A3F-mCH only, A3D-mCH only or A3H-mCH only, respectively, which were set to 100%. Quantitation was done after normalization for tubulin measured using the Odyssey system. (C) Representative immunoblots and quantitation of the A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH proteins in virus lysates. The level of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH proteins after normalization for p24 CA was determined using the Odyssey system relative to A3G-YFP only, A3F-mCH only, A3D-mCH only or A3H-mCH only, respectively. (D) Analysis of the relative packaging efficiency of A3D-mCH and A3H-mCH compared to A3F-mCH (set to 100%). (E) Relative infectivity of viruses produced in the presence of the four human APOBEC3 proteins either individually or in combination with A3G compared to the no APOBEC3 condition (set to 100%). Single-cycle HDV-EGFP virus was produced in the presence or absence of human APOBEC3. The A3G concentration was determined after titration and 62.5 ng of plasmid, which led to ∼80% drop in infectivity, was used in these experiments. The expected inhibition of infectivity (white bars) was calculated by multiplying the inhibition by the individual protein and then divided by 100. All quantitative data are from three independent experiments; n.s., no significant differences. We next determined the relative amounts of A3G, A3F, A3D and A3H packaged proteins into purified HIV-1 particles based on the respective APOBEC3 proteins incorporated into virions when expressed alone (Figure 1C); the amounts of viruses used for western blots were normalized by p24 CA amounts using ELISA. Virion incorporation of A3G-YFP was similar in cells transfected with A3G-YFP only or in combination with A3F-mCH, A3D-mCH and A3H-mCH, respectively. The relative virion incorporation of A3F-mCH, A3D-mCH and A3H-mCH was also similar in viruses produced in cells transfected with or without A3G-YFP. Of note, the packaging efficiencies of A3D-mCH only and A3H-mCH only were 5- and 10-fold less, respectively, compared to A3F-mCH only; packaging efficiency was calculated as normalized band intensity of virion-associated protein relative to normalized cellular expression of the respective protein (Figure 1D). Taken together, these results suggest that co-expressed A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH can be packaged into HIV-1 particles independent of each other. We next assessed the infectivity of viruses produced in the presence of the four APOBEC3 proteins expressed alone or A3G-YFP co-expressed with one of the other APOBEC3 proteins. First, we carefully evaluated the antiviral activity of each APOBEC3 protein by titration of the plasmids transfected in virus producer cells. VSV-G pseudotyped vif-deficient HDV-EGFP single-round virus stocks were produced in the presence of escalating doses of each of the four human APOBEC3 proteins with the same fluorescent protein tags as described above and the infectivity of the viruses was determined in TZM-bl cells using p24 CA-normalized virus input. Inhibition of infectivity by 60–80% compared to the no APOBEC3 control was used to determine the amount of plasmids used in the APOBEC3 proteins combination experiments. Figure 1E shows the relative infectivity of viruses produced in the presence of each APOBEC3 protein alone or combination of A3G-YFP and one of the other three APOBEC3 proteins. We observed a significant reduction in the infectivity of virus produced in the presence of A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH or A3G-YFP + A3H-mCH compared to the respective individual proteins. Co-expression of A3G with one of the other APOBEC3 proteins in the virus producer cells resulted in additive antiviral activities, since the observed antiviral activities were not significantly different from the expected combined antiviral activities of the individual APOBEC3 proteins (the product of their individual antiviral activities; P > 0.05, t-test for comparison of observed versus expected antiviral activities).

Single-virion analysis of the copackaging efficiency of APOBEC3 proteins

The western blotting results shown in Figure 1, or other biochemical analyses, cannot determine whether A3G-YFP and A3F-mCH, A3D-mCH or A3H-mCH are copackaged into the same virions or into different populations of virus particles. To determine whether A3G is copackaged into the same virions with the other APOBEC3 proteins, we exploited the recently developed single-virion fluorescence microscopy imaging technique (57,58). We produced virus by co-transfecting plasmids expressing wild-type HIV-Gag, HIV-Gag-CeFP and A3G-YFP and/or A3F-mCH, A3D-mCH or A3H-mCH expression plasmids into 293T cells (Figure 2A). Transfection of a mixture of HIV-Gag-CeFP and unlabeled HIV-Gag expression plasmids (1:1) produces non-infectious VLPs. Previous studies have shown that >90% of the CeFP+ signals colocalized with virion RNA signals and are thus bona fide viral particles (57). Although some A3G-YFP and A3F-mCH proteins are released from transfected cells in vesicles such as exosomes, we only analyzed CeFP+ particles that contained YFP and/or mCH signals, and the APOBEC3 proteins in exosomes did not have any effect on our analysis.
Figure 2.

Determination of A3G and A3F, A3D or A3H copackaging using single-virion analysis. (A) Protocol for production of virus particles and single-virion analysis for A3G-YFP, and A3F-mCH, A3D-mCH or A3H-mCH copackaging. Plasmids expressing Gag and Gag-CeFP (1:1 ratio) were co-transfected with plasmids expressing A3G-YFP and/or A3F-, A3D- or A3H-mCH into 293T cells, virus particles were harvested 24 h after transfection and single-virion analysis was performed using fluorescence microscopy. (B) Representative images of virus particles labeled with CeFP (Gag), YFP (A3G) and/or mCH (A3F) are shown. The top row of panels shows an example of a particle labeled with Gag-CeFP that is also labeled with A3G-YFP (green arrows). The middle row of panels shows an example of a particle labeled with Gag-CeFP and A3F-mCH (red arrows). The bottom row of the panels shows an example of a particle that is labeled with A3G-YFP and A3F-mCH (yellow arrows) (scale bar, 2 μm). (C) Comparison of the proportion of Gag-CeFP+ particles that are A3G-YFP+ (green bars) and A3F-, A3D- or A3H-mCH+ (red bars) in cells transfected with A3G-YFP, A3F-mCH, A3D-mCH or A3H-mCH alone or co-transfected with A3G-YFP + A3F-, A3D- or A3H-mCH. The proportion of YFP+ and/or mCH+ particles was determined in three independent experiments (∼9000–24 000 particles per experimental group). (D) Relative integrated fluorescence intensity of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH in dual-labeled particles relative to A3G-YFP only or A3F-mCH only single-labeled particles. Error bars represent SD from three independent experiments. Welch's unpaired t-test was performed to determine statistical significance (*P < 0.05).

Determination of A3G and A3F, A3D or A3H copackaging using single-virion analysis. (A) Protocol for production of virus particles and single-virion analysis for A3G-YFP, and A3F-mCH, A3D-mCH or A3H-mCH copackaging. Plasmids expressing Gag and Gag-CeFP (1:1 ratio) were co-transfected with plasmids expressing A3G-YFP and/or A3F-, A3D- or A3H-mCH into 293T cells, virus particles were harvested 24 h after transfection and single-virion analysis was performed using fluorescence microscopy. (B) Representative images of virus particles labeled with CeFP (Gag), YFP (A3G) and/or mCH (A3F) are shown. The top row of panels shows an example of a particle labeled with Gag-CeFP that is also labeled with A3G-YFP (green arrows). The middle row of panels shows an example of a particle labeled with Gag-CeFP and A3F-mCH (red arrows). The bottom row of the panels shows an example of a particle that is labeled with A3G-YFP and A3F-mCH (yellow arrows) (scale bar, 2 μm). (C) Comparison of the proportion of Gag-CeFP+ particles that are A3G-YFP+ (green bars) and A3F-, A3D- or A3H-mCH+ (red bars) in cells transfected with A3G-YFP, A3F-mCH, A3D-mCH or A3H-mCH alone or co-transfected with A3G-YFP + A3F-, A3D- or A3H-mCH. The proportion of YFP+ and/or mCH+ particles was determined in three independent experiments (∼9000–24 000 particles per experimental group). (D) Relative integrated fluorescence intensity of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH in dual-labeled particles relative to A3G-YFP only or A3F-mCH only single-labeled particles. Error bars represent SD from three independent experiments. Welch's unpaired t-test was performed to determine statistical significance (*P < 0.05). An example of single-virion analysis is shown in Figure 2B. The results showed that when Gag-CeFP was co-transfected with A3G-YFP alone (top panels) or A3F-mCH alone (middle panels), Gag-CeFP+ particles that were YFP+ or mCH+, respectively, could be detected. When Gag-CeFP was co-transfected with A3G-YFP + A3F-mCH (bottom panels), Gag-CeFP particles that were both YFP+ and mCH+ could be detected. Co-localization of Gag-CeFP with A3G-YFP + A3F-mCH indicated that both A3G-YFP and A3F-mCH were copackaged into the same HIV-1 particles. We then analyzed the proportion of HIV particles that incorporated detectable levels of the four human APOBEC3 proteins (A3D, A3F, A3G and A3H) by quantifying ∼9000 to ∼20 000 Gag-CeFP+ particles from three independent experiments for each condition (Figure 2C). The average labeling efficiencies of Gag-CeFP+ virions produced in the presence of A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH were 78.3 ± 4.5%, 81.8 ± 3.8%, 50.0 ± 6.4% and 83.0 ± 4.7%, respectively. When A3G-YFP and A3F-mCH were co-expressed, A3G-YFP was detected in 66.8 ± 5.8% and A3F-YFP was detected in 76.5 ± 5.7% of the particles; similarly, when A3G-YFP and A3D-mCH were co-expressed, A3G-YFP was detected in 73.6 ± 4.2% and A3D-mCH was detected in 42.0 ± 7.6%, and when A3G-YFP and A3H-mCH were co-expressed, A3G-YFP was detected in 72.8 ± 2.5% and A3H-mCH was detected in 79.7 ± 7.3% of virions. We found that in the combination experiments ∼58%, ∼37% and ∼64% of HIV-1 particles contained A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH, respectively. The proportions of virions containing APOBEC3 proteins when single APOBEC3 proteins were expressed were not significantly different from the proportions of particles containing detectable levels of these enzymes in the combination experiments (P > 0.05 for all comparisons). This observation indicated that, contrary to a previous report (66), APOBEC3 proteins did not have much (if any) influence on the packaging efficiency of the other APOBEC3 protein. To quantify the amounts of APOBEC3 proteins that were packaged when they were expressed alone or in combination with other APOBEC3 proteins, we compared the fluorescence intensities of YFP and mCH in single and dual-labeled virions that were produced from cells expressing a single APOBEC3 protein or from cells expressing A3G-YFP and one of the other APOBEC3 proteins (Figure 2D). For this analysis, the A3G-YFP only and A3F-mCH only were set to 100% and the fluorescence intensities of the others were plotted relative to the respective fluorescent protein-tagged APOBEC3 protein. The fluorescence intensities of all APOBEC3 proteins were comparable in both single- and dual-labeled HIV-1 particles, further indicating that copackaging of A3G-YFP with the other APOBEC3 proteins did not influence the amounts of the A3G-YFP or other APOBEC3 proteins copackaged into the same particles (P > 0.05; t-test). Interestingly, the proportion of virions containing detectable levels A3D-mCH is ∼2-fold less compared to the other APOBEC3 proteins (Figure 2C); furthermore, the fluorescence intensity analysis suggests that the virions incorporating A3D-mCH contained about half the number of A3D-mCH molecules compared to the other APOBEC3 proteins (Figure 2D; *P < 0.05). Both observations are likely to be due to the reduced levels of A3D-mCH expression compared to the other APOBEC3 proteins (Figure 1C). In any case, the reduced virion incorporation of A3D can at least partially explain its reduced antiviral potency that we and others have observed (Figure 1E, (42,48)). Taken together, these results show that APOBEC3 proteins can be copackaged without any apparent reduction in their packaging efficiencies.

Analysis of patterns of hypermutation in the presence of multiple APOBEC3 proteins

APOBEC3 proteins are known to edit cytidines during synthesis of the minus-strand DNA by selectively targeting 5′-CC-3′ (A3G) and 5′-TC-3′ (A3D, A3F and A3H) dinucleotide sites (7,42,60,67,68). Moreover, these members of the APOBEC3 family of proteins have been demonstrated to be expressed in the natural target cells of HIV (18,43). To study the effect of copackaged APOBEC3 proteins on comutation of the same viral genome, we used viruses produced in the presence of A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH or A3G-YFP + A3H-mCH to infect CEM-SS cells and analyzed the pattern of hypermutations in HIV proviral DNA (Figure 3; Tables 1 and 2). A 1320-nt region of HIV-1 RT was PCR-amplified from proviral DNAs 24-h post-infection, cloned into a plasmid and individual PCR products were sequenced (Figure 3A). GG-to-AG mutations (predominantly induced by A3G) and GA-to-AA mutations (predominantly induced by A3F, A3D and A3H) will be hereafter referred to as GG and GA mutations, respectively. In total, we analyzed 122–250 independent clones from three to six independent experiments performed with viruses produced from cells expressing A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH (Table 1). Because of differences in their deamination activity, the frequency of clones that were hypermutated ranged from 9% for A3D to 96% for A3G (Table 1). Almost all of the sequences analyzed had unique G-to-A mutation patterns, confirming that they were obtained from independent proviral DNAs; the few sequences that were duplicates (<4 per group) were removed from the analysis to avoid bias resulting from amplification of some clones during PCR.
Table 1.

Hypermutations induced by APOBEC3 proteins during a single cycle infection in cell culture

Hypermutated sequencesGA-to-AA mutationsGG-to-AG mutations
APOBEC3No. of clones sequencedTotal or dualNumber (%)TotalPer cloneaPer siteb × 10−3TotalPer cloneaPer siteb × 10−3
A3G-YFP only171Total164 (96.0)6913.9 ± 1.438.3548132.1 ± 4.8428.5
A3F-mCH only160Total149 (93.0)234813.0 ± 6.1143.34002.2 ± 1.134.4
A3D-mCH only122Total11 (9.0)343.4 ± 4.028.1201.6 ± 1.223.3
A3H-mCH only122Total73 (60.0)3354.6 ± 0.441.7530.6 ± 0.69.3
A3G-YFP + A3F-mCH250Total226 (90.4)244011.1 ± 5.898.1642328.3 ± 5.2364.4
Dual112 (49.6)195416.4 ± 4.9158.6342631.6 ± 4.5392.2
A3G-YFP + A3D-mCH122Total113 (92.6)6605.9 ± 0.953.1384734.0 ± 6.6436.5
Dual2 (1.8)n.d.cn.d.cn.d.cn.d.cn.d.cn.d.c
A3G-YFP + A3H-mCH127Total119 (93.7)123910.1 ± 4.694.7345328.7 ± 3.6372.0
Dual58 (48.7)95917.1 ± 2.2150.3173133.1 ± 5.9382.6

aThe average number of mutations per clone ± standard deviation for three to six independent experiments is shown. For the A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH groups, the total hypermutated sequences as well as dual-hypermutated sequences were analyzed to determine the average number of mutations per clone.

bMutations/site = total mutations/[sites/clone × total no. of hypermutated sequences]. For the A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH groups, Mutations/site were calculated using either total hypermutated sequences or dual-hypermutated sequences and both are shown. Each clone contained 78 GG and 110 GA editing target sites.

cDue to the limited number of dual-hypermutated sequences obtained for A3G-YFP + A3D-mCH group, further analysis was not carried out. n.d.; not determined.

Table 2.

Effect of nucleotide at +2 position on APOBEC3-induced G-to-A mutation frequencies in cell culture

Nucleotide at +2 position
ACGT
APOBEC3Trinucleotide contextMut./sitea (× 10−3)Rel. Mut. Freq.bMut./sitea (× 10−3)Rel. Mut. Freq.bMut./sitea (× 10−3)Rel. Mut. Freq.bMut./sitea (× 10−3)Rel. Mut. Freq.b
A3G-YFPGAn22.20.161.90.01136.71.0021.20.16
GGn437.30.6951.80.08629.91.00366.70.58
A3F-mCHGAn197.81.0041.00.21117.30.59126.90.64
GGn61.91.001.10.0214.60.2415.30.25
A3D-mCHGAn28.50.559.60.1819.10.3751.91.00
GGn40.41.000.00.0011.90.2913.00.32
A3H-mCHGAn61.01.0013.00.2131.00.5130.70.50
GGn12.20.970.00.0012.51.000.00.00
A3G-YFP + A3F-mCHGAn196.40.9036.20.17217.61.00124.10.57
GGn396.80.6846.10.08585.81.00325.30.56
A3G-YFP + A3D-mCHGAnn.d.cn.d.cn.d.cn.d.cn.d.cn.d.cn.d.cn.d.c
GGnn.d.cn.d.cn.d.cn.d.cn.d.cn.d.cn.d.cn.d.c
A3G-YFP + A3H-mCHGAn179.50.8239.00.18218.71.00118.20.54
GGn398.90.7554.60.10534.51.00362.10.68

aFor single APOBEC3-containing experiments, Mutations/site = total mutations/[sites/clone × total no. of hypermutated clones]; for the combination experiments, Mutations/site = total mutations/[sites/clone × no. of dual-hypermutated clones]. Each clone contained 78 GG and 110 GA editing target sites. The numbers of sites per clone were as follows: GAA, 51; GAC, 19; GAG, 19; GAT, 21; GGA, 36; GGC, 12; GGG, 23; GGT, 7.

bRelative mutation frequencies were calculated by setting the highest mutation per site frequency to 1.0, and determining the mutation frequencies for the other nucleotides relative to the highest mutation frequency.

cBecause only two clones were dual-hypermutated sequences, further analysis was not carried out. n.d., not determined.

APOBEC3 proteins can comutate the same HIV genome. (A) Schematic representation of CEM-SS infection with the indicated HDV-EGFP viruses made in the presence of A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH or A3G-YFP + A3H-mCH. Genomic DNA was extracted from the infected cells 24-h post-infection (p.i.), and a 1320-bp fragment encoding reverse transcriptase (RT) was PCR-amplified using the indicated forward and reverse primers and sequenced using four overlapping primer sets. Numbers of GG-to-AG (B) and GA-to-AA (C) mutations per clone are shown for A3G (dark gray bars), A3D, A3F and A3H (light gray bars|), for the combination experiments, the calculation was done using dual-hypermutated sequences only (observed frequencies [O]; black bars). The A3G-YFP + A3D-mCH conditions were not determined (n.d.) because of the limited numbers of dual-hypermutated sequences. The expected frequencies (E; white bars) were calculated by adding the mutation frequencies/clone of each APOBEC3 protein from Table 1. The averages ± S.D. from three to six independent experiments are shown. Welch's unpaired t-test was performed to determine statistical significance (*P < 0.005; ns, not significant). (D) The distribution of GA-to-AA mutations induced by A3G-YFP only (dashed line), A3F-mCH only (black line) and A3G-YFP + A3F-mCH (light gray line), are shown relative to their locations in the 1320-nt RT region. (E) The distribution of GA-to-AA mutations induced by A3G-YFP only (dashed line), A3H-mCH only (black line) and A3G-YFP + A3H-mCH (light gray line). Each point represents the average frequency of mutations/site after combining the sequences from three to six independent experiments (1.0 = 100%). Black vertical tick lines on the X-axis indicate positions of 110 GA target sites (D and E). Data were analyzed with generalized linear model and loglinear modeling techniques. See text for a detailed description of the non-homogeneity statistical analysis and group comparisons (**P < 0.0001). aThe average number of mutations per clone ± standard deviation for three to six independent experiments is shown. For the A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH groups, the total hypermutated sequences as well as dual-hypermutated sequences were analyzed to determine the average number of mutations per clone. bMutations/site = total mutations/[sites/clone × total no. of hypermutated sequences]. For the A3G-YFP + A3F-mCH, A3G-YFP + A3D-mCH and A3G-YFP + A3H-mCH groups, Mutations/site were calculated using either total hypermutated sequences or dual-hypermutated sequences and both are shown. Each clone contained 78 GG and 110 GA editing target sites. cDue to the limited number of dual-hypermutated sequences obtained for A3G-YFP + A3D-mCH group, further analysis was not carried out. n.d.; not determined. aFor single APOBEC3-containing experiments, Mutations/site = total mutations/[sites/clone × total no. of hypermutated clones]; for the combination experiments, Mutations/site = total mutations/[sites/clone × no. of dual-hypermutated clones]. Each clone contained 78 GG and 110 GA editing target sites. The numbers of sites per clone were as follows: GAA, 51; GAC, 19; GAG, 19; GAT, 21; GGA, 36; GGC, 12; GGG, 23; GGT, 7. bRelative mutation frequencies were calculated by setting the highest mutation per site frequency to 1.0, and determining the mutation frequencies for the other nucleotides relative to the highest mutation frequency. cBecause only two clones were dual-hypermutated sequences, further analysis was not carried out. n.d., not determined. The mutations induced by A3G-YFP, A3F-mCH, A3D-mCH and A3H-mCH when expressed alone or in combination were in agreement with the expected dinucleotide sequence contexts (48,67,69) as summarized in Table 1, Figure 2B and C. A3G predominantly edited GG sites (88% of the mutations; 32.1 mutations/clone) and ∼12% of the mutations occurred at the GA sites (3.9 mutations/clone). On the other hand, A3D, A3F and A3H predominantly edited GA sites at frequencies of 3.4, 13.0 and 4.6 mutations/clone, respectively; however, they also induced some mutations at GG sites (0.6–2.2 mutations/clone) (Table 1). We then assessed the hypermutation patterns of cells infected with virions produced in the presence of A3G-YFP and one of the other APOBEC3 proteins (A3F-mCH, A3D-mCH or A3H-mCH), and as expected, the frequencies of clones that were hypermutated were >90% (Table 1). To determine whether there were any potential interactions between copackaged APOBEC3 proteins that influence the pattern of hypermutation, we compared the observed and expected frequencies of hypermutation at GG and GA sites (Figure 3B and C). The expected frequencies of hypermutation were determined by summing the hypermutation frequencies observed when the APOBEC3 proteins were individually expressed (Figure 3B and C, white bars). Sequences that were hypermutated by A3G and another APOBEC3 protein were defined as dual-hypermutated sequences if they met the following two criteria (Table 1). First, we determined the percentage of G-to-A mutations that were at GA sites for virions produced in the presence of A3G-YFP only (12%); proviral sequences that exhibited a high percentage of GA mutations (>2 standard deviations above the mean for virions produced in the presence of A3G-YFP only) were defined as dual-hypermutated sequences. Second, the sequence context in which the GA mutations occurred (discussed in more detail below) was analyzed to verify that the GA mutations were generated by the action of A3G (GAG), A3F/A3H (GAA) or A3D (GAT). In the presence of A3G-YFP + A3F-mCH and A3G-YFP + A3H-mCH, the frequencies of dual-hypermutated proviruses were 49.6 and 48.7%, respectively (Table 1), which was in general agreement with their copackaging efficiencies of 58 and 67%, respectively (Figure 2C; yellow bars). The frequencies of dual-hypermutated clones may have been lower than the copackaging efficiencies because of deamination-independent inhibition of DNA synthesis (70–73). However, we found only 2/113 (1.8%) dual-hypermutated clones in the presence of A3G-YFP + A3D-mCH. Since both A3D and A3H can mutate GA sites with similar frequencies (Table 1), the result indicated that A3D did not significantly increase the frequency of dual-hypermutated clones because it is inefficiently incorporated into virions. The presence of A3F or A3H in virions that also contained A3G-YFP did not alter the average mutation frequency at the GG sites (32.1, 31.6 and 33.1 mutations/clone for A3G-YFP only, A3G-YFP + A3F-mCH and A3G-YFP + A3H-mCH, respectively; Table 1 and Figure 3B; P > 0.05). Co-expression of A3G-YFP + A3D-mCH resulted in very few dual-hypermutated clones (2/113), which precluded any further detailed analysis of potential interactions between A3G and A3D. These data suggested that co-expression of A3F-mCH or A3H-mCH did not influence the ability of A3G-YFP to induce mutations at GG sites (Table 1, Figure 3B; P > 0.05). Analysis of mutation frequencies at GA sites revealed that the presence of A3G-YFP + A3F-YFP or A3G-YFP + A3H-mCH resulted in 16.4 and 17.1 mutations per clone in the dual-hypermutated sequences, respectively, which was not significantly higher than the expected value for A3G only + A3F only samples (3.9 + 13.0 = 16.9 mutations/clone; Table 1 and Figure 3C; P > 0.05), but was significantly higher than the expected value for A3G only + A3H only samples (3.9 + 4.6 = 8.5 mutations/clone; Table 1 and Figure 3C; P < 0.005). This result suggested that the copackaging of A3G-YFP and A3H-mCH resulted in a synergistic increase in mutations at the GA sites. Although we did not observe an increase in the frequency of mutations at GG sites when A3G-YFP was expressed alone or in combination with A3F-mCH or A3H-mCH (Table 1 and Figure 3B), potential interactions between APOBEC3 proteins could have influenced the selection of GG target sites by A3G-YFP. To address this question, we analyzed the distribution of mutations in the 1.3-kb region of pol which contains 78 potential GG editing sites by applying the Likelihood Ratio Chi Square statistical analysis (Supplementary Figure S2A and B). Although the edited GG sites were distributed throughout the sequence, we observed that the distribution of mutations at GG sites was not homogeneous across sites (Supplementary Figure S2A and B; P < 0.0001), indicating that some GG sites were preferred targets for mutation as compared to other GG sites. Moreover, when we compared the A3G-YFP only samples to A3G-YFP + A3F-mCH or A3G-YFP + A3H-mCH, we found that the distribution of GG edited sites was homogenous and overlapped in all the three groups, indicating that copackaging of A3F-mCH or A3H-mCH did not significantly influence the A3G deamination activity at the GG sites. Since A3F-mCH and A3H-mCH induced very few mutations at GG sites (2.2 and 0.6 mutations/clone, respectively, compared to 32.1 mutations/clone by A3G; Table 1), their contribution to mutations at GG sites did not have a significant influence on the overall distribution of mutations at GG sites. We performed the same Likelihood Ratio Chi Square statistical analysis with respect to the 110 GA sites present in the 1.3-kb region of pol to determine whether copackaging of A3G-YFP with A3F-mCH or A3H-mCH influenced the distribution of mutations at GA sites (Figure 3D and E, respectively). We found that the distribution of GA edited sites was not homogenous, indicating that A3G-YFP, A3F-mCH and A3H-mCH induced mutations at some GA target sites at higher frequencies than other GA sites (Figure 3D and E; **P < 0.0001). Next, we performed group comparisons between A3G-YFP versus A3F-mCH, A3G-YFP versus A3H-mCH and A3F-mCH versus A3H-mCH, and found that all groups exhibited significant differences (Figure 3D, E and Supplementary Figure S2C, respectively; P < 0.0001). Thus, the APOBEC3 proteins exhibited unique preferences for GA sites that were targeted for mutation. Similar analysis demonstrated that the distribution of the GA mutations between A3G-YFP versus A3G-YFP + A3F-mCH and A3G-YFP versus A3G-YFP + A3H-mCH were also not homogenous (Figure 3D and E, respectively; P < 0.0001). In addition, we also found a significantly different distribution of the GA mutations between A3F-mCH versus A3G-YFP + A3F-mCH and A3H-mCH versus A3G-YFP + A3H-mCH samples (Figure 3D and E, respectively; **P < 0.0001). Taken together, these results suggest that each APOBEC3 protein mutates a unique set of GA sites, and when two APOBEC3 proteins are copackaged, they both contribute to mutations at GA sites and generate distinct patterns of mutations at GA sites.

Local sequence contexts of APOBEC3 protein-mediated mutations of HIV-1 proviral DNA

The target sequences of APOBEC3 protein-mediated mutations are not random and differences in the distribution of mutations as well as local consensus sequences can be discerned (Figure 3D, E and Supplementary Figure S2A and B) (44–46,48,67,74). Biochemical and structural studies have suggested that the amino acids of polynucleotide cytidine deaminases that interact with the substrate single-stranded DNA and the conformational changes in the substrate may play critical roles to permit the reactive cytidine and the immediate neighboring −1, +1 and +2 nucleotides to access the active site (75,76). To systematically investigate the local sequence contexts in which G-to-A mutations occurred, the mutation frequencies in the context of each base found between −5 to +5 positions relative to the deaminated cytidine were interrogated. We found distinct nucleotide preference hierarchy at the +2 position for each APOBEC3 protein relative to the guanine nucleotide that was mutated to adenine (underlined) in 5′-GA and 5′-GG editing sites upon deamination of the cytidine in the minus-strand DNA (Table 2). We first determined the frequencies of mutations per site for each nucleotide (A, G, C or T) at the +2 position (mutations/[sites per sequence × number of hypermutated sequences analyzed]). We then ranked the mutation frequencies and set the highest mutation frequency to 1.0, which was then used to calculate the relative frequencies of mutations for the other nucleotides. Using the numbers of G-to-A mutations in the trinucleotide context when the +2 nucleotide is an A, C, G or T, we carried out Fisher's Exact tests (with Bonferroni correction for multiple comparisons) to determine whether the highest mutation frequency in a specific +2 nucleotide context was significantly different from the other three +2 nucleotide contexts. The results obtained from cell culture experiments are shown in Table 2, Figure 4A and B. The mutation frequencies induced by A3G in the GG and GA contexts are highest and significantly different (P < 0.05) when the +2 nucleotide is G (Figure 4A and B, respectively); the mutation frequencies are significantly lower when the +2 nucleotide is A or T (Figure 4A and Figure B, respectively). Overall A3G is less likely to mutate GG or GA sites when C is at the +2 nucleotide position (P < 0.05) but A3G can accommodate A or T at the +2 position much more frequently in the GG context compared to the GA context.
Figure 4.

Effect of nucleotide at +2 position on relative mutation frequencies at GA and GG sites in cell culture and in patients. The relative mutation frequencies in cell culture (A and B) are shown in Table 2. Relative preference of nucleotides at the +2 position in both the GG (A) and GA (B) edited sites were plotted for A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH and A3G-YFP + A3H-mCH conditions in the cell culture experiments. For the combination experiments only the dual-hypermutated sequences were analyzed and for A3G-YFP + A3D-mCH combination experiments, only two clones were identified as dual hypermutated and hence the analysis was not determined (n.d.). For all conditions, the mutation frequency of the most preferred nucleotide was set to 1.0 and the mutation frequencies for the other nucleotides relative to the most preferred nucleotide were shown. (C) The contribution of the different APOBEC3 proteins in patients as determined by the proportion of hypermutated sequences. (D) Similar analysis shown in panel A and B was performed to determine relative mutation frequencies for hypermutated sequences isolated from patients and the sequences fall into four categories that fulfill the criteria for A3G only, A3D only, A3F or A3H only, and A3G + A3F (or A3H) induced G-to-A mutations. All the comutated sequences identified in patients have local sequence context preference of A3G + (A3F or A3H) pattern; none for A3G + A3D were identified. The relative mutation frequencies at GA edited sites for sequences that primarily had GG site mutations (A3G pattern) and at GG edited sites for sequences that primarily had GA site mutations (A3D or A3F/A3H) were not determined (n.d.) because of the limited numbers. Data were analyzed with generalized linear model and loglinear modeling techniques. The significantly preferred nucleotide in the +2 positions are indicated (*P < 0.05).

Effect of nucleotide at +2 position on relative mutation frequencies at GA and GG sites in cell culture and in patients. The relative mutation frequencies in cell culture (A and B) are shown in Table 2. Relative preference of nucleotides at the +2 position in both the GG (A) and GA (B) edited sites were plotted for A3G-YFP only, A3F-mCH only, A3D-mCH only, A3H-mCH only, A3G-YFP + A3F-mCH and A3G-YFP + A3H-mCH conditions in the cell culture experiments. For the combination experiments only the dual-hypermutated sequences were analyzed and for A3G-YFP + A3D-mCH combination experiments, only two clones were identified as dual hypermutated and hence the analysis was not determined (n.d.). For all conditions, the mutation frequency of the most preferred nucleotide was set to 1.0 and the mutation frequencies for the other nucleotides relative to the most preferred nucleotide were shown. (C) The contribution of the different APOBEC3 proteins in patients as determined by the proportion of hypermutated sequences. (D) Similar analysis shown in panel A and B was performed to determine relative mutation frequencies for hypermutated sequences isolated from patients and the sequences fall into four categories that fulfill the criteria for A3G only, A3D only, A3F or A3H only, and A3G + A3F (or A3H) induced G-to-A mutations. All the comutated sequences identified in patients have local sequence context preference of A3G + (A3F or A3H) pattern; none for A3G + A3D were identified. The relative mutation frequencies at GA edited sites for sequences that primarily had GG site mutations (A3G pattern) and at GG edited sites for sequences that primarily had GA site mutations (A3D or A3F/A3H) were not determined (n.d.) because of the limited numbers. Data were analyzed with generalized linear model and loglinear modeling techniques. The significantly preferred nucleotide in the +2 positions are indicated (*P < 0.05). The most favored nucleotide at +2 position by A3F and A3H for inducing mutations at the GA sites is A (P < 0.05), followed by G or T (Figure 4B). Unlike A3F and A3H, A3D prefers a T at the +2 position followed by A or G; however, the differences are not statistically significant because very few A3D-induced GA mutations were observed. Interestingly, all APOBEC3 cytidine deaminases highly disfavored GA editing sites when a C was at the +2 position (Figure 4B). Next, we compared the relative nucleotide preferences at the +2 position of G-to-A mutations when A3G-YFP + A3F-mCH or A3G-YFP + A3H-mCH were copackaged into the same virus particles (Table 2; Figure 4A and B). At the GG edited sites, the preferences for nucleotides at the +2 position were nearly identical to the preferences observed when A3G alone was present in the virus particles, further confirming that mutations in the GG context are primarily induced by A3G irrespective of the presence or absence of other APOBEC3 proteins (Table 2 and Figure 4A). Interestingly, the preference for nucleotides at the +2 position in the GA edited sites in the dual-hypermutated proviruses (A3G-YFP + A3F-mCH or A3G-YFP + A3H-mCH) were not similar to the A3G-YFP, A3F-mCH or A3H-mCH alone (Table 2 and Figure 4B; P < 0.0001 for all group comparisons between A3G-YFP versus A3G-YFP + A3F-mCH; A3G-YFP versus A3G-YFP + A3H-mCH, A3F-mCH versus A3G-YFP + A3F-mCH; A3H-mCH versus A3G-YFP + A3H-mCH), indicating that both copackaged APOBEC3 proteins contributed to the observed mutations at the GA sites. Finally, using the +2 nucleotide preference, we investigated whether the synergistic increase in GA mutations observed when A3G-YFP and A3H-mCH were copackaged was the result of an increase in the deaminase activity of A3H and/or A3G at GA sites. A3G and A3H prefer GAG and GAA sites for inducing mutations at GA sites, respectively (Figure 4B and Supplementary Figure S3). When both A3G and A3H were co-expressed, we observed a synergistic increase in GA mutations at the GAA sites that are preferred by A3H (P = 0.027), but not the GAG sites that are preferred by A3G (P > 0.05), suggesting that the increase in GA mutations was due to a synergistic increase in the deaminase activity of A3H but not A3G. In contrast, we observed only an additive increase in GA mutations when A3G and A3F were copackaged; as expected, we observed additive activity at the GAG (A3G preferred sites) and GAA (A3F preferred sites) contexts when A3G-YFP and A3F-mCH were copackaged (Supplementary Figure S3; P > 0.05).

Frequent comutation of proviruses by APOBEC3 proteins in HIV-1 infected patients

In PBMCs isolated from HIV-1 infected patients, proviral DNAs hypermutated by APOBEC3 proteins have been observed. The existence of GG and GA mutations in the same genome has also been described; however, none of the studies have shown that they were the result of copackaged APOBEC3 proteins (50,77–79). In addition, one study analyzed HIV sequences from the Los Alamos database (22), and suggested that A3G and A3F rarely comutated the same genome in vivo. Thus, we sought to determine whether APOBEC3 proteins are copackaged into the same HIV-1 virions and comutate the same viral genome in patients. To address this question, we analyzed APOBEC3-mediated hypermutated sequences obtained from a well-characterized cohort of seven chronically infected HIV-1 patients. Clinical characteristics of the seven patients are shown in Supplementary Table S1. The estimated duration of infection ranged from 1.2 to 21.8 years and all of the study participants had detectable viremia with a median viral load of 4.8 log10 copies/ml at the time of sampling (Supplementary Table S1). A genetic screen for A3H haplotype was determined as described before (6,68) and the poorly expressed hap I was found in one patient (patient 4), hap II, the stably expressed and potent antiviral variant of A3H, was found in six patients of whom one (patient 10) was homozygous at this locus and the remaining five were heterozygous (Supplementary Table S2). We used the local sequence contexts determined in the cell culture experiments described above to determine whether similar sequence contexts were also observed in vivo. We performed SGS of the 1.1-kb fragment of the pro-pol region of HIV-1 plasma viral RNA as well as proviral DNA extracted from PBMCs (Table 3). We and others have observed that as a result of purifying selection, there are few if any hypermutated sequences in viral RNA (47,50,80). We obtained a total 217 independent plasma viral RNA sequences, and as expected, the plasma viral RNA sequences had very few G-to-A mutations per clone (GG-to-AG mutations per clone ranged from 0–1 and GA-to-AA mutations per clone ranges from 1–3) (Table 3). Then we generated a non-hypermutated consensus sequence for each patient from these plasma viral RNA sequences to which we aligned all the proviral DNA sequences. We determined a threshold level of GG-to-AG and GA-to-AA mutations that significantly exceeded levels of these mutations detected in the corresponding plasma viral RNA (Fisher's Exact tests for each patient; P < 0.05); for each patient, threshold levels for hypermutation ranged from 5 to 7 for GG-to-AG and 7 to 10 GA-to-AA mutations compared to the consensus plasma viral RNA sequence for each patient.
Table 3.

Single-genome sequencing analysis of viral RNA and cellular DNA from patients

Viral RNACellular DNA
PatientTotal sequencesHypermutated sequencesaTotal sequencesHypermutated sequencesa
1240520
2500175
4270223
7160448
8210212
10710592
11801134
Total217032824

aA consensus plasma viral RNA sequence was generated for each patient. The average number of G-to-A changes between individual RNA sequences and the consensus RNA sequence was used to determine the number of G-to-A changes needed to achieve statistical significance. A sequence that had a significantly higher number of mutations than the plasma viral RNA consensus sequence was considered a hypermutant (Fisher's Exact test; P < 0.05).

aA consensus plasma viral RNA sequence was generated for each patient. The average number of G-to-A changes between individual RNA sequences and the consensus RNA sequence was used to determine the number of G-to-A changes needed to achieve statistical significance. A sequence that had a significantly higher number of mutations than the plasma viral RNA consensus sequence was considered a hypermutant (Fisher's Exact test; P < 0.05). We then characterized the nature of G-to-A hypermutations in patients by analyzing a total of 328 proviral DNA sequences (17–113 sequences per patient), of which 24 (7.3%) were hypermutated (Tables 3 and 4). This frequency of hypermutation is comparable to the minimal estimate of hypermutation reported previously (47,50). Only one of the seven patients did not have any hypermutated sequences even though 52 proviral DNA sequences were obtained (Table 3). Hypermutants that predominantly had mutations at GG sites but also had a high frequency of GA site mutations (>10% of GG site mutations) were defined as dual-hypermutated proviruses. Similarly, hypermutants that predominantly had GA site mutations but also had a high frequency of GG site mutations (17% of GA site mutations) were defined as dual-hypermutated proviruses. Using these criteria, 62% of the hypermutated proviral sequences were mutated by A3G only (15/24) and 21% of the hypermutated proviruses were mutated by A3F, A3H or A3D). The remaining 17% (4/24) of proviruses were dual-hypermutated, indicating that copackaging of APOBEC3 proteins and comutation of the same viral genome can occur in vivo (Table 4 and Figure 4C).
Table 4.

Hypermutations induced by APOBEC3 cytidine deaminases in patients

GA-to-AA mutationsGG-to-AG mutations
APOBEC3No. of Seq.TotalPer sequenceaPer site (×10−3)bTotalPer sequenceaPer site (×10−3)b
A3G15815.4 ± 3.551.845430.3 ± 10.2457.2
A3D25527.5 ± 27.6241.252.5 ± 3.540.3
A3F or A3H3299.7 ± 2.186.831.0 ± 1.015.5
A3G + A3F or A3H45714.3 ± 4.6132.313132.8 ± 16.5490.6

aThe average number of mutations per sequence ± standard deviation was calculated for each group.

bMutations/site = total mutations/[sites/sequence × no. of sequences]. The numbers of sites per sequence were determined using the plasma viral RNA consensus sequence for each patient.

aThe average number of mutations per sequence ± standard deviation was calculated for each group. bMutations/site = total mutations/[sites/sequence × no. of sequences]. The numbers of sites per sequence were determined using the plasma viral RNA consensus sequence for each patient. In addition to A3F, A3D and certain haplotypes of A3H can predominantly induce mutations in the GA context and potentially contribute to the overall GA mutations in patients (7,68,81). In the cell culture experiments described above, we observed that A3F and A3H exhibit a similar preference for nucleotides at the +2 position, whereas A3D exhibits a distinct preference; we analyzed the patient-derived hypermutated sequences to determine which APOBEC3 protein contributed to the GA site mutations. The results indicated that 8% of the hypermutated proviruses were mutated by A3D only and 13% were mutated by A3F and/or A3H (Table 4; Figure 4C and D). One patient only expressed the unstable A3H Hap I, and its A3H presumably could not contribute to the GA site mutations; however, the hypermutated proviruses derived from this patient were predominantly mutated by A3G, which excluded the possibility of gaining further insights into the identity of the APOBEC3 proteins that contributed to GA site mutations (Table 3 and Supplementary Table S2). Importantly, our results suggest that A3G and A3F (or A3H) were copackaged into the same virions and comutated the same viral genomes. Next, we compared the effects of nucleotides at the +2 position on the frequencies of hypermutation in patients (Table 5 and Figure 4D). The frequencies of mutations per site for each nucleotide context were determined, the highest frequency was set to 1.0, and the relative mutation frequencies for the other nucleotides were determined as described above for the cell culture experiments (Figure 4A and B). The patterns of relative mutation frequencies induced by A3G in the GG context in patients and in cell culture were similar but not identical (P < 0.041); a G was the preferred nucleotide at the +2 position both in cell culture and patients (compare Figure 4A and D; P < 0.0005). The concordance between the patient studies and cell culture results confirmed that most of the GG mutations in patients were primarily induced by A3G cytidine deaminase activity. Moreover, the overall local sequence context preference at GA edited sites by A3D and A3F/A3H deaminases in patients displayed very similar patterns compared to the patterns observed in cell culture when they were expressed alone (compare Figure 4B and D; P > 0.05). In both patients and in cell culture, A3F and A3H preferred an adenine and A3D preferred a thymidine at the +2 position in the GA edited site context (P > 0.05), and cytidine was disfavored at the +2 position in patients as well as in cell culture (P < 0.05).
Table 5.

Effect of nucleotide at +2 position on mutation frequencies in vivo

Nucleotide at +2 position
ACGT
APOBEC3 proteinsDinuc. Seq.Mut./sitea (× 10−3)Rel. Mut. Freq.bMut./sitea (× 10−3)Rel. Mut. Freq.bMut./sitea (× 10−3)Rel. Mut. Freq.bMut./sitea (× 10−3)Rel. Mut. Freq.b
A3GGAn52.8n.d.c10.7n.d.c103.7n.d.c43.4n.d.c
GGn471.50.7580.80.13628.81.00330.60.53
A3DGAn223.40.551250.311750.43407.41.00
GGn44.1n.d.c0.0n.d.c0.0n.d.c111.1n.d.c
A3F or A3HGAn110.31.0064.50.5869.00.6376.90.70
GGn9.5n.d.c58.8n.d.c21.3n.d.c0.0n.d.c
A3G + A3F or A3HGAn216.41.0023.80.1194.60.44107.80.50
GGn517.70.79125.00.19652.21.00303.00.46

aMutations/site = total mutations/[sites/sequence × no. of sequences]. The numbers of sites per sequence were determined using the plasma viral RNA consensus sequence for each patient.

bThe relative mutation frequencies were calculated by setting the highest mutations/site frequency to 1 and determining the mutations/site frequency for the remaining nucleotides relative to the highest frequency.

cThe hypermutated sequences exhibited other nucleotide variation relative to the patient consensus sequences. Although we observed GA mutations in sequences that predominantly had GG mutations (A3G Only), the frequencies of these mutations were not significantly higher than those observed for non-hypermutated sequences. Therefore, we were unable to determine whether the GA mutations observed in sequences predominantly hypermutated by A3G, and GG mutations observed in sequences predominantly hypermutated by A3F (or A3D or A3H) were due to A3G- or A3F-(or A3D- or A3H-) induced deamination, respectively, or whether these mutations arose during multiple rounds of viral replication in the patients. Consequently, we did not analyze the relative mutation frequencies of these mutations. n.d., not determined.

aMutations/site = total mutations/[sites/sequence × no. of sequences]. The numbers of sites per sequence were determined using the plasma viral RNA consensus sequence for each patient. bThe relative mutation frequencies were calculated by setting the highest mutations/site frequency to 1 and determining the mutations/site frequency for the remaining nucleotides relative to the highest frequency. cThe hypermutated sequences exhibited other nucleotide variation relative to the patient consensus sequences. Although we observed GA mutations in sequences that predominantly had GG mutations (A3G Only), the frequencies of these mutations were not significantly higher than those observed for non-hypermutated sequences. Therefore, we were unable to determine whether the GA mutations observed in sequences predominantly hypermutated by A3G, and GG mutations observed in sequences predominantly hypermutated by A3F (or A3D or A3H) were due to A3G- or A3F-(or A3D- or A3H-) induced deamination, respectively, or whether these mutations arose during multiple rounds of viral replication in the patients. Consequently, we did not analyze the relative mutation frequencies of these mutations. n.d., not determined. Next, we analyzed the nucleotide preference at the +2 position in the dual-hypermutated sequences derived from patients (Table 5 and Figure 4D). The preferred nucleotides at the +2 position in the GG context when A3G and A3F (or A3H) deaminases were copackaged into virions in cell culture experiments and patients were similar (compare Figure 4A and D, respectively; P > 0.05), confirming that the GG site mutations in dual-hypermutated sequences derived from patients are primarily induced by A3G. Furthermore, the preferred nucleotides at the +2 position in the GA edited sites of dual-hypermutated patient-derived sequences were not significantly different from that observed in cell culture experiments performed with A3G-YFP + A3F-mCH or A3G-YFP + A3H-mCH (compare Figure 4B and D; P > 0.05). These results are consistent with the interpretation that most of the dual-hypermutated sequences were derived from the deamination activity of A3G as well as A3F or A3H.

DISCUSSION

In this study, we demonstrate for the first time that different APOBEC3 proteins can efficiently copackage and comutate the same viral genomes in a single cycle of HIV-1 replication. Moreover, we showed that comutation can be observed in proviral DNA recovered from PBMCs of chronically infected HIV-1 patients, suggesting that A3G and other APOBEC3 proteins can copackage and comutate HIV-1 in vivo. Using single-virion fluorescence microscopy, we observed that A3D, A3F and A3H can be copackaged with A3G into the same virions; furthermore, the amounts of the APOBEC3 proteins packaged into virions was similar to the amounts in virions containing a single APOBEC3 variant, suggesting that the APOBEC3 proteins do not compete for virion encapsidation. Previous studies by us and others concluded that APOBEC3 proteins are packaged into virions by interacting with viral and non-viral RNAs (25,28–32), and the amounts of APOBEC3 proteins packaged into virions is proportional to the amount of packaged RNA. Furthermore, a recent analysis using cross-linking and immunoprecipitation technique concluded that the promiscuity and high affinity of these proteins to bind to both viral and non-viral RNAs ensures their virion incorporation (36). Although promiscuous binding to RNA provides a potential mechanism for independent packaging of APOBEC3 proteins, the APOBEC3 proteins could compete for virion incorporation at a step prior to their interaction with RNA. The results of our studies directly determined that APOBEC3 proteins are copackaged and do not compete for packaging into virions. Previous studies have observed GG and GA mutations in the same viral genomes (50,69,77–79), which can be explained by three potential mechanisms. First, copackaging and comutation of the viral genomes may have occurred in a single cycle of replication; second, comutation could be the result of hypermutation by different APOBEC3 proteins in sequential rounds of replication; and third, one APOBEC3 protein could induce mutations in both GG and GA contexts. The previous studies could not distinguish between these possibilities, in part because the potential for APOBEC3 proteins to induce mutations at both GG and GA sites was not fully appreciated. As the results of these and other studies show, A3G can induce mutations at GA sites at ∼12% the frequency with which it induces mutations at GG sites (7,48,67). We also observed that A3F and A3H can induce mutations GG sites at 17 and 13% the frequency with which they induce mutations at GA sites, respectively. Thus, it is necessary to determine the ratio of mutations at GG and GA sites and determine whether the frequency of mutations at GA sites is higher than expected by individual APOBEC3 proteins in order to conclude that another APOBEC3 contributed to the hypermutation of genomes containing mutations at both GG and GA sites. Another study analyzed HIV-1 sequences from the Los Alamos HIV database and concluded that comutation of the same HIV-1 genomes by A3G and A3F occurs rarely (22). In this study, a bioinformatics-based determination of G-to-A hypermutations within the same sequence using only the dinucleotide GG and GA motifs representation approach was employed to analyze different groups and subtypes of HIV-1. This discrepancy could be due to the high genetic variation in the HIV-1 population, which limits the utility of population-based sequence analysis; the lack of a consensus sequence for each patient to align patient-derived reads may have reduced the sensitivity of identification of comutated genomes. In our study, we analyzed single-genome sequences and generated a consensus sequence for each patient based on the viral RNA sequences, potentially providing a more sensitive assay for hypermutation. We analyzed single-genome proviral DNA sequences from seven viremic chronically infected HIV-1 patients and mapped these APOBEC3-mediated G-to-A hypermutations onto individual proviral DNA reads by aligning to a consensus sequence generated for each patient based on the non-hypermutant plasma viral RNA sequences. In these studies, we systematically analyzed the +1 and +2 nucleotide positions of G-to-A mutations induced by various APOBEC3 proteins and found distinct trinucleotide contexts for each APOBEC3 protein. Importantly, we used these trinucleotide sequence contexts to identify the APOBEC3 proteins that induced hypermutation in the proviral DNAs derived from patients. The trinucleotide contexts we observed for A3G and A3F are in agreement with recently described trinucleotide contexts (48). In contrast to the previous study, we performed more detailed analysis of the relative frequencies of G-to-A mutations in various trinucleotide contexts for each APOBEC3 protein. In addition, in contrast to the previously described preference of A3H for only GGA sites, we observed a strong preference for both GGA and GGG contexts. Our sequence contexts for A3D also differed in that we observed a preference for GAT sites, while the previous study did not provide any contexts for GA sites but indicated a preference for GGA sites. The reasons for these differences are not clear, but may be due to the low deaminase activity of A3D, which may have resulted in very few G-to-A mutations above the background in the previous study. Combinations of A3G-YFP + A3F-mCH or A3G-YFP + A3H-mCH resulted in either no increase or in a 1.8-fold increase, respectively, in the frequency of mutations at GA sites compared to the additive increase expected from the deaminase activity of both APOBEC3 proteins. These results suggest that A3G-YFP + A3H-mCH, but not A3G-YFP + A3F-mCH, act synergistically to increase mutations at GA sites. A direct or indirect interaction between A3G and A3H could potentially result in synergism and an increase in GA mutations. A direct interaction could involve heterodimerization between the two deaminases and enhancement of the activity of one or both enzymes at 5′-TC sites. Alternatively, an indirect interaction, such as inhibition of the rate of plus-strand DNA synthesis (46,70,72,82,83), could allow the deaminases to act on the substrate minus-strand DNA for a longer period of time, leading to an increase in the frequency of GA mutations. Since we observed a synergistic increase in the activity of A3H but not A3G, the results suggest that the mechanism does not involve inhibition of reverse transcription, which would be expected to increase the deamination activity of both enzymes. However, additional studies are needed to understand the underlying molecular mechanism(s) that contributes to the synergistic increase in A3H deamination activity when copackaged with A3G. In our cell culture based assays, we observed an additive increase, but not a synergistic increase, in the overall antiviral activity when A3G and A3H were co-expressed in virus producing cells. While increased GA mutations would lead to more lethal mutations, we and others have recently concluded that hypermutation usually results in lethal mutagenesis (44,84), while others have concluded that hypermutation can result in sublethal mutagenesis and contribute to viral genetic variation (48). It remains possible that a twofold synergistic increase in the activity of A3H may lead to a small increase in antiviral activity that was not apparent in our single-cycle infectivity assays. Another potential outcome of copackaging and comutation is that the resulting proviruses may possess a greater number of G-to-A mutations, and therefore more stop codons, which would increase their potential for expression of aberrant protein products that can potentially be recognized by cytotoxic T lymphocytes (CTLs), facilitating CTL-mediated clearance of infected cells (85,86). It has been reported that hypermutation by APOBEC3 is associated with strong CTL and natural killer cell responses (87,88); additional studies are needed to determine whether copackaging and comutation can potentially facilitate CTL responses that eliminate infected cells (48,89). The observed synergism between copackaged A3G and A3H may provide insights into the mechanisms by which the APOBEC3 proteins interact with each other, move processively on the substrate DNA and carry out cytidine deamination. It is also possible that A3D, A3F and A3H can be copackaged, and act synergistically to increase GA mutations in a manner that leads to a synergistic increase in their antiviral activity. However, A3F and A3H both prefer GAA sites and it is not possible to clearly define which APOBEC3 protein or proteins contributed to the GA mutations in proviruses that are predominantly mutated at GA sites. While A3D is unique in its preference for GAT sites, A3F also induces mutations at GAT sites at a high frequency, and at present, it is not possible to clearly identify hypermutated genomes that were dual-hypermutated by A3F/A3H and A3D. Additional studies are needed to further define the sequence contexts that are preferred by A3D, A3F and A3H proteins and to determine whether they can be copackaged and can comutate the same genomes. Our cell culture-based data clearly demonstrate that ∼50% of the hypermutated sequences retrieved from cells infected with viruses produced in the presence of A3G-YFP + A3F-mCH or A3G-YFP + A3H-mCH were dual-hypermutated, which is consistent with the copackaging efficiency. Analysis of proviral genomes isolated from PBMCs of chronically infected HIV-1 patients indicated that some genomes were hypermutated at both GG and GA sites, suggesting that A3G and A3F (or A3H/A3D) were copackaged into the same virions and induced comutation. One potential explanation for the co-existence of GG and GA mutations in the same genome in HIV-1 patients is the accumulation of G-to-A mutations induced by different APOBEC3 proteins during different cycles of replication in vivo. However, comutation at GG and GA sites in sequential rounds of viral replication is unlikely for the following reasons. First, our recent studies indicate that hypermutation by one APOBEC3 protein is likely to lead to lethal hypermutation of the viral genome (84). Results from previous studies (47,50,80) as well as from the current study showed that because of purifying selection very few hypermutated genomic RNAs are packaged into virions, making it unlikely that RNAs from hypermutated genomes would be copackaged with another APOBEC3 protein during subsequent rounds of infections. Second, packaging and replication of the hypermutated genomes would require superinfection of the hypermutated provirus-containing cells with another replication-competent virus, which has been shown to be a rare event (90). Third, hypermutation itself is an infrequent event; for example, in our study only 24 of 328 (7.3%) proviral genomes from PBMCs of seven viremic chronically-infected HIV-1 patients were hypermutated, consistent with the previously reported hypermutation frequency (47,50). Since sequential rounds of hypermutation would require all three of these rare events, we believe that proviruses with both GG and GA mutations most likely resulted from copackaging of multiple APOBEC3 proteins and comutation of the same viral genomes in a single round of replication. Additionally, since sequential rounds of hypermutation are likely to be rare, we would expect the frequency of comutation to be less than the frequency of hypermutation by A3F or A3H alone. However, we observed a similar frequency of A3G and A3F/A3H-mediated comutation (4 of 24 sequences; ∼17%) and A3F (or A3H) only induced hypermutation (3 of 24 sequences; ∼13%), which suggests that comutation predominantly occurred through copackaging of different APOBEC3 proteins in a single round of replication. Specific sequences and/or substrate nucleic acid secondary structure could potentially affect target site specificities of APOBEC3 proteins. We analyzed 1320 nucleotides of the HIV-1 RT region, which represents ∼14% of the viral genome. Since APOBEC3 proteins are likely to interact with very short regions near the target nucleotide, we believe analysis of a large portion of the genome provides an average of the sequences and RNA structures that are likely to be encountered throughout the HIV-1 genome and in different HIV-1 isolates. Our experiments were performed with HDV-eGFP, which does not express HIV-1 accessory proteins Vif, Vpu, Vpr and Nef. Therefore, it is possible that the accessory proteins, Vpr, Vpu and Nef could influence the target DNA specificity of APOBEC3 proteins, and the target DNA specificities observed in infected individuals might be different from those determined in our cell-based assays. However, to our knowledge there is no direct evidence in the literature suggesting that these accessory proteins influence the deaminase activities and/or target site specificities of APOBEC3 proteins. In addition, we observed that the target site specificity of A3G in infected individuals and in cell-based assays was very similar, suggesting that the presence of the other accessory proteins did not influence A3G's target site specificity. In summary, the results presented here show that different APOBEC3 proteins can copackage into the same virions independently of each other and comutate the same viral genomes. Analysis of chronically infected patients led to the identification of dual-hypermutated proviruses, providing strong evidence that copackaging of these deaminases and comutation of HIV-1 genomes can frequently occur in vivo. The data reported here, which demonstrate that A3G and A3F or A3H proteins can inactivate the same genomes, indicate that they all are co-expressed co-ordinately in sufficient amounts in individual HIV target cells and copackaged to inhibit HIV replication. Furthermore, the findings suggest the possibility for cooperative inhibition of invading viral pathogens by multiple APOBEC3 proteins in vivo. Additional studies are needed to determine how the immune activation status (91–93), which may impact IFN induction (20,21,91,92,94) and APOBEC3 protein expression (20,93–95), affects copackaging and comutation of HIV-1 genomes.
  93 in total

1.  Induction of APOBEC3G ubiquitination and degradation by an HIV-1 Vif-Cul5-SCF complex.

Authors:  Xianghui Yu; Yunkai Yu; Bindong Liu; Kun Luo; Wei Kong; Panyong Mao; Xiao-Fang Yu
Journal:  Science       Date:  2003-10-16       Impact factor: 47.728

2.  Alpha interferon potently enhances the anti-human immunodeficiency virus type 1 activity of APOBEC3G in resting primary CD4 T cells.

Authors:  Keyang Chen; Jialing Huang; Chune Zhang; Sophia Huang; Giuseppe Nunnari; Feng-xiang Wang; Xiangrong Tong; Ling Gao; Kristi Nikisher; Hui Zhang
Journal:  J Virol       Date:  2006-08       Impact factor: 5.103

3.  Quantitative profiling of the full APOBEC3 mRNA repertoire in lymphocytes and tissues: implications for HIV-1 restriction.

Authors:  Eric W Refsland; Mark D Stenglein; Keisuke Shindo; John S Albin; William L Brown; Reuben S Harris
Journal:  Nucleic Acids Res       Date:  2010-03-22       Impact factor: 16.971

4.  Possible footprints of APOBEC3F and/or other APOBEC3 deaminases, but not APOBEC3G, on HIV-1 from patients with acute/early and chronic infections.

Authors:  Andrew E Armitage; Koen Deforche; John J Welch; Kristel Van Laethem; Ricardo Camacho; Andrew Rambaut; Astrid K N Iversen
Journal:  J Virol       Date:  2014-08-27       Impact factor: 5.103

5.  Single-RNA counting reveals alternative modes of gene expression in yeast.

Authors:  Daniel Zenklusen; Daniel R Larson; Robert H Singer
Journal:  Nat Struct Mol Biol       Date:  2008-11-16       Impact factor: 15.369

6.  HIV-1 Vif adaptation to human APOBEC3H haplotypes.

Authors:  Marcel Ooms; Bonnie Brayton; Michael Letko; Susan M Maio; Christopher D Pilcher; Frederick M Hecht; Jason D Barbour; Viviana Simon
Journal:  Cell Host Microbe       Date:  2013-10-16       Impact factor: 21.023

7.  APOBEC3G-induced hypermutation of human immunodeficiency virus type-1 is typically a discrete "all or nothing" phenomenon.

Authors:  Andrew E Armitage; Koen Deforche; Chih-Hao Chang; Edmund Wee; Beatrice Kramer; John J Welch; Jan Gerstoft; Lars Fugger; Andrew McMichael; Andrew Rambaut; Astrid K N Iversen
Journal:  PLoS Genet       Date:  2012-03-22       Impact factor: 5.917

8.  Different mutagenic potential of HIV-1 restriction factors APOBEC3G and APOBEC3F is determined by distinct single-stranded DNA scanning mechanisms.

Authors:  Anjuman Ara; Robin P Love; Linda Chelico
Journal:  PLoS Pathog       Date:  2014-03-20       Impact factor: 6.823

9.  Sequence and structural determinants of human APOBEC3H deaminase and anti-HIV-1 activities.

Authors:  Mithun Mitra; Dustin Singer; Yu Mano; Jozef Hritz; Gabriel Nam; Robert J Gorelick; In-Ja L Byeon; Angela M Gronenborn; Yasumasa Iwatani; Judith G Levin
Journal:  Retrovirology       Date:  2015-01-22       Impact factor: 4.602

Review 10.  Interferon-alpha, immune activation and immune dysfunction in treated HIV infection.

Authors:  Lilian Cha; Cassandra M Berry; David Nolan; Allison Castley; Sonia Fernandez; Martyn A French
Journal:  Clin Transl Immunology       Date:  2014-02-28
View more
  13 in total

1.  An Alu Element Insertion in Intron 1 Results in Aberrant Alternative Splicing of APOBEC3G Pre-mRNA in Northern Pig-Tailed Macaques (Macaca leonina) That May Reduce APOBEC3G-Mediated Hypermutation Pressure on HIV-1.

Authors:  Xiao-Liang Zhang; Meng-Ting Luo; Jia-Hao Song; Wei Pang; Yong-Tang Zheng
Journal:  J Virol       Date:  2020-01-31       Impact factor: 5.103

2.  Mechanism of Enhanced HIV Restriction by Virion Coencapsidated Cytidine Deaminases APOBEC3F and APOBEC3G.

Authors:  Anjuman Ara; Robin P Love; Tyson B Follack; Khawaja A Ahmed; Madison B Adolph; Linda Chelico
Journal:  J Virol       Date:  2017-01-18       Impact factor: 5.103

Review 3.  Multiple Inhibitory Factors Act in the Late Phase of HIV-1 Replication: a Systematic Review of the Literature.

Authors:  Jean-François Gélinas; Deborah R Gill; Stephen C Hyde
Journal:  Microbiol Mol Biol Rev       Date:  2018-01-10       Impact factor: 11.056

4.  Deamination hotspots among APOBEC3 family members are defined by both target site sequence context and ssDNA secondary structure.

Authors:  Yumeng Z McDaniel; Dake Wang; Robin P Love; Madison B Adolph; Nazanin Mohammadzadeh; Linda Chelico; Louis M Mansky
Journal:  Nucleic Acids Res       Date:  2020-02-20       Impact factor: 16.971

5.  Insights into DNA substrate selection by APOBEC3G from structural, biochemical, and functional studies.

Authors:  Samantha J Ziegler; Chang Liu; Mark Landau; Olga Buzovetsky; Belete A Desimmie; Qi Zhao; Tomoaki Sasaki; Ryan C Burdick; Vinay K Pathak; Karen S Anderson; Yong Xiong
Journal:  PLoS One       Date:  2018-03-29       Impact factor: 3.240

Review 6.  Structural Insights into APOBEC3-Mediated Lentiviral Restriction.

Authors:  Krista A Delviks-Frankenberry; Belete A Desimmie; Vinay K Pathak
Journal:  Viruses       Date:  2020-05-27       Impact factor: 5.048

7.  Role of co-expressed APOBEC3F and APOBEC3G in inducing HIV-1 drug resistance.

Authors:  Nazanin Mohammadzadeh; Robin P Love; Richard Gibson; Eric J Arts; Art F Y Poon; Linda Chelico
Journal:  Heliyon       Date:  2019-04-16

8.  Structural Determinants of the APOBEC3G N-Terminal Domain for HIV-1 RNA Association.

Authors:  Hirofumi Fukuda; Songling Li; Luca Sardo; Jessica L Smith; Kazuo Yamashita; Anamaria D Sarca; Kotaro Shirakawa; Daron M Standley; Akifumi Takaori-Kondo; Taisuke Izumi
Journal:  Front Cell Infect Microbiol       Date:  2019-05-21       Impact factor: 5.293

9.  HPV11 E6 mutation by overexpression of APOBEC3A and effects of interferon-ω on APOBEC3s and HPV11 E6 expression in HPV11.HaCaT cells.

Authors:  Yongfang Wang; Xinyu Li; Shasha Song; Yang Sun; Jiafen Zhang; Changming Yu; Wei Chen
Journal:  Virol J       Date:  2017-11-03       Impact factor: 4.099

10.  APOBEC3G Regulation of the Evolutionary Race Between Adaptive Immunity and Viral Immune Escape Is Deeply Imprinted in the HIV Genome.

Authors:  Faezeh Borzooee; Krista D Joris; Michael D Grant; Mani Larijani
Journal:  Front Immunol       Date:  2019-01-11       Impact factor: 7.561

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.