Literature DB >> 26186661

AAV ancestral reconstruction library enables selection of broadly infectious viral variants.

J Santiago-Ortiz1, D S Ojala1, O Westesson2, J R Weinstein3, S Y Wong2, A Steinsapir1, S Kumar2, I Holmes2, D V Schaffer1,2,4.   

Abstract

Adeno-associated virus (AAV) vectors have achieved clinical efficacy in treating several diseases. However, enhanced vectors are required to extend these landmark successes to other indications and protein engineering approaches may provide the necessary vector improvements to address such unmet medical needs. To generate new capsid variants with potentially enhanced infectious properties and to gain insights into AAV's evolutionary history, we computationally designed and experimentally constructed a putative ancestral AAV library. Combinatorial variations at 32 amino acid sites were introduced to account for uncertainty in their identities. We then analyzed the evolutionary flexibility of these residues, the majority of which have not been previously studied, by subjecting the library to iterative selection on a representative cell line panel. The resulting variants exhibited transduction efficiencies comparable to the most efficient extant serotypes and, in general, ancestral libraries were broadly infectious across the cell line panel, indicating that they favored promiscuity over specificity. Interestingly, putative ancestral AAVs were more thermostable than modern serotypes and did not use sialic acids, galactose or heparan sulfate proteoglycans for cellular entry. Finally, variants mediated 19- to 31-fold higher gene expression in the muscle compared with AAV1, a clinically used serotype for muscle delivery, highlighting their promise for gene therapy.

Entities:  

Mesh:

Year:  2015        PMID: 26186661      PMCID: PMC4509550          DOI: 10.1038/gt.2015.74

Source DB:  PubMed          Journal:  Gene Ther        ISSN: 0969-7128            Impact factor:   5.250


INTRODUCTION

Advances in DNA sequencing, synthesis, and computational phylogenetic analyses are enabling the computational reconstruction and experimental investigation of ancestral protein variants. Following the first ancestral reconstruction study – which resurrected a functional, ancestral digestive ribonuclease from an extinct bovid ruminant using the parsimony principle [1] – reconstructions and functional analyses have been carried out on inferred ancestral proteins belonging to eubacteria, bony vertebrates, mammals, and the least common ancestor of higher primates using several inference methods, including the parsimony, consensus, Bayesian distance, and maximum likelihood methods [2]. Such ancestral reconstructions and subsequent analysis of resurrected variants have yielded insights into the conditions that led to protein evolution as well as the continuous adaption of organisms to changing environmental conditions [3]. Ancestral reconstructions have also been harnessed to incorporate additional sequence diversity into genetic libraries for protein engineering. For instance, small libraries of resurrected ancestral variants were used in evolutionary studies of protein diversification [3-5] and to generate variants that are more tolerant to deleterious mutations. Moreover, inferred ancestral sequences have been combined with extant sequences by swapping residues of interest (e.g. residues in or close to an enzyme’s catalytic site) in modern sequences with those of the inferred ancestor. This residue swapping approach was used in basic evolutionary studies [6] as well as to screen for variants with properties such as increased thermostability [7], improved catalytic activity [8], novel substrate binding [9], and higher solubility [10]. Ancestral reconstruction is thus a versatile approach to explore new sequence space for engineering proteins with novel or enhanced properties, and it may likewise offer potential for gene therapy. This approach has recently been extended to more complex, multimeric proteins including viruses. The evolutionary history of viruses is an especially interesting application given their rapid mutational rates, importance to public health, and promise for gene therapy. For example, ancestral reconstructions of viral proteins have been generated with the goal of developing vaccine candidates against HIV-1 and influenza virus [11, 12], and to study the functionality and properties of the resurrected variants of HIV-1, influenza, and coxsackievirus [13, 14]. These studies demonstrated that viral reconstructions could recapitulate properties of modern variants, including immunogenicity, packaging, tropism, and cell receptor dependencies. These properties are key to the viral life cycle, and they are also important properties for viruses used as gene therapy vectors. Adeno-associated virus (AAV) vectors are highly promising for gene therapy. AAVs are non-pathogenic [15] and can transduce numerous dividing and non-dividing cell types, leading to long term expression in the latter [16]. AAV vectors have accordingly been utilized for gene therapy in various tissues, including liver, lung, brain, eye, and muscle [17, 18]. Furthermore, Glybera, the first gene therapy product approved in the European Union in 2012, employs an AAV1 vector [19]. The amino acid composition of the viral capsid, encoded by the cap gene, affects AAV tropism, cell receptor usage, and susceptibility to anti-AAV neutralizing antibodies [20]. These key properties in turn impact efficacy in therapeutic gene delivery, which is often limited by poor transduction of numerous cell types, off-target transduction, difficulties with biological transport barriers, and neutralization by pre-existing anti-AAV antibodies [18]. However, extensive engineering of the AAV capsid, via modification of the cap gene, promises to improve numerous clinically relevant properties [18]. Given the functional diversity of natural AAV serotypes, availability of numerous genetic sequences, and demonstrated clinical efficacy of recombinant vectors, AAV is an intriguing candidate for ancestral reconstruction, which could further our understanding of its evolutionary history and plasticity. Interesting questions include whether reconstructed variants exhibit higher or lower infectivity on a range of cell types, and whether they are relatively specific for particular cells – an attractive feature for many clinical applications – or are instead promiscuous, as are many extant serotypes. Finally, ancestral sequences and libraries may be useful starting materials for directed evolution studies [8, 21], especially considering that such AAVs likely gave rise to the modern serotypes with their divergent biological properties and tropism. Motivated by these questions, we conducted ancestral reconstruction of the AAV capsid. Acknowledging and taking advantage of the inherent ambiguity in reconstructing sequences containing highly divergent residues, we synthesized the inferred ancestral capsid not as a single “best guess” sequence, but rather as a large combinatorial library of candidate sequences incorporating degenerate residues at positions of low confidence. We then explored whether phenotypic selection of this ancestral sequence space using five cell lines representative of different tissues would lead to highly infectious variants, and whether these would be promiscuous – i.e. broadly infectious particles - or exhibit specific tropisms. The ancestral library was found to be fit, with packaging and transduction efficiencies that were on par with extant serotypes, and genetically selected variants were found to be broadly infectious on different cell lines. Furthermore, putative ancestral clones exhibited strong in vivo gene delivery efficiency, underscoring the potential of such vectors for gene therapy applications.

RESULTS

Ancestral AAV sequence reconstruction

The goals of ancestral sequence reconstruction are, given a set of extant DNA sequences, to generate a phylogenetic tree and sequence alignment that relates these sequences, and to infer the sequences of ancestral variants at different ancestral nodes. Accurate sequence reconstruction is challenging due to ambiguity in the evolutionary relationships between extant variants (which affects the phylogenetic tree-building step) as well as sequence divergence at highly variable residues (which affects the sequence alignment and ancestral reconstruction steps). As a starting point, we reconstructed the phylogeny of human, macaque and rhesus monkey AAV cap sequences retrieved from Genbank (n=52) [22]. We used MrBayes [23], which conducts Bayesian Markov chain Monte Carlo (MCMC) simulation of tree space, to estimate the confidence values at each internal node (shown in curly braces in Figs. 1a and S1). This approach generated a phylogenetic tree relating extant sequences, which is essentially a hypothesis concerning the evolutionary history of AAVs. Each branch on this tree depicts the evolutionary direction that diversified the sequences, and each internal node represents a ‘splitting’ event where two AAV lineages diverged. With many ancestral nodes to choose from (full tree in Fig. S1), we selected node 27 (Fig. 1a) based on its high confidence value (1.00), which minimizes one potential source of uncertainty (at the level of phylogenetic relationships between entire sequences) and thus improves confidence in the finer-grained downstream reconstruction of individual amino acids’ evolutionary histories. This node is also the ancestor of serotypes with demonstrated clinical efficacy (AAV1, Glybera), biomedical interest (AAV6 [24]), or relative resistance to neutralizing antibodies (AAV7 [25]).
Figure 1

Ancestral AAV sequence reconstruction. a) A phylogenetic tree relating a subset of extant AAV variants at node 27. Curly braced numbers indicate clade posterior probabilities [23]. The phylogenetic tree graphic was generated in Dendroscope [62]. b) A multiple sequence alignment of a subset of AAV variants with column-specific confidence annotated along the top with single digits. Confidence ranges from above 0.9 (shaded grey) to 0.3–0.4 (shaded white). c) A distribution of predicted ancestral amino acid sequences for node 27, residues 451–481. The character height of each amino acid is proportional to its posterior probability.

We then used the Markov chain Monte Carlo alignment sampler HandAlign [26] to explore alignment space and predict the ancestral sequence of the most likely alignment at node 27. HandAlign generates a multiple sequence alignment, arranging the sequences of different variants in aligned ‘columns’ such that residues grouped in a column share a common ancestor (Fig. 1b). It also performs the ancestral reconstruction simultaneously with the alignment, and accounts for sequence insertions, deletions, and character substitutions. Figure 1c shows the distribution of predicted amino acids as a sequence logo, with character heights proportional to posterior probabilities. The majority of amino acid positions could be predicted with high confidence (≥ 0.90) and thus represented residues highly conserved during evolution. However, as is common in ancestral reconstruction, other positions were less evolutionarily conserved and could thus be predicted with lower probabilities. A DNA library was designed based on these results, and residues above the 0.90 confidence value were fixed, whereas those below this confidence level were varied by introducing the two or three most likely amino acids (above a threshold value of 0.08), such that the fraction of library members containing each amino acid at a given position reflects the probability of that amino acid appearing in the sequence reconstructions. The locations, identities, and synthesis frequencies of the 32 variable residues are presented in Table 1, and the most likely full ancestral cap amino acid sequence is shown in Fig. S2 and aligned with extant serotypes in Fig. S3. The ancestral cap library was synthesized (GeneArt, Life Technologies), and analysis of 61 sequenced clones from this library revealed that the amino acid frequencies at variable positions were not significantly different from the theoretical probabilities from the library (P < 0.001, see Materials and Methods), highlighting the correctness of the library synthesis.
Table 1

Variable positions synthesized in ancestral AAV library.

PositionResidue 1% Freq.Residue 2% Freq.Residue 3% Freq.
264T55Q25A20
266A63S37
268S70A30
448S71A29
459T69N31
460R63Q20K17
467A75G25
470S85A15
471N60T32S8
474A83E16
495S75T25
516D91N9
533D86E14
547Q81E11T8
551A50K50
555T54A46
557E86D14
561M62L28I10
563S80N19
577E50Q50
583S86D8A6
593A45Q39V16
596A81T19
661A71E19T10
662V53T26A22
664T66S34
665P64A26Q10
710T87A13
717N69D31
718N60S40
719E79D21
723S68T32

Phenotypic selection of ancestral AAV library

Given the inherent probabilistic uncertainty of ancestral reconstruction, rather than investigating many possible, candidate ancestral sequences one by one, we selected the library as a whole for functional clones. Specifically, after validating the initial synthesized distribution of amino acids at the 32 variable positions, we probed how those positions would change when subjected to selective pressure for packaging and infectivity, which are key factors for successful viral replicative fitness during the natural evolution of AAV. The ancestral library was cloned into an AAV packaging plasmid, and viral particles were produced by transfection into human embryonic kidney 293T cells as previously described [27]. The viral genomic titer was comparable to levels obtained when packaging libraries based on extant AAV serotypes (data not shown), indicating that the ancestral library can support robust packaging titers. The amino acid distribution at variable positions was only slightly altered by one round of packaging, and we hypothesized that additional selective pressure for infectivity could reveal more about the significance of each variable position. We chose five cell lines representative of different tissues to conduct rounds of selection: C2C12 mouse myoblast cells, IB3-1 human lung epithelial cells, B16-F10 mouse skin melanoma cells, human embryonic kidney 293T cells, and L0 human glioblastoma (GBM) tumor-initiating cells. Briefly, for each round 1 × 105 of each cell type were infected with iodixanol-purified, replication-competent AAV libraries at an initial genomic multiplicity of infection (MOI) of 5,000, and successful virions were recovered by superinfecting the cells with adenovirus type 5 two days later. Six rounds of selection were conducted on each cell line, resulting in five independently selected pools, and the stringency of selection was increased during subsequent rounds by decreasing the genomic MOI (Table S1). To assess the progression of selection at each variable position, clones were sequenced (n = 14) from each library after initial viral packaging (hereafter referred to as post-packaging), after three rounds of selection, and after six rounds of selection. This analysis revealed a range of outcomes for each variable position across the different cell lines. Figure 2 shows the positions of the variable amino acids mapped onto the crystal structure of AAV1 (the most homologous serotype with a solved structure), and Figure 3 depicts the dominant amino acid at each of these positions for each selected pool as a heat map, with darker shades representing higher convergence. As expected, selection for infection of cell lines led to increased convergence, and Figure 4 shows the percentage change in amino acid frequency in rounds 6 relative to post-packaging (increases in blue, decreases in red, and change of amino acid in yellow). Some amino acid positions approached full convergence to the same residue across all cell lines; other positions were divergent, or even acquired specific identities unique to only one cell line. The majority of residues unique to one cell line are located on the surface of the capsid, and they could for example play a role in altering the affinity of capsid interactions with cell surface receptors.
Figure 2

Variable residues mapped to the crystal structure of homologous AAV1, the closest AAV relative with an available structure. A three-dimensional molecular model of the AAV1 capsid was generated in PyMOL [63]. An amino acid alignment of the ancestral AAV sequence with AAV1 was used to map the highlighted residues to the a) individual asymmetric unit and b) full biological assembly.

Figure 3

Dominant amino acids at variable positions after six rounds of selection. A heat map was generated based on the frequency of the most common amino acid at each position in the different libraries. The dominant amino acid and frequency at each position were determined based on sequencing results from individual clones n = 61 (synthesized library), n = 23 (post-packaging), and n=14 (for each ancestral library after selection on respective cell lines).

Figure 4

Change in amino acid frequency at variable positions after six rounds of selection. The percent change in amino acid frequency between the post-packaging library and evolved libraries after six rounds of selection on each cell line was calculated. If the identity of the dominant amino acid did not change, the increase (blue) or decrease (red) in frequency is displayed. If selection resulted in a change in amino acid identity at that position, the new amino acid and frequency is shown (yellow).

To determine whether the changes in amino acid frequencies imparted by phenotypic selection were statistically significantly different from the initial synthesized distribution, we conducted Bayesian Dirichlet-multinomial model comparison tests (as described in Materials and Methods) to calculate the posterior probability that the two sets of variable amino acids come from different distributions. This analysis identified several amino acid positions that are significantly different after selection (P < 0.05, shown in green), and many more that are moderately different (P < 0.5, shown in yellow) (Fig. 5).
Figure 5

Identification of key variable residues by Bayesian Dirichlet-multinomial model comparison tests. A comparison of the two sets of variable amino acids was conducted to identify positions that changed significantly during selection. The posterior probability that the two sets of amino acids come from two different probability distributions was calculated assuming probability parameters that are Dirichlet-distributed with low pseudocounts to reflect sparse observed sequences. Results colored green indicate a >95% chance that the sets came from different distributions, yellow a >50% chance, red a >5% chance, and no color a <5% chance. Synth, synthesized library; PP, post-packaging; R3, round three of selection; R6, round six of selection.

Transduction efficiency of evolved ancestral libraries

Phenotypic selection could conceivably lead to specific infectivity of a given cell line or may alternatively increase overall infectivity but in a promiscuous manner across all cell types. We investigated these possibilities by evaluating the transduction efficiency of evolved ancestral libraries on the cell line panel. The degree of convergence for each amino acid position after six rounds of selection is shown in Figure 3. Selection did not drive full convergence to a single sequence, potentially due to the presence of neutral positions that conferred no selective advantage. Therefore, rather than packaging individual clones, the libraries selected on each cell line were each packaged as a pool of recombinant virus (at a low ratio of AAV helper plasmid per producer cell to minimize mosaic capsids), resulting in five distinct round 6 ancestral libraries; results thus represent overall or average library infectivities. High titer, iodixanol-purified recombinant AAV (rAAV) encoding the green fluorescent protein (GFP) was produced for the ancestral libraries, as well as for natural serotypes AAV1-6, 8, and 9 for comparison of transduction efficiency and tropism. Infection at a genomic MOI of 2,000 (or 32,000 for C2C12s) revealed a range of properties (Fig. 6). Functionally selected ancestral libraries mediated high delivery efficiencies most comparable to AAV1 and AAV6 and generally superior to AAV4, AAV5, AAV8, and AAV9. Ancestral libraries were especially successful in infecting C2C12 and GBM cell lines relative to natural serotypes. Importantly, we observed a large increase in infectivity when comparing the synthesized vs. the round 6 ancestral libraries, suggesting phenotypic selection of advantageous amino acids at the variable positions. Interestingly, the libraries in general displayed broad infectivity across all cell lines, indicating that this reconstructed ancestral pool contains promiscuous AAVs, a property known to be advantageous for natural evolutionary adaptability [28, 29].
Figure 6

Transduction efficiency of ancestral libraries benchmarked against natural AAV serotypes. After six rounds of selection, viral genomic DNA was recovered from ancestral libraries and packaged as rAAV scCMV-GFP along with wild type AAV 1–6, 8, and 9. Cell lines were infected at a genomic multiplicity of infection (MOI) of 2,000 (293T, IB3, B16- F10, GBM) or 32,000 (C2C12). The fraction of GFP expressing cells was quantified by flow cytometry 72 hours later. Data are presented as mean ± SEM, n = 3. AL, ancestral library.

Characterization of the thermostability of candidate ancestral AAV variants

High thermostability and enhanced tolerance to mutations are also properties that could confer an evolutionary advantage to ancestral viral capsids [3, 7, 30]. We benchmarked the thermostability of AAV variants selected from our reconstructed pool against the natural serotypes AAV1, AAV2, AAV5, and AAV6 by assaying their transduction efficiency after heat treatment. Specifically, for initial analysis we chose the ancestral library selected on C2C12 cells and a representative variant from this library, C7. Virions packaged with self-complementary CMV-GFP were treated for 10 minutes at different temperatures using a thermal gradient before being cooled down to 37°C and used to infect 293T cells. We normalized the resulting fraction of GFP expressing cells after treatment at each temperature to the sample incubated at 37° (Fig. 7). Ancestral variants displayed higher thermostability than natural serotypes and showed moderate transduction levels even at the highest treatment temperature, 78°C, which ablated transduction by natural serotypes. The obtained thermostabilities confirm those previously reported for natural serotypes [31], which showed that AAV5 is more stable than AAV1 and that AAV2 is less stable than both. Enhanced thermostability of the ancestral variants in general could enable a higher tolerance to destabilizing mutations, and consequently a higher evolutionary adaptability.
Figure 7

Candidate ancestral variants display higher thermostability than natural serotypes. The thermostability of the ancestral library selected on C2C12 cells and of the representative ancestral variant C7 was characterized and compared to that of natural serotypes 1, 2, 5, and 6. Virions packaged with scCMV-GFP were incubated at temperatures ranging from 59.6°C to 78°C for 10 minutes before being cooled down to 37°C and used to infect 293T cells. The fraction of GFP expressing cells was quantified by flow cytometry 72 hours later. Data are presented, after being normalized to the fraction of GFP expressing cells after incubation at 37°, as mean ± SEM, n = 3.

Characterization of ancestral AAV glycan dependencies and susceptibility to neutralizing antibodies

Our in vitro transduction experiments demonstrated the broad infectivity of reconstructed variants. Given that ancestral node 27 gave rise to AAV1 and AAV6, we were interested in determining whether the candidate ancestral clones shared the same glycan dependencies, or if those evolved later. AAV1 and AAV6 utilize both alpha 2,3 and alpha 2,6 N-linked sialic acids as their primary receptor, and AAV6 has moderate affinity for heparan sulfate proteoglycans [24]. To probe heparan sulfate proteoglycan (HSPG) usage, we transduced parental CHO-K1 cells and the pgsA CHO variant line deficient in HSPG. To examine sialic acid dependence we transduced parental Pro5 CHO cells presenting glycans with both N- and O-linked sialic acids, a Lec2 CHO variant cell line deficient in all N- and O-linked sialic acids, and a Lec1 line deficient in complex and hybrid type N-glycans including sialic acids[32] (Fig. 8b). Interestingly, candidate ancestral AAVs exhibited no dependence on HSPG or N- and O-linked sialic acids (Fig. 8a). We also verified that selected individual clones exhibited similar transduction behavior as the evolved libraries (Supplementary Fig. S6).
Figure 8

Glycan dependency of candidate ancestral AAV variants. a) After six rounds of selection, the transduction efficiency of ancestral libraries carrying scCMV-GFP was quantified by flow cytometry 72 hours after infection at a genomic MOI of 2,000 (Pro5, Lec1, Lec2) and 50,000 (CHO-K1, pgsA). The CHO-K1/pgsA comparison examines heparan sulfate proteoglycan dependence, while Pro5/Lec1 and Pro5/Lec2 probe sialic acid dependence. Data are presented as mean ± SEM, n = 3. b) Glycans present on CHO glycosylation mutants. AL, ancestral library.

We next examined whether ancestral AAVs were neutralized by antibodies against a broad range of contemporary AAVs, in particular human intravenous immunoglobulin (IVIG) that contains polyclonal antibodies against extant serotypes due to natural exposure across the human population. In vitro incubation with IVIG strongly reduced transduction of ancestral libraries and the AAV1 control (Supplementary Fig. S7), indicating that this ancestral pool is not highly serologically distinct from its progeny. Additional capsid engineering may be necessary to address this clinically relevant problem.

Characterization of ancestral variants in vivo in mouse gastrocnemius muscle

Upon finding that the ancestral AAV libraries exhibited efficiencies comparable to or in some cases higher than extant serotypes on a panel of cell lines from representative tissues, we next probed in vivo infectivity. Based on the high transduction efficiencies of candidate ancestral AAVs on the most nonpermissive cell line (C2C12 mouse myoblasts), we chose to evaluate in vivo transduction of mouse gastrocnemius muscle. In particular, individual ancestral variant clones from the selected viral pools (Table S2) that were closest to the consensus sequences of libraries evolved on C2C12 (clones C4, C7) and glioblastoma cells (clone G4) were chosen, based on the efficiency of these two libraries in transducing C2C12 myoblasts in vitro. In addition, these variants were benchmarked against AAV1, given its clinical efficacy in muscle-targeted gene therapy [33]. We generated recombinant AAV vectors expressing firefly luciferase under the control of the hybrid CAG (CMV early enhancer/chicken β-actin/splice acceptor of β-globin gene) promoter. A volume of 30 μl DNase-resistant genomic particles (5 × 1010 vg) was injected into each gastrocnemius muscle of BALB/c mice, and after six weeks, mice were sacrificed and tissue luciferase activities analyzed (Fig. 9). Ancestral reconstruction variants yielded 19–31 fold higher transgene expression than AAV1 in gastrocnemius muscle, with variant C7 yielding the highest expression. Interestingly, variant C7 was the most abundant sequence (71%) in the round 6 ancestral library selected on C2C12 cells. These results demonstrate that candidate ancestral AAVs also exhibit high infectivity in vivo, and even offer the potential to exceed the performance of the best contemporary natural serotypes in gene therapy applications.
Figure 9

Evaluation of gastrocnemius muscle transduction. Luciferase activity measured in relative light units (RLU) per mg protein was determined in gastrocnemius tissue homogenate 48 days after intramuscular administration of 5 × 1010 viral particles of ancestral clones C4, C7, G4, or AAV1 in adult mice. Controls injected with phosphate-buffered saline displayed no activity (data not shown). *, statistical difference of P < 0.05 by two-tailed Student’s t-test.

DISCUSSION

Ancestral sequence reconstruction offers unique opportunities to study fundamental biological questions of virus evolution and fitness, including the characterization of ancestral sequence space relative to extant serotypes, the importance of mutational tolerance or evolutionary conservation, and the comparative advantages of promiscuous versus selective tropism. The primary challenge of ancestral reconstruction is to accurately infer an ancestral sequence despite uncertainty arising from sequence divergence within hypervariable regions of extant variants. We have combined sophisticated computational and library synthesis approaches to address this uncertainty and thereby generate a functional ancestral AAV library. We then studied the biological properties of this library to learn more about the evolutionary behavior of AAV and the gene therapy potential of reconstructed ancestral variants. The posterior probability that an AAV ancestral sequence accurately reflects the actual ancestral virus is the product of the probabilities that each of the amino acids in the capsid protein is correctly predicted. At positions of high evolutionary convergence the posterior probability nears 1.0, yet there are many sites that diverged during evolution and thus cannot be predicted with such high confidence. Our library synthesis approach addressed this concern by introducing the two or three most likely amino acids at the 32 lowest confidence positions in the AAV cap protein. Interestingly, the majority of positions varied in our ancestral library have not been previously described in studies of the functional importance of single mutations to the AAV capsid [34, 35]. Unlike previous ancestral reconstructions of enzymes and other proteins, which utilized single best guess ancestral sequences [11, 36], or which sampled only a small fraction of library variants due to the low throughput of enzymatic assays [8, 37], our massively parallel phenotypic selection enabled screening of a large library (and is limited only by the transformation efficiency of electrocompetent bacteria). The selection strategy applied pressure for efficient packaging and transduction of cell types representing a variety of tissues. By comparing the frequencies of amino acids selected at variable positions to the theoretical ancestral sequence prediction, one can gain insights into both the accuracy of our sequence reconstruction as well as the functional role of each residue in AAV biology. Comparison of sequences from the synthesized library with those recovered after initial library packaging suggested that one round of packaging imposed no statistically significant changes on the amino acid distribution at variable positions (except for a low 0.076 probability change in preference from a threonine to an alanine at residue 264). However, with selection for infectivity on a range of cell types specific positions begin to diverge, and differences between round six and post-packaging sequences were more significant than between round three and post-packaging sequences, likely because six rounds enabled a larger cumulative effect of positive selection. Genetic drift may also play a role, but is unlikely to be the main driving force given that the time to fixation by genetic drift increases with population size [38], and a large number of virions (>108) was used in each sequential round of selection. By comparing the ancestral libraries after six rounds of selection with the post-packaging library, we identified several trends in the level of convergence of the amino acid residues, suggesting these positions may have potential roles in modulating properties like capsid stability and infectivity. Some amino acid positions approached full convergence to the same residue across all cell lines (268, 460, 474, 516, 547, 583, 665, 710, 717, 719); these positions are distributed throughout the capsid and may for example be important for core viral functions such as capsid stability, uncoating, or endosomal escape. Others showed more divergent outcomes across different cell lines (264, 467, 593, 664, 723) and may be neutral with respect to overall fitness. Finally, some positions (459, 470, 471, 533, 555, 596, 662, 718) acquired identities specific to a given cell line and may confer an infectious advantage on each respective cell line. Positions 264 and 459 showed the strongest evidence of change due to selection (P < 0.05). Position 459 is prominently exposed on loop IV of the AAV capsid surface. Position 264 is positioned on loop I of the capsid and has been identified as a key determinant of muscle tropism in the rationally engineered variant AAV2.5 [39]. There is also suggestive evidence of selection at positions 266, 470, 533, 551, 557, 577, 596, and 723 in various libraries (P < 0.5). Position 533 has been previously described as a key contributor to infectivity and glycan dependence in our previously evolved variant ShH10, a vector differing by only four amino acids from AAV6 but exhibiting unique tropism in the retina [32]. Additionally, Lochrie et al. [40] examined several other of these positions in their thorough mutational analysis of the AAV2 serotype, though AAV2 lies in a different phylogenetic clade than ancestral node 27. The characterization of these variable positions is therefore novel, and lessons learned may inform targeted mutagenesis efforts to improve the fitness of extant variants. The assembly-activating protein (AAP), which is involved in directing capsid proteins to the nucleolus and in assembly of the viral capsid in this organelle[20], is translated from an alternative open reading frame (ORF) with a non-canonical CTG start codon present within the cap gene. This alternate ORF is also present in the ancestral reconstruction of the AAV capsid at node 27 (Fig. S2b). Three of the variable residues (positions 264, 266, and 268) are present within the AAP ORF, and the putative ancestral AAP sequence is otherwise conserved across the reconstruction. As discussed above, residue 264 is among the positions that showed strong and statistically significant changes after six rounds of packaging and infection on the cell line panel, and it is possible that both capsid and AAP may have undergone functional selective pressure during this process. The in vitro transduction results also demonstrate the importance of utilizing a library approach coupled with selection. A single or small number of best guess sequences could likely include deleterious amino acids that significantly impact fitness. Indeed, our data show that the synthetized ancestral AAV library evaluated prior to rounds of selection reproducibly exhibited dramatically lower infectivity than libraries subjected to selective pressure. This is not surprising, given that numerous directed evolution studies demonstrate that even single point mutations can significantly alter enzyme activity or virus infectivity by several orders of magnitude [34, 41–43]. Interestingly, despite differences in amino acid composition at variable positions, the ancestral libraries selected for infecting a number of individual cell lines subsequently demonstrated broad tropism across all of these cell lines. Such promiscuity may have been rewarded during the natural evolution of AAVs, since the ability to replicate in different cell and tissue types enhances virus spread. In fact, most natural AAV serotypes exhibit broad tropism [44, 45], indicating that promiscuity continues to be a valued trait for natural evolution. Such broad tropism, however, has important implications for gene therapy. In cases where disease pathologies affect multiple tissues and cell types (e.g. lysosomal storage disorders), broader infectivity could be an advantageous trait. Expanded tropism may also be useful for infecting cell types refractory to infection by most AAV serotypes, or for ex vivo treatments of homogeneous cell populations where off-target infectivity is not a concern. However, in the majority of gene therapy applications it is desirable to limit transgene expression to a target tissue for several important reasons, including risks associated with off-target expression of the transgene, off-target transduction leading to higher immune presentation and reaction, and higher overall dosages needed to overcome vector dilution into multiple tissues. This is true not only when vector is delivered via routes that lead to intentional exposure to multiple tissues (e.g. intravascular delivery) but also for local injection into multiple tissues in which vector leakage into circulation can lead to widespread distribution to multiple organs. For example, biodistribution studies have shown the spread of AAV vectors to sites distant from the target tissue after injecting viral particles through the hepatic artery, intramuscularly, or into the putamen of the brain [46-48]. To address concerns with off-target transgene expression, strategies for controlling gene and protein expression including cell type specific promoters [49] and microRNA elements [50, 51] are being explored to restrict expression to target cells. These approaches are promising, but would not address immune presentation of the capsid protein. Therefore, the optimal scenario is one in which selective AAV tropism is engineered through modification of the capsid protein. Directed evolution can generate vectors capable of targeted gene delivery [32], and evolution for enhanced AAV infectivity of a given target cell in general can enable a reduction in vector dose and thereby reduce the level of off-target transduction. Ancestral variants may be promising starting points for such directed evolution efforts given their high infectivity and representation of a capsid protein sequence space that is different from and complementary to extant serotypes. High thermostability may also be an advantageous property for AAV engineering. Ancestral sequences have been correlated with increased thermostability in multiple studies [3, 7, 30], and in fact, enriching for seemingly neutral mutations that resemble an ancestral sequence has been shown to increase protein kinetic and thermodynamic stability and to improve the probability of acquiring new function mutations[52]. This work lends additional evidence of the correlation between ancestral sequences and thermostability by demonstrating that candidate ancestral AAV variants are more thermostable than contemporary serotypes. We also characterized the glycan dependencies of ancestral variants and found that previously studied AAV glycan dependencies including N- and O-linked sialic acids, heparan sulfate proteoglycans, and galactose were not utilized. It is conceivable that these dependences may have arisen more recently in the evolution along these AAV lineages. In addition, we found that ancestral libraries were as susceptible to neutralizing antibodies as AAV1, suggesting that this ancestral reconstruction pool exhibits immunogenic properties similar to current serotypes. Multiple antigenic regions have been mapped on natural AAV serotypes, including AAV1, AAV2, AAV5, and AAV8 [53]. Given that AAV1 is a descendant of the node 27 ancestral reconstruction, we aligned known AAV1 epitopes [54] with the ancestral reconstruction sequence. Mapped antigenic regions corresponding to AAV1 residues 496–499, 583, 588–591, and 597 were conserved in the ancestral reconstruction. Additionally, the ancestral sequence is identical to several known AAV2 antigenic regions including residues 272–281, 369–378, and 562–573 [55]. Such conserved regions may contribute to the observed susceptibility of ancestral variants to neutralizing antibodies. Interestingly, previous studies have also demonstrated cross-seroreactivity between ancestral and extant viral capsids. In particular, antiserum against extant viruses has been shown to neutralize reconstructed ancestral variants [14], and ancestral viruses can elicit neutralizing antibodies that protect against currently circulating strains, a property that has been exploited for the development of vaccine candidates [11, 12]. Neutralizing antibodies may therefore pose a significant clinical challenge for ancestral vectors. Further capsid engineering under a strong selective pressure for evading neutralizing antibodies may enable selection of combinations of mutations that promote antibody evasion [56]. For example, there are variable residues in the ancestral reconstruction that map to antigenic regions corresponding to AAV1 residues 456–459, 494, 582, and 593–595, and to antigenic regions in other serotypes [54]. Mutations in these regions could disrupt the binding of antibodies to capsid epitopes, and could potentially be combined with other mutagenesis strategies to engineer variants with enhanced antibody evasion properties [56]. Ancestral AAVs demonstrated efficient in vitro gene transfer to C2C12 mouse myoblast cells comparable to AAV1, a current gold standard for muscle transduction, yet utilized a different receptor for cell entry. This distinction may contribute to their efficient in vivo infectivity, which impressively reached 19–31 fold higher levels of expression than AAV1 in mouse gastrocnemius muscle. If the improved expression observed with ancestral reconstruction vectors is reproducible in human muscle tissue, ancestral variants will be auspicious candidates for clinical translation. In summary, our results indicate that a library of AAV variants representing sequence space around a key ancestral node is rich in broadly infectious variants with potential in gene therapy applications. We have taken initial steps in characterizing this sequence space by varying the amino acids at the lowest confidence positions identified by ancestral sequence reconstruction, followed by phenotypic selection to yield highly functional sets of amino acids at these locations. Sequence analysis of variable residues revealed a variety of outcomes ranging from highly conserved residues to more neutral positions that are pliable to change. Selected variants were promiscuous in their infectivity but showed promise as recombinant vectors in vitro and in vivo, and the putative mutational tolerance and evolvability of this library could be further harnessed in directed evolution studies to overcome gene therapy challenges such as targeted gene delivery and immune evasion.

MATERIAL AND METHODS

Ancestral reconstruction

AAV cap sequences (n=52) from Genbank [22], including those from human and non-human primate origin, were incorporated in this analysis, starting from lists of AAV sequences published in previous phylogenetic analyses [57, 58]. The MrBayes package [23] was used to perform Bayesian Markov chain Monte Carlo (MCMC) simulation of tree space and estimate the confidence values at each internal node. We then used the Markov chain Monte Carlo alignment sampler HandAlign [26] to explore alignment space and estimate regional confidence for the most likely alignment at node 27, discarding all but the sequences descended from this node. HandAlign generates a multiple sequence alignment, arranging the sequences of different variants in aligned ‘columns’ such that residues grouped in a column share a common ancestor. Each alignment column was modeled as a realization of the standard phylogenetic continuous-time Markov process of character evolution, using amino acid and empirical codon substitution rate matrices that were estimated from databases of aligned protein-coding sequence [59]. HandAlign performs the reconstruction simultaneously with the alignment, and accounts for sequence insertions, deletions, and character substitutions. The codon-level model was used to account for the possibility of synonymous substitutions with a phenotype at the DNA level; we also checked for the possibility of dual selection in overlapping reading frames (“overprinted” genes), by reconstructing both ancestral reading frames at the codon level. Neither of these subtle effects appeared significant enough to warrant prioritizing synonymous (silent, DNA-level) variants over the many non-synonymous amino acid variants.

Library construction and vector packaging

The reconstructed ancestral AAV cap sequence was synthesized (GeneArt, Life Technologies) with a library size of 5.6 × 1011, greater than the theoretical diversity of 2.5 × 1011. The library was digested with Hind III and Not I, and ligated into the replication competent AAV packaging plasmid pSub2. The resulting ligation reaction was electroporated into E. coli for plasmid production and purification. Replication competent AAV was then packaged and purified by iodixanol density centrifugation as previously described [56, 60]. DNase-resistant genomic titers were obtained via quantitative real time PCR using a Biorad iCycler (Bio-Rad, Hercules, CA) and Taqman probe (Biosearch Technologies, Novato, CA) [60].

Cell culture

C2C12 mouse myoblast, B16-F10 skin melanoma cells, CHO-K1, pgsA, Pro5, Lec1, and Lec2 cells were obtained from the Tissue Culture Facility at the University of California, Berkeley. IB3-1 lung epithelial and human embryonic kidney 293T cells were obtained from American Type Culture Collection (Manassas, VA). Unless otherwise noted all cell lines were cultured in Dulbecco’s Modified Eagle’s medium (DMEM, Gibco) at 37 °C and 5% CO2. L0 human glioblastoma tumor initiating cells were kindly provided by Dr. Brent Reynolds (University of Florida, Gainesville), and propagated in neurosphere assay growth conditions [61] with serum-free media (Neurocult NS-A Proliferation kit, Stem Cell Technologies) that contained epidermal growth factor (EGF, 20 ng/ml, R&D), basic fibroblast growth factor (bFGF, 10 ng/ml, R&D), and heparin (0.2% diluted in phosphate buffered saline, Sigma). IB3-1 cells were cultured in DMEM/F-12 (1:1) (Invitrogen, Carlsbad, CA). CHO-K1 and pgsA cells were cultured in F-12K medium (ATCC), and Pro5, Lec1, and Lec2 cells were cultured in MEM α nucleosides (Gibco). Except for GBM culture, all media were supplemented with 10% fetal bovine serum (Invitrogen) and 1% penicillin/streptomycin (Invitrogen).

Library selection and evolution

All cell lines were seeded in 6-well tissue culture plates at a density of 1 × 105 cells per well. One day after seeding, cells were infected with replication competent AAV libraries. After 24 hours of exposure, cells were superinfected with adenovirus serotype 5 (Ad5). Approximately 48 hours later, cytopathic effect was observed, and virions were harvested by three freeze/thaw steps followed by treatment with Benzonase nuclease (1 unit/mL) (Sigma-Aldrich) at 37 °C for 30 minutes. Viral lysates were then incubated at 56°C for 30 minutes to inactivate Ad5. The viral genomic titer was determined as described above. To analyze cap sequences, AAV viral genomes were extracted after packaging and rounds 3 and 6 of selection, amplified by PCR, and sequenced at the UC Berkeley DNA Sequencing Facility.

Statistical analysis of variable positions in evolved ancestral libraries

A comparison of the two sets of amino acids at each variable amino acid position was conducted to identify variable positions whose library proportions had changed significantly during selection. The posterior probability that the two sets of variable amino acids come from two different probability distributions was calculated assuming probability parameters that are Dirichlet-distributed with low pseudocounts to reflect sparse observed counts. For comparison of the synthesized and theoretical library, post-synthesis amino acid frequencies distributed via a Dirichlet-multinomial were compared with the theoretical probabilities from the library distributed by a multinomial.

In vitro transduction analysis

After six rounds of selection, ancestral library viral genomes were cloned into the pXX2 recombinant AAV packaging plasmid. To benchmark the infectivity of rAAV ancestral libraries against a panel of natural AAV serotypes, vectors were packaged with a self-complementary CMV-GFP cassette using the transient transfection method previously described [56, 60]. Cell lines (293T, C2C12, IB3-1, B16-F10, CHO-K1, pgsA, Pro5, Lec1, and Lec2) were seeded in 96-well plates at a density of 15,000 cells per well. One day after seeding, cells were infected with rAAV at a genomic MOI of 2,000 (293T, C2C12, IB3-1, B16-F10, GBM), 10,000 (Pro5, Lec1, Lec2), 32,000 (C2C12), or 50,000 (CHO-K1, pgsA) (n = 3). To analyze antibody evasion properties, ancestral rAAV libraries were incubated at 37°C for 1 hour with serial dilutions of heat inactivated IVIG (Gammagard), and then used to infect HEK293T cells at a genomic MOI of 2000 (n = 3). To characterize thermostability, virions packaged with self-complementary CMV-GFP were diluted with DMEM supplemented with 2% FBS and incubated at temperatures ranging from 59.6°C to 78°C for 10 minutes in a thermocycler (Bio-Rad) before being cooled down to 37°C and used to infect 293T cells at genomic MOIs ranging from 1,500–16,000; MOIs were adjusted to ensure an adequate number of GFP-positive cells for analysis. For all studies, the fraction of GFP-expressing cells 72 hours post-infection was quantified with a Guava EasyCyte 6HT flow cytometer (EMD/Millipore) (UC Berkeley Stem Cell Center, Berkeley, CA).

In vivo animal imaging and quantification of luciferase expression

High-titer rAAV CAG-Luciferase vectors were purified by iodixanol gradient and then concentrated and exchanged into PBS using Amicon Ultra-15 centrifugal filter units (Millipore). To study skeletal muscle transduction 5 × 1010 rAAV-Luc DNase-resistant genomic particles were injected in a volume of 30 μl into each gastrocnemius muscle of 7-week-old female BALB/c mice (Jackson Laboratories, n = 3) as previously described [56]. Six weeks after injection, animals were sacrificed, and gastrocnemius muscle was harvested and frozen. Luciferase activity was determined and normalized to total protein as previously described [60]. All animal procedures were approved by the Office of Laboratory Animal Care at the University of California, Berkeley and conducted in accordance with NIH guidelines on laboratory animal care.
  62 in total

1.  Feasibility of reconstructed ancestral H5N1 influenza viruses for cross-clade protective vaccine development.

Authors:  Mariette F Ducatez; Justin Bahl; Yolanda Griffin; Evelyn Stigger-Rosser; John Franks; Subrata Barman; Dhanasekaran Vijaykrishna; Ashley Webb; Yi Guan; Robert G Webster; Gavin J D Smith; Richard J Webby
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-20       Impact factor: 11.205

2.  Designing thermostable proteins: ancestral mutants of 3-isopropylmalate dehydrogenase designed by using a phylogenetic tree.

Authors:  Keiko Watanabe; Takatoshi Ohkuri; Shin-ichi Yokobori; Akihiko Yamagishi
Journal:  J Mol Biol       Date:  2005-11-08       Impact factor: 5.469

3.  Crystal structure of an ancient protein: evolution by conformational epistasis.

Authors:  Eric A Ortlund; Jamie T Bridgham; Matthew R Redinbo; Joseph W Thornton
Journal:  Science       Date:  2007-08-16       Impact factor: 47.728

4.  Capsid antibodies to different adeno-associated virus serotypes bind common regions.

Authors:  Brittney L Gurda; Michael A DiMattia; Edward B Miller; Antonette Bennett; Robert McKenna; Wendy S Weichert; Christian D Nelson; Wei-jun Chen; Nicholas Muzyczka; Norman H Olson; Robert S Sinkovits; John A Chiorini; Sergei Zolotutkhin; Olga G Kozyreva; R Jude Samulski; Timothy S Baker; Colin R Parrish; Mavis Agbandje-McKenna
Journal:  J Virol       Date:  2013-06-12       Impact factor: 5.103

5.  microRNA122-regulated transgene expression increases specificity of cardiac gene transfer upon intravenous delivery of AAV9 vectors.

Authors:  A Geisler; A Jungmann; J Kurreck; W Poller; H A Katus; R Vetter; H Fechner; O J Müller
Journal:  Gene Ther       Date:  2010-11-04       Impact factor: 5.250

6.  Safety and efficacy of regional intravenous (r.i.) versus intramuscular (i.m.) delivery of rAAV1 and rAAV8 to nonhuman primate skeletal muscle.

Authors:  Alice Toromanoff; Yan Chérel; Mickaël Guilbaud; Magalie Penaud-Budloo; Richard O Snyder; Mark E Haskins; Jack-Yves Deschamps; Lydie Guigand; Guillaume Podevin; Valder R Arruda; Katherine A High; Hansell H Stedman; Fabienne Rolling; Ignacio Anegon; Philippe Moullier; Caroline Le Guiner
Journal:  Mol Ther       Date:  2008-05-06       Impact factor: 11.454

7.  The ribonuclease from an extinct bovid ruminant.

Authors:  J Stackhouse; S R Presnell; G M McGeehan; K P Nambiar; S A Benner
Journal:  FEBS Lett       Date:  1990-03-12       Impact factor: 4.124

8.  Efficacy and long-term safety of alipogene tiparvovec (AAV1-LPLS447X) gene therapy for lipoprotein lipase deficiency: an open-label trial.

Authors:  D Gaudet; J Méthot; S Déry; D Brisson; C Essiembre; G Tremblay; K Tremblay; J de Wal; J Twisk; N van den Bulk; V Sier-Ferreira; S van Deventer
Journal:  Gene Ther       Date:  2012-06-21       Impact factor: 5.250

9.  Recombination and population mosaic of a multifunctional viral gene, adeno-associated virus cap.

Authors:  Yasuhiro Takeuchi; Richard Myers; Olivier Danos
Journal:  PLoS One       Date:  2008-02-20       Impact factor: 3.240

10.  GenBank.

Authors:  Dennis A Benson; Karen Clark; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Eric W Sayers
Journal:  Nucleic Acids Res       Date:  2013-11-11       Impact factor: 16.971

View more
  31 in total

Review 1.  E Pluribus Unum: 50 Years of Research, Millions of Viruses, and One Goal--Tailored Acceleration of AAV Evolution.

Authors:  Dirk Grimm; Sergei Zolotukhin
Journal:  Mol Ther       Date:  2015-09-21       Impact factor: 11.454

2.  AAV8-vectored suprachoroidal gene transfer produces widespread ocular transgene expression.

Authors:  Kun Ding; Jikui Shen; Zibran Hafiz; Sean F Hackett; Raquel Lima E Silva; Mahmood Khan; Valeria E Lorenc; Daiqin Chen; Rishi Chadha; Minie Zhang; Sherri Van Everen; Nicholas Buss; Michele Fiscella; Olivier Danos; Peter A Campochiaro
Journal:  J Clin Invest       Date:  2019-08-13       Impact factor: 14.808

Review 3.  Toward the Optical Cochlear Implant.

Authors:  Tobias Dombrowski; Vladan Rankovic; Tobias Moser
Journal:  Cold Spring Harb Perspect Med       Date:  2019-08-01       Impact factor: 6.915

4.  Viral infection: A key host receptor for AAV.

Authors:  Sabrina Sun; David V Schaffer
Journal:  Nat Microbiol       Date:  2016-02-01       Impact factor: 17.745

5.  In vivo-directed evolution of adeno-associated virus in the primate retina.

Authors:  Leah C Byrne; Timothy P Day; Meike Visel; Jennifer A Strazzeri; Cécile Fortuny; Deniz Dalkara; William H Merigan; David V Schaffer; John G Flannery
Journal:  JCI Insight       Date:  2020-05-21

Review 6.  Physical, chemical, and synthetic virology: Reprogramming viruses as controllable nanodevices.

Authors:  Maria Yanqing Chen; Susan S Butler; Weitong Chen; Junghae Suh
Journal:  Wiley Interdiscip Rev Nanomed Nanobiotechnol       Date:  2018-11-08

7.  Recombinant Adeno-Associated Virus Gene Therapy in Light of Luxturna (and Zolgensma and Glybera): Where Are We, and How Did We Get Here?

Authors:  Allison M Keeler; Terence R Flotte
Journal:  Annu Rev Virol       Date:  2019-07-05       Impact factor: 10.431

Review 8.  Evaluation of Dose and Safety of AAV7m8 and AAV8BP2 in the Non-Human Primate Retina.

Authors:  Pavitra S Ramachandran; Vivian Lee; Zhangyong Wei; Ji Yun Song; Giulia Casal; Therese Cronin; Keirnan Willett; Rachel Huckfeldt; Jessica I W Morgan; Tomas S Aleman; Albert M Maguire; Jean Bennett
Journal:  Hum Gene Ther       Date:  2016-10-17       Impact factor: 5.695

Review 9.  Systemic delivery of adeno-associated viral vectors.

Authors:  Dongsheng Duan
Journal:  Curr Opin Virol       Date:  2016-07-25       Impact factor: 7.090

Review 10.  Adeno-associated virus (AAV) vectors in cancer gene therapy.

Authors:  Jorge L Santiago-Ortiz; David V Schaffer
Journal:  J Control Release       Date:  2016-01-12       Impact factor: 9.776

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.