Manufacturing of recombinant adeno-associated virus (rAAV) viral vectors remains challenging, with low yields and low full:empty capsid ratios in the harvest. To elucidate the dynamics of recombinant viral production, we develop a mechanistic model for the synthesis of rAAV viral vectors by triple plasmid transfection based on the underlying biological processes derived from wild-type AAV. The model covers major steps starting from exogenous DNA delivery to the reaction cascade that forms viral proteins and DNA, which subsequently result in filled capsids, and the complex functions of the Rep protein as a regulator of the packaging plasmid gene expression and a catalyst for viral DNA packaging. We estimate kinetic parameters using dynamic data from literature and in-house triple transient transfection experiments. Model predictions of productivity changes as a result of the varied input plasmid ratio are benchmarked against transfection data from the literature. Sensitivity analysis suggests that (1) the poorly coordinated timeline of capsid synthesis and viral DNA replication results in a low ratio of full virions in harvest, and (2) repressive function of the Rep protein could be impeding capsid production at a later phase. The analyses from the mathematical model provide testable hypotheses for evaluation and reveal potential process bottlenecks that can be investigated.
Manufacturing of recombinant adeno-associated virus (rAAV) viral vectors remains challenging, with low yields and low full:empty capsid ratios in the harvest. To elucidate the dynamics of recombinant viral production, we develop a mechanistic model for the synthesis of rAAV viral vectors by triple plasmid transfection based on the underlying biological processes derived from wild-type AAV. The model covers major steps starting from exogenous DNA delivery to the reaction cascade that forms viral proteins and DNA, which subsequently result in filled capsids, and the complex functions of the Rep protein as a regulator of the packaging plasmid gene expression and a catalyst for viral DNA packaging. We estimate kinetic parameters using dynamic data from literature and in-house triple transient transfection experiments. Model predictions of productivity changes as a result of the varied input plasmid ratio are benchmarked against transfection data from the literature. Sensitivity analysis suggests that (1) the poorly coordinated timeline of capsid synthesis and viral DNA replication results in a low ratio of full virions in harvest, and (2) repressive function of the Rep protein could be impeding capsid production at a later phase. The analyses from the mathematical model provide testable hypotheses for evaluation and reveal potential process bottlenecks that can be investigated.
Recombinant adeno-associated virus (rAAV) is the replication-incompetent form of the wild-type parvovirus AAV and is employed for in vivo gene-therapy treatments. Its reduced toxicity, robust and long-term transgene expression, and ability to transduce both dividing and non-dividing cells as well as target a wide range of tissues has made rAAV the most widely used viral vector., Increasing demand for gene-therapy products, with 359 in vivo therapies in clinical trials in the first half of 2020 and a predicted 10 to 20 approved cell and gene-therapy treatments by 2025, has created a need for manufacturing large quantities of rAAVs., However, current state-of-the-art viral vector manufacturing methods still fall short of meeting current and future demands.rAAV viral vectors are manufactured by providing mammalian cells or insect cells with genetic elements that express components for rAAV production via infection, transfection, or genetic engineering. In general, the crude harvest of rAAV contains only a small fraction of capsids that contain a therapeutic element, from 5% to 30%,5, 6, 7 contributing to low viral vector yield and high cost because of the extra purification steps to remove empty particles. Transient transfection of mammalian cells with multiple plasmids is the first and remains the most commonly used method to produce rAAVs. The flexibility and speed of transient transfection are advantageous in early-stage clinical product development compared to other methods of viral vector manufacturing, such as using a stable expressing cell line. In double transfection, the helper virus genes required for viral replication are either provided by a helper virus infection or cloned onto the same plasmid that contains the rep/cap genes, whereas in triple transfection, the helper genes are cloned into a third plasmid. Compared to double transfection, triple transfection avoids the potential for incorporation of unwanted viral genomes (vgs) in the product, thus ensuring product safety. However, triple transfection is characterized by low yield. Recent process development efforts on rAAV manufacturing have demonstrated transfection in suspension culture that can produce up to 1 × 1014 vg/L after purification.10, 11, 12, 13 At the current productivity, the total required production of the viral vector, in terms of liters of culture, to treat a population of patients for one disease can be on the order of hundreds of thousands. For instance, the prevalence of Duchenne muscular dystrophy is 19.8 per 100,000 live male births, and each whole-body therapy-treatment requirement of at least 1 × 1015 vg/patient., It follows that the total production needed to treat the whole population for this disorder will be approximately 1 × 105 L of culture per year, with 140 million births per year worldwide. Such demands for viral vectors prompt the need to optimize the production process and increase productivity. A quantitative analysis of the system via a mechanistic, mathematical model will be useful to organize and exploit information obtained from existing data, gain understanding of process dynamics, and predict responses to various process inputs. This approach to facilitate process development can avoid the high cost of experimentation compared to empirical optimization.To our knowledge, there is currently no mechanistic model that captures the triple-transfection process as a whole, starting from the delivery of plasmid to the synthesis and final formation of rAAV particles. To this end, we construct a mechanistic, single-cell, triple-transfection model based on AAV replication biology that includes gene delivery, virion replication, Rep protein and viral protein (VP) synthesis, capsid synthesis, and genome encapsidation (Figures 1 and 2). Kinetic parameters are identified using data from in-house transfection and literature experiments. The model predictions to varied plasmid ratio inputs are benchmarked against published literature results. Sensitivity analysis and dynamics of intermediate components reveal a possible explanation to the low ratio of full-to-empty capsid in crude harvest. Transfection strategies are discussed from an operational standpoint that have the potential to overcome the identified bottlenecks. Lastly, model applications in process development and future research for model improvements are presented.
Figure 1
PEI-mediated plasmid-delivery schematic for triple transfection
(A) Triple transfection. (B) Model of plasmid delivery. Non-viral gene-delivery kinetics are assumed to be similar across all three different plasmids. The cellular uptake parameter (kuptake) describes the combined process of the extracellular complex (ExComplex) binding to the cell membrane followed by its internalization into the cytosol in the form of endosomal complexes (EndosComplex). Endosomal escape (kescape) then follows and releases the complexes into the cytosol (CytoComplex).PEI complexes in the cytosol can either migrate into the nucleus before unpacking to release plasmids, or they can unpack in the cytosol; all of the steps required to deliver the plasmids from inside of the cytosol complexes to inside of the nucleus are described by an effective kinetic event (knuclear_entry) , and the final product is naked plasmid in the nucleus (NuclearPlasmid). In addition, the degradation rates of all plasmid-equivalent species in the cytosol are described by one single parameter (kplasmid_degrade).This framework was adapted from previous non-viral gene-delivery model, and simplified to ensure model identifiability/observability with available measurement data.
Figure 2
Viral vector synthesis network
Helper plasmid (pHelper) activates expression of the rep/cap gene on the packaging plasmid (pPackaging) and synthesis (kCap_syn and kRep_syn) of viral protein (VP) and Rep protein (RepProtein). Capsid proteins are assembled (kassembly) into empty capsids in the nucleus (EmptyCapNuc), and each capsid particle consists of 60 protein subunits. With helper functions from the helper plasmid, the Rep protein replicates (kDNA_rep) viral DNA (vDNA) from the transgene vector plasmid (pVector). Rep proteins dock on empty capsids (kRep_bind_capsid) to form intermediate complexes (CapRepcomplex), which then interact with viral DNA and encapsidate them inside capsids (kDNA_pack) at a 1:1 ratio to form full virions inside the nucleus (RepRCcomplex). Regardless of their contents (empty or full), capsids can be secreted out of the nucleus (ksecrete) into the cytosol (EmptyCapCyto and FullCapCyto). Rep protein binding (kRep_bind_plasmid) forming bounded plasmids (RepRCcomplex) negatively regulates expression of the rep/cap gene on the packaging plasmid. Degradation of proteins (kRep_protein_degrade and kVP_degrade) is possible during the process. Dotted lines imply that reactants do not get consumed in the reaction.
PEI-mediated plasmid-delivery schematic for triple transfection(A) Triple transfection. (B) Model of plasmid delivery. Non-viral gene-delivery kinetics are assumed to be similar across all three different plasmids. The cellular uptake parameter (kuptake) describes the combined process of the extracellular complex (ExComplex) binding to the cell membrane followed by its internalization into the cytosol in the form of endosomal complexes (EndosComplex). Endosomal escape (kescape) then follows and releases the complexes into the cytosol (CytoComplex).PEI complexes in the cytosol can either migrate into the nucleus before unpacking to release plasmids, or they can unpack in the cytosol; all of the steps required to deliver the plasmids from inside of the cytosol complexes to inside of the nucleus are described by an effective kinetic event (knuclear_entry) , and the final product is naked plasmid in the nucleus (NuclearPlasmid). In addition, the degradation rates of all plasmid-equivalent species in the cytosol are described by one single parameter (kplasmid_degrade).This framework was adapted from previous non-viral gene-delivery model, and simplified to ensure model identifiability/observability with available measurement data.Viral vector synthesis networkHelper plasmid (pHelper) activates expression of the rep/cap gene on the packaging plasmid (pPackaging) and synthesis (kCap_syn and kRep_syn) of viral protein (VP) and Rep protein (RepProtein). Capsid proteins are assembled (kassembly) into empty capsids in the nucleus (EmptyCapNuc), and each capsid particle consists of 60 protein subunits. With helper functions from the helper plasmid, the Rep protein replicates (kDNA_rep) viral DNA (vDNA) from the transgene vector plasmid (pVector). Rep proteins dock on empty capsids (kRep_bind_capsid) to form intermediate complexes (CapRepcomplex), which then interact with viral DNA and encapsidate them inside capsids (kDNA_pack) at a 1:1 ratio to form full virions inside the nucleus (RepRCcomplex). Regardless of their contents (empty or full), capsids can be secreted out of the nucleus (ksecrete) into the cytosol (EmptyCapCyto and FullCapCyto). Rep protein binding (kRep_bind_plasmid) forming bounded plasmids (RepRCcomplex) negatively regulates expression of the rep/cap gene on the packaging plasmid. Degradation of proteins (kRep_protein_degrade and kVP_degrade) is possible during the process. Dotted lines imply that reactants do not get consumed in the reaction.
Results
The mechanistic model that describes the end-to-end viral vector production process contains two parts: non-viral gene delivery (Figure 1) and viral production network (Figure 2), with 21 species and 14 parameters in total. Parameter estimates are obtained from literature and fit to data via maximum likelihood (ML) estimation. Details on the model formulation and equations, as well as the parameter estimation strategy can be found in Mathematical modeling in Materials and methods.
Dynamics of polyethylenimine (PEI)-mediated delivery of plasmid DNA
Model simulation with optimized gene-delivery parameter values shows good agreement with the data used for fitting from Carpentier et al. (Figures 3A and S1). Simulation results show total plasmid uptake peaked at 20 h post-transfection (hpt) (Figure 3A), and plasmid copies in the nucleus peaked later at 45 hpt (Figure S1). Only 5% of the initial plasmid added to the cell culture (total plasmid input) is transfected into the cells, and the amount of plasmid that is eventually trafficked inside the nucleus is much smaller—around 0.6% of the total plasmid input. To validate the dynamic trend of nuclear plasmid copies in PEI-mediated gene delivery, model simulation with the estimated parameters was generated and compared against the data from the study by Glover et al. (Figure 3B). The model was able to describe the dynamic trend of the nuclear plasmid copies up to 24 hpt. The slight overprediction of the model can be attributed to the differences in cell lines used in the studies: (1) HEK293 cells have been shown to have better transfectability than HeLa cells at lower doses with calcium phosphate, and (2) the study by Carpentier et al. used HEK293-EBNA1 cells that express EBNA1 protein for episomal replication of the oriP-harboring plasmid, leading to overall higher plasmid copies.,
Figure 3
Dynamics of PEI-mediated gene delivery
(A) Plasmid copies per cell over time. The model was fitted to single-transfection data for HEK293-EBNA1 cells from Carpentier et al. with initial plasmid input of 2.7 × 105 plasmids/cell. (B) Plasmid content in the nucleus over time. The model prediction was generated using estimated parameters from the Carpentier et al. study and plotted against HeLa cell transfection data from Glover et al. The initial plasmid input was 2.2 × 106 plasmids/cell.
Dynamics of PEI-mediated gene delivery(A) Plasmid copies per cell over time. The model was fitted to single-transfection data for HEK293-EBNA1 cells from Carpentier et al. with initial plasmid input of 2.7 × 105 plasmids/cell. (B) Plasmid content in the nucleus over time. The model prediction was generated using estimated parameters from the Carpentier et al. study and plotted against HeLa cell transfection data from Glover et al. The initial plasmid input was 2.2 × 106 plasmids/cell.Parameter estimates are provided in Table 1. The uptake rate constant (kuptake) is higher in the case estimated by the literature than the value estimated from the in-house experiment. The phenomenon is to be expected since Carpentier et al. performed transfection in HEK293-EBNA1 cells that express EBNA1 proteins that enhances replication of the oriP-harboring plasmid., The estimated degradation constant of plasmid-associated species (kplasmid_degrade) in the cytosol is 0.0195 h–1; this rate is lower than the intrinsic instability constant of the uncomplexed plasmid (0.4621–0.8318 h–1) found in other literature. The estimated value is reasonable, as plasmids associated with complexes are expected to be more stable than uncomplexed plasmids. The smaller value of our plasmid nuclear entry constant (knuclear_entry) compared to the range found in the literature is because several steps were combined into one parameter, effectively prolonging the timescale.
Table 1
Model parameters
Parameter
Process
Value (95% confidence interval)
Unit
Evaluating method
Literature range
kuptake
DNA uptake from medium into cell cytosol
2.78 × 10−2 (2.06 × 10−2, 3.75 × 10−2)
h–1
fit to literature measurement of total plasmid20
exponential decay of 9 × 10419
1.19 × 10−3 (1.00 × 10−3, 1.40 × 10−3)
fit to in-house measurement of vDNA
6.527 × 10−3 to 4.2 × 10–118,25
kescape
endosomal escape
6.00 × 10−1
h–1
parameter for PEI 25K18
−
knuclear_entry
nuclear entry
4.30 × 10−3 (2.4 × 10−3, 7.7 × 10–3)
h–1
fit to literature measurement of nuclear plasmid20
8.75 × 10−2 to 3.5418,25
kplasmid_degrade
plasmid degradation
1.95 × 10−2 (1.09 × 10−2, 3.48 × 10−2)
h–1
fit to literature measurement of total plasmid20
0.4621 to 0.831826
kCap_syn
capsid protein synthesis
6.50 × 104 (5.28 × 104, 8.00 × 104)
molecule/cell/h
fit to in-house measurement of vDNA, total capsid, and full virion
−
kRep_syn
Rep protein synthesis
kVP_degrade
VP degradation
2.77 × 10−1
h–1
literature
0.23 to 0.3527
kRep_protein_degrade
Rep protein degradation
2.45 × 10−2
h–1
literature
0.01 to 628, 29, 30
kDNA_rep
transgene rescue and replication
3.10 × 10−7 (1.52 × 10−7, 6.32 × 10−7)
cell2/molecule2/h
fit to in-house measurement of vDNA
−
kassembly
capsid assembly
1.00 × 105
h–1
fast step assumption31
−
ksecrete
capsid secretion from the nucleus to the cytosol
1.00 × 105
h–1
fast step assumption
−
kDNA_pack
viral DNA encapsidation
8.18 × 10−2 (1.1 × 10−9, 6.08 × 106)
molecule/cell/h
fit to in-house measurement of full virion
−
kRep_bind_plasmid
binding of Rep protein on packaging plasmid
3.38 × 10−5
molecule/cell/h
literature
1.0 × 104 M–1s–132,33
kRep_bind_capsid
binding of Rep protein on empty capsid
5.70 × 10−3 (4.10 × 10−3, 8.00 × 10−3)
molecule/cell/h
fit to in-house measurement of total capsid and full virion
−
vDNA, viral DNA.
Model parametersvDNA, viral DNA.
Dynamics of viral vector synthesis in triple transfection
Model simulation with estimated parameters provides a good fit with experimental data (Figure 4). Interestingly, the plasmid kuptake estimated from in-house experimental data is significantly lower than that estimated from literature data. In-house experimental data show that viral DNA started replicating around 12 hpt and doubled every ∼12 h (Figure 4A). The total amount of capsid produced per cell increased sharply at first, with ∼80% of total capsids produced within the first 24 h and then plateaued. Total capsid particles and full virion production display a similar trend and saturate at a slightly later time point between 24 and 36 h; this trend is consistent with data from another transfection study, where the majority of full virions was produced in the first 24 h. Given the assumption that newly synthesized capsid proteins are rapidly assembled, the plateau in capsid particle production would be caused by saturation in VP production regulated by total Rep protein production. The amount of full virion only accounts for a consistent 2%–3% of the total capsid produced (Figure 4B). Although most of viral DNA was encapsidated at 24 hpt, and DNA replication continued at an exponential rate, the rate of encapsidated DNA did not follow DNA replication and decreased significantly beyond that time point. By 48 hpt, only 26% of total viral DNA made it inside capsid particles (Figure 4A).
Figure 4
Dynamics of viral DNA and rAAV5 capsid synthesis in in-house triple-transfection experiments for HEK293 cells
(A) Dynamics of viral GFP DNA replication. (B) Dynamics of total and full rAAV capsid production. Viral production parameters were fitted to in-house triple-transfection experimental data. Measurements of total and full rAAV capsids are per-milliliter culture, harvested from both cell lysate and supernatant, normalized by the cell density at the sampling time. The dotted line indicates time point of media exchange. Initial plasmid input was 7.6 × 104/cell for each of the three plasmids.
Dynamics of viral DNA and rAAV5 capsid synthesis in in-house triple-transfection experiments for HEK293 cells(A) Dynamics of viral GFP DNA replication. (B) Dynamics of total and full rAAV capsid production. Viral production parameters were fitted to in-house triple-transfection experimental data. Measurements of total and full rAAV capsids are per-milliliter culture, harvested from both cell lysate and supernatant, normalized by the cell density at the sampling time. The dotted line indicates time point of media exchange. Initial plasmid input was 7.6 × 104/cell for each of the three plasmids.
Model validation by varied input plasmid ratio
The model was validated by comparing its predictions using the estimated parameters for various plasmid ratio inputs against data published in literature by Grieger et al. (Figure 5). Data from triple-transfection experiments using the same PEI:DNA mass ratio of 2:1 as in-house experiments were used. Model outcomes for the four plasmid ratios experimentally tested in the aforementioned study were produced and plotted side by side. Data were normalized to the maximal value reported, and model predictions were normalized to the maximal value predicted among four cases. The model captured the relative change in full virion quantity as input plasmid ratio varied: case 2, where both helper and packaging plasmids ratios are increased, gives the highest amount of product, whereas case 4, in which only vector plasmid ratio is increased, gives the lowest yield of all. The observation suggests that optimal productivity requires a balance between the packaging gene and helper gene expression and that the increase of the copies of therapeutic gene plasmid does not improve production. This result also verifies the capability of our model to capture the interactions between components during the viral production process.
Figure 5
Quantity of full virions produced as a function of the molecular ratios of helper, packaging, and vector plasmids
Data/model simulation was normalized to the maximum value reported/predicted in the four cases. The experimental data with error bars were taken from a study by Grieger et al.
Quantity of full virions produced as a function of the molecular ratios of helper, packaging, and vector plasmidsData/model simulation was normalized to the maximum value reported/predicted in the four cases. The experimental data with error bars were taken from a study by Grieger et al.
Sensitivity analysis
The dynamic sensitivity profile of the optimized parameters was computed to evaluate their impacts on the final quantity of full virions as production progresses (Figure 6). Plasmid-delivery parameters (kuptake, knuclear_entry, kescape) consistently display positive effects on the quantity of full virion. kplasmid_degrade and degradation constant of Rep protein (kRep_protein_degrade) exhibit consistent inverse effects on full virion production. The assembly and secretion constants (kassembly and ksecrete) have little impact on full virion production, as expected from the fast step assumptions that we made in model formulation. Binding rate constant (kRep_bind_plasmid) has the largest negative impact on productivity. Interestingly, the sensitivity of full virion to the DNA replication rate constant (kDNA_rep) and packaging constant (kDNA_pack) switches from positive to negative at approximately 25 to 26 hpt. The packaging of DNA simultaneously releases the Rep protein to the free Rep protein pool as formulated in the model, and these Rep proteins can bind onto the packaging plasmid, limiting further rep/cap gene expression. On the other hand, the binding of Rep protein on empty capsid (kRep_bind_capsid) as the first step to the packing process has a positive impact on the full virion formation, and its sensitivity dominates the sensitivity of the kDNA_pack. The observation explains the wide confidence of the packing constant estimated. The analysis suggested factors that significantly impact the viral production process include the following: (1) plasmid trafficking, (2) Rep proteins function as a repressor to gene expression and catalyst to DNA packaging, and (3) DNA replication during first 24 hpt.
Figure 6
Dynamic profiles of the full virion sensitivities (Equation 17) for all of the model parameters.
Dynamic profiles of the full virion sensitivities (Equation 17) for all of the model parameters.
Discussion
Bottlenecks in viral production
Gene-delivery efficiency
Transient transfection at its current state is not an efficient process. Although transfection efficiency could be as high as 60%, as seen in our experiment and in the literature, plasmid copies in the nucleus after overcoming trafficking barriers are very low, as observed in most studies regarding gene delivery via non-viral transfection. Data from Carpentier et al. and the in-house experiment show that only a maximal of 5% of initial plasmid input is taken up inside the cells, and less than 1% of initial plasmid input was trafficked inside of the nucleus. This result is also consistent with the conclusion from the study by Pollard et al. Model sensitivity analysis indicates that plasmid-delivery parameters are limiting to full virion production; therefore, it is necessary to engineer a better transfection reagent or devise a delivery strategy that allows more plasmids to enter the nucleus. Varga et al. performed a quantitative analysis on the performance of transfection reagents and found that PEI performs better than lipofectamine. Within the class of polymer transfection reagents, compared to PEI Max that we used, there are other transfection reagents such as PEI Pro and FectorVIR-AAV (Polyplus Transfection, New York, NY, USA) that boast increased production in functional viral titers in suspension cell culture. There has been evidence that an increase in PEI/DNA complex size over longer incubation time with cells had an adverse effect in uptake process efficiency; thus another strategy to get more plasmids inside of the cells can be temporal dosing of cell cultures at lower dosage to prevent complexes from getting too large to enter the cells.
Saturation of capsid production and delayed DNA replication result in empty capsid
We visualized the reaction fluxes simulated by the model using in-house experimental settings (Figure 7B) to better understand the impact of their dynamics in full virion production. Rep protein (rRepSyn) and VP production (rCapSyn) begin shortly after plasmids successfully enter the nucleus at around 6 hpt, peak around 12 hpt, and decline shortly after. Capsid particles are synthesized with the same timeline as the Rep protein and VP since the assembly is assumed to have fast kinetic; the majority of capsids are shown to assemble (rAssembly) during the first 24 hpt. Beyond this point, unbound packaging plasmids that are active for protein synthesis are significantly reduced due to inhibitive binding of the Rep protein (Figure 7A). Analysis of reaction fluxes and intermediate species concentrations also reveals a possible explanation as to why the majority of capsids in harvest are usually empty. Viral DNA starts replicating later than capsid synthesis, around 12 hpt and peaks at ∼24 hpt, and the trend remains steady until at least 48 hpt as simulated by the model (Figure 7B) and also observed in our data (Figure 4). As mentioned above, capsid production is significantly reduced by 24 hpt, and as postulated in the model formulation, replicated DNA in later phases cannot be packed due to the lack of new capsid production, as previously made empty capsid are already secreted out to the cytosol without DNA. This also explains why enhancing the viral DNA replication kinetics at an earlier time point will have a positive effect on full virion production, as indicated in our sensitivity analysis. DNA replication and capsid synthesis dynamics depend largely on Rep protein regulations, which simultaneously negatively and positively affect gene expression. Optimization of rAAV production will require balancing the rep/cap gene with helper gene expression as evident in Figure 5. An essential strategy to enable viral DNA replication before or in parallel with capsid production is early expression of Rep proteins, which is a required machinery. With the current method, the Rep protein and VP are linked on the same plasmid. The decoupling of the expression of the Rep protein and VP by expressing the Rep protein from a separate plasmid from the cap gene will give better control of gene expression levels. In fact, a study by Emmerling et al. employed a four-plasmid transfection strategy by splitting the rep/cap gene on two different plasmids and achieved a higher rAAV yield that corresponded to an increase in VP production. Our analysis should also motivate a transfection strategy that alleviates protein production stress on cellular machinery as well as syncs the timelines of viral DNA and capsid production. From a process-operating standpoint, temporal dosing of plasmid at different time points is a potential solution, although it will be of concern whether the ability of the culture to uptake plasmids decreases over time.
Figure 7
Model simulation of intermediate species.
Dynamic trends of the (A) species concentrations and (B) reaction fluxes. The values are qualitative. The simulation parameters were according to the in-house experiments described in Materials and methods.
Model simulation of intermediate species.Dynamic trends of the (A) species concentrations and (B) reaction fluxes. The values are qualitative. The simulation parameters were according to the in-house experiments described in Materials and methods.
Limitations and future improvements of the model
Currently, all three plasmids are assumed to have the same kinetics; however, the reality could be different: nonlinear effects of circular DNA length on plasmid transfection efficiency have been reported in literature. Therefore, it is important to determine whether there is a significant difference in the kinetics across different plasmid uptake rates. Besides the assumption of similar uptake kinetics for all three different plasmids, several other simplifying assumptions were made to accommodate limited data availability: (1) transcription and translation steps were combined into one kinetic step; (2) capsid assembly and secretion of capsid from the nucleus into the cytosol were assumed to have fast kinetics; (3) all kplasmid_degrade species were described by one parameter; and (4) the DNA encapsidation rate was assumed to only depend on free replicated viral DNA and empty capsid in the nucleus. Experiments that elucidate these intermediate steps have potential for future work and improvements upon the model.Although the developed model enables the prediction of changes in rAAV production in response to several process parameters such as cell growth rate and plasmid quantity, it has not yet captured the effects of many other process parameters that are commonly manipulated during process development, such as cell density, pH, dissolved oxygen, and media component concentrations on rAAV production. As an example, cell density at the time of transfection has also been reported to influence final product quantity through an undefined mechanism., The enhancement of the media with sodium butyrate and certain peptones, which are known to increase recombinant protein expression, can have a positive effect on the recombinant viral vector productivity.,39, 40, 41 Therefore, the model can be improved by correlating the physiological state of the cell culture to its viral vector productivity. Extension of the model to include such correlations will allow predictions from more process production parameters for further support in process development.
Conclusion
At the current state of process development in viral vector manufacturing, evaluating the impact of system inputs on the outcome still relies on the empirical approach. Statistical and empirical approaches, such as design of experiment (DoE) done by Zhao et al., can be straightforward for optimizing processes but only within a defined parameter space, whereas mechanistic models can provide physical insights and understanding of the process, multivariable interactions and dynamics to guide novel production design, and extrapolated hypotheses that can be tested., In this study, we develop a mechanistic model for the triple-transfection production platform to link transfection inputs to outputs and estimate the relevant parameters. Since the biological mechanism of rAAV synthesis will remain largely the same for other transfection methods and even productive infection or a stable production cell line, our model can be easily adapted to other production methods and serotypes with small modifications. For instance, a double transfection can be derived by removing one set of gene-delivery parameters and species and re-estimating protein synthesis constants. Transfection production data from Chahal et al. showed that the quantities of genome-containing particles of different serotypes produced in the same platform are comparable; therefore, the proposed mechanistic model can be adapted to the production of various rAAV serotypes with slight parameter adjustments. This mechanistic model can provide valuable insights to experimental data, identify process bottlenecks, and guide future research. The leveraging of the model in a quality-by-design implementation has great potential to facilitate process development and advance gene-therapy manufacturing.
Materials and methods
Materials
FreeStyle 293-F cells and 293 Expression Medium were purchased from Thermo Fisher Scientific (Waltham, MA, USA). A set of plasmids, including pAAV-GFP (part no. AAV-400), pRC5 (part no. VPK-425), and pHelper (part no. 340202), was purchased from Cell Biolabs (San Diego, CA, USA) for AAV5 production with GFP as the transgene. Plasmid pRC5 is 7.3 kb long with endogenous promoter p5 and p19 for Rep2 and p40 and p81 for Cap5. Plasmid pHelper is 11.6 kb with endogenous sequences for E2A and E4. Plasmid pAAV-GFP is 5.4 kb long and constructed with a cytomegalovirus (CMV) promoter and GFP between two inverted terminal repeat sequences. These plasmids were amplified using One Shot Stbl3 E. coli from Thermo Fisher Scientific (Waltham, MA, USA) and purified by the Plasmid Mega Kit from QIAGEN (Hilden, Germany). The purified plasmids were further sterilized by 0.22 μm filtration and stored at −20°C. PEI transfection agent PEI Max was purchased from Polysciences (Warrington, PA, USA) and dissolved in MilliQ water to make a solution of 1 μg/μL (pH adjusted to 7.0) by adding 1 M HCl. The solution was sterilized by 0.22 μm filtration and stored at 4°C. The AAV5 ELISA kit was from Progen (Wayne, PA, USA). DNA primers were from Integrated DNA Technologies (Coralville, IA, USA). The Monarch Genomic DNA Purification Kit, the DNase buffer, DNase I, and proteinase K were from New England Biolabs (Ipswich, MA, USA). The iTaq Universal SYBR Green Supermix was from Bio-Rad Laboratories (Hercules, CA, USA).
Transfection method
Cell culture
A vial of FreeStyle 293-F cells was thawed and cultured in FreeStyle 293 expression medium. The culture was conducted at 37°C with 135 rpm agitation in a humidified incubator with 5% CO2. The cells were cultured at 30 mL vol in 125 mL shake flasks (flat bottomed with vented caps), and subcultures were performed between 0.3 and 3 million cells/mL. At all times, cell viability was maintained above 90%. Samples of 50 μL cell-culture samples were taken daily from the shake flasks, mixed with 50 μL trypan blue stain, and analyzed for cell count and viability using the Countess II Automated Cell Counter (Thermo Fisher Scientific, Waltham, MA, USA).
Cell transfection for AAV5 production
Prior to transfection, an aliquot of cells at the exponential growth phase was transferred to new shake flasks and diluted to a cell density of 1 million cells/mL to a total volume of 24 mL by adding pre-warmed medium. Each plasmid solution was quantified using a NanoDrop Spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). The three plasmids were mixed at 1:1:1 molar ratio, and the total plasmid dosage was 2 μg plasmid per milliliter of culture. 1 mL of transfection cocktail was prepared by the sequential addition of the plasmid mix, culture medium, and the PEI Max solution. The amount of PEI Max added was calculated so that the PEI:DNA mass ratio came out to 2:1. The transfection cocktail was vortexed vigorously for 10 s, incubated at room temperature for 10 min, and then added to the cell culture. The final volume of the culture in each shake flask was 25 mL. For the control culture, 25 mL culture was set up at 1 million cells/mL without adding transfection cocktail. All of the shake flasks were cultured as described above. Media exchange was performed 6 h after transfection: the cells were transferred to 50 mL centrifuge tubes and centrifuged at 300 × g for 5 min to remove the supernatant; then re-suspended in 25 mL fresh, warmed medium; and transferred back to the original shake flasks and cultured as described above. Culture samples of 2 mL were taken at the time points of 10, 24, 34, 46, and 55 h. A volume of 50 μL culture broth from each sample was used to measure the cell count and viability immediately. The rest of cell-culture samples were aliquoted into 500 μL in each Eppendorf tube and centrifuged at 1,000 rpm for 5 min. Supernatant and cells were separated into different tubes and stored at −80°C until the assays below were performed.
Analytical methods
Measurement of AAV capsids
The quantity of AAV capsids in the supernatant and cell pellet was measured. To quantify the capsids in the cells, one sample tube containing the cells from 500 μL culture was thawed on ice. A volume of 100 μL pre-chilled lysis buffer composed of 150 mM NaCl, 50 mM Tris-HCl (pH 8.5), was added to the cells. The cells were suspended, briefly vortexed, and then processed by a total of three cycles of freezing in an ethanol bath (a mixture of dry ice and 70% ethanol) for 10 min and thawing in a 37°C water bath for 10 min. The sample was clarified by centrifuging at 12,100 × g, 4°C, for 15 min, followed by transferring the supernatant to a new Eppendorf tube and discarding the precipitates. The samples were then analyzed by an AAV5 ELISA kit according to the instruction from the manufacturer (Progen, Wayne, PA, USA). To quantify the capsids in the supernatant, an aliquot of the supernatant saved from cell culture was thawed and further clarified by centrifuging at 12,100 × g, 4°C, for 15 min. The clarified supernatant was then analyzed by the AAV5 ELISA kit. The final absorbance from the ELISA reactions was read by a microplate reader (BioTek Instruments, Winooski, VT, USA). The concentration of capsids in the cell lysate solution was converted to the capsids’ quantity per cell based on the cell number included in the 500-μL culture sample.
Measurement of intracellular plasmids and AAV genome
A sum of the quantity of pAAV-GFP and AAV-GFP genome replicated during AAV production was measured from cells. With the procedures described in the following, we lysed the cells without separating the cytosol and nucleus; therefore, the quantified DNA was a population from the entire cells. To begin with, one sample tube containing the cells from 500 μL culture was thawed on ice. A wash step was performed to cell pellets to remove extracellular plasmids: 1 mL cold 1× PBS was added to suspend cells; the solution was centrifuged at 1,000 rpm and 4°C for 10 min. The supernatant was discarded, and the cell pellets were collected at the bottom of the tubes. DNA was extracted from cell pellets by the Monarch Genomic DNA Purification Kit following the instruction from the manufacturer. The Monarch Genomic DNA Purification Kit uses the same principles for extracting and purifying DNA as the DNeasy Blood & Tissue Kit (QIAGEN), which has been used to extract DNA at the range of 5 kb size. In addition, there is an equal chance for the positive (+) or negative (−) sense DNA strand to have been encapsidated in AAV capsids. Once released by proteinase K, these complementary strands can form double-stranded DNA. Briefly, cell pellets were suspended in 100 μL cold PBS, mixed with 1 μL proteinase K and 100 μL lysis buffer, and incubated at 56°C for 5 min. Subsequently, the sample was mixed with 400 μL binding buffer and transferred to a spin column for centrifugation at 1,000 × g, room temperature for 3 min, followed by centrifugation at 13,000 × g for 1 min. The column was then placed on a clean tube and washed with 500 μL wash buffer and a centrifugation at 13,000 × g for 1 min, followed by one more wash using 500 μL wash buffer and a centrifugation at 18,000 × g for 1 min. The spin column was then transferred to a clean tube and mixed with 100 μL elution buffer that was pre-heated to 60°C and incubated for 1 min. The spin column was then centrifuged at 13,000 × g for 1 min. The eluent was collected and ready for qPCR analysis. To perform qPCR analysis, 5 μL of the extracted DNA was mixed with 2.5 μL of the 4 μM forward primer, 2.5 μL of the 4 μM reverse primer, and 10 μL of iTaq 2× Universal SYBR Green Supermix. The quantity of pAAV-GFP was measured in the samples by a LightCycler 480 System (Roche, Basel, Switzerland) with the following conditions: 95°C for 10 min for the initial denaturation and activation of enzyme, 40 cycles of 95°C for 15 s (for denaturation), 55°C for 15 s (for primer annealing), and 68°C for 1 min (for extension). The primers used targeted the GFP sequence. The forward primer was 5′-GCAAAGACCCCAACGAGAAG-3′, and the reverse primer was 5′-TCACGAACTCCAGCAGGACC-3′. The plasmid pAAV-GFP was diluted to a series concentration from 2 × 109 to 2 × 102 molecules/μL using DNase-free water and used as the standards. The DNase-free water was used as the non-template control.
Measurement of AAV encapsidated genome
To measure the copy number of the AAV genome that was encapsidated in AAV particles, one sample tube containing the cells from 500 μL culture was thawed and lysed by three cycles of freeze and thaw, following the same procedures described in the method for AAV capsid measurement. In the 100-μL lysate, 10 μL of 10× DNase buffer and 5 μL DNase (10 U) were added. The mixture was incubated at 37°C for 1 h, followed by a heating step at 75°C for 10 min to deactivate the DNase. The solution was then cooled down on ice. A 2-μL of proteinase K (1.6 U) was subsequently added to the solution and incubated at 50°C for 1 h. The solution was then heated at 95°C to deactivate the proteinase. Then, the encapsidated genome was quantified from the samples using the primers targeting GFP and the qPCR procedures as described in the method for intracellular plasmids and AAV genome measurement.
Mathematical modeling
Model formulation
The mechanistic model describes kinetic behavior of transient transfection at the subcellular level and is separated into two parts: the trafficking of plasmid from the supernatant to the nucleus and the vg replication process. Schematics of the two parts of the model are shown in Figures 1 and 2. The model has a total of 21 species and 14 parameters.The first part of the model describes the non-viral gene-delivery process. This part was adapted from previously developed non-viral, gene-delivery models.,, Our model was simplified from the models in the literature to exclude some intracellular intermediate species (nuclear pore complex, plasmid associated with PEI versus plasmid-free plasmid, plasmid bound to transport protein) to utilize measurable states and parameters. Specifically, the model bypasses certain steps and combines species to measurable states to ensure model identifiability.The mass balances of the species in the first part of the model are the following:, and These equations include four major steps, namely the uptake of PEI/plasmid complex from culture medium into the cytosol (kuptake) in the form of endosomal vesicles, the escape of complexes from endosomal vesicles into the cytosol (kescape), the entry of plasmid/complex from the cytosol to inside the nucleus (knuclear_entry), and the overall kplasmid_degrade, which describes the combined effect of possible degradation of vesicles containing complexes and uncomplexed plasmids due to various factors such as nucleases in the cytosol. In addition, intracellular species dilution due to cell division is described by a mutual cell growth parameter (μ).In the second part of the model, the knowledge of rAAV production mechanism was adapted to construct a viral vector synthesis network from the three plasmids once they are successfully delivered inside the cell nucleus. The main steps capturing viral production are described in Figure 2, including Rep protein and VP synthesis, capsid assembly, viral DNA replication, DNA packaging into the capsids, and capsid secretion from the nucleus into the cytosol. The model does not include the secretion of viral particles from the cytosol to the supernatant; this step can be added and is expected to vary for different serotypes. The mass balances of the species in the second part of the model are the following:The cellular process corresponding to each kinetic constant is shown in Table 1. The model assumptions and simplifications of the actual events are the following:One kinetic rate is used to describe the combined kinetics of transcription and translation and does not consider the transportation of mRNA into the cytosol and protein into the nucleus after synthesis.The synthesis rates of Rep protein and VP are described by bilinear kinetics to capture the role of the helper plasmid, specifically the E2A gene, in activation of rep/cap gene expression (Equations 5 and 6). Although the helper protein E1A is also crucial in transcription activation and repression relief,, this protein is stably expressed in HEK293 cells,,, so its effects on capsid production are not explicitly modeled.Viral DNA requires both AAV helper functions of the E4 protein and the Rep protein to replicate, and the replication kinetics are formulated as trilinear reactions, including both helper plasmid and Rep protein as reactants (Equation 7).Capsid proteins are VPs of three different types (VP1, VP2, and VP3) but are denoted as one species and assumed to have the same relative synthesis rates.The Rep protein has been reported to negatively regulate the promoter p5 to the rep gene as well as a translational inhibitor of p40 mRNA in AAV2., Although the plasmid used for in-house experiments has the cap gene from AAV5, it has the same promoter p40, and the Rep protein regulatory function remains the same in our case. The Rep protein regulation is captured via inhibitive binding to the packaging plasmid kRep_bind_plasmid, which effectively inactivates that plasmid for gene expression after binding. The binding of such Rep protein to the packaging plasmid does not reduce the quantity of plasmid present but rather simulates the effect of reduced transcription and translation rates; the bound plasmids are still present but no longer active in protein synthesis activities.Packaging of the vg into preformed capsids requires the helicase activity of Rep protein as well as the presence of Rep protein/capsid complex intermediates;57, 58, 59 thus viral DNA encapsidation is formulated as a two-step process, in which Rep protein first docks on kRep_bind_capsid, and DNA is packaged via strand-displacement synthesis kDNA_pack. The Rep protein is postulated to detach from the capsid after packaging, as there has been evidence that Rep proteins accumulate mostly inside the nucleus.
Parameter estimation strategy
Determination of gene-delivery parameters
Existing models for this process contain parameters that do not pertain to the cell line that was used in in-house transfection experiment, or were only validated for shorter time courses (24 h or less,,) than the length of transfection considered (48 h or more). In addition, uptake kinetics can vary between laboratories depending on transfection conditions. Therefore, kinetic parameters were re-estimated to ensure accuracy of the plasmid-delivery dynamics. Although the quantitative dynamic measurement of intracellular plasmid content, including both cytosolic and nuclear plasmids, has been reported in the literature, the corresponding measurement of plasmid content in the nucleus alone is rarely reported considering the often laborious protocol of nuclei isolation. Since the viral vector production requires component synthesis from all three plasmids, quantification of the plasmid copy number in the nucleus relative to the intracellular amount is necessary to constrain plasmid-delivery kinetic constants, specifically knuclear_entry. In this work, the data of a single transfection reported by Carpentier et al. were leveraged to estimate the gene-delivery parameters because (1) they reported quantitative measurements of plasmids both in the cell and the nucleus, and (2) the cell line and transfection used in the experiment (single transfection of HEK293 with PEI40K) were the most similar to our system. Some data were extracted from the figures using WebPlotDigitizer. Although the measurement of plasmids both in the cell and the nucleus cannot uniquely determine kescape and knuclear_entry, we assume prior knowledge on the kescape using results from a previous study.Estimates of delivery parameters can vary between laboratories due to differences in transfection conditions. The plasmid kuptake, which directly affects transfection efficiency (percentage of cells effectively transfected), can change depending on experimental conditions, including the chemical structure and molecular weight of the transfection agent, media composition, cell density, cell health, cell line, complex incubation times, and plasmid dosage. In particular, utilization of HEK293-EBNA1 cells that express the EBNA1 that enhances oriP plasmids will lead to higher plasmid copies overall and will affect the estimated value of kuptake., Therefore, the kuptake was re-estimated to fit in-house experimental data. However, the feature of this cell line will not affect the nuclear entry barrier and the related knuclear_entry parameter. Loss of intracellular plasmid content (μ) also depends on the doubling time of cells in each experiment. Cell cultures in the literature mentioned above had a faster doubling time (18 h) compared to our in-house experiments (∼30 h on average), suggesting that the plasmid dilution rate would be lower in our case. However, it is reasonable to assume that the rates of endosomal escape and nuclear entry are similar for the same cell line and for the same transfection reagent. Although PEI 40K was used in an in-house experiment instead of PEI 25K in the study by Carpentier et al., data provided from the manufacturer suggest that the two reagents are close in transfection efficiency, as measured in level of protein expression, thus making it reasonable to apply the kescape for PEI 25K previously reported and the knuclear_entry estimated from the data by Carpentier et al. to estimate model parameters with in-house experimental data.
Determination of viral production parameters
Due to the intricate viral synthesis network, limited availability of dynamic data, and the lack of prior knowledge on some parameter values, steps are taken to ensure that all parameter values estimated were unique and make physical sense. We leveraged structural analysis and prior parameter knowledge to make assumptions based on observations from literature in order to reduce the number of degrees of freedom and estimate a unique set of parameters.
Protein synthesis constants
Myers and Carter reported the dynamics of Rep protein and VP during wild-type AAV production by co-infection with adenovirus and showed that the synthesis rates of Rep protein and VP are comparable. Therefore, the same value was used for both Rep and VP synthesis constants (kRep_syn and kCap_syn, respectively) in the estimation.
Protein degradation constants
Although assembled capsids are stable and do not degrade easily, Grosse et al. found that unassembled capsid proteins are prone to degradation and have a half-life of 2 to 3 h, in which the first-order kinetic constant equivalence is 0.23 to 0.35 h–1. On the contrary, Redemann et al. reported that Rep proteins are more stable and have a half-life of at least 15 h, which translates to a first-order degradation rate constant of 0.0462 h–1 or lower. The VP degradation constants (kVP_degrade) in mammalian cells including GFP protein in HEK293 cells, as reported in literature, fall within the 0.01 to 0.06 h–1 range., The protein degradation constant can be assumed to be symmetrically and normally distributed in a logarithmic with base 10 (log10) scale, and its range contains 95% confidence intervals. Therefore, the degradation constants are fixed at 0.2773 h–1 for VP and 0.0245 h–1 for the Rep protein.
Rep protein-binding constant
To our knowledge, the association rate constants of the Rep protein to the respective promoter and capsid have not been reported. Transcription factor-binding constants in vitro have been reported to be on the order of 106 (insect) to 108 (bacteria) M–1s–1. However, the binding constants observed in in vivo systems can be two orders of magnitude lower than in vitro as a result of intracellular diffusion and other proteins crowding around binding sites., Because mammalian cells and insect cells are both eukaryotic, whereas bacteria cells are prokaryotic, the kRep_bind_plasmid is fixed at the lower limit adjusted by a factor of 100, at 104 M–1s–1. Unit conversion to (molecule/cell)–1h–1 was performed using 15 μm as the average diameter of HEK nucleus.
Capsid assembly constant
Myers and Carter’s study suggested that capsid proteins are assembled into viral capsids rapidly following synthesis. Therefore, fast kinetics is assumed based on this observation, and kassembly is fixed to be at least 1,000 times faster than the rate of VP synthesis.
Packing and secretion constants
The relationship between DNA packing rates and capsid secretion rates dictates full:empty capsid ratios in the final product, as viral DNA encapsidation into viral capsids only happens inside the nucleus in the case of parvovirus. The total full virions in in-house experimental data does not distinguish between capsids inside or outside the nucleus. The majority of harvested rAAV particles are empty, as reported in literature and observed in experimental data, suggesting that secretion of capsids from the nucleus might be much faster than DNA packing. Accordingly, secretion is assumed to have fast kinetics, and ksecrete was set to be arbitrarily large while estimating kDNA_pack and kRep_bind_capsid.
Computational methods
Variances of parameters predetermined by the literature (kRep_protein_degrade, kVP_degrade) or estimated from gene-delivery data (knuclear_entry) will affect the determination of the remaining viral production parameters (kuptake,
kRep_syn, kCap_syn, kDNA_rep, kDNA_pack, kRep_bind_capsid). ML estimation, i.e.,was used to capture the effect of predetermined parameter variances in the objective function, where k is the vector of parameter estimates, k0 is the vector of predetermined parameters, y is the vector of experimental observations, u is the vector of model predictions, and V is the matrix of experimental data variance taking into account the predetermined parameter variance V and the normal noise Vε. Detailed derivations can be found in the Materials and methods in a past work by our laboratory.Simulation and optimization were performed entirely in MATLAB (MathWorks, Waltham, MA, USA). The differential equations were solved using a stiff ordinary differential equation (ODE) solver (ode15s). The best-fit parameters were computed using a multi-start approach with an interior-point algorithm to look for the global minima by minimizing the objective function locally from 1,000 random starting points. Since the values of the parameters span a large range in order of magnitude, the optimization was performed over log10
k to search over a larger numerical space and speed convergence and ensure that the estimated confidence intervals are positive.The sensitivities of model species with respect to the model parameters were calculated by integrating the sensitivity equations along with the state equations. Let u denote the vector of state variables, f denote the vector of kinetic equations, W denote the sensitivity matrix, and J and J denote the Jacobian matrices with respect to the states and parameters, respectively. ThenThe Jacobian matrices are defined , and , and the partial derivatives can be calculated for each time step by using the auto differentiator ADiGator. Since we optimize over log10
k, the sensitivities are adjusted correspondingly as follows:
Authors: P L Pham; S Perret; B Cass; E Carpentier; G St-Laurent; L Bisson; A Kamen; Y Durocher Journal: Biotechnol Bioeng Date: 2005-05-05 Impact factor: 4.530
Authors: Moo Sun Hong; M Lourdes Velez-Suberbie; Andrew J Maloney; Andrew Biedermann; Kerry R Love; J Christopher Love; Tarit K Mukhopadhyay; Richard D Braatz Journal: Biotechnol Bioeng Date: 2020-12-21 Impact factor: 4.530
Authors: Jia-Shiung Guan; Kai Chen; Yingnan Si; Taehyun Kim; Zhuoxin Zhou; Seulhee Kim; Lufang Zhou; Xiaoguang Margaret Liu Journal: Front Chem Eng Date: 2022-01-28