| Literature DB >> 35145025 |
David A Swan1, Morgane Rolland2,3, Joshua T Herbeck4, Joshua T Schiffer1,5,6, Daniel B Reeves7.
Abstract
Modern HIV research depends crucially on both viral sequencing and population measurements. To directly link mechanistic biological processes and evolutionary dynamics during HIV infection, we developed multiple within-host phylodynamic models of HIV primary infection for comparative validation against viral load and evolutionary dynamics data. The optimal model of primary infection required no positive selection, suggesting that the host adaptive immune system reduces viral load but surprisingly does not drive observed viral evolution. Rather, the fitness (infectivity) of mutant variants is drawn from an exponential distribution in which most variants are slightly less infectious than their parents (nearly neutral evolution). This distribution was not largely different from either in vivo fitness distributions recorded beyond primary infection or in vitro distributions that are observed without adaptive immunity, suggesting the intrinsic viral fitness distribution may drive evolution. Simulated phylogenetic trees also agree with independent data and illuminate how phylogenetic inference must consider viral and immune-cell population dynamics to gain accurate mechanistic insights.Entities:
Keywords: HIV primary infection; phylodynamics; phylogenetics; viral dynamics modeling; viral evolution
Mesh:
Year: 2022 PMID: 35145025 PMCID: PMC8851487 DOI: 10.1073/pnas.2109172119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Mechanistic WiPhy models and the optimal fit to experimental data. (A) Mechanistic model schematic. Susceptible cells are infected by viral variant with a genotype , generating new infected cells (latent and active), producing more virus, and engendering immune responses. (B) Mutation model governs new variants that are defective (probability ) or intact. (C) If intact, point mutations (probability ) can occur that change variant fitness (infectivity) based the viral fitness (vf) model—exponential model was optimal. (D) Adaptive immune (ai) models were also varied—global was optimal. (E) Three stochastic replicate simulations of the best model (exponential-global, purple) against the five types of experimental data (gray, see for details on data and cohorts).
Fig. 4.Comparative analysis of experimental and model tree estimation. (A) Experimental tree (C1V2 env, p1362). All sampling schemes are based on this individual. (B) Running the best model three times (i), sampling sequences with identical timing and sample size three times (j), and with three tree estimate replicates (k) resulted in 27 trees enumerated i.j.k. Two example simulated trees visually match the experimental tree. (C) Quantitative comparison of trees using phylogenetic summary statistics show some simulations (dots) agree with data (dashed line) and that model run introduces the most variability (solid colored lines are medians across sequence sampling and BEAST run).
Fig. 2.Model-predicted fitness distribution resembles in vivo data not restricted to primary infection as well as in vitro data without the influence of adaptive immune pressure. The exponential distribution predicted by the model was compared to available in vivo sequence entropy and in vitro DMS data that quantified the relative fitness of all amino acid changes within env. Ranked fitness has similar fractions advantageous. Distributions were not significantly different by paired Kolmogorov–Smirnov tests. (Inset) Cumulative distribution functions (cdf).
Fig. 3.Visualizing evolutionary dynamics in the optimal model. Example simulation of the best model (variants ever in top 10 and total viral loads). (A) Coloring by genotype number illustrates population sweeps and >106 intact variants; many more defective variants have been created. (B) Variant trajectories shifted to the time they entered the top 10 by abundance; variants emerging later in infection have different kinetic profiles than those from early infection (compare red and blue). (C) Coloring by HD to founder sequence illustrates most early (red) variants have approximately one point mutation from the founder sequence, whereas later sequential evolution has occurred, with variants emerging with more than two mutations from the founder sequence. (D) Proportional abundance colored by HD illustrates the stark shift from founder predominance to more evenness after viral load nadir. (E) The complete transmission record, or genealogy illustrates the “true tree”—the parental genotype of each variant created on each day. Certain lineages persist for more than a hundred days, meaning that offspring are generated from a parental sequence that was created months prior. (F) The tMRCA of 50 randomly sampled sequences on a given day is bimodal: Variants are created by both more ancestral and more recent parents.