| Literature DB >> 30186625 |
Antoine Chaillon1, Sara Gianella1, Susan J Little1,2, Gemma Caballero1, Francis Barin3, Sergei Kosakovsky Pond1, Douglas D Richman1,2, Davey M Smith1,2, Sanjay R Mehta1,2.
Abstract
Transmission of multiple founder variants has been associated with faster HIV disease progression. Many studies have attempted to determine the number of founder variants, mainly by analysis of sequence diversity and/or tree topology from acutely HIV-infected individuals. We hypothesized that adding sequence data collected from source partners might improve resolution and characterization of transmission events. Blood plasma samples were collected from both the source and recipient in thirty epidemiologically- and phylogenetically linked transmission pairs. All were men who have sex with men, sampled on average 70 days (range 11-170) after the recipient's estimated date of infection. Next generation sequencing (454 FLX, Roche) of HIV-1 env (C2-V3) was performed for all samples. Inspection of sequence alignments, highlighter plots, phylogenetic tree topologies and sequence diversity were used to determine the multiplicity of founder viruses with and without the inclusion of source data. Using only recipient sequence data, we were able to resolve multiplicity in twenty-six of the thirty transmission pairs (87 percent). Among them, five presented with a high viral diversity at baseline (>0.10 subst/site), consistent with multiple founders. By incorporating sequence data collected from the source partner, we were able to characterize all thirty transmission pairs. Overall, sixteen transmission events (53.3 percent) involved multiple founders. Results obtained by combining sequence data from recipient and source were congruent for nineteen of the twenty-six (73 percent) cases where conclusions were made using only recipient sequence data. The multiplicity of founders was associated with significantly higher HIV RNA levels (P = 0.04). To further evaluate the transmission bottleneck, we focused on single founder transmissions (fourteen of the thirty), and identified four recipients (28.6 percent) that had founder variants that were inferred to arise from minority viral populations in the source. These source clades ranged from 1.0 to 5.4 percent of the sampled population. Incorporating sequence data from the source increased of the ability to determine the multiplicity of founder variants, reduced misclassification, and allowed us to infer the transmission of minority variants.Entities:
Keywords: HIV transmission; MSM; deep sequencing; founder
Year: 2016 PMID: 30186625 PMCID: PMC6118227 DOI: 10.1093/ve/vew012
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.Distribution of the maximum pairwise diversity of sampled sequences at the time of diagnosis among the thirty recipient partners. The maximum pairwise diversity ranged from 0.01 to 0.41 substitutions/site, with five recipients having distinctly higher maximum diversity (>0.10 substitutions/site [in red]).
Figure 2.Highlighter plots and ML phylogenetic trees from baseline samples for recipients 9, 11, and 15. Baseline samples were collected after 11, 137, and 70 days respectively. Single (recipient 9) and multiple (recipient 11) founders are inferred based upon highlighter plots, tree topologies, and maximum viral diversity are indicated in red (multiple) and blue (single) squared boxes. Unresolved case (recipient 15) is squared with dashed black lines. ML trees are midpoint rooted. The scale bar represents a genetic distance of 0.01 for all pairs. EDI Estimated date of infection. The entire dataset of thirty recipients is proposed in Supplementary Figure S2.
Figure 3.ML phylogenetic trees for transmission pairs 9, 11, and 15. Blood plasma haplotypes from the source are indicated in red. Blood plasma haplotypes collected at baseline in the recipients are colored in green. Multiple and single founders are inferred based upon highlighter plots, tree topologies and viral diversity are indicated in red (multiple) and blue (single squared boxes. The scale bar represents a genetic distance of 0.01 for all pairs. The entire dataset of 30 pairs is proposed in Supplementary Figure S3.
Figure 4.Characteristics of single transmitted HIV-1 Variants. Aligned haplotypes derived from NGS env nucleotide sequences for linked sources (blue and red triangle representing sequences derived from seminal and blood plasma RNA respectively) and recipients (green triangle) were used to generate ML trees for individual transmission pairs. Bootstrap support values are indicated. Distribution and ranking position of the transmitted (red) and not transmitted (blue) haplotypes are indicated for each source. Single transmitted founder variants were defined by a maximum pairwise diversity of <0.10 substitutions/site and a monophyletic clade visualized on the tree topology with a minimum bootstrap support of 70 percent. * The most closely related (transmitted) source variant haplotype to the recipient viral population.
Figure 5.Decision tree to demonstrate how the multiplicity of HIV founder variants in 30 MSM transmission pairs was assigned. Classification (single vs multiple founder) obtained with the sequence data from the recipient only (blue) and with inclusion of the data from the source partner (red) are presented in top and bottom parts of the figure, respectively.