| Literature DB >> 35215999 |
David Seifert1,2, Beda Joos3, Dominique L Braun3,4, Corinna S Oberle3,4, Corinne D Schenkel3,4, Herbert Kuster3,4, Christina Grube3, Jürg Böni4, Sabine Yerly5, Vincent Aubert6, Thomas Klimkait7, Huldrych F Günthard3,4, Niko Beerenwinkel1,2, Karin J Metzner3,4.
Abstract
Little is known about whether and how variation in the HIV-1 genome affects its transmissibility. Assessing which genomic features of HIV-1 are under positive or negative selection during transmission is challenging, because very few virus particles are typically transmitted, and random genetic drift can dilute genetic signals in the recipient virus population. We analyzed 30 transmitter-recipient pairs from the Zurich Primary HIV Infection Study and the Swiss HIV Cohort Study using near full-length HIV-1 genomes. We developed a new statistical test to detect selection during transmission, called Selection Test in Transmission (SeTesT), based on comparing the transmitter and recipient virus population and accounting for the transmission bottleneck. We performed extensive simulations and found that sensitivity of detecting selection during transmission is limited by the strong population bottleneck of few transmitted virions. When pooling individual test results across patients, we found two candidate HIV-1 genomic features for affecting transmission, namely amino acid positions 3 and 18 of Vpu, which were significant before but not after correction for multiple testing. In summary, SeTesT provides a general framework for detecting selection based on genomic sequencing data of transmitted viruses. Our study shows that a higher number of transmitter-recipient pairs is required to improve sensitivity of detecting selection.Entities:
Keywords: HIV-1; SHCS; Selection Test in Transmission (SeTesT); Vpu; ZPHI; transmission; transmitter–recipient pairs
Mesh:
Substances:
Year: 2022 PMID: 35215999 PMCID: PMC8876189 DOI: 10.3390/v14020406
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Determining transmitter–recipient relationships in the ZPHI and SHCS cohorts. We determine direct transmitter–recipient relationships by calling HIV-1 consensus sequences for all patients’ samples, generated using ngshmmalign. Pairwise similarity distances between patients’ samples were computed and combined with additional clinical and epidemiological data in order to identify transmitter–recipient pairs.
Figure 2Modelling the transmission bottleneck. The transmission bottleneck of the HIV-1 population shapes its composition in the recipient drastically. Starting with an infinite pool of three genotypes A, B and C in the transmitter , our model estimates deviations from neutrality by first passing this population through a strong bottleneck, yielding a small founder population in the recipient . Both the transmitter and founder population cannot be estimated directly, and can only be sampled by NGS, yielding the vector of counts and . The Euclidean distance t between vectors and of relative abundances defines our test statistic.
Figure 3Sensitivity of detecting selection in different selection regimes with different numbers of pooled transmitter–recipient pairs. We assessed statistical sensitivity of detecting selection across 2, 30 and 1000 pairs. On the right, we performed the same analysis but in the presence of strong selection.
Figure 4Histogram of number of tests per locus that could be performed for the whole HIV-1 genome. In total, we analyzed 2773 individual amino acid loci, with a maximum of 12 tests performed for any locus. In order to improve sensitivity, we only analyzed those loci further where six or more tests could be performed out of all 30 pairs.
Figure 5Potential selection of Vpu variants during HIV-1 transmission. Heatmap of p-values of the Vpu protein. Each column represents the test outcomes for a transmitter–recipient pair and every row represents one amino acid locus out of 82 across all 30 recipient and transmitters. Significant p-values after pooling across pairs without Benjamini–Hochberg multiple testing correction are shown on the right axis. Amino acid loci 3 and 18 were found to be significant.
Figure 6No evidence for selection of Env gp120 V region variants during HIV-1 transmission. Heatmap of p-values of the gp120 immunogenic loci. Each column represents the test outcomes for a transmitter–recipient pair. The p-values across the 30 transmitter–recipient pairs for the four immunogenic loci (V1/V2, V3, V4 and V5) are shown on the x and y axis, respectively.