| Literature DB >> 30052678 |
Jayna Raghwani1,2, Andrew D Redd3,4, Andrew F Longosz3, Chieh-Hsi Wu5, David Serwadda6,7, Craig Martens8, Joseph Kagaayi6, Nelson Sewankambo6,9, Stephen F Porcella8, Mary K Grabowski10, Thomas C Quinn3,4, Michael A Eller11,12, Leigh Anne Eller11,12, Fred Wabwire-Mangen11,12, Merlin L Robb11,12, Christophe Fraser1, Katrina A Lythgoe1,2.
Abstract
HIV-1 undergoes multiple rounds of error-prone replication between transmission events, resulting in diverse viral populations within and among individuals. In addition, the virus experiences different selective pressures at multiple levels: during the course of infection, at transmission, and among individuals. Disentangling how these evolutionary forces shape the evolution of the virus at the population scale is important for understanding pathogenesis, how drug- and immune-escape variants are likely to spread in populations, and the development of preventive vaccines. To address this, we deep-sequenced two regions of the HIV-1 genome (p24 and gp41) from 34 longitudinally-sampled untreated individuals from Rakai District in Uganda, infected with subtypes A, D, and inter-subtype recombinants. This dataset substantially increases the availability of HIV-1 sequence data that spans multiple years of untreated infection, in particular for different geographical regions and viral subtypes. In line with previous studies, we estimated an approximately five-fold faster rate of evolution at the within-host compared to the population scale for both synonymous and nonsynonymous substitutions, and for all subtypes. We determined the extent to which this mismatch in evolutionary rates can be explained by the evolution of the virus towards population-level consensus, or the transmission of viruses similar to those that establish infection within individuals. Our findings indicate that both processes are likely to be important.Entities:
Mesh:
Year: 2018 PMID: 30052678 PMCID: PMC6082572 DOI: 10.1371/journal.ppat.1007167
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Summary information for the 34 individuals and corresponding samples analysed in this study.
| Individual | Gender | Age at seroconversion | p24 subtype | gp41 subtype | Log SPVL | First sample | Last sample | Number p24 samples | Number gp41 samples |
|---|---|---|---|---|---|---|---|---|---|
| i1 | F | 46 | D | D | 5.03 | 175 | 1268 | 5 | 5 |
| i2 | F | 31 | D | D | 4.71 | 219 | 2276 | 5 | 5 |
| i3 | F | 28 | D | D | 4.86 | 214 | 2249 | 2 | 4 |
| i4 | F | 30 | D | D | 4.86 | 225 | 1190 | 5 | 5 |
| i5 | F | 31 | A | A | 4.08 | 213 | 1826 | 5 | 5 |
| i6 | M | 40 | C | A | 237 | 672 | 5 | 5 | |
| i7 | F | 26 | D | D | 5.69 | 202 | 2268 | 5 | 5 |
| i8 | F | 32 | D | D | 4.02 | 218 | 2946 | 5 | 5 |
| i9 | M | 38 | D | D | 4.53 | 198 | 910 | 4 | 4 |
| i10 | M | 23 | D | A | 3.17 | 173 | 2110 | 5 | 5 |
| i11 | M | 24 | A | A | 5.81 | 363 | 1776 | 4 | 4 |
| i12 | M | 31 | D | D | 5.12 | 291 | 1734 | 5 | 4 |
| i13 | M | 42 | D | D | 4.28 | 234 | 1757 | 5 | 5 |
| i14 | F | 24 | D | D | 5.29 | 425 | 1688 | 5 | 5 |
| i15 | F | 18 | A | A | 3.66 | 204 | 2042 | 5 | 5 |
| i16 | M | 31 | D | A | 4.96 | 228 | 666 | 5 | 5 |
| i17 | F | 21 | D | D | 3.96 | 191 | 1919 | 5 | 4 |
| i18 | F | 18 | A | A | 5.23 | 150 | 1040 | 5 | 5 |
| i19 | F | 20 | D | D | 5.24 | 228 | 679 | 4 | 4 |
| i20 | F | 35 | D | D | 198 | 910 | 5 | 5 | |
| i21 | M | 23 | A | D | 4.96 | 246 | 1724 | 5 | 5 |
| i22 | M | 37 | D | D | 3.71 | 240 | 674 | 5 | 5 |
| i23 | F | 30 | D | D | 4.22 | 230 | 1863 | 5 | 5 |
| i24 | F | 21 | D | D | 5.02 | 226 | 1207 | 5 | 5 |
| i25 | F | 35 | A | A | 5.35 | 226 | 1806 | 5 | 5 |
| i26 | F | 36 | D | D | 5.1 | 235 | 1144 | 3 | 5 |
| i27 | F | 24 | D | C | 4.13 | 239 | 2365 | 5 | 5 |
| i28 | F | 22 | D | D | 5.6 | 234 | 654 | 5 | 5 |
| i29 | F | 29 | D | D | 4.4 | 168 | 1246 | 5 | 5 |
| i30 | M | 22 | A | A | 5.12 | 187 | 1927 | 4 | 4 |
| i31 | F | 25 | D | A | 4.24 | 174 | 2096 | 5 | 5 |
| i32 | M | 22 | A | A | 4.37 | 204 | 1624 | 5 | 5 |
| i33 | F | 30 | D | D | 4.61 | 208 | 1652 | 5 | 5 |
| i34 | M | 23 | A | D | 4.4 | 193 | 927 | 5 | 5 |
a Set-point viral load
bEstimated number of days since seroconversion for the first and last sequenced samples. The seroconversion date was estimated as the mid-date between the last negative and first positive samples. For all individuals except i11, the first sequenced sample corresponds to the first positive sample, giving a window +/- the number of days shown in the first sample column. For i11, the window is +/- 340 days.
Fig 1Diversity and divergence over time for 34 individuals for both p24 and gp41.
Top Row: Mean pairwise diversity at first, second, and third codon positions over time for individuals (represented in yellow, pink, and light blue, respectively). The average change in mean pairwise diversity over time was inferred by linear regression. Bottom Row: Mean nonsynonymous (purple) and synonymous divergence (blue) over time for individuals.
Fig 2Evolutionary rates for p24 and gp41 at the within- and between-hosts scales measured in substitutions per site per year (subs/site/year).
The within-host nonsynonymous (filled circles) and synonymous (open circles) substitution rates were estimated for p24 and gp41 for all 34 individuals, which are ordered from left to right (subtype A, pink; subtype C, green; subtype D, yellow). Blue and white columns correspond to the individual estimates. The estimates in the grey background indicate the between-host substitution rates for subtypes A, C, and D. The vertical lines represent the 95% credible intervals (solid lines, nonsynonymous; dashed lines, synonymous).
Fig 3Changes towards population consensus.
Top Row: The proportion of polymorphic sites where a mutant allele represents a change towards the (subtype-specific) population consensus. Bottom Row: The bias in changes towards population consensus. This is the ratio of the proportion of changes that are towards (subtype-specific) population consensus, compared to the expected proportion in the absence of selection, measured at polymorphic sites and with an assumed mutational transition to transversion ratio of two. A bias of 1 means the proportion of changes that are towards population consensus matches the expectation. The error bars give the 5 and 95 percentiles from 10,000 bootstraps of the individual data. Black, all changes; Blue, synonymous changes; Red, nonsynonymous changes.
Fig 4Contribution to the mismatch in evolutionary rates if founder-like virus has a transmission advantage.
Each point represents the mean contribution of the 34 individuals to the mismatch in phylogenetic rates if transmission occurs during the given time period, and where each viral sequence has a transmission fitness w = e−. The contribution to the mismatch for each sampling time point for each individual was calculated as the ratio of the mean number of mutations from the founder population, μ, to the expected mean distance of transmitted virus from the founder population, μT, giving m = μ/μT. A mismatch of 1 therefore indicates the case where the within- and between-host rates of evolution are expected to be the same. We show results for a moderate (α = 1), large (α = 2), and very large (α = 3) transmission advantage. The error bars give the 5 and 95 percentiles from bootstrapping over the individuals 100,000 times. Black, all mutations; Blue, only synonymous mutations are considered when calculating the expected mismatch; Red, only nonsynonymous mutations are considered when calculating the expected mismatch (see Methods).