| Literature DB >> 31809523 |
Nuno R Faria1, Nicole Vidal2, José Lourenco1, Jayna Raghwani1, Kim C E Sigaloff3,4, Andy J Tatem5,6, David A M van de Vijver7, Andrea-Clemencia Pineda-Peña8,9, Rebecca Rose10, Carole L Wallis11, Steve Ahuka-Mundeke12, Jean-Jacques Muyembe-Tamfum12, Jérémie Muwonga13, Marc A Suchard14, Tobias F Rinke de Wit3, Raph L Hamers3, Nicaise Ndembi15, Guy Baele16, Martine Peeters2, Oliver G Pybus1, Philippe Lemey16, Simon Dellicour16,17.
Abstract
Since the ignition of the HIV-1 group M pandemic in the beginning of the 20th century, group M lineages have spread heterogeneously throughout the world. Subtype C spread rapidly through sub-Saharan Africa and is currently the dominant HIV lineage worldwide. Yet the epidemiological and evolutionary circumstances that contributed to its epidemiological expansion remain poorly understood. Here, we analyse 346 novel pol sequences from the DRC to compare the evolutionary dynamics of the main HIV-1 lineages, subtypes A1, C and D. Our results place the origins of subtype C in the 1950s in Mbuji-Mayi, the mining city of southern DRC, while subtypes A1 and D emerged in the capital city of Kinshasa, and subtypes H and J in the less accessible port city of Matadi. Following a 15-year period of local transmission in southern DRC, we find that subtype C spread at least three-fold faster than other subtypes circulating in Central and East Africa. In conclusion, our results shed light on the origins of HIV-1 main lineages and suggest that socio-historical rather than evolutionary factors may have determined the epidemiological fate of subtype C in sub-Saharan Africa.Entities:
Mesh:
Year: 2019 PMID: 31809523 PMCID: PMC6897401 DOI: 10.1371/journal.ppat.1007976
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Characteristics of the data sets and estimated evolutionary parameters.
N: number of sequences per discrete location/country (k) in the DRC and in Central and East Africa (CEA). Δt = time interval of sequence sampling in this study. ρ indicates the Pearson’s correlation coefficient between N and HIV prevalence in 2013 for Burundi, DRC, Kenya, Rwanda, Tanzania and Uganda as recorded by UNAIDS [113]. P-values indicate statistical significance under a linear regression parametric model. The cumulative number of sequences per country and HIV seroprevalence over time can be found in S5 Fig. R2 indicates the correlation between genetic divergence and sampling dates (S6 Fig).
| Characteristics | Subtype C | Subtype A1 | Subtype D |
|---|---|---|---|
| NDRC ( | 91 (4), 304 (20) | 68 (4), 504 (20) | 15 (4), 447 (20) |
| Δt (years) CEA | 1997–2011.03 | 1996–2011.13 | 1996–2010.88 |
| 0.47 (0.079) | 0.71 (0.022) | 0.19 (0.21) | |
| R2 (root-to-tip) CEA | 0.016 | 0.080 | 0.079 |
Fig 1Origins and early spread of HIV-1 virus subtypes in the Democratic Republic of Congo (DRC).
Maximum clade credibility tree based on 346 pol sequences sampled in the DRC in 2008. Branch colours depict the most probable inferred ancestral location as inferred using discrete phylogeographic analysis [26]. Grey boxes indicate the positions of isolates identified as belonging to subtype A1, C and D both by REGAv3.0 [71] and COMET [72] subtyping tools. Mean posterior support for the modal location estimates (upper italic number) and phylogenetic support values (lower number) are shown for specific viral lineages and for the root location. Since the DRC pol sequence alignment did not contain temporal information (all sequences were sampled in 2008), subtype node heights were calibrated using information from a previously published molecular clock analysis of env sequences [2]. The lower panel shows the node age density distributions stratified by location state for each subtype represented in this data set.
Fig 2Pathways of HIV-1 lineage movement across Central and East countries.
Fig 2A, 2B and 2C summarise lineage migration estimates for subtypes C-A1-D across Central East Africa and are represented using circular plots. Origin and destination locations of virus lineage movements are connected by circle segments. The width of the link at its basis indicates the frequency of viral movements as estimated using a robust counting approach and can be interpreted using the tick marks on the outside of the circle’s segments. The directionality of the virus lineage movement is encoded by the origin colour and by the gap between link and circle segment at the source location. Fig 2D, 2E and 2F show the estimated proportion of virus lineage movements through time within sampled countries with more than one sampling location. A proportion equal to 1 would indicate that all inferred movements occurred within a single country. This analysis only included virus lineage migration movements between countries for which data were available for at least two locations.
Fig 3Velocity of spread of main HIV-1 lineages in Central and East Africa.
(a) Posterior estimates for the branch dispersal velocity of HIV-1 subtypes. Filled and non-filled distributions represent respectively estimates from analyses with and without informed root location priors (see Materials and Methods for details). The inset shows the number of location transitions per time unit obtained using a robust counting approach. (b) Change of the epidemic wavefront through time, measured as the furthest extent of the subtype’s inferred location of origin to inferred branch locations.
Geographic origins and rate of spread of HIV-1 in Central and East Africa.
Posterior probability for the ancestral root location in the DRC (details in S1 Table). CEA: Central East Africa. BCI: Bayesian credible interval. Mean branch dispersal velocities and mean diffusion coefficients were estimated using the R package “seraphim” [111].
| Posterior estimates | Subtype C | Subtype A1 | Subtype D |
|---|---|---|---|
| Root location in DRC | Mbuji-Mayi | Kinshasa | Kinshasa |
| Migration rate in DRC | 15.33 | 10.03 | 9.92 |
| Migration Rate in CEA | 14.1 | 8.65 | 6.03 |
| Mean branch velocity in CEA | 82.43 | 25.68 | 14.41 |
| Mean diffusion coefficient | 19562.55 | 3450.91 | 1283.41 |
Fig 4Synonymous and non-synonymous rates of substitution for main subtypes circulating in Central and East Africa.
Panel a shows the posterior distributions for the overall nucleotide substitution rates (substitutions per nucleotide site per year) based on the entire gene region under study. Panel b shows the absolute non-synonymous and the synonymous rates obtained using a renaissance counting approach. Finally, the inset shows the dN/dS ratio obtained using a renaissance counting approach [31], with dashed and solid lines indicating the mean dN/dS ratios estimated with MEME [98] and aBSREL [99]. Please note that the mean dN/dS ratios for subtypes A1 (dark grey solid line) and D (light grey solid line) estimated with aBSREL are overlapping.