| Literature DB >> 29975689 |
Mariya V Sivay1, Sarah E Hudelson1, Jing Wang2, Yaw Agyei1, Erica L Hamilton3, Amanda Selin4, Ann Dennis5, Kathleen Kahn6, F Xavier Gomez-Olive6, Catherine MacPhail6,7, James P Hughes8, Audrey Pettifor6,9, Susan H Eshleman1, Mary Kathryn Grabowski1.
Abstract
BACKGROUND: South Africa has one of the highest rates of HIV-1 (HIV) infection world-wide, with the highest rates among young women. We analyzed the molecular epidemiology and evolutionary history of HIV in young women attending high school in rural South Africa.Entities:
Mesh:
Year: 2018 PMID: 29975689 PMCID: PMC6033411 DOI: 10.1371/journal.pone.0198999
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1HPTN 068 study cohort flowchart.
The figure provides an overview of the study cohort, including the number of samples tested and the number of HIV genotyping results obtained in each participant group: infected at enrollment, infected during the main study (between enrollment and their expected graduation date), or infected during the follow-up study (after their expected graduation date). Abbreviations: mL: milliliter. Footnote for Fig 1: aFour participants were excluded due to unknown HIV status.
Fig 2Approximately maximum-likelihood phylogenetic tree of 2,494 HIV subtype C pol sequences.
Study sequences are indicated in the phylogeny with red branches. Abbreviation: DRM: drug resistance mutation.
Phylogenetic clusters containing two or more study sequences detected using different tree reconstruction and cluster detection algorithms.
| Phylogenetic tree reconstruction method | Cluster detection software | Clade support threshold | Maximum genetic distance threshold | Total number of clusters detected | Cluster size distribution (N) |
|---|---|---|---|---|---|
| FastTree | Cluster Picker | 90% | 4.5% | 21 | 2 (17), 4 (3), 6 (1) |
| RAxML | Cluster Picker | 90% | 4.5% | 21 | 2 (18), 4 (2), 6 (1) |
| FastTree | Cluster Picker | 90% | 2.5% | 16 | 2 (15), 6 (1) |
| RAxML | Cluster Picker | 90% | 2.5% | 16 | 2 (15), 6 (1) |
| FastTree | Cluster Picker | 90% | 1.0% | 6 | 2 (6) |
| RAxML | Cluster Picker | 90% | 1.0% | 6 | 2 (6) |
| - | HIV-TRACE | - | 2.5% | 16 | 2 (15), 6 (1) |
aCluster size distribution (number) is shown for clusters containing two or more study sequences.
Abbreviation: N: number.
Fig 3Transmission clusters from approximately maximum-likelihood phylogenetic tree.
Phylogenetic clusters were detected in the approximately maximum-likelihood phylogenetic tree (FastTree) at a 4.5% genetic distance threshold. Each row represents one of the twenty-one phylogenetic clusters. Symbols representing participants in clusters are colored in red (intervention) or blue (control). Symbols represent the timing of participants’ first positive HIV test (enrollment, main study, follow-up study). Data from participants are shown on the x-axis (calendar time) according to the date of their first HIV-positive visit in the study. Drug resistant viruses are denoted with an asterisk.
Associations between demographic and epidemiological characteristics of 200 HIV-infected study participants and phylogenetic clustering (4.5% genetic distance threshold).
| Characteristic | Total number; N = 200 | Clustered; N = 59 | Odds ratio | P-value |
|---|---|---|---|---|
| 69 (34.5%) | 22 (37.3%) | 1.0 (ref) | … | |
| 46 (23%) | 13 (22%) | 0.84 (0.36–1.89) | 0.68 | |
| 44 (22%) | 11 (18.7%) | 0.71 (0.3–1.64) | 0.43 | |
| 41 (20.5%) | 13 (22%) | 0.99 (0.43–2.26) | 0.99 | |
| 39 (19.5%) | 9 (15.3%) | 1.0 (ref) | … | |
| 41 (20.5%) | 15 (25.4%) | 1.92 (0.43–5.28) | 0.19 | |
| 71 (35.5%) | 19 (32.2%) | 1.22 (0.5–3.14) | 0.67 | |
| 49 (24.5%) | 16 (27.1%) | 1.62 (0.36–4.33) | 0.32 | |
| 95 (47.5%) | 24 (40.7%) | 1.0 (ref) | … | |
| 105 (52.5%) | 35 (59.3%) | 1.48 (0.8–2.76) | 0.21 | |
| 68 (34%) | 16 (27.1%) | 1.0 (ref) | … | |
| 92 (46%) | 32 (54.2%) | 1.73 (0.87–3.27) | 0.65 | |
| 40 (20%) | 11 (18.6%) | 1.23 (0.5–3) | 0.13 | |
| 61 (30.5%) | 22 (37.3%) | 1.0 (ref) | … | |
| 112 (56%) | 27 (45.8%) | 0.56 (0.29–1.11) | 0.1 | |
| 27 (12.5%) | 10 (16.9%) | 1.04 (0.39–2.65) | 0.93 | |
| 92 (46%) | 30 (50.9%) | 1.0 (ref) | … | |
| 75 (37.5%) | 19 (32.2%) | 0.70 (0.35–1.37) | 0.31 | |
| 15 (7.5%) | 4 (6.8%) | 0.75 (0.2–2.4) | 0.65 | |
| 180 (90%) | 55 (93.2) | 1.0 (ref) | … | |
| 20 (10%) | 4 (6.8%) | 0.57 (0.16–1.63) | 0.33 | |
| No | 90 (45%) | 25 (42.4%) | 1.0 (ref) | … |
| Yes | 69 (34.5%) | 23 (39%) | 1.3 (0.65–2.57) | 0.45 |
| No | 122 (61%) | 37 (62.7%) | 1.0 (ref) | … |
| Yes | 49 (24.5%) | 15 (25.4%) | 1.01 (0.48–2.06) | 0.971 |
aCD4 cell count data was not available for 18 women
bHSV-2 infection status was not available for 41 women
cPregnancy history was not available for 29 women.
Abbreviations: N: number; CI: confidence intervals; ref: reference.
Fig 4GMRF Bayesian Skyride plot of HIV subtype C.
The GMRF Bayesian Skyride plot was reconstructed from the 200 pol gene sequences. Bold black line indicates the median effective population size through time; blue shaded area represents the 95% highest posterior density (HPD) interval. The vertical dotted lines represent the estimated date of origin (1958) of HIV subtype C, and lower and upper 95% HPD intervals (1931–1980).