| Literature DB >> 31407606 |
Alexis Kafando1, Bouchra Serhir2, Florence Doualla-Bell2, Eric Fournier2, Mohamed Ndongo Sangaré3, Christine Martineau2, Mohamed Sylla4, Annie Chamberland4, Mohamed El-Far4, Hugues Charest1,2, Cécile L Tremblay1,2,4.
Abstract
The identification of transmission clusters (TCs) of HIV-1 using phylogenetic analyses can provide insights into viral transmission network and help improve prevention strategies. We compared the use of partial HIV-1 envelope fragment of 1,070 bp with its loop 3 (108 bp) to determine its utility in inferring HIV-1 transmission clustering. Serum samples of recently (n = 106) and chronically (n = 156) HIV-1-infected patients with status confirmed were sequenced. HIV-1 envelope nucleotide-based phylogenetic analyses were used to infer HIV-1 TCs. Those were constructed using ClusterPickerGUI_1.2.3 considering a pairwise genetic distance of ≤10% threshold. Logistic regression analyses were used to examine the relationship between the demographic factors that were likely associated with HIV-1 clustering. Ninety-eight distinct consensus envelope sequences were subjected to phylogenetic analyses. Using a partial envelope fragment sequence, 42 sequences were grouped into 15 distinct small TCs while the V3 loop reproduces 10 clusters. The agreement between the partial envelope and the V3 loop fragments was significantly moderate with a Cohen's kappa (κ) coefficient of 0.59, p < .00001. The mean age (<38.8 years) and HIV-1 B subtype are two factors identified that were significantly associated with HIV-1 transmission clustering in the cohort, odds ratio (OR) = 0.25, 95% confidence interval (CI, 0.04-0.66), p = .002 and OR: 0.17, 95% CI (0.10-0.61), p = .011, respectively. The present study confirms that a partial fragment of the HIV-1 envelope sequence is a better predictor of transmission clustering. However, the loop 3 segment may be useful in screening purposes and may be more amenable to integration in surveillance programs.Entities:
Keywords: HIV-1 transmission networks; clusters; envelope gene sequences; pairwise genetic distance
Mesh:
Substances:
Year: 2019 PMID: 31407606 PMCID: PMC6806616 DOI: 10.1089/AID.2019.0142
Source DB: PubMed Journal: AIDS Res Hum Retroviruses ISSN: 0889-2229 Impact factor: 2.205

Flowchart of HIV-1 envelope sequences used in this study. The figure presents the total number of specimens sampled (n = 262) and the final sequences obtained after amplification, sequencing, and sequence analysis processes (n = 102). EIA, enzyme immunoassay; RITA, recent infection testing algorithm.
Population Characteristics
| Gender, | |||
| Female | 1 (1.90) | 13 (28.20) | 14 (14.14) |
| Male | 52 (98.10) | 33 (71.80) | 85 (85.86) |
| Risk factor for HIV acquisition, | |||
| MSM | 36 (85.71) | 19 (48.72) | 55 (67.90) |
| Heterosexual | 0 | 3 (7.69) | 3 (3.70) |
| Relationship with at-risk heterosexual | 2 (4.76) | 2 (5.13) | 4 (4.94) |
| Endemic country | 1 (2.38) | 13 (33.33) | 14 (17.28) |
| MSM/IDU | 3 (7.14) | 0 | 3 (3.70) |
| IDU | 0 | 2 (5.13) | 2 (2.47) |
| Place of birth, | |||
| Canadian | 27 (65.85) | 19 (48.72) | 46 (57.50) |
| Caribbean | 3 (7.32) | 6 (15.38) | 9 (11.25) |
| Europe | 5 (12.20) | 1 (2.56) | 6 (7.50) |
| North Africa and Middle East | 1 (2.44) | 0 | 1 (1.25) |
| Sub-Saharan Africa | 2 (4.83) | 10 (25.64) | 12 (15) |
| Latin, Central, and South America | 4.88 | 2 (5.13) | 4 (5) |
| Asia | 1 (2.44) | 0 | 1 (1.25) |
| Aboriginal (First Nations) | 0 | 1 (2.56) | 1 (1.25) |
| HIV subtype, | |||
| B | 45 (83.30) | 36 (75.00) | 81 (79.40) |
| Non-B | 9 (17.70) | 12 (25.00) | 21 (20.60) |
| Age, | 31 [18–78] | 46.5 [21–77] | 37 [18–78] |
| 15–24 years | 12 | 2 | 14 |
| 25–34 years | 23 | 11 | 34 |
| 35–44 years | 6 | 7 | 13 |
| 45–54 years | 7 | 19 | 26 |
| 55–64 years | 4 | 5 | 9 |
| >64 years | 1 | 2 | 3 |
| Baseline CD4+ T cell count (cells/mm3), no. of individuals [median; range] | 42 [458.5; 12–1,150] | 36 [175; 5–698] | 78 [375; 5–1,150] |
| >500 | 19 [710; 506–1,150] | 3 [609; 580–698] | 22 [700; 506–1,150) |
| 351–500 | 11 [415; 390–490] | 7 [420; 360–485] | 18 [417.5;360–490) |
| 201–350 | 8 [250; 217–320] | 5 [302; 220–340] | 13 [252; 217–340) |
| ≤200 | 4 [185; 12–200] | 21 [59; 5–196] | 25 [63; 5–200] |
| HIV viral load (log10 copy/mL), no. of individuals [median; range] | 42 [4.94; 3.43–8.20] | 38 [4.76; 1.60–6.34] | 80 [4.90;1.60–8.20] |
| >5 | 20 [5.53; 5–8.20] | 15 [5.60; 5.15–6.34] | 35 [5.60; 5–8.20] |
| <5 | 22 [4.46; 3.43–.96] | 23 [4.25; 1.60–4.98] | 45[4.27;1.60–4.98] |
Table presents the repartition of demographic, epidemiologic, clinical, and risk factors of the study population by infection status (recent or chronic).
IDU, injecting drug user; MSM, men who have sex with men.

Molecular phylogenetic reconstruction of HIV-1 TCs among newly and chronically HIV-infected individuals in 2015 in Quebec using partial env-derived sequences (1,070 bp). The cluster tree contains 101 sequences from HIV-1-infected individuals, including 98 from the study cohort and 3 reference sequences introduced in the analyses as controls (AF443074.1_C, AY426110_BaL, and K03455.1-HXB2). A total of 42 HIV-1 individuals' sequences are grouped into 16 distinct small HIV-1 TCs. These clusters are labeled from Clust1 to Clust16 and are depicted using different colors. The cohort-derived clusters are shown in blue for subtype B (Clust1 to Clust15, except Clust4) and orange for subtype D (Clust16). The control sequence names are shown in red (Clust4). Sequences that did not form a cluster are shown in black. Each depicted cluster satisfied ≥99% support on bootstrap resampling of 1,000 replicates and 10% of pairwise distance.[21] CH, chronic HIV-1 infection sequences; p24, recent infection status determined by p24 antigen positivity; RITA, recent infection status determined by RITA[29]; TCs, transmission clusters. Color images are available online.

Molecular phylogenetic reconstruction of HIV-1 TCs among newly and chronically HIV-infected individuals using env gp120 V3 loop-derived sequences (108 bp). The cluster tree contains 101 sequences from HIV-1-infected individual's env-V3 sequences, including 98 from the study cohort and 3 reference sequences introduced in the analyses as controls (AF443074.1_C, AY426110_BaL, and K03455.1-HXB2). Each tip represents an individual patient. A total of 31 HIV-1 sequences grouped into 11 distinct small HIV-1 TCs with two to five members per cluster. These clusters are labeled from Clust1 to Clust11 and are depicted using different colors. Clusters including sequence names are shown in blue for subtype B (Clust2 to Clust11) and orange for subtype D (Clust1). The control sequence names are shown in red. Sequences that did not cluster are shown in black. Each depicted cluster satisfied ≥99% support on bootstrap resampling of 1,000 replicates and 10% of pairwise distance.[21] CH, chronic HIV-1 infection sequences; p24, recent infection status determined by p24 antigen positivity; RITA, recent infection status determined by RITA.[29] Color images are available online.

Tanglegrams of rooted phylogenetic trees and networks comparing the HIV-1 envelope gp120 loop 3 (a) with the HIV-1 partial envelope length (b) sequence-based clustering. The linked clusters and sequences between the two trees are connected by a gray line.
Performance of the env gp120-V3 Sequences in Determining HIV Transmission Clustering Compared with the Use of the Full env Fragment
| Sensitivity | 61.9 (26/42) | 45.6–76.4 |
| Specificity | 95.0 (57/60) | 86.1–99 |
| Positive predictive value | 89.7 (26/29) | 72.6–97.8 |
| Negative predictive value | 78.1 (57/73) | 66.9–86.9 |
Table presents the performance of the env gp120-V3 sequence-based clustering (108 bp) using partial HIV-1 envelope fragment length (1,070 bp) as the gold standard. The sensitivity and specificity were 61.9% (45.6–76.4) and 95% (86.1–99), respectively. In addition, the positive predictive value and negative predictive value were 89.7% (72.6–97.8) and 78.1% (66.9–86.9), respectively. Furthermore, significant and moderate agreement was documented using Cohen's kappa coefficient (κ = 0.59, p < .00001).
CI, confidence interval; n, number of clusters identified by env gp120 V3 loop-derived sequences; N, number of clusters identified by the gold standard.
Summary Statistics of the Distribution of HIV-1 Transmission Clusters by Factors
| Age | ||
| <38 | 23 (39.66) | 27 (65.85) |
| ≥38 | 35 (60.34) | 14 (34.15) |
| Gender | ||
| Male | 46 (79.31) | 39 (95.12) |
| Female | 12 (20.69) | 2 (4.88) |
| HIV-subtype | ||
| B | 42 (70) | 39 (92.86) |
| Non-B | 18 (30) | 3 (7.14) |
| Ethnicity | ||
| White | 34 (85) | 25 (83,33) |
| Black | 16 (4) | 5 (16.16) |
| Motivation for HIV testing | ||
| Symptomatic person | 25 (50) | 16 (51.61) |
| Screening in an asymptomatic person | 17 (24) | 13 (41.94) |
| Confirmation of previous positive test | 8 (16) | 2 (6.45) |
| HIV-1 clinical status | ||
| Asymptomatic | 19 (38) | 15 (48.39) |
| Acute infection | 18 (36) | 12 (38.71) |
| Symptomatic infection + AIDS | 13(26) | 4 (12.90) |
| HIV transmission risk factors | ||
| MSM | 32 (64) | 23 (74.19) |
| MSM/IDU+IDU | 3 (6) | 2 (6.45) |
| Heterosexual | 15(30) | 6 (19.35) |
| Log[ | ||
| <5 | 29 (59.18) | 15 (48.39) |
| ≥5 | 20 (40.82) | 16 (51.61) |
| CD4 count value | ||
| <350 | 24 (51.06) | 14 (45.16) |
| ≥350 | 23 (48.94) | 17 (54.84) |
| HIV infection status by test | ||
| Chronic infection by RITA[ | 33 (55) | 15 (35.71) |
| Acute/early infection by EIA-p24 testing | 9 (15) | 10 (23.81) |
| Recent by RITA[ | 18 (30) | 17 (40.48) |
| HIV-1 infection status | ||
| Recent (acute/early recent by RITA) | ||
| Chronic | ||
Bold values illustrate the number of recent versus chronic HIV-1 infection sequences include or non-include in HIV-1 transmission clusters.
Table presents the summary statistics of the demographic, epidemiologic, clinical, and risk factors of HIV-1-infected individuals associated with inclusion in or not in clusters.
EIA, enzyme immunoassay; RITA, recent infection testing algorithm.
Factors Associated with HIV-1 Transmission Clustering Using Logistic Regression Analysis
| n | p | P | |||
|---|---|---|---|---|---|
| Ethnicity | |||||
| Black | 21 | .74 | NS | ||
| Nonblack | 60 | Ref. | |||
| Gender | |||||
| Male | 85 | .041 | NS | ||
| Female | 14 | Ref | |||
| Age (mean) | |||||
| <38.8 years | |||||
| ≥38.8 years | 47 | Ref. | |||
| HIV transmission route | |||||
| MSM intercourse | 55 | .18 | |||
| Heterosexual intercourse | 21 | .83 | |||
| IDU and MSM/IDU intercourse | 5 | Ref. | |||
| Motivation for HIV testing | |||||
| Screening in an asymptomatic person | 30 | .20 | |||
| Symptomatic person | 41 | .27 | |||
| Confirmation of previous positive test | 10 | Ref. | |||
| Clinical status | |||||
| Acute infections | 13 | .94 | |||
| Chronic infection symptoms | 6 | .18 | |||
| Nonspecific diseases and symptoms | 17 | .38 | |||
| AIDS | 11 | .23 | |||
| Asymptomatic | 34 | Ref. | |||
| HIV subtype | |||||
| B | 81 | ||||
| Non-B | Ref. | ||||
| HIV-1 infection status | |||||
| Recent by RITA[ | 54 | .057 | |||
| chronic by RITA[ | 48 | Ref | |||
| CD4 cell count (cells/mm3) | |||||
| ≥500 | 22 | .895 | |||
| <500 | 56 | Ref | |||
| HIV-1 viral load (copies/mL) | |||||
| ≥100,000 | 36 | .345 | |||
| <100,000 | 44 | Ref. | |||
| HIV-1 viral load (copies/mL) | |||||
| ≥1,500 | 77 | .337 | |||
| <1,500 | 3 | Ref | |||
| VL_log10 | NS | ||||
| >5 | 36 | .345 | |||
| <5 | 44 | Ref | |||
Table presents results of logistic regression analysis, including bivariate analysis showing that the HIV-1 subtype, gender, and mean age of infected individuals were statistically significant (p < .05). However, all factors with a p value ≤.25 were subjected to multivariate analysis. For multivariate analysis, HIV-1 subtype (B vs. non-B), OR = 0.25 (0.04–0.66), p = .002, and patient mean age (<38.8 years compared with those aged ≥38.8 years), OR = 0.17 (0.10–0.61), p = .011, constitute two factors significantly associated with HIV-1 transmission clustering in this study population.
Bold values illustrate statistically significant results.
Ref: variable of comparison. The statistically significant variables were indicated by their p values.
NS, statistically nonsignificant; OR, odds ratio; S, statistically significant.