| Literature DB >> 29295533 |
Raghavan Sampathkumar1,2, Joel Scott-Herridge3, Binhua Liang4,5, Joshua Kimani6,7, Francis A Plummer8,9, Ma Luo10,11.
Abstract
Within the Pumwani sex worker cohort, a subgroup remains seronegative, despite frequent exposure to HIV-1; some of them seroconverted several years later. This study attempts to identify viral variations in 5'LTR-leader sequences (5'LTR-LS) that might contribute to the late seroconversion. The 5'LTR-LS contains sites essential for replication and genome packaging, viz, primer binding site (PBS), major splice donor (SD), and major packaging signal (PS). The 5'LTR-LS of 20 late seroconverters (LSC) and 122 early seroconverters (EC) were amplified, cloned, and sequenced. HelixTree 6.4.3 was employed to classify HIV subtypes and sequence variants based on seroconversion status. We find that HIV-1 subtypes A1.UG and D.UG were overrepresented in the viruses infecting the LSC (P < 0.0001). Specific variants of PBS (Pc < 0.0001), SD1 (Pc < 0.0001), and PS (Pc < 0.0001) were present only in the viral population from EC or LSC. Combinations of PBS [PBS-2 (Pc < 0.0001) and PBS-3 (Pc < 0.0001)] variants with specific SD sequences were only seen in LSC or EC. Combinations of A1.KE or D with specific PBS and SD variants were only present in LSC or EC (Pc < 0.0001). Furthermore, PBS variants only present in LSC co-clustered with PBS references utilizing tRNAArg; whereas, the PBS variants identified only in EC co-clustered with PBS references using tRNALys3 and its variants. This is the first report that specific PBS, SD1, and PS sequence variants within 5'LTR-LS are associated with HIV-1 seroconversion, and it could aid designing effective anti-HIV strategies.Entities:
Keywords: 5’LTR-leader sequence; HIV subtypes; HIV/AIDS; genetic diversity; packaging signal; primer binding site sequences; seroconversion; splicing donor sequences
Mesh:
Substances:
Year: 2017 PMID: 29295533 PMCID: PMC5795417 DOI: 10.3390/v10010004
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1HIV-1 5′LTR leader sequences. (A). A schematic sketch of 5′LTR-leader sequence variant positions analyzed in this study; (B). a schematic sketch of secondary structure of HIV-1 5′leader sequence.
Figure 2HIV-1 subtype distribution among late seroconverters and early seroconverters. *** : p value is less the 0.0001.
Comparison of HIV-1 subtype distribution among late seroconverters and early seroconverters.
| HIV-1 Subtype | A1 | A1.KE | A1.UG | B | C | D | D.UG | Total | Ave Seq./ind. | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | All 142 individuals | No. | 30 | 2162 | 135 | 97 | 263 | 945 | 46 | 3678 | ||
| % | 0.8 | 58.8 | 3.7 | 2.6 | 7.2 | 25.7 | 1.3 | 100 | ||||
| B | Late seroconverter ( | No. | 0 | 452 | 91 | 0 | 0 | 211 | 46 | 800 | 40 | <0.0001 |
| % | 0 | 56.5 | 11.4 | 0 | 0 | 26.4 | 5.8 | 100 | ||||
| Early seroconverter ( | No. | 30 | 1710 | 44 | 97 | 263 | 734 | 0 | 2878 | 30 | ||
| % | 1.0 | 59.4 | 1.5 | 3.4 | 9.1 | 25.5 | 0 | 100 |
Figure 3Classification of primer binding sequence (PBS) variants according to whether they were identified from late seroconverters (u = 1) or early seroconverters (u = 0). Note: question mark denotes lack of sequence. u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value. (A) Classification tree; (B) PS variants and their distribution with seroconversion status; (C) PBS variants frequencies.
Figure 4Classification of splicing donor (SD) sequence variants according to whether they were identified from late seroconverters (u = 1) or early seroconverters (u = 0). Note: question mark denotes lack of sequence. u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value. (A) Classification tree; (B) PS variants, and their distribution with seroconversion status; (C) SD variants frequencies.
Figure 5Classification of packaging signal (PS) sequence variants according to whether they were identified from late seroconverters (u = 1) or early seroconverters (u = 0). Note: question mark denotes lack of sequence. u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value. (A) Classification tree; (B) PS variants and their distribution with seroconversion status; (C) PS variants frequencies. “?” indicate the absence of the sequences”.
PBS, SD1, and PS sequences listed in Figure 3, Figure 4 and Figure 5.
| Group | Sequences | Subtypes | Frequency in EC or LSC |
|---|---|---|---|
| PBS-1 | GAGAAAGTTAACAGGGAC, | A1.KE (1.1%), A1.UG (93.67%), D (3.2%), D.UG (2.1%) | 0/122 EC, 5/12 LSC |
| PBS-4 | TG-GCGCC-GAACAGGGA, | A1 (2.2%), A1.KE (38%), B (2.2%), C (12.0%), D (42%) | 36/122 EC, 0/20 LSC |
| SD1-1 | AAGGCGAGTAC, GAGGTGAGTAC, CTAGGTGAGTAC, CTAGGTGGGTAC, CTGGTAGGTGC, ACGGTGTTTAC, ATGGTGAGTAC, ACGGTGTGTAC, ACGGTGAATAC | A1.KE (92.4), D (6.2%), D.UG (1.4%) | 0/122 EC, 6/20 LSC |
| SD1-5 | ACGGTAAGTAC, CGGGGGAGTAC, TTGGTGAGTAC, | A1 (1.4%), A1.KE (90%), B (1.4%), C (2.9%), D (4.3%) | 15/122 EC, 0/20 LSC |
| PS-1 | AGTG, | A1.KE (50%), A1.UG (16.7%), D (33.3%) | 0/122 EC, 6/20 LSC |
| PS-3 | GGAA, | A1.KE (66.8%), A1.UG (1.3%), C (24.1%), D (7.8%) | 30/122, EC, 4/20 LSC |
Notes: ? denotes lack of sequence.
Figure 6The combination of PBS variants (PBS-2) and splicing donor sequences identified only in either late (u = 1) or early seroconverters (u = 0). (A) PBS sequence variants in tree node of PBS-2 were further classified with SD sequence variants based on whether they were identified from late or early seroconverters; (B) the figure shows u value of sequence combinations based on whether the PBS-SD combinations were identified from late (1) or early (0) seroconverters. Note: n—counts (clone); u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value. “?” denotes lack of sequences”.
Specific subtype, PBS, and SD variant combinations in late or early seroconverters.
| Subtypes or PBS | SD or PBS | Seroconverter |
|---|---|---|
| TGGCGCCCGAACAGGGGC TGGCGCCCGAACAGGGTC TGGCGCCCGAATAGGGAC TGGCGCCCGAACAGGAAC TGGCGCCCGCACAGGGAC? | CTGGTGAGTAC | LSC |
| CAGGTGAGTAC | EC | |
| TGGCGCCCGAACAGGGAC | AAGGCGAGTAC | LSC |
| ACGGTGAGTGC | EC | |
| Subtype D | AAGGCGAGTAC | LSC |
| ACGGTAAGTAC | EC | |
| Subtype D | TGGCGCCCGAACAGGGTC | LSC |
| TGGCGCCGGAACAGGGAC | EC | |
| A1.KE | CTAGGTGAGTAC | LSC |
| ACGGTGAGTAC | EC |
Note: ‘?’ denotes lack of sequence.
Figure 7The combination of PBS variants (PBS-3) and splicing donor sequences identified only in either late (u = 1) or early seroconverters (u = 0). (A) PBS sequence variants in tree node of PBS-3 were further classified with SD sequence variants based on whether they were identified from late or early seroconverters. (B) The figure shows u value of sequence combinations based on whether the PBS–SD combinations were identified from late (1) or early (0) seroconverters. Note: n—counts (clone); u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value. (C) PBS3–SD variants frequencies. “?” denotes: lack of sequences”.
Figure 8The combination of HIV subtype D and splicing donor sequences (SD) identified only in either late (u = 1) or early seroconverters (u = 0). (A) HIV subtype D in tree node of subtype D was further classified with SD sequence variants based on whether they were identified from late (u = 1) or early seroconverters (u = 0); (B) the figure shows u value of sequence combinations based on whether the subtype D–SD combinations were identified from late (1) or early (0) seroconverters. Note: n—counts (clone); u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value; (C) HIV subtype frequency; (D) Subtype D–SD frequencies in subtype D.
Figure 9The combination of HIV subtype D and primer binding sequences (PBS) identified only in either late (u = 1) or early seroconverters (u = 0). (A) HIV subtype D in tree node of subtype D was further classified with PBS sequence variants, based on whether they were identified from late or early seroconverters. (B) The figure shows u value of sequence combinations based on whether the subtype D–PBS combinations were identified from late (1) or early (0) seroconverters. Note: n—counts (clone); u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value; “?” denotes the lack of sequence. (C) HIV subtype frequency. (D) Subtype D–PBS frequencies in subtype D.
Figure 10The combination of HIV subtype A1.KE and splicing donor sequences (SD) identified SD sequences only in either late (u = 1) or early seroconverters (u = 0). (A) HIV subtype A1.KE in tree node of subtype A1.KE was further classified with SD sequence variants based on whether they were identified from late or early seroconverters. (B) The figure shows u value of sequence combinations based on whether the subtype A1.KE-SD combinations were identified from late (1) or early (0) seroconverters. Note: n—counts (clone); u—mean value; s—standard deviation; se—standard error; mse—mean square error; p—p value; aP—adjusted p value; bP—Bonferroni corrected p value; “?”: denotes lack of sequence. (C) HIV subtype frequency. (D) Subtype A1.KE-SD frequencies in subtype A1.KE.
Later seroconverters with HIV subtypes, PBS, SD, PS variants that are enriched or only identified in LSC.
| mlno | PBS1 | SD1 | PS1 | PBS-2/SD-1 | PBS2/A1.UG | PBS-3/SD-1 | PBS-3/SD-2 | SD2 | A1-UG/D.UG | A1.KE/SD1 | D-SD-1 | D-PBS-1 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 37 | + | |||||||||||
| 58 | + | + | + | + | + | |||||||
| 290 | + | + | + | |||||||||
| 452 | + | + | + | |||||||||
| 546 | + | + | + | |||||||||
| 768 | + | + | + | + | ||||||||
| 814 | + | + | + | |||||||||
| 825 | ||||||||||||
| 888 | + | + | + | + | + | + | + | |||||
| 890 | + | + | + | + | + | |||||||
| 1072 | + | + | + | |||||||||
| 1102 | + | + | + | + | ||||||||
| 1232 | + | + | ||||||||||
| 1248 | ||||||||||||
| 1250 | + | + | + | + | ||||||||
| 1287 | + | + | + | + | + | |||||||
| 1430 | + | |||||||||||
| 1626 | + | + | + | |||||||||
| 1707 | ||||||||||||
| 1730 |
Note: ‘+’ denotes presence of indicated sequences or subtypes, or their combinations.
HIV-1 subtype classification of 5′LTR-leader variants of viruses infecting late seroconverters.
| mlno | A1.KE | A1.UG | D | D.UG |
|---|---|---|---|---|
| 37 | * | |||
| 58 | * | |||
| 290 | * | |||
| 452 | * | |||
| 546 | * | |||
| 768 | * | * | ||
| 814 | * | |||
| 825 | * | * | ||
| 888 | * | |||
| 890 | * | |||
| 1072 | * | * | ||
| 1102 | * | |||
| 1232 | * | |||
| 1248 | * | |||
| 1250 | * | |||
| 1287 | * | |||
| 1430 | * | * | ||
| 1626 | * | |||
| 1707 | * | |||
| 1730 | * |
Note: ‘*’ denotes presence of indicated subtypes.
Figure 11Molecular phylogenetic analysis of PBS sequence variants identified only in late (PBS-1 as PBS-L) or early (PBS-4 as PBS-E) seroconverters by maximum likelihood method with reference sequences.