Literature DB >> 25889106

HIV-1 diversity in an antiretroviral treatment naïve cohort from Bushbuckridge, Mpumalanga Province, South Africa.

Patrick Wela Msimanga^1,2, Efthyia Vardas^3,4, Susan Engelbrecht^5,6.

Abstract

BACKGROUND: South Africa has a generalized and explosive HIV/AIDS epidemic with the largest number of people infected with HIV-1 in the world. Molecular investigations of HIV-1 diversity can help enhance interventions to contain and combat the HIV/AIDS epidemic. However, many studies of HIV-1 diversity in South Africa tend to be limited to the major metropolitan centers and their surrounding provinces. Hardly any studies of HIV diversity have been undertaken in Mpumalanga Province, and this study sought to investigate the HIV-1 diversity in this province, as well as establish the occurrence and extent of transmitted antiretroviral drug resistance mutations.
METHODS: HIV-1 gag p24, pol p10 and p66/p51, pol p31 and env gp41 gene fragments from 43 participants were amplified and sequenced. Quality control on the sequences was carried out using the LANL QC online tool. HIV-1 subtype was preliminary assigned using the REGA 3.0 and jpHMM online tools. Subtype for the pol gene fragment was further designated using the SCUEAL online tool. Phylogenetic analysis was inferred using the Maximum Likelihood methods in MEGA version 6. HIV-1 antiretroviral drug resistance mutations were determined using the Stanford database.
RESULTS: Phylogenetic analysis using Maximum Likelihood methods indicated that all sequences in the study clustered with HIV-1 subtype C. The exception was one putative subtype BC unique recombinant form. Antiretroviral drug resistance mutations K103N and E138A were also detected, indicating possible transmission of anti-retroviral drug resistance mutations.
CONCLUSIONS: The phylogenetic analysis of the HIV sequences revealed that, by 2009, patients in the Bushbuckridge, Mpumalanga were predominantly infected with HIV-1 subtype C. However, the generalized, explosive nature of the HIV/AIDS epidemic in South Africa, in the context of extensive mobility by South Africans who inhabit rural areas, renders the continued molecular monitoring and surveillance of the epidemic imperative.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：
Anti-Retroviral Agents

Year: 2015 PMID： 25889106 PMCID： PMC4340098 DOI： 10.1186/s12985-015-0244-1

Source DB: PubMed Journal: Virol J ISSN： 1743-422X Impact factor: 4.099

Background

Human immunodeficiency virus (HIV), the etiological agent of acquired immunodeficiency syndrome (AIDS), was first isolated more than 30 years ago [1]. By 2013, an estimated 35 million people were living with HIV-1 globally, of which 24.7 million were living in sub Saharan Africa [2]. During this time period, the HIV-1 prevalence in South Africa was 12.2% (6.4 million people), with 469 000 new infections occurring, suggesting that the epidemic is not only generalized, but also explosive [3]. The HIV-1 epidemic in South Africa is characterized by limited subtype diversity with subtype C accounting for the majority of infections [4,5]. Other non-C subtypes, particularly subtypes B and D, have also been identified [6-8] as well as the occasional unique recombinant forms (URFs) [9-15]. Molecular epidemiological investigations in South Africa have largely focused on provinces with major metropolitan centers such as Johannesburg in Gauteng, Cape Town in the Western Cape and Durban in Kwa-Zulu Natal. No subtype information is available for the Eastern Cape, North West and Northern Cape provinces and limited information is available for the Free State, Limpopo and Mpumalanga Provinces. HIV-1 prevalence in South Africa is also characterized by extreme heterogeneity and there is considerable variation in prevalence amongst the different provinces and districts in each province [16]. The highest prevalence is in Kwa-Zulu Natal with the lowest in the Western Cape Province. South Africa not only has a generalized and explosive HIV/AIDS epidemic, its impact also varies significantly in terms of race, age, gender, and between regions of the country, with poor, young, African women in rural Kwa-Zulu Natal bearing a disproportionate burden of HIV infection [16]. The overall HIV prevalence in Mpumalanga in 2012 was 35.6% [16]. The province consists of 3 districts: Ehlanzeni, Nkangala and Gert Sibande. The Bushbuckridge Local Municipality in the Ehlanzeni District in Mpumalanga Province is a predominantly rural, impoverished area, with only 14% of the adult population employed and over 85% of households living below the house hold subsistence level. Half of males and 14% of females between the ages of 25 and 59 are long-term migrant workers and provide a source of remittances, which comprise the largest proportion of the income of the population of Bushbuckridge [17]. Molecular investigations of HIV diversity can help enhance interventions to contain and combat the HIV-1 epidemic. With this study, we investigated for the first time, HIV-1 diversity in Bushbuckridge, Mpumalanga, as well as the possible occurrence and extent of transmitted antiretroviral drug resistance mutations.

Methods

Study population and RNA extraction

In preparation for HIV prevention trials, a cohort was developed for enrollment. Ethics approval were obtained from the Human Research Ethics Committees (HRECs) from the University of the Witwatersrand (M061129) and Stellenbosch University (N11/02/054), following internationally recognized guidelines. The entry point for this cohort was via a free voluntary counseling and testing service. After HIV testing, individuals were offered the opportunity to be part of the pre-screening cohort. Both HIV negative and HIV positive individuals were allowed to join the cohort in preparation for preventative and therapeutic HIV vaccine trials. Fifty-one samples were obtained with informed consent as part of this pre-screening protocol from 43 HIV positive participants in Bushbuckridge, Mpumalanga (Figure 1). RNA was extracted from stored plasma samples using a QIAamp MinElute Virus Spin Kit in a QIAcube automated extractor (QIAGEN, Dusseldorf, Germany), according to the manufacturer’s instructions. RNA samples were stored at −70°C until used.

Figure 1

Geographical location of samples collected in this study. The South African map with 9 provinces is indicated and the Bushbuckridge local municipality in the Ehlanzeni district of Mupumalanga is enlarged. The “Maputo corridor” or N4 trunk roadway is highlighted in blue.

Reverse transcriptase polymerase chain reaction (RT-PCR) of HIV-1 gene fragments

Four genomic regions were targeted for amplification: the gag p24 region (HXB2 nucleotides 1248 to 1707); a part of the pol gene, that includes the Protease (PR) and a partial segment of the Reverse Transcriptase (RT) region (HXB2 nucleotides 2114 to 3335), the Integrase (IN) region (HXB2 nucleotides 4202 to 5096) and the partial env gp 41 region (HXB2 nucleotides 7877 to 8282). PCR amplification and purification was done using previously described primers and methods for the partial gag, pol integrase (IN) and env [18] genes. The partial pol PR/RT gene was also amplified using primers and a method previously described [19,20]. Briefly, cDNA synthesis and first round PCR amplification was done with the Access-RT PCR system (Promega, Wisconsin, USA), while second round nested PCR amplification was done with the GoTaq DNA polymerase system (Promega, Wisconsin, USA). The oligonucleotide primers used in the amplification of the gene fragments are listed in Table 1.

Table 1

Primers used in the amplification of the partial HIV-1 gene products

Primers ^#	Primer sequence 5’ to 3’	HXB2 position*
p24-1 (gag)	AGYCAAAATTAYCCYATAGT	1174 - 1193
p24-2 (gag)	AGRACYTTRAAYGCATGGGT	1237 - 1256
p24-6 (gag)	TGTGWAGCTTGYTCRGCTC	1673 - 1654
p24-7 (gag)	CCCTGRCATGCTGTCATCA	1844 - 1826
PR-5′ prot-1 (pol)	TAATTTTTTAGGGAAGATCTGGCCTTCC	2082 - 2109
PR-5′ prot-2 (pol)	TCAGAGCAGACCAGAGCCAACAGCCCCA	2136 - 2163
RT-NE135 (pol)	CCTACTAACTTCTGTATGTCATTGACAGTCCAGCT	3334 - 3300
RT-MJ4 (pol)	CTGTTAGTGCTTTGGTTCCTCT	3420 - 3399
Poli 5 (integrase)	CACACAAAGGRATTGGAGGAAATG	4162 - 4185
poli7 (integrase)	AACAAGTAGATAAATTAGTCAGT	4186 - 4208
poli6 (integrase)	ATACATATGRTGTTTTACTAARCT	5130 - 5107
poli8 (integrase)	TAGTGGGATGTGTACTTCTGAAC	5217 - 5195
JH41 (env)	CAGCAGGWAGCACKATGGG	7798 - 7816
Env 27 F (env)	CTGGYATAGTGCARCARCA	7861 - 7879
Menv19 R (env)	AARCCTCCTACTATCATTATRA	8299 - 8278
JH38 (env)	GGTGARTATCCCTKCCTAAC	8346 - 8365

#Primer References [18,20].

*Nucleotide position of the primer according to the HXB2 sequence (K03455) numbering.

Primers used in the amplification of the partial HIV-1 gene products #Primer References [18,20]. *Nucleotide position of the primer according to the HXB2 sequence (K03455) numbering.

Sequencing of HIV-1 gene fragments

The cycle sequencing reactions of the partial gene fragments were done with the Big Dye® Terminator v 3.1 Cycle Sequencing Kit (Applied BioSystems, Foster City, CA, USA) and run on an ABI Prism 3130xl Genetic Analyzer (Applied Biosystems, Foster City, CA, USA), according to the manufacturer’s instructions. Both strands were sequenced using overlapping primers. Sequencher v 5.1 (Gene Codes Corporation, Ann Arbor, MI, USA) was used to assemble the trace data into contiguous fragments, which were then verified, edited and saved as text files for subsequent analysis. All sequences were checked for quality assurance using the Los Alamos HIV-1 Sequence Quality Analysis tool (http://www.hiv.lanl.gov/content/sequence/QC/index.html) before further analyses and submission to GenBank.

Preliminary HIV-1 subtyping using online tools

The REGA HIV-1 subtyping Tool Version 3.0 (http://dbpartners.stanford.edu:8080/RegaSubtyping/stanford-hiv/typingtool) was used to preliminary subtype the sequences [21]. To detect recombinants, we used the jumping profile Hidden Markov Model (jpHMM-HIV) tool (http://jphmm.gobics.de) [22]. Subtype Classification Using Evolutionary Algorithms (SCUEAL) was used to test for both intra and inter subtype recombinants in the partial HIV-1 pol and IN sequences (http://www.datamonkey.org/dataupload_scueal.php) [23].

Phylogenetic inference using Maximum likelihood (ML)

The 2010 HIV-1 Group M reference sequence dataset (n = 39), from the LANL database (http://www.hiv.lanl.gov/), was used to subtype our sequences, using phylogenetic inference. Multiple sequence alignments, comprising the partial gag, pol and env sequences, and the reference dataset, were constructed using MAFFT v7.017 [24] as implemented in Geneious version R7 created by Biomatters (http://www.geneious.com). These multiple sequence alignments were subsequently codon aligned using Codon Alignment v1.1.0 (http://www.hiv.lanl.gov/content/sequence/CodonAlign/codonalign.html) and manually checked. The phylogenetic trees for the different HIV-1 genetic fragments were inferred using ML methods implemented in MEGA version 6 [25]. To find the most appropriate evolutionary model for phylogenetic inference, we used Model Selection (ML) as implemented in MEGA [25]. For each model, BIC scores (Bayesian Information Criterion), AICc value (Akaike Information Criterion, corrected), Maximum Likelihood value (lnL), and a number of different parameters were presented. Models with the lowest BIC scores were considered to describe the substitution pattern the best [25]. For the partial pol PR/RT region, the Integrase (IN) region and the partial env gp 41 region, the BIC, AICc and lnL scores indicated that the General Time Reversible model of evolution with Gamma distribution and invariant rate among sites (GTR + G + I), was the best model. For the gag region, the lnL method indicated the use of the GTR + G + I model and BIC and AICc indicated the use of the TN93 + G + I model. All nucleotide positions in the alignments with less than 95% site coverage were eliminated, thus fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. The reliability of the inferred trees was evaluated using bootstrap resampling and branches with a bootstrap value of 70% or greater were considered reliable (n = 100) [26].

HIV-1 antiretroviral drug resistance mutations using HIVdb

HIV-1 PR and RT antiretroviral drug resistance mutations were determined using the Stanford University HIV Drug Resistance Database (HIVdb), http://www.hivdb.stanford.edu/ [27].

GenBank accession numbers

GenBank accession numbers of the gag sequences were KM218392 to KM218428; pol sequences, KM218448 to KM218460; integrase sequences, KM218429 to KM218447 and for the env sequences, KM218357 to KM218391.

Results

Demographic information

The demographic and clinical information of the cohort, together with the subtyping, are summarized in Table 2. The study involved 51 plasma samples, collected from 43 participants in Bushbuckridge, between February and July 2009. Forty samples were collected at the recruitment visit and 11 samples at visit one. Only one sample per participant was included in the study. All participants, except for 0064A and 0206A, were female and none were on HIV-1 antiretroviral treatment. The average age of the cohort was 26.7 years and ranged from 16 to 41 years. The CD4 lymphocyte count ranged from 105 to 1263 with an average of 450.

Table 2

Demographic and clinical information of the participants

Study number	Collection Date	Age	Gender	CD4 count
0005A	23/04/2009	16	Female	466
0022A	24/04/2009	32	Female	731
0038	16/03/2009	31	Female	680
0039	17/03/2009	19	Female	307
0040	23/03/2009	23	Female	506
0042A	19/02/2009	26	Female	511
0064A	16/03/2009	25	Male	105
0066A	17/03/2009	22	Female	583
0073	02/04/2009	24	Female	366
0081A	08/04/2009	23	Female	437
0085	14/04/209	32	Female	154
0092A	16/04/2009	23	Female	137
0097A	17/04/2009	25	Female	243
0098A	20/04/2009	24	Female	261
0101A	20/04/2009	30	Female	790
0103	20/04/2009	22	Female	191
0116A	29/07/2009	26	Female	150
0119A	29/04/2009	28	Female	403
0122A	30/04/2009	33	Female	262
0123A	04/05/2009	34	Female	335
0130A	06/05/2009	29	Female	387
0132A	07/05/2009	25	Female	489
0134A	11/05/2009	26	Female	792
0135A	11/05/2009	23	Female	1263
0136A	11/05/2009	23	Female	1192
0143A	25/05/2009	30	Female	353
0147A	27/05/2009	19	Female	785
0152A	03/06/2009	20	Female	691
0165A	22/06/2009	32	Female	198
0173A	29/06/2009	27	Female	367
0185A	02/07/2009	30	Female	311
0189	20/07/2009	33	Female	560
0190A	06/07/2009	37	Female	229
0192A	06/07/2009	41	Female	522
0193A	06/07/2009	29	Female	313
0198A	07/07/2009	27	Female	217
0199A	07/07/2009	30	Female	219
0203A	08/07/2009	25	Female	401
0204A	08/07/2009	21	Female	349
0206A	08/07/2009	31	Male	314
0207A	08/07/2009	22	Female	733
0211	08/07/2009	31	Female	312
0215A	09/07/2009	22	Female	726

Demographic and clinical information of the participants

PCR amplification, sequence data and quality assurance

PCR amplification was successful for most of the samples, with 93% (n = 40) of the partial gag p24 gene, 48.8% (n = 21) of the partial IN p32 gene, and 83.7% (n = 36) of the partial env gp41 showing positive bands in an agarose gel after electrophoresis. However, the PCR amplification of the partial pol PR/RT gene was considerably less successful at 34.8% (n = 15). PCR amplification of 10 samples were positive in all 4 gene regions and only one sample, 0116A, could not be amplified in any of the primers. Thirty-seven (86.0%) of the gag p24, 13 (30.2%) of the pol, RT/PR, 18 (41.8%) of the pol IN and 35 (81.3%) of the env gp41 amplicons were successfully sequenced. The LANL QC tool indicated no stop codons, and no hypermutation was detected in any of the sequences.

Preliminary subtype analysis using online tools

REGA and jpHMM online tools were used to assign subtypes to all the sequences and to detect possible recombinant forms. REGA 3.0 assigned all gag, pol PR/RT, pol IN, and env sequences to subtype C, except for env 0143A, which was assigned subtype B. Similar results were obtained with jpHMM, with the exception of the IN region of 0193A which was assigned as a CK recombinant form. The SCUEAL subtyping of the pol PR/RT and IN gene fragments revealed that all the PR/RT and IN sequences were HIV-1 subtype C. Six of the sequences (18.75%) were intra-subtype C recombinant forms (Table 3).

Table 3

Intra-subtype C recombinants detected using SQUEAL

Sample	Confidence	Recombination	Intra subtype recombination	Breakpoints
0040_pol	0.749277	0.750944	0.750935	112 (111–113); 770 (769–771)
0042A_pol	0.917024	0.945140	0.945136	818 (797–839)
0143A_pol	0.691903	0.957455	0.957454	706 (701–711); 846 (844–848)
0173A_pol	0.736986	0.748732	0.748732	316 (250–382)
0040_IN	0.673024	0.690546	0.690544	551 (499–603)
0098A_IN	0.794292	0.999993	0.999127	172 (171–173); 383 (382–384); 736 (730–742)

Intra-subtype C recombinants detected using SQUEAL

ML Phylogenetic inference

Model Selection (ML) using the BIC, implemented in MEGA, indicated the use of the (GTR + G + I) model for the pol and env regions and the use of the TN93 + G + I model for the gag region (Additional files 1, 2, 3 and 4: Table S1, Table S2, Table S3 and Table S4. Maximum Likelihood fits of 24 different nucleotide substitution models for gag, pol PR/RT, pol IN and env gp41, respectively). ML phylogenetic trees were inferred from the multiple sequence alignments, and branches with a bootstrap value of 70% or greater were considered reliable. None of the sub-genomic regions supported a monophyletic South African lineage. In the gag ML tree (Figure 2A and B) all the sequences clustered within subtype C. Except for slight differences in the bootstrap values, there were no differences in the gag tree topologies inferred with either the GTR + G + I or TN93 + G + I models. Interestingly the 2 outliers to the main subtype C cluster, 0042A and 0143A, were possible intra subtype C recombinants in the pol region. Sequence 0119A had a long branch and 3 sets of sequences, 0189A/0203A, 0064A/190A and 0085A/0101A clustered closely together. This may indicate that these samples may be a possible PCR contamination or that they are epidemiologically linked.

Figure 2

Phylogenetic analysis of the partial gene, using MEGA 6. A. The evolutionary history was inferred by using the ML method based on the GTR model. The tree with the highest log likelihood (−5337.9653) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 1.2415)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 52.1407% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 76 nucleotide sequences and included all codon positions. There were a total of 451 positions in the final dataset. B. The evolutionary history was inferred by using the ML method based on the Tamura-Nei model (TN93 + G + I). The tree with the highest log likelihood (−5340.4505) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 1.1914)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 51.4597% sites).

Figure 3

Figure 4

Phylogenetic analysis of the gene, using MEGA 6. The evolutionary history was inferred by using the ML method based on the GTR model. The tree with the highest log likelihood (−7480.4899) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.3186)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 42.6830% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 55 nucleotide sequences and there were a total of 849 positions in the final dataset.

Phylogenetic analysis of the partial gene, using MEGA 6. The evolutionary history was inferred by using the ML method based on the GTR model. The tree with the highest log likelihood (−9574.7386) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 1.1121)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 45.2893% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 49 nucleotide sequences and included all codon positions. There were a total of 1062 positions in the final dataset. Phylogenetic analysis of the gene, using MEGA 6. The evolutionary history was inferred by using the ML method based on the GTR model. The tree with the highest log likelihood (−7480.4899) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.3186)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 42.6830% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 55 nucleotide sequences and there were a total of 849 positions in the final dataset. The ML phylogenetic tree for the env gp41 contained 74 sequences and all sequences, except for 0143A, clustered with HIV-1 subtype C sequences (Figure 5). Sequence 0143A clustered with subtype B in the env region and as an outlier to subtype C in the gag region. SQUEAL indicated that 0143A was an intra-subtype C recombinant in the pol region. This is the first indication of a putative unique BC recombinant sequence in Bushbuckridge, Mpumalanga.

Figure 5

HIV-1 antiretroviral drug resistance mutations

Although the participants were from an antiretroviral treatment naïve cohort, some antiretroviral drug mutations were detected (Table 4). The NNRTI mutation K103N detected on the 0143A sequence causes high-level resistance to nevirapine (NVP), and efavirenz (EFV). The NNRTI mutation, E138A, detected on the 0143A sequence is a polymorphism that may contribute to reduced etravirine (ETR) and rilpivirine (RPV) susceptibility in combination with other NNRTI-resistance mutations. The K101E mutation found on the 0189A sequence causes intermediate resistance to NVP and low-level resistance to EFV, ETR, and RPV. No major PI mutations were detected in the Bushbuckridge, Mpumalanga sequences. The T74S minor PI mutation occurs in 5% of untreated persons with subtype C viruses and is associated with reduced NFV susceptibility [28-31].

Table 4

HIV-1 drug resistance mutations detected in the Mpumalanga cohort

Isolate	Minor PI Mutations	NNRTI mutations	Integrase mutations
0005A	None	None	L74I
0042A	T74S	None	None
0081A	None	None	E157Q
0143A	None	K103N	None
0143A	None	E138A	None
0173A	T74S	None	None
0189A	None	K101E	None
0192A	T74S	None	None
0198A	None	None	L74I
0206A	T74S	None	None

HIV-1 drug resistance mutations detected in the Mpumalanga cohort E157Q is an integrase polymorphic accessory mutation that is weakly selected in patients receiving raltegravir (RAL) and causes low level resistance to RAL and elvitegravir (EVG). L74I is an accessory mutation for integrase.

Discussion

The investigation of the HIV subtype diversity of samples obtained from a cohort in Bushbuckridge, Mpumalanga revealed, first, that the HIV-1 from these samples belong almost entirely to HIV-1 subtype C with one BC recombinant; second, that the way in which the sequences derived from these samples cluster in phylogenetic trees suggests there has been multiple introductions of HIV-1 into Bushbuckridge; and third, the prevalence of antiretroviral drug resistance mutations and drug resistance-associated polymorphisms in Bushbuckridge is extremely low.

Bushbuckridge HIV epidemic is predominantly subtype C, with one BC URF

The fact that the HIV samples from Bushbuckridge, Mpumalanga, belong almost entirely to subtype C is consistent not only with the explosive HIV-1 epidemic in southern Africa, but also it’s very limited subtype diversity. HIV-1 subtype C is the most common subtype, accounting for the majority of HIV infections in southern Africa [4,5], while subtype B is responsible for infections in MSM [9,12]. One putative subtype BC unique recombinant form was detected. This indicates that BC URFs are not only found in the Western Cape Province [15], but also in Mpumalanga province.

Multiple introductions of HIV-1 into Bushbuckridge

The fact that HIV-1 subtype C sequences from South Africa tend to intermingle with HIV-1 subtype C sequences from Botswana, Malawi and Zambia suggests they may have a common evolutionary origin [32,33]. The possibility of an underlying common evolutionary origin of isolates in southern Africa is consistent with the history of the population dynamics of the southern African region. While the HIV-1 subtype C isolates from Brazil and Ethiopia tend to cluster separately, the fact that the subtype C isolate from India tends to cluster with the subtype C isolates from southern Africa [34,35] can be explained by the historical connections between the Indian subcontinent and southern Africa, which arises from the roles of both regions as former British colonial territories. Countries in southern Africa in which adult national HIV prevalence rates exceeded 15% in 2007 were all linked by the migrant labor system. This system, which under pinned the population dynamics of both South Africa and the broader southern African region, was critical in shaping the patterns of population mobility and integration that characterizing the entire region. The migrant labor system was integral to the development and structure of the South African economy and apartheid. Botswana, Lesotho, Namibia, South Africa, Swaziland, Zambia, and Zimbabwe, were all historically linked through the migrant labor system that brought men from as far as Zambia and Malawi to the mines initially on the Reef and subsequently elsewhere in the country [36-39]. Migrants are more vulnerable to HIV infection than people who hardly move, both in southern Africa as in other African countries [40-42]. A 1985 survey of workers in the gold mines originating from the entire southern African region found HIV prevalence to be very low among South African miners but among Malawian miners prevalence was already at 3% [36]. High infection levels are being found in Gaza province in Mozambique, where large numbers of migrants working in South Africa originate [43]. Before and after independence foreign migrant workers also crossed borders to work in mines in Namibia, Botswana, Zambia, and Zimbabwe [44-46]. Many of the countries in southern African with explosive HIV/AIDS epidemic are also landlocked, which entails that the region’s road transport networks does not only link these landlocked countries to the ports in Durban, Richards Bay and Maputo, but also facilitate the rapid spread of HIV in the region by ensuring the sexual networks that drive the epidemic transcend national boundaries. The Ehlanzeni District in Mpumalanga Province straddles the Maputo Corridor, a major trade route which connects the Gauteng, Limpopo, and Mpumalanga provinces of South Africa with Maputo, the capital of Mozambique that also has a major port. In Mozambique, HIV is spreading more rapidly in provinces linked by major transport routes to Malawi, South Africa and Zimbabwe. High infection rates have been found in Sofia province, which is traversed by Zimbabwe’s main export route [43]. The peculiarly explosive HIV-1 epidemic in southern Africa could also stem from the unique biological properties of subtype C. HIV-1 subtype C has an additional NF-binding site in the long terminal repeat (LTR), a prematurely truncated Rev protein, a 5′-amino-acid insertion in Vpu, and a more active, catalytically efficient protease, which may influence viral gene expression and alter the transmissibility and pathogenesis of subtype C isolates [31,47-52]. These unique biological properties, including those related to viral entry and pathogenesis such as the CCR5 and non-syncytium-inducing phenotype, may account for the explosive epidemic of HIV-1 subtype C in southern Africa [53-55]. However, the additional NF-κB site in HIV-1 subtype C may be biologically inactive, and enhanced activity of these individual functions may still not be sufficient to overcome the decreased replicative capacity of the CCR5-tropic non-syncytium-inducing phenotype [53].

Drug resistance mutations and polymorphisms

Combination antiretroviral therapy can suppress HIV-1 replication to undetectable levels with concomitant significant clinical outcomes. However, suboptimal suppression HIV-1 replication can result in the emergence of drug resistant virus strains. HIV-1 isolates that have acquired mutations conferring reduced susceptibility to antiretroviral drugs can be can be transmitted, potentially limiting options for first line therapy in untreated individuals [56]. The proportion of patients without prior antiretroviral therapy and who are infected with a virus resistant to at least one antiretroviral drug in Australia, Europe, Japan and the United States of America is 10% to 17%, while data between 2006 and 2010 suggests that transmitted antiretroviral drug resistance among those starting antiretroviral treatment in low- and middle-income countries increasing [2]. South Africa has the largest antiretroviral treatment program in the world. Besides its unprecedented scale, the antiretroviral treatment programme in South Africa is also being rolled out rapidly, such that while only 833653 adults and 86270 children were on antiretroviral treatment through the public sector in South Africa by the end of 2009, the number of those on treatment by 2012 had increased to 2010340 adults and 140541 children [2,3,16]. While the HIV-1 sequences used in this study are derived from treatment-naïve participants from Bushbuckridge, Mpumalanga, the K103N antiretroviral drug resistance mutation was detected. This suggests the participants from Bushbuckridge, Mpumalanga may either have undergone antiretroviral treatment or that they were infected with antiretroviral drug resistant strains [28-31]. The E138A mutation selected for by riplivirine/etravirine, must also be a transmitted mutation. Both riplivirine/etravirine are not part of the first and second line ART regimens in South Africa, while etravirine is part of the third line regimen. Patients in Mpumalanga only started receiving third line ART 2013.

Limitations of this study

The limitations of the study include a relatively small sample size; DNA amplification was not successful for up to 71% of the samples of the partial pol PR/RT sub-genomic region; use of partial gene regions to assign viral subtypes, potentially allowing recombinant viruses to be missed, the use of direct; population sequencing may result in the lack of detection of minority-population viruses; which can lead to an underestimation of viral diversity and drug resistance mutations.

Conclusions

HIV diversity may have implications for diagnosis, pathogenesis, transmission, clinical management and vaccine development. Phylogenetic analysis of HIV sequence diversity has allowed vital insights into the origin, evolution and spread of HIV, which suggests it is imperative to maintain HIV-1 molecular epidemiology surveillance. The extensive population mobility arising from the historical and structural migrant labor system characterizing South Africa, and the concomitant overlapping of sexual networks, seems to have precluded the possibility of distinct geographical lineages developing. However, the demise of apartheid, in particular the end of influx control measures, may have a significant impact on patterns of population mobility and settlement in South Africa, which in turn may affect the patterns of transmission of HIV and ultimately it’s evolution. The possible emergence of various HIV-1 recombinants could suggest that the migration of people into South Africa from Central, West and eastern Africa could also impact on the character and dynamics of the HIV/AIDS epidemic in South Africa.

48 in total

1. Occurrence of additional NF-kappaB-binding motifs in the long terminal repeat region of South African HIV type 1 subtype C isolates.

Authors: G Hunt; C T Tiemessen
Journal: AIDS Res Hum Retroviruses Date: 2000-02-10 Impact factor: 2.205

2. Near full-length clones and reference sequences for subtype C isolates of HIV type 1 from three different continents.

Authors: C M Rodenburg; Y Li; S A Trask; Y Chen; J Decker; D L Robertson; M L Kalish; G M Shaw; S Allen; B H Hahn; F Gao
Journal: AIDS Res Hum Retroviruses Date: 2001-01-20 Impact factor: 2.205

3. A predominantly HIV type 1 subtype C-restricted epidemic in South African urban populations.

Authors: J H Van Harmelen; E Van der Ryst; A S Loubser; D York; S Madurai; S Lyons; R Wood; C Williamson
Journal: AIDS Res Hum Retroviruses Date: 1999-03-01 Impact factor: 2.205

4. Comparison of Vpu sequences from diverse geographical isolates of HIV type 1 identifies the presence of highly variable domains, additional invariant amino acids, and a signature sequence motif common to subtype C isolates.

Authors: C McCormick-Davis; S B Dalton; D K Singh; E B Stephens
Journal: AIDS Res Hum Retroviruses Date: 2000-07-20 Impact factor: 2.205

5. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors: Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal: Mol Biol Evol Date: 2013-10-16 Impact factor: 16.240

6. Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: performance evaluation of the new REGA version 3 and seven other tools.

Authors: Andrea-Clemencia Pineda-Peña; Nuno Rodrigues Faria; Stijn Imbrechts; Pieter Libin; Ana Barroso Abecasis; Koen Deforche; Arley Gómez-López; Ricardo J Camacho; Tulio de Oliveira; Anne-Mieke Vandamme
Journal: Infect Genet Evol Date: 2013-05-07 Impact factor: 3.342

7. Genetic analysis of the near full-length genome of an HIV type 1 A1/C unique recombinant form from northern South Africa.

Authors: Benson C Iweriebor; Pascal O Bessong; Lufuno G Mavhandu; Tracy M Masebe; Julius Nwobegahay; Sylvester R Moyo; Jeffrey M Mphahlele
Journal: AIDS Res Hum Retroviruses Date: 2011-01-15 Impact factor: 2.205

8. Epidemiology of HIV-1 subtypes among men who have sex with men in Cape Town, South Africa.

Authors: Keren Middelkoop; Cecilia Rademeyer; Ben B Brown; Tamaryn J Cashmore; Jinny C Marais; Andrew P Scheibe; Gama P Bandawe; Landon Myer; Jonathan D Fuchs; Carolyn Williamson; Linda-Gail Bekker
Journal: J Acquir Immune Defic Syndr Date: 2014-04-01 Impact factor: 3.731

9. MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors: Kazutaka Katoh; Daron M Standley
Journal: Mol Biol Evol Date: 2013-01-16 Impact factor: 16.240

10. HIV-1 subtypes B and C unique recombinant forms (URFs) and transmitted drug resistance identified in the Western Cape Province, South Africa.

Authors: Graeme Brendon Jacobs; Eduan Wilkinson; Shahieda Isaacs; Georgina Spies; Tulio de Oliveira; Soraya Seedat; Susan Engelbrecht
Journal: PLoS One Date: 2014-03-07 Impact factor: 3.240

5 in total

1. HIV Drug Resistance in Antiretroviral Treatment-Naïve Individuals in the Largest Public Hospital in Nicaragua, 2011-2015.

Authors: Santiago Avila-Ríos; Claudia García-Morales; Margarita Matías-Florentino; Daniela Tapia-Trejo; Bismarck F Hernández-Álvarez; Sumaya E Moreira-López; Carlos J Quant-Durán; Guillermo Porras-Cortés; Gustavo Reyes-Terán
Journal: PLoS One Date: 2016-10-13 Impact factor: 3.240

2. HIV-1 diversity among young women in rural South Africa: HPTN 068.

Authors: Mariya V Sivay; Sarah E Hudelson; Jing Wang; Yaw Agyei; Erica L Hamilton; Amanda Selin; Ann Dennis; Kathleen Kahn; F Xavier Gomez-Olive; Catherine MacPhail; James P Hughes; Audrey Pettifor; Susan H Eshleman; Mary Kathryn Grabowski
Journal: PLoS One Date: 2018-07-05 Impact factor: 3.240

3. Trends in Pretreatment HIV-1 Drug Resistance in Antiretroviral Therapy-naive Adults in South Africa, 2000-2016: A Pooled Sequence Analysis.

Authors: Benjamin Chimukangara; Richard J Lessells; Soo-Yon Rhee; Jennifer Giandhari; Ayesha B M Kharsany; Kogieleum Naidoo; Lara Lewis; Cherie Cawood; David Khanyile; Kassahun A Ayalew; Karidia Diallo; Reshmi Samuel; Gillian Hunt; Alain Vandormael; Babill Stray-Pedersen; Michelle Gordon; Tariro Makadzange; Photini Kiepiela; Gita Ramjee; Johanna Ledwaba; Monalisa Kalimashe; Lynn Morris; Urvi M Parikh; John W Mellors; Robert W Shafer; David Katzenstein; Pravi Moodley; Ravindra K Gupta; Deenan Pillay; Salim S Abdool Karim; Tulio de Oliveira
Journal: EClinicalMedicine Date: 2019-03-18

4. Molecular Epidemiology and Trends in HIV-1 Transmitted Drug Resistance in Mozambique 1999-2018.

Authors: Nalia Ismael; Eduan Wilkinson; Isabel Mahumane; Hernane Gemusse; Jennifer Giandhari; Adilson Bauhofer; Adolfo Vubil; Pirolita Mambo; Lavanya Singh; Nédio Mabunda; Dulce Bila; Susan Engelbrecht; Eduardo Gudo; Richard Lessells; Túlio de Oliveira
Journal: Viruses Date: 2022-09-09 Impact factor: 5.818

5. Characterization of HIV-1 Integrase Gene and Resistance Associated Mutations Prior to Roll out of Integrase Inhibitors by Kenyan National HIV-Treatment Program in Kenya.

Authors: Sepha Mabeya; Anthony Nyamache; Caroline Ngugi; Andrew Nyerere; Raphael Lihana
Journal: Ethiop J Health Sci Date: 2020-01

5 in total