Literature DB >> 32637106

Characterization of envelope sequence of HIV virus in children infected with HIV in Vietnam.

Linh Vu Phuong Dang1, Hung Viet Pham2, Thanh Thi Dinh1, Thu Hoai Nguyen3, Quyen Thi Huyen Vu3, Nhung Thi Phuong Vu1, Phuong Thi Bich Le1, Lam Van Nguyen3, Hai Thanh Le3, Phuong Thi Vu4, Linus Olson5,6,7.   

Abstract

BACKGROUND: HIV is characterized by high levels of genetic variability, including increased numbers of heterogeneous sequences of the envelope region. Therefore, studying genetic variability of HIV in relation to viral replication might facilitate prognosis of disease progression.
METHODS: The study was designed as cross-sectional; data and samples of participants collected and analyzed env genes were obtained from 23 children enrolled by Vietnam National Children's Hospital.
RESULTS: Substantial mutations in the C2 region were found in patients with high levels of viral replication while changes in the C3 region were mostly found in patients with low viral load. In the V1 region, we found profound amino acid modifications in patients with low HIV viral loads in contrast to the V2 sequence, where we identified single point mutations in patients with increased HIV viral load. The V3 region was relatively homogeneous, while profound deletions in the V4 region were detected in patients with increased viral replication.
CONCLUSION: Our results suggest that genetic variations in different regions of the HIV envelope sequence, including both conserved C2 and C3 and variable V1/V2 and V4 regions, might be involved in increased viral infectivity and replication capacity. Such knowledge might help improve prediction of HIV progress and treatment in patients.
© The Author(s) 2020.

Entities:  

Keywords:  CD4 T cell counts; HIV; HIV viral load; Vietnam; children; envelope

Year:  2020        PMID: 32637106      PMCID: PMC7323269          DOI: 10.1177/2050312120937198

Source DB:  PubMed          Journal:  SAGE Open Med        ISSN: 2050-3121


Introduction

According to the UNAIDS report, 2.1 million children have been infected with HIV (human immunodeficiency virus) worldwide, of which more than 6500 HIV-infected children were found in Vietnam.[1,2] The majority of the infected children become infected with HIV vertically, through mother-to-child transmission (MTCT). Thus, HIV might already encounter higher levels of genetic variability. The perinatal HIV infection is influenced by a combination of factors including both maternal, infant host, and viral factors. While the genetic polymorphisms in human leukocyte antigen (HLA) genes and chemokine receptor genes in mother and infant host have been shown to play a crucial role in MTCT of HIV, the genetic variability of HIV virus in relation to MTCT has not been studied extensively.[3,4] In addition, it has been shown that the genetic variation due to heterosexual transmission could be different compared to other transmissions, including men who have sex with men (MSM) transmissions and MTCT. Therefore, genetic variability could be crucial factors in order to determine not only the successful transmission but also the course of the HIV infection.[5] Genetic variability of HIV occurs as the result of nucleotide substitutions, deletions, or insertions within each genotype or recombination between different genotypes. Four major genotype groups of HIV have been reported, including M (main), O (outlier), N (non-M, non-O), and P (putative) groups.[6] Among all the groups, group M, the most predominant circulating genotype, has been divided into different subtypes (clades) denoted A, B, C, D, F, G, H, J, and K and sub-subtypes denoted A1, A2, A3, A4, F1, F2, and so on.[7] The variation within subtype could be 15%–20%, while up to 25%–30% difference has been reported[8] between subtype genetic variation. In addition to the single subtypes, researchers have reported other recombinant subtypes, which are classified as circulating recombinant forms (CRFs) and unique recombinant forms (URFs),[9] thought to be the results of combination between subtypes in dually infected persons. These subtypes could pass to other people through different routes of transmission.[10] Subtype AE, which also is defined as CRF01_AE, has been shown to be the most frequent in Vietnam in different groups, regardless of the routes of transmission.[11,12] HIV infects CD4 T cells through binding of its envelope (env) protein gp120 to the CD4 receptor and the CXCR4/CCR5 co-receptor on the cell surface. After binding, HIV releases its genome into the host cells, in which the HIV genome undergoes reverse transcription into DNA, which is then inserted into the host’s genome in the infected cells. The inserted DNA is then transcribed and translated to form new HIV virions including the whole HIV genome and associated proteins. HIV virions then use the host cells’ surface to form the viruses which are later released and able to infect other cells.[13-15] The process has multiple steps and involves complex pathways, yet it is believed that the fitness of HIV is critical for viral replication, transmission, and disease progression. Hence, HIV has been shown to be completely dependent upon the env protein to enter the host cells. The env gene encodes the surface-expressed viral protein env (gp160) and linearly contains five conserved regions (C1–C5) and five variable regions (V1–V5).[16] Among the C regions, the C2 and C3 regions have been shown to be crucial for CD4 binding. C3 contains three residues defined as oligosaccharide sites that are required for structure and function of the protein. These residues consist of N-X-S (Asparagine-X-Serine) or N-X-T (Asparagine-X-Threonine) contexts including NGT (Asparagine-Glycine-Threonine), NVS (Asparagine-Valine-Serine), and NGS (Asparagine-Glycine-Serine).[17] Regarding the V region, the loop structure was shown to be formed by the intermolecular disulfide bonds in the gp120 glycoprotein.[18] The V1 region plays a significant role in early infection.[19,20] Both V1 and V2 regions are associated with increased neutralization susceptibility of HIV.[21,22] V3 is one of the most important determinants of viral tropism and co-receptor usage with the involvement of N-X-S (Asparagine-X-Serine) or N-X-T (Asparagine-X-Threonine) contexts.[23] The functional importance of V3 is illustrated by the fact that deletion of V3 completely abrogates virus infectivity.[24] The role of V4 is unclear regarding HIV infection; it has been shown to be involved in formation of neutralizing antibodies.[25] It has been suggested that HIV has high levels of genetic variability, and thus that evolution of the env gene might occur rapidly, during the course of HIV infection.[26,27] The process, in turn, could facilitate viral replication by increased fitness, and increased ability to escape from the selection pressure caused by anti-retroviral therapy (ART), as well as the immune system. However, other mutations in the env gene might be associated with decreased viral entry and disturbance of certain functions of host cells. Quan et al.[28] have reported that mutations in the env gene, particularly G367R and D368R in the CD4 binding site (CD4bs) of gp120, have led to inactivity of viral infection. To our knowledge, studies of diversifications of the env gene in the CRF01_AE clade of HIV, the predominant clade circulating in Vietnam, are limited. Here we investigate variations of the env region in HIV-infected children registered at Vietnam National Children’s Hospital, Hanoi, Vietnam, and its relation to HIV viral load and CD4 T cell counts.

Methods

Study population and sample collection

The study subjects were HIV-infected children, who have been diagnosed and treated at National Hospital of Pediatrics (NHP), Hanoi, Vietnam, in 2012. Samples chosen from a previous study[29] were those taken during the first year after treatment initiation and they were compared to the variability of the env gene among HIV-infected children who were treated with first-line ART. Among samples we have accessed, we could extract RNA and perform further analysis in 23 samples. The data regarding CD4 T cell counts and HIV viral load were collected from medical records of all patients at the same time point as the one when blood samples were taken. The duration of the study was from August 2017 to October 2017.

Polymerase chain reaction, cloning, and sequencing of HIV-1 env

Blood samples were collected in tubes containing ethylenediaminetetraacetic acid (EDTA) and transported to the laboratory within 6 h to perform RNA extraction. Viral RNA extraction was performed using QIAamp Viral RNA Mini Kit (Qiagen, Hilden, Germany) and purified RNA was used to synthesize cDNA using First-Strand Synthesis System for reverse transcriptase (RT)–polymerase chain reaction (PCR) (Invitrogen, Carlsbad, CA, USA). The env gene was amplified by PCR using a nested-PCR protocol in a PerkinElmer 480 Thermal Cycler (Perkin Elmer, Waltham, MA, USA). PCR amplification of the complete gp160-encoding sequence (the region from 5580 to 8586 of the HIV-1HXB2 genome) was performed using the following external primers: AEs1ext (5ʹ-TGGGTTACAGTTTATTATGGGG-3ʹ) and AEa1ext (5ʹ-TTCTCTTTGCCYTGGTGGGTGCTA-3ʹ) and inner primers including AEs1int (5ʹ-TGCCAAAGCATATGAGACAGARGYGCA-3ʹ) andAEa1int (5ʹ-TTCACTTCTCCAATTGTCCTTYATRTT-3ʹ), which have been described previously.[30] The amplification was carried out in a final volume of 50 µL containing Tris-HCl, MgCl2, deoxyribonucleotide triphosphates (dNTPs), primers, and Taq polymerase. Thermal cycle conditions included heating at 94°C for 2 min, at 94°C for 30 s, at 50°C for 30 s, and at 68°C for 90 s for 35 cycles, and a final extension step at 72°C for 10 min for both cycling conditions. The PCR products were detected by agarose gel electrophoresis, followed by ethidium bromide staining. The sequencing was performed using BigDye terminator reaction kits and an Applied Biosystems 3100 capillary sequencer (Applied Biosystems, Inc., Foster City, CA, USA). The sequences obtained were aligned to a reference sequence, CRF-AE that has been found in Vietnam, using BioEdit software (BioEdit, Tom Hall, Ibis Therapeutics, Carlsbad, CA, USA), and then to blast using the National Center for Biotechnology Information (NCBI) program and the multiple alignment program for amino acid or nucleotide sequences (MAFFT), program version 7 (CBRC). There were certain regions with multiple mutations within the same patients, making it difficult to obtain the clean sequences and hence these were excluded from the analysis.

Results

General characteristics of the HIV-infected children

As shown in Table 1, median age of participants was 4.6 years (interquartile range (IQR): 3–5.5), median CD4 T cell counts were 470 (103–740) cells/mm3 and median levels of HIV viral load were 400 (400–86,000) copies/mL. The participants were 43.5% female and 56.5% male. 95.7% of the participants were infected with at least one opportunistic infection and the majority was at clinical stage 2 (52.2%). 30.4% of the HIV-infected children suffered from severe immune-deficiency (CD4 T cell counts < 350 cells/mm3), 47.8% had high levels of HIV viral load (>1000 copies/mL).
Table 1.

General characteristics of HIV-infected children.

PatientsAgeSexCD4 T cell countsHIV RNAOpportunistic infectionClinical stage
10144.7M680400Y1
10225.1F754400Y1
10272.8F4071848Y3
10321.8M29220Y3
10333.0F888400N1
10355.0M236220Y4
10374.7M1754400Y1
10581.6M470480,000Y1
10605.1F356400Y2
10614.0M1106400Y3
10654.0F6632600Y3
10714.3M114400Y2
10727.8M88442,000Y2
10815.7M272160,000Y2
10820.9F32180,000Y2
10859.5F37416,000Y2
10874.2F541,300,000Y3
10985.5M594400Y2
11066.2M74056,000Y2
11254.6F55208,193Y2
11659.2M543400Y2
11752.2M58586,000Y2
11853.0F400400Y2
CharacteristicsMedianInterquartile range (IQR)
Age4.635.5
Weight151116.5
Height9890103
CD4 T cell counts470103740
HIV RNA40040086,000
General characteristics of HIV-infected children.

Constant regions showed high homology when aligned to reference sequence

Among 23 subjects included in our study, there were no significant variations in C2 regions compared to the reference sequence. We observed that most of the subjects carried Valine (V) instead of Isoleucine (I) at the second amino acid. Subject 1014, who had low levels of both CD T cell counts and HIV viral load, obtained the replacement of Asparagine (N) to Serine (S) at the fifth amino acid. While high CD4 T cell counts and low HIV RNA have been found in subject 1037 with alteration from Asparagine (N) to Aspartate (D) at the seventh amino acid, subject 1185 with substantial mutations in 10 amino acids, and subject 1165 with five variations in amino acids (Figure 1).
Figure 1.

Multi-alignment of C2 and C3 regions of env sequence.

Multi-alignment of C2 and C3 regions of env sequence. Comparing with the reference sequence in C3 region, patients (1085 and 1022) with the highest levels of genetic variations had high CD4 T cell counts and low levels of HIV viral load. On the contrary, subjects 1081 and 1085 with high levels of viral load experienced only two variations at positions 3 and 4. While the majority of patients showed high levels of homology in three residues in C3 domains that are defined as oligosaccharide sites required for the structure/function of the protein consisting of N-X-S (Asparagine-X-Serine) or N-X-T (Asparagine-X-Threonine) contexts including NGT (Asparagine-Glycine-Threonine), NVS (Asparagine-Valine-Serine), and NGS (Asparagine-Glycine-Serine), patient 1022 with high CD4 and low HIV RNA showed mutation in these residues (Figure 1).

The low variation in V1 region and single point mutation were found in patient with high levels of viral load

Sequence comparison with the founder virus showed profound amino acid variation in the V1 region in subjects with low viral load in contrast to single point mutations within the V2 region in subjects with increased viral load (Figure 2). Similarly, patients with the full V1 length experienced high levels of HIV viral load in contrast to missing amino acids in the V2 region in participants with high levels of viral replication (Figure 2).
Figure 2.

Multi-alignment of V1–V4 regions of env sequence.

Multi-alignment of V1–V4 regions of env sequence.

The V3 region remained relatively homogeneous compared to the reference sequence

As can be seen from Figure 2, Cysteine (C), which is responsible for formation of the loop structure, was highly conserved in all patients. In nine patients, four amino acid GPGQ contexts (Glycine-Proline-Glycine-Glutamine) were found to be conserved. The third amino acid was conserved in all patients. While other subjects showed different patterns of variation and viral load at the first and second position, only patients with replacement of Q by R (Arginine) in the last position showed high levels of viral load (Figure 2). Four patients (1081, 1072, 1175, and 1125) with highly unconserved residues in N-X-S (Asparagine-X-Serine) or N-X-T (Asparagine-X-Threonine) contexts were found to have high levels of HIV viral load in contrast to patients with low levels of diversification who had low HIV RNA (Figure 2).

The most frequent variations found in the V4 region

All subjects showed at least nine mutations in the V4 region, in which subjects 1024, 1071, and 1120 showed the most variations, including multiple point mutations, insertions, and deletions. Five patients had V4 regions with multiple deletions: 1120, 1061, 1130, 1082, and 1125 (Figure 2).

Discussion

Alignment of the C2 region taken from participants to a reference sequence revealed different types of genetic variation. Patients with certain types of genetic variation also experienced high levels of viral load, suggesting that these diversifications might facilitate viral replication in HIV-infected patients. Regarding the C3 region, patients who carried Serine instead of Asparagine at the fifth amino acid had low CD4 T cell counts and low viral load. The variation might be associated with decrease of HIV fitness; therefore, even though the immune system has not been normalized, HIV replication remained low. On the contrary, patients with substantial variations were found to have high CD4 T cell counts and low levels of HIV viral load, suggesting that the mutation could both decreased fitness and increased immune recognition. In contrast to the C2 region, significant diversifications in the C3 region were found in patients with increased viral replication, suggesting different roles of the C3 and C2 regions in facilitating viral replication. Higher levels of evolutionary rate in the C2 and C3 regions have been reported in HIV-1 compared to those of HIV-2, and it is suggested that changes in the C2 and C3 regions might be associated with immune escaping mechanisms and viral replication.[17] Moreover, it has been proposed that immune pressure is one of the most important driving forces for rapid intra-host evolution of the C2 and C3 regions. In our study, the evolution of C2 seems to favor viral replication, whereas evolution of C3 appeared to have an opposite role. The V1 region showed abundant diversifications in HIV-infected children, with truncated regions found in most of the patients. Surprisingly, patients with a full length V1 region showed high levels of HIV viral load, whereas those with a truncated V1 region showed the opposite result. Thus, full sequence of V1 might be required for viral replication. The V1 region has been shown to involve in early infection and replication efficiency[19,20] and deletion of the V1 loop enhanced the neutralization capacity of the immune system.[20] In line with these findings, we showed that the presence of a full length V1 region was associated with increased viral replication and that a truncated V1 region was found in patients with low levels of viral load. However, the mechanism behind the truncation of the V1 region is still unclear, probably due to the increased surveillance by the immune system as long as it is intact. For the V2 region, among the highly conserved four amino acid GPGQ context, only the variation in the last residue was found in patients with high levels of HIV viral load, implying a role of the last residue in viral replication. The V1/V2 regions are considered to play an important role in escaping the immune response, since deletions in the V1/V2 regions were associated with poor neutralization susceptibility of HIV.[21,22] However, most of the participants with truncated V1 regions in our study had low viral replication, suggesting that the V1 region might have different roles compared to the V2 region in development of neutralization capacity of the hosts. Consistent with our results, Saunder et al.[23] also emphasized different roles of V1 and V2 loops in association with the neutralization susceptibility profile of HIV-1 and in certain cases, the roles of V1 and V2 might even be opposite. Our study appears to favor opposite roles of V1 and V2 for increased viral replication. In addition, V1 and V2 changes have been found to be associated with alteration of co-receptor usage.[31] However, such changes were not consistently found, and have not been documented in the AE subtype. Thus, it is difficult to compare the changes in association with the changes in co-receptor usage and HIV binding. Trinh et al. showed that alterations of the V2 sequence C-strand (residues 165–186) were associated with viral escape antibody response.[32] However, we did not consistently found these changes in our patients. The V3 region has been reported to be critical for env-mediated HIV-1-cell fusion.[23] However, we did not find any significant mutations associated with viral replication in the V3 region. Consistent with the finding of Morikita et al.,[33] we also found that the V3 region was quite homogeneous, whereas the V1 and V2 regions were relatively heterogeneous. The V3 region has been thought to be the crucial determinant of CCR5 co-receptor usage; when the V1/V2 and V3 domains were present, there was increased HIV viral infectivity.[34] Shalekoff et al.[35] found that the switch from CCR5 usage to CXCR4 usage in HIV-infected children was observed between 9 and 15 months. All our children were older than 15 months, so that we could speculate that most of them were using CXCR4. Therefore, with a crucial role in viral fusion activity and cellular tropism and a minor role in developing immune response, the V3 region could remain homogeneous without affecting viral replication. Mutations in the V4 region including multiple point mutations, insertions, and deletions were detected in patients with high levels of HIV viral load. Sundravaradan et al.[36] have suggested that due to differences in sequences of V3 and V4 regions, patients with subtype C might encounter higher levels of HIV-1 replication, compared to those with subtype B, suggesting that sequences of the V3 and V4 regions might affect viral fitness and viral replication. In our study, the V3 region remained relatively homogeneous; mutations in the V4 region might be associated with increased viral replication. Taken together, our results suggested that HIV might acquire mutations in different regions of the env gene, including both conserved C2 and C3 and variable V1/V2 and V4 regions, in order to increase viral infectivity and replication capacity, while V3, which is required for fusion activity and cellular tropism, remains relatively homogeneous.

Limitation of the study

Our study is restricted to observations of changes of different regions of the env sequences. Since we have consumed all samples collected in a previous study, the numbers of patients for this study were limited. A more comprehensive functional study should be conducted in order to understand the association between the observed env variations and viral binding and immune-escape mutations for the particular AE subtype in Vietnam.

Conclusion

Our results suggest that env sequences of HIV are mutated in different regions, including both conserved C2 and C3 and variable V1/V2 and V4 regions in order to increase viral infectivity and replication capacity. Notably, the V3 region remained relatively homogeneous, supporting the role of V3 for fusion activity and cellular tropism. Studying the genetic variability of the HIV virus in relation to viral replication and immunological status might be beneficial for prediction of disease progression and treatment responses during HIV infections.
  34 in total

1.  Phylogenetic analysis of 49 newly derived HIV-1 group O strains: high viral diversity but no group M-like subtype structure.

Authors:  P Roques; D L Robertson; S Souquière; F Damond; A Ayouba; I Farfara; C Depienne; E Nerrienet; D Dormont; F Brun-Vézinet; F Simon; P Mauclère
Journal:  Virology       Date:  2002-10-25       Impact factor: 3.616

2.  CCR2 polymorphism and HIV: mutation in both mother and child is associated with higher transmission.

Authors:  Marie Nicole Ngoufack; Céline N Nkenfou; Barbara A Tiedeu; Georges Nguefack-Tsague; Linda C Mekue Mouafo; Beatrice Dambaya; Carine N Nguefeu; Elvis N Ndzi; Serge C Billong; Wilfred F Mbacham; Alexis Ndjolo
Journal:  Int J Biochem Mol Biol       Date:  2019-10-15

3.  Global and regional distribution of HIV-1 genetic subtypes and recombinants in 2004.

Authors:  Joris Hemelaar; Eleanor Gouws; Peter D Ghys; Saladin Osmanov
Journal:  AIDS       Date:  2006-10-24       Impact factor: 4.177

4.  Two-year outcome of first-line antiretroviral therapy among HIV-1 vertically-infected children in Hanoi, Vietnam.

Authors:  H V Pham; A Ishizaki; L V Nguyen; C T T Phan; T T B Phung; K Takemoto; A N Pham; X Bi; D T K Khu; H Ichimura
Journal:  Int J STD AIDS       Date:  2014-10-19       Impact factor: 1.359

5.  A replication-competent, neutralization-sensitive variant of simian immunodeficiency virus lacking 100 amino acids of envelope.

Authors:  Welkin E Johnson; Jennifer Morgan; Julie Reitter; Bridget A Puffer; Susan Czajak; Robert W Doms; Ronald C Desrosiers
Journal:  J Virol       Date:  2002-03       Impact factor: 5.103

6.  Characteristics of HIV type 1 (HIV-1) glycoprotein 120 env sequences in mother-infant pairs infected with HIV-1 subtype CRF01_AE.

Authors:  Tanawan Samleerat; Martine Braibant; Gonzague Jourdain; Alain Moreau; Nicole Ngo-Giang-Huong; Pranee Leechanachai; Jittapol Hemvuttiphan; Temsiri Hinjiranandana; Tikamporn Changchit; Boonyarat Warachit; Veera Suraseranivong; Marc Lallemant; Francis Barin
Journal:  J Infect Dis       Date:  2008-09-15       Impact factor: 5.226

7.  Multidrug-resistant variants of HIV type 1 (HIV-1) can exist in cells as defective quasispecies and be rescued by superinfection with other defective HIV-1 variants.

Authors:  Yudong Quan; Chen Liang; Bluma G Brenner; Mark A Wainberg
Journal:  J Infect Dis       Date:  2009-11-01       Impact factor: 5.226

8.  Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody.

Authors:  P D Kwong; R Wyatt; J Robinson; R W Sweet; J Sodroski; W A Hendrickson
Journal:  Nature       Date:  1998-06-18       Impact factor: 49.962

9.  Phylodynamic analysis of the dissemination of HIV-1 CRF01_AE in Vietnam.

Authors:  Huanan Liao; Kok Keng Tee; Saiki Hase; Rie Uenishi; Xiao-Jie Li; Shigeru Kusagawa; Pham Hong Thang; Nguyen Tran Hien; Oliver G Pybus; Yutaka Takebe
Journal:  Virology       Date:  2009-06-21       Impact factor: 3.616

10.  Evolutionary and structural features of the C2, V3 and C3 envelope regions underlying the differences in HIV-1 and HIV-2 biology and infection.

Authors:  Helena Barroso; Pedro Borrego; Inês Bártolo; José Maria Marcelino; Carlos Família; Alexandre Quintas; Nuno Taveira
Journal:  PLoS One       Date:  2011-01-20       Impact factor: 3.240

View more
  1 in total

1.  The Characteristics of the HIV-1 Env Glycoprotein Are Linked With Viral Pathogenesis.

Authors:  Silvia Pérez-Yanes; María Pernas; Silvia Marfil; Romina Cabrera-Rodríguez; Raquel Ortiz; Víctor Urrea; Carla Rovirosa; Judith Estévez-Herrera; Isabel Olivares; Concepción Casado; Cecilio Lopez-Galindez; Julià Blanco; Agustín Valenzuela-Fernández
Journal:  Front Microbiol       Date:  2022-03-24       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.