| Literature DB >> 22073263 |
Samatchaya Boonchawalit1, Duangrat Jullaksorn, Jiraporn Uttiyoung, Amara Yowang, Nongkran Krathong, Sununta Chautrakul, Akifumi Yamashita, Kazuyoshi Ikuta, Amornsak Roobsoong, Sangkom Kanitvittaya, Pathom Sawanpanyalert, Masanori Kameoka.
Abstract
BACKGROUND: The envelope glycoproteins (Env), gp120 and gp41, are the most variable proteins of human immunodeficiency virus type 1 (HIV-1), and are the major targets of humoral immune responses against HIV-1. A circulating recombinant form of HIV-1, CRF01_AE, is prevalent throughout Southeast Asia; however, only limited information regarding the immunological characteristics of CRF01_AE Env is currently available. In this study, we attempted to examine the evolutionary pattern of CRF01_AE Env under the selection pressure of host immune responses. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 22073263 PMCID: PMC3206936 DOI: 10.1371/journal.pone.0027098
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Variability of amino acid sequences in Env gp120 regions and gp41.
Nine amino acid sequences derived from samples collected between July 2008 and January 2011 from an individual were aligned with the amino acid sequence derived from the earliest sample collected in April 2008 from a corresponding individual. Aligned sequences were then divided into variable and conserved regions of gp120 and gp41. Pairwise genetic distances (p-distance) between the sequences derived from the earliest and one of the later samples was determined as described in Methods. The data obtained for all Env regions were plotted together in chronological order of sampling, as indicated from 1 vs 2 to 1vs 10. Patient IDs, Env regions and the status of treatment are denoted above, below and beside the panels, respectively.
Figure 2Amino acid residues with high diversity in the C3 and V5 regions of gp120.
The Shannon index was calculated for each amino acid residue in the C3 and V5 regions of gp120 among 10 amino acid sequences derived from an individual, as described in Methods. The data obtained from 15 patient samples were then plotted together. Amino acid numbering is based on the HXB2 Env gp120.
Average numbers of amino acid residues and potential N-linked glycosylation (PNLG) sites in Env regions.
| Env regions | Number of amino acid residues | Number of PNLG sites |
| gp120 | ||
| C1 | 129.9±0.7 | 1.9±0.5 |
| V1/V2 | 72.4±9.1 | 6.4±1.6 |
| C2 | 99.9±1.1 | 6.4±0.8 |
| V3 | 34.1±0.3 | 0.7±0.4 |
| C3 | 53.1±1.0 | 2.6±0.6 |
| V4 | 26.0±4.3 | 3.6±1.2 |
| C4 | 40.0±0 | 1.3±0.5 |
| V5 | 10.6±2.8 | 1.7±0.7 |
| C5 | 42.0±0 | 0 |
| gp41 | 352.8±2.6 | 4.2±0.7 |
*The number of amino acid residues in an Env region was counted manually. PNLG sites were evaluated using N-Glycosite, and then counted manually. The means and standard deviations of the data calculated using 150 Env sequences derived from 15 individuals are shown.
Figure 3Changes in the number of amino acid residues in the gp120 regions.
The numbers of amino acid residues in the gp120 regions were counted manually for 10 amino acid sequences derived from an individual. The data obtained from 15 patient samples were then plotted together in chronological order of sampling, as indicated from 1 to 10. Env regions are denoted above the panels, whereas patient IDs and the status of treatment are denoted below the panels.
Figure 4Changes in the number of PNLG sites in the gp120 regions.
PNLG sites in the gp120 regions were evaluated using N-Glycosite, and the number of the sites was then counted manually for 10 amino acid sequences derived from an individual. The data obtained from15 patient samples were then plotted together in chronological order of sampling, as indicated from 1 to 10. Env regions are denoted above the panels, whereas patient IDs and the status of treatment are denoted below the panels.
Amino acid residues potentially under positive selection.*
| Env regions | Amino acid residue (dN/dS ratio, patient ID) |
| gp120 | |
| C1 | 28 (1.03, CR28), 31 (2.43, CR19), 60 (1.05, CR36), 84 (6.24, CR17),85 (1.14, CR14), 119 (1.46, CR17) |
| C2 | 212 (1.28, CR2), 240 (2.33, CR38), 261 (1.01, CR19), 263 (2.31, CR8),290 (1.28, CR19) |
| V3 | 310 (1.54, CR8), 315 (1.07, CR28), 320 (2.57, CR36) |
| C3 | 332 (2.32, CR19), 335 (1.47, CR11), 336 (1.42, CR11), 341 (1.44, CR8),346 (3.00, CR2; 3.20, CR8; 2.96, CR14; 2.30, CR38), 347 (1.23, CR38),349′ (1.28, CR17), |
| C4 | 421 (1.53, CR14), 439 (1.42, CR38), 440 (16.34, CR12), 442 (1.99, CR14; 1.50, CR29; 1.50, CR38), 447 (1.21, CR14), 450 (1.10, CR14), 454 (1.07, CR19) |
| C5 | ND |
| gp41 | 535 (1.61, CR38), 585 (1.78, CR36), 600 (1.03, CR8), 647 (1.01, CR17),658 (1.76, CR2), 718 (1.60, CR17), 776 (1.93, CR2), 804 (3.14, CR19),853 (1.35, CR17; 1.31, CR28) |
*dN/dS ratio was determined as described in Methods, and the amino acid residue at which dN/dS ratio was greater than 1 was considered to be potentially under positive selection.
**Amino acid numbering is according to that of HXB2 Env. dN/dS ratio and Patient ID are shown in parentheses.
***Inserted amino acid residue between position 349 and 350.
****Not detected.
Figure 5Amino acid residues potentially under positive selection in the C3 region of Env gp120.
The dN/dS ratio was estimated for each amino acid residue in the C3 region of gp120 among 10 amino acid sequences derived from an individual, as described in Methods. The data obtained from 15 patient samples were then plotted together. Amino acid numbering is based on the HXB2 Env gp120.
Association of positive selection and potential APOBEC3-related mutation.
| Env regions | Amino acid residue | Patient ID | G to A mutation |
| gp120 | |||
| C1 | 28 | CR25 | GG to AG x3 |
| 31 | CR19 | GA to AA | |
| 60 | CR36 | GG to AG | |
| 84 | CR17 | GA to AA | |
| 85 | CR8 | GG to AG | |
| C2 | 290 | CR19 | GA to AA x2, GT to AT |
| CR29 | GA to AA x2, GT to AT | ||
| C3 | 332 | CR11 | GA to AA x2 |
| CR19 | GT to AT | ||
| 335 | CR11 | GA to AA x2, GG to AG | |
| 346 | CR2 | GA to AA | |
| 347 | CR38 | GA to AA x3 | |
| 350 | CR14 | GG to AG, GA to AA | |
| CR17 | GG to AG x2 | ||
| 362 | CR19 | GA to AA | |
| C4 | 439 | CR19 | GT to AT |
| 440 | CR12 | GG to AG | |
| CR17 | GG to AG | ||
| 442 | CR29 | GA to AA | |
| CR38 | GA to AA | ||
| 454 | CR19 | GA to AA | |
| gp41 | 535 | CR38 | GA to AA x2 |
| 585 | CR36 | GT to AT x2 | |
| 600 | CR8 | GA to AA x3 | |
| 647 | CR17 | GG to AG | |
| 658 | CR2 | GA to AA x2 |
*Amino acid residues are numbered according to HXB2 Env protein.
**Mutations, GG to AG, GA to AA, GC to AC and GT to AT, at amino acid residues with dN/dS ratio >1 were manually detected.
***GG to AG mutation was sequentially detected 3 times among 10 Env amino acid sequences derived from an individual.