| Literature DB >> 21544164 |
Gopichandran Sowmya, Gunasagaran Shamini, Sathyanarayanan Anita, Meena Sakharkar, Venkat Mathura, Hector Rodriguez, Andrew J Levine, Elyse Singer, Deborah Commins, Charurut Somboonwit, John T Sinnott, Harcharan S Sidhu, Ganapathy Rajaseger, Peter Natesan Pushparaj, Pandajarasamme Kangueane, Paul Shapshak.
Abstract
UNLABELLED: The human immunodeficiency virus type-1 (HIV-1) gp160 (gp120-gp41 complex) trimer envelope (ENV) protein is a potential vaccine candidate for HIV/AIDS. HIV-1 vaccine development has been problematic and charge polarity as well as sequence variation across clades may relate to the difficulties. Further obstacles are caused by sequence variation between blood and brain-derived sequences, since the brain is a separate compartment for HIV-1 infection. We utilize a threedimensional residue measure of solvent exposure, accessible surface area (ASA), which shows that major segments of gp120 and gp41 known structures are solvent exposed across clades. We demonstrate a large percent sequence polarity for solvent exposed residues in gp120 and gp41. The range of sequence polarity varies across clades, blood, and brain from different geographical locations. Regression analysis shows that blood and brain gp120 and gp41 percent sequence polarity range correlate with mean Shannon entropy. These results point to the use of protein modifications to enhance HIV-1 ENV vaccines across multiple clades, blood, and brain. It should be noted that we do not address the issue of protein glycosylation here; however, this is an important issue for vaccine design and development. ABBREVIATIONS: HIV-1 - human immunodeficiency virus type 1, AIDS - acquired immunodeficiency syndrome, ENV - envelope, gp160 - 160,000d glycoprotein, gp120 - 120,000d glycoprotein, gp41 - 41,000d glycoprotein, LANL - Los Alamos National Laboratories, PDB - Protein Data Bank, HVTN - STEP HIV vaccine trial, AA - amino acids, MSA - multiple sequence alignment, ASA - accessible surface area, SNPs- single nucleotide polymorphisms, HAART - Highly Active Antiretroviral Therapy, CCR5 - C-C chemokine receptor type 5, CNS - central nervous system, HIVE - HIV encephalitis, P - polarity, NP - non-polarity, CTL - cytotoxic T lymphocyte, NIAID - National Institute of Allergy and Infectious Diseases.Entities:
Keywords: ENV; HIV-1; Shannon entropy; accessible surface area; blood; brain; clades; compositional polarity; gp120; gp120-gp41 complex; gp160; gp41; trimer; vaccine
Year: 2011 PMID: 21544164 PMCID: PMC3082861 DOI: 10.6026/97320630006048
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1MSA of gp120 sequence dataset (Table 1) derived from known structures from the PDB database #x0005b;http://www.pdb.org/pdb/home]. The position specific mutations are in bold letters. The incomplete sequences in the alignment are indicated with dashes. The V1/V2, V3, V4 and V5 regions are marked. However, all the variable loops are not yet solved by X-ray crystallography in gp120 structures and hence their corresponding atomic coordinates are not available. These regions are indicated with incomplete sequences.
Figure 2MSA of gp41 sequence dataset (Table 2) derived from known structures from the PDB database. [http://www.pdb.org/pdb/home]. The position specific mutations are in bold letters. The incomplete sequences in the alignment are indicated with dashes. The gp41 structures shown consist of the N and C helices representing the N heptad repeat and C heptad repeat, respectively.
Figure 3Structural superimposition of (a) gp120, (b) gp120 trimer, (c) gp41, and (d) gp41 trimer. The gp120 and gp41 structures obtained from PDB [http://www.pdb.org/pdb/home] (Table 1 and Table 2) were superimposed using the software SPDBV (Swiss PDB Viewer version 3.7). [http://www.expasy.org/spdbv/]. The variable loops (V1-V5) and the constant regions (C1-C4) of the GP120 structure, with inner and outer domains are shown (a). The superimposition of the gp120 trimer (PDB ID: 3DNL), solved through NMR at 20.0Å is shown in (b). The N heptad and the C heptad regions of the superimposed gp41 structure, solved by X-ray crystallography with a resolution of 2.10Å, are shown in (c). Only the N heptad region of gp41 is available in trimeric form, as illustrated in (d).
Figure 4Residue position specific ASA distribution profiles for (a) gp120 and (b) gp41, (c) and (d) respective means, (e) and (f) respective standard deviations. The structural datasets were obtained from PDB [http://www.pdb.org/pdb/home] as shown in Tables 1 and 2. The ASA distribution was calculated using Surface racer [41].
Figure 5Percent sequence compositional polarity (%P) and non-polarity (%NP) of gp120 and gp41 sequences.These sequences are from HIV database sequences, the PDB structural database (a, b) [http://www.pdb.org/pdb/home] and the LANL sequence database (c, d) [http://www.hiv.lanl.gov/]. (Geographical information is not available from PDB in (a) and (b) and the A-K sequence geographical information in (c) and (d) from LANL are not shown). Additional sequence compositional polarity comparisons are from blood (e to w) and brain (x to zb). Clade and source geographical locations are stated (from LANL). The numbers in parenthesis represent the number of sequences. The ranges for %P and %NP are stated as well in each figure.
Figure 6Mean Shannon entropy and polarity range of gp120 and gp41 sequences in brain and blood samples from different geographical locations. Shannon entropy of gp120 and gp41 sequences is calculated at the LANL website [http://www.hiv.lanl.gov/]. The mean Shannon entropy is correlated with polarity range by regression coefficient (r) of 0.734 for gp120 (a) and 0.588 for gp41 (b) sequences. The Pearson correlation co-efficient was calculated using Microsoft EXCEL software.