| Literature DB >> 18831795 |
Gerard Kian-Meng Goh1, A Keith Dunker, Vladimir N Uversky.
Abstract
To examine the usefulness of protein disorder predictions as a tool for the comparative analysis of viral proteins, a relational database has been constructed. The database includes proteins from influenza A and HIV-related viruses. Annotations include viral protein sequence, disorder prediction, structure, and function. Location of each protein within a virion, if known, is also denoted. Our analysis reveals a clear relationship between proximity to the RNA core and the percentage of predicted disordered residues for a set of influenza A virus proteins. Neuraminidases (NA) and hemagglutinin (HA) of major influenza A pandemics tend to pair in such a way that both proteins tend to be either ordered-ordered or disordered-disordered by prediction. This may be the result of these proteins evolving from being lipid-associated. High abundance of intrinsic disorder in envelope and matrix proteins from HIV-related viruses likely represents a mechanism where HIV virions can escape immune response despite the availability of antibodies for the HIV-related proteins. This exercise provides an example showing how the combined use of intrinsic disorder predictions and relational databases provides an improved understanding of the functional and structural behaviour of viral proteins.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18831795 PMCID: PMC2559894 DOI: 10.1186/1471-2164-9-S2-S4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Model structures of the influenza A (A) and HIV-1 (B) virions.
HIV and influenza viral proteins. Some of the most important HIV and influenza viral proteins are listed with a brief description of their functions. They are listed by the order of their location in the corresponding virions [7-10].
| Major HIV proteins | ||
| Protein | Function of the Protein | Location |
| SU(gp120) | Binding of host's CD4 to itself | Virion Surface(SU) |
| TM (gp41) | Involved in fusion with host | Transmembrane (TM) at the Virion Envelope |
| MA (p17) | Matrix | Beneath the Enevelope |
| Vpua | Virion release | |
| Vpr | Nucleus transportation of viral proteins | Within capsid and beneath matrix |
| CA (p24) | Main core protein | Nucleocapsid |
| PR (p11) | Protease. | Within the Core |
| RT (p66) | Reverse Transcriptase | Within th Core |
| IN (p32) | Integrase. | Within the Core |
| Tata | Transcription factor | NAa |
| Major influenza A virus proteins | ||
| Protein | Function/Description | Location |
| HA | Hemagglutinin allows the attachhment of host's CD4 to itself | Envelope |
| NA | Neuraminidase cleaves sialic acud group to allow virion release into the extracellular region. | Transmembrane (TM) at the Virion Envelope |
| MA (M1, M2) | Protein assembly with membrane binding and disassociation | Matrix: Beneath the Enevelope |
| NS1 | Non-Structural Protein. Inhibits RNA splicing. RNA binding | Within capsid and beneath matrix |
| NS2 | Transportation of RNPb to cytoplasm | |
| NP | Nucleoprotein | Nucleocapsid |
| PB1 | Main core protein | Binds to Nucleocapsid |
a Positions of VPU and Tat in the virion is unclear
b Ribonucleoprotein (RNP)
Abundance of intrinsic disorder in various datasets.
| Protein | % Predicted Disordered X-raya | % Predicted Disordered NMR | % Predicted Disordered |
| PDBS90 | 24 ± 2 (14 ± 2) | 34 ± 2 (32 ± 1) | 24 ± 2 (15 ± 1) |
| HIV | 27 ± 2 (16 ± 1) | 50 ± 3 (41 ± 3) | 34 ± 2 (19 ± 2) |
| Influenza | 21 ± 2 (10 ± 2) | 34 ± 3 (40 ± 3) | 21 ± 2 (11 ± 2) |
aPercentage of residues that are predicted to be disordered that have been characterized by X-ray crystallography.
Table presents means of predicted disorder by PONDR® VLXT. The corresponding PONDR® VL3 are shown in brackets. The total number of proteins used in this analysis was 34 for lentivirus, while the number of influenza A analyzed proteins is 27.
Summary of the predicted disorder rates in HIV proteins.
| Protein | Accession numbers | %VLXT(VL3)a | Methodsb | Virus Type |
| SU | 11 (0) | X-Ray | HIV-1 | |
| (gp120) | ||||
| TM | 34 (27) | X-Ray | HIV-1 | |
| (gp41) TM | 64 (0) | X-Ray | Visna | |
| MA (p17) | 61 (48) | X-Ray | HIV-1 | |
| VPR | 39 (63) | NMR | HIV-1 | |
| VPUc | 26 (NA)d | NMR | HIV-1 | |
| CA (p24) | 48 (0) | X-Ray | HIV-1 | |
| PR (p11)e | 18 (0) | X-Ray | SIV | |
| RT (p66)e | 18 (10) | X-Ray | HIV-1 | |
| IN (p32)e | 26 (7) | X-Ray | HIV-1 | |
| Nefc | 26 (31) | X-Ray | HIV-1 | |
| Tatc | 100 (100) | NMR | HIV-1 |
aPercentage of residues in a chain that are predicted to be disordered.
bMethods used to characterize the proteins
cLocations of these proteins in virions have yet been determined. VPU is however, placed alongside with VPR for comparative purposes only.
dChain available is too short for VL3 analysis.
eExact locations of these proteins have yet been determined.
Summary of the predicted disorder rates in influenza viral protein.
| Protein | Accessions | %VLXT(VL3)a | Method |
| HA1 (H1) | 12 (0) | X-ray | |
| HA2 (H1) | 12 (0) | ||
| HA1 (H3) | 25 (0) | X-ray | |
| HA2 (H3) | 19 (2) | ||
| NA (N1) | 8 (0) | X-ray | |
| M1 | 25 (0) | X-ray | |
| NS1 | 69 (63) | X-ray | |
| NS2 | 60 (23) | X-ray | |
| PB1 | 47 (NA) | NMR | |
| NP | 44 (0) | X-ray |
aThe percentage of residues that are predicted ordered by VLXT. The parenthesized values are those in VL3.
Predicted intrinsic disorder in surface proteins of influenza virus.
| Disorder prediction in various NA subtypes | ||||
| Protein | Accessions | %VLXT(VL3) | Method | |
| N1 | 8 (0) | X-Ray | ||
| N2 | 25 (13) | X-Ray | ||
| N4 | 4 (0) | X-Ray | ||
| N6 | 15 (27) | X-Ray | ||
| N8 | 4 (48) | X-Ray | ||
| N9 | 15 | X-Ray | ||
| Disorder predictions in various HA subtypes | ||||
| Protein | Subunit | Accessions | %VLXT(VL3) | |
| H1 | HA1 | 12 (0) | X-Ray | |
| HA2 | 12 (0) | |||
| H3 | HA1 | 25 (0) | X-Ray | |
| HA2 | 19 (1) | |||
| H5 | HA1 | 12 (2) | X-RAY | |
| HA2 | 13 (0) | |||
| H7 | HA1 | 22 (11) | X-Ray | |
| HA2 | 30 (0) | |||
Summary of protein types generally predicted to be ordered or disordered. Information is deduced from the literature data mainly reported in [23-26].
| Protein | Observed/Predicted Disorder |
| RNA Binding Proteins | Highly Disordered |
| DNA Binding Protein | Disordered |
| Enzymes (e.g. Proteases, Ribonucleases) | Ordered |
| Non-Multiply Spanning Membrane Proteins | Disordered |
| Transmembrane Proteins (e.g Pores) | Ordered |
Figure 2Chart summarizing the percentages of residues predicted to be disordered in HIV-1 (A) and the influenza A virus proteins (B). Top of each panel represents chart of proteins with percentages of residues that are predicted to be disordered by PONDR® VLXT. The types of proteins are also summarized. It should be noted that the percentages of predicted to be disordered residues of HA vary with subtypes, but HA is essentially predicted to be quite ordered. More information is available in Table 5.
Observed paring of predicted disorder of HA-NA in subtypes involved in major epidemics. Quantitative details of the respective predicted disorder values can be found in Table 5.
| Subtype | Description | HA (Predicted Disorder/Order) | NA (Predicted Disorder/Order) |
| H1N1 | "Spanish Flu" (1918) | Ordered | Ordered |
| H2N2 | "Asian Flu" (1957) | NAa | More Disordered |
| H3N2 | "Hong Kong Flu" (1968) | More Disordered | More Disordered |
| H5N1 | "Avian Flu" (1997) | Ordered | Ordered |
aPDB structure of H2 has not been found, presumably, because the H2N2 subtype evolved quickly into H3N2.
Figure 3An entity-relationship (ER) diagram of the viral protein database. Each entity contains attributes. The type of relationships is shown by the connector and/or the black circle. For example, a black circle followed by dotted line indicates a many to zero or one relationship. Whereas, the black circle followed by a solid line indicates a many to one relationship [76].