| Literature DB >> 20633292 |
David van Dijk1, Gokhan Ertaylan, Charles Ab Boucher, Peter Ma Sloot.
Abstract
BACKGROUND: The National Institute of Allergy and Infectious Diseases has launched the HIV-1 Human Protein Interaction Database in an effort to catalogue all published interactions between HIV-1 and human proteins. In order to systematically investigate these interactions functionally and dynamically, we have constructed an HIV-1 human protein interaction network. This network was analyzed for important proteins and processes that are specific for the HIV life-cycle. In order to expose viral strategies, network motif analysis was carried out showing reoccurring patterns in virus-host dynamics.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20633292 PMCID: PMC2913931 DOI: 10.1186/1752-0509-4-96
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fourteen most frequent types of interactions between HIV and human proteins.
| interacts with | 575 | processed by | 99 |
| upregulates | 486 | regulated by | 99 |
| Binds | 411 | phosphorylated by | 65 |
| Activates | 365 | enhances | 62 |
| Inhibits | 270 | cleaves | 61 |
| downregulates | 262 | induces phosphorylation of | 53 |
| inhibited by | 188 | stimulates | 53 |
Figure 1HIV-Human protein interaction network. Nineteen HIV proteins that interact with 1452 human proteins through 3959 interactions. Blue nodes are human proteins and red nodes are HIV proteins. Visualization is done with the Cytoscape [54] software using the spring layout algorithm.
Figure 2A: Number of HIV-human interactions for each HIV protein, B: Normalized relative number of HIV-human interactions for each HIV protein. The y-axis enumerates the n-fold representation of interactions per HIV protein divided by the relative number of interactions of the respective protein in the total network. An n-fold representation of n > 1 shows an over-representation, whereas n < 1 signifies an under-representation.
Top ten highest connected HDFs, considering only HIV-HDF connections.
| ATMPK1 [GenBank: | mitogen-activated protein kinase 1 | 10 |
| IFNG [GenBank: | interferon, gamma | 9 |
| PRKCA [GenBank: | protein kinase C, alpha | 9 |
| MAPK3 [GenBank: | mitogen-activated protein kinase 3 isoform 1 | 9 |
| ACTB [GenBank: | beta actin | 8 |
| ACTG1 [GenBank: | actin, gamma 1 propeptide | 8 |
| HLA-A [GenBank: | major histocompatibility complex, class I, A precursor | 8 |
| CD4 [GenBank: | CD4 antigen precursor | 8 |
| IL10 [GenBank: | interleukin 10 precursor | 7 |
| IFNA1 [GenBank: | interferon, alpha 1 | 7 |
Figure 3HIV proteins interact with HIV dependency factors (HDFs) which in turn interact with human non-HDF proteins. Understanding HIV-host interaction requires the understanding of the HDF network and its position within the total human protein interaction network.
Figure 4Degree distributions of HDFs on a log-log scale. P(k) is the fraction of nodes with degree k. A: Only connections of HDFs to HIV proteins. B: Only HDF-HDF connections. Both distributions were fitted with a power law (P(k) = k-γ) with A: γ = 2.3, and B: γ = 2.3, showing the scale-free nature of both networks.
Mean values of centrality measures on HDFs and on proteins of the whole human protein interaction network, with standard deviations between brackets.
| degree | 16 (25) | 6 (11) | 2.42·10-97 |
| betweenness | 63·103 (19·104) | 17·103 (79·103) | 9.22·10-63 |
| eigenv. centr. | 0.049 (0.10) | 0.013 (0.04) | 3.08·10-87 |
is calculated from a one-sided Kolmogorov-Smirnov test with alternative hypothesis: HDF network > total human network regarding degree, betweenness and eigenvector centrality. In the HDF network each node has approximately three times more connections (16 in HDF vs 6 in the total human network), four times higher betweenness (63·103 in HDF vs 17·103 in human) and a four times higher eigenvector centrality score (0,049 in HDF vs 0,013 in human). The significant higher centrality of the HDF sub-network shows that it takes on a central position within the total human protein interaction network.
Set of proteins that are found to be hubs by both the degree and eigenvector centrality metrics.
| TP53 [GenBank: | tumor protein p53 | 93 | 1.00 |
| BRCA1 [GenBank: | breast cancer 1, early onset isoform 1 | 74 | 0.98 |
| ESR1 [GenBank: | estrogen receptor 1 | 59 | 0.83 |
| CREB1 [GenBank: | CREB binding protein isoform a | 58 | 0.81 |
| RB1 [GenBank: | retinoblastoma 1 | 58 | 0.72 |
| RELA [GenBank: | v-rel reticuloendotheliosis viral oncogene homolog A | 57 | 0.75 |
| SRC [GenBank: | proto-oncogene tyrosine-protein kinase SRC | 57 | 0.53 |
| TBP [GenBank: | TATA box binding protein | 56 | 0.61 |
| MYC [GenBank: | myc proto-oncogene protein | 55 | 0.67 |
| JUN [GenBank: | jun oncogene | 51 | 0.72 |
| EP300 [GenBank: | E1A binding protein p300 | 51 | 0.69 |
The top one percent of highest ranked proteins are shown here.
Top one percent of proteins that have the highest score from the betweenness centrality metric.
| TP53 [GenBank: | tumor protein p53 | 44050 |
| UBC [GenBank: | ubiquitin C | 36458 |
| GRB2 [GenBank: | growth factor receptor-bound protein 2 isoform 1 | 22792 |
| BRCA1 [GenBank: | breast cancer 1, early onset isoform 1 | 21622 |
| SRC [GenBank: | proto-oncogene tyrosine-protein kinase | 21568 |
| EGFR [GenBank: | epidermal growth factor receptor isoform a | 20472 |
| STAT3 [GenBank: | signal transducer and activator of transcription 3 isoform 1 | 18503 |
| ESR1 [GenBank: | estrogen receptor 1 | 18424 |
| RB1 [GenBank: | retinoblastoma 1 | 16777 |
| PIK3R1 [GenBank: | phosphoinositide-3-kinase, regulatory subunit, polypeptide 1 isoform 1 | 16048 |
| POLR2A [GenBank: | DNA directed RNA polymerase II polypeptide A | 15896 |
| MYC [GenBank: | myc proto-oncogene protein | 15608 |
| SP1 [GenBank: | Sp1 transcription factor | 14620 |
| RELA [GenBank: | v-rel reticuloendotheliosis viral oncogene homolog A | 14114 |
| SHC1 [GenBank: | Src homology 2 domain containing transforming protein 1 isoform p52Shc | 14011 |
Figure 5A: Local degree versus global degree, with an . B: Local versus global betweenness, with an R2 of 0.771. C: Local versus global eigenvector centrality, with an R2 of 0.942.
Set of proteins that are identified as central using both adjusted centrality metrics (degree and eigenvector centrality).
| TAF1 [GenBank: | TBP-associated factor 1 isoform 1 |
| ATF2 [GenBank: | activating transcription factor 2 |
| GTF2B [GenBank: | general transcription factor IIB |
| CCND1 [GenBank: | cyclin D1 |
| STAT1 [GenBank: | signal transducer and activator of transcription 1 isoform alpha |
| TBP [GenBank: | TATA box binding protein |
| CDKN1A [GenBank: | cyclin-dependent kinase inhibitor 1A |
| CEBPB [GenBank: | CCAAT/enhancer binding protein beta |
The top one percent of highest ranked proteins are shown here.
Top ten bottlenecks after normalization.
| PSMD6 [GenBank: | proteasome (prosome, macropain) 26S subunit, non-ATPase, 6 | 0.44 |
| PSMA2 [GenBank: | proteasome alpha 2 subunit | 0.25 |
| PSMD10 [GenBank: | proteasome 26S non-ATPase subunit 10 isoform 1 | 0.15 |
| DHX9 [GenBank: | DEAH (Asp-Glu-Ala-His) box polypeptide 9 | 0.08 |
| CD4 [GenBank: | CD4 antigen precursor | 0.07 |
| CD82 [GenBank: | CD82 antigen isoform 1 | 0.07 |
| IKBKE [GenBank: | IKK-related kinase epsilon | 0.06 |
| PTPRC [GenBank: | protein tyrosine phosphatase, receptor type, C isoform 1 precursor | 0.06 |
| A2M [GenBank: | alpha-2-macroglobulin precursor | 0.06 |
| CCR5 [GenBank: | chemokine (C-C motif) receptor 5 | 0.05 |
Figure 6Diagram representing the rewiring method used by the randomization algorithm. Two random edges are chosen and either the sources or the targets are switched with equal probability.
Figure 7Significantly over-represented network motifs in HIV-host protein interaction network. Black nodes are HIV proteins and white nodes are human proteins. Interactions can either be activations/up-regulations (+), inhibitions/down-regulations (-), activation/inhibition/regulation (±), or both (arrow without sign). Nis the number of specific motifs found. N± SD is the average number and standard deviation of the motif found in one thousand randomized networks. Pis the probability that Nor more motifs are found in the randomized networks. Zis the number of standard deviations Ndiffers from N. Network motifs were classified as significant when P< 0.02 and Z> 2.
Figure 8General types of motifs found in the HIV-human protein interaction network. Black nodes represent HIV proteins, white nodes represent human proteins.