| Literature DB >> 19405941 |
Taeyoung Hwang1, Taesung Park.
Abstract
BACKGROUND: Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data has been of great concern. Successful integration for the identification of significant subnetworks requires the use of a search algorithm with a proper scoring method. Here we propose a multivariate analysis of variance (MANOVA)-based scoring method with a greedy search for identifying differentially expressed PPI subnetworks.Entities:
Mesh:
Year: 2009 PMID: 19405941 PMCID: PMC2696448 DOI: 10.1186/1471-2105-10-128
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
GO functional annotations for the ten most significant subnetworks (Serum Response data).
| Sub-networks | GO annotation | P-value (GO) |
| GNB1 GNG4 GNG2 GNAS SNX13 ADCY9 | hormone-mediated signaling | 0 |
| MCM7 MCM4 UBE3A MCM6 UBQLN2 | DNA replication initiation | 0 |
| MAP2K1 ARAF RPS6KA2 ASS MAPK1 KLF11 NEK2 MBP PRKCD | protein amino acid phosphorylation | 0 |
| CEBPA CDK2 ATF2 MAPK11 DUSP14 NR3C1 HNRPU BCL2 PCNA TGFB1I1 SMARCA2 | regulation of DNA replication | 0.000003 |
| DYNLL1 DNMT1 NOS1 TXNL5 BDKRB2 PCNA MSH3 RFC1 DCC1 CDK2 PARD3 CALM2 | regulation of DNA replication | 0.000004 |
GO terms are enriched for the identified subnetworks. P-values are the results of GO term enrichment analyses. The well-known biological processes related to wound healing are denoted by bold-faced letters.
GO functional annotations for the ten most significant subnetworks (Prostate Cancer Metastasis data).
| Sub-networks | GO annotation | P-value (GO) |
| AKT2 PIK3R1 MME NRAS PIP5K2A TUBG1 NFKB1 TRIP4 PIK3C2B VAV2 CEBPB PIK4CA EGR1 | phosphoinositide phosphorylation | 0 |
| ARF1 TMED2 TMED10 ARFGAP1 AP1G1 AFTIPHILIN PSCD2 | vesicle-mediated transport | 0 |
| CANX MBTPS1 ITGB1 CD46 LGALS3BP ITGAV ITGB1BP1 PXN SERP1 ITGA8 | cell-matrix adhesion | 0 |
| CAV1 PTPRF JUP ITGB4 COL17A1 NID1 LGALS3BP IRS1 FRS2 EDG1 PLEC1 APP TGFB1 CALR APPBP1 | cell adhesion | 0 |
GO terms are enriched for identified sub-networks. P-values are the results of GO term enrichment analyses. The well-known biological processes related to prostate cancer metastasis are denoted by bold-faced letters.
The number of significant networks identified by each scoring method.
| Dataset | The number of seed proteins | TΣ | Mutual Information | MANOVA |
| Serum Response | 154 | 129 | 100 | 86 |
| Prostate Cancer Metastasis | 140 | 124 | 92 | 83 |
Figure 1The distribution of the number of proteins in a subnetwork. (a) Serum response data. (b) Prostate cancer metastasis data.
Figure 2The box plots of correlation coefficients between seeds and proteins in identified subnetworks. (a) Serum response data. (b) Prostate cancer metastasis data. The correlation coefficients are absolute values.
The cumulative distribution of the number of proteins in the subnetworks (Serum response data).
| correlation coefficient | TΣ | Mutual Information | MANOVA |
| ≥ 0.5 | 36.14 | 41.28 | |
| ≥ 0.6 | 20.08 | 25.1 | |
| ≥ 0.7 | 10.14 | 17.87 | |
| ≥ 0.8 | 2.49 | 2.93 | |
| ≥ 0.9 | 0 | 0 | 0 |
The numbers denote the percentages of the number of proteins whose correlation coefficients are above the specific values in the subnetworks. The correlation coefficients are absolute values. The largest number in each row is denoted by bold-faced letters.
The cumulative distribution of the number of proteins in the subnetworks (Prostate cancer data).
| correlation | TΣ | Mutual Information | MANOVA |
| ≥ 0.1 | 56.75 | 59.08 | |
| ≥ 0.2 | 28.55 | 33.34 | |
| ≥ 0.3 | 16.44 | 16.95 | |
| ≥ 0.4 | 4.85 | 6.56 | |
| ≥ 0.5 | 1.39 | 1.35 | |
| ≥ 0.6 | 0.7 | 0.84 |
The numbers denote the percentages of the numbers of proteins whose correlation coefficients are above the specific values in the subnetworks. The correlation coefficients are absolute values. The largest number in each row is denoted by bold-faced letters.
Figure 3The structures of the PPI network for the simulation study. (a) An overall PPI network for the simulation study. (b) Five types of target subnetworks. The red nodes represent the seeds. The five types of network structures in (b) are assumed to be target networks for the PPI network (a).
Sensitivity and specificity of each method in the simulation study (ρ = 0.8, ρ' = 0.4).
| Sensitivity | Specificity | |||||
| TΣ | Mutual Information | MANOVA | TΣ | Mutual Information | MANOVA | |
| Type I | 0.36 | 0.27 | 0.56 | 0.66 | 0.69 | 0.5 |
| Type II | 0.07 | 0.04 | 0.18 | 0.9 | 0.94 | 0.79 |
| Type III | 0.01 | 0.01 | 0.07 | 0.98 | 1 | 0.87 |
| Type IV | 0.08 | 0.02 | 0.15 | 0.92 | 0.97 | 0.82 |
| Type V | 0.02 | 0.01 | 0.1 | 0.98 | 1 | 0.92 |
Sensitivity and specificity of each method in the simulation study (ρ = 0.8, ρ' = 0.1).
| Sensitivity | Specificity | |||||
| TΣ | Mutual Information | MANOVA | TΣ | Mutual Information | MANOVA | |
| Type I | 0.41 | 0.41 | 0.57 | 0.6 | 0.78 | 0.47 |
| Type II | 0.07 | 0.02 | 0.14 | 0.83 | 0.96 | 0.76 |
| Type III | 0.02 | 0.01 | 0.09 | 0.98 | 0.97 | 0.91 |
| Type IV | 0.08 | 0 | 0.24 | 0.87 | 0.98 | 0.77 |
| Type V | 0.02 | 0.01 | 0.04 | 0.94 | 1 | 0.92 |