| Literature DB >> 22165958 |
Venkata-Swamy Martha1, Zhichao Liu, Li Guo, Zhenqiang Su, Yanbin Ye, Hong Fang, Don Ding, Weida Tong, Xiaowei Xu.
Abstract
BACKGROUND: Protein-protein interactions (PPIs) are a critical component for many underlying biological processes. A PPI network can provide insight into the mechanisms of these processes, as well as the relationships among different proteins and toxicants that are potentially involved in the processes. There are many PPI databases publicly available, each with a specific focus. The challenge is how to effectively combine their contents to generate a robust and biologically relevant PPI network.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22165958 PMCID: PMC3236850 DOI: 10.1186/1471-2105-12-S10-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Information for the seven public PPI databases
| Databases | Number of proteins | Number of interactions | Websites |
|---|---|---|---|
| BioGRID | 8204 | 33625 | |
| DIP | 1137 | 1509 | |
| HPRD | 9553 | 38802 | |
| IntAct | 7495 | 30965 | |
| MINT | 5230 | 15353 | |
| REATOME | 3599 | 74490 | |
| SPIKE | 6927 | 23224 |
Figure 1Network modeling and evaluation flowchart PPI data are taken from seven preprocessed public PPI databases and used to create seven integrated networks using the k-vote method (A). SCAN is used to generate functional modules for each of these integrated networks. (B). Statistical and pathway analyses are performed on these functional modules to assess the networks (C).
Figure 2Optimality measures for the seven consensus networks Figure 2A shows the four optimality measures for Ĝ2: modularity, similarity-based modularity, clustering score, and enrichment score. Figure 2B shows the same measures for the other 6 consensus networks. The value of optimality measures and the corresponding ε values are plotted on the y-axes and x-axes, respectively.
Presences of Optimal Quality Measures
| #Nodes | #Edges | Presence of Optimal Modularity | Presence of Optimal Similarity-based Modularity | Presence of Optimal Clustering Score | Presence of Optimal Enrichment | |
|---|---|---|---|---|---|---|
| 12043 | 132603 | No | No | No | ||
| 9188 | 36086 | No | ||||
| 6464 | 17222 | No | No | |||
| 4209 | 8108 | No | No | |||
| 2286 | 3619 | No | No | |||
| 345 | 302 | No | No | No | No | |
| 59 | 40 | No | No | No | No | |
Top ten modules with significant biological enrichment in KEGG
| Cluster ID | KEGG Pathway | Total number of proteins in the module | Number of proteins in the KEGG Pathway from the module | Total number of proteins in the KEGG Pathway | Fisher’s p-value |
|---|---|---|---|---|---|
| 1 | RNA polymerase / Transcription / Genetic Information Processing | 10 | 10 | 29 | 5.10E-24 |
| 2 | Progesterone-mediated oocyte maturation / Endocrine System / Cellular Processes | 12 | 12 | 86 | 1.91E-22 |
| 3 | Proteasome / Folding_ Sorting and Degradation / Genetic Information Processing | 17 | 12 | 48 | 5.22E-22 |
| 4 | Basal transcription factors / Transcription / Genetic Information Processing | 9 | 9 | 36 | 1.24E-20 |
| 5 | Cell cycle / Cell Growth and Death / Cellular Processes | 12 | 12 | 128 | 2.97E-20 |
| 6 | Ubiquitin mediated proteolysis / Folding_ Sorting and Degradation / Genetic Information Processing | 12 | 12 | 138 | 7.61E-20 |
| 7 | Cell cycle / Cell Growth and Death / Cellular Processes | 13 | 12 | 128 | 3.78E-19 |
| 8 | Pyrimidine metabolism / Nucleotide Metabolism / Metabolism | 10 | 10 | 98 | 3.57E-18 |
| 9 | Oocyte meiosis / Cell Growth and Death / Cellular Processes | 12 | 11 | 114 | 4.09E-18 |
| 10 | RNA degradation / Folding_ Sorting and Degradation / Genetic Information Processing | 11 | 9 | 59 | 8.97E-17 |
*the highest level to the lowest level of KEGG pathways (maptitle / subcategory / category) are show