| Literature DB >> 22536894 |
Boon-Siew Seah1, Sourav S Bhowmick, C Forbes Dewey, Hanry Yu.
Abstract
BACKGROUND: The availability of large-scale curated protein interaction datasets has given rise to the opportunity to investigate higher level organization and modularity within the protein interaction network (PPI) using graph theoretic analysis. Despite the recent progress, systems level analysis of PPIS remains a daunting task as it is challenging to make sense out of the deluge of high-dimensional interaction data. Specifically, techniques that automatically abstract and summarize PPIS at multiple resolutions to provide high level views of its functional landscape are still lacking. We present a novel data-driven and generic algorithm called FUSE (Functional Summary Generator) that generates functional maps of a PPI at different levels of organization, from broad process-process level interactions to in-depth complex-complex level interactions, through a pro t maximization approach that exploits Minimum Description Length (MDL) principle to maximize information gain of the summary graph while satisfying the level of detail constraint.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22536894 PMCID: PMC3402926 DOI: 10.1186/1471-2105-13-S3-S10
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Functional summary (FSG) of the AD network for k = 30 (cluster size indicated in brackets).
Figure 2Functional summary (FSG) of the AD network for k = 10 (cluster size indicated in brackets).
Figure 3Illustration of FUSE algorithm. (a) A toy example of PPI network. (b) A set of functional clusters of the network in (a). (c) Suppose a 3-node summary is required (k = 3). FUSE explores the functional clusters of the PPI network to identify the 3-node functional summary that best partition and represent the underlying network. This functional summary graph (FSG) depicts the functional landscape of the PPI network in 3 nodes. (d) A 5-node partition (k = 5) and its corresponding FSG.
Summary of datasets used
| Dataset | #nodes | #edges | Source |
|---|---|---|---|
| 9181 | 34624 | HPRD [ | |
| 4768 | 177299 | IntAct [ | |
| 3114 | 6472 | IntAct | |
| Alzheimer's disease (AD) | 177 | 1038 | IntAct |
Figure 4Cluster quality of FUSE vs graph clustering-based approaches.
Figure 5Function representativeness.
Figure 6Effect of k.
Figure 7Effect of b and d.
Figure 8Running times of FUSE (in sec.).
Figure 9Scalability of FUSE.
Figure 10Connectivity of functional clusters in H. sapiens network. Functional cluster degree CDF plots for BP and CC summaries at varying cluster granularity. Plots are on a semi-log scale.
High-degree CC and BP functional clusters in the H. sapiens summary (k = 400)
| CC functional cluster | Degree | BP functional cluster | Degree |
|---|---|---|---|
| Heterogeneous nuclear ribonucleoprotein complex | 183 | Actin filament bundle assembly | 208 |
| Cytosolic large ribosomal subunit | 161 | Regulation of defense response to virus by virus | 206 |
| Cytosolic small ribosomal subunit | 158 | Negative regulation of catabolic process | 204 |
| Coated pit | 158 | Peptidyl-threonine phosphorylation | 200 |
| Mitochondrial nucleoid | 149 | Signal complex assembly | 189 |
| Chaperonin-containing T-complex | 148 | Positive regulation of protein complex assembly | 182 |
| CRD-mediated mRNA stability complex | 141 | Regulation of nitric oxide biosynthetic process | 181 |
| NuA4 histone acetyltransferase complex | 136 | Glial cell development | 178 |
| Actin filament | 135 | Cell killing | 178 |
| Actomyosin | 134 | Regulation of cytokine-mediated signaling pathway | 174 |
| Clathrin coat of coated pit | 133 | Protein stabilization | 174 |
| Nonhomologous end joining complex | 124 | Actin filament capping | 170 |
| Endocytic vesicle membrane | 124 | Activation of MAPKK activity | 169 |
| Nucleosome | 124 | T cell receptor signaling pathway | 164 |
| Nuclear inner membrane | 123 | Regulation of RNA splicing | 164 |