| Literature DB >> 28049415 |
Shirin Taghipour1, Peyman Zarrineh1, Mohammad Ganjtabesh2, Abbas Nowzari-Dalini1.
Abstract
BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly accepted method to integrate different PPI datasets into a single weighted PPI network, but detecting proper weights in such network is still a major problem.Entities:
Keywords: Biological networks; Functional networks; Network clustering; Protein complexes; Protein-protein interaction networks
Mesh:
Substances:
Year: 2017 PMID: 28049415 PMCID: PMC5209909 DOI: 10.1186/s12859-016-1422-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 4Predictions based on highly interacting modules. a, b Flagella signaling example. Predictions based on highly interacting modules. Module 30 consist of four chemotaxis signaling complexes: ribose/galactose/glucose sensing, dipeptide sensing, serine sensing, aspartate sensing. CheY, CheZ, CheR, CheB also interacts with either these complexes or CheA which is in the core of these complexes. c Pilus assembly example. In Module 324: FimC, FimD, FimG, and FimH are part of recently characterized Pilus assembly complex [30]
Fig. 1General protein complex prediction framework
Fig. 2Co-expression of gene pairs in each data source. Co-expression of interacting pairs in each data source is shown as green histogram, and co-expression of all pairs is highlighted as a pink background
Fig. 3Optimization by harmonic search. a The PPI network integration parameters were optimized in a way that detected modules show the highest similarity to the co-expressed modules. Normalized Mutual information (NMI) values were used as a measure to compare similarities at the detected module level and optimization were performed for 10000 iterations. b The NMI values of the best detected module set at each iteration was compared with co-functionality modules using Gene Ontology terms. Although the co-expression module set as functional modules to optimize the NMI values, still the NMI values would increase in most of the interactions when the PPI modules was compared with the co-functionality modules. c The NMI values of the best detected module set at each iteration was also compared with co-regulation modules. Likewise, NMI values would fairly increase in the majority of iterations
Average co-expression and detected weight for each data source
| Dataset | Number of | Average | Detected |
|---|---|---|---|
| interactions | coexpression | weight | |
| HU | 5993 | 0.092 | 0.686 |
| Ariffuzaman | 11447 | 0.033 | 0.015 |
| Butland | 6227 | 0.133 | 0.653 |
| IntAct | 14437 | 0.081 | 0.005 |
| BIND | 487 | 0.068 | 0.036 |
| DIP | 10758 | 0.105 | 1 |
| M. pneumoniae orthologous | 3303 | 0.339 | 0.418 |
| Co-complex | 5541 a | 0.448 | 1 |
aAll co-complex protein pairs have been considered