| Literature DB >> 19828041 |
Sinan Erten1, Xin Li, Gurkan Bebek, Jing Li, Mehmet Koyutürk.
Abstract
BACKGROUND: In systems biology, comparative analyses of molecular interactions across diverse species indicate that conservation and divergence of networks can be used to understand functional evolution from a systems perspective. A key characteristic of these networks is their modularity, which contributes significantly to their robustness, as well as adaptability. Consequently, analysis of modular network structures from a phylogenetic perspective may be useful in understanding the emergence, conservation, and diversification of functional modularity.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19828041 PMCID: PMC2770073 DOI: 10.1186/1471-2105-10-333
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Modularity Based Phylogenetic Analysis of Molecular Interaction Networks.
Comparison of performances of MOPHY with using random protein modules, using protein similarities only, using random homologous protein partners and RDL.
| M | 5.28 | 5.85 | 8.82 | 5.82 | 8.15 | 6.79 |
| RDL | 14.40 | 15.50 | 19.80 | 13.12 | 14.83 | 15.53 |
| Random Modules | 14.81 | 9.33 | 11.54 | 8.11 | 12.6 | 11.29 |
| Only Protein Similarity | 11.72 | 11.56 | 13.96 | 11.22 | 10.75 | 11.84 |
| Random Homolog Selection | 15.01 | 13.60 | 17.73 | 22.72 | 17.61 | 17.33 |
For the simulated networks, we compare the performance of MOPHY with the random module method, random homolog selection method, RDL and a method that only uses protein similarity in the networks. Nodal distance for five simulation instances as well as the average values of these five runs are shown in the table. For MOPHY, the result used is the best performance achieved with coverage 0.60 and diameter 2, by using the most specific modules. Similarly for the random module method, the best performance is achieved for the instance with coverage 0.40 and diameter 4, with the most comprehensive modules. For the random homolog selection method, the best result is achieved with coverage 0.60 and diameter 3, by using the most specific modules. As clearly seen, MOPHY outperforms the other methods in terms of capturing the evolutionary distances between species.
Performance of MOPHY in capturing the topology of underlying phylogeny for simulated networks.
| 20% | 1.6** | 11.2 | 0.0039 | 1.6** | 11.2 | 0.0039 | 1.6** | 12.0 | 0.0029 |
| 40% | 1.6** | 12.0 | 0.0013 | 1.6** | 10.8 | 0.0093 | 1.6** | 12.0 | 0.0014 |
| 60% | 1.6** | 11.2 | 0.0019 | 1.6** | 11.6 | 0.0039 | 1.6** | 11.6 | 0.0021 |
| 20% | 2.4** | 11.6 | 0.0048 | 2.8** | 10.8 | 0.0088 | 4.4* | 10.8 | 0.0121 |
| 40% | 2.8** | 12.0 | 0.0036 | 2.8** | 10.8 | 0.0032 | 4.4* | 10.4 | 0.0179 |
| 60% | 1.6** | 10.8 | 0.0062 | 2.4** | 11.2 | 0.0029 | 3.6** | 10.0 | 0.0054 |
Performance of MOPHY in capturing the topology of underlying phylogeny for simulated networks. For each parameter setting, the symmetric distance between the underlying tree and the tree reconstructed by MOPHY/randomized method is shown. Reported values are averages over five runs. p-values indicate the statistical significance of the performance difference between MOPHY and the randomized method. **: p < 0.01, *: p < 0.05.
Performance of MOPHY in capturing the underlying evolutionary distances for simulated networks.
| 20% | 6.87** | 16.40 | 0.0020 | 6.84** | 15.85 | 0.0017 | 6.97** | 15.78 | 0.0019 |
| 40% | 6.81** | 16.14 | 0.0017 | 6.86** | 15.85 | 0.0017 | 7.01** | 15.53 | 0.0029 |
| 60% | 6.79** | 15.85 | 0.0017 | 6.86** | 15.41 | 0.0016 | 7.02** | 15.25 | 0.0026 |
| 20% | 8.89 | 11.72 | 0.2283 | 9.67 | 11.76 | 0.3512 | 10.83 | 11.82 | 0.6277 |
| 40% | 7.62* | 13.12 | 0.0277 | 8.44 | 11.61 | 0.2029 | 9.63 | 11.29 | 0.4922 |
| 60% | 6.70** | 14.93 | 0.0018 | 7.92 | 12.90 | 0.0529 | 8.96 | 11.51 | 0.3263 |
For each parameter setting, the nodal distance between the underlying tree and the tree reconstructed by MOPHY/randomized method is shown.
Figure 2Effect of Coverage and Noise on Performance. (a), (b): Performance of MOPHY in capturing the underlying evolutionary distances between simulated networks with respect to coverage (fraction of modules that are used in phylogeny reconstruction). (a) Most specific modules, (b) most comprehensive modules. (c): The effect of noise and missing interactions on the performance of MOPHY. Even when the data is perturbed with 50% noise, MOPHY's accuracy in reconstructing the phylogeny is statistically significant. If the missing interactions are correlated across species, then the effect of missing data is comparable to that of random noise. On the other hand, if they are uncorrelated, then the performance of MOPHY is degraded more quickly, which is expected since the networks break apart in different ways in different species.
Figure 3Comparison of the performances of M. (a) Tree based on genome sequences [36], (b) tree reconstructed by MOPHY, (c) tree reconstructed by RDL [16], (d) tree reconstucted by using protein similarities only, (e) tree reconstructed by using random group of proteins as modules, (f) tree reconstructed by Random Homolog Selection method.