| Literature DB >> 23710435 |
Wenhong Tian1, Nagiza F Samatova.
Abstract
A number of tools for the alignment of protein-protein interaction (PPI) networks have laid the foundation for PPI network analysis. Most of alignment tools focus on finding conserved interaction regions across the PPI networks through either local or global mapping of similar sequences. Researchers are still trying to improve the speed, scalability, and accuracy of network alignment. In view of this, we introduce a connected-components based fast algorithm, HopeMap, for network alignment. Observing that the size of true orthologs across species is small comparing to the total number of proteins in all species, we take a different approach based on a precompiled list of homologs identified by KO terms. Applying this approach to S. cerevisiae (yeast) and D. melanogaster (fly), E. coli K12 and S. typhimurium, E. coli K12 and C. crescenttus, we analyze all clusters identified in the alignment. The results are evaluated through up-to-date known gene annotations, gene ontology (GO), and KEGG ortholog groups (KO). Comparing to existing tools, our approach is fast with linear computational cost, highly accurate in terms of KO and GO terms specificity and sensitivity, and can be extended to multiple alignments easily.Entities:
Year: 2013 PMID: 23710435 PMCID: PMC3654364 DOI: 10.1155/2013/670623
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1global visualization of protein interaction networks (from [15]).
Figure 23D alignment guided by physical interaction (interactome), sequence similarity (genome), and functional coherence (functome) of proteins in the network.
A summary of classification of different network alignment tools.
| Tools | Local | Global | Pairwise | Guided models |
|---|---|---|---|---|
| PathBLAST | × | × | Evolution | |
| NetworkBLAST | × | Evolution | ||
| NetworkBLAST-M | × | Evolution | ||
| MaWish | × | × | Duplication/divergence | |
| Graemlin 1.0 | × | General evolution | ||
| Graemlin 2.0 | × | Evolution, duplication, and so forth | ||
| Isorank | × | × | Evolution | |
| Isorank-M | × | Evolution | ||
| HopeMap | × | × | Evolution, function |
Note*. For multiple alignment of HopeMap, only results of yeast/fly/worm are provided in this paper. We are preparing more results in ongoing research. In HopeMap, the global alignment concept is same as in Isorank and Graemlin 2.0.
Figure 3Building an alignment graph.
Figure 4Three connected components in a graph.
Figure 5HopeMap network alignment in five steps.
The PPI networks analyzed in this paper.
| Species (tax id, short name) | Number of proteins | Number of PPI | Source |
|---|---|---|---|
|
| 4121 | 216426 | SNDB [ |
|
| 4239 | 94609 | SNDB [ |
|
| 3365 | 40524 | SNDB [ |
| Yeast (4932, sce) | 4738 | 15417 | DIP [ |
| Fly (7227, dme) | 7165 | 23484 | DIP [ |
| Worm (6239, cel) | 3029 | 3393 | DIP [ |
A comparison of NetworkBLAST and HopeMap for yeast/fly.
| Methods | Specificity (%) in GO | Specificity (%) in KO | Specificity (%) in INPARANOID | Number of GO categories enriched | Total conserved regions | Unique proteins in alignment graph |
|---|---|---|---|---|---|---|
| 1. NetworkBLAST [ | ||||||
|
| 94.87 | N/A | 100 | 54 | 117 | 348 |
|
| 84.62 | N/A | 100 | 43 | 117 | 256 |
| 2. HopeMap-bestPairs | ||||||
|
| 97.18 | N/A | 100 | 51 | 71 | 1314 |
|
| 76.06 | N/A | 100 | 34 | 71 | 1314 |
| 3. HopeMap—data from 1 | ||||||
|
| 98.73 | N/A | 100 | 54 | 79 | 1645 |
|
| 78.48 | N/A | 100 | 39 | 79 | 1913 |
| 4. HopeMap-KO | ||||||
|
| 100.00 | 100 | N/A | 20 | 26 | 747 |
|
| 92.31 | 100 | N/A | 19 | 26 | 753 |
Specificity and sensitivity results for eco/stm and eco/ccr from HopeMap.
| Species | Specificity (%) in GO | Specificity in KO (%) | Number of GO categories enriched ( | Total conserved regions | Number of unique proteins in alignment graph |
|---|---|---|---|---|---|
|
| 100 | 100 | 49 | 58 | 2085 |
|
| 96.55 | 100 | 46 | 58 | 2183 |
|
| 95.24 | 100 | 37 | 42 | 1069 |
|
| 90.48 | 100 | 31 | 42 | 1138 |
Specificity comparison in terms of KO groups for different alignment tools.
| Tools |
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| GrG | 0.86 | 0.86 | 0.72 | 0.72 | 0.68 | 0.68 |
| ISO | 0.91 | 0.91 | 0.65 | 0.65 | 0.63 | 0.63 |
| Gr2.0 | 0.96 | 0.96 | 0.78 | 0.78 | 0.73 | 0.73 |
| HopeMap | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
Sensitivity comparison in terms of KO groups.
| Tools |
|
|
| |||
|---|---|---|---|---|---|---|
|
| Tot |
| Tot |
| Tot | |
| GrG | 1496 | 720 | 384 | |||
| ISO | 2026 | 1014 | 534 | |||
| Gr2.0 | 2024 | 1012 | 637 | |||
| HopeMap | 2159 | 3151 | 1061 | 1365 | 768 | 1664 |
A comparison of networkBLAST, NetworkBLAST-M, and HopeMap for yeast/fly/worm.
| Species | Specificity (%) in GO | Number of GO categories enriched ( | Total conserved regions |
|---|---|---|---|
| NetworkBLAST | |||
|
| 100 | 14 | 59 |
|
| 88 | 13 | 59 |
|
| 94.9 | 16 | 59 |
| NetBLast-M restriceted | |||
|
| 100 | 29 | 64 |
|
| 68.8 | 32 | 64 |
|
| 98.4 | 37 | 64 |
| NetBLASt-M relaxed order | |||
|
| 94.6 | 45 | 92 |
|
| 67 | 29 | 92 |
|
| 90.1 | 41 | 92 |
| HopeMap-KO | |||
|
| 100 | 15 | 18 |
|
| 80 | 5 | 18 |
|
| 88.89 | 14 | 18 |
Specificity in MIPS for homologues cluster of yeast/fly in HopeMap.
| Type | Note | |
|---|---|---|
| Homolog clusters | 71 | Results of HopeMap-bestPairs in |
| MIPS complexes | 70% | percent of clusters in the same complexes of MIPS for yeast (not available for fly) |
| MIPS FunCats | 86% | Percent of clusters in the same FunCats, using 28 main levels of FunCats in MIPS for Yeast |
| MIPS FunCats | 76% | Percent of clusters in the same FunCats using 28 main levels of FunCats in MIPS for fly and GO2MIPS conversion (excluding 30 proteins in fly without available FunCats) |
The run-time results for pairwise alignment.
| Number of species | Number of proteins | Number of PPI edges | Number of nodes |
Runtime (sec) |
|---|---|---|---|---|
| 2 | 8360 | 311035 | 3151 | 35.21 |