| Literature DB >> 24266981 |
Jialiang Yang, Jun Li, Stefan Grünewald, Xiu-Feng Wan.
Abstract
The advances in high throughput omics technologies have made it possible to characterize molecular interactions within and across various species. Alignments and comparison of molecular networks across species will help detect orthologs and conserved functional modules and provide insights on the evolutionary relationships of the compared species. However, such analyses are not trivial due to the complexity of network and high computational cost. Here we develop a mixture of global and local algorithm, BinAligner, for network alignments. Based on the hypotheses that the similarity between two vertices across networks would be context dependent and that the information from the edges and the structures of subnetworks can be more informative than vertices alone, two scoring schema, 1-neighborhood subnetwork and graphlet, were introduced to derive the scoring matrices between networks, besides the commonly used scoring scheme from vertices. Then the alignment problem is formulated as an assignment problem, which is solved by the combinatorial optimization algorithm, such as the Hungarian method. The proposed algorithm was applied and validated in aligning the protein-protein interaction network of Kaposi's sarcoma associated herpesvirus (KSHV) and that of varicella zoster virus (VZV). Interestingly, we identified several putative functional orthologous proteins with similar functions but very low sequence similarity between the two viruses. For example, KSHV open reading frame 56 (ORF56) and VZV ORF55 are helicase-primase subunits with sequence identity 14.6%, and KSHV ORF75 and VZV ORF44 are tegument proteins with sequence identity 15.3%. These functional pairs can not be identified if one restricts the alignment into orthologous protein pairs. In addition, BinAligner identified a conserved pathway between two viruses, which consists of 7 orthologous protein pairs and these proteins are connected by conserved links. This pathway might be crucial for virus packing and infection.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24266981 PMCID: PMC3851320 DOI: 10.1186/1471-2105-14-S14-S8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1All 76 graphlets with 104 non-isomorphic positions on 2 to 4 taxa. A position in a graphlet is denoted by a circle or solid circle: the solid circle position requires the aligned two vertices at this position to be orthologs, whereas the circle position does not have this requirement.
The best alignment of KSHV and VZV by BinAligner
| KSHV | VZV | orth | KSHV | VZV | orth |
|---|---|---|---|---|---|
| ORF2 | ORF15 | 0 | ORF58 | ORF62 | 0 |
| ORF6 | ORF43 | 0 | ORF59 | ORF3 | 0 |
| ORF9 | ORF28 | 1 | ORF60 | ORF18 | 1 |
| ORF23 | ORF33.5 | 0 | ORF61 | ORF19 | 1 |
| ORF25 | ORF14 | 0 | ORF62 | ORF32 | 0 |
| ORF27 | S/L | 0 | ORF63 | ORF33 | 0 |
| ORF28 | ORF1 | 1 | ORF65 | ORF56 | 0 |
| ORF29b | ORF42 | 1 | ORF67.5 | ORF25 | 1 |
| ORF30 | ORF57 | 1 | ORF68 | ORF26 | 1 |
| ORF31 | ORF24 | 0 | ORF69 | ORF27 | 1 |
| ORF34 | ORF59 | 0 | ORF72 | ORF7 | 1 |
| ORF36 | ORF8 | 0 | ORF74 | ORF36 | 1 |
| ORF37 | ORF68 | 0 | ORF75 | ORF44 | 0 |
| ORF39 | ORF50 | 1 | K3 | ORF9 | 0 |
| ORF41 | ORF21 | 0 | K5 | ORF22 | 0 |
| ORF45 | ORF66 | 0 | K7 | ORF67 | 0 |
| ORF47 | ORF12 | 0 | K8 | ORF23 | 1 |
| ORF49 | ORF17 | 0 | K9 | ORF64 | 0 |
| ORF50 | ORF4 | 0 | K10 | ORF60 | 0 |
| ORF52 | ORF46 | 1 | K10.5 | ORF61 | 0 |
| ORF53 | ORF9a | 1 | K11 | ORF16 | 0 |
| ORF54 | ORF39 | 0 | K12 | ORF41 | 0 |
| ORF56 | ORF55 | 0 | K15 | ORF65 | 1 |
| ORF57 | ORF38 | 0 |
The orthologous pairs are marked with 1 in the column orth.
Figure 2The alignment graph only containing aligned orthologous pairs and matched edges. Each node represents a pair of aligned ORFs and the orthologous pairs are shaded.
Functions of 7 orthologous protein pairs connecting by matched links
| KSHV/ VZV | Function |
|---|---|
| ORF29b/ORF42 | DNA packing proteins |
The influence of balancing parameters on the alignment
|
|
|
| nEdge | nOrth |
|---|---|---|---|---|
| 1 | 0 | 0 | 45 | 16 |
| 0 | 1 | 0 | 54 | 9 |
| 0 | 0 | 1 | 53 | 10 |
| 0.9 | 0.1 | 0 | 57 | 16 |
| 0.5 | 0.5 | 0 | 50 | 16 |
| 0.1 | 0.9 | 0 | 56 | 12 |
| 0.9 | 0 | 0.1 | 54 | 16 |
| 0.5 | 0 | 0.5 | 47 | 16 |
| 0.1 | 0 | 0.9 | 50 | 12 |
| 0 | 0.9 | 0.1 | 52 | 10 |
| 0 | 0.5 | 0.5 | 52 | 10 |
| 0 | 0.1 | 0.9 | 54 | 10 |
| 0.9 | 0.09 | 0.01 | 58 | 16 |
| 0.9 | 0.05 | 0.05 | 43 | 16 |
| 0.9 | 0.01 | 0.09 | 51 | 16 |
nEdge denotes the number of aligned matched edges; nOrth denotes the number of aligned orthologous pairs.
Comparison of four methods on aligning the PPI networks of KSHV and VZV
| Method | nEdge | nOrth | EC | OP | P-value |
|---|---|---|---|---|---|
| GRAAL | 45 | 2 | 39.1% | 12.5% | 2.6 |
| GraphAlignment | 51 | 9 | 44.3 % | 56.3% | 4.3 |
| IsoRank | 48 | 15 | 41.8% | 93.8% | 4.1 |
| BinAligner(S) | 68 | 0 | 59.1% | 0 | 6.2 |
| BinAligner | 58 | 16 | 50.4% | 100% | 1.0 |
BinAligner(S) means pure graph structure alignment using BinAligner; nEdge denotes the number of aligned matched edges; nOrth denotes the number of aligned orthologous pairs; EC denotes edge correctness and OP denotes orthologous percentage.