| Literature DB >> 33203352 |
Mercè Llabrés1,2, Gabriel Riera1,2, Francesc Rosselló1,2, Gabriel Valiente3.
Abstract
BACKGROUND: The alignment of protein-protein interaction networks was recently formulated as an integer quadratic programming problem, along with a linearization that can be solved by integer linear programming software tools. However, the resulting integer linear program has a huge number of variables and constraints, rendering it of no practical use.Entities:
Keywords: Graph matching; Integer linear programming; Network alignment; Systems biology; Virus-host protein-protein interaction
Mesh:
Year: 2020 PMID: 33203352 PMCID: PMC7671827 DOI: 10.1186/s12859-020-03733-w
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1EC Scores (λ=0). Boxplot of EC scores for the 300 alignments of 25 virus-host protein-protein interaction networks from the STRING Viruses database, for λ=0. L-GRAAL and ILP obtained the highest scores
Fig. 2FC Scores (λ=1). Boxplot of FC scores for the 300 alignments of 25 virus-host protein-protein interaction networks from the STRING Viruses database, for λ=1. PINALOG, followed by ILP, obtained the highest scores
Fig. 3Combined EC and FC scores (λ=0.5). Boxplot of the mean of EC and FC scores for the 300 alignments of 25 virus-host protein-protein interaction networks from the STRING Viruses database, for λ=0.5. ILP and L-GRAAL obtained the highest scores
Edge correctness score and sequence similarity score (mean values) for several protein-protein interaction network alignment methods and tools, for 300 pairs of virus-host protein-protein interaction networks from the STRING Viruses database, for λ=0.5. Sequence similarity scores are normalized global alignment scores
| Alignment method or tool | Edge Correctness | Sequence Similarity |
|---|---|---|
| L-GRAAL | 0.8297 | 0.8979 |
| Integer Linear Programming | 0.7845 | 0.9044 |
| AligNet | 0.5471 | 0.8823 |
| PINALOG | 0.3920 | 0.9210 |
| HubAlign | 0.6461 | 0.5777 |
| SPINAL | 0.5054 | 0.6900 |
Edge correctness score and sequence similarity score (mean values) for 45 pairs of virus-host protein-protein interaction networks from the STRING Viruses database, for the integer linear programming formulation and different values of the λ parameter. The maximum sum of the edge correctness and sequence similarity scores is achieved at λ=0.4, followed by λ=0.5. Sequence similarity scores are normalized global alignment scores
| Edge | Sequence | |
|---|---|---|
| Correctness | Similarity | |
| 0.0 | 0.8655 | 0.7444 |
| 0.1 | 0.8654 | 0.8529 |
| 0.2 | 0.8648 | 0.8622 |
| 0.3 | 0.8647 | 0.8681 |
| 0.4 | 0.8612 | 0.8731 |
| 0.5 | 0.8565 | 0.8770 |
| 0.6 | 0.8493 | 0.8801 |
| 0.7 | 0.8299 | 0.8872 |
| 0.8 | 0.7820 | 0.8960 |
| 0.9 | 0.6698 | 0.9057 |
| 1.0 | 0.1054 | 0.9144 |
Edge correctness score and sequence similarity score (mean values) for 45 pairs of virus-host protein-protein interaction networks from the STRING Viruses database, for the integer linear programming formulation with λ=0.5 and different sequence similarity measures [12]
| Sequence | Edge | Sequence |
|---|---|---|
| Similarity | Correctness | Similarity |
| 1-mer | 0.8540 | 0.8831 |
| 2-mer | 0.8572 | 0.7229 |
| 3-mer | 0.8581 | 0.5928 |
| 4-mer | 0.8585 | 0.5619 |
| Alignment | 0.8465 | 0.3194 |
Best alignment for the virus-host protein-protein interaction networks for human viruses in the STRING Viruses database considered in our study. Twenty-one of the 25 networks are aligned with networks corresponding to viruses of the same Baltimore class. Sequence similarity scores are normalized global alignment scores
| Virus | Virus | Edge | Sequence |
|---|---|---|---|
| Tax Id | Tax Id | Correctness | Similarity |
| 10298 | 10310 | 0.9961 | 0.9950 |
| 10310 | 10298 | 0.9961 | 0.9950 |
| 10335 | 10298 | 0.9955 | 0.9877 |
| 10359 | 32604 | 0.9736 | 0.9638 |
| 10376 | 11137 | 1.0000 | 0.8885 |
| 11103 | 11269 | 1.0000 | 0.9057 |
| 11137 | 694009 | 1.0000 | 0.9933 |
| 11161 | 162145 | 0.9953 | 0.9951 |
| 11234 | 162145 | 1.0000 | 0.9934 |
| 11250 | 162145 | 0.9953 | 0.9969 |
| 11269 | 186538 | 1.0000 | 0.9964 |
| 11320 | 11269 | 1.0000 | 0.8824 |
| 11676 | 11269 | 1.0000 | 0.8836 |
| 11709 | 194441 | 1.0000 | 0.9078 |
| 162145 | 11234 | 1.0000 | 0.9934 |
| 186538 | 11269 | 1.0000 | 0.9964 |
| 194440 | 194443 | 1.0000 | 0.9980 |
| 194441 | 194443 | 1.0000 | 0.9984 |
| 194443 | 194441 | 1.0000 | 0.9984 |
| 32603 | 32604 | 0.9956 | 0.9933 |
| 32604 | 32603 | 0.9956 | 0.9933 |
| 337041 | 10310 | 0.9890 | 0.9050 |
| 37296 | 186538 | 1.0000 | 0.8867 |
| 63330 | 162145 | 0.9953 | 0.9949 |
| 694009 | 11137 | 1.0000 | 0.9933 |
The virus-host protein-protein interaction networks for human viruses in the STRING Viruses database considered in our study
| Proteins | |||||||
|---|---|---|---|---|---|---|---|
| Tax Id | Baltimore | Family | Genus | Species | Viral | Host | Interactions |
| 11269 | V | Filoviridae | Marburgvirus | Marburg marburgvirus | 4 | 65 | 65 |
| 186538 | V | Filoviridae | Ebolavirus | Zaire ebolavirus | 5 | 66 | 67 |
| 194443 | VI | Retroviridae | Deltaretrovirus | Primate T-lymphotropic virus 3 | 2 | 54 | 71 |
| 194441 | VI | Retroviridae | Deltaretrovirus | Primate T-lymphotropic virus 2 | 2 | 57 | 74 |
| 11137 | IV | Coronaviridae | Alphacoronavirus | Human coronavirus 229E | 5 | 76 | 76 |
| 194440 | VI | Retroviridae | Deltaretrovirus | Primate T-lymphotropic virus 1 | 5 | 67 | 84 |
| 694009 | IV | Coronaviridae | Betacoronavirus | SARS-related coronavirus | 12 | 87 | 87 |
| 337041 | I | Papillomaviridae | Alphapapillomavirus | Alphapapillomavirus 9 | 5 | 80 | 91 |
| 11320 | V | Orthomyxoviridae | Alphainfluenzavirus | Influenza A virus | 11 | 120 | 144 |
| 11103 | IV | Flaviviridae | Hepacivirus | Hepacivirus C | 8 | 146 | 197 |
| 162145 | V | Pneumoviridae | Metapneumovirus | Human metapneumovirus | 5 | 214 | 215 |
| 11250 | V | Pneumoviridae | Orthopneumovirus | Human orthopneumovirus | 8 | 222 | 225 |
| 32604 | I | Herpesviridae | Roseolovirus | Human betaherpesvirus 6B | 17 | 201 | 227 |
| 32603 | I | Herpesviridae | Roseolovirus | Human betaherpesvirus 6A | 18 | 201 | 227 |
| 11161 | V | Paramyxoviridae | Rubulavirus | Mumps rubulavirus | 4 | 239 | 249 |
| 63330 | V | Paramyxoviridae | Henipavirus | Hendra henipavirus | 6 | 247 | 285 |
| 11234 | V | Paramyxoviridae | Morbillivirus | Measles morbillivirus | 8 | 265 | 289 |
| 11676 | VI | Retroviridae | Lentivirus | Human immunodeficiency virus 1 | 10 | 279 | 301 |
| 11709 | VI | Retroviridae | Lentivirus | Human immunodeficiency virus 2 | 5 | 196 | 315 |
| 37296 | I | Herpesviridae | Rhadinovirus | Human gammaherpesvirus 8 | 36 | 300 | 342 |
| 10359 | I | Herpesviridae | Cytomegalovirus | Human betaherpesvirus 5 | 40 | 313 | 380 |
| 10376 | I | Herpesviridae | Lymphocryptovirus | Human gammaherpesvirus 4 | 35 | 451 | 547 |
| 10335 | I | Herpesviridae | Varicellovirus | Human alphaherpesvirus 3 | 27 | 557 | 665 |
| 10310 | I | Herpesviridae | Simplexvirus | Human alphaherpesvirus 2 | 34 | 607 | 765 |
| 10298 | I | Herpesviridae | Simplexvirus | Human alphaherpesvirus 1 | 45 | 690 | 957 |