| Literature DB >> 31874605 |
Jie Zhao1, Xiujuan Lei2.
Abstract
BACKGROUND: Protein complexes are the cornerstones of many biological processes and gather them to form various types of molecular machinery that perform a vast array of biological functions. In fact, a protein may belong to multiple protein complexes. Most existing protein complex detection algorithms cannot reflect overlapping protein complexes. To solve this problem, a novel overlapping protein complexes identification algorithm is proposed.Entities:
Keywords: Clustering; Gene ontology; Granular computation; Protein complexes; Quotient space
Mesh:
Substances:
Year: 2019 PMID: 31874605 PMCID: PMC6929339 DOI: 10.1186/s12859-019-3256-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Pseudo code of maximum complete subgraph
Fig. 1Construction of overlay network chain in quotient space
Fig. 2An example of overlapping protein complexes
Pseudo code of the ONCQS algorithm
The data information of the experimental data
| Dataset | Number of node | Number of edge | Density | GO annotation data |
|---|---|---|---|---|
| DIP | 5028 | 22,302 | 0.0018 | 4939 (98.23%) |
| Gavin | 1430 | 6531 | 0.0064 | 1430 (100%) |
| Krogan | 2674 | 7075 | 0.0020 | 2671 (99.89%) |
| MIPS | 4546 | 12,319 | 0.0012 | 4508 (99.16%) |
Influence of parameters gc
| Dataset | |||||||
|---|---|---|---|---|---|---|---|
| DIP | 0.1 | 0.4199 | 0.7108 | 0.5279 | 874 | 62 | 5.18 |
| 0.2 | 0.4011 | 0.7206 | 0.5153 | 945 | 60 | 4.79 | |
| 0.3 | 0.3571 | 0.7402 | 0.4818 | 1095 | 69 | 3.70 | |
| 0.4 | 0.3561 | 0.8260 | 0.4976 | 1640 | 103 | 2.67 | |
| 0.5 | 0.3521 | 0.8284 | 0.4942 | 1667 | 104 | 2.60 | |
| 0.6 | 0.3470 | 0.8211 | 0.4878 | 1781 | 105 | 2.53 | |
| 0.7 | 0.3499 | 0.8186 | 0.4902 | 1832 | 103 | 2.54 | |
| 0.8 | 0.3530 | 0.8186 | 0.4933 | 1844 | 102 | 2.56 | |
| 0.9 | 0.3530 | 0.8186 | 0.4933 | 1844 | 102 | 2.56 | |
| Gavin | 0.1 | 0.6581 | 0.4167 | 0.5103 | 310 | 38 | 7.99 |
| 0.2 | 0.6085 | 0.4265 | 0.5015 | 355 | 39 | 6.63 | |
| 0.3 | 0.5630 | 0.4363 | 0.4916 | 405 | 41 | 5.13 | |
| 0.4 | 0.5124 | 0.4510 | 0.4797 | 525 | 49 | 3.98 | |
| 0.5 | 0.4973 | 0.4534 | 0.4743 | 553 | 50 | 3.73 | |
| 0.6 | 0.4879 | 0.4461 | 0.4661 | 621 | 50 | 3.46 | |
| 0.7 | 0.4910 | 0.4436 | 0.4661 | 664 | 46 | 3.43 | |
| 0.8 | 0.4927 | 0.4436 | 0.4669 | 684 | 46 | 3.47 | |
| 0.9 | 0.4949 | 0.4436 | 0.4679 | 687 | 46 | 3.49 | |
| Krogan | 0.1 | 0.5856 | 0.5956 | 0.5906 | 473 | 68 | 4.51 |
| 0.2 | 0.5658 | 0.5980 | 0.5815 | 509 | 67 | 4.27 | |
| 0.3 | 0.5401 | 0.5980 | 0.5676 | 561 | 68 | 3.60 | |
| 0.4 | 0.4888 | 0.6422 | 0.5551 | 759 | 80 | 2.86 | |
| 0.5 | 0.2728 | 0.7230 | 0.3962 | 780 | 81 | 2.82 | |
| 0.6 | 0.3095 | 0.7230 | 0.4335 | 835 | 83 | 2.77 | |
| 0.7 | 0.2984 | 0.7230 | 0.4225 | 858 | 78 | 2.76 | |
| 0.8 | 0.3090 | 0.6005 | 0.4080 | 868 | 79 | 2.79 | |
| 0.9 | 0.4989 | 0.6471 | 0.5634 | 870 | 79 | 2.81 | |
| MIPS | 0.1 | 0.3784 | 0.5735 | 0.4559 | 703 | 46 | 3.95 |
| 0.2 | 0.3689 | 0.5760 | 0.4498 | 721 | 47 | 3.78 | |
| 0.3 | 0.3375 | 0.5980 | 0.4315 | 803 | 54 | 3.10 | |
| 0.4 | 0.3231 | 0.6765 | 0.4373 | 1173 | 72 | 2.33 | |
| 0.5 | 0.3238 | 0.6765 | 0.4379 | 1186 | 71 | 2.32 | |
| 0.6 | 0.3288 | 0.6691 | 0.4409 | 1244 | 69 | 2.31 | |
| 0.7 | 0.3299 | 0.6642 | 0.4408 | 1255 | 67 | 2.32 | |
| 0.8 | 0.3315 | 0.6642 | 0.4423 | 1258 | 67 | 2.32 | |
| 0.9 | 0.3315 | 0.6642 | 0.4423 | 1258 | 67 | 2.33 |
Fig. 3Influence of parameters gc
Fig. 4The performance comparisons of various algorithms on four datasets, the blue bar represents Precision, the green bar represents Recall, the red bar represents F-measure. (a) DIP (b) Gavin (c) Krogan (d) MIPS
The performance comparison of several typical algorithms on four datasets
| Algorithms | DIP | Gavin | Krogan | MIPS | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MCODE | 49 | 1 | 16.73 | 66 | 8 | 9.12 | 76 | 11 | 7.21 | 63 | 3 | 8.33 |
| MCL | 189 | 0 | 3.76 | 217 | 20 | 6.83 | 550 | 17 | 4.63 | 922 | 12 | 4.67 |
| CORE | 1707 | 6 | 3.01 | 294 | 0 | 2.58 | 820 | 0 | 2.32 | 1745 | 0 | 2.18 |
| ClusterONE | 372 | 6 | 4.94 | 243 | 13 | 6.92 | 241 | 12 | 5.26 | 295 | 3 | 4.24 |
| COACH | 899 | 16 | 8.90 | 321 | 12 | 10.18 | 355 | 17 | 7.55 | 489 | 9 | 10.31 |
| ONCQS | 1640 | 103 | 2.67 | 525 | 49 | 3.98 | 759 | 80 | 2.86 | 1173 | 72 | 2.34 |
Fig. 5Visualization of the 379th standard protein complex of Krogan. (a) Standard (red area) (b) MCL (pink area) and MCODE (orange area) (c) ClusterONE (blue area) and COACH (yellow area) (d) ONCQS (green area) and CORE (purple area)
The complexes information of elF3 complex and multi-elF complex
| elF3 complex | multi-elF complex |
|---|---|
| YMR012W YLR192C YMR309C YOR361C YBR079C YMR146C YDR429C | YER025W YMR309C YOR361C YNL244C YJR007W YPL237W YMR146C YPR041W |
The performance comparison of mining overlapping proteins in DIP
| Algorithm | Predicted elF3 complex | Predicted | ||
|---|---|---|---|---|
| MCODE | – | – | – | – |
| MCL | – | – | – | – |
| CORE | 0.4286 | – | – | |
| ClusterONE | YPR041W | 0.5143 | – | – |
| COACH | 0.5714 | – | – | |
| ONCQS | 0.6429 | YBR079C | 0.2813 |
The performance comparison of mining overlapping proteins in Gavin
| Algorithm | Predicted elF3 complex | Predicted | ||
|---|---|---|---|---|
| MCODE | 0.4286 | – | – | |
| MCL | 0.5143 | – | – | |
| CORE | – | – | – | – |
| ClusterONE | 0.4286 | – | – | |
| COACH | YNL096C YPR041W | 0.4286 | 0.2404 | |
| ONCQS | YAL035W | 0.6429 | – | – |
The performance comparison of mining overlapping proteins in Krogan
| Algorithm | Predicted elF3 complex | Predicted | ||
|---|---|---|---|---|
| MCODE | – | – | – | – |
| MCL | YBR065C | 0.2078 | – | – |
| CORE | – | – | – | – |
| ClusterONE | 0.3968 | – | – | |
| COACH | 0.5102 | 0.5625 | ||
| ONCQS | 0.7143 | YBR079C | 0.4000 |
The performance comparison of mining overlapping proteins in MIPS
| Algorithm | Predicted elF3 complex | Predicted | ||
|---|---|---|---|---|
| MCODE | – | – | – | – |
| MCL | 0.4464 | – | – | |
| CORE | – | – | – | – |
| ClusterONE | – | – | 0.5208 | |
| COACH | 0.4571 | 0.5208 | ||
| ONCQS | 0.4286 | YBR079C | 0.4000 |