| Literature DB >> 29045465 |
Xianjun Shen1, Li Yi1,2, Xingpeng Jiang1, Tingting He1, Jincai Yang1, Wei Xie1, Po Hu1, Xiaohua Hu1,3.
Abstract
How to identify protein complex is an important and challenging task in proteomics. It would make great contribution to our knowledge of molecular mechanism in cell life activities. However, the inherent organization and dynamic characteristic of cell system have rarely been incorporated into the existing algorithms for detecting protein complexes because of the limitation of protein-protein interaction (PPI) data produced by high throughput techniques. The availability of time course gene expression profile enables us to uncover the dynamics of molecular networks and improve the detection of protein complexes. In order to achieve this goal, this paper proposes a novel algorithm DCA (Dynamic Core-Attachment). It detects protein-complex core comprising of continually expressed and highly connected proteins in dynamic PPI network, and then the protein complex is formed by including the attachments with high adhesion into the core. The integration of core-attachment feature into the dynamic PPI network is responsible for the superiority of our algorithm. DCA has been applied on two different yeast dynamic PPI networks and the experimental results show that it performs significantly better than the state-of-the-art techniques in terms of prediction accuracy, hF-measure and statistical significance in biology. In addition, the identified complexes with strong biological significance provide potential candidate complexes for biologists to validate.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29045465 PMCID: PMC5646790 DOI: 10.1371/journal.pone.0186134
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1DCA algorithm description.
The number distribution of proteins and interactions in the sub-networks of dynamic network.
| Data set | Proteins | Interactions |
|---|---|---|
| DIP | Ave = 599, SD = 223 | Ave = 898, SD = 668 |
| Krogan_extended | Ave = 298, SD = 166 | Ave = 427, SD = 530 |
The basic results of various algorithms on two dynamic networks.
| DIP | Krogan_extended | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Algorithms | #AS | #MS | #MPC | #PC | #MKC | #AS | #MS | #MPC | #PC | #MKC |
| DCA | 8.2 | 47 | 515 | 885 | 118 | 8.7 | 35 | 558 | 818 | 90 |
| DPC | - | - | - | 766[ | - | - | - | - | - | - |
| TS-OCD | 5.1 | 25 | 279 | 843 | 162 | 5.6 | 21 | 187 | 314 | 114 |
| CAMSE | 4.3 | 22 | 881 | 2433 | 215 | 5.4 | 25 | 627 | 1185 | 158 |
| ClusterONE | 3.8 | 20 | 564 | 1690 | 213 | 4.5 | 20 | 526 | 1309 | 179 |
| CoreAttach | 3.3 | 30 | 652 | 2935 | 282 | 3.3 | 26 | 456 | 1402 | 213 |
| CPM | 7.2 | 345 | 249 | 531 | 125 | 7.6 | 349 | 200 | 300 | 83 |
| MCODE | 6.2 | 43 | 147 | 309 | 82 | 5.8 | 51 | 192 | 272 | 76 |
| SPICI | 4.2 | 25 | 551 | 2089 | 197 | 4.3 | 21 | 399 | 1011 | 155 |
| COACH | 5.6 | 26 | 361 | 899 | 150 | 6.3 | 27 | 286 | 482 | 101 |
#AS: the average size of predicted protein complexes; #MS: the maximum size of predicted protein complexes; #MPC: the number of predicted protein complexes matched by known protein complexes; #PC: the total number of predicted protein complexes; #MKC: the number of known complexes matched by predicted protein complexes
Fig 2Performance comparison of DCA against other algorithms on static DIP network.
Fig 3Performance comparison of DCA against other algorithms on dynamic DIP network.
Fig 4Performance comparison of DCA against other algorithms on static Krogan_extended network.
Fig 5Performance comparison of DCA against other algorithms on dynamic Krogan_extended network.
Statistical p-values of complexes predicted by DCA algorithm on DIP data.
| Algorithms | (0, 1e-15] | (1e-15, 1e-10] | (1e-10,1e-5] | (1e-5, 0.01] | >0.01 |
|---|---|---|---|---|---|
| DCA | 79(8.9%) | 107(12.1%) | 299(33.8%) | 367(41.5%) | 32(3.6%) |
| TS-OCD | 22(2.6%) | 54(6.4%) | 204(24.2%) | 510(60.5%) | 53(6.3%) |
| CAMSE | 74(3.0%) | 125(5.1%) | 449(18.5%) | 1483(61.0%) | 301(12.4%) |
| ClusterONE | 40(2.4%) | 73(4.3%) | 313(18.6%) | 1036(61.3%) | 227(13.4%) |
| CoreAttach | 34(1.2%) | 45(1.5%) | 293(10.0%) | 2046(69.7%) | 516(17.6%) |
| CPM | 23(4.3%) | 42(7.9%) | 147(27.7%) | 285(53.8%) | 33(6.2%) |
| MCODE | 4(1.3%) | 32(10.4%) | 98(31.7%) | 155(50.2%) | 20(6.5%) |
| SPICI | 22(1.1%) | 49(2.3%) | 286(13.7%) | 1504(72.0%) | 227(10.9%) |
| COACH | 44(4.9%) | 58(6.5%) | 246(27.4%) | 508(56.5%) | 43(4.8%) |
Statistical p-values of complexes predicted by DCA algorithm on Krogan_extended data.
| Algorithms | (0, 1e-15] | (1e-15, 1e-10] | (1e-10,1e-5] | (1e-5, 0.01] | >0.01 |
|---|---|---|---|---|---|
| DCA | 119(14.6%) | 122(14.9%) | 272(33.3%) | 266(32.6%) | 38(4.7%) |
| TS-OCD | 34(10.8%) | 23(7.3%) | 98(31.2%) | 136(43.3%) | 23(7.3%) |
| CAMSE | 81(6.8%) | 119(10.1%) | 252(21.3%) | 610(51.5%) | 122(10.3%) |
| ClusterONE | 31(2.4%) | 50(3.8%) | 257(19.6%) | 819(62.6%) | 151(11.5%) |
| CoreAttach | 29(2.1%) | 29(2.1%) | 172(12.3%) | 945(67.5%) | 226(16.1%) |
| CPM | 23(7.7%) | 37(12.4%) | 92(30.8%) | 119(39.8%) | 28(9.4%) |
| MCODE | 14(5.2%) | 35(12.9%) | 95(35.1%) | 90(33.2%) | 37(13.7%) |
| SPICI | 27(2.7%) | 31(3.1%) | 186(18.4%) | 639(63.3%) | 127(12.6%) |
| COACH | 48(10.0%) | 45(9.4%) | 134(27.9%) | 217(45.1%) | 37(7.7%) |
HF-measure comparison of various algorithms.
| DIP | Krogan_extended | |||
|---|---|---|---|---|
| Algorithms | hF-measureTb | hF-measureTf | hF-measureTb | hF-measureTf |
| DCA | 0.324 | 0.341 | 0.350 | 0.362 |
| TS-OCD | 0.333 | 0.345 | 0.357 | 0.363 |
| CAMSE | 0.306 | 0.312 | 0.330 | 0.337 |
| ClusterONE | 0.314 | 0.323 | 0.321 | 0.328 |
| CoreAttach | 0.296 | 0.303 | 0.325 | 0.327 |
| CPM | 0.317 | 0.326 | 0.341 | 0.348 |
| MCODE | 0.310 | 0.320 | 0.311 | 0.316 |
| SPICI | 0.299 | 0.309 | 0.319 | 0.326 |
| COACH | 0.316 | 0.324 | 0.335 | 0.342 |
Selected protein complexes predicted by DCA algorithm on DIP data.
| ID | Core component | Attached component | hfun | hFTb | p-value |
|---|---|---|---|---|---|
| 11 | ypr110c ypr190c ypr187w ypr032w ypr010c ypl235w ypl160w yor341w yor224c yor210w yor207c yor151c yor119c yor116c ynr003c ynl308c ynl248c ynl229c ynl113w ymr285c ylr453c ykl144c yjr063w yjl130c yjl011c yil128w yil035c yhr020w ygr229c ygr094w ygl016w ybr249c ybr245c ybr154c ybr020w ybr019c ybl076c ybl039c | ynl151c ydl150w | GO:0003899 | 0.521 | 7.55e-34 |
| 55 | yor116c ypr190c ypr187w ypr110c yor224c yor207c ynr003c ynl113w ykl144c yjl011c ygl070c yfr037c ydr045c ybr154c | ydl150w ynl151c | GO:0003899 | 0.612 | 7.95e-34 |
| 39 | yol135c yor174w ynr010w ynl189w ymr112c ylr071c ygr104c ygl151w ygl127c yfr019w yfr008w yer022w ydr448w ydr443c ydl005c ybr253w ybr193c | ygl025c | GO:0016455 | 0.683 | 3.94e-32 |
| 62 | yor116c ypr190c ypr187w ypr110c yor207c ynr003c ynl113w ykl144c yjl011c yfr037c ydr045c ybr154c | ykr025w ynl151c ydl150w | GO:0003899 | 0.583 | 1.94e-31 |
| 65 | ygr104c ypr168w ypr070w yor174w yol135c ynr010w ynl236w ylr071c yhr058c yhr041c yer148w yer022w ydr308c ydl005c ybr279w | _ | GO:0016455 | 0.687 | 6.72e-31 |
| 67 | ygr104c ypl248c yor174w yol135c ynr010w ynl025c ymr112c ylr071c ygl151w ygl127c yer022w ydl005c ybr253w ybr193c | ygl025c | GO:0016455 | 0.684 | 6.72e-31 |
| 69 | ybr253w ypr168w ypr070w ypl248c yol135c ynl236w ymr112c ygr104c ygl025c yer022w ydr443c ydl005c ybr193c ybr081c | ycr081w | GO:0016455 | 0.684 | 6.72e-31 |
| 13 | ypr110c ypr190c ypr187w ypl235w yor210w yor207c yor151c yor119c yor116c ynr003c ynl308c ynl248c ynl229c ynl113w ymr285c ylr453c ylr238w ylr086w ylr058c ykl144c yjr063w yjl011c yjl008c yil035c yer025w ydl155w ybr245c ybr154c ybr127c ybl076c ybl039c | ynl151c ykr025w ydl150w | GO:0003899 | 0.507 | 1.38e-30 |
| 53 | ydl005c ypr168w ypr070w yor174w yol135c ynr010w ynl236w ylr071c yhr058c yhr041c ygr104c yer157w yer022w ydl153c ycr081w ybr079c | _ | GO:0016455 | 0.667 | 3.58e-30 |
| 58 | ybr253w ypl248c yor174w yol135c ynr010w ynl025c ymr112c ylr071c ygr104c ygl151w yer022w ydr448w ydr443c ydl005c ybr193c | ygl025c | GO:0016455 | 0.667 | 3.58e-30 |
Selected protein complexes predicted by DCA algorithm on Krogan_extended data.
| ID | Core component | Attached component | hfun | hFTb | p-value |
|---|---|---|---|---|---|
| 8 | ypr187w yor151c ypl129w yor210w yor116c ynl113w ypr110c ypr010c yor341w yor340c yor207c yol021c ynl248c ynl151c yjr063w yjl168c yjl011c yil021w ygr063c ygl070c ydl150w ybr154c ypr190c yhr143w-a ydr045c | ynr003c ydr404c ykl144c | GO:0003899 | 0.800 | 3.96e-51 |
| 1 | ybr154c yol021c ynl151c ypr086w ypr010c yor207c yjr063w yjl168c yjl011c yfr037c ypr187w ypr133c ypr110c ypl129w yor341w yor340c yor224c yor210w yor151c ynl248c ynl113w ylr200w yil021w ygr063c ygr005c ygl070c ygl043w yfl023w ydr156w ydl150w ydr045c yhr143w-a | ynr003c ykl144c ydr404c | GO:0003899 | 0.720 | 7.94e-47 |
| 3 | yor210w ypr187w ypr190c yor341w ypr110c ypr010c yor340c yor224c yor151c yor116c ynl216w ynl113w ykl054c yhr143w-a ygr005c yer179w yor207c ynl248c yjr063w yjl168c yjl011c yil021w ygl070c yfr037c yer162c ydr156w ydl150w ydl042c ybr279w ybr154c | ynr003c ydr404c | GO:0003899 | 0.722 | 2.17e-45 |
| 5 | ypr110c ypr190c yor116c yfl023w yer162c yel048c ypr187w ynl113w yjr132w yjl076w yfl009w ydl116w ypr010c yor341w yor340c yor210w yor207c yol021c ynl248c ynl186w ynl151c yjr063w yjl011c yhr143w-a ydr156w ydl150w ydl042c ybr154c ydr045c | ynr003c ykl144c | GO:0003899 | 0.667 | 4.12e-40 |
| 24 | ynl113w ypr190c ypr187w ypr010c yor116c ynr003c ynl151c ypr161c ypr110c yor341w yor340c yor210w yor207c ynl248c ykl144c yjr063w yjl011c ydr156w ydr026c ydl150w ydl042c ybr154c | ynr003c ykl144c | GO:0003899 | 0.704 | 1.25e-39 |
| 10 | ypr110c ypr190c yor116c yfl023w yel048c ypr187w ynl113w yjr132w yjl076w yfl009w yor340c yor210w yor207c yol021c ynl248c ynl186w ynl151c ykr025w yjr063w yjl011c yhr143w-a ydr156w ydl150w ybr154c ydr045c | ynr003c ydl042c ykl144c | GO:0003899 | 0.667 | 8.83e-39 |
| 18 | yor210w ypr187w ypr190c ypr110c yor340c yor151c yor116c ynl113w yhr143w-a yor207c ynl248c yml010w ykr025w yjr063w yjl168c yjl011c yil021w yfr037c ydr156w ydl150w ybr279w ybr154c | ynr003c ydl042c | GO:0003899 | 0.679 | 3.42e-38 |
| 4 | ybr154c yol021c ynl151c ypr086w yor207c ykr025w yjr063w yjl168c yjl011c ygr140w yfr037c ypr187w ypr133c ypr110c ypl129w yor340c yor210w yor151c ynl248c ynl113w ylr200w yil021w ygl043w yfl023w ydr156w ydl150w ydr045c ydl042c yhr143w-a | ykl144c ynr003c | GO:0003899 | 0.635 | 2.38e-37 |
| 20 | ypr187w yor151c ypl129w yor210w yor116c ynl113w ypr110c yor340c yor207c yol021c ynl248c ynl151c yjr063w yjl168c yjl011c yil021w ydl150w ybr154c yhr143w-a ydr045c | ydl042c ynr003c | GO:0003899 | 0.667 | 1.54e-36 |
| 45 | yor207c ypr190c ypr187w ypr110c yor224c yor210w yor116c ynl113w yhr143w-a ydr045c ybr154c yol021c ynl151c yjl011c ydr167w ydl150w ydl042c | ykl144c ynr003c | GO:0003899 | 0.627 | 1.14e-32 |
Fig 6Topology structure illustration of predicted protein complex.
Fig 7An illustration of predicted protein complex.