| Literature DB >> 25093200 |
Mayumi Kamada1, Yusuke Sakuma2, Morihiro Hayashida3, Tatsuya Akutsu3.
Abstract
Proteins in living organisms express various important functions by interacting with other proteins and molecules. Therefore, many efforts have been made to investigate and predict protein-protein interactions (PPIs). Analysis of strengths of PPIs is also important because such strengths are involved in functionality of proteins. In this paper, we propose several feature space mappings from protein pairs using protein domain information to predict strengths of PPIs. Moreover, we perform computational experiments employing two machine learning methods, support vector regression (SVR) and relevance vector machine (RVM), for dataset obtained from biological experiments. The prediction results showed that both SVR and RVM with our proposed features outperformed the best existing method.Entities:
Mesh:
Year: 2014 PMID: 25093200 PMCID: PMC4095743 DOI: 10.1155/2014/240673
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1(a) Illustration of protein-protein interactions (PPIs) model based on domain-domain interactions (DDIs). (b) Schematic overview of PPIs prediction based on DDIs.
Figure 2Illustration of restricting an amino acid sequence to which the spectrum kernel is applied to the domain regions.
Results of average RMSE for training and test data.
|
|
|
| ||||
|---|---|---|---|---|---|---|
| Training | Test | Training | Test | Training | Test | |
| SVR + DN | 0.10472 | 0.12573 | 0.10656 | 0.12600 | 0.09982 |
|
| RVM + DN | 0.09210 | 0.12873 | 0.09178 | 0.12881 | 0.09474 | 0.12908 |
| SVR + SPD ( | 0.08819 | 0.12699 | 0.08080 | 0.12954 | 0.07927 | 0.12903 |
| RVM + SPD ( | 0.02848 | 0.12743 |
| 0.12706 | 0.03276 | 0.12792 |
| SVR + SPD ( | 0.08891 | 0.12654 | 0.08188 | 0.12782 | 0.08117 | 0.12909 |
| RVM + SPD ( |
|
| 0.02301 |
|
| 0.12493 |
| SVR + APM | 0.06846 | 0.13112 | 0.06795 | 0.13247 | 0.06791 | 0.13277 |
| RVM + APM | 0.07052 | 0.13556 | 0.07037 | 0.13550 | 0.07032 | 0.13493 |
|
| ||||||
| APM | Training = 0.06811, Test = 0.13517 | |||||
The number of relevance vectors (RVs) and support vectors (SVs) for each model with DN, SPD, and APM and the selected σ values for each fold.
|
|
|
| |||||
|---|---|---|---|---|---|---|---|
| SVR | RVM | SVR | RVM | SVR | RVM | ||
| SVs ( | RVs ( | SVs ( | RVs ( | SVs ( | RVs ( | ||
| Fold 1 | DN | 271 (0.02) | 113 (0.05) | 271 (0.01) | 123 (0.07) | 308 (0.01) | 74 (0.02) |
| SPD ( | 367 (0.01) | 448 (0.02) | 402 (0.02) | 680 (0.05) | 402 (0.02) | 537 (0.03) | |
| SPD ( | 392 (0.01) | 502 (0.03) | 409 (0.01) | 628 (0.05) | 421 (0.01) | 628 (0.05) | |
| APM | 362 (0.08) | 4 (5.00) | 361 (0.10) | 6 (4.80) | 357 (0.04) | 6 (5.80) | |
|
| |||||||
| Fold 2 | DN | 280 (0.02) | 94 (0.08) | 281 (0.01) | 92 (0.09) | 314 (0.01) | 82 (0.04) |
| SPD ( | 408 (0.01) | 617 (0.04) | 453 (0.04) | 706 (0.06) | 411 (0.01) | 545 (0.03) | |
| SPD ( | 430 (0.01) | 558 (0.04) | 435 (0.01) | 618 (0.05) | 495 (0.04) | 654 (0.06) | |
| APM | 375 (0.10) | 5 (6.50) | 372 (0.10) | 6 (6.90) | 373 (0.04) | 4 (5.50) | |
|
| |||||||
| Fold 3 | DN | 321 (0.04) | 107 (0.08) | 289 (0.01) | 107 (0.10) | 330 (0.01) | 107 (0.08) |
| SPD ( | 371 (0.01) | 439 (0.02) | 412 (0.03) | 658 (0.05) | 382 (0.01) | 305 (0.01) | |
| SPD ( | 387 (0.01) | 625 (0.06) | 418 (0.02) | 529 (0.04) | 398 (0.01) | 529 (0.04) | |
| APM | 368 (0.08) | 3 (7.10) | 368 (0.04) | 3 (6.70) | 372 (0.01) | 5 (4.20) | |