| Literature DB >> 31333706 |
Jiawei Zhao1, Yu Lei1, Jianwei Hong1, Cunjian Zheng1, Lida Zhang1.
Abstract
Protein-protein interactions (PPIs) play fundamental roles in various cellular processes. Here, we present a new version of computational interactome that contains more than 345,000 predicted PPIs involving about 51.2% of the Arabidopsis proteins. Compared to the earlier version, the updated AraPPINet displays a higher accuracy in predicting protein interactions through performance evaluation with independent datasets. In addition to the experimental verifications of the previous version, the new version has been subjected to further validation test that demonstrates its ability to discover novel PPIs involved in hormone signaling pathways. Moreover, network analysis shows that many overlapping proteins are significantly involved in the interactions which mediated the crosstalk among plant hormones. The new version of AraPPINet provides a more reliable interactome which would facilitate the understanding of crosstalk among hormone signaling pathways in plants.Entities:
Keywords: Arabidopsis thaliana; computational interactome; plant hormone crosstalk; protein-protein interaction network; random forest
Year: 2019 PMID: 31333706 PMCID: PMC6625390 DOI: 10.3389/fpls.2019.00870
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
The gold standard PPI data from various databases.
| Database | BioGRID | IntAct | DIP | BIND | MINT | Unique PPIs in total |
|---|---|---|---|---|---|---|
| BioGRID | 8,953 | 6,603 | 95 | 107 | 22 | 8,953 |
| IntAct | – | 6,861 | 48 | 128 | 30 | 6,861 |
| DIP | – | – | 146 | 3 | 0 | 146 |
| BIND | – | – | – | 153 | 1 | 153 |
| MINT | – | – | – | – | 30 | 30 |
| All | – | – | – | – | – | 9,260 |
FIGURE 1Comparison of feature coverage in the updated and the previous versions. (A) Data coverage of all protein pairs with the listed features in the updated and the previous datasets. The coverage represents a ratio of the number of protein pairs with available information in each feature divided by the number of whole possible protein pairs. (B) Data coverage of the predicted PPIs with the given features in the two AraPPINet versions. The coverage represents a ratio of the number of predicted PPIs with available information in each feature divided by the number of all predicted interactions.
Performance comparisons of PPI predictions with and without SMOTE technique.
| Positive to negative ratio | SMOTE or not | TPR on training set (%) | FPR on training set (%) | TPR on test set (%) | FPR on test set (%) |
|---|---|---|---|---|---|
| 1:1 (9260:9260) | No | 70.62 | 14.97 | 64.01 | 14.07 |
| 1:10 (9260:92600) | No | 40.05 | 0.839 | 22.36 | 0.753 |
| 1:50 (9260:463000) | No | 27.51 | 0.114 | 11.51 | 0.098 |
| 1:60 (9260:555600) | No | 26.34 | 0.092 | 10.76 | 0.086 |
| 1:80 (9260:740800) | No | 24.64 | 0.061 | 9.64 | 0.065 |
| 1:100 (9260:926000) | No | 23.19 | 0.047 | 9.01 | 0.055 |
| 1:10 (92600:1389000) | SMOTE | 84.51 | 0.405 | 18.77 | 0.179 |
| 1:30 (46300:1389000) | SMOTE | 63.33 | 0.103 | 14.47 | 0.101 |
| 1:50 (27780:1389000) | SMOTE | 47.94 | 0.071 | 13.78 | 0.072 |
| 1:75 (18520:1389000) | SMOTE | 28.29 | 0.047 | 12.29 | 0.057 |
FIGURE 2Performance comparison of the updated AraPPINet with other computational methods. (A) PR curves of the updated AraPPINet and the previous version on test dataset. (B) Venn diagram of predicted PPIs overlapping with the experimentally determined protein interactions. (C) Comparison of the updated AraPPINet with other methods based on the high-throughput dataset. (D) Comparison of the updated AraPPINet with other methods based on the newly reported interactions.
F-measure comparison of different prediction methods based on high-throughput dataset.
| Method | Positive PPIs | All predicted PPIs | Precision | Recall | F1 score |
|---|---|---|---|---|---|
| RandNet | 123 | 345,006 | 0.0004 | 0.0083 | 0.0007 |
| AtPID | 222 | 24,418 | 0.0091 | 0.0150 | 0.0113 |
| AtPIN | 165 | 87,936 | 0.0019 | 0.0111 | 0.0032 |
| PAIR | 724 | 137,837 | 0.0053 | 0.0489 | 0.0095 |
| The previous AraPPINet | 1,779 | 316,747 | 0.0056 | 0.1201 | 0.0107 |
| The updated AraPPINet | 2,252 | 345,006 | 0.0065 | 0.152 | 0.0125 |
F-measure comparison of different prediction methods based on newly released dataset.
| Method | Positive PPIs | All predicted PPIs | Precision | Recall | F1 score |
|---|---|---|---|---|---|
| RandNet | 124 | 345,006 | 0.0004 | 0.0058 | 0.0007 |
| AtPID | 154 | 24,418 | 0.0063 | 0.0072 | 0.0067 |
| AtPIN | 187 | 87,936 | 0.0021 | 0.0088 | 0.0034 |
| PAIR | 843 | 137,837 | 0.0061 | 0.0395 | 0.0106 |
| The previous AraPPINet | 2,091 | 316,747 | 0.0066 | 0.098 | 0.0124 |
| The updated AraPPINet | 2,880 | 345,006 | 0.0083 | 0.135 | 0.0157 |
FIGURE 3Performance comparison of the two AraPPINet versions for predicting hormone signaling networks. (A–E) Venn diagram of predicted PPIs involved in different hormone signaling overlapping with the experimentally determined PPIs (F) Comparison of the two AraPPINet versions for predicting PPIs involved in hormone signaling based on the high-throughput dataset. (G) Comparison of the two AraPPINet versions for predicting PPIs involved in hormone signaling based on the newly reported interactions.
FIGURE 4Inference of hormone signaling crosstalk from the updated AraPPINet. (A) Venn diagram of hormone signaling interacting proteins inferred from the updated network. (B) Overlapping interacting proteins of five hormone signaling pathways. Numbers on the diagonal line represent total number of proteins interacting with each hormone signaling. The number of common interacting proteins between any two hormone signaling pathways is indicated above the diagonal. P-value of the overlap obtained from fisher’s exact test is presented below the diagonal line. (C) Enriched GO functional categories of proteins interacting with each hormone signaling pathway. (D) Enrichment folds of interactions connected core components of GA with that in other hormone signaling pathways. (E) Interactions among core components in five hormone signaling pathways. All predicted connections among core proteins are colored in yellow, and experimentally demonstrated interactions are colored in red.
FIGURE 5Functional enrichment and interaction network of overlapping proteins of five hormone signaling pathways. (A) Enriched GO functional categories of 298 overlapping proteins of five hormone signaling pathways. (B) Interaction network of overlapping proteins and core components in hormone signaling pathways. Orange and blue nodes represent overlapping proteins and core components, respectively. All predicted links among these proteins are colored in gray and experimentally demonstrated edges are colored in red.
| Hormone 1 | Not Hormone 1 | |
| Hormone 2 | X 12 | X2–X12 |
| Not Hormone 2 | X1–X12 | 13,929-X1–X2+X12 |