| Literature DB >> 33093807 |
Mst Shamima Khatun1, Watshara Shoombuatong1, Md Mehedi Hasan1, Hiroyuki Kurata1.
Abstract
Protein-protein interactions (PPIs) are the physical connections between two or more proteins via electrostatic forces or hydrophobic effects. Identification of the PPIs is pivotal, which contributes to many biological processes including protein function, disease incidence, and therapy design. The experimental identification of PPIs via high-throughput technology is time-consuming and expensive. Bioinformatics approaches are expected to solve such restrictions. In this review, our main goal is to provide an inclusive view of the existing sequence-based computational prediction of PPIs. Initially, we briefly introduce the currently available PPI databases and then review the state-of-the-art bioinformatics approaches, working principles, and their performances. Finally, we discuss the caveats and future perspective of the next generation algorithms for the prediction of PPIs.Entities:
Keywords: PPIs database; Protein-protein interactions; bioinformatics; feature selection; machine learning; sequence features
Year: 2020 PMID: 33093807 PMCID: PMC7536797 DOI: 10.2174/1389202921999200625103936
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Fig. (1)A general framework of ML-based PPI prediction. (A higher resolution / colour version of this figure is available in the electronic copy of the article).
Currently available databases for PPIs.
|
|
|
|
|
|---|---|---|---|
| DIP | Several species PPIs that are manually curated | 2002 | |
| TAIR | PPI annotations for | 2007 | |
| PPIM | PPI database for Maize | 2016 | comp-sysbio.org/ppim/ |
| PPIM | 2,762,560 interactions among 14,000 proteins | 2016 | |
| HIPPIE | Human PPI references | 2017 | |
| BioGRID | 400,000 PPIs collected from the experimentations and primary literatures | 2018 | |
| APID | Agile protein intercoms database for bacterial PPIs | 2019 | |
| APID | It integrates the existing public resources and provides PPI information of more than 1100 organisms | 2019 |
Currently available tools for PPI prediction.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Pred_PPI | SVM | Auto covariance | Jackknife | 90.67% (human), 88.99% (yeast), 90.09% ( | 2010 | [ | |
| Hotpoint | SVM | PseAAC and local alignment kernel | 5-fold CV | 70% | 2010 | [ | |
| PSOPIA | Domain-based | Sequence similarity | 10-fold CV | 70-85% | 2014 | [ | |
| NIP | SVM | G-gap dipeptide compositions | Jackknife | 92.67% | 2016 | [ | |
| SPRINT | SVM | 10-fold | N/A | 2017 | [ | ||
| SIPMA | RF | Autocorrelation, AAC, | 10-fold CV | 89.9% | 2018 | [ | |
| DPPI | Deep learning | Sequence features | 10-fold CV | 96% | 2018 | [ | |
| PPI-Detect | SVM | BPF and sequence features | 10-fold CV | 91.40% | 2018 | [ | |
| DLPred | Deep learning | PSSM, HI, AAindex, sequence conservation score, and 3D-1D scores. | 10-fold CV | 73.68% | 2019 | [ | |
| GWORVMBIG | Optimizer-Based Relevance Vector Machine | PSSM and evolutionary encoding | 5-fold CV | NA | 2019 | [ | |
| DAMpred | Neural-Network | Protein structure encoding | 10-fold | 86% | 2019 | [ | |
| FCTP-WSRC | SVM and Weighted sparse leraning | Auto covariance and KNN | 5-fold CV | 96.67%, 99.82%, and 98.09% for | 2020 | [ |
Contingency table.
|
| |||
|---|---|---|---|
| Tested/Estimated/Predicted Results | |||
| Positive (+) | n(TP) | n(FP) | |
| Negative (-) | n(FN) | n(TN) | |
n(TP) and n(FP) represent the numbers of correctly and incorrectly predicted positive samples, respectively. n(TN) and n(FN) represent the numbers of the correctly and incorrectly predicted negative samples, respectively.