| Literature DB >> 35353009 |
Katherine James1, Jose Muñoz-Muñoz1.
Abstract
Since the large-scale experimental characterization of protein-protein interactions (PPIs) is not possible for all species, several computational PPI prediction methods have been developed that harness existing data from other species. While PPI network prediction has been extensively used in eukaryotes, microbial network inference has lagged behind. However, bacterial interactomes can be built using the same principles and techniques; in fact, several methods are better suited to bacterial genomes. These predicted networks allow systems-level analyses in species that lack experimental interaction data. This review describes the current network inference and analysis techniques and summarizes the use of computationally-predicted microbial interactomes to date.Entities:
Keywords: cellular network analysis; data integration; interactome; interologs; systems biology
Year: 2022 PMID: 35353009 PMCID: PMC9040873 DOI: 10.1128/msystems.01456-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 7.324
FIG 1Protein–protein interaction (PPI) inference methods. (A) Interologs: where experimentally confirmed interaction partners in one species have similarity to proteins in another species, an interaction can be predicted. (B) Domain–domain interactions (DDIs): the presence of a pair of domains with a known interaction can be predictive of PPI in other proteins containing those domains. (C) Structural interaction: protein structures can be mapped to the structure of interacting proteins to infer PPI. (D) Gene neighborhood: conservation of protein pairs’ (green) proximity in multiple genomes can be predictive of interaction between the pair. (E) Gene fusion: proteins that are fused in one species (yellow and green) have a potential PPI in species in which they are separate proteins. (F) Gene cluster: transcription from an operon in one species indicates functional relation and often PPI in another. Here, an operon of four proteins in one species is predictive of six interactions in another. (G) Phylogenetic profile: protein pairs that interact often have a similar pattern of conservation in multiple genomes (green, presence; orange, absence).
Predicted bacterial interactomes
| Species | Methodology | Proteins | Interactions | Source |
|---|---|---|---|---|
|
| ORTH | 533 | 2,737 |
|
|
| ORTH | 296 | 690 |
|
|
| ORTH | 264 | 732 |
|
|
| ORTH, DDI, GE | 2,448 | 15,864 |
|
|
| ORTH | 247 | 707 |
|
|
| ORTH | 238 | 652 |
|
|
| ORTH | 225 | 611 |
|
|
| DDIs | - | - |
|
| ORTH | 334 | 1,028 |
| |
|
| RF: STRING, GO | - | 955 |
|
|
| ORTH, STRING | - | 15,495 |
|
|
| DDIs | - | 1,280 |
|
| ORTH | 400 | 1,473 |
| |
| SVM: GC, CL, PP | 3,798 | 78,122 |
| |
| ML: GC, PP, MT, CM, IH | 4,150 | 1,847,729 |
| |
| EXP, GC | 4,146 | 80,370 |
| |
| PFIN: EXP, DDI, GE, CC, GC, PP | 4,099 | 95,520 |
| |
| PP | 1,479 | 1,618 |
| |
|
| ORTH | 771 | 5,647 |
|
|
| PFIN: ORTH, GC, DDI, PP, GE, CC | 4,674 | 160,450 |
|
|
| ORTH | 176 | 485 |
|
|
| STRING, GC, MET | 637 | 2,194 |
|
|
| STRING, GC, MET | 256 | 2450 |
|
|
| STRING | 3,925 | 29,664 |
|
| ORTH | 738 | 5,639 |
| |
| RF: STRING, GO | - | 1,854 |
| |
| ORTH, SVM: SEQ | 3,465 | 46,119 |
| |
| STRING | 144 | 587 |
| |
| PP | 1,020 | 911 |
| |
|
| ORTH | 333 | 903 |
|
| RF: GE, CL, GN, DDI, SEQ, FUN | 4,181 | 54,107 |
| |
| PFIN: CC, DDI, GC, GE, ORTH, PP | 5,456 | 203,118 |
| |
|
| ORTH, DDI | 3,254 | 82,019 |
|
|
| ORTH, EXP, STRUCT, MET, TF | 30,870 | 81,514 |
|
|
| ORTH | 332 | 1,359 |
|
|
| ORTH | 383 | 4,548 |
|
| DDI, STRUC, STRING | 2,930 | 109,532 |
| |
| ORTH, DDI, GO | 998 | 8,783 |
| |
| NB: ORTH, DDIs, GC | 3,231 | 4,715 |
| |
|
| ORTH | 275 | 1.021 |
|
|
| ORTH | 365 | 1,520 |
|
|
| ORTH | 372 | 1,557 |
|
|
| ORTH | 352 | 1,100 |
|
|
| ORTH, DDI | 1,988 | 36,886 |
|
CC: co-citation; GE, gene expression; CL, cellular localization; CM, context mirror; DDI, domain–domain interaction; EXP, experimental; FUN, functional interaction; GC, genome context; GO, gene ontology; IH, in silico two hybrid; MET, metabolic interactions; ML, machine learning; MT, mirror tree; NB, Naïve Bayes; ORTH, orthology (interologs); PP, phylogenetic profile; PFIN, probabilistic functional integrated network; SEQ, sequence properties; STRING, https://string-db.org; STRUC, structural interactions; SVM, support vector machine; TF, transcription factor interactions.
Combined for multiple strains.