| Literature DB >> 20952401 |
Mingzhi Lin1, Xueling Shen, Xin Chen.
Abstract
The predicted Arabidopsis interactome resource (PAIR, http://www.cls.zju.edu.cn/pair/), comprised of 5990 experimentally reported molecular interactions in Arabidopsis thaliana together with 145,494 predicted interactions, is currently the most comprehensive data set of the Arabidopsis interactome with high reliability. PAIR predicts interactions by a fine-tuned support vector machine model that integrates indirect evidences for interaction, such as gene co-expressions, domain interactions, shared GO annotations, co-localizations, phylogenetic profile similarities and homologous interactions in other organisms (interologs). These predictions were expected to cover 24% of the entire Arabidopsis interactome, and their reliability was estimated to be 44%. Two independent example data sets were used to rigorously validate the prediction accuracy. PAIR features a user-friendly query interface, providing rich annotation on the relationships between two proteins. A graphical interaction network browser has also been integrated into the PAIR web interface to facilitate mining of specific pathways.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20952401 PMCID: PMC3013789 DOI: 10.1093/nar/gkq938
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Assessments of prediction accuracy using independent benchmark data sets
| Independent example data set | PAIR | AtPID | Interologs | Filtered Interologs | AtPIN |
|---|---|---|---|---|---|
| New interactions in BioGrid (448) | 115 | 16 | 50 | 20 | 57 |
| All cell-cycle protein interactions (357) | 170 | 22 | 27 | 11 | 37 |
| New cell-cycle protein interactions (293) | 140 | 16 | 16 | 2 | 26 |
aThe PAIR V3 predicted interactions.
bThe predicted interactions in the AtPID database (15).
cThe interactions predicted by Geisler-Lee et al. (19) based on homology to known interactions in other organisms (interologs).
dThe filtered interolog-based predicted interactions by De Bodt et al. (14).
eThe predicted interactions in the AtPIN database (essentially the assembly of the Interologs data set and the AtPID database) (16).
fThe newly curated interactions in the BioGRID database (8).
gThe 375 interactions between 58 cell-cycle proteins reported in a recent Plant Cell article (21).
hThe 293 interactions reported in (21) that had not been reported before.
Figure 1.The Interaction Information page. (a) Summary of the interaction and its component proteins. (b) Indirect evidences supporting this interaction, including gene co-expressions, domain interactions, shared GO annotations, co-localizations, phylogenetic profile similarities and homologous interactions in other organisms (interologs). Co-publication (two interacting proteins appearing in the same articles) information was not used for interaction prediction, but is also presented here to facilitate literature mining. (c) The confidence scores for this interaction. The SVM score is the overall confidence score.
Figure 2.The graphical interaction network browser showing a network of interactions between proteins in the sulfur metabolism pathway. Several core sulfur metabolism proteins (triangle nodes) are used to search PAIR for interactions between them. This network graph is then expanded to include additional interactions. Double click on a node will bring up a window showing all interactions involving this protein and allow selection of some interactions to be added into this graph. Newly added proteins are shown as circle nodes. Nodes are colored according to their protein functions, as detailed in the Help/FAQ page. Experimentally reported interactions (edges) are in red. Predicted interactions that are homologous to known interactions in other organisms are in blue. Other predicted interactions are in grey.