| Literature DB >> 17962307 |
Jian Cui1, Peng Li, Guang Li, Feng Xu, Chen Zhao, Yuhua Li, Zhongnan Yang, Guang Wang, Qingbo Yu, Yixue Li, Tieliu Shi.
Abstract
Arabidopsis thaliana Protein Interactome Database (AtPID) is an object database that integrates data from several bioinformatics prediction methods and manually collected information from the literature. It contains data relevant to protein-protein interaction, protein subcellular location, ortholog maps, domain attributes and gene regulation. The predicted protein interaction data were obtained from ortholog interactome, microarray profiles, GO annotation, and conserved domain and genome contexts. This database holds 28,062 protein-protein interaction pairs with 23,396 pairs generated from prediction methods. Among the rest 4666 pairs, 3866 pairs of them involving 1875 proteins were manually curated from the literature and 800 pairs were from enzyme complexes in KEGG. In addition, subcellular location information of 5562 proteins is available. AtPID was built via an intuitive query interface that provides easy access to the important features of proteins. Through the incorporation of both experimental and computational methods, AtPID is a rich source of information for system-level understanding of gene function and biological processes in A. thaliana. Public access to the AtPID database is available at http://atpid.biosino.org/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17962307 PMCID: PMC2238993 DOI: 10.1093/nar/gkm844
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of GSP resources
| PPI Resources | Number of.PPI pairs | Number of proteins in PPI pairs | |
|---|---|---|---|
| GSP PPI | [ | 1259 | 740 |
| [ | 1528 | 677 | |
| [ | 1475 | 538 | |
| [ | 1073 | 698 | |
| [ | 3866 | 1875 | |
| Protein complexes | [ | 1700 | 856 |
| Total | [ | 4666 | 2285 |
[1] Manually collected protein interactions are extracted directly from thousands of published articles in PubMed. [2] InAct provides a freely available, open source database system for protein interaction data in EMBL-EBI. All interactions are derived from literature curation or direct user submissions. [3] BIND is a new resource to perform cross-database searches of available sequence, interaction, complex and pathway information. It integrates a range of component databases including Genbank and BIND, the Biomolecular Interaction Network Database. [4] TAIR provides ‘Tair Protein Interaction’ file by Matt Geisler at its FTP (ftp://ftp.arabidopsis.org/home/tair/Proteins/). [5] KEGG, a reference knowledge base linking genomes to biological systems and environments, provides resourceful enzyme complex information. [1]∼[4] After mapping various symbols to AGI, we found 3866 PPI pairs involving 1875 proteins with literature supports. [1]∼[5] combined with enzyme complexes from KEGG, the total number of GSP is up to 4666 involving with 2285 proteins.
Overview of the number of individual predictive dataset
| Number of predictive PPI pairs | Number of proteins in the PPI pairs | |
|---|---|---|
| O: Ortholog interaction datasets | 3045 | 1359 |
| G: Shared biological function:GO Ontology | 553 | 523 |
| E: Co-expression | 14 837 | 8024 |
| F: Gene fusion method | 6570 | 5671 |
| N: Gene neighbors method | 2008 | 1637 |
| P: Phylogenetic profile method | 15 723 | 8751 |
| D: Enriched domain pair | 2182 | 1288 |
| AtPID | 28 062 (putative PPI with GSP) | 12 506 |
| 23 396 (putative PPI without GSP) | 11 706 |
aThrough integration by Naïve Bays Network, AtPID achieved 28 062 PPI pairs with 23 396 pairs from prediction methods. There are seven individual datasets from various approaches, identified by O, G, E, F, N, P and D. The details of each method can be browsed on AtPID FAQ.
Figure 1.The home page of AtPID.
Figure 2.An overview of querying flow.
Figure 3.The AtPID interface of PPI Simple Search Results for the queried protein, HAP3A. There are generally three tables: (1) protein attributes with gene model descriptions and PPI summary, showing how many protein pairs can be predicted and the inferred interactions overlapped with GSP. (2) The second table represents interactants of the queried protein, HAP3A. Users can view what the experimental evidence is to support this consequence and link to corresponding literature(s). (3) The third table represents potential interactants of the queried protein, HAP3A, by Naïve Bays integration across the seven approaches. Each approach is graphically represented by circles. When the cursor is over a circle, it will display the corresponding score (LR) for the method.
Figure 4.AtPID interface of the (Network Display) window for the queried protein, HAP3. In the (Network Display) window, we can view the interaction network involved with queried protein(s) and the neighbor component(s) intuitively. (Text Format Output) can export the interaction pair information for further analysis.
Figure 5.Domain attributes of one interactant of the queried protein, HAP3A.