| Literature DB >> 20550717 |
Shao-Wu Zhang1, Yao-Jun Li, Li Xia, Quan Pan.
Abstract
BACKGROUND: Extracting and visualizing of protein-protein interaction (PPI) from text literatures are a meaningful topic in protein science. It assists the identification of interactions among proteins. There is a lack of tools to extract PPI, visualize and classify the results.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20550717 PMCID: PMC2906489 DOI: 10.1186/1471-2105-11-326
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Architecture of PPLook. PPLook contains four modules: (i) submission authentication module aims to identify whether the query word is a protein name or not; (ii) article parser module acts as a collector to pick up sentences containing the query protein name; (iii) full-sentence parser module determines the PPI and (iv) PPI visualization module displays the PPI in the form of a 3-D graph.
Statistical results of six key words.
| ID | Key words | Frequencies | Recall (%) | Precision (%) |
|---|---|---|---|---|
| 1 | interact | 538 | 89.1 | 96.1 |
| 2 | bind | 415 | 80.9 | 90.2 |
| 3 | complex | 1625 | 86.6 | 95.3 |
| 4 | regulate | 617 | 86.4 | 92.7 |
| 5 | activate | 1613 | 82.8 | 91.3 |
| 6 | associate | 483 | 80.4 | 92.5 |
Keywords and corresponding PPI patterns
| Keywords | Patterns |
|---|---|
| Interact | A interact with B |
| Interaction of A (with | and) B | |
| Interaction(between | among) | |
| A - B interact | |
| A and B interact | |
| Associate | A associate with B |
| Association between A and B | |
| Association of A (with | and) B | |
| A and B associated with each other | |
| Bind | Binding of A to B |
| A and B bind | |
| Binding between A and B | |
| A bind B | |
| Complex | A (- |/) B complex |
| A and B complex | |
| complex A and B | |
| A complex with B | |
| Complex...contains B... | |
| A complex B | |
| Activate | A activate B |
| Regulate | A regulate B |
Figure 2An example of PPLook search results for the protein IL-2. (A) is the protein semantic class selection window, (B) is the IL-2 protein input window, (C) is the text results output window, and (D) is the 3-D display output window.
Figure 3Heterogeneous Search results: Structure information and MEDLINE results for the protein IL-2. The top left window is the protein IL-2 structure information coming from PDB database. The bottom right window is the Google search results.