| Literature DB >> 18515840 |
Hong-Jie Dai1, Chi-Hsin Huang, Ryan T K Lin, Richard Tzong-Han Tsai, Wen-Lian Hsu.
Abstract
BIOSMILE web search (BWS), a web-based NCBI-PubMed search application, which can analyze articles for selected biomedical verbs and give users relational information, such as subject, object, location, manner, time, etc. After receiving keyword query input, BWS retrieves matching PubMed abstracts and lists them along with snippets by order of relevancy to protein-protein interaction. Users can then select articles for further analysis, and BWS will find and mark up biomedical relations in the text. The analysis results can be viewed in the abstract text or in table form. To date, BWS has been field tested by over 30 biologists and questionnaires have shown that subjects are highly satisfied with its capabilities and usability. BWS is accessible free of charge at http://bioservices.cse.yzu.edu.tw/BWS.Entities:
Mesh:
Year: 2008 PMID: 18515840 PMCID: PMC2447743 DOI: 10.1093/nar/gkn319
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.BWS search interface. The left frame (No. 1) shows the search history. In the search box (No. 2), a user can enter either a PMID or keywords. The rest of the work space shows a table that retrieved abstracts. Each abstract is hyperlinked to the original PubMed abstract (No. 3). For each snippet, BWS annotates gene or protein names in light blue color (No. 4). A graduated bar meter is also presented (No. 5) in the “Protein-Protein Interaction” column to indicate the abstract's relevance to PPI. (BWS search results).
Figure 2.Relation Analysis. The analysis history is displayed in the left pane (No. 1). Analysis results are shown in the Analysis tab pane (No. 2), with biomedical verbs (No. 3) marked in red. The semantic roles related to a verb are listed on the right-hand side in its corresponding accordion pane (No. 4). When users click on verb (No. 3), its pane will automatically expand (No. 4).
Figure 3.Analysis summary table (simple).
Figure 4.Analysis summary table (advanced).
Figure 5.The architecture of BIOSMILE Web Search.
Figure 6.A parsing tree.
Figure 7.A parsing tree annotated with semantic roles.
Training/Test set of the GENIA Treebank
| Source | Set | Sentences |
|---|---|---|
| GTB 300 | Training | 2768 |
| GTB 200 | Test | 1732 |
The performance of our biomedical SRL system in 100 randomly selected articles
| Semantic Role | |||
|---|---|---|---|
| Arg0 | 91.86 | 82.93 | 87.18 |
| Arg1 | 91.90 | 76.95 | 83.76 |
| Arg2 | 76.00 | 64.04 | 69.51 |
| ArgM-ADV | 76.00 | 57.58 | 69.51 |
| ArgM-DIS | 100.00 | 95.83 | 97.87 |
| ArgM-LOC | 94.29 | 58.93 | 72.53 |
| ArgM-MNR | 95.65 | 75.00 | 84.08 |
| ArgM-MOD | 94.12 | 94.12 | 94.12 |
| ArgM-NEG | 100.00 | 84.62 | 91.67 |
| ArgM-TMP | 42.86 | 21.43 | 28.57 |
| Overall | 90.06 | 74.85 | 81.75 |
Comparison of BWS with existing tools
| Software | Summary | Named entities | PPI abstract classification | Analyzed relation types |
|---|---|---|---|---|
| BWS | (1) Provides an online PubMed search and annotates NEs and PPI relevance for retrieved abstracts. | Protein, DNA, RNA, cell, disease | Yes | Multiple semantic relations, including predicate–agent, predicate–patient, predicate–location, predicate–negation, predicate–extent, predicate–manner, predicate–time, etc. |
| iHOP | (1) Filters and ranks the retrieved sentence that match the given gene or protein names according to significance, impact factor, date of publication and syntax. | Genes, MeSH-terms, chemical compounds | No | Cooccurrence only |
| MEDIE | (1) Provides a search interface with subject, verb and object input fields and retrieves S–V–O syntactic relations. | Genes, diseases | No | Syntactic relations (subject–verb–object) |
ahttp://www.ihop-net.org
bhttp://www-tsujii.is.s.u-tokyo.ac.jp/medie/
Summary of descriptive statistics and construct reliability analysis
| Construct | Mean | SD |
|---|---|---|
| User information satisfaction (10 items) | ||
| Intension to use (3 items) | ||
| Perceived usefulness (6 items) | ||
| Perceived ease of use (6 items) | ||
| System usability | ||
| Overall reaction towards the system (6 items) | 6.97 | 1.17 |
| Screen layout and sequence of system (4 items) | 6.71 | 1.34 |
| Terminology and system information (6 items) | 6.98 | 1.33 |
| Learning to use the system (6 items) | 7.42 | 1.07 |
| Capabilities of the system (5 items) | 6.81 | 1.32 |
PubMed access time comparison
| No. of articles | Average query time | |
|---|---|---|
| Before mirror (s) | After mirror (s) | |
| 5 | 5.05 | 3.24 |
| 10 | 5.83 | 3.48 |
| 20 | 6.73 | 4.06 |
The processing time of NER, PPI abstract classifier and SRL (disable cache)
| NER time | PPI abstract classification time | SRL time |
|---|---|---|
| 0.58 s per sentence | 2.47 s per abstract | 1.99 s per sentence |