| Literature DB >> 20576702 |
Xin He1, Yanen Li, Radhika Khetani, Barry Sanders, Yue Lu, Xu Ling, Chengxiang Zhai, Bruce Schatz.
Abstract
Text mining is one promising way of extracting information automatically from the vast biological literature. To maximize its potential, the knowledge encoded in the text should be translated to some semantic representation such as entities and relations, which could be analyzed by machines. But large-scale practical systems for this purpose are rare. We present BeeSpace question/answering (BSQA) system that performs integrated text mining for insect biology, covering diverse aspects from molecular interactions of genes to insect behavior. BSQA recognizes a number of entities and relations in Medline documents about the model insect, Drosophila melanogaster. For any text query, BSQA exploits entity annotation of retrieved documents to identify important concepts in different categories. By utilizing the extracted relations, BSQA is also able to answer many biologically motivated questions, from simple ones such as, which anatomical part is a gene expressed in, to more complex ones involving multiple types of relations. BSQA is freely available at http://www.beespace.uiuc.edu/QuestionAnswer.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20576702 PMCID: PMC2896161 DOI: 10.1093/nar/gkq544
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The flowchart of the BSQA system. The main steps of the computational procedure are shown (see text for details).
Figure 2.The Entity Ranking subsystem of BSQA. (A) The retrieved documents of the example query ‘courtship’. Clicking on the title of one result entry will expand its abstract, highlighting entities with different colors. The hyperlinks in the entities point to external resources. (B) The genes appearing in the retrieved documents, ranked by their frequencies.
Figure 3.The Relation Mining subsystem of BSQA. In the left panel, a user chooses the template question and types in the query variable(s). The main results in the right panel are sorted by the entities and clicking on the entity names will reveal the associated documents.