| Literature DB >> 26091670 |
Padmini Srinivasan1, Xiao-Ning Zhang2, Roxane Bouten3, Caren Chang4.
Abstract
BACKGROUND: The rapid pace of bioscience research makes it very challenging to track relevant articles in one's area of interest. MEDLINE, a primary source for biomedical literature, offers access to more than 20 million citations with three-quarters of a million new ones added each year. Thus it is not surprising to see active research in building new document retrieval and sentence retrieval systems. We present Ferret, a prototype retrieval system, designed to retrieve and rank sentences (and their documents) conveying gene-centric relationships of interest to a scientist. The prototype has several features. For example, it is designed to handle gene name ambiguity and perform query expansion. Inputs can be a list of genes with an optional list of keywords. Sentences are retrieved across species but the species discussed in the records are identified. Results are presented in the form of a heat map and sentences corresponding to specific cells of the heat map may be selected for display. Ferret is designed to assist bio scientists at different stages of research from early idea exploration to advanced analysis of results from bench experiments.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26091670 PMCID: PMC4474359 DOI: 10.1186/s12859-015-0630-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1System components. The figure shows processing sequence with the arrows on the left side. The user may modify aspects of the job (homologs, synonyms etc.) at several points as shown by arrows on the right
Fig. 2Sample heat map output given nine genes and seven keywords
Fig. 3Sample sentence output for a heat map cell. The display also shows PubMed Identifiers (pmid), species identified in the records and relevance rating options
Comparison of systems
| Ferret | Quertle | Textpresso | MEDIE | |
|---|---|---|---|---|
| Brief description | Takes two lists of concepts as input. Retrieves sentences for pairs across two lists and pairs within one list. | Supports semantic searching of concept pairs. A concept may be a single entry or a class of objects via their Power Terms™. | Offers full-text, sentence retrieval from the literature of model organisms or diseases. Although categories of terms such as the cellular component hierarchy of GO may be selected, does not handle a specific list of concepts such as a set of genes. | Retrieves biomedical correlations by specifying pairs of concepts in subject – verb – object structure. Does not handle a list of concepts as input. |
| Results | Retrieved 38 relevant sentences from 25 relevant PubMed documents referencing 26 species. | Found 1 of 25 relevant documents found by Ferret under focused results. | Found all 15 documents that refer to Arabidopsis. The 25 species referred to by 14 relevant sentences from 10 documents are not indexed by Textpresso. | Found 4 of the 25 relevant documents. |