| Literature DB >> 18321884 |
Arek Gladki1, Pawel Siedlecki, Szymon Kaczanowski, Piotr Zielenkiewicz.
Abstract
UNLABELLED: Using literature databases one can find not only known and true relations between processes but also less studied, non-obvious associations. The main problem with discovering such type of relevant biological information is 'selection'. The ability to distinguish between a true correlation (e.g. between different types of biological processes) and random chance that this correlation is statistically significant is crucial for any bio-medical research, literature mining being no exception. This problem is especially visible when searching for information which has not been studied and described in many publications. Therefore, a novel bio-linguistic statistical method is required, capable of 'selecting' true correlations, even when they are low-frequency associations. In this article, we present such statistical approach based on Z-score and implemented in a web-based application 'e-LiSe'. AVAILABILITY: The software is available at http://miron.ibb.waw.pl/elise/Mesh:
Year: 2008 PMID: 18321884 DOI: 10.1093/bioinformatics/btn086
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937