Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Textquest: document clustering of Medline abstracts for concept discovery in molecular biology.

Literature DB >> 11262957

Textquest: document clustering of Medline abstracts for concept discovery in molecular biology.

I Iliopoulos¹, A J Enright, C A Ouzounis.

Abstract

We present an algorithm for large-scale document clustering of biological text, obtained from Medline abstracts. The algorithm is based on statistical treatment of terms, stemming, the idea of a 'go-list', unsupervised machine learning and graph layout optimization. The method is flexible and robust, controlled by a small number of parameter values. Experiments show that the resulting document clusters are meaningful as assessed by cluster-specific terms. Despite the statistical nature of the approach, with minimal semantic analysis, the terms provide a shallow description of the document corpus and support concept discovery.

Mesh：

Year: 2001 PMID： 11262957 DOI： 10.1142/9789814447362_0038

Source DB: PubMed Journal: Pac Symp Biocomput ISSN： 2335-6928

Keyword Cloud
Cited

13 in total

Review 10. A practical application of text mining to literature on cognitive rehabilitation and enhancement through neurostimulation.

Authors: Puiu F Balan; Annelies Gerits; Wim Vanduffel
Journal: Front Syst Neurosci Date: 2014-09-26

Textquest: document clustering of Medline abstracts for concept discovery in molecular biology.

1. Creating an online dictionary of abbreviations from MEDLINE.

2. The computational analysis of scientific literature to define and recognize gene expression clusters.

3. Text mining neuroscience journal articles to populate neuroscience databases.

4. Literature based discovery of gene clusters using phylogenetic methods.

5. A document clustering and ranking system for exploring MEDLINE citations.

6. PESCADOR, a web-based tool to assist text-mining of biointeractions extracted from PubMed queries.

7. CoPub Mapper: mining MEDLINE based on search term co-publication.

8. Discovering semantic features in the literature: a foundation for building functional associations.

Review 9. Linking genes to literature: text mining, information extraction, and retrieval applications for biology.

Review 10. A practical application of text mining to literature on cognitive rehabilitation and enhancement through neurostimulation.