Literature DB >> 11262957

Textquest: document clustering of Medline abstracts for concept discovery in molecular biology.

I Iliopoulos1, A J Enright, C A Ouzounis.   

Abstract

We present an algorithm for large-scale document clustering of biological text, obtained from Medline abstracts. The algorithm is based on statistical treatment of terms, stemming, the idea of a 'go-list', unsupervised machine learning and graph layout optimization. The method is flexible and robust, controlled by a small number of parameter values. Experiments show that the resulting document clusters are meaningful as assessed by cluster-specific terms. Despite the statistical nature of the approach, with minimal semantic analysis, the terms provide a shallow description of the document corpus and support concept discovery.

Mesh:

Year:  2001        PMID: 11262957     DOI: 10.1142/9789814447362_0038

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  13 in total

1.  Creating an online dictionary of abbreviations from MEDLINE.

Authors:  Jeffrey T Chang; Hinrich Schütze; Russ B Altman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

2.  The computational analysis of scientific literature to define and recognize gene expression clusters.

Authors:  Soumya Raychaudhuri; Jeffrey T Chang; Farhad Imam; Russ B Altman
Journal:  Nucleic Acids Res       Date:  2003-08-01       Impact factor: 16.971

3.  Text mining neuroscience journal articles to populate neuroscience databases.

Authors:  Chiquito J Crasto; Luis N Marenco; Michele Migliore; Buqing Mao; Prakash M Nadkarni; Perry Miller; Gordon M Shepherd
Journal:  Neuroinformatics       Date:  2003

4.  Literature based discovery of gene clusters using phylogenetic methods.

Authors:  Indra Neil Sarkar; Abha Agrawal
Journal:  AMIA Annu Symp Proc       Date:  2006

5.  A document clustering and ranking system for exploring MEDLINE citations.

Authors:  Yongjing Lin; Wenyuan Li; Keke Chen; Ying Liu
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

6.  PESCADOR, a web-based tool to assist text-mining of biointeractions extracted from PubMed queries.

Authors:  Adriano Barbosa-Silva; Jean-Fred Fontaine; Elisa R Donnard; Fernanda Stussi; J Miguel Ortega; Miguel A Andrade-Navarro
Journal:  BMC Bioinformatics       Date:  2011-11-09       Impact factor: 3.307

7.  CoPub Mapper: mining MEDLINE based on search term co-publication.

Authors:  Blaise T F Alako; Antoine Veldhoven; Sjozef van Baal; Rob Jelier; Stefan Verhoeven; Ton Rullmann; Jan Polman; Guido Jenster
Journal:  BMC Bioinformatics       Date:  2005-03-11       Impact factor: 3.169

8.  Discovering semantic features in the literature: a foundation for building functional associations.

Authors:  Monica Chagoyen; Pedro Carmona-Saez; Hagit Shatkay; Jose M Carazo; Alberto Pascual-Montano
Journal:  BMC Bioinformatics       Date:  2006-01-26       Impact factor: 3.169

Review 9.  Linking genes to literature: text mining, information extraction, and retrieval applications for biology.

Authors:  Martin Krallinger; Alfonso Valencia; Lynette Hirschman
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

Review 10.  A practical application of text mining to literature on cognitive rehabilitation and enhancement through neurostimulation.

Authors:  Puiu F Balan; Annelies Gerits; Wim Vanduffel
Journal:  Front Syst Neurosci       Date:  2014-09-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.