| Literature DB >> 21995939 |
Caroline Baroukh1, Sherry L Jenkins, Ruth Dannenfelser, Avi Ma'ayan.
Abstract
BACKGROUND: Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications.Entities:
Year: 2011 PMID: 21995939 PMCID: PMC3213042 DOI: 10.1186/1751-0473-6-15
Source DB: PubMed Journal: Source Code Biol Med ISSN: 1751-0473
Figure 1Fetching text for Genes2WordCloud. Text to display the word-clouds can originate from six sources. In some cases several steps are taken to convert the input selection to a body of text for further processing.
Figure 2Text processing pipeline. The extracted text from the different options shown in Figure 1 is then processed by standard text mining algorithms. Several steps are taken to process the text for word-cloud display.
Figure 3The Genes2WordCloud user interface. The initial user interface provides users several options to create word-clouds from different sources: Genes- can be used to create word-clouds from list of genes or single genes; Free Text- can be used to create word-clouds from any body of free text; URL- word-clouds from any given URL; Author- word-clouds for specific authors based on a PubMed query; PubMed Search- word-clouds from any PubMed search; BMC Bioinformatics- word-clouds from the abstracts of the most popular papers published in BMC Bioinformatics.
Figure 4Word-cloud created using two genes. Visualization of a word-cloud for Nanog and Sox2 using the Genes option, showing user options to edit the output display.
Figure 5Word-cloud created from a PubMed search. A word-cloud created for the p38 pathway using the PubMed search option.