| Literature DB >> 18279519 |
Neil R Smalheiser1, Wei Zhou, Vetle I Torvik.
Abstract
BACKGROUND: PubMed is designed to provide rapid, comprehensive retrieval of papers that discuss a given topic. However, because PubMed does not organize the search output further, it is difficult for users to grasp an overview of the retrieved literature according to non-topical dimensions, to drill-down to find individual articles relevant to a particular individual's need, or to browse the collection.Entities:
Year: 2008 PMID: 18279519 PMCID: PMC2276193 DOI: 10.1186/1747-5333-3-2
Source DB: PubMed Journal: J Biomed Discov Collab ISSN: 1747-5333
Figure 1The Cluster-by-topic algorithm.
Top 20 most important words for the PubMed query "Alzheimer Disease [MeSH Term]".
| Rank | Important words | Rank | Important words |
| 1 | alzheimer | 11 | app |
| 2 | ad | 12 | donepezil |
| 3 | abeta | 13 | secretase |
| 4 | dementia | 14 | cognitive |
| 5 | amyloid | 15 | abeta42 |
| 6 | neurofibrillary | 16 | mmse |
| 7 | tangle | 17 | neurodegenerative |
| 8 | presenilin | 18 | presenilin-1 |
| 9 | epsilon4 | 19 | disease |
| 10 | apoe | 20 | ps1 |
Figure 2Screenshot of the Anne O'Tate tool returning the PubMed query "dicer."
Figure 3Screenshot of the Anne O'Tate tool displaying a list of the author names mentioned in the set of articles retrieved by the "dicer" query.
Figure 4Screenshot of the Anne O'Tate tool displaying a histogram of the publication dates of the set of articles retrieved by the "dicer" query.
Figure 5Coverage of the cluster-by-topic list across a range of queries. Anonymous queries in the Anne O'Tate query web log were analyzed. For each query, the coverage was computed (i.e., the proportion of MeSH-indexed articles in the PubMed search output that were included in the 15 MeSH-based topical clusters). The results were averaged for retrieved literatures of different size ranges as follows: 0–100 articles, 6 queries; 101–1000 articles, 9 queries; 1001–10000 articles, 9 queries; and >10000 articles, 3 queries.
Clustering the search results of the PubMed query "Alzheimer Disease [MeSH Term]" using the cluster-by-topic function.
| Rank | Topic | Count* |
| Most recent articles | 399 | |
| 1 | Aged, 80 and over | 6282 |
| 2 | Brain | 4711 |
| 3 | Amyloid beta-Protein | 3983 |
| 4 | Neuropsychological Tests | 2846 |
| 5 | Cognition Disorders | 2454 |
| 6 | Neurons | 2022 |
| 7 | Apolipoproteins E | 1969 |
| 8 | Dementia | 1897 |
| 9 | Risk Factors | 1791 |
| 10 | Aging | 1616 |
| 11 | Cholinesterase Inhibitors | 1535 |
| 12 | Tau Proteins | 1382 |
| 13 | Membrane Proteins | 1289 |
| 14 | Caregivers | 971 |
| 15 | Parkinson Disease | 899 |
| Not indexed by topic | 52 | |
| Miscellaneous | 4949 | |
42,671 articles were retrieved from PubMed, 87% of which are included in the 15 MeSH-based topical clusters.
Some currently available web-based tools that allow users to carry out post-processing of PubMed queries.
| AliBaba [15] | Extract relationships between biological objects and map them into a graphical network | ||
| BioIE [16] | Extract informative sentences from retrieved results | ||
| Chilibot [17] | Extract biological relationships from search results | ||
| ConceptLink [18] | Extract relationships between medical concepts and allow graphical visualization | ||
| PubNet [29] | Extract several relationships from the search results and then map them into networks | ||
| XplorMed [20] | Extract dependency relations among words and allow users to refine queries using these words | ||
| GoPubMed [21] | Sort PubMed query results through Gene Ontology and MeSH hierarchy | ||
| MEVA [22] | Summarize search results according to MeSH hierarchy | ||
| FABLE [23] | Tag gene and protein occurrences in text | ||
| PubReMiner [24] | Categories include words, MeSH, authors, journals, year, substances and country | ||
| PubMed Assistant [25] | Lists MeSH and chemicals, with link-outs to PubMed, Google and Google Scholar | ||
| Vivísimo ClusterMed [26] | Cluster articles into several categories | ||
| HubMed [27] | Cluster related articles and allow for graphical visualization | ||
| PubFocus [28] | Rank articles by the journal impact factor and volume of forward references | ||
| ReleMed [29] | Rank articles by relevance |