| Literature DB >> 27288493 |
Sun Kim1, Lana Yeganova1, W John Wilbur1.
Abstract
UNLABELLED: Medical Subject Headings (MeSH(®)) is a controlled vocabulary for indexing and searching biomedical literature. MeSH terms and subheadings are organized in a hierarchical structure and are used to indicate the topics of an article. Biologists can use either MeSH terms as queries or the MeSH interface provided in PubMed(®) for searching PubMed abstracts. However, these are rarely used, and there is no convenient way to link standardized MeSH terms to user queries. Here, we introduce a web interface which allows users to enter queries to find MeSH terms closely related to the queries. Our method relies on co-occurrence of text words and MeSH terms to find keywords that are related to each MeSH term. A query is then matched with the keywords for MeSH terms, and candidate MeSH terms are ranked based on their relatedness to the query. The experimental results show that our method achieves the best performance among several term extraction approaches in terms of topic coherence. Moreover, the interface can be effectively used to find full names of abbreviations and to disambiguate user queries.Entities:
Mesh:
Year: 2016 PMID: 27288493 PMCID: PMC5039918 DOI: 10.1093/bioinformatics/btw331
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Screenshot of the result for the query, ‘diabetes’
Performance comparison between our theme method and other feature extraction approaches for Top 5 and Top 10 topic terms
| Methods | Top 5 | Top 10 | ||
|---|---|---|---|---|
| UMASS | NPMI | UMASS | NPMI | |
| Theme | −12.3707 | 8.2984 | −76.2566 | 34.9573 |
| Bayes weights | −60.6455 | 4.9002 | −280.0770 | 21.0571 |
| Chi-square | −27.6835 | 7.0750 | −189.4230 | 27.4682 |
| Hypergeometric test | −12.3707 | 8.2984 | −76.2657 | 34.9573 |
The coherence measures, UMASS and NPMI, are used to evaluate topic terms obtained from 100 random MeSH entries, and the scores are averaged.