| Literature DB >> 26471719 |
Jing Zhou1,2, Yuxuan Shui1,2, Shengwen Peng1,2, Xuhui Li3, Hiroshi Mamitsuka4, Shanfeng Zhu1,2.
Abstract
Currently, all MEDLINE documents are indexed by medical subject headings (MeSH). Computing semantic similarity between two MeSH headings as well as two documents has become very important for many biomedical text mining applications. We develop an R package, MeSHSim, which can compute nine similarity measures between MeSH nodes, by which similarity between MeSH headings as well as MEDLINE documents can be easily computed. Also, MeSHSim supports querying hierarchy information of a MeSH heading and retrieving MeSH headings of a query document, and can be easily integrated into pipelines for any biomedical text analysis tasks. MeSHSim is released under general public license (GPL), and available through Bioconductor and from Github at https://github.com/JingZhou2015/MeSHSim.Keywords: MEDLINE documents; MeSH; R/bioconductor package; semantic similarity
Mesh:
Year: 2015 PMID: 26471719 DOI: 10.1142/S0219720015420020
Source DB: PubMed Journal: J Bioinform Comput Biol ISSN: 0219-7200 Impact factor: 1.122