| Literature DB >> 35105306 |
Xianwei Pan1, Peng Huang1, Shan Li1, Lei Cui2.
Abstract
BACKGROUND: Besides Boolean retrieval with medical subject headings (MeSH), PubMed provides users with an alternative way called "Related Articles" to access and collect relevant documents based on semantic similarity. To explore the functionality more efficiently and more accurately, we proposed an improved algorithm by measuring the semantic similarity of PubMed citations based on the MeSH-concept network model.Entities:
Keywords: Medical subject headings; Network analysis; Random walk with restart algorithm; Semantic similarity network
Mesh:
Year: 2022 PMID: 35105306 PMCID: PMC8805236 DOI: 10.1186/s12859-022-04578-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The detail information of the studied corpus
| ID | Semantic template | Topic code | Related articles |
|---|---|---|---|
| 1 | Method or protocol | 100–109 | 922 |
| 2 | Gene(s) & disease | 110–119 | 1413 |
| 3 | Gene & biological process | 120–129 | 927 |
| 4 | Genes & function of organ & disease | 130–134 136–139 | 210 |
| 5 | Gene with mutation & biological impact | 140–149 | 859 |
Fig. 1Flowchart of network modelling
Fig. 2Definitions of TP, FP, FN, TN
Basic information of MeSH network and MeSH-concept network
| Net. type | No. of nodes | No. of MeSH nodes | No. of concept nodes | No. of edges | Average degree | Clustering coefficient |
|---|---|---|---|---|---|---|
| MeSH net | 2438 | 2438 | 0 | 366,614 | 300.75 | 0.66 |
| MeSH-concept net | 7302 | 2438 | 4864 | 417,538 | 114.36 | 0.58 |
Fig. 3Visualization of MeSH network and MeSH-concept network
Fig. 4ROC curves of MCRWR, MRWR and PMRA algorithms
Fig. 5Mean P5 values of 49 topics
Fig. 6Precision curves of topics for three algorithms
Fig. 7The pruned document similarity network of “Genes&Function of organ & Disease”
Topics and corresponding document numbers of golden standards
| Topic ID | 130 | 131 | 132 | 133 | 134 | 136 | 137 | 138 | 139 |
|---|---|---|---|---|---|---|---|---|---|
| Article number | 29 | 42 | 28 | 5 | 9 | 3 | 50 | 11 | 33 |
Communities and corresponding article numbers of the pruned network clustering
| Cluster ID | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|
| Article number | 51 | 42 | 33 | 27 | 31 | 12 | 11 | 3 |
Fig. 8Example of document similarity network based on MeSH terms
Fig. 9Boxplots of topic precisions at different thresholds by three algorithms