| Literature DB >> 27570676 |
Wei Wei1, Rebecca Marmor1, Siddharth Singh1, Shuang Wang1, Dina Demner-Fushman2, Tsung-Ting Kuo1, Chun-Nan Hsu1, Lucila Ohno-Machado1.
Abstract
Recommendation of related articles is an important feature of the PubMed. The PubMed Related Citations (PRC) algorithm is the engine that enables this feature, and it leverages information on 22 million citations. We analyzed the performance of the PRC algorithm on 4584 annotated articles from the 2005 Text REtrieval Conference (TREC) Genomics Track data. Our analysis indicated that the PRC highest weighted term was not always consistent with the critical term that was most directly related to the topic of the article. We implemented term expansion and found that it was a promising and easy-to-implement approach to improve the performance of the PRC algorithm for the TREC 2005 Genomics data and for the TREC 2014 Clinical Decision Support Track data. For term expansion, we trained a Skip-gram model using the Word2Vec package. This extended PRC algorithm resulted in higher average precision for a large subset of articles. A combination of both algorithms may lead to improved performance in related article recommendations.Entities:
Year: 2016 PMID: 27570676 PMCID: PMC5001748
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.A comparison of the number of matched term counts at different PRC weight thresholds. The red curve is the smoothed trend of matched terms in TP articles. The blue curve is the smoothed trend of matched terms in FP articles. The two curves cross between X= 0.76 and X=0.8. The green curve illustrates the p-value of the difference between TP and FP for the null hypothesis that the count of matched terms in TP is equal to the count of terms in FP above different weight thresholds. When 0.76
Figure 2.A comparison of PRC and XPRC at five precision levels determined by the PRC algorithm on the Genomics dataset and CDS datasets. For the Genomics articles in which PRC does not achieve perfect precision, XPRC has better overall performance in every group. For the CDS articles, XPRC achieved better performance in PRC’s low precision articles. Values associated with every data point are available in Tables 1 and 2. The error bars indicate standard errors.
Figure 3.A comparison of PRC and XPRC at five average precision (AP) levels determined by the PRC algorithm on the Genomics dataset and CDS dataset. For the Genomics articles in which PRC does not achieve perfect AP, XPRC has better performance in every group. For the CDS articles, XPRC achieved better performance in PRC’s low precision articles. Values of every data point are available in Tables 3 and 4. The error bars indicate standard errors.
A comparison of PRC and XPRC at different precision levels determined by the PRC algorithm on the Genomics dataset. The cumulative article count is the number of articles with PRC precision below a given precision level. For example, there are 158 articles that result in PRC precisions lower than 0.2. PRC has precision 0.0 on all the 58 articles in the 0.0 group, so its macro-average precision and standard error are also 0. XPRC has better performance on these articles. p-value shows the significance of the difference between PRC and XPRC at different precision levels.
|
|
|
|
|
|
|
| |
|
| 58 | 158 | 314 | 603 | 1215 | 4234 | |
|
|
| 0 | 0.127 | 0.262 | 0.424 | 0.613 | 0.889 |
|
| 0 | 0.008 | 0.009 | 0.008 | 0.007 | 0.003 | |
|
|
| 0.338 | 0.443 | 0.523 | 0.614 | 0.705 | 0.864 |
|
| 0.048 | 0.028 | 0.019 | 0.013 | 0.009 | 0.004 | |
|
| 3e-09 | 3e-21 | 3e-30 | 2e-31 | 2e-16 | 4e-07 | |
A comparison of PRC and XPRC at different precision levels determined by the PRC algorithm on the CDS dataset. The format of this table is the same as that of Table 1.
|
|
|
|
|
|
|
| |
|
| 87 | 268 | 539 | 1000 | 1670 | 3034 | |
|
|
| 0 | 0.135 | 0.268 | 0.421 | 0.573 | 0.765 |
|
| 0 | 0.006 | 0.006 | 0.006 | 0.006 | 0.005 | |
|
|
| 0.172 | 0.248 | 0.315 | 0.410 | 0.509 | 0.645 |
|
| 0.024 | 0.015 | 0.011 | 0.009 | 0.007 | 0.005 | |
|
| 2e-10 | 2e-11 | 3e-4 | 0.294 | 4e-12 | 6e-57 | |
A comparison of PRC and XPRC at five average precision (AP) levels determined by the PRC algorithm on the Genomics dataset. The cumulative article count is the number of articles with PRC AP below the given AP level. XPRC has better performance at all levels except for 1.0. p-value shows the significance of difference between PRC and XPRC at different AP levels.
|
|
|
|
|
|
|
| |
|
| 58 | 231 | 420 | 687 | 1215 | 4234 | |
|
|
| 0 | 0.09 | 0.198 | 0.324 | 0.504 | 0.858 |
|
| 0 | 0.005 | 0.007 | 0.007 | 0.007 | 0.004 | |
|
|
| 0.274 | 0.417 | 0.493 | 0.555 | 0.633 | 0.827 |
|
| 0.045 | 0.024 | 0.018 | 0.014 | 0.01 | 0.004 | |
|
| 1e-07 | 8e-31 | 2e-45 | 2e-45 | 2e-25 | 1e-07 | |
A comparison of PRC and XPRC at five average precision (AP) levels determined by the PRC algorithm on the CDS dataset. The format of this table is the same as that of Table 3.
|
|
|
|
|
|
|
| |
|
| 87 | 363 | 663 | 1079 | 1670 | 3034 | |
|
|
| 0 | 0.094 | 0.201 | 0.325 | 0.471 | 0.709 |
|
| 0 | 0.004 | 0.005 | 0.006 | 0.006 | 0.006 | |
|
|
| 0.103 | 0.181 | 0.253 | 0.317 | 0.408 | 0.565 |
|
| 0.020 | 0.011 | 0.010 | 0.009 | 0.008 | 0.006 | |
|
| 2e-6 | 3e-12 | 8e-6 | 0.446 | 2e-10 | 6e-62 | |
Time and memory usage of PRC and XPRC. The corpora were randomly selected from the Genomics dataset. For each algorithm, we ran 10 queries on every corpus. The time shown in this table is the average value of all queries on every corpus. The memory in this table is the maximum value for all queries on every corpus.
| PRC | XPRC | |||
|---|---|---|---|---|
| Corpus Size | Time (s) | Maximum Memory (MB) | Time (s) | Maximum Memory (MB) |
| 10 | 0.08 | 65 | 2.1 | 590 |
| 100 | 0.15 | 68 | 2.2 | 591 |
| 1000 | 0.7 | 144 | 2.6 | 601 |