| Literature DB >> 30153250 |
Nicolas Fiorini1, Kathi Canese1, Grisha Starchenko1, Evgeny Kireev1, Won Kim1, Vadim Miller1, Maxim Osipov1, Michael Kholodov1, Rafis Ismagilov1, Sunil Mohan1, James Ostell1, Zhiyong Lu1.
Abstract
PubMed is a free search engine for biomedical literature accessed by millions of users from around the world each day. With the rapid growth of biomedical literature-about two articles are added every minute on average-finding and retrieving the most relevant papers for a given query is increasingly challenging. We present Best Match, a new relevance search algorithm for PubMed that leverages the intelligence of our users and cutting-edge machine-learning technology as an alternative to the traditional date sort order. The Best Match algorithm is trained with past user searches with dozens of relevance-ranking signals (factors), the most important being the past usage of an article, publication date, relevance score, and type of article. This new algorithm demonstrates state-of-the-art retrieval performance in benchmarking experiments as well as an improved user experience in real-world testing (over 20% increase in user click-through rate). Since its deployment in June 2017, we have observed a significant increase (60%) in PubMed searches with relevance sort order: it now assists millions of PubMed searches each week. In this work, we hope to increase the awareness and transparency of this new relevance sort option for PubMed users, enabling them to retrieve information more effectively.Entities:
Mesh:
Year: 2018 PMID: 30153250 PMCID: PMC6112631 DOI: 10.1371/journal.pbio.2005343
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Fig 1The overall architecture of the new relevance search algorithm in PubMed.
(a) It consists of two stages: processing first by BM25, a classic term-weighting algorithm; the top 500 results are then re-ranked by LambdaMART, a high-performance L2R algorithm. The machine-learning–based ranking model is learned offline using relevance-ranked training data together with a set of features extracted from queries, documents, or both. (b) Features designed and experimented in this study with their brief descriptions and identifiers. D, document; IDF, inverse document frequency; L2R, learning to rank; Q, query; QD, query–document relationship; TIAB, title and abstract
Comparison of the user click-through rate of best match versus the previous TF–IDF method and the default date sort order.
| Ranking Method | CTR@20 | CTR@10 | CTR@5 | CTR@3 |
|---|---|---|---|---|
| Sort by date | 0.32 | 0.29 | 0.24 | 0.20 |
| Sort by TF–IDF | 0.36 | 0.33 | 0.29 | 0.25 |
| Sort by Best Match |
All improvements in CTRs by Best Match are statistically significant with 99% confidence (paired t test). Abbreviations: CTR, click-through rate; TF–IDF, term frequency–inverse document frequency.
Fig 2The Best Match search option in action.
When our system detects that search results by Best Match could be helpful to our users, a Best Match banner is displayed on top of the regular search results (a). A user can click title(s) to view the article abstract (as shown in (b)) or click on the Switch button see complete results returned by Best Match (as shown in (c)).
Fig 3Usage rate of relevance sort order over 6 months (May 2017 to October 2017).
The blue line represents the trend, and the blue area represents the variance. The vertical line denotes the switch to the new relevance algorithm, Best Match, which is followed by a significant and steady increase in usage. Note that the 1% usage rate on the y-axis represents about 30,000 queries on an average work day.