| Literature DB >> 25560088 |
Jiajin Wu, Jimmy Huang, Zheng Ye.
Abstract
BACKGROUND: Different from traditional information retrieval (IR), promoting diversity in IR takes consideration of relationship between documents in order to promote novelty and reduce redundancy thus to provide diversified results to satisfy various user intents. Diversity IR in biomedical domain is especially important as biologists sometimes want diversified results pertinent to their query.Entities:
Mesh:
Year: 2014 PMID: 25560088 PMCID: PMC4304246 DOI: 10.1186/1475-925X-13-S2-S3
Source DB: PubMed Journal: Biomed Eng Online ISSN: 1475-925X Impact factor: 2.819
Features for general learning-to-rank model.
| Feature | Description |
|---|---|
| Term frequency inverse document frequency. | |
| Okapi BM25 model [ | |
| The DFR version of BM25 [ | |
| An algorithm derived from the divergence from randomness (DFR) framework [ | |
| An DLH hyper-geometric DFR model (parameter free) [ | |
| KL-divergence language model with Dirichlet smoothing [ | |
| Hiemstra's language model [ | |
| Proximity of Query Terms: Intuitively, the more close the query terms occur in a document, the more likely the document would be relevant [ | |
Additional features for diversity-biased learning-to-rank model.
| Feature | Description |
|---|---|
| Number of relevant aspects the passage contains. | |
| Number of irrelevant aspects the passage contains. | |
| Number of new relevant aspects the passage contains compared with afore ranked passages. | |
| Number of relevant aspects that already existed in afore ranked passages. | |
| Ratio of passages that contains new aspects with all afore ranked passages. | |
| Ratio of number of relevant aspects with all aspects before current rank position. | |
| Ratio of unique relevant aspects with all aspects before current rank position. | |
Performance Comparison with Baselines on 2006 Collection.
| MAP | Aspect | Passage | Document |
|---|---|---|---|
| BM25 | 0.1972 | 0.0362 | 0.3449 |
| DirKL | 0.1591 | 0.0360 | 0.3566 |
| gLTR | 0.2292 | 0.0369 | 0.3547 |
| LTR |
Performance Comparison with Baselines on 2007 Collection.
| MAP | Aspect | Passage | Passage2 | Document |
|---|---|---|---|---|
| BM25 | 0.1622 | 0.0651 | 0.0697 | 0.2402 |
| DirKL | 0.1383 | 0.0693 | 0.0637 | 0.2376 |
| gLTR | 0.1878 | 0.0533 | 0.0706 | 0.2179 |
| LTR |
Performance Comparison with TREC 2006 Submissions.
| MAP | Aspect | Passage | Document |
|---|---|---|---|
| Max | |||
| Min | 0.011 | 0.0007 | 0.0198 |
| Median | 0.1581 | 0.0345 | 0.3083 |
| gLTR | 0.2292 | 0.0369 | 0.3547 |
| LTR |
Performance Comparison with TREC 2007 Submissions.
| MAP | Aspect | Passage | Passage2 | Document |
|---|---|---|---|---|
| Max | ||||
| Min | 0.0197 | 0.0029 | 0.0008 | 0.0329 |
| Median | 0.1311 | 0.0565 | 0.0377 | 0.1897 |
| gLTR | 0.1878 | 0.0533 | 0.0706 | 0.2179 |
| LTR |
Comparison with Re-Ranking Method on 2006 Collection.
| MAP | Aspect | Passage | Document |
|---|---|---|---|
| Re-Rank | 0.2374 | 0.0386 | 0.3549 |
| LTR |
Comparison with Re-Ranking Method on 2007 Collection.
| MAP | Aspect | Passage | Passage2 | Document |
|---|---|---|---|---|
| Re-Rank | 0.1642 | 0.0651 | 0.0679 | 0.2116 |
| LTR |
Figure 1Parameter .
Figure 2Parameter .