| Literature DB >> 26361503 |
Paul Thompson1, Juliette C Madan2, Jason H Moore3.
Abstract
BACKGROUND: Retrieving relevant biomedical literature has become increasingly difficult due to the large volume and rapid growth of biomedical publication. A query to a biomedical retrieval system often retrieves hundreds of results. Since the searcher will not likely consider all of these documents, ranking the documents is important. Ranking by recency, as PubMed does, takes into account only one factor indicating potential relevance. This study explores the use of the searcher's relevance feedback judgments to support relevance ranking based on features more general than recency.Entities:
Year: 2015 PMID: 26361503 PMCID: PMC4564977 DOI: 10.1186/s13040-015-0061-5
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Fig. 1Machine learning steps
Ten-fold cross validation results, n = 201
| Maybe + yes – libsvm | Maybes + no – libsvm | Maybe + yes – libsvm - detailed | Maybe + no – libsvm - detailed | Maybe + yes – j48 | Maybe + no – j48 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Correctly classified | 130 | 64.68 % | 150 | 74.63 % | 97 | 48.26 % | 150 | 75 % | 120 | 59.70 % | 126 | 62.69 % |
| Incorrectly classified | 71 | 35.32 % | 51 | 25.37 % | 104 | 51.74 % | 50 | 25 % | 81 | 40.30 % | 75 | 37.31 % |
| Kappa statistic | 0.26 | −0.01 | −0.04 | 0 | 0.14 | −0.11 | ||||||
| Mean absolute error | 0.35 | 0.26 | 0.52 | 0.25 | 0.41 | 0.39 | ||||||
| Root mean squared error | 0.59 | 0.50 | 0.72 | 0.5 | 0.61 | 0.59 | ||||||
| Relative absolute error | 72.11 % | 67.64 % | 103.48 % | 66.42 % | 83.59 % | 102.51 % | ||||||
| Root relative squared error | 120.09 % | 116.52 % | 143.86 % | 115.47 % | 122.76 % | 135.82 % | ||||||
Training on 2008–2010 documents and testing on 2011 documents, n = 19
| Maybe + yes | Maybe + no | |||
|---|---|---|---|---|
| Correctly classified | 8 | 42.11 % | 12 | 63.16 % |
| Incorrectly classified | 11 | 57.90 % | 7 | 36.84 % |
| Kappa statistic | 0 | 0 | ||
| Mean absolute error | 0.58 | 0.37 | ||
| Root mean squared error | 0.76 | 0.61 | ||
| Relative absolute error | 118.46 % | 78.61 % | ||
| Root relative squared error | 154.09 % | 125.79 % | ||