| Literature DB >> 29688374 |
Wei Wei1, Zhanglong Ji1, Yupeng He1, Kai Zhang1, Yuanchi Ha1, Qi Li2, Lucila Ohno-Machado1.
Abstract
Database URL: https://github.com/w2wei/dataset_retrieval_pipeline.Entities:
Mesh:
Year: 2018 PMID: 29688374 PMCID: PMC5861401 DOI: 10.1093/database/bay017
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.The pipeline for dataset retrieval. Additional information was collected as a supplement to the dataset metadata. Indices were built on the combination of metadata and additional information. Once a query was automatically generated from a user's question, the system retrieved relevant datasets. Next, these datasets were re-ranked using two different algorithms, the pseudo sequential dependence model and the snippet-based query expansion method. The re-ranked results could be further merged to get an averaged result using the Ensemble method. Re-ranked datasets were evaluated on the test set provided by the Challenge organizers.
Figure 2.Query interpretation: from a free-text question to a query. Uninformative words were removed using a rule-based method. The query expansion used the same method as PubMed does, relying on NCBI E-utilities.
The performance of the Ensemble methods
| PSD-allwords | PSD-keywords | SQEM | infAP | infNDCG | NDCG@10 | P@10 (+partial) | P@10 (-partial) |
|---|---|---|---|---|---|---|---|
| Y | Y | Y | 0.3120 | 0.4560 | 0.6089 | 0.7267 | 0.3067 |
| N | Y | Y | 0.3120 | 0.4442 | 0.5649 | 0.6800 | 0.2800 |
| Y | N | Y | 0.3216 | 0.4735 | 0.6439 | 0.7733 | 0.3333 |
| Y | Y | N | 0.2801 | 0.5398 | 0.6800 | 0.2400 |
Y, the feature is included; N, the feature is not included.
The performance of five methods in infAP, infNDCG, NDCG@10, P@10(+partial) and P@10(−partial)
| Category | Method | infAP | infNDCG | NDCG@10 | P@10 (+partial) | P@10 (-partial) |
|---|---|---|---|---|---|---|
| No re-ranking | Elasticsearch | 0.2446 | 0.4333 | 0.4228 | 0.5200 | 0.2733 |
| Re-ranking | PSD-allwords | 0.2792 | 0.6152 | 0.3267 | ||
| PSD-keywords | 0.2391 | 0.4490 | 0.4088 | 0.5200 | 0.1667 | |
| SQEM | 0.4783 | 0.7467 | ||||
| Ensemble | 0.2801 | 0.4847 | 0.5398 | 0.6800 | 0.2400 |
The indices were built on the provided metadata and the additional information. All methods used automatically generated queries. Method Elasticsearch did not use any re-ranking methods. The other four methods used re-ranking algorithms. infAP is inferred average precision, infNDCG is inferred NDCG, NDCG@10 is the NDCG score on top 10 results, P@10(+partial) is the precision of top 10 results considering ‘partially relevant’ as ‘relevant,’ P@10(-partial) is the precision of top 10 results considering ‘partially relevant’ as ‘irrelevant.’.
Comparison of the pipeline with settings of combinations of three different features
| Additional fields | Standard fields | Query expansion | infNDCG | |
|---|---|---|---|---|
| 1 | Y | Y | Y | 0.4333 |
| 2 | N | Y | Y | 0.4164 |
| 3 | Y | N | Y | 0.4159 |
| 4 | Y | Y | N | 0.3961 |
| 5 | Y | N | N | 0.3868 |
| 6 | N | Y | N | 0.4015 |
| 7 | N | N | Y | 0.4084 |
| 8 | N | N | N | 0.4019 |
infNDCG measurements are scored in the rightmost column. When both additional fields and standard fields were excluded, all fields in the metadata were searched.
Y, the feature is included; N, the feature is not included.