| Literature DB >> 34927002 |
Chrysoula Zerva1,2, Samuel Taylor1, Axel J Soto3, Nhung T H Nguyen1, Sophia Ananiadou1,4.
Abstract
The COVID-19 pandemic resulted in an unprecedented production of scientific literature spanning several fields. To facilitate navigation of the scientific literature related to various aspects of the pandemic, we developed an exploratory search system. The system is based on automatically identified technical terms, document citations, and their visualization, accelerating identification of relevant documents. It offers a multi-view interactive search and navigation interface, bringing together unsupervised approaches of term extraction and citation analysis. We conducted a user evaluation with domain experts, including epidemiologists, biochemists, medicinal chemists, and medicine students. In general, most users were satisfied with the relevance and speed of the search results. More interestingly, participants mostly agreed on the capacity of the system to enable exploration and discovery of the search space using the graph visualization and filters. The system is updated on a weekly basis and it is publicly available at http://www.nactem.ac.uk/cord/.Entities:
Keywords: citation network; exploratory search systems; term extraction
Year: 2021 PMID: 34927002 PMCID: PMC8672931 DOI: 10.1093/jamiaopen/ooab104
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Performance on the 10APR2020 data (round 1) for different indexing units
| TREC round | nDCG | MAP@10 | P@5 | P@10 | 50Q time (min:s) | |
|---|---|---|---|---|---|---|
| (OUR) Full text | 1 | 0.391 | 0.170 | 0.549 | 0.433 | 03:30 |
| (OUR) Abstract + title | 1 | 0.403 | 0.148 | 0.620 | 0.400 |
|
| (OUR) Paragraph | 1 | 0.582 | 0.165 | 0.680 | 0.648 | 04:23 |
| (OUR) First + last paragraph sentence | 1 | 0.625 | 0.155 | 0.704 | 0.700 | 04:35 |
| (OUR) Term-based reranking | 1 |
| 0.190 | 0.715 |
| 12:56 |
| Sabir (sab20.1.meta.docs) | 1 | 0.608 |
|
|
|
|
| Anserini—title/abstract | 0 | 0.606 | 0.356 |
| 0.510 |
|
| Anserini—paragraph | 0 | 0.503 | 0.395 |
| 0.503 |
|
Note: Bold numbers denote the best results for each metric in round 1. The row in green background highlights the best system in our experiments, and the rows in gray background denote round 0 baselines.
Figure 1.List of results and associated term bubble chart view for the query “transmission chain.” The screenshot was captured on 19 July 2021.
Figure 2.Document graph demonstrating connections between documents returned from the query “transmission chain.” Node size signifies relevance to query. Blue nodes correspond to documents returned within the first 50 results. Orange nodes appear after expanding the blue node in their center. Thicker edges correspond to higher relation weights (see top cluster of purple edges); hovering over an edge will show the weight and direction, and if it is a term edge it will show the cooccurring term. The screenshot was captured using data available on/before 19 July 2021.
Figure 3.Summary of participants’ responses to different aspects of the tool. Although a 5-Likert scale is used in the questionnaire, a 3-Likert scale is used in the plot for better identification of patterns in the responses.