| Literature DB >> 30329013 |
Axel J Soto1, Piotr Przybyła1, Sophia Ananiadou1.
Abstract
SUMMARY: Although the publication rate of the biomedical literature has been growing steadily during the last decades, the accessibility of pertinent research publications for biologist and medical practitioners remains a challenge. This article describes Thalia, which is a semantic search engine that can recognize eight different types of concepts occurring in biomedical abstracts. Thalia is available via a web-based interface or a RESTful API. A key aspect of our search engine is that it is updated from PubMed on a daily basis. We describe here the main building blocks of our tool as well as an evaluation of the retrieval capabilities of Thalia in the context of a precision medicine dataset.Entities:
Mesh:
Year: 2019 PMID: 30329013 PMCID: PMC6513154 DOI: 10.1093/bioinformatics/bty871
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.The user interface of Thalia is divided into: a search area (top), main search results pane (middle) and faceted results for publication metadata (left) and entities (right)
System performance in terms of infNDCG, precision at 10, R-prec and retrieval time per query in seconds depending on whether the semantic concepts are used for retrieval or not
| infNDCG | P@10 | R-prec | Query time | |
|---|---|---|---|---|
| Textual | 0.338 | 0.403 | 0.213 | 1.22 |
| Thalia | 0.383 | 0.427 | 0.230 | 1.86 |