| Literature DB >> 32379317 |
Julien Gobeill1,2, Déborah Caucheteur2, Pierre-André Michel1, Luc Mottin2, Emilie Pasche1,2, Patrick Ruch1,2.
Abstract
Thanks to recent efforts by the text mining community, biocurators have now access to plenty of good tools and Web interfaces for identifying and visualizing biomedical entities in literature. Yet, many of these systems start with a PubMed query, which is limited by strong Boolean constraints. Some semantic search engines exploit entities for Information Retrieval, and/or deliver relevance-based ranked results. Yet, they are not designed for supporting a specific curation workflow, and allow very limited control on the search process. The Swiss Institute of Bioinformatics Literature Services (SIBiLS) provide personalized Information Retrieval in the biological literature. Indeed, SIBiLS allow fully customizable search in semantically enriched contents, based on keywords and/or mapped biomedical entities from a growing set of standardized and legacy vocabularies. The services have been used and favourably evaluated to assist the curation of genes and gene products, by delivering customized literature triage engines to different curation teams. SIBiLS (https://candy.hesge.ch/SIBiLS) are freely accessible via REST APIs and are ready to empower any curation workflow, built on modern technologies scalable with big data: MongoDB and Elasticsearch. They cover MEDLINE and PubMed Central Open Access enriched by nearly 2 billion of mapped biomedical entities, and are daily updated.Entities:
Year: 2020 PMID: 32379317 PMCID: PMC7319474 DOI: 10.1093/nar/gkaa328
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Evaluation of SIBiLS for curation support of the neXtProt database by the SIB Calipho group
| Identified articles | |||
|---|---|---|---|
| Curation axis | Accepted by both | Accepted by one | Rejected |
| Biological processes | 162 ( | 39 ( | 41 ( |
| Diseases | 152 ( | 48 ( | 42 ( |
| Identified concepts | |||
| Accepted for curation | Modified for curation | Rejected for curation | |
| Biological processes | 699 ( | 413 ( | 2061 ( |
| Diseases | 1094 ( | 146 ( | 3727 ( |
Evaluation of PubMed and SIBiLS for Information Retrieval with the TREC 2019 Precision Medicine benchmark
| Search engine | Relevant retrieved | P20 | R100 | R-Prec | MAP |
|---|---|---|---|---|---|
| PubMed | 1437 | 0.23 | 0.18 | 0.14 | 0.10 |
| PubMed (relevance sort) | 1624 | 0.33 | 0.21 | 0.18 | 0.15 |
| SIBiLS | 3212 | 0.47 | 0.29 | 0.27 | 0.22 |
| SIBiLS (normalized queries) | 3468 | 0.50 | 0.31 | 0.30 | 0.25 |
|
|
|
|
|
|
|