Literature DB >> 31830249

Semantic persistence of ambiguous biomedical names in the citation network.

Raul Rodriguez-Esteban1.   

Abstract

MOTIVATION: Name ambiguity has long been a central problem in biomedical text mining. To tackle it, it has been usually assumed that names present only one meaning within a given text. It is not known whether this assumption applies beyond the scope of single documents.
RESULTS: Using a new method that leverages large numbers of biomedical annotations and normalized citations, this study shows that ambiguous biomedical names mentioned in scientific articles tend to present the same meaning in articles that cite them or that they cite, and, to a lesser extent, two steps away in the citation network. Citations, therefore, can be regarded as semantic connections between articles and the citation network should be considered for tasks such as automatic name disambiguation, entity linking and biomedical database annotation. A simple experiment shows the applicability of these findings to name disambiguation.
AVAILABILITY AND IMPLEMENTATION: The code used for this analysis is available at: https://github.com/raroes/one-sense-per-citation-network.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2020        PMID: 31830249     DOI: 10.1093/bioinformatics/btz923

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  Biomedical articles share annotations with their citation neighbors.

Authors:  Raul Rodriguez-Esteban
Journal:  BMC Bioinformatics       Date:  2021-02-26       Impact factor: 3.169

2.  The speed of information propagation in the scientific network distorts biomedical research.

Authors:  Raul Rodriguez-Esteban
Journal:  PeerJ       Date:  2022-01-10       Impact factor: 2.984

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.