| Literature DB >> 35854724 |
David Kauchak1, Jorge Apricio1, Gondy Leroy2.
Abstract
Text continues to be an important medium for communicating health-related information. We have built a text simplification tool that gives concrete suggestions on how to simplify health and medical texts. An important component of the tool identifies difficult words and suggests simpler synonyms based on pre-existing resources (WordNet and UMLS). These candidate substitutions are not always appropriate in all contexts. In this paper, we introduce a filtering algorithm that utilizes semantic similarity based on word embeddings to determine if the candidate substitution is appropriate in the context of the text. We provide an analysis of our approach on a new dataset of 788 labeled substitution examples. The filtering algorithm is particularly helpful at removing obvious examples and can improve the precision by 3% at a recall level of 95%. ©2022 AMIA - All rights reserved.Entities:
Mesh:
Year: 2022 PMID: 35854724 PMCID: PMC9285171
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076