| Literature DB >> 31258958 |
Andras Szep1, Marton Szep1, Gondy Leroy1, David Kauchak2, Nick Kloehn1, Debra Revere3, Melissa Just4.
Abstract
There is often a discontinuity between patients' literacy level and educational materials. In response, we are developing an online medical text simplification editor. In this paper, we describe generating grammar simplification rules from a large parallel corpus (N=141,500) containing original sentences and their simplified variants. We algorithmically identified grammatical transformations between sentences (N=26,600) and used distributional characteristics in two corpora to select transformations with the broadest application and the least ambiguity. This resulted in a top set of 146 rules. Two experts evaluated 20 representative rules reflecting 4 characteristics (long/short and weak/strong) each with 5 example sentences. Generally, we found that the rules are helpful for guiding simplification. Using a 5-point Likert scale (5=best), stronger rules scored higher for ease of applying (4.11), overall helpfulness (4.40) and usefulness of examples (4.05). Rule length did not affect the expert scores. The grammar simplification rules are being integrated in our text editor.Entities:
Year: 2019 PMID: 31258958 PMCID: PMC6568077
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc