Literature DB >> 31258958

Algorithmic Generation of Grammar Simplification Rules Using Large Corpora.

Andras Szep1, Marton Szep1, Gondy Leroy1, David Kauchak2, Nick Kloehn1, Debra Revere3, Melissa Just4.   

Abstract

There is often a discontinuity between patients' literacy level and educational materials. In response, we are developing an online medical text simplification editor. In this paper, we describe generating grammar simplification rules from a large parallel corpus (N=141,500) containing original sentences and their simplified variants. We algorithmically identified grammatical transformations between sentences (N=26,600) and used distributional characteristics in two corpora to select transformations with the broadest application and the least ambiguity. This resulted in a top set of 146 rules. Two experts evaluated 20 representative rules reflecting 4 characteristics (long/short and weak/strong) each with 5 example sentences. Generally, we found that the rules are helpful for guiding simplification. Using a 5-point Likert scale (5=best), stronger rules scored higher for ease of applying (4.11), overall helpfulness (4.40) and usefulness of examples (4.05). Rule length did not affect the expert scores. The grammar simplification rules are being integrated in our text editor.

Entities:  

Year:  2019        PMID: 31258958      PMCID: PMC6568077     

Source DB:  PubMed          Journal:  AMIA Jt Summits Transl Sci Proc


  1 in total

1.  Improving the Quality of Suggestions for Medical Text Simplification Tools.

Authors:  David Kauchak; Jorge Apricio; Gondy Leroy
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.