| Literature DB >> 35663117 |
Gondy Leroy1, David Kauchak2, Diane Haeger3, Douglas Spegman3.
Abstract
Objective: Simplifying healthcare text to improve understanding is difficult but critical to improve health literacy. Unfortunately, few tools exist that have been shown objectively to improve text and understanding. We developed an online editor that integrates simplification algorithms that suggest concrete simplifications, all of which have been shown individually to affect text difficulty. Materials andEntities:
Keywords: health literacy; metrics; text difficulty; text simplification; user study
Year: 2022 PMID: 35663117 PMCID: PMC9155254 DOI: 10.1093/jamiaopen/ooac044
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Screenshot of the editor for word level suggestions (by clicking “Simplify Text”). “Subsequent” has been clicked and potential substitutions are shown as a dropdown menu.
Figure 2.Screenshot of the editor for a sentence-level suggestion (“months or years”). The guidance with a new example, taken from a general corpus, is shown at the bottom in the colored box.
Figure 3.The lexical chains tab of the editor with the same text used in the previous 2 figures.
Text characteristics
| Long | Short | |||||||
|---|---|---|---|---|---|---|---|---|
| Asthma | Liver cirrhosis | Pemphigus | Polycythemia vera | |||||
| Original | Simplified | Original | Simplified | Original | Simplified | Original | Simplified | |
| Word count | 623 | 626 (0.5) | 481 | 517 (7.5) | 198 | 177 (−10.6) | 199 | 199 (0) |
| Sentences | 31 | 34 (9.7) | 25 | 26 (4.0) | 15 | 14 (−6.7) | 11 | 9 (−18.2) |
| Number of (exact) lexical chains | 33 | 37 (12.1) | 26 | 26 (0) | 12 | 13 (8.3) | 12 | 11 (−8.3) |
| Verb count | 93 | 103 (10.8) | 62 | 80 (29.0) | 24 | 22 (−8.3) | 22 | 25 (13.6) |
| Noun count | 212 | 202 (−4.7) | 180 | 184 (2.2) | 67 | 64 (−4.5) | 78 | 76 (−2.6) |
| Avg word frequency | 280 834 273 | 327 044 769 (16.5) | 313 157 350 | 347 837 287 (11.1) | 325 116 425 | 393 023 079 (20.9) | 276 426 270 | 286 907 512 (3.8) |
| Flesch-Kincaid | 13.9 | 11.5 (−17.3) | 14.5 | 12.7 (−12.4) | 12.3 | 14.2 (15.4) | 12.5 | 11.5 (−8.0) |
Participant demographic information
| Variable | Choice | Count (%) |
|---|---|---|
| Sex | ||
| Female | 39 (87) | |
| Male | 6 (13) | |
| Race (multiple options possible) | ||
| American Indian or Alaska Native | 5 (11) | |
| Asian | 1 (2) | |
| Black or African American | 1 (2) | |
| Native Hawaiian or Other Pacific Islander | 1 (2) | |
| White | 40 (89) | |
| Ethnicity | ||
| Hispanic or Latino | 19 (42) | |
| Not Hispanic or Latino | 26 (58) | |
| Education level | ||
| Less than high school degree | 1 (2) | |
| High school diploma | 11 (24) | |
| Associate degree | 7 (16) | |
| Bachelor’s degree | 13 (29) | |
| Master’s degree | 11 (24) | |
| Doctoral degree (PhD, MD, …) | 2 (4) | |
| Age | ||
| Younger than 30 years old | 10 (22) | |
| 31–40 years old | 8(18) | |
| 41–50 years old | 9(20) | |
| 51–60 years old | 7 (16) | |
| 61–70 years old | 9 (20) | |
| 71 years old or better | 2 (2) | |
| Language spoken at home | ||
| Never English | 0 (0) | |
| Rarely English | 1 (2) | |
| Half English | 3 (7) | |
| Mostly English | 14(31) | |
| Only English | 27 (60) |
ANOVA results for perceived difficulty (significant differences are in bold)
| Metric | Condition | ||
|---|---|---|---|
| Original | Simplified | ||
| Mean (SD) | Mean (SD) |
| |
| 4-point Likert Scale | 2.28 (.697) | 2.55 (.681) |
|
ANOVA results for actual difficulty (significant differences are in bold and indicated by * for scores higher with simplified text and + for scores higher with original text)
| Variable | Metric | Condition | |||
|---|---|---|---|---|---|
| Original | Simplified | ||||
| Mean (SD) | Mean (SD) |
| |||
| Questions before reading | True/false (%) | TF1 | 67 (47) | 63 (49) | 0.261/.610 |
| TF2 | 71 (46) | 71 (46) | 0.001/.982 | ||
| TF3 | 51 (50) | 53 (50) | 0.061/.805 | ||
| TF4 | 76 (43) | 72 (45) | 0.257/.613 | ||
| TF5* |
|
|
| ||
| Questions while reading | Multiple-choice (%) | Overview question | 80 (79) | 83 (76) | 0.250/.618 |
| General question | 71 (79) | 70 (76) | 0.024/.877 | ||
| Questions after reading | True/false (%) | TF1 | 66 (48) | 62 (49) | 0.263/609 |
| TF2 | 93 (27) | 88 (32) | 0.793/.375 | ||
| TF3 | 66 (48) | 72 (45) | 0.770/.382 | ||
| TF4 | 93 (27) | 92 (27) | 0.005/.945 | ||
| TF5* |
|
|
| ||
| Multiple-choice (%) | MC1* |
|
|
| |
| MC2 | 68 (47) | 75 (44) | 0.835/.362 | ||
| MC3* |
|
|
| ||
| MC4 | 33 (47) | 29 (46) | 0.282/.596 | ||
| Automated/semantic evaluation | Unique word count ( | 25 (14) | 23 (12) | 0.891.347 | |
| Proportion of words similar to text (%)* |
|
|
| ||
| Proportion of word matching to text (%)* |
|
|
| ||
| Overall cosine similarity* |
|
|
| ||
| Manual/expert evaluation | Correct facts count ( | 7.5 (4.4) | 8.1 (6.6) | 0.357/.551 | |
| Main point made (%) | 67 (69) | 67 (47) | 0.016/.899 | ||
| Completeness (score) | 2.10 (.89) | 2.17 (.84) | 0.145/.704 | ||
| Correctness (score) | 3.18 (.86) | 3.21 (.71) | 0.058/.810 | ||
| Automated/machine learning metric | Rouge recall—longest phrase | 0.0930766 (0.06349999) | 0.1078276 (0.06988539) | 1.894/.171 | |
| Rouge precision—longest phrase | 0.4873848 (0.21197400) | 0.4357437 (0.12496557) | 3.380/.068 | ||
| Rouge F-measure—longest phrase | 0.1514124 (0.09538131) | 0.1637058 (0.09026941) | 0.678.412 | ||
| Rouge recall—unigram* |
|
|
| ||
| Rouge precision—unigram | 0.6893254 (0.15174880) | 0.6535888 (0.13297372) | 2.424/.122 | ||
| Rouge F-measure—unigram* |
|
|
| ||
| Rouge recall—bigram | 0.0368748 (0.03753065) | 0.0437070 (0.03943795) | 1.221/.271 | ||
| Rouge precision—bigram+ |
|
|
| ||
| Rouge F-measure—bigram | 0.0639861 (0.06378179) | 0.0699317 (0.05622001) | 0.378/.540 | ||
Figure 4.Mean accuracy answering questions by education and language level.
Figure 5.Mean accuracy answering questions per text.