| Literature DB >> 32436849 |
Biyang Yu1, Zhe He1, Aiwen Xing2, Mia Liza A Lustria1.
Abstract
BACKGROUND: The language gap between health consumers and health professionals has been long recognized as the main hindrance to effective health information comprehension. Although providing health information access in consumer health language (CHL) is widely accepted as the solution to the problem, health consumers are found to have varying health language preferences and proficiencies. To simplify health documents for heterogeneous consumer groups, it is important to quantify how CHLs are different in terms of complexity among various consumer groups.Entities:
Keywords: consumer health informatics; digital divide; health literacy; readability
Mesh:
Year: 2020 PMID: 32436849 PMCID: PMC7273233 DOI: 10.2196/16795
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Existing metrics for assessing health text complexity.
| Health readability measure | Measure specification | Inclusion | Inclusion or exclusion rationale | |
|
| ||||
|
| Word length or syllable length [ | Average number of characters (eg, syllables) in a given lexical item | No | Already measured in traditional readability metrics |
|
| Sentence length [ | Average number of words in a sentence | No | Already measured in traditional readability metrics |
|
| Paragraph length [ | Average number of sentences in a paragraph | No | Not applicable for CHLa complexity measure |
|
| Traditional readability metrics [ | Flesch-Kincaid grade level, Simple Measure of Gobbledygook, and Gunning fog | Yes | (1) Well-established formulas that are widely utilized in the literature; (2) Combining word, syllable and sentence length; and (3) Flesch-Kincaid grade and Simple Measure of Gobbledygook are the most used readability metrics |
|
| ||||
|
| Ratio of content word [ | Ratio of content words (ie, noun, adjective, verb, and adverb) to functional words (ie, pronoun, determiner, preposition, qualifier, conjunction, interjection) | Yes | Indicator for syntax-level complexity measure; validated in previous literature |
|
| Ratio of nouns [ | Ratio of nouns to all types of parts of speech | Yes | Indicator for syntax-level complexity measure; validated in previous literature |
|
| ||||
|
| Average familiarity score of CHVb [ | Frequency use of each CHV term to the lay people | Yes | Indicator to tell how lay health terms are used in CHL |
|
| Coverage in CHV [ | Ratio of CHV terms of all terms | No | We used the ratio of professional health terms |
|
| Coverage in basic medical dictionary [ | Health terms that are in basic medical dictionaries | No | Not applicable for CHL complexity measure |
|
| Coverage in the Unified Medical Language System [ | Ratio of Unified Medical Language System terms | Yes | We utilized the Systematized Nomenclature of Medicine-Clinical Terms as the source of professional health terms |
|
| Term overlap ratio [ | A higher overlap indicates a more cohesive and easier to read text; overlapped terms/all terms in the document | No | Not applicable for CHL complexity measure |
|
| Vocabulary size [ | Distinct word counts in the corpus | No | Not applicable for CHL complexity measure |
|
| ||||
|
| ||||
|
| Diversity of health topics [ | Ratio of semantic types indicated in the Unified Medical Language System | Yes | Indicator for semantic-level complexity measure; validated in previous literature |
aCHL: consumer health language.
bCHV: consumer health vocabulary.
Figure 1Consumer health language complexity measurement framework (CHELC).
Basic textual characteristics of the 3 health corpora.
| Basic textual characters | Health corpora | ||
|
| AllDeaf (deaf and hearing-impaired people), n | Wrong Planet (people with ASD), n | Yahoo! Answers (general public), n |
| Number of posts | 27,545 | 26,484 | 12,560 |
| Number of threads | 1623 | 2751 | 3756 |
| Number of involved users | 788 | 2978 | 9544 |
| Average number of sentences per post per user | 9.21 | 9.15 | 9.63 |
| Average number of words per sentence per user | 12.14 | 13.99 | 13.09 |
| Average number of syllables per word per user | 1.37 | 1.41 | 1.35 |
| Average number of letters per word per user | 4.14 | 4.23 | 4.11 |
| Distinct health terms per user | 199.87 | 91.63 | 39.09 |
| Mentioned semantics number | 71 | 71 | 72 |
Figure 2Text-level complexity comparison for users in the 3 health corpora. ASD: autism spectrum disorder.
Figure 3Syntax-level complexity comparison for users in the 3 health corpora. ASD: autism spectrum disorder.
Figure 4Term-level complexity comparison for users in the 3 health corpora. ASD: autism spectrum disorder.
Figure 5Semantic-level complexity comparison for users in the 3 health corpora. ASD: autism spectrum disorder.
Figure 6Overall complexity comparison for users in the 3 health corpora. ASD: autism spectrum disorder.