| Literature DB >> 32966630 |
Dean Schillinger1,2,3, Renu Balyan4, Scott A Crossley5, Danielle S McNamara6, Jennifer Y Liu2, Andrew J Karter1,2.
Abstract
OBJECTIVE: To develop novel, scalable, and valid literacy profiles for identifying limited health literacy patients by harnessing natural language processing. DATA SOURCE: With respect to the linguistic content, we analyzed 283 216 secure messages sent by 6941 diabetes patients to physicians within an integrated system's electronic portal. Sociodemographic, clinical, and utilization data were obtained via questionnaire and electronic health records. STUDYEntities:
Keywords: communication; diabetes; health literacy; machine learning; managed care; natural language processing; secure messaging
Mesh:
Year: 2020 PMID: 32966630 PMCID: PMC7839650 DOI: 10.1111/1475-6773.13560
Source DB: PubMed Journal: Health Serv Res ISSN: 0017-9124 Impact factor: 3.734
FIGURE 1Patient and secure messages inclusion/exclusion flowchart*. *MRN#: Patient ID; msg_date: Date of message sent; Svy: survey; SM#: number of secure messages; LP: literacy profile; PCP_ID: primary care provider ID; proxy_pct: % of proxy messages; TOFROM_PAT_C: SM sent by the patient
Linguistic indices used in five literacy profiles
| Literacy profile | Linguistic indices | Description |
|---|---|---|
| LP_FK | Readability | The length of words (ie, number of letters or syllables) and length of sentences (ie, number of words) |
| LP_LD | Lexical Diversity | The variety of words used in a text based on D |
| LP_WQ | Word Frequency | Frequency of word in a reference corpus |
| Syntactic Complexity | Number of words before the main verb in a sentence | |
| Lexical Diversity | The variety of words used in a text based on MTLD | |
| LP_SR | Concreteness | The degree to which a word is concrete |
| Lexical diversity | The variety of words used in a text based on two measures of lexical diversity: MTLD, and D | |
| Present tense | Incidence of present tense | |
| Determiners | Incidence of determiners (eg, | |
| Adjectives | Incidence of adjectives | |
| Function words | Incidence of function words such as prepositions, pronouns etc | |
| LP_Exp | Age of Exposure | The estimated age at which a word first appears in a child's vocabulary |
| Lexical decision response time | The time it takes for a human to judge a string of characters as a word | |
| Attested lemmas | Number of attested lemmas used per verb argument construction | |
| Determiner per nominal phrase | Number of determiners in each noun phrase | |
| Dependents per nominal subject | Number of structural dependents for each subject in a noun phrase | |
| Number of associations | Number of words strongly associated with a single word |
Abbreviations: LP_Exp, Literacy Profile Expert‐Rated Health Literacy; LP_FK, Literacy Profile Flesch‐Kincaid; LP_LD, Literacy Profile Lexical Diversity; LP_SR, Literacy Profile Self‐Reported Health Literacy; LP_WQ, Literacy Profile Writing Quality.
We present examples of linguistic indices for LP_SR (n = 185) and LP_Exp (n = 8).
FIGURE 2ROCs and performance metrics for the literacy profiles relative to self‐reported health literacy. AUC: Area Under Curve; LP_Exp: Literacy Profile Expert‐Rated Health Literacy; LP_FK: Literacy Profile Flesch‐Kincaid; LP_LD: Literacy Profile Lexical Diversity; LP_SR: Literacy Profile Self‐Reported Health Literacy; LP_WQ: Literacy Profile Writing Quality; ML: Machine Learning; SVM: Support Vector Machine [Color figure can be viewed at wileyonlinelibrary.com]
FIGURE 3ROCs and performance metrics for the literacy profiles relative to expert‐rated literacy. AUC: Area Under Curve; LDA: Linear Discriminant Analysis; LP_Exp: Literacy Profile Expert‐Rated Health Literacy; LP_FK: Literacy Profile Flesch‐Kincaid; LP_LD: Literacy Profile Lexical Diversity; LP_SR: Literacy Profile Self‐Reported Health Literacy; LP_WQ: Literacy Profile Writing Quality; ML: Machine Learning; SVM: Support Vector Machine [Color figure can be viewed at wileyonlinelibrary.com]
Prevalence of sociodemographic characteristics by literacy profile
| Literacy profile | Education—College degree % | Race—White % | Age at Survey—Mean (SD) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Limited health literacy | Adequate health literacy |
| Limited health literacy | Adequate health literacy |
| Limited health literacy | Adequate health literacy |
| |
| LP_FK | 66.3 | 60.4 | .076 | 22.0 | 33.7 | <.001 | 56.60 (11.4) | 57.29 (9.89) | .305 |
| LP_LD | 68.9 | 59.9 | .002 | 21.4 | 32.6 | <.001 | 56.72 (9.73) | 56.74 (10.0) | .966 |
| LP_WQ | 60.4 | 57.4 | .070 | 31.7 | 34.8 | .056 | 56.71 (9.89) | 57.44 (10.2) | .032 |
| LP_SR | 72.3 | 53.7 | <.001 | 23.9 | 36.5 | <.001 | 58.88 (9.98) | 55.74 (9.74) | <.001 |
| LP_Exp | 71.2 | 57.4 | <.001 | 23.1 | 33.1 | <.001 | 56.70 (10.2) | 57.80 (10.0) | <.001 |
Abbreviations: LP_Exp, Literacy Profile Expert‐Rated Health Literacy; LP_FK, Literacy Profile Flesch‐Kincaid; LP_LD, Literacy Profile Lexical Diversity; LP_SR, Literacy Profile Self‐Reported Health Literacy; LP_WQ, Literacy Profile Writing Quality; SD, Standard Deviation.
Associations between five literacy profiles and single‐item CAHPS ratings of poor physician communication, diabetes‐related health outcomes (%), and annual health care service utilization—mean visits (SD)
| Health outcomes | Literacy profile | LP_FK | LP_LD | LP_WQ | LP_SR | LP_Exp |
|---|---|---|---|---|---|---|
| Poor Physician Communication (%) | Limited health literacy | 12.2 | 10.6 | 9.2 | 13.8 | 15.5 |
| Adequate health literacy | 8.8 | 9.6 | 10.7 | 7.3 | 11.3 | |
|
| .0919 | .5610 | .1372 | <.001 | <.001 | |
| Poor medication adherence (%) | Limited health literacy | 29.0 | 26.6 | 23.9 | 25.6 | 27.9 |
| Adequate health literacy | 22.9 | 23.8 | 25.3 | 23.4 | 22.9 | |
|
| .043 | .277 | .364 | .047 | <.001 | |
| ≥1 Severe Hypoglycemia (%) | Limited health literacy | 8.9 | 4.3 | 2.9 | 5.1 | 4.3 |
| Adequate health literacy | 3.1 | 3.3 | 3.9 | 2.0 | 3.4 | |
|
| <.001 | .318 | .119 | <.001 | .02 | |
| HbA1c ≤ 7% | Limited health literacy | 40.2 | 44.8 | 47.4 | 45.9 | 43.3 |
| Adequate health literacy | 48.8 | 48.5 | 45.2 | 47.7 | 47.4 | |
|
| .011 | .202 | .193 | .141 | <.001 | |
| HbA1c ≥ 9% | Limited health literacy | 19.1 | 13.3 | 13.8 | 14.6 | 16.4 |
| Adequate health literacy | 13.7 | 13.5 | 14.6 | 13.5 | 13.0 | |
|
| .02 | .91 | .499 | .24 | <.001 | |
| Charlson Index | Limited health literacy | 2.61 (1.84) | 2.36 (1.69) | 2.20 (1.61) | 2.65 (1.91) | 2.42 (1.79) |
| Adequate health literacy | 2.28 (1.68) | 2.31 (1.69) | 2.40 (1.72) | 2.02 (1.41) | 2.32 (1.70) | |
|
| .004 | .636 | <.001 | <.001 | .006 | |
| Outpatient clinic visits | Limited health literacy | 9.10 (7.37) | 9.01 (8.98) | 9.45 (9.75) | 10.29 (10.7) | 9.83 (10.6) |
| Adequate health literacy | 9.53 (9.33) | 9.61 (10.3) | 9.42 (9.53) | 9.01 (9.16) | 9.68 (9.57) | |
|
| .479 | .301 | .931 | <.001 | .499 | |
| ED visits | Limited health literacy | 0.48 (1.00) | 0.47 (1.15) | 0.38 (0.94) | 0.53 (1.20) | 0.47 (1.14) |
| Adequate health literacy | 0.39 (0.94) | 0.38 (0.88) | 0.43 (0.96) | 0.31 (0.76) | 0.42 (1.01) | |
|
| .170 | .102 | .085 | <.001 | .016 | |
| Hospitalization | Limited health literacy | 0.23 (0.71) | 0.21 (0.61) | 0.17 (0.60) | 0.25 (0.73) | 0.20 (0.65) |
| Adequate health literacy | 0.18 (0.62) | 0.19 (0.65) | 0.19 (0.67) | 0.13 (0.54) | 0.20 (0.67) | |
|
| .243 | .604 | .503 | <.001 | .713 |
Abbreviations: Exp, Expert‐Rated; FK, Flesch‐Kincaid; LD, Lexical Diversity; LP, Literacy Profile; SD, Standard Deviation; SR, Self‐Reported; WQ, Writing Quality.