| Literature DB >> 30794616 |
Renu Balyan1, Scott A Crossley2, William Brown3, Andrew J Karter4, Danielle S McNamara5, Jennifer Y Liu4, Courtney R Lyles3,4,6, Dean Schillinger3,4,6.
Abstract
Limited health literacy is a barrier to optimal healthcare delivery and outcomes. Current measures requiring patients to self-report limitations are time-consuming and may be considered intrusive by some. This makes widespread classification of patient health literacy challenging. The objective of this study was to develop and validate "literacy profiles" as automated indicators of patients' health literacy to facilitate a non-intrusive, economic and more comprehensive characterization of health literacy among a health care delivery system's membership. To this end, three literacy profiles were generated based on natural language processing (combining computational linguistics and machine learning) using a sample of 283,216 secure messages sent from 6,941 patients to their primary care physicians. All patients were participants in Kaiser Permanente Northern California's DISTANCE Study. Performance of the three literacy profiles were compared against a gold standard of patient self-reported health literacy. Associations were analyzed between each literacy profile and patient demographics, health outcomes and healthcare utilization. T-tests were used for numeric data such as A1C, Charlson comorbidity index and healthcare utilization rates, and chi-square tests for categorical data such as sex, race, poor adherence and severe hypoglycemia. Literacy profiles varied in their test characteristics, with C-statistics ranging from 0.61-0.74. Relations between literacy profiles and health outcomes revealed patterns consistent with previous health literacy research: patients identified via literacy profiles indicative of limited health literacy: (a) were older and more likely of minority status; (b) had poorer medication adherence and glycemic control; and (c) exhibited higher rates of hypoglycemia, comorbidities and healthcare utilization. This represents the first successful attempt to employ natural language processing to estimate health literacy. Literacy profiles can offer an automated and economical way to identify patients with limited health literacy and greater vulnerability to poor health outcomes.Entities:
Mesh:
Year: 2019 PMID: 30794616 PMCID: PMC6386302 DOI: 10.1371/journal.pone.0212488
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Selected NLP indices and relation to health literacy (HL) scores.
| Linguistic Index | Description | Relation to Health Literacy (HL) |
|---|---|---|
| Concreteness | The degree to which a word is concrete or imageable vs. abstract (e.g., table vs. love) | Less concrete words in high HL patient writing |
| Lexical diversity | Lexical diversity refers to the variety of words used in a text. It is usually measured using type–token ratios (TTR), which is related to text length | More lexical diversity (i.e., more diverse words) in high HL patient writing |
| Present tense | Incidence of present tense | Less use of present tense in high HL patient writing |
| Determiners | Incidence of determiners (e.g., | More determiners in high HL patient writing |
| Adjectives | Incidence of adjectives | More adjectives in high HL patient writing |
| Function words | Incidence of function words such as prepositions, pronouns etc. | More function words in high HL patient writing |
Classification metric statistics of models for different self-reported literacy profiles (Positive class: Adequate HL).
| ML | Literacy Profile | Accuracy | C-statistic | Sensitivity | Specificity | Positive Predictive Value (PPV) | Negative Predictive Value (NPV) | # of Predicted limited vs adequate HL |
|---|---|---|---|---|---|---|---|---|
| LDA | HLCOMB | 60.55 | 0.63 | 56.10 | 64.42 | 57.83 | 62.78 | 1142 / 939 |
| LDA | HLSUMTri | 0.61 | 39.32 | 55.23 | 1498 / 583 | |||
| SVM | HLAVG | 62.52 | 47.11 | 61.79 | 725 / 1356 |
* The numbers are a function of sample size for test set only
Demographics (Sex %, Race % and Age–Mean (SD)).
| ML | Literacy Profile (Dependent | Sex—Men % | Race–White % | Age at Survey–Mean (SD) | P-value | ||||
|---|---|---|---|---|---|---|---|---|---|
| Limited HL | Adequate HL | P-value | Limited HL | Adequate HL | Limited HL | Adequate HL | |||
| LDA | HLCOMB | 54.9 | 53.7 | 0.32 | 25.5 | 40.0 | 57.91 (10.0) | 55.53 (9.66) | < 0.001 |
| LDA | HLSUMTri | 55.8 | 53.6 | 0.08 | 29.2 | 40.1 | 57.34 (10.0) | 55.43 (9.50) | < 0.001 |
| SVM | HLAVG | 53.6 | 56.2 | 0.06 | 23.9 | 36.5 | 58.88 (9.98) | 55.74 (9.74) | < 0.001 |
Poor adherence and hypoglycemia (%).
| ML | Literacy Profile | Poor medication adherence (%) | Severe Hypoglycemia (%) | ||||
|---|---|---|---|---|---|---|---|
| Limited HL | Adequate HL | P-value | Limited HL | Adequate HL | P-value | ||
| LDA | HLCOMB | 24.9 | 23.3 | 0.143 | 4.0 | 2.0 | < 0.001 |
| LDA | HLSUMTri | 24.5 | 23.2 | 0.296 | 3.5 | 2.1 | < 0.001 |
| SVM | HLAVG | 25.6 | 23.4 | 0.047 | 5.1 | 2.0 | < 0.001 |
A1c and Charlson index—Mean (SD).
| ML | Literacy Profile (Dependent | A1c | Charlson Index | ||||
|---|---|---|---|---|---|---|---|
| Limited HL | Adequate HL | P-value | Limited HL | Adequate HL | P-value | ||
| LDA | HLCOMB | 7.51 (1.56) | 7.48 (1.50) | 0.371 | 2.44 (1.78) | 1.99 (1.39) | < 0.001 |
| LDA | HLSUMTri | 7.50 (1.54) | 7.49 (1.52) | 0.786 | 2.34 (1.71) | 1.94 (1.34) | < 0.001 |
| SVM | HLAVG | 7.55 (1.57) | 7.47 (1.51) | 0.038 | 2.65 (1.91) | 2.02 (1.41) | < 0.001 |
Healthcare service utilization (outpatient clinic visit, emergency room encounter and hospitalization–Mean (SD)).
| ML | Literacy Profile (Dependent | Outpatient clinic visit | ED visits | Hospitalization | P-value | |||
|---|---|---|---|---|---|---|---|---|
| Limited HL | Adequate HL | Limited HL | Adequate HL | Limited HL | Adequate HL | |||
| LDA | HLCOMB | 10.02 (10.4) | 8.76 (8.76) | 0.46 (1.07) | 0.30 (0.75) | 0.21 (0.68) | 0.13 (0.51) | < 0.001 |
| LDA | HLSUMTri | 9.69 (10.0) | 8.79 (8.81) | 0.42 (1.00) | 0.31 (0.75) | 0.19 (0.63) | 0.14 (0.56) | < 0.001 |
| SVM | HLAVG | 10.29 (10.7) | 9.01 (9.16) | 0.53 (1.20) | 0.31 (0.76) | 0.25 (0.73) | 0.13 (0.54) | < 0.001 |