| Literature DB >> 34713182 |
Aurelie Mascio1, Robert Stewart1,2,3, Riley Botelle4, Marcus Williams4, Luwaiza Mirza4, Rashmi Patel5, Thomas Pollak5, Richard Dobson1,2,3, Angus Roberts1,2,3.
Abstract
Background: Cognitive impairments are a neglected aspect of schizophrenia despite being a major factor of poor functional outcome. They are usually measured using various rating scales, however, these necessitate trained practitioners and are rarely routinely applied in clinical settings. Recent advances in natural language processing techniques allow us to extract such information from unstructured portions of text at a large scale and in a cost effective manner. We aimed to identify cognitive problems in the clinical records of a large sample of patients with schizophrenia, and assess their association with clinical outcomes.Entities:
Keywords: cognition; data mining; electronic health records; natural language processing; schizophrenia
Year: 2021 PMID: 34713182 PMCID: PMC8521945 DOI: 10.3389/fdgth.2021.711941
Source DB: PubMed Journal: Front Digit Health ISSN: 2673-253X
Figure 1BioBERT fine-tuning.
Performance of the NLP-CI tool for individual domains (average over 10-fold cross validation).
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
|
| ||||||
|
|
|
|
|
|
|
|
|
| Attention | 2,800 | 93% | 94% | 83% | 96% | 97% | 96% |
| Memory | 2,800 | 96% | 97% | 96% | 97% | 98% | 98% |
| Executive Function | 2,800 | 93% | 93% | 93% | 96% | 96% | 96% |
| Emotion | 2,800 | 93% | 93% | 93% | 93% | 93% | 93% |
| Other cognition | 2,800 | 91% | 91% | 91% | 98% | 98% | 98% |
|
| 14,000 | 93% | 94% | 93% | 97% | 97% | 97% |
F1, F1-score; P, precision; R, recall.
Performance of the NLP-CI tool for individual domains (separate blind set).
|
|
|
|
|
|
|---|---|---|---|---|
| Attention | 160 | 97% | 98% | 96% |
| Memory | 160 | 91% | 86% | 96% |
| Executive function | 160 | 94% | 88% | 100% |
| Emotion | 160 | 97% | 100% | 94% |
| Other cognition | 160 | 92% | 86% | 100% |
|
|
|
|
|
|
F1, F1-score; P, precision; R, recall.
Cognitive impairments profiles (Sample A).
|
|
|
|
|---|---|---|
| 0 | 45% (11,069) | – |
| 1 | 3% (825) | Executive function |
| 2 | 4% (1,006) | Memory—executive function |
| 3 | 6% (1,435) | Attention—memory—executive function |
| 4 | 9% (2,270) | Attention—memory—executive function—other |
| 5 | 33% (8,009) | Attention—memory—emotion—executive function—other |
|
| 100% (24,614) | – |
Analysis of factors associated with cognitive impairments in Sz (Sample B).
|
|
|
|
| ||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||
|
| 20–30 | 22% (2,654) | 17,950 | 1.9 (1.9) | 0.5 (0.8) | 2.08 (0.03) | 2.94 (0.03) |
| 30–40 | 22% (2,710) | 25,177 | 1.7 (1.8) | 0.5 (0.8) | 2.08 (0.03) | 2.8 (0.03) | |
| 40–50 | 23% (2,815) | 29,619 | 1.5 (1.7) | 0.5 (0.8) | 1.98 (0.03) | 3.03 (0.03) | |
| 50–60 | 13% (1,592) | 22,422 | 1.4 (1.6) | 0.7 (0.9) | 1.89 (0.03) | 3.75 (0.03) | |
| 60–70 | 7% (894) | 10,739 | 1.4 (1.6) | 0.8 (1) | 1.82 (0.03) | 4.76 (0.03) | |
| 70+ | 7% (852) | 9,175 | 1.4 (1.6) | 1.1 (1.1) | 1.86 (0.03) | 6.07 (0.03) | |
| 15–20 | 6% (717) | 1,637 | 1.8 (2.1) | 0.6 (0.8) |
|
| |
|
| Female | 42% (5,160) | 48,092 | 1.5 (1.7) | 0.6 (0.9) | 0.82 (0.01) | 0.89 (0.01) |
| Male | 58% (7,074) | 68,627 | 1.6 (1.7) | 0.6 (0.9) |
|
| |
|
| Sz only | 91% (11,078) | 104,724 | 1.5 (1.7) | 0.6 (0.8) | 0.88 (0.02) | 0.68 (0.02) |
| Sz + dementia | 9% (1,156) | 11,995 | 1.8 (1.8) | 1.1 (1.1) |
|
| |
|
| GCSE+ | 49% (6,000) | 62,361 | 1.7 (1.8) | 0.5 (0.8) | 1.3 (0.01) | 0.91 (0.01) |
| no | 51% (6,234) | 54,358 | 1.5 (1.7) | 0.7 (0.9) |
|
| |
|
| White | 62% (7,556) | 71,163 | 1.5 (1.7) | 0.6 (0.9) | 0.84 (0.01) | 1.03 (0.01)*, [1, 1.05] |
| Other | 38% (4,678) | 45,556 | 1.6 (1.7) | 0.6 (0.8) |
|
| |
|
| Married/cohabiting | 11% (1,300) | 11,114 | 1.2 (1.6) | 0.6 (0.9) | 0.72 (0.02) | 0.85 (0.02) |
|
| Single/separated | 89% (10,934) | 105,605 | 1.4 (1.8) | 0.6 (0.8) |
|
|
|
| Employed | 5% (559) | 5,003 | 1.4 (1.7) | 0.4 (0.7) | 0.8 (0.03) | 0.81 (0.03) |
| Other | 95% (11,675) | 111,716 | 1.5 (1.7) | 0.7 (0.9) |
|
| |
|
| Absent (>2) | 27% (3,256) | 33,184 | 1.7 (1.7) | 1.1 (1) | 0.78 (0.01) | 0.39 (0.01) |
|
| Present | 73% (8,978) | 83,535 | 1.5 (1.7) | 0.4 (0.7) |
|
|
|
| Absent (>2) | 33% (4,035) | 36,291 | 1.6 (1.7) | 0.9 (1) | 0.91 (0.01) | 0.75 (0.01) |
|
| Present | 67% (8,199) | 80,428 | 1.5 (1.7) | 0.5 (0.8) |
|
|
|
| Absent (>2) | 56% (6,891) | 68,809 | 1.3 (1.6) | 0.5 (0.8) | 0.51 (0.01) | 0.66 (0.01) |
|
| Present | 44% (5,343) | 47,910 | 1.9 (1.8) | 0.8 (0.9) |
|
|
|
| Absent (>2) | 77% (9,387) | 95,548 | 1.6 (1.7) | 0.6 (0.8) | 1.76 (0.02) | 0.98 (0.02), [0.95, 1.01] |
|
| Present | 23% (2,847) | 21,171 | 1.3 (1.6) | 0.8 (1) |
|
|
|
| Yes | 12% (1,500) | 7,245 | 1.4 (1.6) | 0.6 (0.8) | 1.44 (0.01) | 1.14 (0.01) |
| No | 88% (10,734) | 109,474 | 1.6 (1.7) | 0.6 (0.9) |
|
| |
|
|
|
|
|
| |||
* p < 0.05, **p < 0.01,
p < 0.001.
Prevalence of different NLP-CI symptoms grouped by education.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| None recorded | 43% (5,290) | 1.3 (1.6) | 0.7 (0.9) | 56% | 51% | 70% | 62% | 54% |
| GCSE | 8% (944) | 1.6 (1.7) | 0.6 (0.9) | 66% | 62% | 74% | 69% | 61% |
| A-level | 24% (2,922) | 1.7 (1.8) | 0.5 (0.8) | 66% | 63% | 71% | 67% | 60% |
| University | 25% (3,078) | 1.7 (1.8) | 0.6 (0.8) | 65% | 64% | 69% | 66% | 61% |
Association between number of cognitive impairments (estimated using NLP and HoNOS) with mental health hospital admission and duration of admission between 2007 and 2020 (Sample B).
|
|
| |
|---|---|---|
|
|
| |
|
| ||
| (0) Unadjusted | 0.49 (0) | 23.41 (0.16) |
| ( | 0.4 (0.01) | 18.6 (0.22) |
| ( | 0.3 (0.01) | 15.97 (0.23) |
| ( | 0.26 (0.01) | 14.43 (0.23) |
|
| ||
| (0) Unadjusted | 0.22 (0) | 10.18 (0.05) |
| ( | 0.21 (0) | 9.9 (0.06) |
| ( | 0.2 (0) | 9.56 (0.06) |
| ( | 0.19 (0) | 9.3 (0.06) |
|
| ||
| (0) Unadjusted | 0.33 (0) | 18.76 (0.19) |
| ( | 0.03 (0.01) | 6.07 (0.25) |
| ( | 0 (0.01), [−0.01, 0.01] | 5.14 (0.24) |
| ( | 0.04 (0.01) | 5.21 (0.25) |
*p < 0.05, **p < 0.01,
p < 0.001.
Linear regression based on admissions within 6 months of symptom measure or mention.
β-coefficients are per one unit increase on the ordinal scale.
Figure 2Association of HoNOS and NLP scores with mental health stays.
Figure 3Trajectories of HoNOS and NLP-based Cognitive Impairment scores (Sample B).
Estimates for the fully adjusted linear mixed model for HoNOS-CI and NLP-CI trajectories.
|
|
|
| |
|---|---|---|---|
|
|
|
| |
|
| 1.32 (0.04) | 0.93 (0.03) | 0.96 (0.03) |
| Gender (female) | −0.04 (0.02), [−0.08, 0] | −0.02 (0.02), [−0.05, 0.01] | −0.05 (0.01) |
| Has dementia (no) | 0.01 (0.04), [−0.07, 0.09] | −0.05 (0.03), [−0.1, 0.01] | −0.34 (0.02) |
| Education (GCSE or above) | 0.6 (0.02) | 0.33 (0.02) | −0.13 (0.01) |
| Ethnicity (white) | −0.05 (0.02)*, [−0.09, −0.01] | −0.02 (0.02), [−0.05, 0.01] | 0.01 (0.01), [−0.02, 0.03] |
| Married / cohabiting (yes) | −0.1 (0.04)**, [−0.18, −0.03] | −0.05 (0.03), [−0.11, 0] | −0.05 (0.02)*, [−0.1, −0.01] |
| Employed (yes) | 0.11 (0.06), [−0.01, 0.22] | 0.05 (0.04), [−0.03, 0.13] | −0.14 (0.03) |
| Antipsychotic (yes) | −0.61 (0.03) | −0.35 (0.02) | −0.04 (0.01)**, [−0.06, −0.01] |
|
| 0.05 (0.01) | 0.04 (0.00) | 0.06 (0.00) |
| Gender (female) | 0.01 (0.00)*, [0, 0.01] | 0.01 (0.00)**, [0, 0.01] | 0.01 (0.00)**, [0, 0.01] |
| Has dementia (no) | −0.05 (0.00) | −0.04 (0.00) | −0.05 (0.00) |
| Education (GCSE or above) | −0.02 (0.00) | −0.02 (0.00) | −0.01 (0.00) |
| Ethnicity (white) | 0.01 (0.00), [0, 0.01] | 0 (0.00), [0, 0.01] | 0 (0.00), [0, 0] |
| Married / cohabiting (yes) | −0.01 (0.00)**, [−0.02, −0.01] | −0.01 (0.00)**, [−0.01, 0] | 0 (0.00), [−0.01, 0.01] |
| Employed (yes) | −0.01 (0.01)*, [−0.03, 0] | −0.01 (0.01), [−0.02, 0] | 0 (0.01), [−0.01, 0.01] |
| Antipsychotic (yes) | 0.05 (0.00) | 0.03 (0.00) | 0.01 (0.00) |
|
| |||
| Intercept | 0.95 (0.01) | 0.37 (0.01) | 0.32 (0.01) |
| Slope | −0.07 (0.00) | −0.03 (0.00) | −0.02 (0.00) |
| Residual | 0.01 (0.00) | 0 (0.00) | 0 (0.00) |
|
| |||
| BIC | |||
| −2LL | −384,229 | −257,483 | −130,724 |
*p < 0.05, **p < 0.01,
p < 0.001.
Slope estimates for the fully adjusted linear mixed model for HoNOS-CI and NLP-CI trajectories.
|
|
|
| |
|---|---|---|---|
|
|
|
| |
| 0. Unadjusted | 0 (0.00), [0, 0] | 0.01 (0.00) | 0 (0.00), [0, 0] |
| 1. Model 0 + age and gender | 0 (0.00), [0, 0] | 0.01 (0.00) | 0.01 (0.00) |
| 2. Model 1 + socio-demographics | 0.05 (0.01) | 0.05 (0.00) | 0.06 (0.00) |
| 3. Model 2 + antipsychotics | 0.05 (0.01) | 0.04 (0.00) | 0.06 (0.00) |
p < 0.001.