| Literature DB >> 32973586 |
Sandra A Just1, Erik Haegert2, Nora Kořánová2, Anna-Lena Bröcker1, Ivan Nenchev1, Jakob Funcke1, Andreas Heinz1, Felix Bermpohl1, Manfred Stede2, Christiane Montag1.
Abstract
BACKGROUND: Computational linguistic methodology allows quantification of speech abnormalities in non-affective psychosis. For this patient group, incoherent speech has long been described as a symptom of formal thought disorder. Our study is an interdisciplinary attempt at developing a model of incoherence in non-affective psychosis, informed by computational linguistic methodology as well as psychiatric research, which both conceptualize incoherence as associative loosening. The primary aim of this pilot study was methodological: to validate the model against clinical data and reduce bias in automated coherence analysis.Entities:
Keywords: automated analysis; coherence; psychosis; schizophrenia; speech; thought disorder
Year: 2020 PMID: 32973586 PMCID: PMC7466436 DOI: 10.3389/fpsyt.2020.00846
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 4.157
Characteristics of the sample.
| NAP with positive FTD ( | NAP without positive FTD ( | HC ( | Statistics | ||
|---|---|---|---|---|---|
| Age (years) | 45.7 (11.91) † | 41.9 (10.87) | 43.9 (13.29) | .61 | |
| Sex (male) | .04 | ||||
| Verbal IQ | 103.6 (14.86) | 106.1 (12.61) | 103.25 (7.62) | Welch’s | .69 |
| Inpatients | .004 | ||||
| F20.0 | .68 | ||||
| F25.0 | .68 | ||||
| Antipsychotic medication | .15 | ||||
| CGI | 5.2 (1.36) | 3.65 (1.31) | .001 | ||
| Duration of illness (years) | 17.25 (12.03) | 14.35 (9.91) | .41 | ||
| SAPS | |||||
| positive FTD | 2.8 (.7) | .35 (.59) | <.001 | ||
| Incoherence | 1.4 (1.55) | .05 (.22) | <.001 | ||
| Tangentiality | 2.6 (.82) | .05 (.22) | <.001 | ||
| Derailment | 2.25 (1.29) | .15 (.67) | <.001 | ||
| Illogicality | 1.5 (1.54) | .05 (.22) | <.001 | ||
| Circumstantiality | 1.7 (1.66) | .55 (.89) | .009 | ||
| Pressured speech | 2.15 (1.6) | .4 (.88) | <.001 | ||
| Distractibility | 1.25 (1.29) | .1 (.45) | .001 | ||
| Clanging | .6 (1.0) | 0 | .01 | ||
| Hallucinations | 1.6 (1.67) | 1.0 (1.49) | .24 | ||
| Delusions | 2.6 (1.23) | .7 (1.03) | <.001 | ||
| Bizarre Behavior | 1.3 (1.26) | .05 (.22) | <.001 | ||
| Inappropriate Affect | .85 (1.23) | .05 (.22) | .009 | ||
| SANS | – | ||||
| Flat Affect | 1.55 (1.54) | 1.3 (1.13) | .56 | ||
| Alogia | 1.0 (1.38) | .85 (1.09) | .7 | ||
| Avolition/apathy | 1.95 (1.47) | 1.55 (1.47) | .39 | ||
| Anhedonia/asociality | 2.6 (1.31) | 1.95 (1.47) | .15 | ||
| Attention | .95 (1.13) | .15 (.67) | .012 |
†Mean (SD); group comparisons between healthy controls (HC) and patients with non-affective psychoses (NAP) with and without formal thought disorder (FTD): aANOVA; bχ²-test; cWelch’s ANOVA; group comparisons between patients with and without FTD: dχ²-test; et-test for independent samples. CGI: Clinical Global Impression; SANS, Scale for the Assessment of Negative Symptoms; SAPS, Scale for the Assessment of Positive Symptoms.
Data set.
| Word count | Total ( | NAP with positive FTD ( | NAP without positive FTD ( | HC ( | ||
|---|---|---|---|---|---|---|
| Raw data | 46,375772.92 (493.94)† | 18,011900.55 (542.81) | 10,788539.4 (360.12) | 17,576878.8 (496.92) | 3.67 | .03 |
| GloVe | 42,757712.62 (462.57) | 16,624831.2 (504.34) | 9,772488.6 (331.7) | 16,361818.05 (469.81) | 3.86 | .03 |
†Mean (SD); aANOVA; group comparisons of healthy controls (HC) and patients with non-affective psychoses (NAP) with and without formal thought disorder (FTD); GloVe data set: transcripts cleaned of sentences only containing stop words, fillers, unknown words.
Figure 1Steps of coherence analysis. Raw data, represented here by three sentences from a transcript (A), is preprocessed by filtering sentences only containing stop words and verbal fillers (B). The meaning of each word is represented as a vector in a semantic space by the GloVe model (C). TF-IDF allows scaling vectors according to their respective semantic contribution (D). Sentence embeddings are calculated as the mean vector of its word embeddings (E). Cosine similarity between adjacent sentences is taken as a measure for level of coherence between them (F).
Coherence markers, bias and syntactic features: z-standardized independent variables.
| NAP with positive FTD | NAP without positive FTD | HC | Statistics | ||
|---|---|---|---|---|---|
| ACM | −.41 (.89) † | .2 (1.2) | .22 (.77) | .07 | |
| Repetitions | −.32 (.89) | .52 (1.11) | −.2 (.8) | .01 | |
| Ambiguous referents | .56 (1.51) | −.2 (.51) | −.36 (.25) | Welch’s | .03 |
| Neologisms | .59 (1.38) | −.1 (.72) | −.49 (.23) | Welch’s | .002 |
| LIWC | |||||
| Differentiation | −.42 (.95) | .02 (1.08) | .41 (.82) | .03 | |
| Common adverbs | −.09 (1.05) | −.18 (.94) | .27 (1.0) | .33 | |
| Conjunctions | −.33 (1.05) | −.01 (1.12) | .34 (.73) | .11 | |
| Causation | −.05 (.85) | .19 (1.28) | −.15 (.82) | .55 | |
| Focus past | .13 (.98) | .24 (.94) | −.37 (1.01) | .12 | |
| Focus present | −.22 (.98) | .13 (1.03) | .1 (1.0) | .48 | |
| Focus future | .04 (.8) | −.07 (1.32) | .04 (.84) | Welch’s | .95 |
†Mean (SD); aANOVA; bWelch’s ANOVA; group comparisons of healthy controls (HC) and patients with non-affective psychoses (NAP) with and without formal thought disorder (FTD); ACM, automatically derived coherence metrics; LIWC, Linguistic Inquiry Word Count.
Figure 2Linear regression between z-standardized values of automatically derived coherence metrics and clinical SAPS ratings of positive FTD. Trend line b depicted with 95% confidence bands.
Prediction of group membership: results of three multinomial logistic regression analysis models.
| NAP patients without positive FTD | NAP patients with positive FTD | |||||||
|---|---|---|---|---|---|---|---|---|
| SE | OR [95% CI] | SE | OR [95% CI] | |||||
| Model 1 | ||||||||
| Constant | .004 | .323 | .990 | −.089 | .339 | .792 | ||
| ACM | −.020 | .318 | .950 | .980 [.526–1.828] | −.769 | .382 | .044 | .464 [.219–.979] |
| Model 2 | ||||||||
| Constant | −.167 | .369 | .651 | −.148 | .370 | .690 | ||
| ACM | .031 | .353 | .930 | 1.032 [.517–2.058] | −.946 | .434 | .029 | .388 [.166–.909] |
| Repetitions | .938 | .418 | .025 | 2.555 [1.126–5.799] | −.227 | .418 | .588 | .797 [.351–1.809] |
| ACM*Rep | −.520 | .425 | .221 | .594 [.258–1.367] | −.406 | .417 | .330 | .666 [.294–1.508] |
| Model 3 | ||||||||
| Constant | 1.124 | .763 | .141 | .918 | .773 | .235 | ||
| ACM | .230 | .378 | .543 | 1.259 [.599–2.642] | −.235 | .522 | .653 | .791 [.284–2.200] |
| Repetitions | .852 | .422 | .044 | 2.345 [1.025–5.367] | −.493 | .530 | .353 | .611 [.216–1.728] |
| Neologisms | 2.152 | 1.22 | .078 | 8.603 [.787–94.053] | 2.658 | 1.25 | .033 | 14.27 1.239–164.27] |
| Ambig. ref. | 1.318 | 1.01 | .192 | 3.737 [.516–27.069] | 2.485 | 1.04 | .017 | 12.00 [1.560–92.238] |
| Differentiat. | −.282 | .398 | .479 | .754 [.346–1.644] | −1.14 | .514 | .026 | .319 [.116–.872] |
The reference category is healthy controls.
ACM, automatically derived coherence metrics; FTD, formal thought disorder; NAP, non-affective psychosis.
Model evaluation for the multinomial logistic regression analysis.
| Model fitting criteria | Likelihood ratio tests | |||
|---|---|---|---|---|
| AIC | Nagelkerke | |||
| Model 1 | 134.025 | .104 | 5.808 | .055 |
| Model 2 | 130.936 | .276 | 16.897 | .010 |
| Model 3 | 112.867 | .575 | 42.966 | .000 |
AIC, Akaike Information Criteria.