| Literature DB >> 34903724 |
Tom J Spencer1,2, Philip McGuire1, Sarah E Morgan3,4,5, Kelly Diederen1, Petra E Vértes6,7, Samantha H Y Ip8, Bo Wang7,9, Bethany Thompson1, Arsime Demjaha1, Andrea De Micheli10,11, Dominic Oliver10, Maria Liakata7,12, Paolo Fusar-Poli10,11,2.
Abstract
Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total N = 54). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34903724 PMCID: PMC8669009 DOI: 10.1038/s41398-021-01722-y
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Sample characteristics for the three groups: healthy control subjects (CON), clinical high risk subjects (CHR-P) and first episode psychosis patients (FEP).
| CON | CHR-P | FEP | Group difference | |
|---|---|---|---|---|
| Sample size | 13 | 25 | 16 | |
| Age (years) | 26.5 ± 5.2 | 25.1 ± 4.8 | 24.5 ± 3.7 | |
| Sex (M) | 8 (61.5%) | 15 (62.5%) | 13 (81.3%) | |
| No. on antipsychotic medication | 0 | 4 | 6 | |
| Years in education | 18.4 ± 4.2 | 13.0 ± 2.8 | 13.3 ± 1.9 | |
| WRAT IQ | 115.6 ± 5.2 | 103.3 ± 11.8 | 99.8 ± 15.0 | |
| Digit span | 20.7 ± 4.1 | 17.0 ± 3.6 | 13.3 ± 4.5 | |
| TLI total | 0.37 ± 0.51 | 1.8 ± 1.4 | 3.5 ± 2.9 | |
| TLI positive | 0.37 ± 0.51 | 1.4 ± 1.3 | 2.9 ± 3.0 | |
| TLI negative | 0 ± 0 | 0.27 ± 0.61 | 0.58 ± 0.86 |
We note that age information was missing for two participants: one CHR-P subject and one FEP patient and sex information was missing for one CHR-P subject. Results are reported as the mean average and standard deviation where appropriate. Group differences were calculated using a 1-way ANOVA. WRAT IQ, digit span, TLI and education information were missing for one CHR-P subject.
TLI Thought and Language Index, WRAT IQ Wide Range Achievement Test Intelligence Quotient.
Fig. 1Speech profiles.
A Average speech profiles for the control subjects, CHR-P subjects and FEP patients. B, C Example descriptions of one of the TAT pictures, for a particular CHR-P subject and control subject, respectively. The response in part B diverges somewhat from the average control response, with more, shorter sentences, and lower coherence, on-topic score and LCC, for example. The response in part C follows the average control response quite closely, but has a somewhat higher maximum similarity between sentences. We note that the healthy control subject whose speech profile is given in part C was excluded from our calculation of the average control response, to avoid inflating the similarity between their speech profile and the average control profile. Spider plots were generated using code from ref. [48].
Statistical group differences in NLP measures.
| TAT | DCT | Free | |||||||
|---|---|---|---|---|---|---|---|---|---|
| FEP/CON | CHR-P/CON | FEP/CHR-P | FEP/CON | CHR-P/CON | FEP/CHR-P | FEP/CON | CHR-P/CON | FEP/CHR-P | |
| −1.1 (0.26) | −1.3 (0.19) | −1.2 (0.25) | −1.4 (0.17) | 0.50 (0.62) | −0.19 (0.85) | 0.48 (0.63) | |||
| 0.77 (0.44) | 1.1 (0.26) | 0.61 (0.54) | −0.65 (0.52) | 1.2 (0.25) | 0.56 (0.57) | ||||
| Sentence length | −1.9 (0.054) | −1.7 (0.095) | 0 (1) | −0.98 (0.33) | −0.14 (0.89) | −1.8 (0.073) | |||
| Coherence | −1.3 (0.19) | −1.1 (0.28) | −1.8 (0.070) | −1.3 (0.20) | −0.87 (0.39) | ||||
| Tangentiality | −0.95 (0.34) | −0.69 (0.49) | −0.23 (0.81) | 0.76 (0.45) | −0.21 (0.83) | 1.2 (0.22) | N/A | N/A | N/A |
| On-topic | −1.3 (0.20) | −1.6 (0.10) | N/A | N/A | N/A | ||||
| Maximum similarity | 1.7 (0.082) | 0.65 (0.51) | 1.7 (0.090) | −0.72 (0.47) | 0.50 (0.61) | −1.3 (0.20) | 1.6 (0.10) | 0.60 (0.55) | 1.3 (0.20) |
| Ambig. Pronouns | 1.2 (0.25) | 1.8 (0.073) | −0.66 (0.51) | 1.5 (0.14) | 1.1 (0.28) | −0.75 (0.45) | −1.2 (0.23) | 0.20 (0.84) | |
| LCC | −1.5 (0.14) | −1.3 (0.18) | −1.0 (0.31) | 0.77 (0.44) | |||||
| LSC | −1.8 (0.067) | −1.7 (0.090) | −0.95 (0.34) | −1.1 (0.26) | 0.41 (0.68) | 1.3 (0.19) | −0.90 (0.37) | ||
| LCCr | −1.7 (0.084) | −1.3 (0.20) | −1.9 (0.057) | −1.0 (0.31) | 0.55 (0.58) | ||||
| LSCr | −1.4 (0.17) | −0.72 (0.47) | 1.2 (0.23) | ||||||
Z-values are given from Mann–Whitney U-tests, with the corresponding P-values in brackets. Results where P < 0.05 are highlighted in bold. LCC, LSC, LCCr, and LSCr results for the TAT have already been reported by ref. [29].
Fig. 2Box-plots showing group differences in all twelve NLP measures.
Results are shown for speech generated using the TAT.
Fig. 3Relationships between NLP measures.
A Heat mapping showing the relationships (T-statistics) between different NLP measures, calculated using linear regression, controlling for group membership. Colormap from ref. [49]. B Network showing the NLP measures which are significantly associated with each other, with P < 0.01. Corresponding T-statistics are shown on the network edges between measures. *The colorbar was truncated at T = 10 for visualisation purposes; T = 29.79 for the relationship between LCC and LCCr.