| Literature DB >> 35951619 |
Giulia Gargano1, Elisabetta Caletti2, Cinzia Perlini3,4, Nunzio Turtulici1, Marcella Bellani3,4, Carolina Bonivento5, Marco Garzitto6, Francesca Marzia Siri2, Chiara Longo2, Chiara Bonetto3, Doriana Cristofalo3, Paolo Scocco7, Enrico Semrov8, Antonio Preti9, Lorenza Lazzarotto3, Francesco Gardellin10, Antonio Lasalvia3,4, Mirella Ruggeri3,4, Andrea Marini6, Paolo Brambilla1,2.
Abstract
Language production has often been described as impaired in psychiatric diseases such as in psychosis. Nevertheless, little is known about the characteristics of linguistic difficulties and their relation with other cognitive domains in patients with a first episode of psychosis (FEP), either affective or non-affective. To deepen our comprehension of linguistic profile in FEP, 133 patients with FEP (95 non-affective, FEP-NA; 38 affective, FEP-A) and 133 healthy controls (HC) were assessed with a narrative discourse task. Speech samples were systematically analyzed with a well-established multilevel procedure investigating both micro- (lexicon, morphology, syntax) and macro-linguistic (discourse coherence, pragmatics) levels of linguistic processing. Executive functioning and IQ were also evaluated. Both linguistic and neuropsychological measures were secondarily implemented with a machine learning approach in order to explore their predictive accuracy in classifying participants as FEP or HC. Compared to HC, FEP patients showed language production difficulty at both micro- and macro-linguistic levels. As for the former, FEP produced shorter and simpler sentences and fewer words per minute, along with a reduced number of lexical fillers, compared to HC. At the macro-linguistic level, FEP performance was impaired in local coherence, which was paired with a higher percentage of utterances with semantic errors. Linguistic measures were not correlated with any neuropsychological variables. No significant differences emerged between FEP-NA and FEP-A (p≥0.02, after Bonferroni correction). Machine learning analysis showed an accuracy of group prediction of 76.36% using language features only, with semantic variables being the most impactful. Such a percentage was enhanced when paired with clinical and neuropsychological variables. Results confirm the presence of language production deficits already at the first episode of the illness, being such impairment not related to other cognitive domains. The high accuracy obtained by the linguistic set of features in classifying groups support the use of machine learning methods in neuroscience investigations.Entities:
Mesh:
Year: 2022 PMID: 35951619 PMCID: PMC9371299 DOI: 10.1371/journal.pone.0272873
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Sociodemographic and clinical data of the sample of FEP and HC.
| HC (n = 133) | FEP (n = 133) | Comparison | |
|---|---|---|---|
| Female % | 51.88% | 39.85% | χ2(1) = 3.41, p = 0.07 |
| Age in years | 33.07 ±9.56 (19, 64) | 28.93 ±9.05 (16, 53) | t(263.2) = 3.62, p<0.00* |
| Education in years | 15.62 ±3.59 (5.0, 26.0) | 11.58 ±3.06 (5.0, 18.0) | t(257.7) = 9.85, p<0.00* |
| GAF | 83.14 ±6.19 | 46.31 ±12.94 | t(197.3) = 28.64, p<0.00* |
| PANSS | - | 34.95 ±8.93 | - |
| HAM-D | - | 15.23 ±7.875 (0.0, 61.0) | - |
| BRMRS | - | 2.59 ± 3.40 | - |
| DUP | - | 294.87 ± 571.99 | - |
HC, Healthy Controls; FEP, First Episode Psychosis; GAF, Global Assessment of Functioning
PANSS, Positive and Negative Syndrome Scale, General psychopathology subscale; HAM-D, Hamilton’s Depression Rating Scale; BRMRS, Bech-Rafaelsen Mania Rating Scale; DUP, Duration of untreated psychosis.
Linguistic variables in HC and FEP.
| HC (N = 133) | FEP (N = 133) | ||
|---|---|---|---|
| Microlinguistic measures | Mean ± st.dev. | Mean ± st.dev. | Comparison |
| False-starts | 0.94 ±1.14 | 1.19±2.91 | t(171.6) = -0.92, p = 0.361 |
| Speech-rate (words per minute) | 134.29 ±25.31 | 119.43 ±38.37 | t(228.6) = 3.73, p< |
| Mean length of utterances (words) | 7.19±1.67 | 6.23 ±1.53 | t(261.8) = 4.86, p< |
| Phonological paraphasias | 0.05±0.26 | 0.14 ±0.40 | t(223.1) = -1.99, p = |
| Semantic paraphasias | 0.19±0.64 | 0.35±0.75 | t(257.9) = -1.85, p = 0.066 |
| Verbal paraphasias | 0.04±0.19 | 0.10±0.32 | t(214.4) = -1.85, p = 0.066 |
| Undefined words | 0.00±0.00 | 0.02±0.15 | t(132.0) = -1.75, p = 0.083 |
| Lexical fillers | 7.02±8.97 | 4.74 ±7.61 | t(257.1) = 2.23, p = |
| Syntactic completeness (%) | 45.90±18.91 | 40.13 ±21.71 | t(259.1) = 2.31, p = |
|
| |||
| Local coherence errors (ambiguous) | 0.34±0.61 | 0.53 ±0.83 | t(243.1) = -2.18, p = |
| Local coherence errors (missings) | 1.25±1.36 | 0.93 ±1.04 | t(246.7) = 2.13, p = |
| Utterances with semantic errors (words) | 1.58±3.52 | 3.35 ±8.84 | t(172.8) = -2.14, p = |
| Repetitions (%) | 5.08±4.00 | 6.20±5.26 | t(246.5) = -1.96, p = |
| Global coherence errors (%) | 0.00±0.00 | 0.06±0.47 | t(132.0) = -1.40, p = 0.165 |
HC, Healthy Controls; FEP, First Episode Psychosis.
Post-Hoc analysis (HC vs FEP-NA).
| HC (N = 133) | FEP-NA (N = 95) | ||||
|---|---|---|---|---|---|
| Microlinguistic measures | Mean ± st.dev. | Mean ± st.dev. | t-test | df | p-value |
| Speech-rate (words per minute) | 134.29±25.31 | 119.56±40.62 | -3.13 | 145.40 |
|
| Mean length of utterances (words) | 7.19 ±1.67 | 6.23±1.53 | -4.42 | 212.12 |
|
| Phonological paraphasias | 0.05±0.26 | 0.16±0.42 | 2.17 | 143.08 | 0.03 |
| Lexical fillers | 7.02±8.97 | 4.63±7.91 | -2.12 | 216.14 | 0.04 |
| Syntactic completeness (%) | 45.90±18.91 | 38.01±20.92 | -2.87 | 188.62 |
|
|
| |||||
| Local coherence errors (ambiguous) | 0.34±0.61 | 0.53±0.86 | 1.82 | 159.70 | 0.07 |
| Local coherence errors (missings) | 1.25±1.36 | 0.99±1.09 | -.159 | 223.19 | 0.11 |
| Utterances with semantic errors (words) | 1.58±3.52 | 3.06±5.48 | 2.32 | 148.32 |
|
HC, Healthy Controls; FEP-NA, First Episode Psychosis–Non-Affective.
Post-Hoc analysis (FEP-A vs FEP-NA).
| FEP-A (N = 38) | FEP-NA (N = 95) | ||||
|---|---|---|---|---|---|
| Microlinguistic measures | Mean ± st.dev. | Mean ± st.dev. | t-test | df | p-value |
| Speech-rate (words per minute) | 119.11±32.56 | 119.56±40.62 | 0.07 | 84.50 | 0.95 |
| Mean length of utterances (words) | 6.24±1.54 | 6.23±1.53 | -0.02 | 67.56 | 0.98 |
| Phonological paraphasias | 0.08±0.36 | 0.16±0.42 | 1.09 | 79.42 | 0.28 |
| Lexical fillers | 5.00±6.88 | 4.63±7.91 | -0.27 | 77.93 | 0.79 |
| Syntactic completeness (%) | 45.45±22.98 | 38.01±20.92 | -1.73 | 62.88 | 0.09 |
|
| |||||
| Local coherence errors (ambiguous) | 0.55±0.76 | 0.53±0.86 | -0.17 | 76.74 | 0.86 |
| Local coherence errors (missings) | 0.79±0.91 | 0.99±1.09 | 1.09 | 81.32 | 0.28 |
| Utterances with semantic errors (words) | 4.05±14.20 | 3.06±5.48 | -0.42 | 41.48 | 0.68 |
FEP-A, First Episode Psychosis–Affective; FEP-NA, First Episode Psychosis–Non-Affective.
Cognitive and neuropsychological data in FEP and HC.
| HC | FEP | Comparison | |
|---|---|---|---|
| IQ (TIB) | 120.25 ±5.50 | 110.83 ±7.73 | t(195.8) = 10.74, p<0.00* |
| Span of Apprehension 3-trials, Sensitivity | 0.48 ±0.02 | 0.42 ±0.12 | t(146.0) = 5.06, p<0.00* |
| Span of Apprehension 12-trials, Sensitivity | 0.35 ±0.09 | 0.27 ±0.12 | t(178.9) = 5.39, p<0.00* |
| Span of Apprehension 3-trials, Specificity | 0.94 ±0.05 | 0.87 ±0.22 | t(151.9) = 3.50, p<0.00* |
| Span of Apprehension 12-trials, Specificity | 0.87±0.13 | 0.84±0.20 | t(188.8) = 1.37, p = 0.17 |
| N-Back 0-trials, Sensitivity | 0.99 ±0.04 | 0.96 ±0.12 | t(122.6) = 2.61, p = 0.01* |
| N-Back 1-trials, Sensitivity | 0.97 ±0.13 | 0.80 ±0.33 | t(137.0) = 4.95, p<0.00* |
| N-Back 2-trials, Sensitivity | 0.88 ±0.27 | 0.69 ±0.38 | t(192.5) = 4.54, p<0.00* |
| N-Back 3-trials, Sensitivity | 0.63 ±0.37 | 0.44 ±0.37 | t(230.8) = 4.05, p<0.00* |
| N-Back 0-trials, Specificity | 0.98 ±0.03 | 0.93 ±0.15 | t(116.9) = 3.25, p<0.00* |
| N-Back 1-trials, Specificity | 1.00 ±0.01 | 0.95 ±0.18 | t(109.0) = 2.51, p = 0.01* |
| N-Back 2-trials, Specificity | 1.00±0.02 | 0.97±0.16 | t(111.5) = 1.86, p = 0.07 |
| N-Back 3-trials, Specificity | 0.98 ±0.05 | 0.95 ±0.14 | t(125.7) = 1.92, p = 0.06 |
HC, Healthy Controls; FEP, First Episode Psychosis; IQ, Intelligence Quotient; TIB, Brief Intelligence Test.
Fig 1Language production variables feature gain.
XGBoost models precision, recall, f1-score and accuracy metrics by different predictive variables sets.
| Features | precision | recall | f1-score | accuracy |
|---|---|---|---|---|
|
| 79.87% | 91.35% | 85.04% | 84.11% |
|
| 78.57% | 84.03% | 80.83% | 82.53% |
|
| 97.97% | 98.01% | 97.94% | 97.72% |
|
| 73.29% | 63.75% | 67.59% | 70.04% |
|
| 74.86% | 93.40% | 82.81% | 75.05% |
|
| 82.99% | 64.15% | 71.43% | 77.27% |
|
| 77.23% | 92.78% | 83.97% | 75.51% |
|
| 79.80% | 82.45% | 80.49% | 82.39% |
|
| 81.28% | 86.19% | 83.38% | 78.18% |
|
| 76.94% | 67.65% | 71.43% | 75.69% |
SOA, Span of Apprehension; GAF, Global Assessment of Functioning; IQ, Intelligence Quotient; TIB, Brief Intelligence Test
Post-Hoc analysis (HC vs FEP-A).
| HC (N = 133) | FEP-A (N = 38) | ||||
|---|---|---|---|---|---|
| Microlinguistic measures | Mean ± st.dev. | Mean ± st.dev. | t-test | df | p-value |
| Speech-rate (words per minute) | 134.29±25.31 | 119.11±32.56 | -2.65 | 50.45 |
|
| Mean length of utterances (words) | 7.19 ±1.67 | 6.24±1.54 | -3.23 | 63.57 |
|
| Phonological paraphasias | 0.05±0.26 | 0.08±0.36 | 0.42 | 48.23 | 0.68 |
| Lexical fillers | 7.02±8.97 | 5.00±6.88 | -1.48 | 76.64 | 0.14 |
| Syntactic completeness (%) | 45.90±18.91 | 45.45±22.98 | -0.07 | 51.89 | 0.94 |
|
| |||||
| Local coherence errors (ambiguous) | 0.34±0.61 | 0.55±0.76 | 1.60 | 51.57 | 0.12 |
| Local coherence errors (missings) | 1.25±1.36 | 0.79±0.91 | -2.43 | 89.81 |
|
| Utterances with semantic errors (words) | 1.58±3.52 | 4.05±14.20 | 1.06 | 38.30 | 0.29 |
HC, Healthy Controls; FEP-A, First Episode Psychosis—Affective.