| Literature DB >> 36004863 |
Daniela Laricchiuta1, Andrea Termine1, Carlo Fabrizio1, Noemi Passarello1,2, Francesca Greco3, Fabrizio Piras1, Eleonora Picerni1, Debora Cutuli1,4, Andrea Marini5, Laura Mandolesi2, Gianfranco Spalletta1, Laura Petrosini1.
Abstract
The analysis of sequences of words and prosody, meter, and rhythm provided in an interview addressing the capacity to identify and describe emotions represents a powerful tool to reveal emotional processing. The ability to express and identify emotions was analyzed by means of the Toronto Structured Interview for Alexithymia (TSIA), and TSIA transcripts were analyzed by Natural Language Processing to shed light on verbal features. The brain correlates of the capacity to translate emotional experience into words were determined through cortical thickness measures. A machine learning methodology proved that individuals with deficits in identifying and describing emotions (n = 7) produced language distortions, frequently used the present tense of auxiliary verbs, and few possessive determiners, as well as scarcely connected the speech, in comparison to individuals without deficits (n = 7). Interestingly, they showed high cortical thickness at left temporal pole and low at isthmus of the right cingulate cortex. Overall, we identified the neuro-linguistic pattern of the expression of emotional experience.Entities:
Keywords: Natural Language Processing; alexithymia; cortical thickness; emotional processing; machine learning
Year: 2022 PMID: 36004863 PMCID: PMC9404916 DOI: 10.3390/bs12080292
Source DB: PubMed Journal: Behav Sci (Basel) ISSN: 2076-328X
Sociodemographic variables.
| TSIA DIF | TSIA DDF | TSIA EOT | TSIA IP | TSIA TOT | |
|---|---|---|---|---|---|
|
| r = 0.30 | r = 0.32 | r = 0.44 | r = 0.31 | r = 0.41 |
|
| r = −0.39 | r = −0.47 | r = −0.36 | r = −0.30 | r = −0.46 |
Figure 1Verbal analysis of Toronto Structured Interview for Alexithymia (TSIA) transcripts. In the Natural Language Processing (NLP), the Linguistic Profiling Analysis showed High Alexithymia subjects produced more pseudowords or non-words (tag: Other) (A1), used the present tense of auxiliary verbs more (A2), used the possessive determiners less (A3), used fewer coordinating conjunctions (A4), and less coordinating conjunction relation (A5), when compared to Low Alexithymia subjects. In the boxplot are reported the percentages of frequency (median ± percentile). ** P adjusted = 0.01; * at least P adjusted = 0.03. Semantic analysis showed that High Alexithymia subjects used less emotional keywords (feel, happy, to express) and more filler words (maybe, in conclusion, huh) (B). This pattern of differential use of words is confirmed by unsupervised Latent Semantic Analysis (LSA), showing cluster separation between High, Medium (participants with middle alexithymia levels), and Low Alexithymia subjects on a two-dimensional space representing the latent semantic structure. The orthogonal dimensions represent semantically related cluster of words, and Low and High Alexithymia subjects are divided by Dimension 4 and 5 in the LSA space (C). Unsupervised Latent Dirichlet Allocation (LDA) identified 20 topics, described by the top five words for frequency of use. Three topics were significantly different between High and Low Alexithymia subjects (P ≤ 0.05; Topic 1: fashion, reason, end, minute, effect; Topic 2: day, home, week, sister, vacation; Topic 3: sense, parent, news, scene, patient) (D). Exploring sentiment polarization, High Alexithymia subjects were characterized by a reduced frequency of emotion-related words, when compared to Low Alexithymia subjects, specifically for anger and disgust (* at least P = 0.025) (E). In the histograms are reported the frequency mean (mean ± standard deviation).
Figure 2Machine Learning Analysis on Cross-Disciplinary Data. Natural Language Processing (NLP) data were aggregated with demographic, neuroimaging, and psychological data to predict Toronto Structured Interview for Alexithymia (TSIA) scores. Candidate features (n = 92) were extracted using univariate filtering with Spearman correlation method, Principal Components Regression (PCR) model with three components was selected as it had the best performance over evaluation metrics assessed in Leave-One-Out Cross-Validation (LOOCV) (R2 = 0.86; Root Mean Square Error—RMSE = 0.35) (A,B). Variable importance is reported (C1,C2). Namely, 85 candidate features resulted to belong to NLP, 3 to psychological measures, 2 to neuroimaging data, and 2 to demographic data. The variable importance of neuroimaging data showed that the increased left temporal pole thickness (orange colored area, D1) predicted increased TSIA scores, while increased right isthmus cingulate thickness (orange colored area, D2) predicted decreased TSIA scores. In (D1,D2) left is left.
Figure 3Machine Learning Analysis on Speech Records. Univariate correlation filtering was applied to select 212 candidate acoustic features belonging to the ComParE feature set and Principal Components Regression (PCR) model with 5 components was selected as the best fitting based on R2 and Root Mean Square Error (RMSE) metrics in Leave-One-Out Cross-Validation (LOOCV) (R2 = 0.74; RMSE = 6.74) (A,B). Relative variable importance was computed over the domains of ComParE low-level descriptors, namely sound quality, spectral, cepstral, and prosodic (C). Representative differences in audio tracks (recorded in responding to an item of Difficulty in Describing Feelings—DDF) from High Alexithymia subjects and Low Alexithymia subjects can be observed in spectrograms (D1,D2).