| Literature DB >> 30936845 |
Jana M Havigerová1, Jiří Haviger2, Dalibor Kučera3, Petra Hoffmannová1.
Abstract
This study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents' linguistic fingerprints were traced in four texts of different genres. Depressivity was measured using the Depression, Anxiety and Stress Scale (DASS-21). The research sample (N = 172, 83 men, 89 women) was created by quota sampling an adult Czech population. Morphological variables of the texts showing differences (M-W test) between the non-depressive and depressive groups were incorporated into predictive models.Entities:
Keywords: depression; genre; morphology; predictive model; quantitative linguistics
Year: 2019 PMID: 30936845 PMCID: PMC6431661 DOI: 10.3389/fpsyg.2019.00513
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Age group, education, and gender of participants (Nresp = 172).
| % of Total | ||||||
|---|---|---|---|---|---|---|
| Education | Total % | |||||
| Gender | Primary % | High school % | University % | |||
| Male | Age group | Younger than 25 | 7.2 | 7.2 | 1.2 | 15.7 |
| 25–34 | 1.2 | 14.5 | 6.0 | 21.7 | ||
| 35–55 | 3.6 | 30.1 | 8.4 | 42.2 | ||
| Older than 55 | 2.4 | 12.0 | 6.0 | 20.5 | ||
| Total | 14.5 | 63.9 | 21.7 | 100.0 | ||
| Female | Age group | Younger than 25 | 5.6 | 5.6 | 1.1 | 12.4 |
| 25 to 34 | 1.1 | 11.2 | 5.6 | 18.0 | ||
| 35 to 55 | 2.2 | 24.7 | 6.7 | 33.7 | ||
| Older than 55 | 2.2 | 27.0 | 6.7 | 36.0 | ||
| Total | 11.2 | 68.5 | 20.2 | 100.0 | ||
| Total | Age group | Younger than 25 | 6.4 | 6.4 | 1.2 | 14.0 |
| 25 to 34 | 1.2 | 12.8 | 5.8 | 19.8 | ||
| 35 to 55 | 2.9 | 27.3 | 7.6 | 37.8 | ||
| Older than 55 | 2.3 | 19.8 | 6.4 | 28.5 | ||
| Total | 12.8 | 66.3 | 20.9 | 100.0 | ||
Frequency of outliers (Nresp = 172, Ntext = 688).
| Male ( | Female ( | |||
|---|---|---|---|---|
| Non-outliers | Outliers | Non-outliers | Outliers | |
| TXT1 | 81 | 2 | 88 | 1 |
| TXT2 | 81 | 2 | 85 | 4 |
| TXT3 | 81 | 2 | 86 | 3 |
| TXT4 | 82 | 1 | 88 | 1 |
Frequency of depression in sample (Nresp = 172).
| DASS_D | Male ( | Female ( |
|---|---|---|
| Non-depressive (D ≤ 6) | 65.5% | 81.3% |
| Depressive (D > 6) | 34.5% | 18.7% |
Significance of intergroup differences in ql variables: Mann–Whitney (Nresp = 172, Ntext = 688).
| Male ( | Female ( | |||||||
|---|---|---|---|---|---|---|---|---|
| TXT1 | TXT2 | TXT3 | TXT4 | TXT1 | TXT2 | TXT3 | TXT4 | |
| Singularity index | 0.213 | 0.175 | 0.503 | 0.374 | 0.120 | 0.653 | 0.008 | 0.459 |
| Singularity_P index | 0.039 | 0.208 | 0.157 | 0.470 | 0.211 | 0.540 | 0.542 | 0.291 |
| Negativity index | 0.026 | 0.000 | 0.013 | 0.122 | 0.664 | 0.613 | 0.863 | 0.399 |
| Words per sentence | 0.125 | 0.266 | 0.062 | 0.020 | 0.676 | 0.475 | 0.916 | 0.750 |
| Sentence complexity | 0.000 | 0.030 | 0.005 | 0.004 | 0.518 | 0.488 | 0.578 | 0.721 |
| Punctuations per sentence | 0.130 | 0.826 | 0.458 | 0.612 | 0.290 | 0.420 | 0.487 | 0.009 |
| Coherence index | 0.528 | 0.613 | 0.732 | 0.204 | 0.747 | 0.236 | 0.748 | 0.530 |
| Pronom index | 0.059 | 0.279 | 0.006 | 0.205 | 0.131 | 0.711 | 0.210 | 0.469 |
| Formality index | 0.023 | 0.086 | 0.001 | 0.062 | 0.112 | 0.566 | 0.523 | 0.317 |
| Trager index | 0.022 | 0.011 | 0.011 | 0.032 | 0.128 | 0.117 | 0.846 | 0.189 |
| Readiness_to_action index | 0.014 | 0.049 | 0.002 | 0.058 | 0.171 | 0.870 | 0.781 | 0.965 |
| Aggressiveness index | 0.013 | 0.063 | 0.008 | 0.243 | 0.234 | 0.468 | 0.653 | 0.910 |
| Activity index | 0.011 | 0.025 | 0.021 | 0.259 | 0.076 | 0.070 | 0.474 | 0.390 |
Predictors of membership in depressive sample: Logistic regression.
| Male (n | Female (n | |||||||
|---|---|---|---|---|---|---|---|---|
| TXT1 | TXT2 | TXT3 | TXT4 | TXT1 | TXT2 | TXT3 | TXT4 | |
| Singularity index | 0.701 | 0.062 | 0.845 | 0.050 | 0.058 | 0.630 | 0.007 | 0.349 |
| Singularity_P index | 0.044 | 0.630 | 0.017 | 0.585 | 0.112 | 0.392 | 0.746 | 0.728 |
| Negativity index | 0.635 | 0.534 | 0.401 | 0.153 | 0.169 | 0.071 | 0.295 | 0.247 |
| Words per sentence | 0.046 | 0.172 | 0.250 | 0.248 | 0.046 | 0.059 | 0.411 | 0.194 |
| Sentence complexity | 0.004 | 0.089 | 0.575 | 0.536 | 0.198 | 0.050 | 0.618 | 0.201 |
| Punctuations per sentence | 0.236 | 0.210 | 0.841 | 0.087 | 0.818 | 0.041 | 0.677 | 0.798 |
| Pronom index | 0.661 | 0.042 | 0.017 | 0.720 | 0.385 | 0.682 | 0.740 | 0.323 |
| Formality index | 0.479 | 0.631 | 0.025 | 0.723 | 0.525 | 0.605 | 0.234 | 0.236 |
| Trager index | 0.639 | 0.759 | 0.019 | 0.841 | 0.599 | 0.139 | 0.186 | 0.182 |
| Readiness_to_action index | 0.333 | 0.034 | 0.017 | 0.688 | 0.052 | 0.941 | 0.506 | 0.083 |
| Aggressiveness index | 0.162 | 0.418 | 0.009 | 0.863 | 0.060 | 0.217 | 0.999 | 0.567 |
| Activity index | 0.374 | 0.352 | 0.007 | 0.694 | 0.059 | 0.554 | 0.622 | 0.353 |
| Constant | 0.431 | 0.170 | 0.059 | 0.637 | 0.639 | 0.248 | 0.086 | 0.914 |
Coefficients of model quality and predictive power: Logistic regression (Nresp = 172, Ntext = 688).
| Male ( | Female ( | |||||||
|---|---|---|---|---|---|---|---|---|
| TXT1 | TXT2 | TXT3 | TXT4 | TXT1 | TXT2 | TXT3 | TXT4 | |
| Nagelkerke | 0.479 | 0.526 | 0.435 | 0.310 | 0.330 | 0.670 | 0.324 | 0.284 |
| Recall | 0.607 | 0.571 | 0.556 | 0.500 | 0.235 | 0.800 | 0.267 | 0.267 |
| Precision | 0.739 | 0.889 | 0.789 | 0.647 | 0.571 | 0.889 | 0.667 | 0.667 |