| Literature DB >> 31877156 |
Indy Wijngaards1,2, Martijn Burger1,3, Job van Exel2.
Abstract
Recent advances in computer-aided text analysis (CATA) have allowed organizational scientists to construct reliable and convenient measures from open texts. As yet, there is a lack of research into using CATA to analyze responses to open survey questions and constructing text-based measures of psychological constructs. In our study, we demonstrated the potential of CATA methods for the construction of text-based job satisfaction measures based on responses to a completely open and semi-open question. To do this, we employed three sentiment analysis techniques: Linguistic Inquiry and Word Count 2015, SentimentR and SentiStrength, and quantified the forms of measurement error they introduced: specific factor error, algorithm error and transient error. We conducted an initial test of the text-based measures' validity, assessing their convergence with closed-question job satisfaction measures. We adopted a time-lagged survey design (Nwave 1 = 996; Nwave 2 = 116) to test our hypotheses. In line with our hypotheses, we found that specific factor error is higher in the open question text-based measure than in the semi-open question text-based measure. As expected, algorithm error was substantial for both the open and semi-open question text-based measures. Transient error in the text-based measures was higher than expected, as it generally exceeded the transient error in the human-coded and the closed job satisfaction question measures. Our initial test of convergent and discriminant validity indicated that the semi-open question text-based measure is especially suitable for measuring job satisfaction. Our article ends with a discussion of limitations and an agenda for future research.Entities:
Mesh:
Year: 2019 PMID: 31877156 PMCID: PMC6932814 DOI: 10.1371/journal.pone.0226408
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographics of the wave 1 (N = 997) and wave 2 (N = 116).
| Wave 1 | Wave 2 | |||
|---|---|---|---|---|
| Characteristic | N | % | N | % |
| Age | ||||
| Mean | 35.6 | 39.6 | ||
| Standard deviation | 9.8 | 10.5 | ||
| Gender | ||||
| Female | 744 | 74.6 | 68 | 58.6 |
| Male | 253 | 25.4 | 48 | 41.4 |
| Marital status | ||||
| Divorced | 45 | 4.5 | 9 | 7.8 |
| In a relationship | 317 | 31.8 | 29 | 25.0 |
| Married | 446 | 44.7 | 55 | 47.4 |
| Single | 185 | 18.6 | 23 | 19.8 |
| Widowed | 4 | 0.4 | 0 | 0.0 |
| Education | ||||
| Less than high school | 8 | 0.8 | 1 | 0.9 |
| High school graduate | 139 | 13.9 | 13 | 11.2 |
| Professional degree | 87 | 8.7 | 4 | 3.5 |
| Some college | 258 | 25.9 | 23 | 19.8 |
| 2-year degree | 79 | 7.9 | 13 | 11.2 |
| 4-year degree | 278 | 27.9 | 40 | 34.5 |
| Master’s degree | 126 | 12.6 | 20 | 17.2 |
| Doctorate | 22 | 2.2 | 2 | 1.7 |
N = sample size; % = percentage
Summary of measures.
| Words | Rating/scores | |||
|---|---|---|---|---|
| Measure | M/SD | Mean | SD | α |
| Closed question | ||||
| | ||||
| General job satisfaction | 6.42 | 2.37 | ||
| Satisfaction with work environment | 6.61 | 2.25 | ||
| Satisfaction with work content | 6.56 | 2.26 | ||
| Satisfaction with team | 7.45 | 2.08 | ||
| Satisfaction with supervisor | 6.77 | 2.79 | ||
| Satisfaction with work-life balance | 6.45 | 2.49 | ||
| Satisfaction with company | 6.47 | 2.52 | ||
| Satisfaction with pay | 5.44 | 2.47 | ||
| P-O fit | 5.00 | 1.30 | .87 | |
| Virtuous leadership | 4.78 | 1.38 | .97 | |
| Life satisfaction | 6.80 | 1.85 | ||
| Flourishing | 5.38 | 0.95 | .91 | |
| OCB | 3.13 | 0.78 | .87 | |
| | ||||
| General job satisfaction | 5.99 | 2.91 | ||
| Open question | ||||
| | 48.47/39.72 | |||
| Independent coders | 3.35 | 1.09 | ||
| LIWC 2015 | 4.21 | 1.34 | ||
| SentimentR | 2.81 | 0.62 | ||
| SentiStrength | 3.09 | 0.72 | ||
| | 64.99/42.12 | |||
| Independent coders | 3.24 | 1.30 | ||
| LIWC 2015 | 3.77 | 1.55 | ||
| SentimentR | 3.09 | 0.83 | ||
| SentiStrength | 3.16 | 0.92 | ||
| Semi-open question | ||||
| | 4.65/1.93 | |||
| Independent coders | 3.16 | 0.91 | .92 | |
| LIWC 2015 | 3.69 | 1.73 | .49 | |
| SentimentR | 3.22 | 0.76 | .74 | |
| SentiStrength | 3.41 | 0.68 | .91 | |
| | 4.45/1.62 | |||
| Independent coders | 3.00 | 1.30 | .87 | |
| LIWC 2015 | 3.16 | 1.87 | .69 | |
| SentimentR | 3.05 | 1.07 | .69 | |
| SentiStrength | 3.26 | 0.78 | .92 | |
LIWC = Linguistic Inquiry and Word Count; M = Mean; SD = Standard deviation; α = Cronbach’s α.
Most frequently used words in responses to open and semi-open job satisfaction question.
| Low job satisfaction (N = 193) | Moderate job satisfaction (N = 242) | High job satisfaction (N = 562) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Open | Semi-open | Open | Semi-open | Open | Semi-open | ||||||
| N | Word | N | Word | N | Word | N | Word | N | Word | N | Word |
| 244 | Job | 56 | Boring | 282 | Job | 39 | Rewarding | 650 | Job | 131 | Rewarding |
| 207 | Work | 41 | Stressful | 253 | Work | 35 | Challenging | 546 | Work | 103 | Challenging |
| 111 | Feel | 29 | Repetitive | 133 | Feel | 31 | Stressful | 326 | Feel | 97 | Interesting |
| 109 | Like | 25 | Tiring | 105 | Like | 30 | Busy | 248 | Happy | 79 | Busy |
| 53 | Enjoy | 19 | Busy | 83 | Enjoy | 27 | Interesting | 245 | Like | 73 | Fun |
| 53 | Get | 19 | Frustrating | 67 | Get | 26 | Boring | 224 | Enjoy | 50 | Important |
| 51 | Time | 16 | Challenging | 65 | Happy | 25 | Hard | 141 | Working | 44 | Stressful |
| 50 | People | 16 | Hard | 64 | People | 25 | Tiring | 131 | Can | 42 | Happy |
| 38 | Much | 13 | Rewarding | 58 | Can | 23 | Repetitive | 118 | People | 41 | Enjoyable |
| 37 | However | 13 | Dull | 49 | Time | 19 | Easy | 106 | Get | 40 | Exciting |
N = Number of observations.
* = The most frequently used stop words in the English language are omitted from the textual data [86].
Fig 1Histograms of the sentiment measures based on the open question.
Fig 2Histograms of the sentiment measures based on the semi-open question.
Correlations between open text-based measures and closed job satisfaction question (N = 1,113) and test-retest reliability (N = 116).
| Human coding | LIWC 2015 | SentimentR | SentiStrength | Closed question | |
|---|---|---|---|---|---|
| Human coding | .543 | ||||
| LIWC 2015 | .508 | .249 | |||
| SentimentR | .532 | .512 | .329 | ||
| SentiStrength | .587 | .510 | .487 | .189 | |
| Closed question | .726 | .393 | .407 | .464 | .502 |
Test-retest reliability values are displayed on the diagonal; All correlations significant at the level of p < .05; LIWC = Linguistic Inquiry and Word Count
Correlations between semi-open text-based measures and closed job satisfaction question (N = 1,113) and test-retest reliability (N = 116).
| Human coding | LIWC 2015 | SentimentR | SentiStrength | Closed question | |
|---|---|---|---|---|---|
| Human coding | .314 | ||||
| LIWC 2015 | .775 | .311 | |||
| SentimentR | .772 | .708 | .244 | ||
| SentiStrength | .696 | .704 | .665 | .250 | |
| Closed question | .628 | .576 | .593 | .547 | .502 |
Test-retest reliability values are displayed on the diagonal; All correlations significant at the level of p < .05; LIWC = Linguistic Inquiry and Word Count
Correlations between text-based measures and closed question measures (N = 997).
| Job facet satisfaction | Antecedents | Outcomes | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Text-based measure | General job satisfaction | Work environment | Work content | Team | Supervisor | Work-balance | Company | Pay | P-O fit | Virtuous leadership | Life satisfaction | Flourishing | OCB | |
| Independent coder ratings | Open | .703 | .503 | .598 | .445 | .512 | .412 | .582 | .355 | .539 | .482 | .413 | .400 | .168 |
| Semi-open | .618 | .458 | .555 | .361 | .438 | .370 | .537 | .336 | .487 | .427 | .361 | .326 | .128 | |
| LIWC 2015 | Open | .373 | .303 | .333 | .230 | .244 | .233 | .337 | .200 | .290 | .234 | .233 | .233 | . |
| Semi-open | .564 | .412 | .497 | .329 | .376 | .309 | .474 | .301 | .443 | .359 | .318 | .288 | .092 | |
| SentimentR | Open | .382 | .301 | .345 | .256 | .305 | .245 | .342 | .229 | .279 | .292 | .220 | .194 | .078 |
| Semi-open | .579 | .444 | .511 | .367 | .423 | .328 | .504 | .277 | .440 | .397 | .295 | .279 | .074 | |
| SentiStrength | Open | .457 | .374 | .396 | .288 | .300 | .286 | .368 | .199 | .360 | .289 | .287 | .283 | .074 |
| Semi-open | .541 | .402 | .480 | .348 | .368 | .288 | .475 | .282 | .422 | .359 | .302 | .272 | .070 | |
P-O = Person-Organization; OCB = Organizational citizenship behavior; LIWC = Linguistic Inquiry and Word Count, N = Sample size; All ps < .05, except for the one in italic.