| Literature DB >> 27322382 |
Nan Zhao1, Dongdong Jiao2, Shuotian Bai3, Tingshao Zhu1.
Abstract
The increasing need of automated analyzing web texts especially the short texts on Social Network Services (SNS) brings new demands of computerized text analysis instruments. The psychometric properties are the basis of the extensive use of these instruments such as the Linguistic Inquiry and Word Count (LIWC). For this study, Sina Weibo statuses were analyzed via rater coding and Simplified Chinese version of LIWC (SCLIWC), in order to evaluate the validity of SCLIWC in detecting psychological expressions in Weibo statuses (n = 60) and in identifying the psychological meaning of a single Weibo status (n = 11). Significant correlations between human ratings and SCLIWC scores and the high sensitivities of capturing single statuses with certain expressions identified by raters, proved the validity of SCLIWC in detecting psychological expressions. The results also suggested that, the efficiency of SCLIWC in detecting psychological expressions of SNS short texts could be higher if using status count scoring method, rather than the word count method as the common usage of LIWC. However, SCLIWC may not perform well in identifying the psychological meaning of a single piece of SNS short text because of its over-identification of target expressions. This study provided primary evidence of validity of SCLIWC, as well as the proper way of using it efficiently on SNS short texts.Entities:
Mesh:
Year: 2016 PMID: 27322382 PMCID: PMC4920595 DOI: 10.1371/journal.pone.0157947
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The percentage of total words detected by the SCLIWC categories on Weibo statuses, Renren blogs and news comments.
| self and others | affective processes | cognitive processes | concerned contents | |
|---|---|---|---|---|
| Ist Per. 2.74(1.96) | Pos. Emo. 3.40(1.62) | Insight 1.87(1.06) | Bio. 3.40(1.81) | |
| Family .48(.38) | Neg. Emo. 1.69(.98) | Caus. 1.08(.70) | Work 2.72(1.68) | |
| Friend .19(.15) | Anxiety .23(.21) | Disc. 2.58(1.71) | Achieve 1.30(.82) | |
| Anger .49(.50) | Tent. 2.16(1.23) | Leisure 2.23(1.22) | ||
| Sad .35(.29) | Money .69(.51) | |||
| Death .31(.35) | ||||
| Renren | Ist Per. 3.44(1.73) | Pos. Emo. 2.54(1.04) | Insight 2.52(1.05) | Bio 2.23(1.42) |
| Family .33(.50) | Neg. Emo. 1.62(.81) | Caus. 1.33(.64) | Work 2.73(1.82) | |
| Friend .18(.22) | Anxiety .23(.19) | Disc. 2.40(.98) | Achieve 1.57(1.19) | |
| Anger .32(.25) | Tent. 2.72(1.13) | Leisure 1.46(1.16) | ||
| Sad .37(.42) | Money .49(.50) | |||
| Death .13(.22) | ||||
| Media | Ist Per. .44(.47) | Pos. Emo. 2.11(1.34) | Insight 2.77(.96) | Bio .94(1.55) |
| Family .16(.39) | Neg. Emo. 1.16(.82) | Caus. 2.11(.81) | Work 8.71(3.79) | |
| Friend .13(.54) | Anxiety .20(.24) | Disc. 1.93(.69) | Achieve 3.86(2.01) | |
| Anger .35(.39) | Tent. 1.86(.83) | Leisure .97(1.57) | ||
| Sad .15(.19) | Money 1.50(1.76) | |||
| Death .17(.59) |
The standard deviation (SD) was presented in the parentheses following the mean percentage. The results of Weibo statuses were calculated from all the valid Weibo statuses collected of each Weibo user in our study.
correlations between human ratings and SCLIWC word count scores on Weibo statuses, Renren blogs and news comments.
| self and others | affective processes | cognitive processes | concerned contents | |
|---|---|---|---|---|
| Ist Per. .37 | Pos. Emo. .33 | Insight .25 | Bio. .44 | |
| Family .37 | Neg. Emo. .46 | Caus. .29 | Work .47 | |
| Friend .29 | Anxiety .57 | Disc. .18 | Achieve .35 | |
| Anger .41 | Tent. .17 | Leisure .41 | ||
| Sad .35 | Money .43 | |||
| Death .22 | ||||
| Renren | Ist Per. .37 | Pos. Emo. .37 | Insight .59 | Bio. .70 |
| Family .53 | Neg. Emo. .44 | Caus. .43 | Work .59 | |
| Friend .28 | Anxiety .36 | Disc. -.05 | Achieve .38 | |
| Anger .27 | Tent. .62 | Leisure .57 | ||
| Sad .46 | Money .84 | |||
| Death .73 | ||||
| Media | Ist Per. .27 | Pos. Emo. .37 | Insight .12 | Bio. .70 |
| Family .71 | Neg. Emo. .50 | Caus. .15 | Work .39 | |
| Friend .74 | Anxiety .46 | Disc. .15 | Achieve .53 | |
| Anger .26 | Tent. .57 | Leisure .85 | ||
| Sad .38 | Money .80 | |||
| Death .83 |
*p < 0.05
**p < 0.01.
Correlations between human ratings and SCLIWC scores (word count/status count) on Weibo statuses of different time spans.
| self and others | affective processes | cognitive processes | concerned contents | |
|---|---|---|---|---|
| day | Ist Per. | Pos. Emo. | Insight | Bio. .57 |
| Family .49 | Neg. Emo. | Caus. | Work | |
| Friend .01/.09 | Anxiety | Disc. -.07/.16 | Achieve .22/.14 | |
| Anger .51 | Tent. .01/.17 | Leisure | ||
| Sad | Money | |||
| Death | ||||
| week | Ist Per. | Pos. Emo. | Insight | Bio. |
| Family .42 | Neg. Emo. | Caus. | Work | |
| Friend .16/.17 | Anxiety .39 | Disc. -.05/.08 | Achieve .03/.21 | |
| Anger | Tent. | Leisure | ||
| Sad | Money | |||
| Death | ||||
| month | Ist Per. | Pos. Emo. | Insight | Bio. |
| Family | Neg. Emo. | Caus. | Work | |
| Friend .29 | Anxiety .57 | Disc. | Achieve | |
| Anger | Tent. | Leisure | ||
| Sad | Money . | |||
| Death |
The pairs of data in bold mark the condition that the correlation was significant and the coefficient became higher when using status count as the SCLIWC scoring method.
‘p < 0.10
*p < 0.05
**p < 0.01.
The mean sensitivity, specificity, positive predictive value and negative predictive value of SCLIWC in single Weibo status scoring (N = 11).
| Self-references | .63(.14) | .89(.10) | .78(.22) | .76(.12) |
| Family | .40(.27) | .97(.03) | .32(.28) | .98(.02) |
| Friend | .21(.31) | .99(.01) | .56(.39) | .92(.06) |
| Positive emotion | .84(.19) | .69(.08) | .25(.20) | .96(.05) |
| Negative emotion | .65(.14) | .83(.07) | .40(.12) | .93(.04) |
| Anxiety | .81(.23) | .97(.01) | .38(.22) | .99(.01) |
| Anger | .54(.27) | .95(.03) | .41(.30) | .96(.04) |
| Sad | .49(.28) | .95(.05) | .34(.29) | .98(.01) |
| Insight | .59(.27) | .76(.10) | .19(.21) | .91(.19) |
| Causation | .87(.25) | .86(.08) | .16(.13) | .99(.01) |
| Discrepancy | .45(.31) | .67(.12) | .05(.03) | .96(.04) |
| Tentativeness | .65(.39) | .72(.10) | .02(.03) | .99(.01) |
| Biological processes | .83(.13) | .77(.09) | .54(.15) | .93(.07) |
| Work | .71(.19) | .82(.09) | .38(.19) | .95(.04) |
| Achievement | .62(.24) | .84(.07) | .20(.13) | .97(.02) |
| Leisure | .63(.15) | .86(.08) | .56(.16) | .88(.10) |
| Money | .77(.15) | .97(.03) | .64(.22) | .99(.01) |
| Death | .71(.41) | .97(.02) | .39(.41) | .995(.01) |
The standard deviation of each mean in this table was shown in parenthesis.