| Literature DB >> 29389979 |
Abstract
In this study, we leverage human evaluations, content analysis, and computational modeling to generate a comprehensive analysis of readers' evaluations of authors' communication quality in social media with respect to four factors: author credibility, interpersonal attraction, communication competence, and intent to interact. We review previous research on the human evaluation process and highlight its limitations in providing sufficient information for readers to assess authors' communication quality. From our analysis of the evaluations of 1,000 Twitter authors' communication quality from 300 human evaluators, we provide empirical evidence of the impact of the characteristics of the reader (demographic, social media experience, and personality), author (profile and social media engagement), and content (linguistic, syntactic, similarity, and sentiment) on the evaluation of an author's communication quality. In addition, based on the author and message characteristics, we demonstrate the potential for building accurate models that can indicate an author's communication quality.Entities:
Mesh:
Year: 2018 PMID: 29389979 PMCID: PMC5794137 DOI: 10.1371/journal.pone.0192061
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overall study procedure.
Instruments to measure communication quality.
We used a 5-point Likert scale for each question (1: Strongly Disagree—5: Strongly Agree). Cronbach’s α results indicate high internal consistency among question items. In interpersonal attraction, the questions for IA1 and IA2 relate to social attraction and those for IA3 and IA4 relate to task attraction.
| Type | Question (1: Strongly Disagree, 5: Strongly Agree) | α | |
|---|---|---|---|
| SC1 | The author seems honest | 0.89 | |
| SC2 | The author seems trustworthy | ||
| SC3 | The author seems genuine | ||
| IA1 | The author could be a friend of mine | 0.92 | |
| IA2 | I could establish a personal friendship with the author | ||
| IA3 | I have confidence in the author’s ability to get the job done | ||
| IA4 | If I wanted to get things done, I could probably depend on the author | ||
| CC1 | The author seems effective in accomplishing what was set out to do | 0.92 | |
| CC2 | The author’s tweets seem easy to understand | ||
| CC3 | The author’s tweets seem written in a confident style | ||
| INT1 | If the topic matches my interest, I may want to follow the author | 0.93 | |
| INT2 | If the topic matches my interest, I may want to receive tweet updates from this author | ||
Correlations among four communication quality factors.
| Credibility | 1.00 | |||
| Attraction | 0.84 | 1.00 | ||
| Competence | 0.84 | 0.86 | 1.00 | |
| Intent | 0.76 | 0.81 | 0.76 | 1.00 |
*p < 0.05
Standardized linear regression coefficients of authors’ communication quality with evaluators’ (readers’) gender, age, social media use length, frequency, and the five features of personality.
The Durbin-Watson results are close to 2.0, showing that there is no presence of autocorrelation in the residuals. The VIF results for all predictors are less than 4.0.
| Category | Credibility | Attraction | Competence | Intent | Comm Quality |
|---|---|---|---|---|---|
| Age | -0.102 | -0.144 | -0.090 | -0.175 | -0.134 |
| Gender | 0.037 | 0.040 | 0.043 | 0.050 | 0.042 |
| SM use length | 0.024 | -0.002 | 0.026 | 0.067 | 0.026 |
| SM use frequency | 0.135 | 0.137 | 0.166 | 0.096 | 0.146 |
| Extraversion | -0.006 | -0.059 | 0.010 | -0.021 | -0.026 |
| Agreeableness | 0.053 | 0.057 | 0.076 | 0.034 | 0.066 |
| Conscientiousness | -0.037 | 0.022 | -0.018 | -0.053 | -0.018 |
| Openness | 0.004 | 0.041 | 0.001 | -0.024 | 0.011 |
| Neuroticism | -0.011 | -0.041 | -0.035 | -0.020 | -0.035 |
| Durbin-Watson | 1.979 | 1.988 | 2.001 | 2.048 | 2.059 |
†p < 0.10
*p < 0.05
Stepwise regression result of Table 3.
The Durbin-Watson results are close to 2.0, showing that there is no presence of autocorrelation in the residuals. The VIF results for all predictors are less than 4. Even we controlled for age, social media use frequency showed a significant influence on all components of communication quality and itself.
| Category | Credibility | Attraction | Competence | Intent | Comm Quality |
|---|---|---|---|---|---|
| Age | - | -0.123 | - | -0.172 | -0.116 |
| SM use frequency | 0.128 | 0.121 | 0.142 | - | 0.121 |
| Durbin-Watson | 1.979 | 1.988 | 2.001 | 2.048 | 2.059 |
| 0.09 | 0.06 | 0.07 | 0.08 | 0.07 | |
| 5.188 | 5.389 | 6.412 | 9.531 | 5.031 |
*p < 0.05
Standardized linear regression coefficients of authors’ communication quality with linguistic characteristics from LIWC and features unique in social media.
The Durbin-Watson results are close to 2.0, showing that there is no presence of autocorrelation in the residuals. The VIF results for all predictors are less than 4.0.
| Category | Credibility | Attraction | Competence | Intent | Comm Quality |
|---|---|---|---|---|---|
| Word count | -0.058 | 0.065 | -0.001 | 0.008 | 0.001 |
| Word per sentence | -0.020 | -0.028 | -0.040 | -0.117 | -0.054 |
| Sixltr words | 0.153 | 0.162 | 0.152 | 0.078 | 0.155 |
| Dictionary | -0.068 | -0.041 | 0.049 | 0.028 | -0.040 |
| 1st person singular | -0.038 | -0.032 | -0.021 | -0.066 | -0.046 |
| 1st person plural | -0.022 | -0.033 | -0.074 | -0.021 | -0.043 |
| 2nd person | -0.023 | -0.036 | -0.025 | -0.041 | -0.036 |
| 3rd person plural | -0.011 | -0.025 | -0.046 | -0.005 | -0.026 |
| 3rd person overall | 0.043 | 0.042 | 0.014 | -0.007 | 0.031 |
| Impersonal pronouns | 0.017 | 0.044 | 0.037 | 0.015 | 0.034 |
| Articles | 0.126 | 0.156 | 0.107 | 0.063 | 0.117 |
| Auxiliary verbs | -0.045 | -0.020 | -0.054 | 0.079 | -0.020 |
| Adverbs | -0.001 | -0.032 | 0.003 | 0.056 | 0.000 |
| Adjectives | 0.013 | 0.016 | 0.037 | 0.013 | 0.022 |
| Past tense | 0.008 | -0.007 | -0.036 | -0.012 | -0.012 |
| Present tense | 0.036 | 0.075 | 0.030 | -0.062 | 0.035 |
| Future tense | 0.024 | 0.044 | 0.021 | 0.014 | 0.032 |
| Prepositions | 0.124 | 0.092 | 0.079 | 0.065 | 0.106 |
| Conjunctions | 0.061 | 0.077 | 0.065 | -0.045 | 0.056 |
| Negations | -0.056 | -0.077 | -0.029 | -0.089 | -0.071 |
| Numbers | 0.073 | 0.079 | 0.056 | -0.032 | 0.060 |
| Quantifiers | 0.087 | 0.061 | 0.048 | -0.072 | 0.047 |
| Swear words | -0.067 | -0.014 | -0.080 | -0.005 | -0.048 |
| Average # hashtags | -0.091 | -0.043 | -0.044 | 0.064 | -0.042 |
| Average # URLs | 0.094 | 0.053 | 0.073 | 0.169 | 0.079† |
| Average # mentions | -0.098 | -0.048 | -0.086 | -0.115 | -0.093 |
| Average # retweets | -0.013 | -0.022 | -0.053 | -0.010 | -0.029 |
| Average # questions | -0.071 | -0.112 | -0.074 | -0.059 | -0.095 |
| Average # exclamations | -0.011 | -0.016 | -0.015 | -0.008 | -0.015 |
| Average # stocks | -0.031 | -0.001 | -0.004 | 0.009 | -0.009 |
| Average # emojis | -0.043 | -0.031 | -0.030 | 0.026 | -0.027 |
| Average tweet length | 0.001 | -0.041 | 0.011 | 0.040 | -0.002 |
| Average sentiment | 0.073 | 0.069 | 0.043 | 0.065 | 0.068 |
| Tweets similarity | -0.170 | -0.146 | -0.152 | -0.166 | -0.179 |
| Durbin-Watson | 1.434 | 1.353 | 1.434 | 1.208 | 1.401 |
†p < 0.10
*p < 0.05
Stepwise regression result of Table 5.
The VIF results for all predictors are less than 4.0.
| Category | Credibility | Attraction | Competence | Intent | Comm Quality |
|---|---|---|---|---|---|
| Sixltr words | 0.148 | 0.139 | 0.136 | - | 0.140 |
| Articles | 0.132 | 0.142 | 0.126 | - | 0.120 |
| Average sentiment | 0.080 | 0.128 | - | 0.133 | 0.073 |
| Tweet similarity | -0.165 | -0.149 | -0.146 | -0.168 | -0.171 |
| Prepositions | 0.128 | 0.090 | 0.111 | - | 0.096 |
| Average # mentions | -0.124 | -0.085 | -0.134 | -0.088 | -0.125 |
| Swear words | -0.070 | - | -0.079 | - | - |
| Average # questions | -0.067 | -0.106 | -0.061 | - | -0.087 |
| Average # hashtags | -0.079 | - | - | - | - |
| Quantifiers | 0.088 | 0.067 | - | - | - |
| Average # URLs | - | - | - | 0.144 | - |
| Negations | - | - | - | -0.086 | - |
| WPS | - | - | - | -0.114 | - |
| Durbin-Watson | 1.428 | 1.314 | 1.422 | 1.210 | 1.415 |
| 0.123 | 0.128 | 0.113 | 0.103 | 0.119 | |
| 3.811 | 3.976 | 3.472 | 3.124 | 3.675 |
*p < 0.05
Standardized linear regression coefficients of communication quality with tweet features for the consensus analysis after controlling for reader’s characteristics.
The Durbin-Watson results are close to 2.0, showing that there is no presence of autocorrelation in the residuals. The VIF results for all predictors are less than 4.0. Higher coefficient means less consensus among the respondents.
| Category | Credibility | Attraction | Competence | Intent | Comm Quality |
|---|---|---|---|---|---|
| Average # hashtags | 0.043 | 0.055 | 0.025 | 0.099 | 0.058 |
| Average # URLs | 0.036 | 0.028 | 0.023 | 0.059 | 0.018 |
| Average # mentions | 0.088 | 0.089 | 0.078 | 0.079 | 0.093 |
| Average # retweets | 0.022 | 0.026 | 0.027 | 0.056 | 0.034 |
| Average # questions | 0.038 | 0.030 | 0.020 | 0.012 | 0.018 |
| Average # exclamations | 0.070 | 0.107 | 0.087 | 0.071 | 0.095 |
| Average # stocks | 0.013 | 0.024 | 0.029 | 0.040 | 0.028 |
| Average # emojis | 0.023 | 0.017 | 0.024 | 0.057 | 0.030 |
| Average sentiment | 0.052 | 0.094 | 0.076 | 0.069 | 0.082 |
| Tweets similarity | 0.213 | 0.224 | 0.216 | 0.198 | 0.236 |
| Durbin-Watson | 1.200 | 1.145 | 1.248 | 1.094 | 1.042 |
†p < 0.10
*p < 0.05
Stepwise regression result of Table 7.
The VIF results for all predictors are less than 4.0.
| Category | Credibility | Attraction | Competence | Intent | Comm Quality |
|---|---|---|---|---|---|
| Average # hashtags | - | - | - | 0.115 | - |
| Average # mentions | 0.092 | 0.081 | 0.093 | 0.105 | 0.109 |
| Average # exclamations | 0.089 | 0.111 | 0.092 | 0.067 | 0.110 |
| Average sentiment | 0.089 | 0.095 | 0.074 | 0.071 | 0.082 |
| Tweets similarity | 0.214 | 0.210 | 0.208 | 0.182 | 0.225 |
| Durbin-Watson | 1.200 | 1.145 | 1.248 | 1.094 | 1.042 |
| 0.07 | 0.08 | 0.08 | 0.07 | 0.07 | |
| 15.43 | 20.94 | 14.82 | 13.74 | 18.07 |
*p < 0.05
Fig 2Three examples of tweets that yielded overall lowest consensus (high disagreement).
First example (top) shows the tweets consisting of only hashtags and links. Second example (mid) shows the tweets with a high similarity score, mentions, and hashtags. Third example (bottom) shows the tweets that were written similarly. User identifiable information (mentions) is anonymized.
Fig 3Example of tweets that yielded overall highest consensus (high agreement).
Summary of accuracy, precision, recall, and F1 scores of the models based on different types of the features.
Models with the tweet features yielded better performance than those with the author features. When all features were used, the models showed the best. Best model for each case is highlighted.
| Feature type | Model | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|---|
| LR | 0.74 | 0.75 | 0.74 | 0.74 | |
| RF | 0.92 | 0.92 | 0.92 | 0.92 | |
| SVM | 0.90 | 0.90 | 0.91 | 0.90 | |
| ADT | 0.90 | 0.90 | 0.91 | 0.91 | |
| LR | 0.71 | 0.71 | 0.72 | 0.72 | |
| RF | 0.84 | 0.84 | 0.84 | 0.84 | |
| SVM | 0.78 | 0.82 | 0.78 | 0.78 | |
| ADT | 0.83 | 0.84 | 0.83 | 0.84 | |
| LR | 0.76 | 0.76 | 0.75 | 0.76 | |
| RF | 0.94 | 0.95 | 0.93 | 0.94 | |
| SVM | 0.94 | 0.94 | 0.93 | 0.93 | |
| ADT | 0.94 | 0.94 | 0.93 | 0.93 |
Fig 4Result of ROC curve (left) and Precision-Recall Curve (right).
RF and ADT yielded much greater performance than LR and SVM.
Fig 5Top ten important features from the RF model.
Author- and tweet-based feature comparison between the high and the low communication quality groups at p < 0.05 (sorted by F-values).
| Type | Feature | Low | High | t(505) |
|---|---|---|---|---|
| Author | Author Registration Age (days) | 1694.86 | 2357.09 | 40.57 |
| Author # favorites | 18505.79 | 495127 | 9.37 | |
| Tweet | Tweet similarity | 0.49 | 0.32 | 46.02 |
| Articles | 2.01 | 3.07 | 18.29 | |
| 1st person singular | 1.49 | 0.59 | 11.84 | |
| Prepositions | 7.97 | 9.26 | 11.02 | |
| Sixltr words | 23.36 | 25.91 | 10.53 | |
| Swear words | 0.42 | 0.11 | 8.09 | |
| Average # questions | 0.13 | 0.06 | 5.02 | |
| Negations | 0.79 | 0.53 | 4.26 |