| Literature DB >> 30059512 |
Tao Wang1,2, Markus Brede3, Antonella Ianni4, Emmanouil Mentzakis4.
Abstract
Online health communities facilitate communication among people with health problems. Most prior studies focus on examining characteristics of these communities in sharing content, while limited work has explored social interactions between communities with different stances on a health problem. Here, we analyse a large communication network of individuals affected by eating disorders on Twitter and explore how communities of individuals with different stances on the disease interact online. Based on a large set of tweets posted by individuals who self-identify with eating disorders online, we establish the existence of two communities: a large community reinforcing disordered eating behaviours and a second, smaller community supporting efforts to recover from the disease. We find that individuals tend to mainly interact with others within the same community, with limited interactions across communities and inter-community interactions characterized by more negative emotions than intra-community interactions. Moreover, by studying the associations between individuals' behavioural characteristics and interpersonal connections in the communication network, we present the first large-scale investigation of social norms in online health communities, particularly on how a community approves of individuals' behaviours. Our findings shed new light on how people form online health communities and can have broad clinical implications on disease prevention and online intervention.Entities:
Mesh:
Year: 2018 PMID: 30059512 PMCID: PMC6066201 DOI: 10.1371/journal.pone.0200800
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1(a) Distributions of average Silhouette scores with different k values in k-means. Each box shows the quartiles of the scores obtained in 100 rounds running, and the whiskers show the rest of a distribution. (b) and (c) The most frequent hashtags and their co-occurrence networks used by two groups of users in ED-related tweets respectively. Each node is a hashtag and its size is proportional to the frequency of the tag used in a group. Edge width is proportional to the number of co-occurrences of two hashtags in tweets. (d) Average relative sentiments of two groups on different themes: “pro-ED” where each tweet contains a pro-ED hashtag without pro-recovery tags; “pro-recovery” where each tweet has a pro-recovery hashtag without pro-ED tags; “mixed” where a tweet has both pro-ED and pro-recovery tags; and “unspecified” where a tweet has neither a pro-ED nor a pro-recovery tag. Error bars denote 95% CI. Mann-Whitney U tests are used to assess the differences of sentiments between two groups on each theme. All p-values for “pro-ED”, “pro-recovery” and “unspecified” themes are p < 0.001, while no significant difference occurs for the “mixed” theme (see SI).
Fig 2(a) The communication network of users in pro-ED and pro-recovery communities, laid out by ForceAtlas2 [44]. Each node represents a user and edges represent mentioning or replying relationships. Red nodes denote pro-ED users and blue nodes denote pro-recovery users. Node size is proportional to in-degree. (b) Average relative sentiments of intra- and inter-community messages S↻ and S↷ sourced from pro-ED (ED) and pro-recovery (Rec) communities respectively. Error bars denote 95% CI. Differences between S↻ and S↷ are significant (p < 0.01) in U tests in both two communities.
Statistics of the communication networks among pro-ED and pro-recovery communities.
Total number of nodes (N); number of edges (E); average degree per node (〈k〉); average shortest path length of connected node pairs (L); number of weakly connected components (#Comp.); ratio of nodes in the giant connected component (GCR); reciprocity measuring the likelihood of nodes with mutual links (R); global clustering coefficient (or transitivity) measuring the probability that two neighbours of a node are connected (C); assortativity coefficient of degree measuring the preference for nodes to link to others with similar degree values (A). Degree assortativity measured here are the correlations between source out-degree and destination in-degree [43], and z denotes the z-score of a property X observed in an empirical network compared to those observed in null models, i.e., randomized networks by preserving the degrees of the empirical network [45].
| Network | 〈 | # | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pro-ED | 5,708 | 9,023 | 1.58 | 10.76 | 114 | 97.8% | 0.03 | 0.01 | -0.13 | 90.45 | 0.33 | -12.16 |
| Pro-Rec. | 461 | 1,666 | 3.61 | 3.95 | 62 | 84.2% | 0.16 | 0.19 | -0.13 | 20.12 | 10.62 | -5.57 |
| Entire | 6,169 | 11,056 | 1.79 | 10.20 | 1 | 100.0% | 0.05 | 0.03 | -0.14 | 113.41 | 32.14 | -14.45 |
Comparing communities in social activities and language use, where measures on language use count the percentages of words that reflect different psychometric properties, such as concerns, emotions and thinking styles, in a user’s historical tweets.
Two-sided Mann-Whitney U tests evaluate differences between groups, significance levels with Bonferroni correction: * p < 0.05/m; ** p < 0.01/m; *** p < 0.001/m where m = 22.
| #Followees | Number of total followees | 543.70 ± 1,096.09 | 1,561.58 ± 4,022.53 | -14.82 | 0.000 *** |
| #Tweets | Number of total tweets | 2,573.03 ± 6,515.83 | 5,485.02 ± 10,166.14 | -9.24 | 0.000 *** |
| #Followers | Number of total followers | 1,339.11 ± 27,384.37 | 16,299.21 ± 128,844.20 | -17.52 | 0.000 *** |
| #Followees/day | Average number of followees per day | 2.22 ± 7.23 | 1.72 ± 6.18 | 3.36 | 0.001 * |
| #Tweets/day | Average number of tweets per day | 5.48 ± 9.01 | 4.13 ± 8.73 | 7.69 | 0.000 *** |
| #Followers/day | Average number of followers per day | 2.41 ± 12.36 | 7.51 ± 46.21 | -3.81 | 0.000 ** |
| %Re-tweet | Ratio of re-tweets in historical posts | 0.30 ± 0.20 | 0.21 ± 0.18 | 9.76 | 0.000 *** |
| %Mention | Ratio of posts with mentions | 0.31 ± 0.17 | 0.36 ± 0.20 | -5.08 | 0.000 *** |
| %Reply | Ratio of posts with replies | 0.10 ± 0.09 | 0.15 ± 0.13 | -7.10 | 0.000 *** |
| Entropy of re-tweeting others | 4.28 ± 1.20 | 3.36 ± 1.08 | 16.97 | 0.000 *** | |
| Entropy of mentioning others | 2.33 ± 1.09 | 2.91 ± 1.04 | -11.76 | 0.000 *** | |
| Entropy of replying others | 2.71 ± 1.21 | 2.87 ± 1.05 | -3.09 | 0.002 * | |
| Body | Concerns of body image | 0.02 ± 0.01 | 0.01 ± 0.01 | 25.68 | 0.000 *** |
| Ingest | Concerns of ingestion | 0.03 ± 0.02 | 0.02 ± 0.02 | 13.41 | 0.000 *** |
| Health | Concerns of health | 0.02 ± 0.01 | 0.02 ± 0.01 | 0.97 | 0.334 |
| I | 1st personal singular use | 0.11 ± 0.03 | 0.04 ± 0.03 | 30.94 | 0.000 *** |
| We | 1st personal plural use | 0.00 ± 0.00 | 0.01 ± 0.01 | -25.52 | 0.000 *** |
| Social | Social concerns | 0.08 ± 0.02 | 0.11 ± 0.03 | -19.78 | 0.000 *** |
| Swear | Abusive language | 0.01 ± 0.01 | 0.00 ± 0.00 | 28.84 | 0.000 *** |
| Negate | Negation use | 0.03 ± 0.01 | 0.02 ± 0.01 | 25.86 | 0.000 *** |
| Posemo | Positive emotions | 0.05 ± 0.01 | 0.06 ± 0.02 | -18.34 | 0.000 *** |
| Negemo | Negative emotions | 0.04 ± 0.01 | 0.02 ± 0.01 | 26.14 | 0.000 *** |
Fig 3Parameters estimates β and 95% confidence intervals for effects of an attribute on PageRank centralities in pro-ED and pro-recovery communities, estimated using robust linear models with controls on social capital covariates (see Methods).
Coefficients at significance level p < 0.05 are labelled with an asterisk. (Prostr) is the strength that a user promotes a pro-ED or pro-recovery tendency, measured by the average sentiment of the user on pro-ED or pro-recovery content in tweets (see SI).