| Literature DB >> 31023299 |
Hai Liang1,2, Isaac Chun-Hai Fung3,4,5, Zion Tsz Ho Tse6, Jingjing Yin3, Chung-Hong Chan2, Laura E Pechta7, Belinda J Smith8, Rossmary D Marquez-Lameda7, Martin I Meltzer4, Keri M Lubell7, King-Wa Fu9.
Abstract
BACKGROUND: Information and emotions towards public health issues could spread widely through online social networks. Although aggregate metrics on the volume of information diffusion are available, we know little about how information spreads on online social networks. Health information could be transmitted from one to many (i.e. broadcasting) or from a chain of individual to individual (i.e. viral spreading). The aim of this study is to examine the spreading pattern of Ebola information on Twitter and identify influential users regarding Ebola messages.Entities:
Keywords: Broadcast model; Ebola; Network analysis; Social media; Viral diffusion model
Mesh:
Year: 2019 PMID: 31023299 PMCID: PMC6485141 DOI: 10.1186/s12889-019-6747-8
Source DB: PubMed Journal: BMC Public Health ISSN: 1471-2458 Impact factor: 3.295
Fig. 1An example of information cascade and the key measures. In this example, the cascade size is 8, the scale is 4/8 = 50%, and the depth is 3
Definition of three metrics that describe an information cascade
| Metrics | Definitions |
|---|---|
| Cascade size | The number of total retweets received by an original tweet. The cascade size describes the popularity of the seed message |
| Cascade scale | The percentage of all retweets that were retweeted directly from the original tweet. The higher the percentage is, the more likely the diffusion cascade is dominated by the broadcast model |
| Cascade depth | The number of generations in a diffusion path. A large depth value may suggest a long chain of information diffusion and thus implies viral spreading |
Fig. 2An illustration of the reconstruction of a diffusion path. From the Twitter API, we know that user A retweeted a message from user C. User A follows 4 users: B1-B4. Among the followees, users B2 and B3 follow user C and retweeted the same message from user C at time 1 and time 2 respectively. If time 1 is more recent than time 2, we will say that A retweeted C through B2 and information diffused from C to A via B2
Definitions of degree centrality and authority
| Metrics | Definitions |
|---|---|
| Degree centrality | The total number of links of an individual in a network. In a network of followers, this will be the number of followers a user has |
| Authority | The relative importance of a node in a network. In this paper, we measure the authority of a user by calculating the ratio of the number of followees to the number of followers, and the ratio of the number of retweets received from others to the number of retweets the user posted |
Two dimensions of authority and definitions of four user types
| First, we defined two dimensions of authority to classify users into four categories (2 × 2): | |
|---|---|
| a. Followee-follower ratio | The first dimension is the ratio of the number of followees to the number of followers. Users are classified as either ratio > 1 or ≤ 1. |
| b. Retweeted-retweeting ratio | The second dimension is the ratio of the number of retweets received from others to the number of retweets the user posted. Users are classified as either ratio > 1 or ≤ 1. |
| We expect that users, who have more followers than followees, should have more retweets by their own followers than they retweeting their followees’ tweets. Likewise, we expect that users, who have fewer followers than followees, should have fewer retweets by their own followers than they retweeting their followees’ tweets. | |
| Therefore, according to the two dimensions, we defined four types of users: | |
| a. Disseminators (also named as “Broadcasters” by Gonzalez-Bailon et al. [ | followees ≤ followers & being retweeted ≤ retweeting |
| b. Common users | followees > followers & being retweeted ≤ retweeting |
| c. Influential users | followees ≤ followers & being retweeted > retweeting |
| d. Hidden influential users | followees > followers & being retweeted > retweeting |
Fig. 3The probability distribution of normalized structural virality of information cascades of 192,209 original tweets with more than 10 retweets each, selected from a data set of 36,931,362 Ebola-related tweets from March 23, 2014 to May 31, 2015
Number of Twitter users (percentage of all users, n = 4,925,730) in four categories defined according to the following and retweeting characteristics of the users who tweeted about Ebola from March 23, 2014 to May 31, 2015
| One’s tweets being retweeted ≤ Retweeting others’ tweets | One’s tweets being retweeted > Retweeting others’ tweets | |
|---|---|---|
| Followees ≤ | Disseminators | Influential Users |
| Followees > | Common Users | Hidden Influential Users |
Note: “Followees” refers to the number of Twitter accounts that a Twitter user followed. “Followers” refers to the number of Twitter users who followed a Twitter user’s account. “One’s tweets being retweeted” refers to the number of times a Twitter user’s tweet was retweeted by others. “Retweeting others’ tweets” refers to the number of times a Twitter user retweeted another users’ tweets
Cascade size, structural virality and normalized structural virality of information cascades created by four different categories of users who tweeted about Ebola from March 23, 2014 to May 31, 2015
| Categories of users who created the information cascades | Percentage of total cascades | Cascade size (Q1, median, Q3) | Structural virality (Q1, median, Q3) | Normalized structural virality (Q1, median, Q3) |
|---|---|---|---|---|
| Influential users | 91.6% |
| (1.89, 1.98, 2.15) | (0.00, 0.00, 0.04) |
| Hidden influential users | 7.1% |
| (1.93, 2.09, 2.61) | (0.01, 0.04, 0.15) |
| Disseminators | 0.6% | (12, 13, 16) | (1.92, 2.15, 2.64) |
|
| Common users | 0.7% | (12, 14, 18) | (1.98, 2.28, 2.86) |
|
Note: Q1: First quartile (25%); Q3: Third quartile (75%). See the User classification section in the Methods for the definition of disseminators, common users, hidden influential users, and influential users
Information flow, as represented by frequencies of retweets and the expected numbers in bracket, among four categories of Twitter users who tweeted about Ebola from March 23, 2014 to May 31, 2015
| From-To | Disseminators | Common users | Hidden influential users | Influential users |
|---|---|---|---|---|
| Disseminators |
| 146,719 |
|
|
| Common users | 82,712 | 143,088 |
|
|
| Hidden influential users | 174,208 |
|
|
|
| Influential users |
|
| 107,264 | 351,598 |
Note: The numbers in parentheses are the expected values. The cells where empirical values are larger than the expected values are written in italics. The expected numbers were calculated by cross-tabulation analysis by assuming columns and rows are independent. The analysis was based on the 12,426,623 retweets