Fulian Yin1, Hongyu Pang1, Xinyu Xia1, Xueying Shao1, Jianhong Wu2. 1. College of Information and Communication Engineering, Beijing, 100024, PR China. 2. Fields-CQAM Laboratory of Mathematics for Public Health, Laboratory for Industrial and Applied Mathematics, York University, Toronto, M3J1P3, Canada.
Abstract
The outbreak of a novel coronavirus (COVID-19) aroused great public opinion in the Chinese Sina-microblog. To help in designing effective communication strategies during a major public health emergency, we analyze the real data of COVID-19 information and propose a comprehensive susceptible-reading-forwarding-immune (SRFI) model to understand the patterns of key information propagation considering both public contact and participation. We develop the SRFI model, based on the public reading quantity and forwarding quantity that denote contact and participation respectively, and take into account the behavior that users may re-enter another related topic during the attention phase or the participation phase freely. Data fitting using the real data of both reading quantity and forwarding quantity obtained from Chinese Sina-microblog can parameterize the model to make an accurate prediction of the COVID-19 public opinion trend until the next major news item occurs, and the sensitivity analysis provides the basic strategies for communication.
The outbreak of a novel coronavirus (COVID-19) aroused great public opinion in the Chinese Sina-microblog. To help in designing effective communication strategies during a major public health emergency, we analyze the real data of COVID-19 information and propose a comprehensive susceptible-reading-forwarding-immune (SRFI) model to understand the patterns of key information propagation considering both public contact and participation. We develop the SRFI model, based on the public reading quantity and forwarding quantity that denote contact and participation respectively, and take into account the behavior that users may re-enter another related topic during the attention phase or the participation phase freely. Data fitting using the real data of both reading quantity and forwarding quantity obtained from Chinese Sina-microblog can parameterize the model to make an accurate prediction of the COVID-19 public opinion trend until the next major news item occurs, and the sensitivity analysis provides the basic strategies for communication.
Up to 24:00 on April 16, 2020, 2,352,198 cases have been affected by a novel coronavirus pneumonia (COVID-19) worldwide since it has been successively found in Wuhan. Major news items combined have generated quite strong fluctuations in public opinions. For example, on January 28, 2020, Nanshan Zhong, a well-known expert in infectious disease control, emphasized that people should not go out at present [1], this appeal attracted wide attention and warning that people said they could go out only if Nanshan Zhong admitted [2]. Sina-microblog is the most popular social network in China [3] and the outbreak-related topics about COVID-19 grew exponentially on that platform. As reading quantity and forwarding quantity of “Nanshan Zhong” have reached almost 9.40 billion and 2.05 million on Sina-microblog, understanding how these emerging public contact and participation spread in social media to alter the public behaviors is important to help to design effective communication strategies for rapid implementation of public health interventions.Fig. 1 shows the schematic diagram of COVID-19 information propagation in Sina-microblog, and the nodes represent users in different states. Many original post owners (green nodes) can publish single or multiple epidemic topics subordinate to different news. Take Headline News and Pear Video for example. While Headline News reports on multiple topics, and Pear Video focuses on a particular topic, and both of them can be read or forwarded by users attracted by both topics. Especially, reading users (blue nodes) can choose to be silent then leave and also can forward after reading becoming forwarding users (orange nodes). When users finish one topic, they may join others. And the relevant forwarding users can resume in the topics later and forward multiple messages becoming cross-forwarding users (pink nodes), leading to a multi-level information diffusion process. All users (readers and forwarding users) can choose to read or forward only one or multiple topics and therefore information propagates through one or multiple topics. This promotes the COVID-19 information dissemination rapidly.
Fig. 1
Information propagation considering both reading and forwarding on multiple topics of the COVID-19.
Information propagation considering both reading and forwarding on multiple topics of the COVID-19.To our best knowledge, there is no appropriate model framework that can be used to analyze public contact propagation and public participation propagation together during a major public health emergency. In consideration of the urgent need of developing theoretical knowledge and practical technologies to help effective communications of public health interventions, we propose a comprehensive susceptible–reading–forwarding–immune (SRFI) model based on both reading quantity and forwarding quantity that represent public contact and participation respectively to analyze the public opinion propagation of the COVID-19. In particular, we consider the characteristic user behavior that users may participate repeatedly in the reading or forwarding on different topics of the COVID-19 information dissemination.Traditionally, considering rumor is similar to epidemiology in several propagation ways, many scholars usedsusceptible–infected (SI) model [4], [5], susceptible–infected–recovered (SIR) model [6], [7], susceptible–exposed–infected–recovered (SEIR) model [8], [9] and susceptible–infected–susceptible (SIS) [10] model to represent rumor propagation and address relevant issues. Generally, information propagation in social networks was analyzed by stratifying users into three classes: heard rumor (ignorants), actively spreading rumor (spreaders) and no longer spreading rumor (stiflers) [11]. Then, scholars introduced new modules into classical models to understand the process of information dissemination better and then achieve various research purposes. In 2011, Zhao et al. [12] provided a more detailed and realistic description of the rumor spreading process with combination of forgetting mechanism and the SIR model of epidemics on an online social blogging platform called LiveJournal. In 2012, Zhao et al. [13] extended the classical SIR rumor spreading model by adding a direct link from ignorants to stiflers and a new kind of people—Hibernators in order to reduce the maximum rumor influence, which was called susceptible–infected–hibernator–removed (SIHR) model. In 2012, Xiong et al. [14] proposed a susceptible–contacted–infected–refractory (SCIR) diffusion model, which contained four possible states to characterize information propagation on online microblogs. In 2014, Zhao et al. [15] added the refutation mechanism in homogeneous social networks to the basic model using the Runge–Kutta method, which could help authorities reduce the maximum influence of the rumor. Zhang et al. [16] emphasized a special rumor spreading characteristic called “the cumulative effects of memory” and added the memory mechanism, meanwhile simulated the rumor spreading process on Sina-Microblog. Rui et al. [17] proposed a susceptible–potential–infective–removed (SPIR) model introducing a potential spreader set, which made the state-changing mechanism more reasonable and accurate for the diffusion process.In addition, Zhang et al. [18] used an improved SIR model to posit that a coupled network comprised two categories of nodes, then made use of the data collected from Weibo and WeChat of an actual news event to visualize the information spread process in the cross-network dissemination case of public opinion. Huang et al. [19] established a human dynamics model for deducing retweeting behavior and investigated it by gathering data through Sina API, which revealed that the distribution of the probability of a message to be browsed over time presented power-law characters. In 2018, Zan [20] studied the double rumors spreading with different launch times and introduced two kinds of models: double-susceptible–infected–recovered (DSIR) model and comprehensive-DSIR (C-DSIR) model, which focused on the interaction from old rumor to new rumor and the propagation of two rumors posted successively. And in 2019, we [21] proposed an epidemic model called susceptible–forwarding–immune (SFI) to capture a single information propagation trend in the Sina-microblog considering the forwarding quantity of users. Trpevski D et al. [22], Qian et al. [23], Wang et al. [24] also proposed many improved models for the spread of information. Especially Tanaka M et al. [25] added a new module to the traditional model using the datasets from the Japanese Mixi and Facebook.After summarizing a lot of literature, we find that most of the experimental data used for simulation are from well-known social platforms in the world, such as Twitter and Facebook. Furthermore, scholars from different countries may choose a local social platform in order to obtain data more convenient, especially Sina-microblog in China. By analyzing different data sources from other papers, we discover that existing studies have used forwarding quantity of one Weibo or multiple Weibos, the content of rumors and browsing behavior of users. To our best knowledge, they have not introduced the new module by combining with reading quantity and forwarding quantity of topics which are significant indicators to measure public contact and participation.The paper is organized as follows: in Section 2, we analyze the public opinion data of crucial moment on the COVID-19; in Section 3, we introduce mathematical model definition for information contact and participation, fit the model with the real data of two typical topics, make a staged prediction of the overall public opinion trend, then conduct a parameter sensitivity analysis and give effective intervention strategies about the information contact and participation; and in Section 4, we draw a conclusion.
COVID-19 information contact and participation analysis
Information reading and forwarding topology
The urgency is often accompanied with a number of topics related to the COVID-19 and the behavior of reading and forwarding usually prompts the propagation of the event. Reading is a kind of behavior that reflects users’ contact in information propagation and forwarding is a kind of behavior that reflects users’ participation in information propagation. In order to more clearly show the propagation process of both public contact and participation, the network topology is shown in Fig. 2 which describes the state of each node in the network at a certain moment in the process of information dissemination. Taking the propagation of three original post owners under three topics as an example, the user’s overall state of integrated information propagation is given.
Fig. 2
Network topology for COVID-19 propagation by reading and forwarding.
The information posted by the original post owners (red nodes), and can be read separately by single readers with interest (blue nodes) in which readers can then participate in the forwarding about the outbreaks (black nodes) or choose to be silent (yellow nodes). Especially, some co-spreaders between different topics repeatedly read and forward the related information successively then become the cross-reading users (pink nodes) and cross-forwarding users (green nodes) because of the correlation about the epidemic. Of course, there will also be many readers who contact information choose to be silent, such as the un-forwarding users (yellow nodes). In the real-world, the number of information is dynamically changing and cannot be clearly calculated.The reading quantity and forwarding quantity of the whole COVID-19 are composed of many topics with multiple information. Different from the traditional public hot events, the outbreak is causing great public concern. With the continuous development of the COVID-19, there is a high level of repetition in public reading and forwarding on different topics. As reading is a measure of contact and forwarding is a measure of participation for information dissemination, in this paper, we build the comprehensive susceptible–reading–forwarding–immune (SRFI) dynamics model with considering the repeated behavior including “re-reading” and “re-forwarding” on the impact of public opinion propagation.Network topology for COVID-19 propagation by reading and forwarding.
Information contact and participation analysis
Since the outbreak of COVID-19, around 7900 major topics appeared in Sina-microblog. Fig. 3 shows the cumulative reading quantity and forwarding quantity from December 31, 2019 to April 16, 2020, where the ordinate is the logarithm of the cumulative reading and forwarding quantities. It can be roughly seen that during the period from December 31, 2019 to January 16, 2020, there was only a certain amount of hot topics of COVID-19, however, from January 17, 2020, the hot topics about the epidemic gradually increased, especially after Nanshan Zhong confirmed that the COVID-19 could be transmitted from human to human on January 20, 2020, both the two quantities kept increasing dramatically, and then we will analyze the contact and participation data of COVID-19 information from January 17, 2020 to April 16, 2020 and Table 1 gives the specific values of the cumulative reading quantity and forwarding quantity.
Fig. 3
The reading quantity and forwarding quantity about COVID-19: (a) the reading quantity; (b) the forwarding quantity.
Table 1
The cumulative reading quantity and forwarding quantity of COVID-19.
Date
2020.1.17
2020.1.18
2020.1.19
2020.1.20
2020.1.21
2020.1.22
2020.1.23
CR(*105)
38 633
42 915
63 018
125 739
253 288
476 434
779 867
CF
225 554
264 641
434 164
1 383 953
3 390 573
7 639 887
23 811 193
Date
2020.1.24
2020.1.25
2020.1.26
2020.1.27
2020.1.28
2020.1.29
2020.1.30
CR(*105)
1 009 154
1 239 081
1 482 486
1 702 699
1 885 614
2 062 613
2 235 681
CF
31 929 846
37 850 190
43 424 983
47 026 579
50 333 088
54 259 760
57 741 525
Date
2020.1.31
2020.2.1
2020.2.2
2020.2.3
2020.2.4
2020.2.5
2020.2.6
CR(*105)
2 404 628
2 615 989
2 816 495
2 995 967
3 182 909
3 375 830
3 544 372
CF
62 205 935
69 421 577
75 392 452
78 614 456
84 683 986
90 049 559
99 153 988
Date
2020.2.7
2020.2.7
2020.2.9
2020.2.10
2020.2.11
2020.2.12
2020.2.13
CR(*105)
3 797 159
3 957 690
4 168 665
4 337 476
4 539 071
4 690 281
4 852 670
CF
111 769 392
116 979 114
120 813 833
125 010 556
129 008 667
133 206 593
138 580 808
Date
2020.2.14
2020.2.15
2020.2.16
2020.2.17
2020.2.18
2020.2.19
2020.2.20
CR(*105)
5 028 183
5 200 905
5 355 616
5 507 174
5 660 835
5 800 665
5 934 350
CF
143 356 722
147 554 717
151 821 458
155 138 314
158 447 426
161 128 288
163 301 415
Date
2020.2.21
2020.2.22
2020.2.23
2020.2.24
2020.2.25
2020.2.26
2020.2.27
CR(*105)
6 139 063
6 299 068
6 465 841
6 619 762
6 757 166
6 885 342
7 003 691
CF
165 502 316
167 645 359
170 198 613
173 068 348
176 231 509
178 567 007
180 802 728
Date
2020.2.28
2020.2.29
2020.3.1
2020.3.2
2020.3.3
2020.3.4
2020.3.5
CR(*105)
7 130 939
7 246 748
7 351 752
7 447 584
7 534 199
7 638 853
7 726 308
CF
183 003 765
184 900 668
186 966 898
188 327 150
189 521 175
190 727 593
191 892 477
Date
2020.3.6
2020.3.7
2020.3.8
2020.3.9
2020.3.10
2020.3.11
2020.3.12
CR(*105)
7 798 859
7 866 021
7 916 356
7 977 164
8 050 796
8 140 781
8 286 361
CF
192 894 893
193 762 501
194 739 304
195 609 360
196 481 124
197 673 456
199 077 665
Date
2020.3.13
2020.3.14
2020.3.15
2020.3.16
2020.3.17
2020.3.18
2020.3.19
CR(*105)
8 486 884
8 606 930
8 741 984
8 881 456
9 025 990
9 159 021
9 294 382
CF
200 311 755
201 747 067
203 341 571
205 021 835
213 788 905
219 382 758
222 669 494
Date
2020.3.20
2020.3.21
2020.3.22
2020.3.23
2020.3.24
2020.3.25
2020.3.26
CR(*105)
9 454 936
9 613 858
9 714 204
9 788 695
9 846 275
9 940 630
10 024 169
CF
225 005 650
227 086 475
228 026 043
228 598 328
229 318 397
230 114 834
230 719 885
Date
2020.3.27
2020.3.28
2020.3.29
2020.3.30
2020.3.31
2020.4.1
2020.4.2
CR(*105)
10 116 518
10 193 220
10 274 610
10 401 702
10 509 970
10 613 594
10 719 037
CF
231 501 263
232 111 489
232 756 584
233 655 217
234 849 798
235 876 483
236 758 976
Date
2020.4.3
2020.4.4
2020.4.5
2020.4.6
2020.4.7
2020.4.8
2020.4.9
CR(*105)
10 807 269
11 026 877
11 138 821
11 231 162
11 333 261
11 422 902
11 503 056
CF
237 811 162
252 986 547
255 099 056
256 111 309
257 005 795
258 764 094
259 979 183
Date
2020.4.10
2020.4.11
2020.4.12
2020.4.13
2020.4.14
2020.4.15
2020.4.16
CR(*105)
11 562 237
11 617 701
11 657 360
11 699 565
11 743 954
11 806 848
11 866 802
CF
260 392 077
260 727 437
260 984 402
261 217 740
261 480 292
261 836 224
262 202 217
The reading quantity and forwarding quantity about COVID-19: (a) the reading quantity; (b) the forwarding quantity.Fig. 4 shows the box diagram of reading quantity and forwarding quantity from January 17, 2020 to April 16, 2020, which reflect the user’s contact and participation in information, respectively. Although in terms of the order of magnitude, the reading quantity is larger than the forwarding quantity and its range is relatively wide, in particular, the reading quantity and the forwarding quantity vary within about 3 × 1010 and 7 × 106 respectively, except for the outlier forwarding quantity caused by several emergencies, these two propagation trends are consistent, both in line with the laws of dynamics development. Besides, there are also different features in reading quantity and forwarding quantity such as the median, so it is necessary to build a model combining these two attributes.
Fig. 4
The box diagram of the number of reading and forwarding.
The cumulative reading quantity and forwarding quantity of COVID-19.The box diagram of the number of reading and forwarding.In order to analyze the public opinion trend more clearly, Fig. 5 gives the reading quantity, forwarding quantity and the number of topics together with line charts and histograms from January 17, 2020 to April 16, 2020. We can see that since January 17, 2020, the number of reading and forwarding of COVID-19 has gradually increased, and the emergence of some important events or hot topics has a major impact on public opinion and the overall development can be divided into four stages so far based on this. The first stage, from January 17, 2020 to February 19, 2020, which both two quantities increased gradually, is the outbreak stage. In the beginning, topic # 5 rumors of viral pneumonia in Wuhan # attracted public attention, then Wuhan began its closure on January 23, 2020, with the emergence of topics such as # Wuhan Bus Metro Suspension of Operation #, the reading and forwarding quantities of COVID-19 began to increase dramatically and reached the highest value and the number of topics began to gradually increase leading public opinion enter the outbreak stage. Besides, topic #Doctor Wenliang Li passed away# posted on February 7, 2020 and #Easy epidemic prevention station# posted on February 19, 2020 have aroused great attention from people, causing both two quantities to reach the extreme again and public opinion continues to ferment. The second stage is from February 19, 2020 to March 16, 2020, in which both two quantities start to decrease with the stabilization and improvement of the domestic cases, and the overall number of topics increased and stabilized in a certain range in the later period. The third stage is another outbreak stage from March 16, 2020 to April 3, 2020, the overall trend is on the rise as the disease becomes more serious abroad, and the number of topics has risen again. With the release of #support Hubei Medical Team to evacuate # on March 17, 2020 and other topics related to COVID-19 of the global, both two quantities increased dramatically again, and public opinion entered into another outbreak stage. The fourth stage, from April 3, 2020 to April 16, 2020 is the occasional fluctuation stage. April 4, 2020 is Tomb-Sweeping Day of China, some topics for mourning the anti-epidemic heroes began to increase such as # Three minutes of silence in all China # which caused a lot of users to read and forward and led to some occasional fluctuations in the reading quantity and forwarding quantity. The division of the public opinion of COVID-19 helps us understand the trend caused by the entire epidemic hot topics.
Fig. 5
The trends chart of the topics about COVID-19, where the blue line represents the number of reading and the red line represents the number of forwarding respectively, and the histograms represent the number of topics per day.
The trends chart of the topics about COVID-19, where the blue line represents the number of reading and the red line represents the number of forwarding respectively, and the histograms represent the number of topics per day.
COVID-19 information contact and participation prediction
Susceptible–reading–forwarding–immune model (SRFI)
The propagation dynamics model based on the reading quantity and forwarding quantity of COVID-19 constructed in this paper is shown in Fig. 6. Here, we only consider the accessible population in the process of information propagation and pay attention to both the information diffusion caused by users’ reading behavior and forwarding behavior. Assuming that the number of users () who can contact information in the process of propagation on Sina-microblog remains unchanged, we stratify the population into four states: the susceptible state (), in which the users unaware of but susceptible to the information of the event; the reading state (), in which the users have read information and are susceptible to forward it; the forwarding state (), in which the users have forwarded the information actively to influence other users; and the immune state (), in which the users have read or forwarded the information, but are no longer read or forward the information even if receive them again.
Fig. 6
A schematic diagram to illustrate information spreading considering both reading and forwarding in the population with four different states: susceptible (), reading (), forwarding () and immune ().
A susceptible user can read one information with an average exposure rate and a user in reading state will leave and become other states with a deactivation rate . The forwarding users can become immune users who are inactive to the event with an average inactive rate , with being the average duration where an F-user remains active in being contacted.A schematic diagram to illustrate information spreading considering both reading and forwarding in the population with four different states: susceptible (), reading (), forwarding () and immune ().The core of our model is to study the role of both repeated reading and forwarding because users exposure to different information about COVID-19. Hence, with forwarding probability from reading users to forwarding users and immunity probability denotes those who keep silent in the event and go straight to the immune state, we use to represent the “re-reading” probability for a reading user returns to the susceptible state. Besides, we use parameter to describe the “re-forwarding” probability for a forwarding user who can return a new round of susceptible state of COVID-19.In particular, each user may have a unique state, that is, at the same time, each user can be only one of the susceptible, reading, forwarding or immune states. We obtain the following SRFI dynamics model:
where ’
is the derivative with respect to . The behavior transformation and state transition of the masses can also be interpreted as follow:Reading: Since an active forwarding user will contact an average number of users per unit time, the probability of a normal user is a susceptible user is , and there are active forwarding users in total, then the number of new reading users is . Forwarding: Some reading users will inactive with the deactivation rate and participate in the forwarding state with the forwarding probability , the number of new forwarding users is . Re-enter: As events unfold, there are two ways to generate repeated behavior and initiate a new round of reading and forwarding: users who contact one information and yearn for getting more information about COVID-19 will re-enter to the susceptible state from reading state, the average number of re-entered users is per unit time; users who have forwarded one information and be interested in related information about COVID-19 will re-enter to susceptible from forwarding state, the average number of the re-entered users is per unit time. Immune: Some reading users will not participate in the forwarding and become the immune users directly because they want to keep silent in the event, and the number of direct immune users is . And some forwarding users will go to the immune state out of active time, the number of inactive users is .The Sina-microblog provides the number of cumulative reading population and forwarding population which is the total times of reading and forwarding within a topic about COVID-19, and we calculate the sum of the whole event, given byThe corresponding differential equation can be expressed as:Considering the initial condition: , , and . The final condition from Eqs. (4)–(6), it follows that , and are all increasing since , and . Therefore, the final states are , , , and are tending to 0 , and . Here and are the final size of the COVID-19 reading and forwarding. In addition, the number of maximal reading users and maximal forwarding users are and , respectively.We define the reproduction ratio to measure whether the outbreak forwarding quantity was likely to break out. In the initial post of the COVID-19, the forwarding outbreak is given by , and the population will never take off since due to the decreasing of . Then we deduce Public opinion reproduction ratio
:The reproduction ratio is defined to measure whether public opinion was likely to break out. We use the calculation method of basic reproduction number developed in [26], and rewrite our model as follows: where andCalculate the derivatives and at no information propagation equilibrium , we can obtain andThe roots of the characteristic equation can deduce the eigenvalues of the matrix :Because is not negative, we haveHere, based on the extension of , we define the (effective) reproduction number to describe the outbreak of public opinion at each time t, and it has more practical significance in the dynamic development process. Then we haveThe represents the propagation capability of each period and it is time-varying, which is determined by the average exposures rate , the average inactive rate , the forwarding probability , and the susceptible users . When , it means that the comprehensive public opinion will decline which implies the propagation can never break out. The indicates that the comprehensive public opinion grows exponentially initially.
Data fitting
Parameter estimation method:To use our SRFI model to explore some distinctions of qualitative behaviors for prediction, we use the LS method to estimate the model parameters and the initial data of our SRFI model. The parameter vector can be set as , and the corresponding numerical calculation based on the parameter vectors for and are denoted by and , respectively. The LS error function is used in our calculation, where and denote the actual cumulative reading quantity and forwarding quantity, and
0, 1, 2, …is the sampling time. In our paper, we use DEDiscover software to solve this LS problem.Data description:In order to analyze the public opinions with different characteristics, the following two typical events are selected from the whole public opinion outbreak duration. Table 2 shows the reading quantity and forwarding quantity of # Refuse to eat wild animals # with a slow outbreak. And Table 3 shows the reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation # with a fast outbreak.
Table 2
The cumulative reading quantity and forwarding quantity of # Refuse to eat wild animals #.
Data
2020.2.29
2020.3.1
2020.3.2
2020.3.3
2020.3.4
2020.3.5
2020.3.6
2020.3.7
CR(*105)
14
19
249
504
712
855
992
1142
CF
1270
3780
9666
33 657
51 138
64 398
71 008
89 619
Data
2020.3.8
2020.3.9
2020.3.10
2020.3.11
2020.3.12
2020.3.13
2020.3.14
2020.3.15
CR(*105)
1270
1368
1443
1486
1508
1514
1523
1536
CF
106 938
109 223
109 589
110 431
110 888
112 583
112 861
113 140
Data
2020.3.16
2020.3.17
2020.3.18
2020.3.19
2020.3.20
2020.3.21
2020.3.22
CR(*105)
1549
1551
1551
1551
1551
1551
1551
CF
113 640
113 778
114 011
114 117
114 170
114 247
114 281
Table 3
The cumulative reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation #.
Data
2020.1.27
2020.1.28
2020.1.29
2020.1.30
2020.1.31
2020.2.1
2020.2.2
2020.2.3
CR(*105)
2361
8448
14 146
19 897
23 839
26 616
30 331
30 799
CF
61 134
119 339
181 997
240 858
263 552
284 178
314 132
321 897
Data
2020.2.4
2020.2.5
2020.2.6
2020.2.7
2020.2.8
2020.2.9
2020.2.10
2020.2.11
CR(*105)
31 838
32 261
32 424
32 619
33 451
33 551
33 579
33 603
CF
334 686
341 232
343 963
347 308
360 530
362 177
363 453
364 203
Data
2020.2.12
2020.2.13
2020.2.14
2020.2.15
2020.2.16
2020.2.17
2020.2.18
2020.2.19
CR(*105)
33 656
33 721
33 752
33 767
33 787
33 812
33 854
33 870
CF
364 682
365 318
365 673
366 002
366 323
366 652
367 272
367 698
Data
2020.2.20
2020.2.21
2020.2.22
2020.2.23
2020.2.24
2020.2.25
2020.2.26
2020.2.27
CR(*105)
33 908
33 961
33 984
34 000
34 021
34 039
34 054
34 069
CF
368 275
368 875
369 261
369 629
370 098
370 585
370 899
371 286
Data
2020.2.28
2020.2.29
2020.3.1
2020.3.2
CR(*105)
34 081
34 081
34 081
34 081
CF
371 720
372 086
372 420
372 593
Data fitting results:The cumulative reading quantity and forwarding quantity of # Refuse to eat wild animals #.The cumulative reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation #.As shown in Fig. 7, we performed data fitting on the real data of # Refuse to eat wild animals# in Table 2, where the red star and blue star denotes the actual cumulative number of reading and forwarding population, respectively, the red line and blue line denotes the estimated cumulative number of reading and forwarding population, respectively. It can be seen that the initial outbreak of the topic is slow and our SRFI model achieves accurate estimation.
Fig. 7
The data fitting results of # Refuse to eat wild animals #.
Table 4 gives some important values of early period parameter estimation of # Refuse to eat wild animals #. We can see relative to the topic # Real-time broadcast of joint prevention and control of epidemic situation # the initial susceptible users is smaller meaning that there are fewer users are susceptible at the beginning of the information explosion, leading the topic outbreak slowly. Besides, the parameter is larger, which means the active period of forwarding users is short so that they cannot influence other susceptible people for a long time, leading a smaller final size of cumulative reading and forwarding quantities.
Table 4
Some important values of parameter estimation of # Refuse to eat wild animals #.
Name
Estimated value
Standard error
CI low bound
CI high bound
p-value
t-statistic
Min
Max
S0
4.2710×104
52.7936
6.0805×105
6.0826×106
1.5212×10−218
1.1520×104
1.0000×104
1.0000×104
α
0.8611
0.0264
0.3484
0.4535
9.8689×10−24
15.2126
0.0000
2.0000
β
0.1450
0.0029
0.0057
0.0172
1.8223×10−4
3.9562
0.0000
1.0000
γ
0.9782
0.0262
0.9429
1.0475
5.7330×10−48
37.9579
0.0000
1.0000
p
7.9000×10−4
8.3598×10−5
1.2329×10−4
4.5684×10−4
9.0251×10−4
3.4697
0.0000
1.0000
q
0.2580
0.0117
0.0865
0.1332
5.8565×10−14
9.3897
0.0000
1.0000
θ
0.9781
0.0322
0.8578
0.9861
4.7696×10−4
28.6591
0.0000
1.0000
As shown in Fig. 8, we performed data fitting on the real data of # Real-time broadcast of joint prevention and control of epidemic situation # in Table 3, where the red star and blue star denotes the actual cumulative number of reading and forwarding population, respectively, the red line and blue line denotes the estimated cumulative number of reading and forwarding population, respectively. It can be seen that the initial outbreak of topic # Real-time broadcast of joint prevention and control of epidemic situation # is fast, which entered a stage of rapid explosion directly at the beginning and our SRFI model achieves accurate estimation.
Fig. 8
The data fitting results of # Real-time broadcast of joint prevention and control of epidemic situation #.
The data fitting results of # Refuse to eat wild animals #.Some important values of parameter estimation of # Refuse to eat wild animals #.Table 5 gives some important values of early period parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #. We can see the initial susceptible users is larger meaning that more users become susceptible at the beginning of the information explosion, leading the topic outbreaks rapidly. In addition, the parameter is smaller, indicating more users will remain active and affect other susceptible users, which leads it quickly increases to a larger final size of cumulative reading and forwarding quantities.
Table 5
Some important values of parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #.
Name
Estimated value
Standard error
CI low bound
CI high bound
p-value
t-statistic
Min
Max
S0
2.1879×106
29.4563
2.1878×106
2.1879×106
2.0350×10−259
7.4275×104
1.0000×104
1.0000×107
α
0.1959
0.0257
0.1446
0.2472
1.3350×10−10
7.6283
0.0000
4.0000
β
0.0046
2.2555×10−4
0.0042
0.0051
4.9300×10−30
20.4641
0.0000
1.0000
γ
0.8141
0.0159
0.7822
0.8459
3.3164×10−54
51.1067
0.0000
1.0000
p
9.0233×10−5
8.4946×10−6
7.3269×10−5
1.0720
7.6349×10−16
10.6225
0.0000
1.0000
q
0.7086
0.0228
0.6630
0.7541
1.0095×10−40
31.0714
0.0000
1.0000
θ
0.7363
0.0157
0.7049
0.7678
9.0737×10−52
46.7712
0.0000
1.0000
The data fitting results of # Real-time broadcast of joint prevention and control of epidemic situation #.Some important values of parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #.
COVID-19 information contact and participation prediction
We have divided the main development of COVID-19 information dissemination into four stages according to the analysis of information contact and participation from January 17, 2020, to April 16, 2020. Although we cannot control the occurrence of emergency incidents and the dissemination of information, it is very important that, in each stage, we can predict the trend of public opinion based on the existing data before the emergency comes. Fig. 9 shows the prediction of reading and forwarding quantities of COVID-19 and Table 6 gives the estimated parameters with DEDiscover software.
Fig. 9
The prediction of reading and forwarding quantities of COVID-19.
Table 6
Parameter results.
β
α
γ
p
q
θ
S0
Fig. 6(a1)
0.0049
0.0018
0.3553
6.0042×10−4
0.0307
0.3437
6.1797×105
Fig. 6(a2)
0.0088
0.0310
0.4841
2.9000×10−4
0.0392
0.5191
5.2450×105
Fig. 6(b1)
0.0152
0.9306
0.0450
7.9350×10−5
0.3230
0.4370
2.7494×105
Fig. 6(b2)
0.0150
1.2250
0.0381
8.6383×10−5
0.2831
0.5673
3.4533×105
Fig. 6(c1)
0.0360
0.7140
0.0265
1.0531×10−4
0.2330
0.1050
4.5804×104
Fig. 6(c2)
0.0099
0.7056
0.0140
1.5568×10−4
2.7685×10−4
0.0043
1.0001×105
Fig. 6(d1)
0.0280
2.0401
0.1041
6.0000×10−5
0.8191
0.3230
1.1055×105
Fig. 6(d2)
0.0450
0.4304
0.0110
2.4700×10−4
0.4040
0.6360
1.9527×105
At the first stage, topic #5 rumors of viral pneumonia in Wuhan# attracted public attention and the cumulative reading and forwarding quantities of COVID-19 began to increase. We predict the public opinion trend of COVID-19 with the data from January 17, 2020 to January 25, 2020 and it achieves good data fitting with the actual data until January 28, 2020, as shown in Fig. 9(a1). Thus, we extend the data to January 31, 2020 and predict again, fortunately, we get a satisfactory result until the end of this stage, as shown in Fig. 9(a2). It can be seen from the forecast curve of predicted reading population and forwarding population that both the public contact and participation are increasing at the beginning of the epidemic.The prediction of reading and forwarding quantities of COVID-19.Parameter results.With the gradual stabilization of COVID-19 in China, the cumulative reading and forwarding quantities began to grow slowly and the public opinion enters the second stage. We use the data between February 19, 2020 and February 25, 2020 to estimate the parameters and predict the trend of public opinion of the next days, as shown in Fig. 9(b1). Besides, we extend the data until February 26, 2020 to predict the rest of the public opinion trend in this phase and we have achieved very good prediction results, as shown in Fig. 9(b2). At this stage, the predicted reading and forwarding population have a downward trend, which means the public contact and participation are close to saturation.Unexpectedly, the second stage ended since the seriousness of overseas COVID-19 and public opinion enters the third stage. We use the data between March 16, 2020 and March 22, 2020 to estimate the parameters and predict the trend of public opinion, as shown in Fig. 9(c1). In addition, we extend the data until March 23, 2020 to predict the rest of the public opinion trend in this phase, as shown in Fig. 9(c2). At this stage, the predicted reading and forwarding population have a big value at the starting point and then it starts to fall. With the decrease in reading and forwarding, we can infer that the dramatic outbreak of public opinion only appears at the beginning of this stage.Then public opinion entered the fourth stage due to occasional fluctuations in the reading quantity and forwarding quantity, we use the data between April 3, 2020 and April 9, 2020 to realize a good prediction for the next three days, as shown in Fig. 9(d1). Then we extend the data until April 10, 2020 and have a good prediction until the end of this stage, as shown in Fig. 9(d2). At the beginning of this stage, the predicted reading and forwarding population have a maximum value and show a downward trend. With the decrease in reading and forwarding, we can give the conclusion that the dramatic outbreak of public opinion only appears at the beginning of this stage.Table 7 gives the results of the predicted effective reproduction ratio at each time using the estimation results of the last time. We can see the reading quantity, forwarding quantity and per day together from January 17, 2020 to April 16, 2020 more clearly, and the three curves have similar trends at each stage as shown in Fig. 10. Our SRFI model predicts it has the greatest reproduction ratio
6.5656 and breaks out quickly in the early stage of COVID-19. With the development of the epidemic, starts to be less than 1 in the second stage, meaning that public opinion is gradually calming down. Then it started to increase again at the beginning of the third stage, but it is still less than 1, indicating that although reading and forwarding users have increased at first, public opinion will not continue to erupt. With the occurrence of emergency suddenly rises greater than 1 which indicates that in the future, under the trend of overall stability, the information on COVID-19 will continue to wave burst with the occurrence of emergencies.
Table 7
The results of predicted public opinion reproduction ratio .
Data
2020.1.17
2020.1.18
2020.1.19
2020.1.20
2020.1.21
2020.1.22
2020.1.23
ℜe
6.5656
6.5444
6.3993
6.1255
5.6959
5.1075
4.4263
Data
2020.1.24
2020.1.25
2020.1.26
2020.1.27
2020.1.28
2020.1.29
2020.1.30
ℜe
3.7758
3.2554
2.8779
2.6011
2.3918
2.2262
2.0905
Data
2020.1.31
2020.2.1
2020.2.2
2020.2.3
2020.2.4
2020.2.5
2020.2.6
ℜe
1.9776
1.8819
1.7994
1.7273
1.6635
1.6066
1.5557
Data
2020.2.7
2020.2.8
2020.2.9
2020.2.10
2020.2.11
2020.2.12
2020.2.13
ℜe
1.5101
1.4686
1.43
1.3956
1.363
1.3337
1.3058
Data
2020.2.14
2020.2.15
2020.2.16
2020.2.17
2020.2.18
2020.2.19
2020.2.20
ℜe
1.28
1.2563
1.2339
1.2126
1.1926
0.6044
0.1452
Data
2020.2.21
2020.2.22
2020.2.23
2020.2.24
2020.2.25
2020.2.26
2020.2.27
ℜe
0.2336
0.3384
0.4409
0.5284
0.5973
0.65
0.6902
Data
2020.2.28
2020.2.29
2020.3.1
2020.3.2
2020.3.3
2020.3.4
2020.3.5
ℜe
0.7209
0.7447
0.7633
0.7781
0.7898
0.7993
0.8069
Data
2020.3.6
2020.3.7
2020.3.8
2020.3.9
2020.3.10
2020.3.11
2020.3.12
ℜe
0.8131
0.8181
0.8223
0.8257
0.8285
0.8309
0.8328
Data
2020.3.13
2020.3.14
2020.3.15
2020.3.16
2020.3.17
2020.3.18
2020.3.19
ℜe
0.8344
0.8357
0.8368
0.9704
0.9617
0.9506
0.9361
Data
2020.3.20
2020.3.21
2020.3.22
2020.3.23
2020.3.24
2020.3.25
2020.3.26
ℜe
0.9173
0.8931
0.8620
0.8223
0.7726
0.7114
0.6385
Data
2020.3.27
2020.3.28
2020.3.29
2020.3.30
2020.3.31
2020.4.1
2020.4.2
ℜe
0.5553
0.4656
0.3751
0.2907
0.2178
0.1591
0.4602
Data
2020.4.3
2020.4.4
2020.4.5
2020.4.6
2020.4.7
2020.4.8
2020.4.9
ℜe
2.2455
0.162
0.1976
0.2397
0.2884
0.343
0.402
Data
2020.4.10
2020.4.11
2020.4.12
2020.4.13
2020.4.14
2020.4.15
2020.4.16
ℜe
0.4628
0.5223
0.577
0.6246
0.6636
0.6941
0.7167
Fig. 10
The prediction of reading and forwarding quantities of COVID-19.
The prediction of reading and forwarding quantities of COVID-19.The results of predicted public opinion reproduction ratio .
COVID-19 information contact and participation sensitivity analysis and intervention strategies
To further analyze the different parameters responsible for the comprehensive SRFI model, we use the partial rank correlation coefficients (PRCCs) [27] based on 1000 samples for various input parameters against the threshold condition to evaluate the sensitivity. According to the histogram and scatter diagram of dependence, when the correlation is positive, it means that with the increase of the value of the parameter, the value of corresponding index will increase. On the contrary, when the correlation is negative, the index will decrease as the parameter decreases.Fig. 11 shows that the values of the public opinion reproduction ratio is strongly positively affected by parameters , and the initial susceptible users , and negatively affected by parameter . This result confirms the correctness of our derivation of in Formula (15). Therefore, the average exposure rate , the forwarding probability , the probability of inactivation after forwarding , and the initial value are the key factors in determining the event outbreak. Thus, if we want to make public opinion explode, such as positive topics of the COVID-19, we can achieve it by increasing the value of , , and decreasing the value of . Since the parameter is the average exposure rate for a user to contact the information and the is the initial value of the susceptible population, we can increase these two values by persuading some opinion leaders to participate in the information propagation. Since each opinion leader will motivate a large number of new susceptible users, it can effectively increase the value of and . Besides, we can make the content richer and more interesting to attract people to participate in the forwarding of topics and keep users active in the forwarding for a longer time to increase the value of parameter which is influenced by users’ interest in topics and decrease the value of . Correspondingly, if we do not want public opinion to erupt, such as rumor topics of the COVID-19, we need to reduce the value of parameter and . Thus, we can motivate the platform to delete the relevant topics of the rumors about the COVID-19, which can effectively reduce the new reading and curb the outbreak of public opinion.
Fig. 11
PRCC results and PRCC scatter plots with index of different parameters.
In addition, the final size , and the maximum value and are also our concern. and denote the final size of cumulative reading and the high peak of reading quantity which can reflect the users’ contact. From Fig. 12 we can see that the parameter and the initial susceptible users have a positive impact on both two indexes. Similarly, and denote the final size of cumulative forwarding users and the high peak of the forwarding population which can reflect the users’ participation. From Fig. 12 we can see that the parameter and have a positive impact and has a small negative impact on both two indexes. The increase in parameter leads to an increase in reading quantity, and in the stage of a rapid increase in forwarding quantity, parameter and have a relatively large value.
Fig. 12
PRCC results with indexes , and of different parameters.
PRCC results and PRCC scatter plots with index of different parameters.PRCC results with indexes , and of different parameters.Combined with PRCC results, here we take topic # Real-time broadcast of joint prevention and control of epidemic situation # as an example to analyze the specific effects of parameters in our SRFI model. Fig. 13 depicts the effects of parameter and initial data on R-population and respectively. It shows that both the parameter and initial data have a positive effect on the number of reading users () and cumulative reading population (). Comparatively speaking, the larger the parameter is, the earlier the reading peak appears and the shorter the outbreak duration. And the greater the initial data is, the larger the reading peak value will be without changing the outbreak duration is. Furthermore, it shows that the initial data has a much important influence on the outbreaking behaviors of the topic reading related to COVID-19.
Fig. 13
Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .
Fig. 14 depicts the effects of parameters and on F-population and respectively. It shows that both the parameters and have a positive effect on the number of forwarding users () and cumulative forwarding population (). The larger the parameter is, the greater the forwarding peak value and the final size of forwarding users will be without changing the outbreak duration. By contrast, the parameter has a more subtle impact on forwarding related to COVID-19, the greater the parameter is, the earlier the forwarding peak appears and the slower the outbreak velocity and the propagation decline velocity are.
Fig. 14
Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .
Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .Thus, if we want to increase users’ attention on topics of the COVID-19, we can achieve it by persuading some opinion leaders who have a large number of fans to participate in the information propagation to increase the value of and . Correspondingly, if we want to decrease users’ attention on topics of the COVID-19, we can motivate the platform to delete the relevant topics, which can effectively reduce the reading quantity. In addition, if we want to increase users’ participation in topics of the COVID-19, we can achieve it through increasing the value of and by making content more innovative to attract people to participate in the forwarding of topics. Here, we have similar strategies as the sensitivity analysis of opinion reproduction ratio .Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .
Conclusions
In this paper, we proposed a comprehensive susceptible–reading–forwarding–immune (SRFI) dynamics model based on the reading quantity and forwarding quantity in Chinese Sina-Microblog to understand both the contribution of users’ contact and participation behavior to information propagation about the COVID-19. The particular feature mechanism of social network that users may re-enter to the susceptible state to have more chance to contact information from reading state or from forwarding state subjectively is discussed, which prompt an active understanding of the epidemic. We analyzed the public opinion data of crucial moment about the COVID-19 from January 17, 2020 to April 16, 2020 on Chinese Sina-microblog and stratified events development into different stages according to the disease outbreak development. We performed the numerical simulation on two typical topics about the COVID-19 based on both cumulative reading and forwarding quantities to verify the effectiveness of our model. Then in each stage, we used a small amount of data for parameter estimation and then used the parameterized model for trend prediction which agreed with both the real data well until the next event occurred. For characteristic parameters, a PRCC sensitivity analysis was completed that provides some perceptions in design of some effective strategies. We hope this paper could provide a tool efficiently for predicting the direction of public opinion and stabilizing public emotions with on-going COVID-19 development.
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Authors: Blessing Ogbuokiri; Ali Ahmadi; Nicola Luigi Bragazzi; Zahra Movahedi Nia; Bruce Mellado; Jianhong Wu; James Orbinski; Ali Asgary; Jude Kong Journal: Front Public Health Date: 2022-08-12