Literature DB >> 33551542

COVID-19 information contact and participation analysis and dynamic prediction in the Chinese Sina-microblog.

Fulian Yin¹, Hongyu Pang¹, Xinyu Xia¹, Xueying Shao¹, Jianhong Wu².

Abstract

The outbreak of a novel coronavirus (COVID-19) aroused great public opinion in the Chinese Sina-microblog. To help in designing effective communication strategies during a major public health emergency, we analyze the real data of COVID-19 information and propose a comprehensive susceptible-reading-forwarding-immune (SRFI) model to understand the patterns of key information propagation considering both public contact and participation. We develop the SRFI model, based on the public reading quantity and forwarding quantity that denote contact and participation respectively, and take into account the behavior that users may re-enter another related topic during the attention phase or the participation phase freely. Data fitting using the real data of both reading quantity and forwarding quantity obtained from Chinese Sina-microblog can parameterize the model to make an accurate prediction of the COVID-19 public opinion trend until the next major news item occurs, and the sensitivity analysis provides the basic strategies for communication.

Entities: Disease Species

Keywords: COVID-19; Dynamic model; Reading and forwarding; Sina-microblog

Year: 2021 PMID： 33551542 PMCID： PMC7845521 DOI： 10.1016/j.physa.2021.125788

Source DB: PubMed Journal: Physica A ISSN： 0378-4371 Impact factor: 3.263

Introduction

Up to 24:00 on April 16, 2020, 2,352,198 cases have been affected by a novel coronavirus pneumonia (COVID-19) worldwide since it has been successively found in Wuhan. Major news items combined have generated quite strong fluctuations in public opinions. For example, on January 28, 2020, Nanshan Zhong, a well-known expert in infectious disease control, emphasized that people should not go out at present [1], this appeal attracted wide attention and warning that people said they could go out only if Nanshan Zhong admitted [2]. Sina-microblog is the most popular social network in China [3] and the outbreak-related topics about COVID-19 grew exponentially on that platform. As reading quantity and forwarding quantity of “Nanshan Zhong” have reached almost 9.40 billion and 2.05 million on Sina-microblog, understanding how these emerging public contact and participation spread in social media to alter the public behaviors is important to help to design effective communication strategies for rapid implementation of public health interventions. Fig. 1 shows the schematic diagram of COVID-19 information propagation in Sina-microblog, and the nodes represent users in different states. Many original post owners (green nodes) can publish single or multiple epidemic topics subordinate to different news. Take Headline News and Pear Video for example. While Headline News reports on multiple topics, and Pear Video focuses on a particular topic, and both of them can be read or forwarded by users attracted by both topics. Especially, reading users (blue nodes) can choose to be silent then leave and also can forward after reading becoming forwarding users (orange nodes). When users finish one topic, they may join others. And the relevant forwarding users can resume in the topics later and forward multiple messages becoming cross-forwarding users (pink nodes), leading to a multi-level information diffusion process. All users (readers and forwarding users) can choose to read or forward only one or multiple topics and therefore information propagates through one or multiple topics. This promotes the COVID-19 information dissemination rapidly.

Fig. 1

Information propagation considering both reading and forwarding on multiple topics of the COVID-19.

Information propagation considering both reading and forwarding on multiple topics of the COVID-19. To our best knowledge, there is no appropriate model framework that can be used to analyze public contact propagation and public participation propagation together during a major public health emergency. In consideration of the urgent need of developing theoretical knowledge and practical technologies to help effective communications of public health interventions, we propose a comprehensive susceptible–reading–forwarding–immune (SRFI) model based on both reading quantity and forwarding quantity that represent public contact and participation respectively to analyze the public opinion propagation of the COVID-19. In particular, we consider the characteristic user behavior that users may participate repeatedly in the reading or forwarding on different topics of the COVID-19 information dissemination. Traditionally, considering rumor is similar to epidemiology in several propagation ways, many scholars usedsusceptible–infected (SI) model [4], [5], susceptible–infected–recovered (SIR) model [6], [7], susceptible–exposed–infected–recovered (SEIR) model [8], [9] and susceptible–infected–susceptible (SIS) [10] model to represent rumor propagation and address relevant issues. Generally, information propagation in social networks was analyzed by stratifying users into three classes: heard rumor (ignorants), actively spreading rumor (spreaders) and no longer spreading rumor (stiflers) [11]. Then, scholars introduced new modules into classical models to understand the process of information dissemination better and then achieve various research purposes. In 2011, Zhao et al. [12] provided a more detailed and realistic description of the rumor spreading process with combination of forgetting mechanism and the SIR model of epidemics on an online social blogging platform called LiveJournal. In 2012, Zhao et al. [13] extended the classical SIR rumor spreading model by adding a direct link from ignorants to stiflers and a new kind of people—Hibernators in order to reduce the maximum rumor influence, which was called susceptible–infected–hibernator–removed (SIHR) model. In 2012, Xiong et al. [14] proposed a susceptible–contacted–infected–refractory (SCIR) diffusion model, which contained four possible states to characterize information propagation on online microblogs. In 2014, Zhao et al. [15] added the refutation mechanism in homogeneous social networks to the basic model using the Runge–Kutta method, which could help authorities reduce the maximum influence of the rumor. Zhang et al. [16] emphasized a special rumor spreading characteristic called “the cumulative effects of memory” and added the memory mechanism, meanwhile simulated the rumor spreading process on Sina-Microblog. Rui et al. [17] proposed a susceptible–potential–infective–removed (SPIR) model introducing a potential spreader set, which made the state-changing mechanism more reasonable and accurate for the diffusion process. In addition, Zhang et al. [18] used an improved SIR model to posit that a coupled network comprised two categories of nodes, then made use of the data collected from Weibo and WeChat of an actual news event to visualize the information spread process in the cross-network dissemination case of public opinion. Huang et al. [19] established a human dynamics model for deducing retweeting behavior and investigated it by gathering data through Sina API, which revealed that the distribution of the probability of a message to be browsed over time presented power-law characters. In 2018, Zan [20] studied the double rumors spreading with different launch times and introduced two kinds of models: double-susceptible–infected–recovered (DSIR) model and comprehensive-DSIR (C-DSIR) model, which focused on the interaction from old rumor to new rumor and the propagation of two rumors posted successively. And in 2019, we [21] proposed an epidemic model called susceptible–forwarding–immune (SFI) to capture a single information propagation trend in the Sina-microblog considering the forwarding quantity of users. Trpevski D et al. [22], Qian et al. [23], Wang et al. [24] also proposed many improved models for the spread of information. Especially Tanaka M et al. [25] added a new module to the traditional model using the datasets from the Japanese Mixi and Facebook. After summarizing a lot of literature, we find that most of the experimental data used for simulation are from well-known social platforms in the world, such as Twitter and Facebook. Furthermore, scholars from different countries may choose a local social platform in order to obtain data more convenient, especially Sina-microblog in China. By analyzing different data sources from other papers, we discover that existing studies have used forwarding quantity of one Weibo or multiple Weibos, the content of rumors and browsing behavior of users. To our best knowledge, they have not introduced the new module by combining with reading quantity and forwarding quantity of topics which are significant indicators to measure public contact and participation. The paper is organized as follows: in Section 2, we analyze the public opinion data of crucial moment on the COVID-19; in Section 3, we introduce mathematical model definition for information contact and participation, fit the model with the real data of two typical topics, make a staged prediction of the overall public opinion trend, then conduct a parameter sensitivity analysis and give effective intervention strategies about the information contact and participation; and in Section 4, we draw a conclusion.

COVID-19 information contact and participation analysis

Information reading and forwarding topology

The urgency is often accompanied with a number of topics related to the COVID-19 and the behavior of reading and forwarding usually prompts the propagation of the event. Reading is a kind of behavior that reflects users’ contact in information propagation and forwarding is a kind of behavior that reflects users’ participation in information propagation. In order to more clearly show the propagation process of both public contact and participation, the network topology is shown in Fig. 2 which describes the state of each node in the network at a certain moment in the process of information dissemination. Taking the propagation of three original post owners under three topics as an example, the user’s overall state of integrated information propagation is given.

Fig. 2

Network topology for COVID-19 propagation by reading and forwarding.

The information posted by the original post owners (red nodes), and can be read separately by single readers with interest (blue nodes) in which readers can then participate in the forwarding about the outbreaks (black nodes) or choose to be silent (yellow nodes). Especially, some co-spreaders between different topics repeatedly read and forward the related information successively then become the cross-reading users (pink nodes) and cross-forwarding users (green nodes) because of the correlation about the epidemic. Of course, there will also be many readers who contact information choose to be silent, such as the un-forwarding users (yellow nodes). In the real-world, the number of information is dynamically changing and cannot be clearly calculated. The reading quantity and forwarding quantity of the whole COVID-19 are composed of many topics with multiple information. Different from the traditional public hot events, the outbreak is causing great public concern. With the continuous development of the COVID-19, there is a high level of repetition in public reading and forwarding on different topics. As reading is a measure of contact and forwarding is a measure of participation for information dissemination, in this paper, we build the comprehensive susceptible–reading–forwarding–immune (SRFI) dynamics model with considering the repeated behavior including “re-reading” and “re-forwarding” on the impact of public opinion propagation. Network topology for COVID-19 propagation by reading and forwarding.

Information contact and participation analysis

Since the outbreak of COVID-19, around 7900 major topics appeared in Sina-microblog. Fig. 3 shows the cumulative reading quantity and forwarding quantity from December 31, 2019 to April 16, 2020, where the ordinate is the logarithm of the cumulative reading and forwarding quantities. It can be roughly seen that during the period from December 31, 2019 to January 16, 2020, there was only a certain amount of hot topics of COVID-19, however, from January 17, 2020, the hot topics about the epidemic gradually increased, especially after Nanshan Zhong confirmed that the COVID-19 could be transmitted from human to human on January 20, 2020, both the two quantities kept increasing dramatically, and then we will analyze the contact and participation data of COVID-19 information from January 17, 2020 to April 16, 2020 and Table 1 gives the specific values of the cumulative reading quantity and forwarding quantity.

Fig. 3

The reading quantity and forwarding quantity about COVID-19: (a) the reading quantity; (b) the forwarding quantity.

Table 1

The cumulative reading quantity and forwarding quantity of COVID-19.

Date	2020.1.17	2020.1.18	2020.1.19	2020.1.20	2020.1.21	2020.1.22	2020.1.23
CR(*10⁵)	38 633	42 915	63 018	125 739	253 288	476 434	779 867
CF	225 554	264 641	434 164	1 383 953	3 390 573	7 639 887	23 811 193

Date	2020.1.24	2020.1.25	2020.1.26	2020.1.27	2020.1.28	2020.1.29	2020.1.30

CR(*10⁵)	1 009 154	1 239 081	1 482 486	1 702 699	1 885 614	2 062 613	2 235 681
CF	31 929 846	37 850 190	43 424 983	47 026 579	50 333 088	54 259 760	57 741 525

Date	2020.1.31	2020.2.1	2020.2.2	2020.2.3	2020.2.4	2020.2.5	2020.2.6

CR(*10⁵)	2 404 628	2 615 989	2 816 495	2 995 967	3 182 909	3 375 830	3 544 372
CF	62 205 935	69 421 577	75 392 452	78 614 456	84 683 986	90 049 559	99 153 988

Date	2020.2.7	2020.2.7	2020.2.9	2020.2.10	2020.2.11	2020.2.12	2020.2.13

CR(*10⁵)	3 797 159	3 957 690	4 168 665	4 337 476	4 539 071	4 690 281	4 852 670
CF	111 769 392	116 979 114	120 813 833	125 010 556	129 008 667	133 206 593	138 580 808

Date	2020.2.14	2020.2.15	2020.2.16	2020.2.17	2020.2.18	2020.2.19	2020.2.20

CR(*10⁵)	5 028 183	5 200 905	5 355 616	5 507 174	5 660 835	5 800 665	5 934 350
CF	143 356 722	147 554 717	151 821 458	155 138 314	158 447 426	161 128 288	163 301 415

Date	2020.2.21	2020.2.22	2020.2.23	2020.2.24	2020.2.25	2020.2.26	2020.2.27

CR(*10⁵)	6 139 063	6 299 068	6 465 841	6 619 762	6 757 166	6 885 342	7 003 691
CF	165 502 316	167 645 359	170 198 613	173 068 348	176 231 509	178 567 007	180 802 728

Date	2020.2.28	2020.2.29	2020.3.1	2020.3.2	2020.3.3	2020.3.4	2020.3.5

CR(*10⁵)	7 130 939	7 246 748	7 351 752	7 447 584	7 534 199	7 638 853	7 726 308
CF	183 003 765	184 900 668	186 966 898	188 327 150	189 521 175	190 727 593	191 892 477

Date	2020.3.6	2020.3.7	2020.3.8	2020.3.9	2020.3.10	2020.3.11	2020.3.12

CR(*10⁵)	7 798 859	7 866 021	7 916 356	7 977 164	8 050 796	8 140 781	8 286 361
CF	192 894 893	193 762 501	194 739 304	195 609 360	196 481 124	197 673 456	199 077 665

Date	2020.3.13	2020.3.14	2020.3.15	2020.3.16	2020.3.17	2020.3.18	2020.3.19

CR(*10⁵)	8 486 884	8 606 930	8 741 984	8 881 456	9 025 990	9 159 021	9 294 382
CF	200 311 755	201 747 067	203 341 571	205 021 835	213 788 905	219 382 758	222 669 494

Date	2020.3.20	2020.3.21	2020.3.22	2020.3.23	2020.3.24	2020.3.25	2020.3.26

CR(*10⁵)	9 454 936	9 613 858	9 714 204	9 788 695	9 846 275	9 940 630	10 024 169
CF	225 005 650	227 086 475	228 026 043	228 598 328	229 318 397	230 114 834	230 719 885

Date	2020.3.27	2020.3.28	2020.3.29	2020.3.30	2020.3.31	2020.4.1	2020.4.2

CR(*10⁵)	10 116 518	10 193 220	10 274 610	10 401 702	10 509 970	10 613 594	10 719 037
CF	231 501 263	232 111 489	232 756 584	233 655 217	234 849 798	235 876 483	236 758 976

Date	2020.4.3	2020.4.4	2020.4.5	2020.4.6	2020.4.7	2020.4.8	2020.4.9

CR(*10⁵)	10 807 269	11 026 877	11 138 821	11 231 162	11 333 261	11 422 902	11 503 056
CF	237 811 162	252 986 547	255 099 056	256 111 309	257 005 795	258 764 094	259 979 183

Date	2020.4.10	2020.4.11	2020.4.12	2020.4.13	2020.4.14	2020.4.15	2020.4.16

CR(*10⁵)	11 562 237	11 617 701	11 657 360	11 699 565	11 743 954	11 806 848	11 866 802
CF	260 392 077	260 727 437	260 984 402	261 217 740	261 480 292	261 836 224	262 202 217

The reading quantity and forwarding quantity about COVID-19: (a) the reading quantity; (b) the forwarding quantity. Fig. 4 shows the box diagram of reading quantity and forwarding quantity from January 17, 2020 to April 16, 2020, which reflect the user’s contact and participation in information, respectively. Although in terms of the order of magnitude, the reading quantity is larger than the forwarding quantity and its range is relatively wide, in particular, the reading quantity and the forwarding quantity vary within about 3 × 1010 and 7 × 106 respectively, except for the outlier forwarding quantity caused by several emergencies, these two propagation trends are consistent, both in line with the laws of dynamics development. Besides, there are also different features in reading quantity and forwarding quantity such as the median, so it is necessary to build a model combining these two attributes.

Fig. 4

The box diagram of the number of reading and forwarding.

The cumulative reading quantity and forwarding quantity of COVID-19. The box diagram of the number of reading and forwarding. In order to analyze the public opinion trend more clearly, Fig. 5 gives the reading quantity, forwarding quantity and the number of topics together with line charts and histograms from January 17, 2020 to April 16, 2020. We can see that since January 17, 2020, the number of reading and forwarding of COVID-19 has gradually increased, and the emergence of some important events or hot topics has a major impact on public opinion and the overall development can be divided into four stages so far based on this. The first stage, from January 17, 2020 to February 19, 2020, which both two quantities increased gradually, is the outbreak stage. In the beginning, topic # 5 rumors of viral pneumonia in Wuhan # attracted public attention, then Wuhan began its closure on January 23, 2020, with the emergence of topics such as # Wuhan Bus Metro Suspension of Operation #, the reading and forwarding quantities of COVID-19 began to increase dramatically and reached the highest value and the number of topics began to gradually increase leading public opinion enter the outbreak stage. Besides, topic #Doctor Wenliang Li passed away# posted on February 7, 2020 and #Easy epidemic prevention station# posted on February 19, 2020 have aroused great attention from people, causing both two quantities to reach the extreme again and public opinion continues to ferment. The second stage is from February 19, 2020 to March 16, 2020, in which both two quantities start to decrease with the stabilization and improvement of the domestic cases, and the overall number of topics increased and stabilized in a certain range in the later period. The third stage is another outbreak stage from March 16, 2020 to April 3, 2020, the overall trend is on the rise as the disease becomes more serious abroad, and the number of topics has risen again. With the release of #support Hubei Medical Team to evacuate # on March 17, 2020 and other topics related to COVID-19 of the global, both two quantities increased dramatically again, and public opinion entered into another outbreak stage. The fourth stage, from April 3, 2020 to April 16, 2020 is the occasional fluctuation stage. April 4, 2020 is Tomb-Sweeping Day of China, some topics for mourning the anti-epidemic heroes began to increase such as # Three minutes of silence in all China # which caused a lot of users to read and forward and led to some occasional fluctuations in the reading quantity and forwarding quantity. The division of the public opinion of COVID-19 helps us understand the trend caused by the entire epidemic hot topics.

Fig. 5

The trends chart of the topics about COVID-19, where the blue line represents the number of reading and the red line represents the number of forwarding respectively, and the histograms represent the number of topics per day.

COVID-19 information contact and participation prediction

Susceptible–reading–forwarding–immune model (SRFI)

The propagation dynamics model based on the reading quantity and forwarding quantity of COVID-19 constructed in this paper is shown in Fig. 6. Here, we only consider the accessible population in the process of information propagation and pay attention to both the information diffusion caused by users’ reading behavior and forwarding behavior. Assuming that the number of users () who can contact information in the process of propagation on Sina-microblog remains unchanged, we stratify the population into four states: the susceptible state (), in which the users unaware of but susceptible to the information of the event; the reading state (), in which the users have read information and are susceptible to forward it; the forwarding state (), in which the users have forwarded the information actively to influence other users; and the immune state (), in which the users have read or forwarded the information, but are no longer read or forward the information even if receive them again.

Fig. 6

A schematic diagram to illustrate information spreading considering both reading and forwarding in the population with four different states: susceptible (), reading (), forwarding () and immune ().

A susceptible user can read one information with an average exposure rate and a user in reading state will leave and become other states with a deactivation rate . The forwarding users can become immune users who are inactive to the event with an average inactive rate , with being the average duration where an F-user remains active in being contacted. A schematic diagram to illustrate information spreading considering both reading and forwarding in the population with four different states: susceptible (), reading (), forwarding () and immune (). The core of our model is to study the role of both repeated reading and forwarding because users exposure to different information about COVID-19. Hence, with forwarding probability from reading users to forwarding users and immunity probability denotes those who keep silent in the event and go straight to the immune state, we use to represent the “re-reading” probability for a reading user returns to the susceptible state. Besides, we use parameter to describe the “re-forwarding” probability for a forwarding user who can return a new round of susceptible state of COVID-19. In particular, each user may have a unique state, that is, at the same time, each user can be only one of the susceptible, reading, forwarding or immune states. We obtain the following SRFI dynamics model: where ’ is the derivative with respect to . The behavior transformation and state transition of the masses can also be interpreted as follow: Reading: Since an active forwarding user will contact an average number of users per unit time, the probability of a normal user is a susceptible user is , and there are active forwarding users in total, then the number of new reading users is . Forwarding: Some reading users will inactive with the deactivation rate and participate in the forwarding state with the forwarding probability , the number of new forwarding users is . Re-enter: As events unfold, there are two ways to generate repeated behavior and initiate a new round of reading and forwarding: users who contact one information and yearn for getting more information about COVID-19 will re-enter to the susceptible state from reading state, the average number of re-entered users is per unit time; users who have forwarded one information and be interested in related information about COVID-19 will re-enter to susceptible from forwarding state, the average number of the re-entered users is per unit time. Immune: Some reading users will not participate in the forwarding and become the immune users directly because they want to keep silent in the event, and the number of direct immune users is . And some forwarding users will go to the immune state out of active time, the number of inactive users is . The Sina-microblog provides the number of cumulative reading population and forwarding population which is the total times of reading and forwarding within a topic about COVID-19, and we calculate the sum of the whole event, given by The corresponding differential equation can be expressed as: Considering the initial condition: , , and . The final condition from Eqs. (4)–(6), it follows that , and are all increasing since , and . Therefore, the final states are , , , and are tending to 0 , and . Here and are the final size of the COVID-19 reading and forwarding. In addition, the number of maximal reading users and maximal forwarding users are and , respectively. We define the reproduction ratio to measure whether the outbreak forwarding quantity was likely to break out. In the initial post of the COVID-19, the forwarding outbreak is given by , and the population will never take off since due to the decreasing of . Then we deduce Public opinion reproduction ratio : The reproduction ratio is defined to measure whether public opinion was likely to break out. We use the calculation method of basic reproduction number developed in [26], and rewrite our model as follows: where and Calculate the derivatives and at no information propagation equilibrium , we can obtain and The roots of the characteristic equation can deduce the eigenvalues of the matrix : Because is not negative, we have Here, based on the extension of , we define the (effective) reproduction number to describe the outbreak of public opinion at each time t, and it has more practical significance in the dynamic development process. Then we have The represents the propagation capability of each period and it is time-varying, which is determined by the average exposures rate , the average inactive rate , the forwarding probability , and the susceptible users . When , it means that the comprehensive public opinion will decline which implies the propagation can never break out. The indicates that the comprehensive public opinion grows exponentially initially.

Data fitting

Parameter estimation method: To use our SRFI model to explore some distinctions of qualitative behaviors for prediction, we use the LS method to estimate the model parameters and the initial data of our SRFI model. The parameter vector can be set as , and the corresponding numerical calculation based on the parameter vectors for and are denoted by and , respectively. The LS error function is used in our calculation, where and denote the actual cumulative reading quantity and forwarding quantity, and 0, 1, 2, …is the sampling time. In our paper, we use DEDiscover software to solve this LS problem. Data description: In order to analyze the public opinions with different characteristics, the following two typical events are selected from the whole public opinion outbreak duration. Table 2 shows the reading quantity and forwarding quantity of # Refuse to eat wild animals # with a slow outbreak. And Table 3 shows the reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation # with a fast outbreak.

Table 2

The cumulative reading quantity and forwarding quantity of # Refuse to eat wild animals #.

Data	2020.2.29	2020.3.1	2020.3.2	2020.3.3	2020.3.4	2020.3.5	2020.3.6	2020.3.7
CR(*10⁵)	14	19	249	504	712	855	992	1142
CF	1270	3780	9666	33 657	51 138	64 398	71 008	89 619

Data	2020.3.8	2020.3.9	2020.3.10	2020.3.11	2020.3.12	2020.3.13	2020.3.14	2020.3.15

CR(*10⁵)	1270	1368	1443	1486	1508	1514	1523	1536
CF	106 938	109 223	109 589	110 431	110 888	112 583	112 861	113 140

Data	2020.3.16	2020.3.17	2020.3.18	2020.3.19	2020.3.20	2020.3.21	2020.3.22

CR(*10⁵)	1549	1551	1551	1551	1551	1551	1551
CF	113 640	113 778	114 011	114 117	114 170	114 247	114 281

Table 3

The cumulative reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation #.

Data	2020.1.27	2020.1.28	2020.1.29	2020.1.30	2020.1.31	2020.2.1	2020.2.2	2020.2.3
CR(*10⁵)	2361	8448	14 146	19 897	23 839	26 616	30 331	30 799
CF	61 134	119 339	181 997	240 858	263 552	284 178	314 132	321 897

Data	2020.2.4	2020.2.5	2020.2.6	2020.2.7	2020.2.8	2020.2.9	2020.2.10	2020.2.11

CR(*10⁵)	31 838	32 261	32 424	32 619	33 451	33 551	33 579	33 603
CF	334 686	341 232	343 963	347 308	360 530	362 177	363 453	364 203

Data	2020.2.12	2020.2.13	2020.2.14	2020.2.15	2020.2.16	2020.2.17	2020.2.18	2020.2.19

CR(*10⁵)	33 656	33 721	33 752	33 767	33 787	33 812	33 854	33 870
CF	364 682	365 318	365 673	366 002	366 323	366 652	367 272	367 698

Data	2020.2.20	2020.2.21	2020.2.22	2020.2.23	2020.2.24	2020.2.25	2020.2.26	2020.2.27

CR(*10⁵)	33 908	33 961	33 984	34 000	34 021	34 039	34 054	34 069
CF	368 275	368 875	369 261	369 629	370 098	370 585	370 899	371 286

Data	2020.2.28	2020.2.29	2020.3.1	2020.3.2

CR(*10⁵)	34 081	34 081	34 081	34 081
CF	371 720	372 086	372 420	372 593

Data fitting results: The cumulative reading quantity and forwarding quantity of # Refuse to eat wild animals #. The cumulative reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation #. As shown in Fig. 7, we performed data fitting on the real data of # Refuse to eat wild animals# in Table 2, where the red star and blue star denotes the actual cumulative number of reading and forwarding population, respectively, the red line and blue line denotes the estimated cumulative number of reading and forwarding population, respectively. It can be seen that the initial outbreak of the topic is slow and our SRFI model achieves accurate estimation.

Fig. 7

The data fitting results of # Refuse to eat wild animals #.

Table 4 gives some important values of early period parameter estimation of # Refuse to eat wild animals #. We can see relative to the topic # Real-time broadcast of joint prevention and control of epidemic situation # the initial susceptible users is smaller meaning that there are fewer users are susceptible at the beginning of the information explosion, leading the topic outbreak slowly. Besides, the parameter is larger, which means the active period of forwarding users is short so that they cannot influence other susceptible people for a long time, leading a smaller final size of cumulative reading and forwarding quantities.

Table 4

Some important values of parameter estimation of # Refuse to eat wild animals #.

Name	Estimated value	Standard error	CI low bound	CI high bound	p-value	t-statistic	Min	Max
S0	4.2710×104	52.7936	6.0805×105	6.0826×106	1.5212×10−218	1.1520×104	1.0000×104	1.0000×104
α	0.8611	0.0264	0.3484	0.4535	9.8689×10−24	15.2126	0.0000	2.0000
β	0.1450	0.0029	0.0057	0.0172	1.8223×10−4	3.9562	0.0000	1.0000
γ	0.9782	0.0262	0.9429	1.0475	5.7330×10−48	37.9579	0.0000	1.0000
p	7.9000×10−4	8.3598×10−5	1.2329×10−4	4.5684×10−4	9.0251×10−4	3.4697	0.0000	1.0000
q	0.2580	0.0117	0.0865	0.1332	5.8565×10−14	9.3897	0.0000	1.0000
θ	0.9781	0.0322	0.8578	0.9861	4.7696×10−4	28.6591	0.0000	1.0000

As shown in Fig. 8, we performed data fitting on the real data of # Real-time broadcast of joint prevention and control of epidemic situation # in Table 3, where the red star and blue star denotes the actual cumulative number of reading and forwarding population, respectively, the red line and blue line denotes the estimated cumulative number of reading and forwarding population, respectively. It can be seen that the initial outbreak of topic # Real-time broadcast of joint prevention and control of epidemic situation # is fast, which entered a stage of rapid explosion directly at the beginning and our SRFI model achieves accurate estimation.

Fig. 8

The data fitting results of # Real-time broadcast of joint prevention and control of epidemic situation #.

The data fitting results of # Refuse to eat wild animals #. Some important values of parameter estimation of # Refuse to eat wild animals #. Table 5 gives some important values of early period parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #. We can see the initial susceptible users is larger meaning that more users become susceptible at the beginning of the information explosion, leading the topic outbreaks rapidly. In addition, the parameter is smaller, indicating more users will remain active and affect other susceptible users, which leads it quickly increases to a larger final size of cumulative reading and forwarding quantities.

Table 5

Some important values of parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #.

Name	Estimated value	Standard error	CI low bound	CI high bound	p-value	t-statistic	Min	Max
S0	2.1879×106	29.4563	2.1878×106	2.1879×106	2.0350×10−259	7.4275×104	1.0000×104	1.0000×107
α	0.1959	0.0257	0.1446	0.2472	1.3350×10−10	7.6283	0.0000	4.0000
β	0.0046	2.2555×10−4	0.0042	0.0051	4.9300×10−30	20.4641	0.0000	1.0000
γ	0.8141	0.0159	0.7822	0.8459	3.3164×10−54	51.1067	0.0000	1.0000
p	9.0233×10−5	8.4946×10−6	7.3269×10−5	1.0720	7.6349×10−16	10.6225	0.0000	1.0000
q	0.7086	0.0228	0.6630	0.7541	1.0095×10−40	31.0714	0.0000	1.0000
θ	0.7363	0.0157	0.7049	0.7678	9.0737×10−52	46.7712	0.0000	1.0000

The data fitting results of # Real-time broadcast of joint prevention and control of epidemic situation #. Some important values of parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #.

COVID-19 information contact and participation prediction

We have divided the main development of COVID-19 information dissemination into four stages according to the analysis of information contact and participation from January 17, 2020, to April 16, 2020. Although we cannot control the occurrence of emergency incidents and the dissemination of information, it is very important that, in each stage, we can predict the trend of public opinion based on the existing data before the emergency comes. Fig. 9 shows the prediction of reading and forwarding quantities of COVID-19 and Table 6 gives the estimated parameters with DEDiscover software.

Fig. 9

The prediction of reading and forwarding quantities of COVID-19.

Table 6

Parameter results.

	β	α	γ	p	q	θ	S0
Fig. 6(a1)	0.0049	0.0018	0.3553	6.0042×10−4	0.0307	0.3437	6.1797×105
Fig. 6(a2)	0.0088	0.0310	0.4841	2.9000×10−4	0.0392	0.5191	5.2450×105
Fig. 6(b1)	0.0152	0.9306	0.0450	7.9350×10−5	0.3230	0.4370	2.7494×105
Fig. 6(b2)	0.0150	1.2250	0.0381	8.6383×10−5	0.2831	0.5673	3.4533×105
Fig. 6(c1)	0.0360	0.7140	0.0265	1.0531×10−4	0.2330	0.1050	4.5804×104
Fig. 6(c2)	0.0099	0.7056	0.0140	1.5568×10−4	2.7685×10−4	0.0043	1.0001×105
Fig. 6(d1)	0.0280	2.0401	0.1041	6.0000×10−5	0.8191	0.3230	1.1055×105
Fig. 6(d2)	0.0450	0.4304	0.0110	2.4700×10−4	0.4040	0.6360	1.9527×105

At the first stage, topic #5 rumors of viral pneumonia in Wuhan# attracted public attention and the cumulative reading and forwarding quantities of COVID-19 began to increase. We predict the public opinion trend of COVID-19 with the data from January 17, 2020 to January 25, 2020 and it achieves good data fitting with the actual data until January 28, 2020, as shown in Fig. 9(a1). Thus, we extend the data to January 31, 2020 and predict again, fortunately, we get a satisfactory result until the end of this stage, as shown in Fig. 9(a2). It can be seen from the forecast curve of predicted reading population and forwarding population that both the public contact and participation are increasing at the beginning of the epidemic. The prediction of reading and forwarding quantities of COVID-19. Parameter results. With the gradual stabilization of COVID-19 in China, the cumulative reading and forwarding quantities began to grow slowly and the public opinion enters the second stage. We use the data between February 19, 2020 and February 25, 2020 to estimate the parameters and predict the trend of public opinion of the next days, as shown in Fig. 9(b1). Besides, we extend the data until February 26, 2020 to predict the rest of the public opinion trend in this phase and we have achieved very good prediction results, as shown in Fig. 9(b2). At this stage, the predicted reading and forwarding population have a downward trend, which means the public contact and participation are close to saturation. Unexpectedly, the second stage ended since the seriousness of overseas COVID-19 and public opinion enters the third stage. We use the data between March 16, 2020 and March 22, 2020 to estimate the parameters and predict the trend of public opinion, as shown in Fig. 9(c1). In addition, we extend the data until March 23, 2020 to predict the rest of the public opinion trend in this phase, as shown in Fig. 9(c2). At this stage, the predicted reading and forwarding population have a big value at the starting point and then it starts to fall. With the decrease in reading and forwarding, we can infer that the dramatic outbreak of public opinion only appears at the beginning of this stage. Then public opinion entered the fourth stage due to occasional fluctuations in the reading quantity and forwarding quantity, we use the data between April 3, 2020 and April 9, 2020 to realize a good prediction for the next three days, as shown in Fig. 9(d1). Then we extend the data until April 10, 2020 and have a good prediction until the end of this stage, as shown in Fig. 9(d2). At the beginning of this stage, the predicted reading and forwarding population have a maximum value and show a downward trend. With the decrease in reading and forwarding, we can give the conclusion that the dramatic outbreak of public opinion only appears at the beginning of this stage. Table 7 gives the results of the predicted effective reproduction ratio at each time using the estimation results of the last time. We can see the reading quantity, forwarding quantity and per day together from January 17, 2020 to April 16, 2020 more clearly, and the three curves have similar trends at each stage as shown in Fig. 10. Our SRFI model predicts it has the greatest reproduction ratio 6.5656 and breaks out quickly in the early stage of COVID-19. With the development of the epidemic, starts to be less than 1 in the second stage, meaning that public opinion is gradually calming down. Then it started to increase again at the beginning of the third stage, but it is still less than 1, indicating that although reading and forwarding users have increased at first, public opinion will not continue to erupt. With the occurrence of emergency suddenly rises greater than 1 which indicates that in the future, under the trend of overall stability, the information on COVID-19 will continue to wave burst with the occurrence of emergencies.

Table 7

The results of predicted public opinion reproduction ratio .

Data	2020.1.17	2020.1.18	2020.1.19	2020.1.20	2020.1.21	2020.1.22	2020.1.23
ℜe	6.5656	6.5444	6.3993	6.1255	5.6959	5.1075	4.4263
Data	2020.1.24	2020.1.25	2020.1.26	2020.1.27	2020.1.28	2020.1.29	2020.1.30
ℜe	3.7758	3.2554	2.8779	2.6011	2.3918	2.2262	2.0905

Data	2020.1.31	2020.2.1	2020.2.2	2020.2.3	2020.2.4	2020.2.5	2020.2.6
ℜe	1.9776	1.8819	1.7994	1.7273	1.6635	1.6066	1.5557

Data	2020.2.7	2020.2.8	2020.2.9	2020.2.10	2020.2.11	2020.2.12	2020.2.13
ℜe	1.5101	1.4686	1.43	1.3956	1.363	1.3337	1.3058

Data	2020.2.14	2020.2.15	2020.2.16	2020.2.17	2020.2.18	2020.2.19	2020.2.20
ℜe	1.28	1.2563	1.2339	1.2126	1.1926	0.6044	0.1452

Data	2020.2.21	2020.2.22	2020.2.23	2020.2.24	2020.2.25	2020.2.26	2020.2.27
ℜe	0.2336	0.3384	0.4409	0.5284	0.5973	0.65	0.6902

Data	2020.2.28	2020.2.29	2020.3.1	2020.3.2	2020.3.3	2020.3.4	2020.3.5
ℜe	0.7209	0.7447	0.7633	0.7781	0.7898	0.7993	0.8069

Data	2020.3.6	2020.3.7	2020.3.8	2020.3.9	2020.3.10	2020.3.11	2020.3.12
ℜe	0.8131	0.8181	0.8223	0.8257	0.8285	0.8309	0.8328

Data	2020.3.13	2020.3.14	2020.3.15	2020.3.16	2020.3.17	2020.3.18	2020.3.19
ℜe	0.8344	0.8357	0.8368	0.9704	0.9617	0.9506	0.9361

Data	2020.3.20	2020.3.21	2020.3.22	2020.3.23	2020.3.24	2020.3.25	2020.3.26
ℜe	0.9173	0.8931	0.8620	0.8223	0.7726	0.7114	0.6385

Data	2020.3.27	2020.3.28	2020.3.29	2020.3.30	2020.3.31	2020.4.1	2020.4.2
ℜe	0.5553	0.4656	0.3751	0.2907	0.2178	0.1591	0.4602

Data	2020.4.3	2020.4.4	2020.4.5	2020.4.6	2020.4.7	2020.4.8	2020.4.9
ℜe	2.2455	0.162	0.1976	0.2397	0.2884	0.343	0.402

Data	2020.4.10	2020.4.11	2020.4.12	2020.4.13	2020.4.14	2020.4.15	2020.4.16
ℜe	0.4628	0.5223	0.577	0.6246	0.6636	0.6941	0.7167

Fig. 10

The prediction of reading and forwarding quantities of COVID-19.

The prediction of reading and forwarding quantities of COVID-19. The results of predicted public opinion reproduction ratio .

COVID-19 information contact and participation sensitivity analysis and intervention strategies

To further analyze the different parameters responsible for the comprehensive SRFI model, we use the partial rank correlation coefficients (PRCCs) [27] based on 1000 samples for various input parameters against the threshold condition to evaluate the sensitivity. According to the histogram and scatter diagram of dependence, when the correlation is positive, it means that with the increase of the value of the parameter, the value of corresponding index will increase. On the contrary, when the correlation is negative, the index will decrease as the parameter decreases. Fig. 11 shows that the values of the public opinion reproduction ratio is strongly positively affected by parameters , and the initial susceptible users , and negatively affected by parameter . This result confirms the correctness of our derivation of in Formula (15). Therefore, the average exposure rate , the forwarding probability , the probability of inactivation after forwarding , and the initial value are the key factors in determining the event outbreak. Thus, if we want to make public opinion explode, such as positive topics of the COVID-19, we can achieve it by increasing the value of , , and decreasing the value of . Since the parameter is the average exposure rate for a user to contact the information and the is the initial value of the susceptible population, we can increase these two values by persuading some opinion leaders to participate in the information propagation. Since each opinion leader will motivate a large number of new susceptible users, it can effectively increase the value of and . Besides, we can make the content richer and more interesting to attract people to participate in the forwarding of topics and keep users active in the forwarding for a longer time to increase the value of parameter which is influenced by users’ interest in topics and decrease the value of . Correspondingly, if we do not want public opinion to erupt, such as rumor topics of the COVID-19, we need to reduce the value of parameter and . Thus, we can motivate the platform to delete the relevant topics of the rumors about the COVID-19, which can effectively reduce the new reading and curb the outbreak of public opinion.

Fig. 11

PRCC results and PRCC scatter plots with index of different parameters.

In addition, the final size , and the maximum value and are also our concern. and denote the final size of cumulative reading and the high peak of reading quantity which can reflect the users’ contact. From Fig. 12 we can see that the parameter and the initial susceptible users have a positive impact on both two indexes. Similarly, and denote the final size of cumulative forwarding users and the high peak of the forwarding population which can reflect the users’ participation. From Fig. 12 we can see that the parameter and have a positive impact and has a small negative impact on both two indexes. The increase in parameter leads to an increase in reading quantity, and in the stage of a rapid increase in forwarding quantity, parameter and have a relatively large value.

Fig. 12

PRCC results with indexes , and of different parameters.

PRCC results and PRCC scatter plots with index of different parameters. PRCC results with indexes , and of different parameters. Combined with PRCC results, here we take topic # Real-time broadcast of joint prevention and control of epidemic situation # as an example to analyze the specific effects of parameters in our SRFI model. Fig. 13 depicts the effects of parameter and initial data on R-population and respectively. It shows that both the parameter and initial data have a positive effect on the number of reading users () and cumulative reading population (). Comparatively speaking, the larger the parameter is, the earlier the reading peak appears and the shorter the outbreak duration. And the greater the initial data is, the larger the reading peak value will be without changing the outbreak duration is. Furthermore, it shows that the initial data has a much important influence on the outbreaking behaviors of the topic reading related to COVID-19.

Fig. 13

Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .

Fig. 14 depicts the effects of parameters and on F-population and respectively. It shows that both the parameters and have a positive effect on the number of forwarding users () and cumulative forwarding population (). The larger the parameter is, the greater the forwarding peak value and the final size of forwarding users will be without changing the outbreak duration. By contrast, the parameter has a more subtle impact on forwarding related to COVID-19, the greater the parameter is, the earlier the forwarding peak appears and the slower the outbreak velocity and the propagation decline velocity are.

Fig. 14

Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .

Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter . Thus, if we want to increase users’ attention on topics of the COVID-19, we can achieve it by persuading some opinion leaders who have a large number of fans to participate in the information propagation to increase the value of and . Correspondingly, if we want to decrease users’ attention on topics of the COVID-19, we can motivate the platform to delete the relevant topics, which can effectively reduce the reading quantity. In addition, if we want to increase users’ participation in topics of the COVID-19, we can achieve it through increasing the value of and by making content more innovative to attract people to participate in the forwarding of topics. Here, we have similar strategies as the sensitivity analysis of opinion reproduction ratio . Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .

Conclusions

In this paper, we proposed a comprehensive susceptible–reading–forwarding–immune (SRFI) dynamics model based on the reading quantity and forwarding quantity in Chinese Sina-Microblog to understand both the contribution of users’ contact and participation behavior to information propagation about the COVID-19. The particular feature mechanism of social network that users may re-enter to the susceptible state to have more chance to contact information from reading state or from forwarding state subjectively is discussed, which prompt an active understanding of the epidemic. We analyzed the public opinion data of crucial moment about the COVID-19 from January 17, 2020 to April 16, 2020 on Chinese Sina-microblog and stratified events development into different stages according to the disease outbreak development. We performed the numerical simulation on two typical topics about the COVID-19 based on both cumulative reading and forwarding quantities to verify the effectiveness of our model. Then in each stage, we used a small amount of data for parameter estimation and then used the parameterized model for trend prediction which agreed with both the real data well until the next event occurred. For characteristic parameters, a PRCC sensitivity analysis was completed that provides some perceptions in design of some effective strategies. We hope this paper could provide a tool efficiently for predicting the direction of public opinion and stabilizing public emotions with on-going COVID-19 development.

CRediT authorship contribution statement

Fulian Yin: Conceptualization, Methodology, Software, Project administration, Funding acquisition. Hongyu Pang: Formal analysis, Software, Data curation, Writing - original draft. Xinyu Xia: Validation, Visualization, Investigation, Supervision. Xueying Shao: Software, Validation. Jianhong Wu: Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

6 in total

1 in total

1. Public sentiments toward COVID-19 vaccines in South African cities: An analysis of Twitter posts.

Authors: Blessing Ogbuokiri; Ali Ahmadi; Nicola Luigi Bragazzi; Zahra Movahedi Nia; Bruce Mellado; Jianhong Wu; James Orbinski; Ali Asgary; Jude Kong
Journal: Front Public Health Date: 2022-08-12

1 in total

COVID-19 information contact and participation analysis and dynamic prediction in the Chinese Sina-microblog.

Introduction

COVID-19 information contact and participation analysis

Information reading and forwarding topology

Information contact and participation analysis

COVID-19 information contact and participation prediction

Susceptible–reading–forwarding–immune model (SRFI)

Data fitting

COVID-19 information contact and participation prediction

COVID-19 information contact and participation sensitivity analysis and intervention strategies

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

1. EPIDEMICS AND RUMOURS.

2. A susceptible-infected epidemic model with voluntary vaccinations.

3. Nearcasting forwarding behaviors and information propagation in Chinese Sina-Microblog.

4. Global stability for the SEIR model in epidemiology.

5. Occurrence of the potent mutagens 2- nitrobenzanthrone and 3-nitrobenzanthrone in fine airborne particles.

6. Global analysis of an epidemic model with nonmonotone incidence rate.

1. Public sentiments toward COVID-19 vaccines in South African cities: An analysis of Twitter posts.