Literature DB >> 33551542

COVID-19 information contact and participation analysis and dynamic prediction in the Chinese Sina-microblog.

Fulian Yin1, Hongyu Pang1, Xinyu Xia1, Xueying Shao1, Jianhong Wu2.   

Abstract

The outbreak of a novel coronavirus (COVID-19) aroused great public opinion in the Chinese Sina-microblog. To help in designing effective communication strategies during a major public health emergency, we analyze the real data of COVID-19 information and propose a comprehensive susceptible-reading-forwarding-immune (SRFI) model to understand the patterns of key information propagation considering both public contact and participation. We develop the SRFI model, based on the public reading quantity and forwarding quantity that denote contact and participation respectively, and take into account the behavior that users may re-enter another related topic during the attention phase or the participation phase freely. Data fitting using the real data of both reading quantity and forwarding quantity obtained from Chinese Sina-microblog can parameterize the model to make an accurate prediction of the COVID-19 public opinion trend until the next major news item occurs, and the sensitivity analysis provides the basic strategies for communication.
© 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  COVID-19; Dynamic model; Reading and forwarding; Sina-microblog

Year:  2021        PMID: 33551542      PMCID: PMC7845521          DOI: 10.1016/j.physa.2021.125788

Source DB:  PubMed          Journal:  Physica A        ISSN: 0378-4371            Impact factor:   3.263


Introduction

Up to 24:00 on April 16, 2020, 2,352,198 cases have been affected by a novel coronavirus pneumonia (COVID-19) worldwide since it has been successively found in Wuhan. Major news items combined have generated quite strong fluctuations in public opinions. For example, on January 28, 2020, Nanshan Zhong, a well-known expert in infectious disease control, emphasized that people should not go out at present [1], this appeal attracted wide attention and warning that people said they could go out only if Nanshan Zhong admitted [2]. Sina-microblog is the most popular social network in China [3] and the outbreak-related topics about COVID-19 grew exponentially on that platform. As reading quantity and forwarding quantity of “Nanshan Zhong” have reached almost 9.40 billion and 2.05 million on Sina-microblog, understanding how these emerging public contact and participation spread in social media to alter the public behaviors is important to help to design effective communication strategies for rapid implementation of public health interventions. Fig. 1 shows the schematic diagram of COVID-19 information propagation in Sina-microblog, and the nodes represent users in different states. Many original post owners (green nodes) can publish single or multiple epidemic topics subordinate to different news. Take Headline News and Pear Video for example. While Headline News reports on multiple topics, and Pear Video focuses on a particular topic, and both of them can be read or forwarded by users attracted by both topics. Especially, reading users (blue nodes) can choose to be silent then leave and also can forward after reading becoming forwarding users (orange nodes). When users finish one topic, they may join others. And the relevant forwarding users can resume in the topics later and forward multiple messages becoming cross-forwarding users (pink nodes), leading to a multi-level information diffusion process. All users (readers and forwarding users) can choose to read or forward only one or multiple topics and therefore information propagates through one or multiple topics. This promotes the COVID-19 information dissemination rapidly.
Fig. 1

Information propagation considering both reading and forwarding on multiple topics of the COVID-19.

Information propagation considering both reading and forwarding on multiple topics of the COVID-19. To our best knowledge, there is no appropriate model framework that can be used to analyze public contact propagation and public participation propagation together during a major public health emergency. In consideration of the urgent need of developing theoretical knowledge and practical technologies to help effective communications of public health interventions, we propose a comprehensive susceptible–reading–forwarding–immune (SRFI) model based on both reading quantity and forwarding quantity that represent public contact and participation respectively to analyze the public opinion propagation of the COVID-19. In particular, we consider the characteristic user behavior that users may participate repeatedly in the reading or forwarding on different topics of the COVID-19 information dissemination. Traditionally, considering rumor is similar to epidemiology in several propagation ways, many scholars usedsusceptible–infected (SI) model [4], [5], susceptible–infected–recovered (SIR) model [6], [7], susceptible–exposed–infected–recovered (SEIR) model [8], [9] and susceptible–infected–susceptible (SIS) [10] model to represent rumor propagation and address relevant issues. Generally, information propagation in social networks was analyzed by stratifying users into three classes: heard rumor (ignorants), actively spreading rumor (spreaders) and no longer spreading rumor (stiflers) [11]. Then, scholars introduced new modules into classical models to understand the process of information dissemination better and then achieve various research purposes. In 2011, Zhao et al. [12] provided a more detailed and realistic description of the rumor spreading process with combination of forgetting mechanism and the SIR model of epidemics on an online social blogging platform called LiveJournal. In 2012, Zhao et al. [13] extended the classical SIR rumor spreading model by adding a direct link from ignorants to stiflers and a new kind of people—Hibernators in order to reduce the maximum rumor influence, which was called susceptible–infected–hibernator–removed (SIHR) model. In 2012, Xiong et al. [14] proposed a susceptible–contacted–infected–refractory (SCIR) diffusion model, which contained four possible states to characterize information propagation on online microblogs. In 2014, Zhao et al. [15] added the refutation mechanism in homogeneous social networks to the basic model using the Runge–Kutta method, which could help authorities reduce the maximum influence of the rumor. Zhang et al. [16] emphasized a special rumor spreading characteristic called “the cumulative effects of memory” and added the memory mechanism, meanwhile simulated the rumor spreading process on Sina-Microblog. Rui et al. [17] proposed a susceptible–potential–infective–removed (SPIR) model introducing a potential spreader set, which made the state-changing mechanism more reasonable and accurate for the diffusion process. In addition, Zhang et al. [18] used an improved SIR model to posit that a coupled network comprised two categories of nodes, then made use of the data collected from Weibo and WeChat of an actual news event to visualize the information spread process in the cross-network dissemination case of public opinion. Huang et al. [19] established a human dynamics model for deducing retweeting behavior and investigated it by gathering data through Sina API, which revealed that the distribution of the probability of a message to be browsed over time presented power-law characters. In 2018, Zan [20] studied the double rumors spreading with different launch times and introduced two kinds of models: double-susceptible–infected–recovered (DSIR) model and comprehensive-DSIR (C-DSIR) model, which focused on the interaction from old rumor to new rumor and the propagation of two rumors posted successively. And in 2019, we [21] proposed an epidemic model called susceptible–forwarding–immune (SFI) to capture a single information propagation trend in the Sina-microblog considering the forwarding quantity of users. Trpevski D et al. [22], Qian et al. [23], Wang et al. [24] also proposed many improved models for the spread of information. Especially Tanaka M et al. [25] added a new module to the traditional model using the datasets from the Japanese Mixi and Facebook. After summarizing a lot of literature, we find that most of the experimental data used for simulation are from well-known social platforms in the world, such as Twitter and Facebook. Furthermore, scholars from different countries may choose a local social platform in order to obtain data more convenient, especially Sina-microblog in China. By analyzing different data sources from other papers, we discover that existing studies have used forwarding quantity of one Weibo or multiple Weibos, the content of rumors and browsing behavior of users. To our best knowledge, they have not introduced the new module by combining with reading quantity and forwarding quantity of topics which are significant indicators to measure public contact and participation. The paper is organized as follows: in Section 2, we analyze the public opinion data of crucial moment on the COVID-19; in Section 3, we introduce mathematical model definition for information contact and participation, fit the model with the real data of two typical topics, make a staged prediction of the overall public opinion trend, then conduct a parameter sensitivity analysis and give effective intervention strategies about the information contact and participation; and in Section 4, we draw a conclusion.

COVID-19 information contact and participation analysis

Information reading and forwarding topology

The urgency is often accompanied with a number of topics related to the COVID-19 and the behavior of reading and forwarding usually prompts the propagation of the event. Reading is a kind of behavior that reflects users’ contact in information propagation and forwarding is a kind of behavior that reflects users’ participation in information propagation. In order to more clearly show the propagation process of both public contact and participation, the network topology is shown in Fig. 2 which describes the state of each node in the network at a certain moment in the process of information dissemination. Taking the propagation of three original post owners under three topics as an example, the user’s overall state of integrated information propagation is given.
Fig. 2

Network topology for COVID-19 propagation by reading and forwarding.

The information posted by the original post owners (red nodes), and can be read separately by single readers with interest (blue nodes) in which readers can then participate in the forwarding about the outbreaks (black nodes) or choose to be silent (yellow nodes). Especially, some co-spreaders between different topics repeatedly read and forward the related information successively then become the cross-reading users (pink nodes) and cross-forwarding users (green nodes) because of the correlation about the epidemic. Of course, there will also be many readers who contact information choose to be silent, such as the un-forwarding users (yellow nodes). In the real-world, the number of information is dynamically changing and cannot be clearly calculated. The reading quantity and forwarding quantity of the whole COVID-19 are composed of many topics with multiple information. Different from the traditional public hot events, the outbreak is causing great public concern. With the continuous development of the COVID-19, there is a high level of repetition in public reading and forwarding on different topics. As reading is a measure of contact and forwarding is a measure of participation for information dissemination, in this paper, we build the comprehensive susceptible–reading–forwarding–immune (SRFI) dynamics model with considering the repeated behavior including “re-reading” and “re-forwarding” on the impact of public opinion propagation. Network topology for COVID-19 propagation by reading and forwarding.

Information contact and participation analysis

Since the outbreak of COVID-19, around 7900 major topics appeared in Sina-microblog. Fig. 3 shows the cumulative reading quantity and forwarding quantity from December 31, 2019 to April 16, 2020, where the ordinate is the logarithm of the cumulative reading and forwarding quantities. It can be roughly seen that during the period from December 31, 2019 to January 16, 2020, there was only a certain amount of hot topics of COVID-19, however, from January 17, 2020, the hot topics about the epidemic gradually increased, especially after Nanshan Zhong confirmed that the COVID-19 could be transmitted from human to human on January 20, 2020, both the two quantities kept increasing dramatically, and then we will analyze the contact and participation data of COVID-19 information from January 17, 2020 to April 16, 2020 and Table 1 gives the specific values of the cumulative reading quantity and forwarding quantity.
Fig. 3

The reading quantity and forwarding quantity about COVID-19: (a) the reading quantity; (b) the forwarding quantity.

Table 1

The cumulative reading quantity and forwarding quantity of COVID-19.

Date2020.1.172020.1.182020.1.192020.1.202020.1.212020.1.222020.1.23
CR(*105)38 63342 91563 018125 739253 288476 434779 867
CF225 554264 641434 1641 383 9533 390 5737 639 88723 811 193

Date2020.1.242020.1.252020.1.262020.1.272020.1.282020.1.292020.1.30

CR(*105)1 009 1541 239 0811 482 4861 702 6991 885 6142 062 6132 235 681
CF31 929 84637 850 19043 424 98347 026 57950 333 08854 259 76057 741 525

Date2020.1.312020.2.12020.2.22020.2.32020.2.42020.2.52020.2.6

CR(*105)2 404 6282 615 9892 816 4952 995 9673 182 9093 375 8303 544 372
CF62 205 93569 421 57775 392 45278 614 45684 683 98690 049 55999 153 988

Date2020.2.72020.2.72020.2.92020.2.102020.2.112020.2.122020.2.13

CR(*105)3 797 1593 957 6904 168 6654 337 4764 539 0714 690 2814 852 670
CF111 769 392116 979 114120 813 833125 010 556129 008 667133 206 593138 580 808

Date2020.2.142020.2.152020.2.162020.2.172020.2.182020.2.192020.2.20

CR(*105)5 028 1835 200 9055 355 6165 507 1745 660 8355 800 6655 934 350
CF143 356 722147 554 717151 821 458155 138 314158 447 426161 128 288163 301 415

Date2020.2.212020.2.222020.2.232020.2.242020.2.252020.2.262020.2.27

CR(*105)6 139 0636 299 0686 465 8416 619 7626 757 1666 885 3427 003 691
CF165 502 316167 645 359170 198 613173 068 348176 231 509178 567 007180 802 728

Date2020.2.282020.2.292020.3.12020.3.22020.3.32020.3.42020.3.5

CR(*105)7 130 9397 246 7487 351 7527 447 5847 534 1997 638 8537 726 308
CF183 003 765184 900 668186 966 898188 327 150189 521 175190 727 593191 892 477

Date2020.3.62020.3.72020.3.82020.3.92020.3.102020.3.112020.3.12

CR(*105)7 798 8597 866 0217 916 3567 977 1648 050 7968 140 7818 286 361
CF192 894 893193 762 501194 739 304195 609 360196 481 124197 673 456199 077 665

Date2020.3.132020.3.142020.3.152020.3.162020.3.172020.3.182020.3.19

CR(*105)8 486 8848 606 9308 741 9848 881 4569 025 9909 159 0219 294 382
CF200 311 755201 747 067203 341 571205 021 835213 788 905219 382 758222 669 494

Date2020.3.202020.3.212020.3.222020.3.232020.3.242020.3.252020.3.26

CR(*105)9 454 9369 613 8589 714 2049 788 6959 846 2759 940 63010 024 169
CF225 005 650227 086 475228 026 043228 598 328229 318 397230 114 834230 719 885

Date2020.3.272020.3.282020.3.292020.3.302020.3.312020.4.12020.4.2

CR(*105)10 116 51810 193 22010 274 61010 401 70210 509 97010 613 59410 719 037
CF231 501 263232 111 489232 756 584233 655 217234 849 798235 876 483236 758 976

Date2020.4.32020.4.42020.4.52020.4.62020.4.72020.4.82020.4.9

CR(*105)10 807 26911 026 87711 138 82111 231 16211 333 26111 422 90211 503 056
CF237 811 162252 986 547255 099 056256 111 309257 005 795258 764 094259 979 183

Date2020.4.102020.4.112020.4.122020.4.132020.4.142020.4.152020.4.16

CR(*105)11 562 23711 617 70111 657 36011 699 56511 743 95411 806 84811 866 802
CF260 392 077260 727 437260 984 402261 217 740261 480 292261 836 224262 202 217
The reading quantity and forwarding quantity about COVID-19: (a) the reading quantity; (b) the forwarding quantity. Fig. 4 shows the box diagram of reading quantity and forwarding quantity from January 17, 2020 to April 16, 2020, which reflect the user’s contact and participation in information, respectively. Although in terms of the order of magnitude, the reading quantity is larger than the forwarding quantity and its range is relatively wide, in particular, the reading quantity and the forwarding quantity vary within about 3 × 1010 and 7 × 106 respectively, except for the outlier forwarding quantity caused by several emergencies, these two propagation trends are consistent, both in line with the laws of dynamics development. Besides, there are also different features in reading quantity and forwarding quantity such as the median, so it is necessary to build a model combining these two attributes.
Fig. 4

The box diagram of the number of reading and forwarding.

The cumulative reading quantity and forwarding quantity of COVID-19. The box diagram of the number of reading and forwarding. In order to analyze the public opinion trend more clearly, Fig. 5 gives the reading quantity, forwarding quantity and the number of topics together with line charts and histograms from January 17, 2020 to April 16, 2020. We can see that since January 17, 2020, the number of reading and forwarding of COVID-19 has gradually increased, and the emergence of some important events or hot topics has a major impact on public opinion and the overall development can be divided into four stages so far based on this. The first stage, from January 17, 2020 to February 19, 2020, which both two quantities increased gradually, is the outbreak stage. In the beginning, topic # 5 rumors of viral pneumonia in Wuhan # attracted public attention, then Wuhan began its closure on January 23, 2020, with the emergence of topics such as # Wuhan Bus Metro Suspension of Operation #, the reading and forwarding quantities of COVID-19 began to increase dramatically and reached the highest value and the number of topics began to gradually increase leading public opinion enter the outbreak stage. Besides, topic #Doctor Wenliang Li passed away# posted on February 7, 2020 and #Easy epidemic prevention station# posted on February 19, 2020 have aroused great attention from people, causing both two quantities to reach the extreme again and public opinion continues to ferment. The second stage is from February 19, 2020 to March 16, 2020, in which both two quantities start to decrease with the stabilization and improvement of the domestic cases, and the overall number of topics increased and stabilized in a certain range in the later period. The third stage is another outbreak stage from March 16, 2020 to April 3, 2020, the overall trend is on the rise as the disease becomes more serious abroad, and the number of topics has risen again. With the release of #support Hubei Medical Team to evacuate # on March 17, 2020 and other topics related to COVID-19 of the global, both two quantities increased dramatically again, and public opinion entered into another outbreak stage. The fourth stage, from April 3, 2020 to April 16, 2020 is the occasional fluctuation stage. April 4, 2020 is Tomb-Sweeping Day of China, some topics for mourning the anti-epidemic heroes began to increase such as # Three minutes of silence in all China # which caused a lot of users to read and forward and led to some occasional fluctuations in the reading quantity and forwarding quantity. The division of the public opinion of COVID-19 helps us understand the trend caused by the entire epidemic hot topics.
Fig. 5

The trends chart of the topics about COVID-19, where the blue line represents the number of reading and the red line represents the number of forwarding respectively, and the histograms represent the number of topics per day.

The trends chart of the topics about COVID-19, where the blue line represents the number of reading and the red line represents the number of forwarding respectively, and the histograms represent the number of topics per day.

COVID-19 information contact and participation prediction

Susceptible–reading–forwarding–immune model (SRFI)

The propagation dynamics model based on the reading quantity and forwarding quantity of COVID-19 constructed in this paper is shown in Fig. 6. Here, we only consider the accessible population in the process of information propagation and pay attention to both the information diffusion caused by users’ reading behavior and forwarding behavior. Assuming that the number of users () who can contact information in the process of propagation on Sina-microblog remains unchanged, we stratify the population into four states: the susceptible state (), in which the users unaware of but susceptible to the information of the event; the reading state (), in which the users have read information and are susceptible to forward it; the forwarding state (), in which the users have forwarded the information actively to influence other users; and the immune state (), in which the users have read or forwarded the information, but are no longer read or forward the information even if receive them again.
Fig. 6

A schematic diagram to illustrate information spreading considering both reading and forwarding in the population with four different states: susceptible (), reading (), forwarding () and immune ().

A susceptible user can read one information with an average exposure rate and a user in reading state will leave and become other states with a deactivation rate . The forwarding users can become immune users who are inactive to the event with an average inactive rate , with being the average duration where an F-user remains active in being contacted. A schematic diagram to illustrate information spreading considering both reading and forwarding in the population with four different states: susceptible (), reading (), forwarding () and immune (). The core of our model is to study the role of both repeated reading and forwarding because users exposure to different information about COVID-19. Hence, with forwarding probability from reading users to forwarding users and immunity probability denotes those who keep silent in the event and go straight to the immune state, we use to represent the “re-reading” probability for a reading user returns to the susceptible state. Besides, we use parameter to describe the “re-forwarding” probability for a forwarding user who can return a new round of susceptible state of COVID-19. In particular, each user may have a unique state, that is, at the same time, each user can be only one of the susceptible, reading, forwarding or immune states. We obtain the following SRFI dynamics model: where ’ is the derivative with respect to . The behavior transformation and state transition of the masses can also be interpreted as follow: Reading: Since an active forwarding user will contact an average number of users per unit time, the probability of a normal user is a susceptible user is , and there are active forwarding users in total, then the number of new reading users is . Forwarding: Some reading users will inactive with the deactivation rate and participate in the forwarding state with the forwarding probability , the number of new forwarding users is . Re-enter: As events unfold, there are two ways to generate repeated behavior and initiate a new round of reading and forwarding: users who contact one information and yearn for getting more information about COVID-19 will re-enter to the susceptible state from reading state, the average number of re-entered users is per unit time; users who have forwarded one information and be interested in related information about COVID-19 will re-enter to susceptible from forwarding state, the average number of the re-entered users is per unit time. Immune: Some reading users will not participate in the forwarding and become the immune users directly because they want to keep silent in the event, and the number of direct immune users is . And some forwarding users will go to the immune state out of active time, the number of inactive users is . The Sina-microblog provides the number of cumulative reading population and forwarding population which is the total times of reading and forwarding within a topic about COVID-19, and we calculate the sum of the whole event, given by The corresponding differential equation can be expressed as: Considering the initial condition: , , and . The final condition from Eqs. (4)–(6), it follows that , and are all increasing since , and . Therefore, the final states are , , , and are tending to 0 , and . Here and are the final size of the COVID-19 reading and forwarding. In addition, the number of maximal reading users and maximal forwarding users are and , respectively. We define the reproduction ratio to measure whether the outbreak forwarding quantity was likely to break out. In the initial post of the COVID-19, the forwarding outbreak is given by , and the population will never take off since due to the decreasing of . Then we deduce Public opinion reproduction ratio : The reproduction ratio is defined to measure whether public opinion was likely to break out. We use the calculation method of basic reproduction number developed in [26], and rewrite our model as follows: where and Calculate the derivatives and at no information propagation equilibrium , we can obtain and The roots of the characteristic equation can deduce the eigenvalues of the matrix : Because is not negative, we have Here, based on the extension of , we define the (effective) reproduction number to describe the outbreak of public opinion at each time t, and it has more practical significance in the dynamic development process. Then we have The represents the propagation capability of each period and it is time-varying, which is determined by the average exposures rate , the average inactive rate , the forwarding probability , and the susceptible users . When , it means that the comprehensive public opinion will decline which implies the propagation can never break out. The indicates that the comprehensive public opinion grows exponentially initially.

Data fitting

Parameter estimation method: To use our SRFI model to explore some distinctions of qualitative behaviors for prediction, we use the LS method to estimate the model parameters and the initial data of our SRFI model. The parameter vector can be set as , and the corresponding numerical calculation based on the parameter vectors for and are denoted by and , respectively. The LS error function is used in our calculation, where and denote the actual cumulative reading quantity and forwarding quantity, and 0, 1, 2, …is the sampling time. In our paper, we use DEDiscover software to solve this LS problem. Data description: In order to analyze the public opinions with different characteristics, the following two typical events are selected from the whole public opinion outbreak duration. Table 2 shows the reading quantity and forwarding quantity of # Refuse to eat wild animals # with a slow outbreak. And Table 3 shows the reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation # with a fast outbreak.
Table 2

The cumulative reading quantity and forwarding quantity of # Refuse to eat wild animals #.

Data2020.2.292020.3.12020.3.22020.3.32020.3.42020.3.52020.3.62020.3.7
CR(*105)14192495047128559921142
CF12703780966633 65751 13864 39871 00889 619

Data2020.3.82020.3.92020.3.102020.3.112020.3.122020.3.132020.3.142020.3.15

CR(*105)12701368144314861508151415231536
CF106 938109 223109 589110 431110 888112 583112 861113 140

Data2020.3.162020.3.172020.3.182020.3.192020.3.202020.3.212020.3.22

CR(*105)1549155115511551155115511551
CF113 640113 778114 011114 117114 170114 247114 281
Table 3

The cumulative reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation #.

Data2020.1.272020.1.282020.1.292020.1.302020.1.312020.2.12020.2.22020.2.3
CR(*105)2361844814 14619 89723 83926 61630 33130 799
CF61 134119 339181 997240 858263 552284 178314 132321 897

Data2020.2.42020.2.52020.2.62020.2.72020.2.82020.2.92020.2.102020.2.11

CR(*105)31 83832 26132 42432 61933 45133 55133 57933 603
CF334 686341 232343 963347 308360 530362 177363 453364 203

Data2020.2.122020.2.132020.2.142020.2.152020.2.162020.2.172020.2.182020.2.19

CR(*105)33 65633 72133 75233 76733 78733 81233 85433 870
CF364 682365 318365 673366 002366 323366 652367 272367 698

Data2020.2.202020.2.212020.2.222020.2.232020.2.242020.2.252020.2.262020.2.27

CR(*105)33 90833 96133 98434 00034 02134 03934 05434 069
CF368 275368 875369 261369 629370 098370 585370 899371 286

Data2020.2.282020.2.292020.3.12020.3.2

CR(*105)34 08134 08134 08134 081
CF371 720372 086372 420372 593
Data fitting results: The cumulative reading quantity and forwarding quantity of # Refuse to eat wild animals #. The cumulative reading quantity and forwarding quantity of # Real-time broadcast of joint prevention and control of epidemic situation #. As shown in Fig. 7, we performed data fitting on the real data of # Refuse to eat wild animals# in Table 2, where the red star and blue star denotes the actual cumulative number of reading and forwarding population, respectively, the red line and blue line denotes the estimated cumulative number of reading and forwarding population, respectively. It can be seen that the initial outbreak of the topic is slow and our SRFI model achieves accurate estimation.
Fig. 7

The data fitting results of # Refuse to eat wild animals #.

Table 4 gives some important values of early period parameter estimation of # Refuse to eat wild animals #. We can see relative to the topic # Real-time broadcast of joint prevention and control of epidemic situation # the initial susceptible users is smaller meaning that there are fewer users are susceptible at the beginning of the information explosion, leading the topic outbreak slowly. Besides, the parameter is larger, which means the active period of forwarding users is short so that they cannot influence other susceptible people for a long time, leading a smaller final size of cumulative reading and forwarding quantities.
Table 4

Some important values of parameter estimation of # Refuse to eat wild animals #.

NameEstimated valueStandard errorCI low boundCI high boundp-valuet-statisticMinMax
S04.2710×10452.79366.0805×1056.0826×1061.5212×102181.1520×1041.0000×1041.0000×104
α0.86110.02640.34840.45359.8689×102415.21260.00002.0000
β0.14500.00290.00570.01721.8223×1043.95620.00001.0000
γ0.97820.02620.94291.04755.7330×104837.95790.00001.0000
p7.9000×1048.3598×1051.2329×1044.5684×1049.0251×1043.46970.00001.0000
q0.25800.01170.08650.13325.8565×10149.38970.00001.0000
θ0.97810.03220.85780.98614.7696×10428.65910.00001.0000
As shown in Fig. 8, we performed data fitting on the real data of # Real-time broadcast of joint prevention and control of epidemic situation # in Table 3, where the red star and blue star denotes the actual cumulative number of reading and forwarding population, respectively, the red line and blue line denotes the estimated cumulative number of reading and forwarding population, respectively. It can be seen that the initial outbreak of topic # Real-time broadcast of joint prevention and control of epidemic situation # is fast, which entered a stage of rapid explosion directly at the beginning and our SRFI model achieves accurate estimation.
Fig. 8

The data fitting results of # Real-time broadcast of joint prevention and control of epidemic situation #.

The data fitting results of # Refuse to eat wild animals #. Some important values of parameter estimation of # Refuse to eat wild animals #. Table 5 gives some important values of early period parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #. We can see the initial susceptible users is larger meaning that more users become susceptible at the beginning of the information explosion, leading the topic outbreaks rapidly. In addition, the parameter is smaller, indicating more users will remain active and affect other susceptible users, which leads it quickly increases to a larger final size of cumulative reading and forwarding quantities.
Table 5

Some important values of parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #.

NameEstimated valueStandard errorCI low boundCI high boundp-valuet-statisticMinMax
S02.1879×10629.45632.1878×1062.1879×1062.0350×102597.4275×1041.0000×1041.0000×107
α0.19590.02570.14460.24721.3350×10107.62830.00004.0000
β0.00462.2555×1040.00420.00514.9300×103020.46410.00001.0000
γ0.81410.01590.78220.84593.3164×105451.10670.00001.0000
p9.0233×1058.4946×1067.3269×1051.07207.6349×101610.62250.00001.0000
q0.70860.02280.66300.75411.0095×104031.07140.00001.0000
θ0.73630.01570.70490.76789.0737×105246.77120.00001.0000
The data fitting results of # Real-time broadcast of joint prevention and control of epidemic situation #. Some important values of parameter estimation of # Real-time broadcast of joint prevention and control of epidemic situation #.

COVID-19 information contact and participation prediction

We have divided the main development of COVID-19 information dissemination into four stages according to the analysis of information contact and participation from January 17, 2020, to April 16, 2020. Although we cannot control the occurrence of emergency incidents and the dissemination of information, it is very important that, in each stage, we can predict the trend of public opinion based on the existing data before the emergency comes. Fig. 9 shows the prediction of reading and forwarding quantities of COVID-19 and Table 6 gives the estimated parameters with DEDiscover software.
Fig. 9

The prediction of reading and forwarding quantities of COVID-19.

Table 6

Parameter results.

βαγpqθS0
Fig. 6(a1)0.00490.00180.35536.0042×1040.03070.34376.1797×105
Fig. 6(a2)0.00880.03100.48412.9000×1040.03920.51915.2450×105
Fig. 6(b1)0.01520.93060.04507.9350×1050.32300.43702.7494×105
Fig. 6(b2)0.01501.22500.03818.6383×1050.28310.56733.4533×105
Fig. 6(c1)0.03600.71400.02651.0531×1040.23300.10504.5804×104
Fig. 6(c2)0.00990.70560.01401.5568×1042.7685×1040.00431.0001×105
Fig. 6(d1)0.02802.04010.10416.0000×1050.81910.32301.1055×105
Fig. 6(d2)0.04500.43040.01102.4700×1040.40400.63601.9527×105
At the first stage, topic #5 rumors of viral pneumonia in Wuhan# attracted public attention and the cumulative reading and forwarding quantities of COVID-19 began to increase. We predict the public opinion trend of COVID-19 with the data from January 17, 2020 to January 25, 2020 and it achieves good data fitting with the actual data until January 28, 2020, as shown in Fig. 9(a1). Thus, we extend the data to January 31, 2020 and predict again, fortunately, we get a satisfactory result until the end of this stage, as shown in Fig. 9(a2). It can be seen from the forecast curve of predicted reading population and forwarding population that both the public contact and participation are increasing at the beginning of the epidemic. The prediction of reading and forwarding quantities of COVID-19. Parameter results. With the gradual stabilization of COVID-19 in China, the cumulative reading and forwarding quantities began to grow slowly and the public opinion enters the second stage. We use the data between February 19, 2020 and February 25, 2020 to estimate the parameters and predict the trend of public opinion of the next days, as shown in Fig. 9(b1). Besides, we extend the data until February 26, 2020 to predict the rest of the public opinion trend in this phase and we have achieved very good prediction results, as shown in Fig. 9(b2). At this stage, the predicted reading and forwarding population have a downward trend, which means the public contact and participation are close to saturation. Unexpectedly, the second stage ended since the seriousness of overseas COVID-19 and public opinion enters the third stage. We use the data between March 16, 2020 and March 22, 2020 to estimate the parameters and predict the trend of public opinion, as shown in Fig. 9(c1). In addition, we extend the data until March 23, 2020 to predict the rest of the public opinion trend in this phase, as shown in Fig. 9(c2). At this stage, the predicted reading and forwarding population have a big value at the starting point and then it starts to fall. With the decrease in reading and forwarding, we can infer that the dramatic outbreak of public opinion only appears at the beginning of this stage. Then public opinion entered the fourth stage due to occasional fluctuations in the reading quantity and forwarding quantity, we use the data between April 3, 2020 and April 9, 2020 to realize a good prediction for the next three days, as shown in Fig. 9(d1). Then we extend the data until April 10, 2020 and have a good prediction until the end of this stage, as shown in Fig. 9(d2). At the beginning of this stage, the predicted reading and forwarding population have a maximum value and show a downward trend. With the decrease in reading and forwarding, we can give the conclusion that the dramatic outbreak of public opinion only appears at the beginning of this stage. Table 7 gives the results of the predicted effective reproduction ratio at each time using the estimation results of the last time. We can see the reading quantity, forwarding quantity and per day together from January 17, 2020 to April 16, 2020 more clearly, and the three curves have similar trends at each stage as shown in Fig. 10. Our SRFI model predicts it has the greatest reproduction ratio 6.5656 and breaks out quickly in the early stage of COVID-19. With the development of the epidemic, starts to be less than 1 in the second stage, meaning that public opinion is gradually calming down. Then it started to increase again at the beginning of the third stage, but it is still less than 1, indicating that although reading and forwarding users have increased at first, public opinion will not continue to erupt. With the occurrence of emergency suddenly rises greater than 1 which indicates that in the future, under the trend of overall stability, the information on COVID-19 will continue to wave burst with the occurrence of emergencies.
Table 7

The results of predicted public opinion reproduction ratio .

Data2020.1.172020.1.182020.1.192020.1.202020.1.212020.1.222020.1.23
e6.56566.54446.39936.12555.69595.10754.4263
Data2020.1.242020.1.252020.1.262020.1.272020.1.282020.1.292020.1.30
e3.77583.25542.87792.60112.39182.22622.0905

Data2020.1.312020.2.12020.2.22020.2.32020.2.42020.2.52020.2.6
e1.97761.88191.79941.72731.66351.60661.5557

Data2020.2.72020.2.82020.2.92020.2.102020.2.112020.2.122020.2.13
e1.51011.46861.431.39561.3631.33371.3058

Data2020.2.142020.2.152020.2.162020.2.172020.2.182020.2.192020.2.20
e1.281.25631.23391.21261.19260.60440.1452

Data2020.2.212020.2.222020.2.232020.2.242020.2.252020.2.262020.2.27
e0.23360.33840.44090.52840.59730.650.6902

Data2020.2.282020.2.292020.3.12020.3.22020.3.32020.3.42020.3.5
e0.72090.74470.76330.77810.78980.79930.8069

Data2020.3.62020.3.72020.3.82020.3.92020.3.102020.3.112020.3.12
e0.81310.81810.82230.82570.82850.83090.8328

Data2020.3.132020.3.142020.3.152020.3.162020.3.172020.3.182020.3.19
e0.83440.83570.83680.97040.96170.95060.9361

Data2020.3.202020.3.212020.3.222020.3.232020.3.242020.3.252020.3.26
e0.91730.89310.86200.82230.77260.71140.6385

Data2020.3.272020.3.282020.3.292020.3.302020.3.312020.4.12020.4.2
e0.55530.46560.37510.29070.21780.15910.4602

Data2020.4.32020.4.42020.4.52020.4.62020.4.72020.4.82020.4.9
e2.24550.1620.19760.23970.28840.3430.402

Data2020.4.102020.4.112020.4.122020.4.132020.4.142020.4.152020.4.16
e0.46280.52230.5770.62460.66360.69410.7167
Fig. 10

The prediction of reading and forwarding quantities of COVID-19.

The prediction of reading and forwarding quantities of COVID-19. The results of predicted public opinion reproduction ratio .

COVID-19 information contact and participation sensitivity analysis and intervention strategies

To further analyze the different parameters responsible for the comprehensive SRFI model, we use the partial rank correlation coefficients (PRCCs) [27] based on 1000 samples for various input parameters against the threshold condition to evaluate the sensitivity. According to the histogram and scatter diagram of dependence, when the correlation is positive, it means that with the increase of the value of the parameter, the value of corresponding index will increase. On the contrary, when the correlation is negative, the index will decrease as the parameter decreases. Fig. 11 shows that the values of the public opinion reproduction ratio is strongly positively affected by parameters , and the initial susceptible users , and negatively affected by parameter . This result confirms the correctness of our derivation of in Formula (15). Therefore, the average exposure rate , the forwarding probability , the probability of inactivation after forwarding , and the initial value are the key factors in determining the event outbreak. Thus, if we want to make public opinion explode, such as positive topics of the COVID-19, we can achieve it by increasing the value of , , and decreasing the value of . Since the parameter is the average exposure rate for a user to contact the information and the is the initial value of the susceptible population, we can increase these two values by persuading some opinion leaders to participate in the information propagation. Since each opinion leader will motivate a large number of new susceptible users, it can effectively increase the value of and . Besides, we can make the content richer and more interesting to attract people to participate in the forwarding of topics and keep users active in the forwarding for a longer time to increase the value of parameter which is influenced by users’ interest in topics and decrease the value of . Correspondingly, if we do not want public opinion to erupt, such as rumor topics of the COVID-19, we need to reduce the value of parameter and . Thus, we can motivate the platform to delete the relevant topics of the rumors about the COVID-19, which can effectively reduce the new reading and curb the outbreak of public opinion.
Fig. 11

PRCC results and PRCC scatter plots with index of different parameters.

In addition, the final size , and the maximum value and are also our concern. and denote the final size of cumulative reading and the high peak of reading quantity which can reflect the users’ contact. From Fig. 12 we can see that the parameter and the initial susceptible users have a positive impact on both two indexes. Similarly, and denote the final size of cumulative forwarding users and the high peak of the forwarding population which can reflect the users’ participation. From Fig. 12 we can see that the parameter and have a positive impact and has a small negative impact on both two indexes. The increase in parameter leads to an increase in reading quantity, and in the stage of a rapid increase in forwarding quantity, parameter and have a relatively large value.
Fig. 12

PRCC results with indexes , and of different parameters.

PRCC results and PRCC scatter plots with index of different parameters. PRCC results with indexes , and of different parameters. Combined with PRCC results, here we take topic # Real-time broadcast of joint prevention and control of epidemic situation # as an example to analyze the specific effects of parameters in our SRFI model. Fig. 13 depicts the effects of parameter and initial data on R-population and respectively. It shows that both the parameter and initial data have a positive effect on the number of reading users () and cumulative reading population (). Comparatively speaking, the larger the parameter is, the earlier the reading peak appears and the shorter the outbreak duration. And the greater the initial data is, the larger the reading peak value will be without changing the outbreak duration is. Furthermore, it shows that the initial data has a much important influence on the outbreaking behaviors of the topic reading related to COVID-19.
Fig. 13

Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .

Fig. 14 depicts the effects of parameters and on F-population and respectively. It shows that both the parameters and have a positive effect on the number of forwarding users () and cumulative forwarding population (). The larger the parameter is, the greater the forwarding peak value and the final size of forwarding users will be without changing the outbreak duration. By contrast, the parameter has a more subtle impact on forwarding related to COVID-19, the greater the parameter is, the earlier the forwarding peak appears and the slower the outbreak velocity and the propagation decline velocity are.
Fig. 14

Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .

Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter . Thus, if we want to increase users’ attention on topics of the COVID-19, we can achieve it by persuading some opinion leaders who have a large number of fans to participate in the information propagation to increase the value of and . Correspondingly, if we want to decrease users’ attention on topics of the COVID-19, we can motivate the platform to delete the relevant topics, which can effectively reduce the reading quantity. In addition, if we want to increase users’ participation in topics of the COVID-19, we can achieve it through increasing the value of and by making content more innovative to attract people to participate in the forwarding of topics. Here, we have similar strategies as the sensitivity analysis of opinion reproduction ratio . Numerical experiments of the impact of parameters on , : (a) parameter ; (b) parameter .

Conclusions

In this paper, we proposed a comprehensive susceptible–reading–forwarding–immune (SRFI) dynamics model based on the reading quantity and forwarding quantity in Chinese Sina-Microblog to understand both the contribution of users’ contact and participation behavior to information propagation about the COVID-19. The particular feature mechanism of social network that users may re-enter to the susceptible state to have more chance to contact information from reading state or from forwarding state subjectively is discussed, which prompt an active understanding of the epidemic. We analyzed the public opinion data of crucial moment about the COVID-19 from January 17, 2020 to April 16, 2020 on Chinese Sina-microblog and stratified events development into different stages according to the disease outbreak development. We performed the numerical simulation on two typical topics about the COVID-19 based on both cumulative reading and forwarding quantities to verify the effectiveness of our model. Then in each stage, we used a small amount of data for parameter estimation and then used the parameterized model for trend prediction which agreed with both the real data well until the next event occurred. For characteristic parameters, a PRCC sensitivity analysis was completed that provides some perceptions in design of some effective strategies. We hope this paper could provide a tool efficiently for predicting the direction of public opinion and stabilizing public emotions with on-going COVID-19 development.

CRediT authorship contribution statement

Fulian Yin: Conceptualization, Methodology, Software, Project administration, Funding acquisition. Hongyu Pang: Formal analysis, Software, Data curation, Writing - original draft. Xinyu Xia: Validation, Visualization, Investigation, Supervision. Xueying Shao: Software, Validation. Jianhong Wu: Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  6 in total

1.  EPIDEMICS AND RUMOURS.

Authors:  D J DALEY; D G KENDALL
Journal:  Nature       Date:  1964-12-12       Impact factor: 49.962

2.  A susceptible-infected epidemic model with voluntary vaccinations.

Authors:  Frederick H Chen
Journal:  J Math Biol       Date:  2006-06-07       Impact factor: 2.259

3.  Nearcasting forwarding behaviors and information propagation in Chinese Sina-Microblog.

Authors:  Fu Lian Yin; Xue Ying Shao; Jian Hong Wu
Journal:  Math Biosci Eng       Date:  2019-06-11       Impact factor: 2.080

4.  Global stability for the SEIR model in epidemiology.

Authors:  M Y Li; J S Muldowney
Journal:  Math Biosci       Date:  1995-02       Impact factor: 2.144

5.  Occurrence of the potent mutagens 2- nitrobenzanthrone and 3-nitrobenzanthrone in fine airborne particles.

Authors:  Aldenor G Santos; Gisele O da Rocha; Jailson B de Andrade
Journal:  Sci Rep       Date:  2019-01-09       Impact factor: 4.379

6.  Global analysis of an epidemic model with nonmonotone incidence rate.

Authors:  Dongmei Xiao; Shigui Ruan
Journal:  Math Biosci       Date:  2006-12-12       Impact factor: 2.144

  6 in total
  1 in total

1.  Public sentiments toward COVID-19 vaccines in South African cities: An analysis of Twitter posts.

Authors:  Blessing Ogbuokiri; Ali Ahmadi; Nicola Luigi Bragazzi; Zahra Movahedi Nia; Bruce Mellado; Jianhong Wu; James Orbinski; Ali Asgary; Jude Kong
Journal:  Front Public Health       Date:  2022-08-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.