| Literature DB >> 33519039 |
Wensen Huang1, Bolin Cao1, Guang Yang1, Ningzheng Luo2, Naipeng Chao1.
Abstract
The surveillance and forecast of newly confirmed cases are important to mobilize medical resources and facilitate policymaking during a public health emergency. Digital surveillance using data available online has increasingly become a trend with the advancement of the Internet. In this study, we assessed the predictive value of multiple online medical behavioral data, including online medical consultation (OMC), online medical appointment (OMA), and online medical search (OMS) for the regional outbreak of coronavirus disease 2019 in Shenzhen, China during January 1, 2020 to March 5, 2020. Multivariate vector autoregression models were used for the prediction. The results identified a novel predictor, OMC, which can forecast the disease trend up to 2 days ahead of the official reports of confirmed cases from the local health department. OMS data had relatively weaker predictive power than OMC in our model, and OMA data failed to predict the confirmed cases. This study highlights the importance of OMC data and has implication in providing evidence-based guidelines for local authorities to evaluate risks and allocate resources during the pandemic.Entities:
Keywords: COVID-19; eHealth; online medical consultation; pandemic; telehealth
Year: 2020 PMID: 33519039 PMCID: PMC7836698 DOI: 10.1016/j.ipm.2020.102486
Source DB: PubMed Journal: Inf Process Manag ISSN: 0306-4573 Impact factor: 6.222
Fig. 1Daily counts of new confirmed COVID-19 cases in Shenzhen, China (2020.1.19–2020.2.16).
Fig. 3Time plots of NCC, OMC, OMA, and OMS (2020.1.1–2020.3.5)
Fig. 2Data collection process for the three types of online medical behaviors
Keywords list used to collect data for OMC and OMS
| Chinese | English | |
|---|---|---|
| 1 | 发热 | Fever |
| 2 | 咳嗽 | Cough |
| 3 | 咽痛 | Pharyngalgia |
| 4 | 胸闷 | Chest tightness |
| 5 | 呼吸困难 | Dyspnea |
| 6 | 乏力 | Fatigue |
| 7 | 肌肉酸痛 | Myalgia |
| 8 | 恶心 | Nausea |
| 9 | 呕吐 | Vomiting |
| 10 | 腹泻 | Diarrhea |
| 11 | 嗜睡 | Somnolence |
| 12 | 惊厥 | Convulsion |
Core keyword was used to query with Baidu Index.
Basic statistics of variables and unit root test results
| ADF | ADF 1st difference | Critical values | |||||
|---|---|---|---|---|---|---|---|
| 1% | 5% | 10% | |||||
| NCC | 416 | 6.42±11.41 | -1.67 | -8.32 | -4.04 | -3.45 | -3.15 |
| OMC | 20547 | 316.11±170.98 | -1.08 | -7.74 | -4.04 | -3.45 | -3.15 |
| OMA | 108777 | 557.83 ± 372.93 | -0.90 | -5.87 | -4.04 | -3.45 | -3.15 |
| OMS | 128958 | 165.33 ± 23.19 | -2.09 | -5.66 | -4.04 | -3.45 | -3.15 |
Selection of optimal lag order of VAR model
| Lag | AIC | HQ | SC | FPE |
|---|---|---|---|---|
| lag-1 | 25.90 | 26.23 | 26.75* | 1.77E+11 |
| lag-2 | 25.49* | 26.05* | 26.91 | 1.19E+11* |
| lag-3 | 25.61 | 26.38 | 27.60 | 1.38E+11 |
| lag-4 | 25.55 | 26.55 | 28.11 | 1.36E+11 |
| lag-5 | 25.57 | 26.79 | 28.70 | 1.50E+11 |
| lag-6 | 25.78 | 27.22 | 29.48 | 2.07E+11 |
Optimal lag order. +P<0.10, * P<0.05, ** P<0.01, *** P<0.001.
Estimation results for the equation of ∆NCCt in VAR(1) and VAR(2).
| ∆NCC | ∆NCC | |||||||
|---|---|---|---|---|---|---|---|---|
| Estimate | Estimate | |||||||
| ∆NCC | −0.484⁎⁎⁎ | 0.111 | −4.372 | 0.000 | −0.658⁎⁎⁎ | 0.104 | −6.313 | 0.000 |
| ∆OMC | 0.006 | 0.014 | 0.426 | 0.672 | 0.006 | 0.012 | 0.497 | 0.621 |
| ∆OMA | −0.002 | 0.011 | −0.213 | 0.832 | −0.004 | 0.010 | −0.418 | 0.678 |
| ∆OMS | 0.125* | 0.061 | 2.050 | 0.045 | 0.010 | 0.051 | 0.197 | 0.845 |
| ∆NCC | −0.324⁎⁎ | 0.111 | −2.919 | 0.005 | ||||
| ∆OMC | 0.059⁎⁎⁎ | 0.012 | 5.073 | 0.000 | ||||
| ∆OMA | 0.002 | 0.009 | 0.244 | 0.809 | ||||
| ∆OMS | −0.103* | 0.051 | −2.025 | 0.048 | ||||
| SPV | 5.049* | 2.424 | 2.083 | 0.042 | 5.291* | 2.326 | 2.274 | 0.027 |
| Intercept | −0.918 | 0.913 | −1.006 | 0.319 | −1.001 | 0.709 | −1.411 | 0.164 |
| 0.325 | 0.658 | |||||||
| Adjusted | 0.265 | 0.599 | ||||||
+P<0.10, * P<0.05, ** P<0.01, *** P<0.001.
Results of Granger causality test for the VAR(2) model
| Granger | Bivariate test | Multivariate test | |||
|---|---|---|---|---|---|
| Cause | ∆NCC | ∆OMC | ∆OMA | ∆OMS | |
| ∆NCC | - | 1.50(0.230) | 0.34(0.713) | 0.49(0.613) | 0.68(0.664) |
| ∆OMC | 23.12(0.000) | - | 0.28(0.755) | 3.03(0.055)+ | 5.48(0.000) |
| ∆OMA | 0.07 (0.928) | 1.02(0.366) | - | 0.92(0.404) | 0.43(0.862) |
| ∆OMS | 3.41(0.040) | 0.28(0.758) | 5.75(0.005) | - | 2.55(0.021) |
+P<0.10, * P<0.05, ** P<0.01, *** P<0.001.
Fig. AImpulse response function for the four variables of the VAR(2) model (10 days).
Fig. 4IRF for the response of NCC to other variables (95% CI).
Results of FEVD for the four variables of the VAR(2) model (10 steps).
| Period | NCC | OMC | ONA | OMS | |
|---|---|---|---|---|---|
| NCC | 1 | 100.00% | 0.00% | 0.00% | 0.00% |
| 2 | 99.55% | 0.21% | 0.19% | 0.05% | |
| 3 | 73.08% | 22.99% | 0.14% | 3.78% | |
| 4 | 53.73% | 41.63% | 0.10% | 4.54% | |
| 5 | 52.22% | 42.06% | 0.18% | 5.54% | |
| 6 | 51.16% | 42.80% | 0.18% | 5.86% | |
| 7 | 50.78% | 43.10% | 0.20% | 5.93% | |
| 8 | 50.79% | 43.08% | 0.20% | 5.93% | |
| 9 | 50.76% | 43.09% | 0.21% | 5.95% | |
| 10 | 50.73% | 43.11% | 0.21% | 5.95% | |
| OMC | 1 | 0.08% | 99.92% | 0.00% | 0.00% |
| 2 | 0.94% | 98.64% | 0.41% | 0.00% | |
| 3 | 1.28% | 97.45% | 0.49% | 0.78% | |
| 4 | 1.36% | 96.63% | 0.58% | 1.43% | |
| 5 | 1.64% | 96.26% | 0.60% | 1.50% | |
| 6 | 1.65% | 96.24% | 0.61% | 1.49% | |
| 7 | 1.66% | 96.23% | 0.62% | 1.49% | |
| 8 | 1.65% | 96.24% | 0.62% | 1.49% | |
| 9 | 1.65% | 96.23% | 0.63% | 1.49% | |
| 10 | 1.65% | 96.23% | 0.63% | 1.49% | |
| OMA | 1 | 0.01% | 1.47% | 98.52% | 0.00% |
| 2 | 0.48% | 1.44% | 86.54% | 11.54% | |
| 3 | 0.47% | 1.84% | 81.43% | 16.26% | |
| 4 | 0.85% | 2.19% | 80.60% | 16.35% | |
| 5 | 0.86% | 2.33% | 80.35% | 16.47% | |
| 6 | 0.86% | 2.88% | 79.83% | 16.42% | |
| 7 | 0.86% | 2.94% | 79.74% | 16.46% | |
| 8 | 0.86% | 3.00% | 79.68% | 16.45% | |
| 9 | 0.86% | 3.03% | 79.65% | 16.45% | |
| 10 | 0.87% | 3.03% | 79.65% | 16.45% | |
| OMS | 1 | 0.45% | 2.21% | 2.43% | 94.91% |
| 2 | 0.47% | 10.82% | 3.53% | 85.18% | |
| 3 | 0.98% | 14.38% | 4.20% | 80.44% | |
| 4 | 1.40% | 14.35% | 4.18% | 80.07% | |
| 5 | 1.53% | 14.80% | 4.14% | 79.54% | |
| 6 | 1.55% | 15.13% | 4.14% | 79.18% | |
| 7 | 1.55% | 15.14% | 4.14% | 79.16% | |
| 8 | 1.55% | 15.14% | 4.14% | 79.17% | |
| 9 | 1.56% | 15.14% | 4.14% | 79.17% | |
| 10 | 1.56% | 15.14% | 4.14% | 79.16% |
Fig. 5Variance decomposition in NCC by other variables in the VAR(2) model.
Results of Granger causality test between OMC and NCC
| Lag | ∆OMC does not Granger cause ∆NCC | ||||
|---|---|---|---|---|---|
| ∆OMC_18 | ∆OMC_19a | ∆OMC_19b | ∆OMC_20a | ∆OMC_20b | |
| lag-1 | 0.43(0.514) | 1.02(0.318) | 0.11(0.734) | 0.53(0.470) | 0.52(0.470) |
| lag-2 | 0.37(0.694) | 1.00(0.374) | 0.09(0.913) | 1.87(0.162) | 23.12(0.000)⁎⁎⁎ |
| lag-3 | 0.55(0.647) | 0.66(0.578) | 0.73(0.539) | 1.23(0.308) | 19.56(0.000)⁎⁎⁎ |
| lag-4 | 0.39(0.808) | 0.48(0.753) | 0.92(0.455) | 1.00(0.416) | 14.34(0.000)⁎⁎⁎ |
| lag-5 | 0.23(0.948) | 0.49(0.785) | 0.82(0.535) | 0.98(0.436) | 12.97(0.000)⁎⁎⁎ |
| lag-6 | 0.17(0.981) | 0.45(0.843) | 0.69(0.655) | 1.50(0.201) | 9.91(0.000)⁎⁎⁎ |
OMC_18: the daily amount of OMC containing symptom keywords within 65 days (2018.1.1-2018.3.6);
OMC_19a: the daily amount of OMC containing symptom keywords within 65 days (2019.1.1-2019.3.6);
OMC_19b: the daily amount of OMC containing symptom keywords within 65 days (2019.10.28-2019.12.31);
OMC_20a: the daily amount of OMC excluding symptom keywords within 65 days (2020.1.1-2020.3.5);
OMC_20b: the daily amount of OMC containing symptom keywords within 65 days (2020.1.1-2020.3.5).
⁎⁎⁎P<0.001, ⁎⁎P<0.01, * P<0.05, +P<0.10.
Fig. 6Variance decomposition plots for alternative VAR models in different timeframes.
Fig. 7IRF for the response of NCC to OMA in alternative models (95% CI).
Results of Granger causality test from OMS to NCC by keywords.
| A Granger causes B | ||||
|---|---|---|---|---|
| lag-1 | lag-2 | lag-3 | lag-4 | |
| ∆OMS-S→∆NCC | 3.40(0.070)+ | 3.41(0.040)* | 2.76(0.051)+ | 2.32(0.069)+ |
| ∆OMS-Q1→∆NCC | 0.47(0.498) | 0.48(0.624) | 0.98(0.408) | 1.06(0.386) |
| ∆OMS-Q2→∆NCC | 1.80(0.185) | 2.79(0.070)+ | 1.94(0.134) | 1.46(0.223) |
| ∆OMS-Q3→∆NCC | 0.60(0.440) | 0.36(0.700) | 0.58(0.628) | 0.50(0.735) |
| ∆OMS-M→∆NCC | 2.12(0.151) | 2.89(0.063)+ | 1.91(0.139) | 1.37(0.257) |
+P<0.10, * P<0.05, ** P<0.01, *** P<0.001; -S= sum, -Q1=quantile 1, - Q3 =quantile 2, - Q3= quantile 3, -M= maximum.