| Literature DB >> 33147252 |
Soo Beom Choi1,2, Insung Ahn1,2.
Abstract
As the number of global coronavirus disease (COVID-19) cases increases, the number of imported cases is gradually rising. Furthermore, there is no reduction in domestic outbreaks. To assess the risks from imported COVID-19 cases in South Korea, we suggest using the daily risk score. Confirmed COVID-19 cases reported by John Hopkins University Center, roaming data collected from Korea Telecom, and the Oxford COVID-19 Government Response Tracker index were included in calculating the risk score. The risk score was highly correlated with imported COVID-19 cases after 12 days. To forecast daily imported COVID-19 cases after 12 days in South Korea, we developed prediction models using simple linear regression and autoregressive integrated moving average, including exogenous variables (ARIMAX). In the validation set, the root mean squared error of the linear regression model using the risk score was 6.2, which was lower than that of the autoregressive integrated moving average (ARIMA; 22.3) without the risk score as a reference. Correlation coefficient of ARIMAX using the risk score (0.925) was higher than that of ARIMA (0.899). A possible reason for this time lag of 12 days between imported cases and the risk score could be the delay that occurs before the effect of government policies such as closure of airports or lockdown of cities. Roaming data could help warn roaming users regarding their COVID-19 risk status and inform the national health agency of possible high-risk areas for domestic outbreaks.Entities:
Mesh:
Year: 2020 PMID: 33147252 PMCID: PMC7641397 DOI: 10.1371/journal.pone.0241466
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Imported COVID-19 cases in South Korea.
(A) The time-series data of imported COVID-19 cases in South Korea. (B) Global COVID-19 incidence rate. (C) Korea Telecom (KT) roaming data. (D) Risk score of imported COVID-19 cases. The stacked vertical bar graph at 100% represents the ratio of daily values of countries categorized as China, Asia except China, Europe, America, and Africa. KT roaming data were converted to have a range from 0 to 1 by Min-Max Scaling.
Performance of the forecasting models for imported COVID-19 cases after 12 days.
| Model | (p,d,q) | Training set | First validation set (3.28~4.30) | Second validation set (5.1~6.30) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AIC | BIC | P-value | RMSE | MAE | MAPE | RMSE | MAE | MAPE | ||||
| LR with risk score | 269.0 | 273.0 | < 0.001 | 0.885 | 6.2 | 5.0 | 56.5 | 0.835 | 4.4 | 3.5 | 32.8 | |
| ARIMA (Reference) | (1,1,0) | 165.3 | 169.2 | - | 0.899 | 22.3 | 15.3 | 117.1 | 0.716 | 4.7 | 3.5 | 30.9 |
| (1,1,1) | 135.3 | 141.1 | - | 0.885 | 96.9 | 40.6 | 186.8 | 0.657 | 5.1 | 4.0 | 39.4 | |
| (2,1,0) | 154.4 | 160.2 | - | 0.894 | 36.4 | 19.4 | 115.9 | 0.662 | 5.0 | 3.9 | 37.6 | |
| (2,1,1) | 137.3 | 145.1 | - | 0.879 | 98.3 | 41.6 | 196.5 | 0.667 | 5.0 | 3.7 | 34.1 | |
| (2,1,2) | 137.1 | 146.8 | - | 0.875 | 99.6 | 41.6 | 192.5 | 0.650 | 5.2 | 4.0 | 38.4 | |
| ARIMAX with risk score | (1,1,0) | 99.3 | 105.1 | 0.029 | 0.925 | 6.3 | 5.0 | 49.9 | 0.798 | 4.1 | 2.8 | 22.3 |
| (1,1,1) | 100.6 | 108.3 | 0.040 | 0.924 | 6.4 | 5.1 | 50.7 | 0.799 | 4.2 | 2.8 | 22.7 | |
| (2,1,0) | 101.5 | 109.3 | 0.026 | 0.925 | 6.3 | 5.0 | 49.9 | 0.798 | 4.1 | 2.8 | 22.3 | |
| (2,1,1) | 101.3 | 110.9 | 0.050 | 0.935 | 6.8 | 5.4 | 51.2 | 0.798 | 4.1 | 2.8 | 22.3 | |
| (2,1,2) | 104.5 | 115.9 | 0.034 | 0.752 | 14.2 | 8.8 | 74.0 | 0.799 | 4.2 | 2.8 | 22.9 | |
p, autoregressive order; d, differencing; q, moving average order; AIC, Akaike information criterion; BIC, Bayesian information criterion; LR, linear regression; R, correlation coefficient; RMSE, root mean square error; MAE, mean absolute error; MAPE, mean absolute percentage error; ARIMA, autoregressive moving average; ARIMAX, ARIMA including exogenous variables.
* P-value for the risk score in LR and ARIMAX models.
Fig 2The forecast of imported COVID-19 cases after 12 days in South Korea.
The forecast in both first and second validation sets is displayed. The black line denotes real imported cases in South Korea, the red line denotes the prediction values, and the red bars represent the 50% confidence intervals.
Fig 3Heat maps of imported COVID-19 cases in South Korea.
(A) Heat maps for the imported COVID-19 cases. (B) The risk score calculated using the roaming data. Darker shades of red indicate that more imported COVID-19 cases have been confirmed (A) and present a higher risk (B).