| Literature DB >> 30103690 |
Yang Zhao1, Qinneng Xu2, Yupeng Chen2, Kwok Leung Tsui3,2.
Abstract
BACKGROUND: Hand, foot, and mouth disease (HFMD) has been recognized as one of the leading infectious diseases among children in China, which causes hundreds of annual deaths since 2008. In China, the reports of monthly HFMD cases usually have a delay of 1-2 months due to the time needed for collecting and processing clinical information. This time lag is far from optimal for policymakers making decisions. To alleviate this information gap, this study uses a meta learning framework and combines publicly Internet-based information (Baidu search queries) for real-time estimation of HFMD cases.Entities:
Keywords: Baidu index; HFMD; Meta-learning; Predictive model
Mesh:
Year: 2018 PMID: 30103690 PMCID: PMC6090735 DOI: 10.1186/s12879-018-3285-4
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Fig. 1Monthly HFMD cases in China and search frequency of ‘hand-foot-mouth’. Blue: the variation trend of monthly HFMD incidences in China; Orange: Baidu search volume of ‘hand-footmouth’
Correlation coefficients of HFMD cases at lag 0 with cases at lag 1, 2, 3, 4, 5, and 6
| Region | Lag 1 | Lag 2 | Lag 3 | Lag 4 | Lag 5 | Lag 6 |
|---|---|---|---|---|---|---|
| China | 0.744 | 0.236 | -0.194 | -0.398 | -0.387 | -0.368 |
| Guangxi | 0.667 | 0.089 | -0.241 | -0.281 | -0.206 | -0.125 |
| Henan | 0.714 | 0.198 | -0.152 | -0.298 | -0.324 | -0.33 |
| Zhejiang | 0.675 | 0.197 | -0.106 | -0.211 | -0.083 | -0.039 |
Fig. 2Meta learning framework
Meta features description
| Feature | Explanation |
|---|---|
| Min | Minimum of the HFMD cases over the time period. |
| Max | Maximum of the HFMD cases over the time period. |
| Mean | Mean of the HFMD cases over the time period. |
| SD | Standard deviation of the HFMD cases over the time period. |
| SKEW | Skewness of the HFMD cases over the time period. |
| KURT | Kurtosis of the HFMD cases over the time period. |
| Q1 | First quartile of the HFMD cases over the time period. |
| Q2 | Second quartile of the HFMD cases over the time period. |
| Q3 | Third quartile of the HFMD cases over the time period. |
| Month | Calendar month of the forecast point. |
| Ratio of turning points | Percentage of turning points in the series. |
Fig. 3Evaluation metric: RMSE. Dark blue: the RMSE of PCA; Red: the RMSE of LASSO; Green: the RMSE of RR; Purple: the RMSE of ARIMA; Light blue: the RMSE of ML
Fig. 4Evaluation metric: correlation coefficient. Dark blue: the correlation coefficient of PCA; Red: the correlation coefficient of LASSO; Green: the correlation coefficient of RR; Purple: the correlation coefficient of ARIMA; Light blue: the correlation coefficient of ML
Fig. 5Forecasting results. Black: the true value; Orangered: the nowcasting results of ARIMA; Gray: the nowcasting results of PCA; Orange: the nowcasting results of RR; Dark blue: the nowcasting results of LASSO; Green: the nowcasting results of Meta learning
RMSE of different forecasting methods
| Model without Baidu data | Model with Baidu data | |||
|---|---|---|---|---|
| Region | LR | PCA.LR | LASSO | RR |
| China | 150860 | 108510 | 57573 | 54326 |
| Guangxi | 13062 | 11453 | 10123 | 10778 |
| Zhejiang | 7682 | 4210 | 3445 | 4214 |
| Henan | 7028 | 3210 | 4561 | 2165 |
Corr of different forecasting methods
| Model without Baidu data | Model with Baidu data | |||
|---|---|---|---|---|
| Region | LR | PCA.LR | LASSO | RR |
| China | 0.74 | 0.92 | 0.95 | 0.97 |
| Guangxi | 0.65 | 0.91 | 0.88 | 0.90 |
| Zhejiang | 0.64 | 0.93 | 0.97 | 0.96 |
| Henan | 0.65 | 0.96 | 0.89 | 0.97 |
Fig. 6Evaluation metric of different lag time: RMSE. Blue: the RMSE of ARIMA; Orange: the RMSE of PCA+LR; Yellow: the RMSE of LASSO; Orangered: the RMSE of RR; Brown: the RMSE of ML