| Literature DB >> 30337302 |
Qingpeng Zhang1,2, Yi Chai1,3, Xiaoming Li4, Sean D Young5, Jiaqi Zhou1.
Abstract
OBJECTIVES: Internet data are important sources of abundant information regarding HIV epidemics and risk factors. A number of case studies found an association between internet searches and outbreaks of infectious diseases, including HIV. In this research, we examined the feasibility of using search query data to predict the number of new HIV diagnoses in China.Entities:
Keywords: health informatics; internet; predictive model; search query; surveillance
Mesh:
Year: 2018 PMID: 30337302 PMCID: PMC6196849 DOI: 10.1136/bmjopen-2017-018335
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Terms included in the composite search indices at the national and provincial levels
| Chinese | English translation | Correlation coefficient (China) | Correlation coefficient (Guangdong) |
| 艾滋病 | HIV/AIDS | 0.54 (p<0.001) | 0.43 (p<0.001) |
| 艾滋病初期症状 | Initial symptoms of HIV | 0.44 (p<0.001) | 0.49 (p<0.001) |
| 艾滋病试纸 | HIV test kits/strips | 0.36 (p<0.01) | 0.42 (p<0.001) |
| 艾滋病检测 | HIV testing | 0.35 (p<0.01) | 0.41 (p<0.001) |
| 艾滋病窗口期 | HIV test window period | 0.34 (p<0.01) | 0.36 (p<0.01) |
| 艾滋病能活多久 | How long can HIV patients live | 0.33 (p<0.01) | 0.43 (p<0.001) |
| 艾滋病传播途径 | Ways of HIV transmission | 0.31 (p<0.01) | 0.37 (p<0.01) |
| 艾滋病潜伏期 | Incubation period of HIV | Excluded | 0.31 (p<0.01) |
Figure 1Number of new HIV diagnoses (black) and composite search index (red) from 2011 to 2016 in China (A) and Guangdong province (B).
Figure 2Actual number of new HIV diagnoses and prediction results of the six proposed models (China). The black curve represents the actual data of new HIV diagnoses. The red curve represents the fitted values. The blue curve represents the prediction result. BnbGLM-AR, Bayesian negative binomial generalised linear model (BnbGLM) with autoregressive terms; BnbGLM-AR-Baidu, BnbGLM with autoregressive terms and the composite Baidu Search Index; BnbGLM-Baidu, BnbGLM with a variable representing the composite Baidu Search Index; nbGLM-AR, negative binomial generalised linear model (nbGLM) with autoregressive terms; nbGLM-AR-Baidu, nbGLM with autoregressive terms and the composite Baidu Search Index; nbGLM-Baidu, nbGLM with a variable representing the composite Baidu Search Index.
Accuracy in predicting the number of new HIV diagnoses in China
| Model | Nowcasting | One-month ahead forecasting | Two-month ahead forecasting | |||
| RMSE | NRMSE | RMSE | NRSME | RMSE | NRMSE | |
| nbGLM-AR | 473.35 | 11.71% | 484.21 | 11.98% | 528.38 | 13.08% |
| nbGLM-Baidu | 957.58 | 23.7% | 1166.38 | 28.86% | 1176.06 | 29.1% |
| nbGLM-AR-Baidu | 420.68 | 10.41% | 482.79 | 11.95% | 539.37 | 13.35% |
| BnbGLM-AR | 455.65 | 11.27% | 456.95 | 11.31% | 497.11 | 12.3% |
| BnbGLM-Baidu | 976.99 | 24.18% | 1176.16 | 29.11% | 1145.23 | 28.34% |
| BnbGLM-AR-Baidu | 423.17 | 10.47% | 451.75 | 11.18% | 508.31 | 12.58% |
BnbGLM-AR, Bayesian negative binomial generalised linear model (BnbGLM) with autoregressive terms; BnbGLM-AR-Baidu, BnbGLM with autoregressive terms and the composite Baidu Search Index; BnbGLM-Baidu, BnbGLM with a variable representing the composite Baidu Search Index; nbGLM-AR, negative binomial generalised linear model (nbGLM) with autoregressive terms; nbGLM-AR-Baidu, nbGLM with autoregressive terms and the composite Baidu Search Index; nbGLM-Baidu, nbGLM with a variable representing the composite Baidu Search Index; NRMSE, normalised root mean square error; RMSE, root mean square error.
Accuracy in predicting the number of new HIV diagnoses in Guangdong province
| Model | Nowcasting | One-month ahead forecasting | Two-month ahead forecasting | |||
| RMSE | NRMSE | RMSE | NRSME | RMSE | NRMSE | |
| nbGLM-AR | 56.39 | 21.28% | 54.73 | 20.65% | 54.66 | 20.6% |
| nbGLM-Baidu | 64.08 | 24.18% | 80.65 | 30.43% | 82.84% | 31.26% |
| nbGLM-AR-Baidu | 54.64 | 20.62% | 55.09 | 20.79% | 55.25 | 20.85% |
| BnbGLM-AR | 55.2 | 20.83% | 53.75 | 20.28% | 52.59 | 19.84% |
| BnbGLM-Baidu | 63.03 | 23.79% | 79.35 | 29.94% | 81.38 | 30.71% |
| BnbGLM-AR-Baidu | 54.18 | 20.45% | 54.21 | 20.46% | 53.01 | 20.04% |
BnbGLM-AR, Bayesian negative binomial generalised linear model (BnbGLM) with autoregressive terms; BnbGLM-AR-Baidu, BnbGLM with autoregressive terms and the composite Baidu Search Index; BnbGLM-Baidu, BnbGLM with a variable representing the composite Baidu Search Index; nbGLM-AR, negative binomial generalised linear model (nbGLM) with autoregressive terms; nbGLM-AR-Baidu, nbGLM with autoregressive terms and the composite Baidu Search Index; nbGLM-Baidu, nbGLM with a variable representing the composite Baidu Search Index; NRMSE, normalised root mean square error; RMSE, root mean square error.