| Literature DB >> 28727821 |
Shihao Yang1, Samuel C Kou1, Fred Lu2, John S Brownstein2,3, Nicholas Brooke4, Mauricio Santillana2,3.
Abstract
Dengue is a mosquito-borne disease that threatens over half of the world's population. Despite being endemic to more than 100 countries, government-led efforts and tools for timely identification and tracking of new infections are still lacking in many affected areas. Multiple methodologies that leverage the use of Internet-based data sources have been proposed as a way to complement dengue surveillance efforts. Among these, dengue-related Google search trends have been shown to correlate with dengue activity. We extend a methodological framework, initially proposed and validated for flu surveillance, to produce near real-time estimates of dengue cases in five countries/states: Mexico, Brazil, Thailand, Singapore and Taiwan. Our result shows that our modeling framework can be used to improve the tracking of dengue activity in multiple locations around the world.Entities:
Mesh:
Year: 2017 PMID: 28727821 PMCID: PMC5519005 DOI: 10.1371/journal.pcbi.1005607
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Comparison of ARGO to benchmark models across countries and evaluation metrics.
The bold face value is the best value among all methods according to each performance metric. Google Dengue Trends was not published for Taiwan and therefore the GDT benchmark is not available for Taiwan. The assessment period for the five regions, chosen based on the common available periods for all methods, are: Brazil (Mar 2006–Dec 2012), Mexico (Mar 2006–Aug 2015), Thailand (Oct 2010–Aug 2015), Singapore (Feb 2008–Aug 2015), Taiwan (Jan 2013–Mar 2016). The error value is relative to the naive, whose absolute error value is reported in the parenthesis.
| RMSE | MAE | RMSPE | MAPE | CORR | |
|---|---|---|---|---|---|
| ARGO | |||||
| GDT | 0.666 | 0.633 | 0.984 | 0.817 | 0.916 |
| GT | 0.902 | 0.829 | 0.877 | 0.838 | 0.861 |
| SAR | 0.660 | 0.563 | 0.664 | 0.583 | 0.917 |
| SAR+GDT | 0.629 | 0.587 | 0.564 | 0.560 | 0.938 |
| naive | 1 (30560.436) | 1 (21677.634) | 1 (0.703) | 1 (0.546) | 0.812 |
| ARGO | |||||
| GDT | 0.944 | 0.961 | 1.270 | 1.311 | 0.863 |
| GT | 0.950 | 0.927 | 1.097 | 1.100 | 0.861 |
| SAR | 0.790 | 0.737 | 0.776 | 0.815 | 0.911 |
| SAR+GDT | 1.249 | 0.986 | 0.779 | 0.854 | 0.891 |
| naive | 1 (3570.105) | 1 (2161.018) | 1 (0.816) | 1 (0.492) | 0.833 |
| ARGO | |||||
| GDT | 0.880 | 0.868 | 1.494 | 1.284 | 0.884 |
| GT | 1.364 | 1.224 | 1.510 | 1.368 | 0.833 |
| SAR | 0.774 | 0.836 | 0.906 | 0.898 | 0.917 |
| SAR+GDT | 1.157 | 0.983 | 0.923 | 0.936 | 0.903 |
| naive | 1 (2058.891) | 1 (1276.068) | 1 (0.426) | 1 (0.326) | 0.852 |
| ARGO | |||||
| GDT | 1.182 | 1.285 | 1.427 | 1.439 | 0.821 |
| GT | 1.287 | 1.165 | 1.287 | 1.254 | 0.796 |
| SAR | 1.153 | 1.104 | 1.166 | 1.087 | 0.847 |
| SAR+GDT | 2.452 | 1.297 | 1.185 | 1.009 | 0.775 |
| naive | 1 (329.318) | 1 (202.651) | 1 (0.283) | 1 (0.230) | 0.878 |
| ARGO | 2.180 | 1.264 | 0.834 | ||
| GT | 12.211 | 4.904 | 1.069 | 0.898 | 0.724 |
| SAR | 1.852 | 1.397 | 0.247 | 0.408 | |
| naive | 1 (3.248) | 1 (1.601) | 0.734 |
Fig 1Estimation results.
Monthly dengue case-count estimations are displayed for all studied countries for four different estimation methodologies: ARGO, a seasonal auto-regressive model with and without Google Dengue Trends information (SAR+GDT, and SAR, respectively), and a naive detection that estimates current month case counts using the last month’s observed cases. Historical dengue case counts, as reported by local health authorities, are shown for reference (black line), as well as the corresponding estimation errors associated to each methodology when compared to the reference.
Comparison of countries/states.
| Brazil | Mexico | Thailand | Singapore | Taiwan | |
|---|---|---|---|---|---|
| ARGO correlation | 0.971 | 0.924 | 0.928 | 0.903 | 0.834 |
| Median yearly case count | 590,000 | 48,000 | 47,000 | 5,400 | 1,700 |
| Seasonality (correlation of SAR) | 0.917 | 0.911 | 0.917 | 0.847 | 0.878 |
| Internet penetration [ | 50% | 38% | 27% | 74% | 76% |
| Google market share [ | 97% | 93% | 99% | 84% | 42% |
| Report frequency | monthly | monthly | monthly | weekly | weekly |
| Population (avg. in millions) [ | 198 | 120 | 67 | 5.2 | 23 |
| Median yearly incidence (per 10,000) | 29.1 | 4.1 | 8.0 | 10.5 | 6.8 |
| Country size (103 mi2) | 3,290 | 758 | 198 | 0.28 | 14 |
| Population density (per mi2) [ | 60 | 160 | 340 | 18,700 | 1,600 |
| GDP (per capita avg. over study period) [ | $10,100 | $9,200 | $5,800 | $55,000 | $31,900 |