| Literature DB >> 27014744 |
David Andre Broniatowski1, Mark Dredze2, Michael J Paul3, Andrea Dugas4.
Abstract
BACKGROUND: Public health officials and policy makers in the United States expend significant resources at the national, state, county, and city levels to measure the rate of influenza infection. These individuals rely on influenza infection rate information to make important decisions during the course of an influenza season driving vaccination campaigns, clinical guidelines, and medical staffing. Web and social media data sources have emerged as attractive alternatives to supplement existing practices. While traditional surveillance methods take 1-2 weeks, and significant labor, to produce an infection estimate in each locale, web and social media data are available in near real-time for a broad range of locations.Entities:
Keywords: Web mining; social computing; time series analysis
Year: 2015 PMID: 27014744 PMCID: PMC4803078 DOI: 10.2196/publichealth.4472
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Figure 1Screenshot of HealthTweets.
Figure 2Equations defining the ARIMAX model.
Log-likelihood (AICa) for each surveillance method.
| Laboratory-confirmed influenza | Influenza like illness (ILI) | ||||||
| City | Region | US | City | Region | US | ||
| USc | -311 (627)0,1,0e | -317g(653)5,1,3 | -235g(484)0,1,5 | -502g(1009)0,2,1 | -66g(143)0,1,0 | -27g(61)1,1,1 | |
| MDd | -310 (624)0,1,0 | -321 (661)5,1,3 | -236 (486)0,1,5 | -503 (1012)0,1,0 | -70 (144)0,1,0 | -30 (68)1,1,1 | |
| Baltimore | -308g(620)0,1,0 | -323 (666)5,1,3 | -235 (484)0,1,5 | -504 (1013)0,2,1 | -74 (158)0,1,3 | -32 (74)1,1,1 | |
| US | -291g(596)1,1,4 | -313g(648)5,1,4 | -230f,g(475)0,1,5 | -494f,g(1002)1,2,4 | -49f,g(110)0,1,4 | -1f,g(15)1,1,4 | |
| MD | -299 (612)1,1,4 | -318 (656)5,1,3 | -236 (486)0,1,5 | -498 (1010)1,2,4 | -58 (129)0,1,4 | -27 (61)1,1,1 | |
| Baltimore | -295 (604)1,1,4 | -320 (660)5,1,3 | -236 (486)0,1,5 | -495 (1005)1,2,4 | -60 (132)0,1,4 | -23 (56)1,1,2 | |
| US | -289f,g(594)1,1,4 | -312f,g(646)5,1,3 | -230g(477)0,1,5 | -495g(1003)0,1,4 | -49g(112)0,1,4 | -0g(17)1,1,4 | |
| MD | -299 (613)1,14 | -318 (657)5,1,3 | -235 (485)0,1,5 | -498 (1011)1,2,4 | -58 (130)0,1,4 | -27 (68)1,1,1 | |
| Baltimore | -294 (604)1,1,4 | -319 (659)5,1,3 | -235 (486)0,1,5 | -500 (1007)0,2,1 | -60 (134)0,1,4 | -22 (55)1,1,2 | |
AIC=Aikake Information Criterion
bTwitter data from the HealthTweets website.
cUS=United States
dMD=Maryland
eSuperscript numerals indicate the autoregressive order, the order of differencing, and the moving average order, respectively. Models were chosen to minimize AIC, guided by examinations of autocorrelation and partial autocorrelation values.
fThe best predictor across all data sources.
gThe best predictor within each data source (HealthTweets website, Google, or a linear combination of both).
Figure 3Plot of weekly confirmed influenza cases (right axis) as compared to standardized Baltimore social media data (left axis).
Figure 4Plot of weekly influenza-like illness cases (right axis) as compared to standardized US social media data (left axis).