| Literature DB >> 32214588 |
Shannon M Fast1, Louis Kim1, Emily L Cohn2, Sumiko R Mekaru2, John S Brownstein2, Natasha Markuzon1.
Abstract
Infectious disease outbreaks often have consequences beyond human health, including concern among the population, economic instability, and sometimes violence. A warning system capable of anticipating social disruptions resulting from disease outbreaks is urgently needed to help decision makers prepare appropriately. We designed a system that operates in near real-time to identify and predict social response. Over 150,000 Internet-based news articles related to outbreaks of 16 diseases in 72 countries and territories were provided by HealthMap. These articles were automatically tagged with indicators of the disease activity and population reaction. An anomaly detection algorithm was implemented on the population reaction indicators to identify periods of unusually severe social response. Then a model was developed to predict the probability of these periods of unusually severe social response occurring in the coming week, 2 and 3 weeks. This model exhibited remarkably strong performance for diseases with substantial media coverage. For country-disease pairs with a median of 20 or more articles per year, the onset of social response in the next week was correctly predicted over 60% of the time, and 87% of weeks were correctly predicted. Performance was weaker for diseases with little media coverage, and, for these diseases, the main utility of our system is in identifying social response when it occurs, rather than predicting when it will happen in the future. Overall, the developed near real-time prediction approach is a promising step toward developing predictive models to inform responders of the likely social consequences of disease spread. © Springer Science+Business Media New York 2017.Entities:
Keywords: Anomaly detection; Biosurveillance; Epidemics; Near real-time prediction; Social response
Year: 2017 PMID: 32214588 PMCID: PMC7088430 DOI: 10.1007/s10479-017-2480-9
Source DB: PubMed Journal: Ann Oper Res ISSN: 0254-5330 Impact factor: 4.854
Fig. 1Overview of methods. a First, news articles were automatically collected and tagged with indicators of disease activity and social response. b Next, an anomaly detection approach was used to identify periods of time with unusually severe social response profiles. These periods were used as targets for social response forecasting. c Finally, the occurrence of unusually severe social response was forecast for the coming week, 2 and 3 weeks
Fig. 2Bayesian network describing relationships between country, disease, and social response indicator counts. All social response indicator counts were dependent upon the country and the disease. We allowed relationships between social response indicator counts (e.g. the count for Violence depends upon the count for Population Fear) to be learned, but did not require such relationships. The pictured network is the Bayesian network in the case where no relationships were learned among the social response indicator counts
Fig. 3Identification of periods of unusually severe social response for dengue fever outbreaks in India. a The social response indicator counts are shown by social response type. Overall, the peaks in social response indicator counts align well with the binary social response indicator, . b The exponentially weighted moving average of the anomaly scores () is shown along with the upper control limit (). The binary social response indicator is 1 when the exponentially weighted moving average surpasses the control limit
Features of the social response transition models
| Model | Model |
|---|---|
| Country name | Country name |
| Disease name | Disease name |
| Number of articles about country and disease in week (with and without duplicates) | Number of articles about country and disease in week (with and without duplicates) |
| Indicator counts for all 20 indicators | Indicator counts for all 20 indicators |
| 3 week slopes of indicator counts | EWMA as a percentage of UCL |
| Duration of current no social response period | 3 week slope of EWMA |
| Duration of current social response period |
Overall performance of the social response prediction models. Model performance was evaluated based on six metrics: accuracy, sensitivity, sensitivity looking only at weeks with articles in the preceding 3 weeks, sensitivity looking only at weeks with articles in the preceding week, specificity, and precision. Model predicted the onset of periods of social response, while Model predicted the end of such periods
| Accuracy | Sensitivity | Specificity | Precision | |||
|---|---|---|---|---|---|---|
| All weeks | Weeks with one or more articles in prior 3 weeks | Weeks with one or more articles in prior week | ||||
| Next week ( | ||||||
| Model | 0.991 | 0.213 | 0.389 | 0.460 | 0.995 | 0.146 |
| Model | 0.923 | 0.956 | – | – | 0.821 | 0.943 |
| Next 2 weeks ( | ||||||
| Model | 0.988 | 0.179 | 0.367 | 0.452 | 0.994 | 0.193 |
| Model | 0.916 | 0.954 | – | – | 0.780 | 0.940 |
| Next 3 weeks ( | ||||||
| Model | 0.984 | 0.162 | 0.363 | 0.457 | 0.993 | 0.221 |
| Model | 0.909 | 0.951 | – | – | 0.745 | 0.936 |
Fig. 4Predicted probability of unusually severe social response in the next 2 weeks for dengue fever outbreaks in India. a The social response indicator counts are shown by social response type. Periods during which social response was occurring () are shaded in grey. b The predicted probability of social response in the next 2 weeks () is shown. The predictions are colored according to whether an incorrect (false positive or false negative; red) or correct (true positive or true negative; black) prediction was made. Overall, the predictions exhibit the desired behavior—low probability of social response in the next 2 weeks was predicted during no social response periods, and, as a social response period were approached, the predicted probability increased. (Color figure online)
Comparison of performance for predicting the onset of social response (Model ) for country-disease pairs with a median of 20 or more news articles per year and those with fewer articles per year. Model performance was evaluated based on six metrics: accuracy, sensitivity, sensitivity looking only at weeks with articles in the preceding 3 weeks, sensitivity looking only at weeks with articles in the preceding week, specificity, and precision. Model sensitivity and precision were dramatically higher for the country-disease pairs with a median of 20 or more articles per year, than for the pairs with fewer articles per year
| Accuracy | Sensitivity | Specificity | Precision | |||
|---|---|---|---|---|---|---|
| All weeks | Weeks with one or more articles in prior 3 weeks | Weeks with one or more articles in prior week | ||||
| Next week ( | ||||||
|
| 0.887 | 0.602 | 0.633 | 0.671 | 0.897 | 0.169 |
|
| 0.994 | 0.123 | 0.260 | 0.311 | 0.997 | 0.127 |
| Next 2 weeks ( | ||||||
|
| 0.858 | 0.536 | 0.569 | 0.619 | 0.877 | 0.206 |
|
| 0.991 | 0.107 | 0.265 | 0.337 | 0.997 | 0.182 |
| Next 3 weeks ( | ||||||
|
| 0.838 | 0.516 | 0.552 | 0.607 | 0.864 | 0.238 |
|
| 0.988 | 0.096 | 0.267 | 0.348 | 0.996 | 0.206 |