Literature DB >> 28782059

Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda.

Reid Priedhorsky1, Dave Osthus2, Ashlynn R Daughton3, Kelly R Moran3, Nicholas Generous3, Geoffrey Fairchild3, Alina Deshpande3, Sara Y Del Valle3.   

Abstract

Effective disease monitoring provides a foundation for effective public health systems. This has historically been accomplished with patient contact and bureaucratic aggregation, which tends to be slow and expensive. Recent internet-based approaches promise to be real-time and cheap, with few parameters. However, the question of when and how these approaches work remains open. We addressed this question using Wikipedia access logs and category links. Our experiments, replicable and extensible using our open source code and data, test the effect of semantic article filtering, amount of training data, forecast horizon, and model staleness by comparing across 6 diseases and 4 countries using thousands of individual models. We found that our minimal-configuration, language-agnostic article selection process based on semantic relatedness is effective for improving predictions, and that our approach is relatively insensitive to the amount and age of training data. We also found, in contrast to prior work, very little forecasting value, and we argue that this is consistent with theoretical considerations about the nature of forecasting. These mixed results lead us to propose that the currently observational field of internet-based disease surveillance must pivot to include theoretical models of information flow as well as controlled experiments based on simulations of disease.

Entities:  

Keywords:  Disease; G.3. Probability and statistics: Correlation and regression analysis; H.1.1. Systems and information theory: Value of information; H.3.5. Online information services: Web-based services; J.3. Life and medical sciences: Health; Wikipedia; epidemiology; forecasting; modeling

Year:  2017        PMID: 28782059      PMCID: PMC5542563          DOI: 10.1145/2998181.2998183

Source DB:  PubMed          Journal:  CSCW Conf Comput Support Coop Work


  100 in total

Review 1.  Systematic review: surveillance systems for early detection of bioterrorism-related diseases.

Authors:  Dena M Bravata; Kathryn M McDonald; Wendy M Smith; Chara Rydzak; Herbert Szeto; David L Buckeridge; Corinna Haberland; Douglas K Owens
Journal:  Ann Intern Med       Date:  2004-06-01       Impact factor: 25.391

2.  Norovirus disease surveillance using Google Internet query share data.

Authors:  Rishi Desai; Aron J Hall; Benjamin A Lopman; Yair Shimshoni; Marcus Rennick; Niv Efron; Yossi Matias; Manish M Patel; Umesh D Parashar
Journal:  Clin Infect Dis       Date:  2012-06-19       Impact factor: 9.079

Review 3.  Google trends: a web-based tool for real-time surveillance of disease outbreaks.

Authors:  Herman Anthony Carneiro; Eleftherios Mylonakis
Journal:  Clin Infect Dis       Date:  2009-11-15       Impact factor: 9.079

4.  Seasonality in seeking mental health information on Google.

Authors:  John W Ayers; Benjamin M Althouse; Jon-Patrick Allem; J Niels Rosenquist; Daniel E Ford
Journal:  Am J Prev Med       Date:  2013-05       Impact factor: 5.043

5.  The use of Twitter to track levels of disease activity and public concern in the U.S. during the influenza A H1N1 pandemic.

Authors:  Alessio Signorini; Alberto Maria Segre; Philip M Polgreen
Journal:  PLoS One       Date:  2011-05-04       Impact factor: 3.240

6.  Emergency department and 'Google flu trends' data as syndromic surveillance indicators for seasonal influenza.

Authors:  L H Thompson; M T Malik; A Gumel; T Strome; S M Mahmud
Journal:  Epidemiol Infect       Date:  2014-01-20       Impact factor: 4.434

7.  The reliability of tweets as a supplementary method of seasonal influenza surveillance.

Authors:  Anoshé A Aslam; Ming-Hsiang Tsou; Brian H Spitzberg; Li An; J Mark Gawron; Dipak K Gupta; K Michael Peddecord; Anna C Nagel; Christopher Allen; Jiue-An Yang; Suzanne Lindsay
Journal:  J Med Internet Res       Date:  2014-11-14       Impact factor: 5.428

8.  Correlation Between UpToDate Searches and Reported Cases of Middle East Respiratory Syndrome During Outbreaks in Saudi Arabia.

Authors:  Anna R Thorner; Bin Cao; Terrence Jiang; Amy J Warner; Peter A Bonis
Journal:  Open Forum Infect Dis       Date:  2016-02-18       Impact factor: 3.835

9.  Use of Internet Search Queries to Enhance Surveillance of Foodborne Illness.

Authors:  Gyung Jin Bahk; Yong Soo Kim; Myoung Su Park
Journal:  Emerg Infect Dis       Date:  2015-11       Impact factor: 6.883

10.  Wikipedia usage estimates prevalence of influenza-like illness in the United States in near real-time.

Authors:  David J McIver; John S Brownstein
Journal:  PLoS Comput Biol       Date:  2014-04-17       Impact factor: 4.475

View more
  8 in total

Review 1.  Social Media- and Internet-Based Disease Surveillance for Public Health.

Authors:  Allison E Aiello; Audrey Renson; Paul N Zivich
Journal:  Annu Rev Public Health       Date:  2020-01-06       Impact factor: 21.981

2.  Surveilling Influenza Incidence With Centers for Disease Control and Prevention Web Traffic Data: Demonstration Using a Novel Dataset.

Authors:  Wendy K Caldwell; Geoffrey Fairchild; Sara Y Del Valle
Journal:  J Med Internet Res       Date:  2020-07-03       Impact factor: 5.428

3.  Comparison of Social Media, Syndromic Surveillance, and Microbiologic Acute Respiratory Infection Data: Observational Study.

Authors:  Ashlynn R Daughton; Rumi Chunara; Michael J Paul
Journal:  JMIR Public Health Surveill       Date:  2020-04-24

4.  Estimating influenza incidence using search query deceptiveness and generalized ridge regression.

Authors:  Reid Priedhorsky; Ashlynn R Daughton; Martha Barnard; Fiona O'Connell; Dave Osthus
Journal:  PLoS Comput Biol       Date:  2019-10-01       Impact factor: 4.475

5.  The impact of news exposure on collective attention in the United States during the 2016 Zika epidemic.

Authors:  Michele Tizzoni; André Panisson; Daniela Paolotti; Ciro Cattuto
Journal:  PLoS Comput Biol       Date:  2020-03-12       Impact factor: 4.475

6.  Even a good influenza forecasting model can benefit from internet-based nowcasts, but those benefits are limited.

Authors:  Dave Osthus; Ashlynn R Daughton; Reid Priedhorsky
Journal:  PLoS Comput Biol       Date:  2019-02-01       Impact factor: 4.475

7.  Situating Wikipedia as a health information resource in various contexts: A scoping review.

Authors:  Denise A Smith
Journal:  PLoS One       Date:  2020-02-18       Impact factor: 3.240

8.  Google Health Trends performance reflecting dengue incidence for the Brazilian states.

Authors:  Daniel Romero-Alvarez; Nidhi Parikh; Dave Osthus; Kaitlyn Martinez; Nicholas Generous; Sara Del Valle; Carrie A Manore
Journal:  BMC Infect Dis       Date:  2020-03-26       Impact factor: 3.090

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.