Literature DB >> 28721308

Eliciting Disease Data from Wikipedia Articles.

Geoffrey Fairchild1, Sara Y Del Valle1, Lalindra De Silva2, Alberto M Segre3.   

Abstract

Traditional disease surveillance systems suffer from several disadvantages, including reporting lags and antiquated technology, that have caused a movement towards internet-based disease surveillance systems. Internet systems are particularly attractive for disease outbreaks because they can provide data in near real-time and can be verified by individuals around the globe. However, most existing systems have focused on disease monitoring and do not provide a data repository for policy makers or researchers. In order to fill this gap, we analyzed Wikipedia article content. We demonstrate how a named-entity recognizer can be trained to tag case counts, death counts, and hospitalization counts in the article narrative that achieves an F1 score of 0.753. We also show, using the 2014 West African Ebola virus disease epidemic article as a case study, that there are detailed time series data that are consistently updated that closely align with ground truth data. We argue that Wikipedia can be used to create the first community-driven open-source emerging disease detection, monitoring, and repository system.

Entities:  

Year:  2015        PMID: 28721308      PMCID: PMC5511739     

Source DB:  PubMed          Journal:  Proc Int AAAI Conf Weblogs Soc Media        ISSN: 2162-3449


  7 in total

1.  ProMED-mail: an early warning system for emerging diseases.

Authors:  Lawrence C Madoff
Journal:  Clin Infect Dis       Date:  2004-06-28       Impact factor: 9.079

2.  Using internet searches for influenza surveillance.

Authors:  Philip M Polgreen; Yiling Chen; David M Pennock; Forrest D Nelson
Journal:  Clin Infect Dis       Date:  2008-12-01       Impact factor: 9.079

3.  Detecting influenza epidemics using search engine query data.

Authors:  Jeremy Ginsberg; Matthew H Mohebbi; Rajan S Patel; Lynnette Brammer; Mark S Smolinski; Larry Brilliant
Journal:  Nature       Date:  2009-02-19       Impact factor: 49.962

4.  The use of Twitter to track levels of disease activity and public concern in the U.S. during the influenza A H1N1 pandemic.

Authors:  Alessio Signorini; Alberto Maria Segre; Philip M Polgreen
Journal:  PLoS One       Date:  2011-05-04       Impact factor: 3.240

5.  Global disease monitoring and forecasting with Wikipedia.

Authors:  Nicholas Generous; Geoffrey Fairchild; Alina Deshpande; Sara Y Del Valle; Reid Priedhorsky
Journal:  PLoS Comput Biol       Date:  2014-11-13       Impact factor: 4.475

6.  HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports.

Authors:  Clark C Freifeld; Kenneth D Mandl; Ben Y Reis; John S Brownstein
Journal:  J Am Med Inform Assoc       Date:  2007-12-20       Impact factor: 4.497

7.  Wikipedia usage estimates prevalence of influenza-like illness in the United States in near real-time.

Authors:  David J McIver; John S Brownstein
Journal:  PLoS Comput Biol       Date:  2014-04-17       Impact factor: 4.475

  7 in total
  4 in total

1.  Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda.

Authors:  Reid Priedhorsky; Dave Osthus; Ashlynn R Daughton; Kelly R Moran; Nicholas Generous; Geoffrey Fairchild; Alina Deshpande; Sara Y Del Valle
Journal:  CSCW Conf Comput Support Coop Work       Date:  2017 Feb-Mar

2.  Evaluating Google, Twitter, and Wikipedia as Tools for Influenza Surveillance Using Bayesian Change Point Analysis: A Comparative Analysis.

Authors:  J Danielle Sharpe; Richard S Hopkins; Robert L Cook; Catherine W Striley
Journal:  JMIR Public Health Surveill       Date:  2016-10-20

3.  Public reaction to Chikungunya outbreaks in Italy-Insights from an extensive novel data streams-based structural equation modeling analysis.

Authors:  Naim Mahroum; Mohammad Adawi; Kassem Sharif; Roy Waknin; Hussein Mahagna; Bishara Bisharat; Mahmud Mahamid; Arsalan Abu-Much; Howard Amital; Nicola Luigi Bragazzi; Abdulla Watad
Journal:  PLoS One       Date:  2018-05-24       Impact factor: 3.240

4.  Epidemiological Data Challenges: Planning for a More Robust Future Through Data Standards.

Authors:  Geoffrey Fairchild; Byron Tasseff; Hari Khalsa; Nicholas Generous; Ashlynn R Daughton; Nileena Velappan; Reid Priedhorsky; Alina Deshpande
Journal:  Front Public Health       Date:  2018-11-23
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.